Hacker News new | past | comments | ask | show | jobs | submit login

This is a LD_LIBRARY_PATH emulator. No CUDA installation required.



You probably still want to use things like cublas if you want to run existing CUDA software.


I would want an equivalent of cublas optimized for my specific GPU model and implementing the same API.

AFAIK cublas and other first-party libraries are hand-optimized by nVidia for different generations of their hardware, with dynamic dispatch in runtime for optimal performance. Pretty sure none of these versions would run optimally on AMD GPUs because ideally AMD GPUs run 64 threads / wavefront, nVidia GPUs run 32 threads / wavefront.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: