Hacker News new | past | comments | ask | show | jobs | submit login

AFAIK there is no general purpose, "do this on the ANE" API. You have to be using specific higher level APIs like CoreML or VisionKit in order for it to end up on the ANE.



This, plus metal acceleration works quite well. 7~8B parameter models quantized to 3bpw or so run with good tok/s on my iphone 15 pro


It works quite well as long as you don't care about battery.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: