Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
wpm
3 months ago
|
parent
|
context
|
favorite
| on:
Apple's On-Device and Server Foundation Models
AFAIK there is no general purpose, "do this on the ANE" API. You have to be using specific higher level APIs like CoreML or VisionKit in order for it to end up on the ANE.
bt1a
3 months ago
[–]
This, plus metal acceleration works quite well. 7~8B parameter models quantized to 3bpw or so run with good tok/s on my iphone 15 pro
sanxiyn
3 months ago
|
parent
[–]
It works quite well as long as you don't care about battery.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: