Is there also something available to make use of the ANE during _training_? E.g ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

callen43 on March 24, 2023 | parent | context | favorite | on: Transformer architecture optimized for Apple Silic...

Is there also something available to make use of the ANE during _training_? E.g fine-tuning BERT on an M1 Mac in a couple of hours?

(This here only applies to inference, right?)

machinekob on March 24, 2023 [–]

Its FP16/Int8 inference only (cause you can only access it via apple frameworks that dosent support training). Also its only used if your data is small enough (4mb cache) it wont be useful for big transformers/ big images processing in a while.

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact