That's 10x speed increase. What's the secret behind apple M3? Faster clocked RAM... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

3abiton 8 months ago | parent | context | favorite | on: Many options for running Mistral models in your te...

That's 10x speed increase. What's the secret behind apple M3? Faster clocked RAMs? Specific AI hardware?

bugglebeetle 8 months ago [–]

Unified memory and optimizations in llama.cpp (which Ollama wraps).

ithkuil 8 months ago | [–]

Is that using the GPU?

bugglebeetle 8 months ago | | [–]

It can be variably configured. There are details in the repo, but llama.cpp makes use of Metal.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact