Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
3abiton
8 months ago
|
parent
|
context
|
favorite
| on:
Many options for running Mistral models in your te...
That's 10x speed increase. What's the secret behind apple M3? Faster clocked RAMs? Specific AI hardware?
bugglebeetle
8 months ago
[–]
Unified memory and optimizations in llama.cpp (which Ollama wraps).
ithkuil
8 months ago
|
parent
[–]
Is that using the GPU?
bugglebeetle
8 months ago
|
root
|
parent
[–]
It can be variably configured. There are details in the repo, but llama.cpp makes use of Metal.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: