If consumer cards can run the big models, then datacenter cards will be able to ... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

sebzim4500 6 months ago | parent | context | favorite | on: The Era of 1-bit LLMs: ternary parameters for cost...

If consumer cards can run the big models, then datacenter cards will be able to efficiently run the really big models.

leroman 6 months ago [–]

Some tasks we are using LLMs for are performing very close to GPT-4 levels using 7B models, so really depends on what value you are looking to get.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact