Hacker News new | past | comments | ask | show | jobs | submit login

"Now, we are starting to see why, thanks to the release of their first model, Mistral.

A 7-billion-parameter model that, despite its size, has crazy credentials already and comes packed with surprises that few saw coming."





anyone know why they have a forked version of the python Transformers library there?




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: