Hacker News new | past | comments | ask | show | jobs | submit login

Nice! For what it is worth, a colleague and I made a library a while ago that factors out most shared model code, with which many models can be implemented in about 100 lines (excluding Python import ceremony and comments). E.g.:

BERT:

https://github.com/explosion/curated-transformers/blob/main/...

Llama 1/2:

https://github.com/explosion/curated-transformers/blob/main/...

MPT:

https://github.com/explosion/curated-transformers/blob/main/...

With various stuff enabled, including support for TorchScript JIT, PyTorch flash attention, etc.




Nice. I will definitely be taking a look at this. Have you looked at the xformers library ? They are looking at the same problem as you but their focus is more on providing performant transformer modules using triton. Using specific components from the library though is not as simple. I kept running into runtime errors so I've kept it aside for now. I am building something based on the Bert architecture so I will give this a look. Thanks for all the work!


I would've loved to look at xFormers, but I avoided looking at other implementations to make sure that ours is a clean room implementation.

Curated Transformers started as a very small library just for spaCy (spaCy 3.7 transformer pipelines use Curated Transformers) with just the older encoder models (BERT, RoBERTa, etc.). spaCy used Hugging Face Transformers prior for the provided transformer models, but we wanted something where we could easily hook into different parts of the model (e.g. for distillation).

After the functionality needed for spaCy was done, Matt @ Explosion encouraged us to extend it into a more general PyTorch library that would also support decoder architectures, generation, etc.


I’m floored by this library. Great concept.

I’ve never been a fan of HFs implementation. This is a beautiful API at exactly the right level of abstraction.

I’ll give this a try on my next project.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: