Hacker News new | past | comments | ask | show | jobs | submit login

The model targets the decoder part of the system which is the speed bottleneck. So for tasks like classification it is not likely to be helpful. However a similar method could be used for that use case. (Coauthor)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: