Hacker News new | past | comments | ask | show | jobs | submit login

I might be wrong, but looks like this could help with speculative decoding which can already vastly improves the inference speed?



Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: