Hacker News new | past | comments | ask | show | jobs | submit login

Commonvoice is actually not very good test set. Their texts are very specific (mostly wikipedia and such) and also the texts overlap between train and test which leads to overtraining of most transformer models. If you test on variety of domains, you'll see totally different picture.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: