Hacker News new | past | comments | ask | show | jobs | submit login

How well would it work with technical or specific jargon? (Anatomy and medical terminology, in our specific use-case)

Is there a way to feed some kind of (text) dictionary to aid recognition? Or does it also need audio samples to learn from?




Yes! You can upload a corpus of text that the service will learn new words (and their context) from and/or you can tell it it specific words and their pronunciation. No audio samples are needed, the customization works on the existing language models.

More details: https://www.ibm.com/watson/developercloud/doc/speech-to-text...


If you use Kaldi you can mix any type of domain-specific texts, it usually improves accuracy significantly, particularly for technical domains. You do not need audio for that.




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: