Hacker News new | past | comments | ask | show | jobs | submit login

About the "Text-to-Speech" section there, I was really impressed with the updated Swedish "Alva" voice in OSX El Capitan: it correctly pronounces "tomten" in different ways in the first and second occurrence in this example:

say -v Alva "Tomten dricker julmust på tomten"

"Tomten" can mean either "Santa Claus" or "the yard"/"the plot" depending on context, and apparently they're able to detect this properly.




OS X makes progress with every release on this front. I typically test it with a few tricky french sentences (think "les poules du couvent couvent") and it seems to improve, but it's hard to say from the outside what gets better in the model ("Mes fils ont cassé mes fils" still fails for instance, but seems harder to detect to me)


I think the OP was talking about Text-to-Speech, and you are (maybe?) talking about speech recognition?

(The irony of this misunderstanding being kicked off by a comment about the text-to-speech engine understanding the context of a word amuses me)


What is Apple's approach to NLP and speech? What algorithms are they using?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: