Hacker News new | past | comments | ask | show | jobs | submit login

What is the current state of offline, non-cloud-connected speech-to-text?

My phone has a voice processing chip, and it recognizes my speech pretty well, but I still can't figure out if it's completely disconnected from the cloud (despite intentionally not agreeing to the privacy policy)

His demo is just a shim for Amazon's API...




Kaldi is pretty good. Not sure if you can run it on a phone but definitely on a single desktop, purely local processing.

Results depend on the trained model, I think the Tedlium one is alright. And of course quality of input signal - far field/noisy much more prone to errors, that's where the mic array on Echo helps a lot.

Here's a relatively easy way to set it up

https://github.com/alumae/kaldi-gstreamer-server


> My phone has a voice processing chip, and it recognizes my speech pretty well, but I still can't figure out if it's completely disconnected from the cloud

Does it work when you are in airplane mode?


It does.

There's an offline language pack installed, though.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: