Hacker News new | past | comments | ask | show | jobs | submit login

I'd love to hear some opinions from anyone with speech recognition experience on how this stacks up to the commercial alternatives.



I don't think this is quite the sort of answer you want:

My sibling (using Debian Testing) had wrist RSI this summer and we tried to set up Simon Listens (the previous version; 0.4 looks from the release notes like it's improving). We are both Linux nerds. We were not able to get it to do anything useful after a few days of work, and I estimated a 50-50 chance that working harder on it would help[1]. I did not find any other FOSS speech-to-text that I could get working either. (FOSS Linux text-to-speech is much better; e.g. Orca is good.) We did not try any commercial products; Dragon Naturally Speaking is the only one I know of having a good reputation but it is Windows-based. Also, it's hard to integrate well into the Linux stack without being FOSS. A list of products we looked at: https://en.wikipedia.org/wiki/Speech_recognition_in_Linux

[1] Issues with compiling, dependencies, figuring out the conceptual model Simon Listens uses, trying to figure out whether Simon-not-doing-anything was because we miscompiled it, or audio input, or incompatible dep versions or misconfigured deps, or us just doing the wrong thing because the English documentation wasn't super thorough... Imagine setting up Apache, MySQL and PHP if there weren't a billion tutorials online, you'd never used Apache, and MySQL wasn't compatible with your GCC unless you pulled the git version and hoped you didn't get confused by dev-version-only bugs.


That is unfortunate, but somehow that sort of user experience doesn't come as a surprise to me. I think it is the "blog as the main project page, and no version control link in sight" thing that gives off that feeling. The only other thing that gives off vibes that bad is a sourceforge page with no hint of a real project page.

Really frustrating when you see projects with potential botch basic developer usability so badly.


There's also http://www.simon-listens.org/ . If you speak German there might be some info there that we couldn't read (some is in English, but I think the developers are mostly in Germany).


Ah, that's bit better. Found the version countrol: http://sourceforge.net/projects/speech2text/


Actualy Simon project has recently moved to KDE infrastructure: http://dot.kde.org/2012/04/08/simon-speech-recognition-proje... and the code is here: https://projects.kde.org/projects/extragear/accessibility/si...




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: