I would like to include a speech recognition in my commercial projects, but the license is GPL and is tied to KDE.
I've also tried sphinx before but the recognition is kind of poor and it lacks a gui for user/developer configuration of the grammars.
I wonder if speech recognition software can be developed with a :
- Dynamic Time Wrapping (DTW) algorithm for comparing utterances/words.
- A recording device for the users to record their words.
- Context separation like simon uses for limiting the phrases to listen at any time and improving accuracy.
"A recording device for the users to record their words."
Do you mean a software device like some kind of control panel? While that's a solution that eases the software developer's job, that's not how people want their software to work. I'm a software developer myself and I don't want my speech software to require training. Or if it's going to require training, fake it for me. Maybe a wizard: "Hi, I'm Simon! I need to hear your voice a bit before we get started. Please read the following sentence: ..." or something.
Sure, while this is a <1.0 release, maybe this recorder will help the devs learn their problem domain a bit more deeply, but I'd sincerely hope it doesn't become an engineer's crutch- IMO, it's Not Good expecting users to adapt themselves to the technology that's supposed to be serving them.
I wonder if speech recognition software can be developed with a : - Dynamic Time Wrapping (DTW) algorithm for comparing utterances/words. - A recording device for the users to record their words. - Context separation like simon uses for limiting the phrases to listen at any time and improving accuracy.