Hacker News new | past | comments | ask | show | jobs | submit login

The main problem I have faced with the whisper model (large) is when there is silence or a sizable gap without audio, it hallucinates and just puts out some random gibberish repeatedly until the transcription ends. How does this app handle this?



I've run into that, many times. Would be nice to have a fix.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: