Hacker News new | past | comments | ask | show | jobs | submit login

Good point but the problem with local hosting is that if you want to use the larger models it will take a long time to transcribe a file. We use multiple gpus and we do speaker detection, sound detection and it is has a rich audio editor.



Totally agree, having built a similar app I know speaker diarization is a killer feature that's hard to get. My problem is I'll never share these recordings ;).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: