Hacker News new | past | comments | ask | show | jobs | submit login

I would love to read this in transcription. I find my attention span for videos is less than for text. Does anybody have any other info on this topic?




That transcription seems to be really bad. It's full of strange errors. :(


I wonder what they used to generate that transcription?


I was kinda wondering too, and did a (very shallow) dive into the JavaScript on that page. I'm almost positive they are using Deepgram(dot com)'s speech-to-text service. I ran whisper.cpp on that audio file on my laptop, and it does a reasonably well job too.


They discuss their transcription process at 23:15 in episode #621

https://syntax.fm/621?t=0:23:15




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: