Hacker News new | past | comments | ask | show | jobs | submit login

I used one of the TTS pre-trained models to turn the Frankenstein e-book from Project Gutenberg into a podcast, and it worked pretty darn well (especially when I compared it with the terminal "say" command). Here's my write-up:

https://www.charlieharrington.com/flow-and-creative-computin...

And the podcast RSS feed:

https://whatrocks.github.io/castellan/podcastjr.xml

It's great when these ML models link to a Google Colab notebook. It makes it super easy and dare-I-say fun to try them out.




Wow, that neural voice sounds so much better than the TTS that I use with my screenreader for reading books with my print-related disability. Thank you for the writeup! :-)


You should also totally check out silero-models, which are also available in colab with 10 speakers:

- https://github.com/snakers4/silero-models#text-to-speech - https://habr.com/ru/post/549482/

Disclaimer, this is my independent project




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: