For my sci-fi story (alpha readers wanted; see profile), I used Whisper to trans...

101008 · on March 30, 2023

Whisper is great. You can get faster results running the tiny model. I used it for podcast transcription and it is much faster and the quality is not worse than the medium model - there are some podcast episodes that the transcription is the same.

alach11 · on March 30, 2023

If speed is important, you're much better off using a larger model and whisper.cpp.

pantalaimon · on March 30, 2023

Wow thank you! That's a nice speedup indeed, with whisper I get

    33,53s user 2,05s system 443% cpu 8,023 total

with the 'tiny.en' model whereas whisper.cpp gives me

    22,71s user 0,12s system 745% cpu 3,062 total

with the 'base.en' model for a 15s audio clip on an i7-3770 (8 threads).

alach11 · on March 30, 2023

Awesome! Thanks for posting the stats.

In my workflows I've found rare but noticeable quality differences between the model sizes. So when practical I try to use the larger ones.

malborodog · on March 30, 2023

why not just run whisper from the command line directly? Why put it into a docker container??

ec109685 · on March 30, 2023

Why not keep everything tightly contained?

malborodog · on March 30, 2023

Hm, I'm on Mac so it takes up a bunch of ram and I'm not used to this workflow. good point though.

ec109685 · on March 31, 2023

Unless you actually use the memory (e.g. allocate it), it won’t impact system performance, but yeah, it definitely is overhead.

pdntspa · on March 30, 2023

some people just love making their environments needlessly complicated.

jazzyjackson · on March 30, 2023

complexity is in the eye of the beholder, some people just get docker enough that it's not a friction

Now installing the dependencies of every git repo I want to try on my host system, that's how an environment becoming needlessly complicated

hodanli · on March 30, 2023

thank you for this