I suspect Whisper is more robust than other "SOTA" models, but this release is l...

ma2rten · on Sept 21, 2022

I'm looking forward to your comparison. It's really hard to make sense of how good this model actually is without being an expert in the area.

lunixbochs · on Sept 27, 2022

allanrbo · on Sept 21, 2022

Talon was the first thing that came to my mind when I saw this news. Would be nice if it could benefit from Whisper. (Big fan of your work on Talon!)

nshm · on Sept 21, 2022

It is interesting how they compare with wav2vec2 instead of nemo conformer (which is more accurate) in Table 2.

sjnair96 · on Sept 23, 2022

Indeed interesting.

On that note, a core Nvidia NeMo developer I follow posted this: https://twitter.com/HaseoX94/status/1572748653189791745

He calls it a "T5 for ASR" paper :) More insights in there, have a look! Curious to see what your blog would put up as well!