Somewhat tangential question, have you looked for/found any audio models/tools t...

bane · on April 6, 2023

That's called "speaker diarization" and there's quite a bit of work in the field. https://github.com/topics/speaker-diarization

I have no idea what's good or the best, but there's a starting point!

bick_nyers · on April 6, 2023

Thank you for the proper terminology, googling these things in plain English just doesn't do it anymore

bane · on April 7, 2023

Not a problem and good luck!

tomcam · on April 6, 2023

I had speaker diarization and my wife and dog wouldn’t even come near me for weeks

iKlsR · on April 6, 2023

I haven't tested this with multiple voices and it sounds like you want something more specific but it's produced 10/10 results with a couple dozen audio files I've thrown at it, might be of use... https://vocalremover.org/

rahimnathwani · on April 6, 2023

https://speechbrain.readthedocs.io/en/latest/API/speechbrain...

Slow_Hand · on April 6, 2023

Izotope RX Pro, which is software for the cleaning and refinement of audio for music and audio post production includes 'Multiple Speaker Detection' which analyzes different voices in a recording and allows you to process them independently.

https://www.izotope.com/en/products/rx.html

I can't speak to it's effectiveness because I don't have any need for it, and also RX 10 Advanced is commercial software and pretty expensive for a casual user, but the feature seems to be on the horizon for other apps.

ukuina · on April 7, 2023

Does Ultimate Vocal Remover 5 fit the bill?

Despite the name, it can also do audio separation.

dvngnt_ · on April 6, 2023

meta has demucs which is the best I've used so far