Hacker News new | past | comments | ask | show | jobs | submit login

Somewhat tangential question, have you looked for/found any audio models/tools that can be used for separating out individual voices to separate audio tracks automatically? Perhaps this is already possible with existing tools that I am uninitiated in.



That's called "speaker diarization" and there's quite a bit of work in the field. https://github.com/topics/speaker-diarization

I have no idea what's good or the best, but there's a starting point!


Thank you for the proper terminology, googling these things in plain English just doesn't do it anymore


Not a problem and good luck!


I had speaker diarization and my wife and dog wouldn’t even come near me for weeks


I haven't tested this with multiple voices and it sounds like you want something more specific but it's produced 10/10 results with a couple dozen audio files I've thrown at it, might be of use... https://vocalremover.org/



Izotope RX Pro, which is software for the cleaning and refinement of audio for music and audio post production includes 'Multiple Speaker Detection' which analyzes different voices in a recording and allows you to process them independently.

https://www.izotope.com/en/products/rx.html

I can't speak to it's effectiveness because I don't have any need for it, and also RX 10 Advanced is commercial software and pretty expensive for a casual user, but the feature seems to be on the horizon for other apps.


Does Ultimate Vocal Remover 5 fit the bill?

Despite the name, it can also do audio separation.


meta has demucs which is the best I've used so far




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: