Hacker News new | past | comments | ask | show | jobs | submit login

I relly hope Meta (FB research) releases this wonderful noise reduction library: https://github.com/facebookresearch/denoiser

I tested MANY while building various audio tech and this was by far the best (beats the shit out of all the python Pandas Hugging Face lists etc). Incredibly ability to cut noise (tho, it must be noted that perceptual improvements for humans do not usually increase machine transcription as AI models strangely seem to pull info out of the dead space between words, and the noise around words...not just the voiced words themselves...the hidden vibrations, beyond our human ken, oh mere mortals unworthy of the grand perceptive machines...ugh...:p :o ;p xx ;p)

I contacted the authors via their FB emails but never heard back. Right now it's non-commercial and I was building a commercial product.




Somewhat tangential question, have you looked for/found any audio models/tools that can be used for separating out individual voices to separate audio tracks automatically? Perhaps this is already possible with existing tools that I am uninitiated in.


That's called "speaker diarization" and there's quite a bit of work in the field. https://github.com/topics/speaker-diarization

I have no idea what's good or the best, but there's a starting point!


Thank you for the proper terminology, googling these things in plain English just doesn't do it anymore


Not a problem and good luck!


I had speaker diarization and my wife and dog wouldn’t even come near me for weeks


I haven't tested this with multiple voices and it sounds like you want something more specific but it's produced 10/10 results with a couple dozen audio files I've thrown at it, might be of use... https://vocalremover.org/



Izotope RX Pro, which is software for the cleaning and refinement of audio for music and audio post production includes 'Multiple Speaker Detection' which analyzes different voices in a recording and allows you to process them independently.

https://www.izotope.com/en/products/rx.html

I can't speak to it's effectiveness because I don't have any need for it, and also RX 10 Advanced is commercial software and pretty expensive for a casual user, but the feature seems to be on the horizon for other apps.


Does Ultimate Vocal Remover 5 fit the bill?

Despite the name, it can also do audio separation.


meta has demucs which is the best I've used so far


Have you tested https://hushaudioapp.com ? Its quality is amazing.


AFAIU their point was they wanted a reusable library, and specifically NOT a closed-source paid-only solution.


Have you tested the FB one?


How does it compare to Adobe's recent AI audio enhancement?


Not sure. Anyone else wanna chime in on that?

BTW - in my search just now in response to your question, I see Dolby also has an API: https://dolby.io/products/enhance/


Interesting that all of these seem to work extremely well but none are actually available to end users as an unrestricted product.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: