Hacker News new | past | comments | ask | show | jobs | submit login

I'm still looking over this to see its capabilities, but if reading this right, we can turn any mp3/wav into a set of midis, which allows us to import into music editing software (like Finale). If this works, this is huge. Congrats to the team.



That is a tremendously big “if” considering how many times that problem has been attempted. Even just detecting the key of a song is awfully fuzzy.


Detecting a key of a song is also not deterministic. Some song’s keys are truly ambiguous and/or subjective.


Aye. Polyphonic pitch is a much simpler problem than "key" which is more akin to rigorous sentiment analysis or some other intractable goal.


My gut is that computing key (given reliable pitch data) is a lot easier than computing polyphonic pitch data (given audio).

For relatively "conventional" music, there are very strong signals of key like beginning and ending chords, and overall note distributions which will generally cluster around one particular scale. For less conventional music, this will be more ambiguous, but it would have been more ambiguous for a human listener too.


the result can be used as audio fingerprints, which is not a new thing. This has something to do with how things like Shazam work.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: