I'm still looking over this to see its capabilities, but if reading this right, ...

emerged · on Dec 18, 2021

That is a tremendously big “if” considering how many times that problem has been attempted. Even just detecting the key of a song is awfully fuzzy.

pindab0ter · on Dec 18, 2021

Detecting a key of a song is also not deterministic. Some song’s keys are truly ambiguous and/or subjective.

andybak · on Dec 18, 2021

Aye. Polyphonic pitch is a much simpler problem than "key" which is more akin to rigorous sentiment analysis or some other intractable goal.

haberman · on Dec 19, 2021

My gut is that computing key (given reliable pitch data) is a lot easier than computing polyphonic pitch data (given audio).

For relatively "conventional" music, there are very strong signals of key like beginning and ending chords, and overall note distributions which will generally cluster around one particular scale. For less conventional music, this will be more ambiguous, but it would have been more ambiguous for a human listener too.

nsonha · on Dec 20, 2021

the result can be used as audio fingerprints, which is not a new thing. This has something to do with how things like Shazam work.