The remote possibilities here are staggering. If we could send beeps in the human vocal range over phone lines to control things, it could revolutionize telephony!
Very cool but you wouldn't get very far in an FPS by sending one event at a time. This way you can't did things like strafe turning or shoot while walking.
A flute normally has about 2 octaves of diatonic notes, certainly true for a recorder. I'd suggest expanding the number of usable notes so that some could be used for compound commands.
In saying that, what would be even better, would be to allow the use multiple notes at ones, polyphonic instruments, so that chords could be used for this effect. Say you play a C to walk forwards, a G to strafe right, and E to shoot, then a C major triad could be played to do all these things simultaneously. A flute quartet could maybe play the game together.
Maybe the flute's pitch could be decoded back into the required fingering combination. This way the player could treat each hole as an individual button. The actual notes might not all sound musical (or be properly distinguishable).
That's a cool idea but subtlety of those differences would be much, much more complex to pick up and everything would certainly need to be recalibrated for every different model of flute whereas the current solution would work for flute tuned A=440Hz
Yes, I made a game that you controlled by whistling, back in 2018 with the WebAudio API.
My version of the cursor kept a continuous scan velocity horizontally across the screen, and you controlled the y-axis via pitch. Button presses were (intentional) collisions.
This is hilarious. Reminds me of the YID (yelling input device) I made for the stupid shit hackathon in 2018[1]. Playing with alternative ways to control systems is a fun exercise in creativity.