As someone working on singing synthesis, I know how hard it is to get that last ...

imtringued · on Jan 13, 2023

If you are going to have such an intensive particle effect in your videos at least bother to upload a 4k version so there is a tiny chance that not every single frame consists of nothing but artifacts.

Also don't put gumi and English in the same search query on YouTube. I don't know how they did it but the voices from six years ago sound better than SOTA TTS based on deep learning today...

singedproxy · on Jan 14, 2023

Clearly the point of the video is its AUDIO content, not the visuals. The lack of a "4k version" does not make any difference other than saving you bandwith :-)

meremortals · on Jan 13, 2023

Very well done! Any suggestions on where/how one might learn to do something similar? I love the idea of being able to swap singers on a given track