I have trained GANs on raw JPEG coefficients with moderate success as a pet proj...

akrymski · on April 29, 2021

Using a standard loss function like MSE?

Yeah it kinda works when you feed JPEG coefficients into a typical time-domain CNN, but mathematically it seems that if you're using frequencies as inputs, your convolutions should become simple multiplications. Am I wrong?

mochomocha · on April 29, 2021

Yep, you can successfully do it without convolutions. Here is a pointer if you want to dig deeper: https://eng.uber.com/neural-networks-jpeg/ (there's prior art to that, but this one is well written)