So streaming services can save money on bandwidth

constantcrying · 2024-04-24T15:32:17 1713972737

That's absurd. I think anybody is aware that it is far superior to e.g. compress in the frequency domain than to down sample your image. If you don't believe me just compare a JPEG compressed image with the same image of the same size compressed with down sampling. You will notice a literal night and day difference.

Down sampling is a bad way to do compression. It makes no sense to do NN reconstruction on that if you could have compressed that image better and reconstructed from that data.

acuozzo · 2024-04-24T16:10:58 1713975058

An image downscaled and then upscaled to its original size is effectively low-pass filtered where the degree of edge preservation is dictated by the kernel used in both cases.

Are you saying low-pass filtering is bad for compression?

itishappy · 2024-04-24T23:13:06 1714000386

The word is "blur." Low-pass filtering is blurring.

Is blurring good for compression? I don't know what that means. If the image size (not the file size) is held constant, a blurry image and a clear image take up exactly the same amount space in memory.

Blurring is bad for quality. Our vision is sensitive to high-frequency stuff, and low-pass filtering is by definition the indiscriminate removal of high-frequency information. Most compression schemes are smarter about the information they filter.

acuozzo · 2024-04-26T21:37:41 1714167461

> Is blurring good for compression? I don't know what that means.

Consider lossless RLE compression schemes. In this case, would data with low or high variance compress better?

Now consider RLE against sets of DCT coefficients. See where this is going?

In general, having lower variance in your data results in better compression.

> Our vision is sensitive to high-frequency stuff

Which is exactly why we pick up HF noise so well! Post-processing houses are very often presented with the challenge of choosing just the right filter chain to maximize fidelity under size constraint(s).

> low-pass filtering is by definition the indiscriminate removal of high-frequency information

It's trivial to perform edge detection and build a mask to retain the most visually-meaningful high frequency data.

constantcrying · 2024-04-24T16:20:35 1713975635

Do you seriously think down sampling is superior to JPEG?

acuozzo · 2024-04-26T21:19:56 1714166396

No. I never made this claim. My argument is pedantic.

p1esk · 2024-04-24T15:42:49 1713973369

Are you saying that when Netflix streams a 480p version of a 4k movie to my TV they do not perform downsampling?

constantcrying · 2024-04-24T15:58:58 1713974338

Yes. Down sampling makes only sense if you store per pixel data, which is obviously a dumb idea. You get a stream for 480p which contains frames which were compressed from the source files, or the 4k version. At some point there might have been down sampling involved, but you never actually get any of that data, you get the compressed version of those.

p1esk · 2024-04-24T16:29:24 1713976164

Not sure if I’m being dumb, or if it’s you not explaining it clearly: if Neflix produced low resolution frames from high resolution (4k to 480p), and if these 480p frames are what my TV is receiving - are you saying it’s not downsampling, and my TV would not benefit from this new upsampling method?

constantcrying · 2024-04-24T16:47:31 1713977251

Your TV never receives per pixel data. Why would you use a NN to enhance the data which your TV has constructed instead of enhancing the data it actually receives?

p1esk · 2024-04-24T16:57:30 1713977850

OK, I admit I don’t know much about video compression. So what does my TV receives from Netflix if it’s not pixels? And when my TV does “upsampling” (according to the marketing) what does it do exactly?

itishappy · 2024-04-24T23:22:21 1714000941

It receives information about the spacial frequency content of the image. If you're unfamiliar, it's definitely worth looking into the specifics of how this works, as it's quite impressive! Here's a few relevant Wikipedia articles, and a Computerphile video:

https://en.wikipedia.org/wiki/JPEG#JPEG_codec_example

https://en.wikipedia.org/wiki/Discrete_cosine_transform

https://www.youtube.com/watch?v=Q2aEzeMDHMA

turnsout · 2024-04-24T16:10:28 1713975028

I think you're missing the point of this paper—the precise thing it's showing is upscaling previously downscaled video with minimal perceptual differences from ground truth.

So you could downscale, then compress as usual, and then upscale on playback.

It would obviously be quite attractive to be able to ship compressed 480p (or 720p etc) footage and be able to blow it up to 4K at high quality. Of course you will have higher quality if you just compress the 4K, but the file size will be an order of magnitude larger.

constantcrying · 2024-04-24T16:21:00 1713975660

Why would you not enhance the compressed data?

turnsout · 2024-04-24T21:52:39 1713995559

In our hypothetical example, the compressed 4k data or the compressed 480p data? You would enhance the compressed 480p—that's what the example is. You would probably not enhance the 4K, because there's very little benefit to increasing resolution beyond 4K.

yellow_postit · 2024-04-24T15:26:31 1713972391

Or low connectivity scenarios that pushes more local processing.

I think it a bit unimaginative to see no use cases for this.

constantcrying · 2024-04-24T15:34:24 1713972864

There is no use case, because it is a stupid idea. Downscaling then reconstructing is a stupid idea for exactly the same reasons why downscaling for compression is a bad idea.

The issue isn't NN reconstruction, but that you are reconstructing the wrong data.

adgjlsfhk1 · 2024-04-24T15:52:43 1713973963

if the nn is part of the codec, you can choose to only downscale the regions that get reconstructed correctly.

constantcrying · 2024-04-24T16:06:22 1713974782

Why would you not let the NN work on the compressed data? That is actually where the information is.

adgjlsfhk1 · 2024-04-24T17:06:20 1713978380

that's like asking why you don't train a llm on gzipped text. the compressed data is much harder to reason about

prmoustache · 2024-04-24T15:26:37 1713972397

Meh.

I think upscaling framerate would be more useful.

turnsout · 2024-04-24T16:11:01 1713975061

TVs already do this… and it's basically a bad thing