This isn't like lossless compression. Both techniques involve throwing lots of i...

oblio · 2024-12-17T12:00:06 1734436806

My joke was more along the lines of entropy. Entropy is information and you can't throw away all of it, otherwise you have nothing useful left.

vlovich123 · 2024-12-17T17:39:31 1734457171

Modern LLMs are still quite inefficient in their representation of information. We're at like the DEFLATE era and we've still yet to invent zstd where there's only marginal incremental gains; so right now there's a lot of waste to prune away.

lxgr · 2024-12-17T13:24:11 1734441851

Hence the idea to only throw away almost all of it.