I took a look at the diff linked in the article with code that "we are all runni...

justinsaccount · 2024-04-03T12:56:59 1712149019

Also because the 'safe' version only checks

  dict->pos == dict->limit

and not

  dict->pos >= dict->limit

if you can get one call of dict_put somewhere to pass the limit, all later calls of dict_put_safe will happily overwrite memory and not actually be safe.

Calzifer · 2024-04-04T08:22:55 1712218975

No, because dict_put will update the limit value if the new pos exceed it.

justinsaccount · 2024-04-04T11:55:16 1712231716

I don't see anything like what you are describing. What line exactly are you talking about?

ahartmetz · 2024-04-03T21:04:19 1712178259

Wow, that is 1000% obviously malicious

Matumio · 2024-04-03T12:26:36 1712147196

Agree, nice catch. Also, there are many other opportunities in this patch to hide memory safety bugs.

This is the kind of optimization I might have done in C 10 years ago. But coming back from Rust, I wouldn't consider it any more. Rust, despite its focus on performance, will simply not allow it (without major acrobatics). And you can usually find a way to make the compiler optimize it out of the critical path.

kmfpl · 2024-04-03T11:36:25 1712144185

I agree, this looks extremely sketchy. Especially because the code is just writing a fully controlled byte in the buffer and incrementing its index.

This would give you a controlled relative write primitive if you can repeatedly call this function in a loop and going OOB.

liendolucas · 2024-04-03T12:13:16 1712146396

I think at this point is clear that everybody has to assume that XZ is completely rotten and can no longer be trusted. Is it XZ easy to replace with some other compression tool? Or has it been so widely adopted that is going to take huge effort moving out of it?

dralley · 2024-04-03T12:39:30 1712147970

There is no reason to assume that. Even if you assume every commit since Jia became a maintainer is malicious, the version from 3 years ago is perfectly fine.

Zstd has a number of benefits over Xz that may warrant its use as a replacement of the latter, and this will likely be a motivating factor to do so. But calling it entirely rotten is going way too far IMO

mmd45 · 2024-04-03T13:19:34 1712150374

There is an interesting argument to be made that pre-JT xz code is probably pretty secure due to the fact that the threat actors would have already audited the code for existing exploits prior to exerting effort to subvert it.

tripflag · 2024-04-03T13:16:04 1712150164

I always use "zstd --long=31 -T0 -19" to compress disk images, since that is a usecase where it generally offers vastly superior compression to xz, deduplicating across bigger distances.

XZ offers slightly better compression on average, but decompression is far slower than Zstd.

dralley · 2024-04-03T13:19:57 1712150397

IIRC memory consumption is generally worse for Zstd at comparable levels of compression. Which, these days, is generally fine, but my point is you can't thoughtlessly substitute the two.

liendolucas · 2024-04-03T12:48:32 1712148512

What keeps ringing in my head is the "." that was found that invalidates compilation. I personally don't buy it (but is my opinion).

dralley · 2024-04-03T13:08:45 1712149725

What do you mean "don't buy it"?

liendolucas · 2024-04-03T14:22:05 1712154125

My bad. I thought that the person who made that commit was someone else than JT. Can't delete comment nor self-down-vote it.

kzrdude · 2024-04-03T12:30:17 1712147417

Huge effort, because it is the default .deb compressor in Debian for example

rthnbgrredf · 2024-04-03T12:36:03 1712147763

Arch Linux has replaced it with zstd in 2020 already. It's doable for the next major release of Debian.

kzrdude · 2024-04-03T13:43:53 1712151833

Certainly, but we need an xz decompressor to read the current debian repo versions for the next decades, when they are oldstable or archived.

formerly_proven · 2024-04-03T14:36:48 1712155008

Decoding is easy.

logro · 2024-04-03T18:09:52 1712167792

This is 100% malicious or novice coder. And we surely know it's not the latter.

If you need an unsafe call, you add a dict_put_unsafe(). That again should of course be rejected in a code review.