I'm surprised that this pays attention to '\r' (CR) specifically, and not '\n' (...

rurban · 2024-06-14T05:59:59

I also have no idea why he needs the newline-mask \r. Only <pre> blocks only on windows would need that.

account42 · 2024-06-14T09:47:01

<pre> blocks don't depend on the OS, that would be ridiculous.

kristianp · 2024-06-14T06:40:04

Is it something to do with http headers? They have CR LF pairs terminating the lines.

JoshTriplett · 2024-06-14T07:22:21

I wondered about that, but that wouldn't be described as parsing HTML, and it shouldn't involve parsing '<' and '&'.

eknkc · 2024-06-14T10:19:34

Probably need to do a pass to find all \r chars, check if the next char is \n and if so, discard it. Otherwise convert it to \n.