I'm thinking that a multi-pass parser:
1. Is this sane HTML5 markup? I'll eat that.
2. Is this a borked-but-manageable variant? Strip the cruft.
3. Is this Beyond All Hope? Just dump it as ASCII. It'll be blockquoted in reply.
HTML is no longer a peresistent, stable, or universal interchange format.
I'm thinking that a multi-pass parser:
1. Is this sane HTML5 markup? I'll eat that.
2. Is this a borked-but-manageable variant? Strip the cruft.
3. Is this Beyond All Hope? Just dump it as ASCII. It'll be blockquoted in reply.
HTML is no longer a peresistent, stable, or universal interchange format.