Hacker News new | past | comments | ask | show | jobs | submit login

This is false. If you're doing WARC correctly, HTTP resources/responses are stored verbatim.

Perhaps the only possible purer format would be packet captures, say of the full HTTPS session, along with the session keymatter and connection metadata to later extract the verbatim HTTP resources. That'd be interesting, but I doubt that's what this "22120 format" (for which I see no documentation links) does.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: