Quote from Mike: > Rather, you simply split the file into 218 parts ending with ...

barrkel · on March 8, 2015

The EOF is not at a cost of zero bytes; it costs as much as storing the length of each constituent file. The extra space used is in the file system accounting.

Dylan16807 · on March 8, 2015

It's at a cost of 0 competition score bytes. Mike screwed up by allowing an alphabet of 257 symbols and then only counting 256 of them. Pretty much any compression or repacking algorithm could have been used at that point.

cnvogel · on March 8, 2015

Dylan16807, that's a very concise way to put it, thanks for making that comment.

emn13 · on March 8, 2015

In return, Patrick screwed up in wanting to use the extra filesystem metadata, but he failed to actually ensure that it would be left unaltered. Only the file contents were to be left untouched.

louis-paul · on March 8, 2015

It's inventive, but what if instead of doing it at the byte level, we did it at the bit level? Instead of splitting files on every "5" encountered, we split them at every 1 in binary, which gives about 50% of "compression".

octatoan · on March 8, 2015

That would essentially be a kind of run-length encoding. I doubt if a 50% compression ratio is possible, though.

Dylan16807 · on March 8, 2015

I wonder how you got a downvote for that, it's quite accurate. A version of simplistic token replacement even has a wikipedia page http://en.wikipedia.org/wiki/Byte_pair_encoding