There's a commit (https://github.com/ctrl-alt-test/mouton/commit/79d2d1eab7a22...) where we save many bytes by removing a performance optimization. We originally wanted to keep it, but we realized we were short on bytes and that optimization was not required on recent-ish GPUs.
Would love to hear more technical details on how corners have been cut to shave off some bytes.