Minotaur: A SIMD-oriented synthesizing superoptimizer

chrisaycock · on July 13, 2023

As an example optimization, consider this C code:

  do {
    if (*--p == '.') *p = '_';
  } while (p != name);

LLVM originally produced this bitcode for AVX-2 instructions:

  %1 = shufflevector %0, <31, 30, 29, ... , 0>
  %2 = icmp eq %1, <46, 46, 46, ... , 46>
  %3 = shufflevector %2, <31, 30, 29, ... , 0>

Minotaur instead produced:

  %1 = icmp eq %0, <46, 46, 46, ... , 46>

Both compare the register to ASCII 46 ('.'), but LLVM originally produced an unnecessary pair of vector reversals. Minotaur removed it.

addaon · on July 13, 2023

Super-optimizers always strike me with the same feeling when seeing examples like this. On the one hand, it's awesome to be able to have truly-optimal (under chosen assumptions) code sequences for certain operations. On the other hand, so many of the examples just hammer home how many missed practical opportunities there are for peephole optimizations (in this case) and other traditional, local optimization techniques. I almost feel like the most valuable use of a super-optimizer would be to run it against a bunch of real world code patterns, and then use the optimization results to feed back in to simple traditional optimizations.

viraptor · on July 14, 2023

https://github.com/google/souper

> Souper is a superoptimizer for LLVM IR. It uses an SMT solver to help identify missing peephole optimizations in LLVM's midend optimizers.

thechao · on July 14, 2023

The posted paper is by John, as well as Souper!

eslaught · on July 13, 2023

You're exactly right:

https://theory.stanford.edu/~aiken/publications/papers/asplo...

xxpor · on July 13, 2023

That is in fact what they're usually used for.

mhh__ · on July 14, 2023

LLVM generates so much more code than GCC for that. Crazy - obviously the compiler is damned if it does, damned if it doesn't, but optimizing for small strings is probably not a bad idea.

bjourne · on July 14, 2023

Isn't this "just" a bug in LLVM? If you make the code run forwards instead of backwards it will not insert the redundant shuffles.

brucethemoose2 · on July 13, 2023

Looks like the source is here: https://github.com/minotaur-toolkit/minotaur

naillo · on July 14, 2023

Semi related: Does anyone know if there's been promising research in using LLMs for compilation? Seems like if you have an LLM that can have higher levels of abstraction understanding of a codebase it could (in theory) be able to optimize it in ways traditional compilers might not.

Findecanor · on July 14, 2023

I think using an LLM specifically would be wasteful and produce too many errors.

There are many research projects out there exploring different ways to use neural networks for compiler optimisation though. My impression is that most of them are using it as a technique to replace heuristics to find candidate solutions for local optimisation problems.

iib · on July 14, 2023

AlphaDev[1][2] seems to be the only project, in the news at least, that approaches low-level compilation research. And even there, you have only a small subset of optimization.

[1] https://www.deepmind.com/blog/alphadev-discovers-faster-sort...

[2] https://justine.lol/sorting/

worldsayshi · on July 14, 2023

How would you approach making sure that the output is correct?

aseipp · on July 14, 2023

It depends on what parameters you give the system control over. It doesn't necessarily have to generate the code itself, it can simply control one of the many other things. For example, if you only give control over some dimensional parameters like the command line flags, or something like warp/block/thread scheduling parameters (e.g. GPU programming), then you could maybe ask questions like "Given this block of code, running on this hardware, what are the best compilation flags and build parameters."

There was an interesting paper a while back before the LLM craze that basically did this for OpenMP programs. You'd feed it the source code to some loop body or whatever, and it would try to pick the right OMP block/loop scheduling parameters from there.

Naturally, if you're training it on open code, you will have to censor the AI so it doesn't use flags like -ffast-math too much. :)

intelVISA · on July 14, 2023

Sir this is an LLM shop -- 'correct' is whatever sells the most product.