It's impossible for me to know how long the engines thought on each move, so I just analyzed the games at a variety of times and depths. Stockfish 8 at two minutes per move on a machine that is slower than that used in the match finds plenty of issues. If we know the amount of time AZ thought (or indeed... had access to AZ) it'd be possible to more closely reproduce the games.
This is all either of us would have done if we were peer reviewing the paper, don't really understand the hostility about trying to reproduce a published paper.