It's time to design a public benchmark for these types of systems to compare bet...

jacquesm · on May 31, 2023

But this is the same version that changes without a change of the version number.

airgapstopgap · on May 31, 2023

Well, people suspect it isn't, and it's not like we can see the internal version designation, and it's not even like we would care a lot, if it performed identically from day to day.

Indeed, you could do better or worse with the exact same raw checkpoint, just depending on inference-optimizing tricks.

pmontra · on May 31, 2023

So the version number is the day the benchmark is run. Version yyyy-mm-dd