I'd not put any trust in geekbench. If you read the whitepaper[0], you'd notice ...

a-french-anon · on Oct 4, 2023

In general, I'd say disregard benchmarks that aren't open source and not disclosing their full compilation stack.

Don't want a repeat of Cinebench compiled with ICC or https://hothardware.com/news/antutu-mobile-benchmark-cited-b... and https://www.anandtech.com/show/7384/state-of-cheating-in-and...

nine_k · on Oct 4, 2023

I don't see why a CPU test should be only about number crunching, and not about RAM and cache access efficiency, provided that the test is not incurring gratuitous cache trashing.

Also, working with a ton of small files is a typical real-world task.

xxs · on Oct 4, 2023

>Also, working with a ton of small files is a typical real-world task.

Yes, but not compressing them individually, and while staying in the userland (and having SHA1 on each, just for kicks). The real world scenarios tend to be make a big archive (e.g. tar) and compress that thing.

>and not about RAM and cache access efficiency

It's about the size of the cache, workloads that fit L2 vs such that don't, exhibit an amazing performance boost. Pretty much, the performance drops off a cliff when it doesn't fit the L2.

Overall microbenchmarks are extremely/notoriously difficult to get right, and more often than not, they are gamed. However the compression/decompression of geekbench is just bad.

mpixel · on Oct 4, 2023

I agree with all your points and also agree that honestly, these scenarios aren't far off from real world tasks.

I get the main issue, which is you could adjust the workload by 10% and achieve a 50% performance loss when you do this at the point where we cross the cache threshold and whatnot.

However I see CPUs unique in that I rank them _for_ these scenarios. A particular might be ranked unfairly, but as long as the test is equal, the better one is infact better, just not by the 50% the test might show but it's still going to be 5% better. I expect my GPU to be idle when it isn't training AI or rendering frames, but for the CPU, it's general purpose in real life, and anything goes.

xxs · on Oct 4, 2023

>just not by the 50% the test might show but it's still going to be 5% better.

Sort of, indeed. Yet, when you see any promotional/marketing material - you see all those phallic bar graphs, and how much bigger it is. Other than that - heavy cache utilization hides inferior memory subsystem (latency/throughput), the latter tends to be quite important in the real world. Overall benchmarks/tests that feature handful of MB as datasets, and run in 100s of ms - should not be used as representative... for most use cases.

That was my initial point - 'don't trust'.

skavi · on Oct 4, 2023

geekbench 5 and 6 scores generally correspond well to spec2017 which is pretty much the industry standard.