Actually sounds pretty cool, but the graph on expert level tasks is confusing my...

random_cynic · 2025-02-03T03:08:09 1738552089

I think that's mostly because of the access to information it has. Much of the highly useful information is not on the public internet or shows up on search engines, only domain experts know about them. Also, the websites may be paywalled or gated by login. So a better comparison would be if the models had the same level of access as an expert.