> p50/p99 retrieval times at realistic loads or it didn't happen. Therein lies t...

gleenn · 2023-07-12T22:21:04

One actual solution is to use historical search logs. Just because "random" is a bad answer doesn't mean people don't try and make reasonable reproductions of load to replay and benchmark. Cacheing is also a big factor.

joking · 2023-07-13T09:28:35

I don't know if this is true for elasticsearch, but at least with solr, when you update an index, the default is to run some of the queries in the cache of the old searcher to warm up the new one.