I skimmed through the article but that's a lot of assumptions there if so. 1. So...

cbolton · on Dec 23, 2023

No the distribution doesn't matter at all. I've given an extreme example here: https://news.ycombinator.com/item?id=38742735

Keyframe · on Dec 24, 2023

I see what you did there. So basically an overlapped proportion (or hits proportion) would be overlapping hits divided by samples run, and then an estimated total would be this proportion divided by total space of possibilities. That would work.

remus · on Dec 23, 2023

Video IDs are generated by hashing a a secret identifier, so they should be uniformly distributed.