Hacker News new | past | comments | ask | show | jobs | submit login

>Will the published caches be 99% crap

Yes. It will be exactly as crap as whatever's published on the web.

And the utility of google's search engine would be to perform their proprietary processing on top of the publicly-available crawl results. Analogous to how their search is already preforming proprietary processing on top of a crawl cache.

>If you don't then you fail to appreciate the amount of labor it takes to thwart bad actors from ruining indexes.

Did you miss the part where I said "Assuming this hypothetical shared crawl cache were to exist, it does not preclude google (and all consumers of that cache) doing their own processing downstream of that cache. Does it?"




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: