Hacker News new | past | comments | ask | show | jobs | submit login

How does Google figure out relevance in realtime? With twitter, it's user driver content with tags etc.

But with the net at large, blogs etc, this becomes difficult. Incoming links etc are hard to determine in real time (primarily because they haven't occurred yet).




The site's PR, uptime/age, update rate, uniqueness of content. By this measure HN is near the ideal: it has amazing inbound PR but does not link out much. It's been running fast and fine for ~3 years, the content is often unique, in the sense that it contains phrases Googlebot has never encountered before.

An experiment: here is a search that matches an exact phrase in this comment. http://www.google.com/search?q=%22By+this+measure+HN+is+near...

At the moment it returns nothing. within a minute or two this comment till be the first result.


52 minutes and counting. ;)


Yep. Fail. Stuff from yesterday is indexed however:

http://www.google.com/search?q=confusingly+called+copy-regio...


The quotes seem to be the problem, works without for me.


12 hours later, still no match.


It has appeared in the index after somewhere between 12 and 17 hours.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: