How does Google figure out relevance in realtime? With twitter, it's user driver...

aristus · on May 19, 2009

The site's PR, uptime/age, update rate, uniqueness of content. By this measure HN is near the ideal: it has amazing inbound PR but does not link out much. It's been running fast and fine for ~3 years, the content is often unique, in the sense that it contains phrases Googlebot has never encountered before.

An experiment: here is a search that matches an exact phrase in this comment. http://www.google.com/search?q=%22By+this+measure+HN+is+near...

At the moment it returns nothing. within a minute or two this comment till be the first result.

foulmouthboy · on May 19, 2009

52 minutes and counting. ;)

aristus · on May 19, 2009

Yep. Fail. Stuff from yesterday is indexed however:

http://www.google.com/search?q=confusingly+called+copy-regio...

ZeroGravitas · on May 20, 2009

The quotes seem to be the problem, works without for me.

ntoshev · on May 20, 2009

12 hours later, still no match.

ntoshev · on May 20, 2009

It has appeared in the index after somewhere between 12 and 17 hours.