Search being a democracy is really just a crude way of creating a better ranking...

vonklaus · on Feb 25, 2016

I made this point below about inability to provide context. In the other thread link, I think I provided why,. although I am no machine learning specialist but I think because:

Google can never necessarily know what you want and can never truly know you achieved your goal, so you could not train it properly.

Not only would you need to discover what profession I am in, assuming you had fully updated linked profile, etc. you would need to build a comparable universe of like minded people and calibrate.

Then, you would have to assume what inputs are similar in that they have same/similar parameters and expect similar results.

Then you would have to assume which link I clicked was the answer, for every person who did this same thing.

Then you would have to discount your bias as an engine, because you provide the top results to me and (for now) people trust the engine so they typically have a false choice of the first 5-10 things. If those 5-10 things are wrong, whole model is in error to extent it is wrong.

Any one of these would provide error and the cascade leads to larger disparity. Google IS SO AMAZINGLY GOOD, it has actually managed to make this not a problem for a very long time.

dheera · on Feb 25, 2016

> Google can never necessarily know what you want and can never truly know you achieved your goal, so you could not train it properly.

They do have some amount of confirmation. All of their search results are redirect links, so they're tracking which links you click on. Based on the timing of those clicks, they can tell if you clicked on a result, left that site a few seconds later, and then clicked on another result further down the page, which probably means the first result didn't give you what you want. It's not perfect but it's still potential training data.

If that site has Google Ads or a Google '+1' icon, they can get slightly more information about how you spend your time on that site. I don't know about the legality of this but it's technically feasible.