Hacker News new | past | comments | ask | show | jobs | submit login

The catch is this database would need to be kept away from spammers, lest they be able to test their site designs against it directly.



Not so much the database as the algorithm; if it's sufficiently understood (as seems to be the case with Google's algorithm, or at least a lot of people claim it to be the case), then spammers can target it directly. Merely having the original data used to train it doesn't give much insight into the algorithm itself.

On second thought, though, being able to identify common characteristics of the least spam-like websites would allow spammers to mimic those characteristics. It would take a lot of effort (figuring out the core bits), but they are clearly willing to put that in. So yes, I suppose that you're right.





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: