Hacker News new | past | comments | ask | show | jobs | submit login

All these strategies are interesting, but I'm afraid we are over-engineering the problem here. The pretty simplistic strategy I'm using now is basically just pattern matching, and so far I had only 4 misplaced posts out of the 840 for April alone: that is < 0.5%. And it's blazing fast! I can rebuild the entire db in less then 30 seconds.

Given these number I believe pretty much everything more complicated than that would be a total overkill... Good food for thoughts though!




I just manually curate in these cases. HN hiring threads don't ever exceed a level where 0.5% manual review would be onerous.


I think you will need 100% manual review to find those 0.5%




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: