Hacker News new | past | comments | ask | show | jobs | submit login

It's exactly how email spam filters evolved.

They used to be completely local and even some DIY solutions, evolved to signature updates, but eventually the attacks grew so advanced that only online services could be updated and aggressive enough, which is of course how gmail took over the internet with near perfect spam filter (when was the last time you checked a gmail spam folder).

The last generation of local spam filters were pretty good though. Anyone remember Eudora and Spamnix?




Local spam filtering still works quite fine. It just needs a lot of data most users probably don't have when starting out.

I just use bogofilter, and it worked almost perfectly from the start, just because I saved years upon years of SPAM and HAM. 10's of thousands of messages each.

It got slightly worse over years, because I incrementally only train it on new SPAM but not on new HAM, because of laziness.

People probably have HAM archives, but don't usually save their SPAM, to be able to start using Bayesian spam filters right away with great results.

Personally I find it much better than whatever Google uses. I don't even bother with SMTP level domain/IP blacklists, or reverse IP/domain checks anymore. All mail is just passed right to the mailbox and is then pre-filtered by a bogofilter to SPAM folder that I check once weekly, and barely find any HAM there. I receive about 500k mails a year.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: