They're not using any fancy algorithms...except when they implement a customized...

jfasi · on May 8, 2014

Say what you will, but while Boyer-Moore can be tricky to implement, it's not exactly a fancy algorithm.

snewman · on May 8, 2014

Exactly; "fancy" is relative. It's a bit tricky, but it's nothing like the complexity of maintaining and using a keyword index. The reference implementation given in the article we linked to is thirty-odd lines of code. What we're using in practice is somewhat larger, in part because Java is more verbose for this kind of thing, but still reasonable. (If there's interest, we'd be happy to post the code.)

victor106 · on May 8, 2014

Would love to look at your code...

j2kun · on May 8, 2014

I think the real point is that they did a bona-fide tradeoff analysis and found that for their use case one algorithm was better than another. It's not about how fancy the algorithm is, that's just how great engineering works. It's only surprising if you don't consider "brute force" to be just as valid a tool as any other.

twic · on May 8, 2014

Moreover, however fancy Boyer-Moore is, the data structure it is being applied to is incredible simple. There are no inverted indexes, B-trees, etc - just a string.