I don't see how the fact that he's a Mozilla blogger is relevant. The file conta...

jcromartie · on Oct 23, 2012

I'd imagine you could scrape a million people's publicly available info, too.

sneak · on Oct 23, 2012

Have you ever tried scraping Facebook?

Achshar · on Oct 23, 2012

What should I expect?

Permit · on Oct 23, 2012

They're pretty clever. When I started programming in 2009, I wrote a small scraper that would create accounts, friend people and steal their info if they accepted. (I never released it past my own friends list and never sold the data).

There were the obvious checks for CAPTCHAs when too much activity was detected, but other subtleties as well. If you looked at too many people's profiles, emails wouldn't be displayed as text, but as images. A person would be unlikely to notice as the pages looked identical, but dynamic changes like that make it harder to scrape some things. Introducing even rudimentary OCR requirements is enough to turn away a lot of programmers.

I'm not saying it's not possible to pull off. But Facebook has set it up so any money you might make this way will likely not be worth the development time required.

mkjones · on Oct 24, 2012

Glad you found our anti-scraping stuff to be neat! I work on the team that builds a lot of that technology at Facebook. Any interest in interning here sometime and helping us improve our systems even more?

Permit · on Oct 24, 2012

You guys do a really great job.

To be perfectly honest, I've kind of fallen out of love with web development in the last year and have taken more of an interest in algorithmic trading. I appreciate the interest, though. :)

fghh45sdfhr3 · on Oct 23, 2012

Soon we'll have very clever, slow going, open source Facebook scrapers, created for free just because we love a challenge.

Evbn · on Oct 24, 2012

You could friend people, get to know them, get their email, go to a party to meet their friends, friend them.... and eventually scrape the whole network, if you had a team working in parallel.

gailees · on Oct 23, 2012

Exactly what I'm saying....most of these people probably could care less whether their info is public or not.

Many people talk about caring about their security in an almost idealistic view; few actually care in application.

shardling · on Oct 23, 2012

It gives context?

bashzor · on Oct 23, 2012

It associates Mozilla too, unfairly imho.