Hacker News new | past | comments | ask | show | jobs | submit login

Or Common Crawl so other people could actually download and use it?



I didn't realise you couldn't download and use the data from Internet Archive. If not, that's pretty silly to back up the feeds to them, and I'm a bit annoyed to have contributed. I'd like to make them available to everyone to download, analyse, plug into their reader etc etc etc...



You can from the Internet Archive. The GGP is talking about Twingly, and the discussion is about integrating their data with the Archive Team.


For anything substantial (like say, their actual crawl), they'll only do it on a case by case basis with a rather restrictive license and you have to drive up there and plop down the machines to copy it onto.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: