I’m curious. Scraping seems to come up a lot lately. What is everyone scraping? ...

nickpsecurity · 2024-09-07T23:33:30 1725752010

To add to others’ points, we can do two, more things:

1. Pretain models with any legal, scraped content. That includes updating existing models with recent data.

2. Have our own private collection of pages we’ve looked at. Then, we can search them with a local engine.

jstanley · 2024-09-07T10:12:30 1725703950

With people making LLMs act as agents in the world, the line between "scraping" and "ordinary web usage" is becoming very blurred.

samrolken · 2024-09-07T09:50:46 1725702646

Context for LLMs, and use cases uniquely enabled by LLMs, mostly I think.