Hacker News new | past | comments | ask | show | jobs | submit login

I've been trying to do something similar for a while now. In the past I tried using YaCy in private mode, scraping a few aggregators and RSS feeds I read +2 levels of links. That was cool, but YaCy is practically dead these days and has various issues. Currently I'm trying ArchiveBox for the extraction + storage and poking around importing the results into Verba for RAG-style search using the local model of mixtral. ArchiveBox is nice in that it can do the text extractions from different types of media through a number of plugins. It's early days, but I think that's got a future.



I am working on pretty much exactly this same thing :)

Anything you can share yet?

Here is mine: https://github.com/ydennisy/kg1


Yep. ArchiveBox here too. Am poking at the same problem, using a variety of RAG prompts.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: