Hacker News new | past | comments | ask | show | jobs | submit login

Additionally, using any Wiki page is misleading, as LLMs have seen their format many times during training, and can probably reproduce the original HTML from the stripped version fairly well.

Instead, using some random, messy, scattered-with-spam site would be a much more realistic test environment.




Also it can get partial credit on some of these questions without feeding in any data at all.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: