Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
whimsicalism
4 months ago
|
parent
|
context
|
favorite
| on:
GPT-4o's Memory Breakthrough – Needle in a Needles...
Increasingly convinced that nobody on the public internet knows how to do actual LLM evaluations.
tedeh
4 months ago
[–]
I'm just glad that we are finally past the "Who was the 29th president of the United States" and "Draw something in the style of Van Gogh" LLM evaluation test everyone did in 2022-2023.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: