Hacker News new | past | comments | ask | show | jobs | submit login

I wonder how much of them finding it hard to say "I don't know" is RLHF pushing them to finish the prompt no matter what. With custom instructions asking the model to be more honest, I do get some reasonable follow on questions, and occasional admissions of lack of knowledge. Not perfect, but that it works at all is interesting.



Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: