I wonder how much of them finding it hard to say "I don't know" is RLHF pushing ...

Baeocystin on Nov 13, 2023 | parent | context | favorite | on: You need a mental model of LLMs to build or use a ...

I wonder how much of them finding it hard to say "I don't know" is RLHF pushing them to finish the prompt no matter what. With custom instructions asking the model to be more honest, I do get some reasonable follow on questions, and occasional admissions of lack of knowledge. Not perfect, but that it works at all is interesting.