Yep, the idea of truth or falsity is not part of the design, and if it was part ...

astrange · on Feb 13, 2023

> If, based on the training data, the most statistically likely series of words for a given prompt is the correct answer, it will give correct answers.

ChatGPT does not work this way. It wasn't trained to produce "statistically likely" output, it was trained for highly rated by humans output.

mrtranscendence · on Feb 13, 2023

Not exactly. ChatGPT was absolutely trained to produce statistically likely output, it just had an extra training step added for human ratings. If they relied entirely on human ratings there would not have been sufficient data to train the model.

astrange · on Feb 13, 2023

The last step is what matters. "Statistically likely" is very underdetermined anyway, answering everything with "e" is statistically likely.

(That's why original GPT3 is known for constantly ending up in infinite loops.)

svachalek · on Feb 13, 2023

"e" is not a likely response to anything. I think you are not understanding the type of statistics involved here.

astrange · on Feb 14, 2023

GPT3 doesn't create "responses". Not till it's been trained to via RLHF.