> I’d also note this isn’t confidence in the answer but in the token prediction....

acchow · 2024-08-28T20:32:51 1724877171

> when you ask an LLM to give you a confidence value, it is indeed computed

You mean the output of the transformer? It does not "compute" confidence values. It's still doing token prediction.

viraptor · 2024-08-29T03:21:48 1724901708

What's your example of a "computed" confidence value for an opinion given through text? I don't understand the requirements you have for this concept.

fnordpiglet · 2024-09-01T04:39:18 1725165558

The assertion that the token likelihood metric is some sort of accuracy metric is false. There are more traditional AI techniques that compute probabilistic reasoning scores that are in fact likelihoods of accuracy.

I’d note you can’t ask an LLM for a confidence value and get any answer that’s not total nonsense. The likelihood scores for the token prediction given prior tokens isn’t directly accessible to the LLM and isn’t intrinsically meaningful regardless in the way people hope it might be. They can quite confidentially produce nonsense with a high likelihood score.