LLM are non deterministic by nature.

richk449 · 2024-12-28T21:24:34 1735421074

Is this really true? The linear algebra is deterministic, although maybe there is some chaotic behavior with floating point handling. The non deterministic part mostly comes from intentionally added randomness, which can be turned off right?

Maybe the argument is that if you turn off the randomness you don’t have an LLM like result any more?

hansvm · 2024-12-29T06:09:11 1735452551

Floats are deterministic too (this winds up being helpful if you want to do something like test an algorithm on every single float); you just might get different deterministic outcomes on different compilation targets or with threaded intermediate values.

The argument is, as you suggest, that without randomness you don't have an LLM-like result any more. You _can_ use the most likely token every time, or beam search, or any number of other strategies to try to tease out an answer. Doing so gives you a completely different result distribution, and it's not even guaranteed to give a "likely" output (imagine, e.g., a string of tokens that are all 10% likely for any greedy choice, vs a different string where the first is 9% and the remainder are 90% -- with a 10-token answer the second option is 387 million times more likely with random sampling but will never happen with a simple deterministic strategy, and you can tweak the example slightly to keep beam search and similar from finding good results).

That brings up an interesting UI/UX question.

Suppose (as a simplified example) that you have a simple yes/no question and only know the answer probabilistically, something like "will it rain tomorrow" with an appropriate answer being "yes" 60% of the time and "no" 40%. Do you try to lengthen the answer to include that uncertainty? Do you respond "yes" always? 60% of the time? To 60% of the users and then deterministically for a period of time for each user to prevent flip-flopping answers?

The LD50 question is just a more complicated version of that conundrum. The model isn't quite sure. The question forces its hand a bit in terms of the classes of answers. What should its result distribution be?

chimpanzee · 2024-12-28T19:30:52 1735414252

Yes, that’s the main issue as ideally they wouldn’t be non-deterministic on well-established quantitative facts.

griomnib · 2024-12-28T19:37:06 1735414626

But they can never be. RAG gets you somewhere, but it’s still a pile of RNGs under a trenchcoat.

chimpanzee · 2024-12-28T20:17:25 1735417045

>> ideally

griomnib · 2024-12-28T21:57:59 1735423079

It’s just not possible. You can do a lot with nondeterministic systems, they have value - but oranges and apples. They need to coexist.

chimpanzee · 2024-12-28T22:42:48 1735425768

ideal (def. #2) = Existing only in the mind; conceptual, imaginary

https://en.m.wiktionary.org/wiki/ideal

(We’re allowed to imagine the impossible.)

griomnib · 2024-12-28T23:55:29 1735430129

Fair, I am loath to take away your dreams!