They do a phenomenal job of guessing the next word, and our language is redundan...

kkzz99 · 2024-08-17T13:27:52 1723901272

Guess you have not tried more advanced prompting techniques like CoT, Agents and RAG.

ipsum2 · 2024-08-17T14:02:48 1723903368

Those are buzzwords, not advanced prompting techniques.

kkzz99 · 2024-08-18T12:47:52 1723985272

What are you even on about? This sentence has absolute zero value for the discussion at hand.

imtringued · 2024-08-17T13:29:35 1723901375

Yeah, if an LLM was truly capable of reasoning, then whenever it makes a mistake, e.g. due to randomness or due to lack of knowledge, then pointing out the mistakes and giving steps on correcting the mistakes should result in basically a 100% success rate, since the assistant has infinite capacity to accommodate the LLM's weaknesses.

When you look at things like https://arxiv.org/abs/2408.06195 you notice that the amount of tokens needed to solve trivial tasks is somewhat ridiculous. On the order of 300k tokens for a simple grade school problem. That is roughly three hours at a rate of 30 token/s. You could fill 400 pages of a book with that many tokens.