Once we train models on the chain of thought outputs, next token prediction can ...

psadri · 2025-02-05T16:36:41 1738773401

I think that is how human brains work. When we practice, at first we have to be deliberate (thinking slow). Then we “learn” from our own experience and it becomes muscle memory (thinking fast). Of course, it increases the odds we are wrong.

bloomingkales · 2025-02-05T16:45:24 1738773924

Or worse, we incorrectly overweight the wrong chain of thinking to an irrelevant output (but pragmatically useful output), at scale.

For example, xenophobia as a response to economic hardship is the wrong chain of thinking embedded in the larger zeitgeist.