That is a terrible headline. It implies that the _abilities_ are a mirage, but i...

unclenoriega · 2024-03-25T01:10:33 1711329033

FWIW, I read the headline as saying the abilities of LLMs are not emergent.

anon291 · 2024-03-25T03:42:09 1711338129

Emergent and breakthrough are not the same quality. Something can be emergent and develop gradually

andsoitis · 2024-03-25T04:32:33 1711341153

> Something can be emergent and develop gradually

Emergent can be defined as the sudden appearance of novel behavior.

hdarshane · 2024-03-25T12:29:26 1711369766

emergence simply means something the model wasn't trained and expected to produce.

CamperBob2 · 2024-03-25T04:11:37 1711339897

Not only that, but what's the point in evaluating a language model on its ability to do arithmetic? It's like criticizing a talking dog because the C++ code it wrote was full of buffer-overflow bugs.

In reality, if you ask a good LLM to solve a math-related problem, it will write and run a Python program to return the answer. Sometimes this even works. Sometimes it returns garbage. And sometimes it realizes that its own answer isn't realistic and tries a different approach. Sometimes people claim this isn't a valid manifestation of intelligence.

Completely pointless study, unworthy of Wired or HN.

necovek · 2024-03-25T05:52:15 1711345935

This is a study of "emergent" properties of LLMs — whether something unexpected shows up (i.e. imagine that talking dog suddenly becoming great at pointer arithmetic and never using-after-free).

It was noticed that LLMs can do some arithmetic, but we are yet uncertain how much and how it happens exactly.

naasking · 2024-03-25T04:36:30 1711341390

Arithmetic is treated as a proxy for general reasoning abilities.

CamperBob2 · 2024-03-25T05:42:13 1711345333

Arithmetic is treated as a proxy for general reasoning abilities.

Which is stupid, because it's not. A pocket calculator can perform arithmetic, as can a Python program, or for that matter a Japanese cormorant, who counts the fish it helps you catch. None of those are considered capable of "reasoning" on their own.

Meanwhile, GPT4 will cheerfully write a program to carry out the required arithmetic operations (and then some.) A study that doesn't acknowledge that is worthless at best.

Jensson · 2024-03-25T08:23:07 1711354987

> Meanwhile, GPT4 will cheerfully write a program to carry out the required arithmetic operations (and then some.) A study that doesn't acknowledge that is worthless at best.

We don't want to see if the LLM can be used as a tool to do arithmetics, but whether it can learn complex data relationships like arithmetics. Arithmetics is a stepping stone, not a goal so that the model can solve it by invoking a calculator isn't relevant. The problems we want it to solve doesn't have tools like calculators so it doesn't help getting us there.

naasking · 2024-03-25T12:43:07 1711370587

Exactly, because as Godel showed, aritmhmetic can in principle model mnany other formal systems, so if you can see arithmetic examples and generalize to the full set of rules and learn to apply them consistently, then you have a powerful reasoning tool.

imtringued · 2024-03-25T10:23:57 1711362237

These models are clearly capable of doing this. There is no theoretical reason why you should expect them to to fail at this. One day they will be able to do this perfectly and nobody gets the silly idea of generating a program to do it anymore. There is no need for another bitter lesson where "clever" AI researchers and engineers waste their career adding a hundred different workarounds to these minor problems.

CamperBob2 · 2024-03-25T16:25:50 1711383950

I don't know... the ability to write code to solve an otherwise ill-suited problem seems pretty general to me. It seems like a big step in a concrete direction, as opposed to a lot of Goedelian navel-gazing about arithmetic and Peano axioms and whatnot.

Agreed that generalized architectures will ultimately win out over hand-tweaked ones. But the patent wars that will eventually be fought over this stuff are where the real bitter lessons will come into play. At some point, we'll be forced back into the hand-optimization business because someone like OpenAI (or another Microsoft proxy) will have locked down a lot of powerful basic techniques.

taneq · 2024-03-25T01:15:21 1711329321

And “emergent” in this context usually means “behaviour resulting from the interaction of a large number of individually simple units”, not “suddenly appearing.”

hdarshane · 2024-03-25T12:30:48 1711369848

This interpretation is most definitely right. As with all things, it's coming down to semantics.