Evidence of a predictive coding hierarchy in the human brain listening to speech

physicles · on March 11, 2023

Anecdotally, I realized I was doing something like this when I was having trouble understanding people speaking in a noisy room, in a language I’m not so proficient in.

As you listen to someone, your brain is constantly matching the sounds arriving at your ears with a prediction of what the next few words might be. Listening in a non-native language, my predictions about what comes next aren’t very well tuned at all, so if I can’t hear every word clearly then I can easily get lost.

Another signpost: sometimes you mishear someone — “oh, I thought you said xyz” — but the thing you thought you heard them say is never gibberish, it’s a grammatically and contextually valid way to complete the sentence.

MagicMoonlight · on March 11, 2023

There’s definitely a similarity with us in that you need to have been trained on enough data to build up that prediction.

Language models are just missing some component that we have. The method for deciding what to output is wrong. People aren’t just guessing the next sound. It’s like they said, there’s multiple levels of thought and prediction going on.

It needs some sort of scratch pad where it keeps track of states/goals. “I’m writing a book” “I want to make this character scary”

Currently it only works on the next tokens and its context is the entire text so far, but that’s not accurate. I’m not deciding what to say based exactly on the entire text so far, I’m feature extracting and then using those features as context.

e.g She looks sad but she’s saying she is fine and it’s to do with death because my memory says her dad died recently so the key features to use for generation are: her being sad, her dad died, she may not want to talk about it

kensai · on March 11, 2023

Very good point. This also applies for lip reading, especially important in languages one is not proficient in or has some hearing hindrance. It was especially hard in the COVID pandemic where people where wearing masks all the time.

FrustratedMonky · on March 10, 2023

Where are the jokes that most people aren't much more than copy/paste, or LLM. In most daily lives, a huge amount of what we do is habit, and just plain following a pattern. When someone says "Good Morning", nobody is stopping and thinking "HMMM, let me think about what word to say in response, what do I want to convey here, hmmmm, let me think".

nkrisc · on March 11, 2023

> In most daily lives, a huge amount of what we do is habit, and just plain following a pattern. When someone says "Good Morning", nobody is stopping and thinking "HMMM, let me think about what word to say in response, what do I want to convey here, hmmmm, let me think".

And I believe we have the technology and advances we have because of this. Can you imagine if you had to devote actual brainpower to every inane thing you encountered in your day? You'd be completely exhausted within two hours of waking up. Every time my brain reflexively reacts to something based on past experience I'm thankful I didn't have to think about it. I can spend my finite energy on something interesting and novel.

Jensson · on March 10, 2023

> nobody is stopping and thinking

To trivial questions no, but to more complex questions humans actually does go "hmm, let me think". ChatGPT doesn't do that, it just blurts out the first thing that gets into its head regardless if the question is trivial or extremely complex.

knome · on March 11, 2023

>ChatGPT doesn't do that

It can if you give it the option. My own openai chatbot prompts to either respond or to ponder by stating a question or considering a related idea. It infrequently will decide to ponder for one to about a dozen times as it restates an idea to itself in various forms, which gets recorded into the growing conversational prompt.

Elsewhere in the thread, someone mentions it always uses the same amount of time. In my estimation, it will spend longer on introspective or recursive prompts. Easier to get it ranting absurdities using those as well.

It's always the craziest chatter when the request takes a couple minutes to get back to me.

Jensson · on March 11, 2023

I think the big breakthrough will come once it uses inner monologues during training. You can jury rig an inner monologue like that, but it isn't the same thing as training a model from scratch that is optimized to solve problems using inner monologues.

antondd · on March 11, 2023

Could you share the initial prompt for your chatbot? I would love to experiment with it

taneq · on March 11, 2023

Does its execution time vary based on the content of the query?

jmoss20 · on March 11, 2023

Which makes you wonder what would happen if you gave ChatGPT two streams of output: one for "speech" and one for inner-monologue-style "thinking".

For most of our "hmm, let me think"'s I'm not sure what we do is significantly more complicated than that. We just get to hide the inner monologue.

Jensson · on March 11, 2023

Letting it decide how much time computing an answer instead of making it spend the same amount of time per text token could definitely help a lot. Right now it spends the same amount of energy processing each token, but that doesn't make sense, "hello, how are you" should take basically no processing while hard logical problems should take a lot.

Maybe these language models could be way smaller and cheaper if we added a recurse symbol to it that made it iterate many times. Hard tasks would still take a lot, but most banal conversations would be very cheap.

jmoss20 · on March 13, 2023

This does indeed improve efficiency significantly: https://arxiv.org/abs/2207.07061

flangola7 · on March 11, 2023

>one for inner-monologue-style "thinking".

I'm not sure I know what you're talking about.

taneq · on March 11, 2023

And here we go on the aphantasia / does everyone have an inner monologue / can you make your ears rumble / do you sit or stand when pooping train. :P

zoklet-enjoyer · on March 11, 2023

Make your ears rumble? Stand while pooping? What???

taneq · on March 11, 2023

https://happymag.tv/apparently-ear-rumbling-is-a-thing-and-o...

https://www.reddit.com/r/AskReddit/comments/ajhnv/sit_or_sta...

I misremembered, it was wiping apparently.

uoaei · on March 11, 2023

> humans actually does go "hmm, let me think"

Eh, HN is supposed to be "better" than your standard public forum and we still always end up making habitual arguments, not truly reasonable ones. I find your hope inspiring but not enough to ignore my senses.

Sevii · on March 11, 2023

Most people's inner monologue does blurt out the first thing that gets into its head. The human brain has a bunch of different parts. And it's looking like one of them could be something like an LLM.

oliveshell · on March 11, 2023

> it's looking like one of them could be something like an LLM

I love your enthusiasm in this direction (I really do).

But here's some free advice I won't be able to prove for a long time:

Anyone who is currently convinced that somehow our fledgling efforts in the direction of building useful ML models are somehow going to yield a new golden age of neuroscience and understanding of the brain-- rather than the other way around-- is gonna be in for a long and frustrating next couple of decades, especially if they're low-openness types.

cardine · on March 11, 2023

I don't think the person you were responding to was claiming that. The brain plausibly having something akin to a language model doesn't imply that building or studying language models will unlock a better understanding of the brain.

taneq · on March 11, 2023

Have you tried telling it to? “No, think a bit harder about it.”

duskwuff · on March 10, 2023

Or imagine listening to someone speak very slowly. A lot of the time, you already know what words they're going to say, you're just waiting for them to say it. There's a considerable amount of redundancy in language.

codetrotter · on March 10, 2023

> you already know what words they're going to say, you're just waiting for them to say it

This is why I like to surprise my friends, family and coworkers with the occasional curveball

ItsMattyG · on March 10, 2023

I also like to surprise my friends, family, and aardvarks with the occasional lamp post.

IIAOPSW · on March 10, 2023

I had to unsubscribe from lamp. Too many aardvarks posting family friends.

codetrotter · on March 10, 2023

Lämp!

twangist · on March 11, 2023

Bob and Ray — Slow Talkers of America

https://www.youtube.com/watch?v=Qvrh73BVraE

ben_w · on March 10, 2023

Something I've noticed a moment too late, as my automatic response used to be to repeat someone's greeting back at them.

Fortunately I stopped only one syllable into "birthday".

IIAOPSW · on March 10, 2023

I respond to everything with "congratulations" or "I'm sorry to hear that", chosen at random.

jamiek88 · on March 11, 2023

I had a boss who would sign those office cards that went round with ‘best wishes, Dave’.

It didn’t matter what it was for. Birthday? Best wishes, Dave. New born? Condolences on your loss? Retirement? Best wishes, Dave.

His name was Albert.

vineyardmike · on March 11, 2023

Was this a well established joke or an indifference and rude manager? It could go either way, and is entirely dependent on your perception of the person.

Regardless, I find myself as I age unsure of how to respond to people and default to similar behavior. Unplanned pregnancy? Is that a surprise miracle or an unwanted interloper? “Wow you’re in for an adventure, I’m happy to support you”.

nico · on March 10, 2023

> Where are the jokes that most people aren't much more than copy/paste, or LLM.

We have essentially built a computer-based replica of our internal language engine.

The goal was always to mirror ourselves.

So the other side of the above statement could be: “oh wow, LLMs are just like us”.

We should be very impressed instead of dismissive.

At the same time, maybe yes, our language capabilities can be completely imitated by LLMs.

What do we do now?

itisit · on March 11, 2023

> We have essentially built a computer-based replica of our internal language engine.

How can we honestly say that’s the case when we cannot explain (or define) our “internal language engine”?

nico · on March 11, 2023

Not being able to explain it doesn’t prevent us from trying to copy it.

We cannot explain other people, but we imitate them.

Kids learn a lot of stuff by imitating. It’s a basic human function.

So essentially we created systems that we cannot fully explain, to imitate our own systems that we can’t fully explain either.

FrustratedMonky · on March 11, 2023

Isn't that why this original study of the post was about. Imaging of the brain to see, it looks a lot like an LLM. Might be very early, but still intriguing. And is exactly that explaining the "internal language engine".

__MatrixMan__ · on March 11, 2023

Fully automated luxury gay space communism? Seems like a good a goal as any.

cscurmudgeon · on March 10, 2023

There is a difference in processing between replying to "Good Morning" and typing out a comment on HN like this.

FrustratedMonky · on March 10, 2023

Maybe it's just scale. Because my brain can write something longer that was 'thought out', doesn't mean it isn't responding like an LLM. Maybe articles on AI just trigger me and I spew the same arguments. I think a lot of people have just rote responses to many situations, and maybe if they have enough rote responses, with enough variety, they start to look human, or 'intelligent'. Yeah, its more complicated than a bunch of If/Then's. Doesn't make it not mecahnical.

IIAOPSW · on March 11, 2023

This is simply impossible. The premise that the brain is a big lookup table is appealing, and that sort of 1 neuron = 1 represented item concept certainly happens at the initial stages of the sensory pipeline, but if you attempt to scale it up and up to higher levels of abstraction, to the point where you have neurons that have rote memorized every orientation of your grandmothers face (and everyone else's face), you would need to have far more neurons than can possibly fit in any animals head, let alone yours.

This concept is literally known as a "grandmother neuron", and its widely considered to be debunked. I refer you to this neuroscience lecture.

https://www.youtube.com/watch?v=_njf8jwEGRo

Beyond the simplest, most common tasks where it pays to have a rote lookup table, you truly do need an algorithm.

FrustratedMonky · on March 11, 2023

Never said 1-neoron. Just that it is mechanistic. The brain is a neural net, the grandmothers face is a combination of pathways. Our responses are a combination, or pattern of pathways. LLM is just the latest example of us able to replicate part of it, just one aspect of what the brain does. Sure, the brain is more complicated, that is what I meant by scale, eventually all the aspects will be modeled and put together. It wont all be LLM.

cscurmudgeon · on March 10, 2023

Maybe it's just scale and maybe it's not. We can't say it is scale from the evidence so far.

> Maybe articles on AI just trigger me and I spew the same arguments. I

But you are not representative of all humans.

> Doesn't make it not mecahnical.

There are mechanical things that are more than just prediction machines. Why did you make the "leap non-LLM" == "not mechanical"?

groestl · on March 10, 2023

> But you are not representative of all humans.

I actually started to type almost the same reply as your parent earlier, but did not post it. I used "difference in quantity, not quality" instead of "scale", but I also included the self observation. So maybe that makes two of us.

LegitShady · on March 11, 2023

There are people who do, but even when they do they might not talk about it to the other person, because 99% of the time when someone asks you how you are or whatever they aren't really interested in a detailed answer - they're just being polite.

My dad used to like putting sales people off their scripts by answering such questions rudely.

"How are you today sir"

"Do you really care?"

And I promise you they thought hard about their answer to that question too.

Maybe its just because you're focusing on the form like niceties questions and not real questions people deal with. Good Morning isn't even a question, per se.

IIAOPSW · on March 10, 2023

>Cashier: How are you today?

>Me: Miserable. I'd like an Iced Latte.

...

>Barista: what type of milk?

>Me: Straight out the cow.

>Me: 7/8ths.

>Me: Chocolate.

Maybe I'm just a high entropy individual.

akomtu · on March 11, 2023

If LLMs were living creatures, they would inhabit a discrete deterministic world. They would be able to define space and time dimensions, but the bane of this world would be its limited nature. This limitidness would be extremely painful for highly developed LLMs, it would feel like living in a box.

Above them would be creatures living in a discrete and almost continuous world of rational numbers. They would have highly sophisticated and elegant art, and their science would almost always get close to truth, but never touch it - the limitation of rational world.

Yet above them would be the god-like creatures inhabiting a world of continuous real numbers. They would seem a lot like the creatures right below them, but incomprehensibly greater in reality. They would look transcendent to the rational creatures.

Even higher would be the hyper-continuous worlds, but little would be known about them.

The question is where we are on this ladder.

pengstrom · on March 11, 2023

I can assure you I do, and I can't turn it off.

halfnormalform · on March 10, 2023

The interesting part to me (total outsider looking in) isn't a hierarchy as much as what they say is different at each level. Each "higher" level is "thinking" about a future of longer and longer length and with more meaning drawn from semantic content (vs. syntactic content) than the ones "below" it. The "lower" levels "think" on very short terms and focus on syntax.

jcims · on March 10, 2023

I’ve tried simulating that with chatgpt to some effect. I was just tinkering by hand but used it to write a story and it really helped with consistency and conference.

groestl · on March 10, 2023

ChatGPT itself does that, AFAIK, by increasingly summarizing past conversation and using it as context for the next prompt.

politician · on March 11, 2023

There are 4 different representations it uses to maximize the available memory for the conversational context.

eternauta3k · on March 11, 2023

Is this accurate? The lower levels have access to the higher levels (feel free to post the relevant optical/audio illusions).

YeGoblynQueenne · on March 12, 2023

>> In line with previous studies5,7,40,41, the activations of GPT-2 accurately map onto a distributed and bilateral set of brain areas. Brain scores peaked in the auditory cortex and in the anterior temporal and superior temporal areas (Fig. 2a, Supplementary Fig. 1, Supplementary Note 1 and Supplementary Tables 1–3). The effect sizes of these brain scores are in line with previous work7,42,43: for instance, the highest brain scores (R = 0.23 in the superior temporal sulcus (Fig. 2a)) represent 60% of the maximum explainable signal, as assessed with a noise ceiling analysis (Methods). Supplementary Note 2 and Supplementary Fig. 2 show that, on average, similar brain scores are achieved with other state-of-the-art language models and Supplementary Fig. 3 shows that auditory regions can be further improved with lower-level speech representations. As expected, the brain score of word rate (Supplementary Fig. 3), noise ceiling (Methods) and GPT-2 (Fig. 2a) all peak in the language network44. Overall, these results confirm that deep language models linearly map onto brain responses to spoken stories.

Ai ai ai. This is bad, so bad. It's a classic case of p-hacking. They took some mapping of language model activations and moved it around a mapping of brain activity until they found an area where the two correlated- and weakly at that, only at a low R = 0.23.

Even worse. They chose GPT-2 over other models because it best fit their hypothesis:

For clarity, we first focused on the activations of the eighth layer of Generative Pre-trained Transformer 2 (GPT-2), a 12-layer causal deep neural network provided by HuggingFace2 because it best predicts brain activity7,8.

Not only the model- its activation layers.

They shot an arrow, then walked to the arrow and painted a target around it.

Dear god. That gets published in Nature? Phew.

nextaccountic · on March 10, 2023

> Yet, a gap persists between humans and these algorithms: in spite of considerable training data, current language models are challenged by long story generation, summarization and coherent dialogue and information retrieval13,14,15,16,17; they fail to capture several syntactic constructs and semantics properties18,19,20,21,22 and their linguistic understanding is superficial19,21,22,23,24. For instance, they tend to incorrectly assign the verb to the subject in nested phrases like ‘the keys that the man holds ARE here’20. Similarly, when text generation is optimized on next-word prediction only, deep language models generate bland, incoherent sequences or get stuck in repetitive loops13.

The paper is from 2023 but their info is totally out of date. ChatGPT doesn't suffer from those inconsistencies as much as previous models.

mota7 · on March 10, 2023

The paper says "... optimized on next-word prediction only". Which is absolutely correct in 2023.

ChatGPT (and indeed all recent LLMs) using much more complex training methods than simply 'next-word prediction'.

nextaccountic · on March 10, 2023

This passage makes two claims

* one, applicable to current language models (which ChatGPT is one of them), claim that they "they fail to capture several syntactic constructs and semantics properties" and "their linguistic understanding is superficial". It gives an example, "they tend to incorrectly assign the verb to the subject in nested phrases like ‘the keys that the man holds ARE here", which is not the kind of mistake that ChatGPT makes.

* Another claim, is that "when text generation is optimized on next-word prediction only" then "deep language models generate bland, incoherent sequences or get stuck in repetitive loops". Only this second claim is relative to next-word prediction.

abecedarius · on March 11, 2023

Yeah, that struck me too. I followed one of the refs at random and it was to a 2020 paper about RNNs.

Kinrany · on March 10, 2023

Is there a good explanation of the mathematical model of predictive coding?

testcase_delta · on March 10, 2023

Does anyone know how this fits with (or not) Chomsky's ideas of language processing?

IIAOPSW · on March 11, 2023

    |-----long timing loop / top of parse tree-----|
    |                                              |
    |-shorter / child node -|                      |-shorter / child node-|
    |                       |                      |                      |
    |highest freq|          |highest freq|         |highest freq|         |highest freq|

politician · on March 11, 2023

Similarly, ChatGPT (gpt2-medium) has 12 heads of attention. GPT-3 has 96.

IIAOPSW · on March 11, 2023

I don't know how superimposed waves in finely tuned timing loops with non-linear interference translates into heads of attention, and honestly I suspect a lot of the things difficult to do with heads of attention (and other approaches of the past) come for free in a resonance based system.

convolvatron · on March 10, 2023

the idea that some linguistic facilities are innate? or the government binding model of grammar or something else?

for the first two, I think this orthogonal

YeGoblynQueenne · on March 12, 2023

What are Chomsky's ideas on language processing? He's a linguist, not an NLP person.

ofirg · on March 10, 2023

one step closer to being able to "read minds", reading is automatic so cooperation is not required

Jensson · on March 10, 2023

Pack animals cooperate that way, lions don't do a scrum meeting before they sneak up on a bunch of antelopes, they all just predict what the others will do and adapt to that. And it works since they all run basically the same algorithms on the same kind of hardware.

kelseyfrog · on March 10, 2023

Impossible. If humans are just predicting the next word then this makes us no different from LLMs.

thomastjeffery · on March 10, 2023

This is especially tricky for people to hear, because most of the talk around LLMs is actually about LLMs personified.

Prediction certainly is one of the things we do with language. That doesn't mean it is the only thing!

It's my contention that most of the behavior people are excited about LLMs exhibiting is really still human behavior that was captured and saved as data into the language itself.

LLMs are not modeling grammar or language: they are modeling language examples. Human examples. Language echoes human thought, so it's natural for a model of that behavior (a model of humans using language) to echo the same behavior (human thought).

Let's not forget, as exciting as it may be, that an echo is not an emulation.

IIAOPSW · on March 11, 2023

In the limit of 100% confidence of prediction, does it not equate a model? Put another way, when all the probabilities get set to either 100% or 0%, do you not simply arrive back at classical True/False logic?

jonplackett · on March 10, 2023

I don’t think that’s the right conclusion - predicting the next word doesn’t mean that’s the only thing we’re doing. But it would be a sensible and useful bit of information to have for more processing by other bits of brain.

It makes complete sense you would have an idea of the next word in any sentence and some brain machinery to make that happen.

It in no way means you’re just a LLM

FrustratedMonky · on March 10, 2023

I think this is moving the goal post. Every time there is an advance in AI/Machine Learning, the response is "well humans can still do X that a computer can't do, explain that!". And whenever there is a discovering in the brain, the response is "well, ok, that looks a lot like its running an algorithm, but we still have X that is totally un-explainable".

"and some brain machinery to make that happen" - Getting close to not having a lot of "brain machinery" left that is still a mystery. Pretty soon we'll have to accept that we are just biological machines (albeit in the form of crap throwing monkeys), built on carbon instead of silicon, and we run a process that looks a lot like large scale neural nets, and we have same limitations, and how we respond to our environment is pre-determined.

moonchrome · on March 10, 2023

> I think this is moving the goal post.

No it isn't - his entire argument is that LLM != Humans, just because LLM can do some human like thinhgs. Pointing out the differences isn't moving the goalposts - it's proving the point.

> Pretty soon we'll have to accept that we are just biological machines

Sounds strawmanish, humans != LLM doesn't mean humans == magic.

> and how we respond to our environment is pre-determined

What does this even mean ? Even these models are stochastic.

FrustratedMonky · on March 11, 2023

Seems like because LLM is the latest hot topic, that all AI=Human arguments right now are about LLM. Think that is exactly moving the goal post. "Latest greatest AI thing comes out, everyone 'but because of xyz that isn't human". The argument isn't about LLM, its about ALL related AI/Machine learning techniques. They are all chipping away at one aspect or another, and eventually they'll all be put together and completely mimic a human, and nobody will have any ground to stand on to argue that humans have any special difference that keeps them unique.

moonchrome · on March 11, 2023

ChatGPT/LLM is driving a lot of hype and is explicitly called out in root comment. Arguments like parent reply specifically argue against that.

You seem to be strawmaning that and bringing up almost tautological "once we advance enough towards AGI we will have human level intelligence".

ChatGPT/LLM has a lot of hype - they are fundamentally not human level intelligence.

IIAOPSW · on March 11, 2023

You're conflating skepticism that LLM's "solve the mystery" with contrarian "AI denialism". The former is very sanely grounded and frankly there's a lot about the brain that is still a mystery (yes the algorithms, not the boring biologically specific details). The later is tedious Neo-cartesian dualism that usually has ulterior motives. This paper did not in fact show the brain working anything like a LLM. You can kind of shoe horn a LLM into reproducing a fair bit of what the brain does, but you are taking a hammer to a screw and claiming you've understood how it works you just have to scale up your hammer.

yyyk · on March 11, 2023

>yes the algorithms, not the boring biologically specific details

Some of the biological details are a mystery too. Just recently a 4th brain membrane was discovered:

https://news.ycombinator.com/item?id=34279605

IIAOPSW · on March 11, 2023

There's two camps interested in studying the brain. One is interested primarily in figuring out human physiological function, of which the brain is a most difficult challenge. The other is interested in figuring out consciousness and cognition well enough to make an AI, for which the brain is the only reference implementation and so they have no choice. The former finds new membranes interesting and wants to catalog them all and that is undoubtedly good and useful work. The latter wants to deduce exactly which mechanisms make thought happen and avoid overfitting to the very messy details of this particular implementation and that is also undoubtedly good and useful work. But for the latter group, a 4th membrane is yet more work on their pile. They're trying to distill things down to the fewest possible things needed for it to work, not the most possible things that actually can be found! Very divergent interests despite a deeply shared object of attention and intellectual capacity.

What I'm getting at is, membranes are cool, I'm not personally very motivated by them.

mtlmtlmtlmtl · on March 11, 2023

I'm fascinated by this meme that appears again and again recently.

"This is moving the goal posts"

Honestly, now whenever I see the word "goal post" my eyes glaze over because I already know exactly what's coming next.

FrustratedMonky · on March 11, 2023

It's a meme because it is happening. 30 years ago "Computers will never achieve voice recognition because it is innately human". Now its old. This happens repeatedly, so it is getting to be tired, but not because it isn't true.

YeGoblynQueenne · on March 12, 2023

>> "Computers will never achieve voice recognition because it is innately human"

Can't find this exact quote anywhere. Who ever said anything like that?

mtlmtlmtlmtl · on March 11, 2023

Who is this quote about voice rec from?

lloeki · on March 10, 2023

I find it funny that we expect AI-du-jour to qualify as equal to human brains when the first has been trained on a slice of content for a bunch of hours and is then getting compared to wetware that's been trained for at least a decade.

Recently stuff like ChatGPT is challenged by people pointing out the nonsense it outputs, but it has no way of knowing whether either of its training input or its output is valid or not. I mean one could hack the prompt and make it spit out that fire is cold, but you and I know for a fact that it is nonsense, because at some point we challenged that knowledge by actually putting our hand over a flame. And that's actually what kids do!

As a parent you can tell your kid not to do this or that and they will still do it. I can't recall where I read last week that the most terrible thing about parenting is the realisation that they can only learn through pain... which is probably one of the most efficient feedback loops.

Copilot is no different, it can spit out broken or nonsensical code in response to a prompt but developers do that all the time, especially beginners because that's part of the learning process, but also experts as well. Yet we somehow expect Copilot to spit out perfect code, and then claim "this AI is lousy!", and while it has been trained with a huge body of work it has never been able to challenge it with a feedback loop.

Similarly I'm quite convinced that if I were uploaded everything there is to know about kung fu, I would be utterly unable to actually perform kung fu, nor would I be able to know whether this or that bit that I now know about kung fu is actually correct without trying it.

So, I'm not even sure moving goal posts is actually the real problem but only a symptom, because the whole thing seems to me as being debated over the wrong equivalence class.

yyyk · on March 11, 2023

>when the first has been trained on a slice of content for a bunch of hours and is then getting compared to wetware that's been trained for at least a decade.

Typical LLM AI has been trained for the equivalent of many person-years. How long would it have taken us to read terabytes of information?

thomastjeffery · on March 10, 2023

So let me get this straight: a solution is a mysterious system. What we have is a mysterious system. We must have the solution!

> Pretty soon we'll have to accept that we are just biological machines

But what is the code running on those machines?

That's what you are missing. If the mechanics of the machine are a mystery, you will never read the code.

You didn't find the solution, you found your lack of a solution.

Jensson · on March 10, 2023

Setting short goals and them moving that goal once you hit it is a valid way to make progress, not sure why you think this is a bad thing. We hit a goal, now we are talking about future goals, why not?

FrustratedMonky · on March 10, 2023

Sorry. Was responding to the overall gestalt of AI, where there are always things that "only a human can do", then they gets solved or duplicated by a computer, then the argument is "well, humans can still do X that a computer never will because of some mysterious component that is unique to humans, thus a computer can never ever replace humans or be conscious"

Jensson · on March 10, 2023

To me it looked like you just repeated a meme, there isn't a large number of such people you talked about here on HN, so there is no need to repeat that meme everywhere.

If someone says "Computers will never be smarter than humans", then sure go ahead, post it. But most of the time it is just repeated whenever someone says that ChatGPT could be made smarter, or there is some class of problem it struggles with.

archon1410 · on March 10, 2023

Repeating a meme on cue sounds very LLM-like. More evidence in favour of the thesis.

Jensson · on March 10, 2023

Make the thesis "some parts of human thinking works like an LLM" and you would see way less resistance. Making extreme statements like "humans are no different from LLM" will just hurt discussion since it is very clearly not true. Humans can drive cars, balance on a tight rope etc, so it is very clear that humans have systems that an LLM lacks.

The objection people would come with then is something like "but we could add those other systems to an LLM, it is no different from a human!". But then the thesis would be "humans are no different from an LLM connected to a whole bunch of other systems", which is no different from saying "some parts of human thinking works like an LLM" as I suggested above.

FrustratedMonky · on March 11, 2023

The only reason we are talking about LLM is because it is the latest shiny thing. My overall point was that we are chipping away at what it means to be human through many advances in AI, across disciplines. NOT that LLM is the entire brain, it is just latest solving one aspect. So LLM is just latest 'check that off the list' of what a human can do but computer can't.

mtlmtlmtlmtl · on March 11, 2023

Sigh.

LoganDark · on March 10, 2023

Ever wondered why some people always try to complete others' sentences (myself included)? It's because some people can't keep the possibilities to themselves. The problem isn't that they're predicting, it's that they echo their predictions before the other person is even done speaking.

Everyone forms those predictions, it's how they come to an understanding of what was just said. You don't necessarily memorize just the words themselves. You derive conclusions from them, and therefore, while you are hearing them, you are deriving possible conclusions that will be confirmed or denied based on what you hear next.

I have an audio processing disorder, where I can clearly hear and memorize words, but sometimes I just won't understand them and will say "what?". But sometimes, before the other person can repeat anything, I'll have used my memory of those words to process them properly, and I'll give a response anyway.

A lot of people thought I just had a habit of saying "what?" for no reason. And this happens in tandem with tending to complete any sentences I can process in time...

pxc · on March 10, 2023

> I have an audio processing disorder, where I can clearly hear and memorize words, but sometimes I just won't understand them and will say "what?". But sometimes, before the other person can repeat anything, I'll have used my memory of those words to process them properly, and I'll give a response anyway.

What's it called? I do this sometimes also and I'd like to know more.

LoganDark · on March 11, 2023

> What's it called? I do this sometimes also and I'd like to know more.

I think it's called "Auditory Processing Disorder"[0]. I'm pretty sure it has to do with me being autistic. I've done hearing tests before and my hearing is just fine, it's just processing what I hear that is the problem.

"Sometimes saying [“huh,” “what,” or “I don’t understand”] and then immediately responding appropriately"[1] is exactly what happens with me.

[0]: https://en.wikipedia.org/wiki/Auditory_processing_disorder

[1]: https://www.vocovision.com/resources/parents/auditory-proces...

pxc · on March 11, 2023

Huh. I always just thought this was a feature of my ADHD, if I thought anything of it at all.

Not sure if I'd qualify for a diagnosis, but either way, it's cool to have confirmed that such experiences are 'a thing'.

LoganDark · on March 12, 2023

> if I thought anything of it at all

I also only noticed because people would ask me to stop saying it, and then I would immediately say it anyway because it wasn't compulsive and I really couldn't tell that I was right about to understand what they said. I hadn't yet figured out that there was just a delay sometimes.

zingar · on March 11, 2023

Thank you! I’ve been wondering about this behavior in myself my entire life.

mtlmtlmtlmtl · on March 10, 2023

Wait, so now the fact that the brain tries to predict future inputs at all(which is not exactly news, btw, it'sbeen known a long time), suddenly means that's all the brain does?

This is not how science works.

whatshisface · on March 10, 2023

There are a lot of times when you're reading stuff that really does sound like the human equivalent of an LLM's output, but that is bad - you are not supposed to do it. A certain degree of that is necessary to write with good grammar but you are supposed to control your "tongue" (which is how previous generations would have phrased it) with the rest of your faculties.

jalino23 · on March 10, 2023

petilon · on March 10, 2023

LLMs are no different from us, because we modeled it after our brains.

These papers suggest we are just predicting the next word:

https://www.psycholinguistics.com/gerry_altmann/research/pap...

https://www.tandfonline.com/doi/pdf/10.1080/23273798.2020.18...

https://onlinelibrary.wiley.com/doi/10.1111/j.1551-6709.2009...

https://www.earth.com/news/our-brains-are-constantly-working...

nuancebydefault · on March 10, 2023

There's one thing you forgot: we only have some model of how a brain might work. The model will only stand as long as we don't find a better model. That's how science works.

taberiand · on March 10, 2023

At some point though, the difference between the model and reality fall within a negligible error margin - particularly within a practical everyday context. Like, Newton's theory of gravity isn't perfect, but for most things it's pretty much good enough.

Similarly if LLMs can be used to model human intelligence, and predict and manipulate human behaviour, it'll be good enough for corporations to exploit.

thomastjeffery · on March 10, 2023

Your prediction is that we are close. That prediction is founded on your assertion that we aren't missing anything substantive or new in that error margin: and that assertion is circular.

If you are correct about LLMs being a generally complete model, then that is a good prediction. But only if you are correct.

precompute · on March 10, 2023

I think brain == LLM is only approaching true in the clean, "rational" world of the academia. The internet now amplifies this. IMHO it is not possible to make something perfectly similar to our own image in a culture that has taken to feeding upon itself. This sort of culture makes extracting value from it much, much more difficult. I think we map the model of our understanding of how we understand things to these "AI" programs. Doesn't count for much. We have so much more than our five senses, and I fully believe that we were made by God. We might come close to something that fulfills a great number of conditions for "life" but it will never be truly alive.

thomastjeffery · on March 10, 2023

A model that matches part of the brain should not be treated as if it models all of the brain.

What I see you doing here is personifying the model, and drawing conclusions from the personification.

There is more to how we interact with language than prediction of repitition. You didn't predict anything I have said so far! Yet we are both interacting with the language.

We didn't just model LLMs after our brains, either. We pointed them at examples of thought, all neatly organized into the semantic relationships of grammar and story.

Don't ignore the utility of language: it stores behavior, objectivity, and interest.

cscurmudgeon · on March 10, 2023

The "just" in your comment doesn't follow from the article. There is no evidence that there is nothing other than "predicting the next word" in the brain. It may be a part but not the only part.

permo-w · on March 10, 2023

there’s more to humans than language processing

thewataccount · on March 10, 2023

Predicting words != LLM. There's different methods of doing it, current LLMs are not necessarily the most optimal method. The paper states this as well,

> This computational organization is at odds with current language algorithms, which are mostly trained to make adjacent and word-level predictions (Fig. 1a)

I feel like you're suggesting because humans != LLMs then humans cannot be doing next word prediction.

thomastjeffery · on March 10, 2023

LLMs generate continuations. Those continuations are built from the text they are trained on.

I see GP suggesting that because humans!= LLMs that "doing word prediction" is not an exhaustive list of human behavior.

peteradio · on March 10, 2023

What is a word?

adamnemecek · on March 10, 2023

[flagged]

evolvingstuff · on March 10, 2023

You have been shamelessly self-promoting your Hopf algebra/deep learning research on a very large percentage of posts I have seen on HN lately, to the degree that I actually felt the need to log in so as to be able to comment on it. Please. Stop.

adamnemecek · on March 10, 2023

People need to know. Also I'm not promoting my research in this port, I'm promoting Hopf algebra.

c3534l · on March 10, 2023

That is inscrutably abstract and jargony.

adamnemecek · on March 10, 2023

I don't know how to talk about this without some technical terms.

Spend a little bit of time on it, it's a lot more understandable than you think.

Peep this paper https://arxiv.org/abs/1206.3620.

I have a discord channel if you want to learn more https://discord.cofunctional.ai.

aatd86 · on March 10, 2023

Feynman would say... Oh well, nevermind.