Hacker News new | past | comments | ask | show | jobs | submit login

I'm not so sure about the simulation, but Ray's right that the brain cannot be more complex than the data that specifies it, speaking information-theoretically. A biologist probably wouldn't get this, because he has too much knowledge of how difficult and complex the actual translation is. It's a mathematical idea, like non-constructive proofs, which freak out sensible folk - and rightfully so. (EDIT no offence intended, they freak me out too)

The only other source of information is the non-genomic environment - extra-nucleur DNA like mitochondria, and the womb (which is arguably already specified in the genome, unless mother nature has done a Ken Thompson http://cm.bell-labs.com/who/ken/trust.html at some point.)

But it's weird to claim that 50% of our genome encodes the brain. Really? Perhaps it's just that 50% is required by the brain, much of it being foundational to the whole organism (like standard libraries.)




> I'm not so sure about the simulation, but Ray's right that the brain cannot be more complex than the data that specifies it

Which would be true if the DNA specification for that particular part of the body was the only part of what specifies the brain. Myers is pointing out that assertion is patently false. The environment of developing creature and the interactions between cell types and their environment (and themselves) is a giant information content multiplier, and the DNA need not explicitly specify any of this information for it to exist and be relevant.

Bringing this to a familiar compsci example; imagine software for creating neural net recognizers. You can look at the source code for a net and say, "This will take N inputs and produce N outputs" You can look at a finished classifier and say, "Ah, I see what this does! It tells the airbags in this car when to deploy!" But that's as far as you can go without the training data that was used to train the classifier. This is a doubly good example because it's often very difficult to determine HOW a complex neural net is doing what it does, but it's fairly easy to explain how to train one to do that task.


Environment being another source of information was mentioned (in my 2nd paragraph), but interactions that are determined by their starting points don't add information - even if the result looks more complex. This idea (and the comment about the mandlebrot set) is similar to "pi holding all the information in the world" (assuming it's normal, meaning not repeating/regular) http://news.ycombinator.com/item?id=1567225

Whereas the training data for a neural net is extra information - but in utero, what is the training data that is not a predictable consequence of the genome? (ex utero there might be an argument, since humans not exposed to language don't develop it; although a group of isolated humans have developed language spontaneously - complex grammatical structures, the whole bit - which makes sense, given the variety of human languages. This supports the idea that language is genetic, or as Pinker provocatively describes it, the language instinct).

EDIT here's a thought experiment to illustrate why predictable interactions don't add information: taking the figure of six billion bits for the whole human genome, this means it can specify 2^6,000,000,000 different genomes (a lot). You can imagine changing one single bit, and all those complex interactions leading to a slightly different human phenome. Most of the possible phenomes wouldn't be a living human, or even anything recognizably human (or living). But the crucial point is that you simply can't specify any other phenomes (apart from those 2^6,000,000,000). You've changed all the bits - what else is there left to change within the genome?


> Whereas the training data for a neural net is extra information - but in utero, what is the training data that is not a predictable consequence of the genome?

For starters, the mother's chemistry which is a function of her DNA and environment. And the mother's physical environment, diet, health, etc. These are things that have no representation in the genome but can radically change brain structure in a developing mammal.

I'm not claiming the environment magnifies existing information, I'm claiming it's part of the total set and Kurzweil (and you) are vastly underestimating the amount of state that is associated with the exact details of a developing organism. This seems to be the thrust of Myers's point (at least in the beginning): you are simplifying and you are not allowed to do that.

Myers then follows that point by saying that even if we do manage to isolate all that information and understand it, we actually don't have certain critical problems like protein folding solved, or even reliably simulated yet.

Even if you hand-wave all this and assume it's possible, the notion that 10 years is the timeframe for this seems... excessively optimistic.


Actually, we're not in agreement.

> vastly underestimating the amount of state that is associated with the exact details of a developing organism.

Sir, kindly indicate where Myers makes that point. I read him as going straight into protein folding, and the complex interactions required for the expression of the genome. (I believe you agree that state in the environment that is caused by the expression of the DNA is not information originating in the environment - ie that this is merely magnifying information, as you put it.)

While there is an incredible amount of state created, in the from of gradients and so on, this is directed by the genome...

Or maybe this is our basic disagreement: do you think that an image of a mandelbrot set creates information as it is generated (and that pi creates information as each digit is calculated), or do you think that the information is defined within the algorithm that calculates it? [there are other issues, but just taking this one alone]

So I'm not 100% sure what you mean; please clarify if I understand you correctly.

While there's information in the environment, there's not very much: consider the white and yoke of a chicken egg. Like letters etched in metal with acid, most of the information is in the placement of the letters; the exact nature of the chemical reaction contributes a very small amount of information. Can you indicate why you think there is a great deal of information originating in the environment?

Yes, the heath of the mother can have an effect, but that's if she is unhealthy, and development does not proceed normally. Assuming she's healthy, the specific condition of the mother doesn't determine whether or not a human being is created. Are you suggesting otherwise?


Looks like I placed too much weight on your parenthetical "and themselves"

> interactions between cell types and their environment (and themselves) is a giant information content multiplier

and we're actually in agreement; my first comment included:

> I'm not so sure about the simulation

> The only other source of information is the non-genomic environment - extra-nucleur DNA like mitochondria, and the womb (which is arguably already specified in the genome, unless mother nature has done a Ken Thompson http://cm.bell-labs.com/who/ken/trust.html at some point.)


Is this sort of like having a pre-processor and system code to get a program loaded, linked and running?


Perhaps, only the pre-processor would be in a feedback loop with the early parts stages of the program life-cycle. Typically the linker and pre-processor are one way operations. In a developing organism, the environment feeds back on the cells developing and triggers new responses which causes new environmental changes which causes new feedback.

Biology is full of bizarre examples of when this process goes awry and our bodies have bizarre features. Dawkins's example of the Laryngeal nerve that makes a crazy loop down into the mammalian chest cavity is the classic example.


Maybe, if you consider the pre-processor to be, well, reality...


The only other source of information is the non-genomic environment - extra-nucleur DNA like mitochondria, and the womb (which is arguably already specified in the genome, unless mother nature has done a Ken Thompson http://cm.bell-labs.com/who/ken/trust.html at some point.)

That just gives you a newborn baby's brain, which is a pretty poor standard for displaying "human" intelligence. If we didn't get any smarter than that, we'd be pretty dumb by animal standards. To get to "human" intelligence, you have to be able to simulate a rich environment for the brain to learn from. You also have to model the growth of the brain and its response to stimulus -- the physics, chemistry, and biochemistry of the brain. DNA doesn't have to do that, because it runs on a platform with that functionality built-in (i.e., the real world.)


Yes, but it gives you the newborn baby's brain in toto, including its ability to grow up into a normal adult human. If you get to newborn's brain you are 99.9% of the way there from the AI side. By the time we get there, feeding it stimulus will be a relatively simple problem by comparison.


Not if the world is actually more complicated than a newborn baby's brain. Can we simulate a baby's interaction with its mother without simulating the mother's brain?


We have a world in hand. We're not trying to build a world simulation, we're trying to build intelligences. By the time we get this far, the infant will probably be embodied, and we can use real "mothers". Some speculate that a non-embodied being can never become intelligent.

Bear in mind we are talking about at least 20 years hence, in my mind.


Yup, wire it up to a baby-shaped I/O package and ask one of your grad students to take it home with her.


> but Ray's right that the brain cannot be more complex than the data that specifies it

Indeed, however due to the Kolmogorov complexity http://en.wikipedia.org/wiki/Kolmogorov_complexity one can argue both for DNA being a resource and, with more common sense, that DNA and all appropriate resources during a lifetime up to a point where you take a measure of a whole brain all funnels into a resource from which you can describe a brain.

And this is only about Kolmogorov Complexity and under assumption that brain is a discrete set. I am not much well read on biology, but I think there were recent advancements where some organisms also showed quantum effects playing a biological role. Even with disregarding our current lack of knowledge we can still argue if brain is a discrete set or not, proving either would be a major contributing factor in our understanding of it I think.


This strikes me as very similar to a Mandelbrot fractal. The fractal is overwhelmingly complex and intricate, but wholly mathematically described by a very simple formula. The complexity of the brain is similar: A few simple cells and proteins can interact in extraordinarily complex ways.

Some biologists understand this just fine, thank you very much.


In fact, some biologists have even written books on the subject:

http://books.google.com/books?id=lZcSpRJz0dgC&lpg=PP1...


But even looking at it in an information-theoretic way, what is being stated is that the instruction set is far more powerful than the binary analogy Kurzweil is using. The instruction I give: "Cure Cancer" has 11 bytes. But that instruction implies a ridiculously complicated set of sequences that we haven't figured out yet.

In this situation, look at the genome as the instruction set for protein construction and folding, an ongoing problem in research we have just begun to investigate. The information contained in the genome is combinatorially descriptive and therefore not as simple as made out to be, if you define information as the amount of "surprise" in the outcome.


I don't think it can be compared to instructions. More like just the startup conditions, like a rom or something. All analogies that compare computers to biological things are terrible. Mine is as much so as any other.


I see it as an upper bound for the Kolmogorov Complexity of the brains structure. I am very dubious that the first successful attempts at doing this will be as efficient as the genetic code though...


You are making the assumption that the information quantity is a good measure of the complexity, which is far from obvious. It is also wrong in general for the genetic code (for example, many organisms have a much bigger genetic code than human, even though they are arguably much simpler - e.g. rice has more encoding genes than human).

Also, having a few set of simple rules is not enough to understand or reproduce a system in general.


but Ray's right that the brain cannot be more complex than the data that specifies it, speaking information-theoretically.

You're conflating two very different concepts of complexity here.

One is the complexity of a static state of information, and the other is the complexity arising from dynamical systems.

As roadnottaken pointed out, fractals are perfect examples of systems that are described by very "simple" formulas, yet contain infinite complexity. It could therefore be said that the simple equation of the Mandelbrot set represents infinite complexity.

However, if you take a particular iteration of the formula, then you can get a finite concept of its complexity, i.e. how many bits it takes to represent the image you're seeing.

So, to say that the "the brain cannot be more complex than the data that specifies it" is true in one sense, but completely useless in another.

To put this into terms of the Mandelbrot, you can gather up all the bits that represents some particular iteration of the Mandelbrot, but that doesn't tell you anything about how it works, or even how to generate the next frame. You need the equation for that.

That's just one place where Ray fails. The second is the lingering question of whether computers in their current state are even capable of "simulating" a brain. It's still not an answered question of what the role of the non-determinism found in Quantum Mechanics plays in the brain and the interactions of various chemicals. It was recently shown that DNA relies on QM entanglement to "hold it together". If it turns out that non-determinism and QM effects play a crucial role in biology (which they almost certainly do), then the very rigid and deterministic system that is the CPU may simply be incapable of simulating a human brain.


The objective is to simulate the real-time activities of the brain, not the evolutionary-time activities that were and are acting on the brain.

Continuing from your extension of the fractal analogy, the former is more like an "iteration of the Mandelbrot" while the latter is "how to generate the next frame".

We do not need to know how our human-precursor brains worked, nor do we need to know what our human-successor brains will be like to successfully simulate current-human-brain intelligence.

It seems plausible to me that we will be able to understand how to simulate the functions of the brain without necessarily simulating the physical universe and its remarkable evolutionary unfolding--which seems to be the ultimate level of complexity and one that I agree is far beyond us.




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: