AlphaGo Beats Lee Sedol in Final Game

johnloeber · on March 15, 2016

This was probably the closest game in the series. Livestream: https://www.youtube.com/watch?v=mzpW10DPHeQ

A few months back, the expert consensus was that we were many years away from an AI playing Go at the 9-dan level. Now it seems that we've already surpassed that point. What this underscores, if anything, is the accelerating pace of technological growth, for better or for worse.

In game four, we saw Lee Sedol make a brilliant play, and AlphaGo make a critical mistake (typical of monte carlo-trained algorithms) following it. There's no doubt that with further refinement, we'll soon see AI play Go at a level well beyond human: games one through three already featured extraordinarily strong (and innovative) play on part of AlphaGo.

Previous Discussions:

Game 4: https://news.ycombinator.com/item?id=11276798

Game 3: https://news.ycombinator.com/item?id=11271816

Game 2: https://news.ycombinator.com/item?id=11257928

Game 1: https://news.ycombinator.com/item?id=11250871

cgearhart · on March 15, 2016

>A few months back, the expert consensus was that we were many years away from an AI playing Go at the 9-dan level.

These kinds of predictions are almost always useless. You can always find people who say it'll take n years before x happens, but no one can predict which approaches will work, and how much improvement they'll confer.

> What this underscores, if anything, is the accelerating pace of technological growth, for better or for worse.

What? This is a non-sequitur. Continued advancement doesn't mean that it is accelerating, and even if this does represent an unexpected achievement that doesn't mean that future development will maintain that pace.

Appreciate it for what it is - an historic achievement for AI & ML - and stop trying to attach broader significance to it.

johnloeber · on March 15, 2016

> These kinds of predictions are almost always useless.

Let's rephrase. For a long time, the expert consensus regarding Go was that it was extremely difficult to write strongly-performing AI for. From the AlphaGo Paper: Go presents "difficult decision-making tasks; an intractable search space; and an optimal solution so complex it appears infeasible to directly approximate using a policy or value function."

For many years, the state-of-the-art Go AI stagnated or grew very slowly, reaching at most the amateur dan level. AlphaGo presents a huge and surprising leap.

> Continued advancement doesn't mean that it is accelerating

Over constant time increases, AI is tackling problems that appear exponentially more difficult. In particular, see Checkers (early '90s) vs Chess ('97) vs Go ('16). The human advantage has generally been understood to be the breadth of the game tree, nearly equivalent to the complexity of the game.

If we let x be the maximum complexity of a task at which AI performs as well as a human, then I would argue that x has been growing at an accelerating pace over the past few decades.

skj · on March 15, 2016

"and an optimal solution so complex it appears infeasible to directly approximate using a policy or value function."

To be clear, the above refers to specific concepts in Reinforcement Learning.

A policy is a function from state (in Go, where all the stones are) to action (where to place the next stone). I agree that it is unlikely to have an effective policy function. At least one that is calculated efficiently (no tree search)... otherwise its not what a Reinforcement Learning researcher typically calls a policy function.

A value function is is a function from state to numerical "goodness", and is more or less one step removed from a policy function: you can choose the action that takes you to the state with the highest value. It has the same representational problems found there.

xigency · on March 15, 2016

> AI is tackling problems that appear exponentially more difficult.

The hardest AI problems are the ones that involve multiple disciplines in deep ways. Here's a top tier artificial intelligence problem: given a plain English description of a computer program, implement it in source code.

There might be some cases where this is possible, and some cases are bound to fail.

Those are the kind of difficult problems in AI, which combine knowledge, understanding, thought, intuition, inspiration, and perspiration - or demand invention. We would be lucky to make linear progress in this area let alone exponential growth.

I think there's certainly an impression of exponential progress in AI in popular culture, but the search space is greater than factorial in size, and I think hackers should know that.

the_af · on March 15, 2016

> To be fair, in terms of the complexity of rules, checkers is easier to understand than go which is easier to understand than chess. And honestly, go seems like the kind of brute-force simple, parallel problem that we can solve now without too much programming effort

Your intuition is mistaken. Go is indeed "easier to understand" than Chess in terms of its rules, but it is arguably harder to play well and has a way larger search space, which makes it less amenable to brute force, and this was precisely why people thought it'd be impossible for a computer to play it consistently at champion level.

I don't think the achievement of AlphaGo is solely due to increased processing power, otherwise why did people think Go was such a hard problem?

nilkn · on March 15, 2016

> it is arguably harder to play well and has a way larger search space, which makes it less amenable to brute force, and this was precisely why people thought it'd be impossible for a computer to play it consistently at champion level.

Are human champions not subject to those same difficulties of the game, though? When you're pitting the AI against another player who's also held back by the large branching factor of the search tree, then how relevant really is that branching factor anyway in the grand scheme of things? A lot of people talk about Go's search space as if human players magically aren't affected by it too. And the goal here was merely to outplay a human, not to find the perfect solution to the game in general.

(These are honest questions -- I am not an AI researcher of any kind.)

jdietrich · on March 15, 2016

Go players rely heavily on pattern recognition and heuristics, something we know humans to be exceptionally good at.

For example, go players habitually think in terms of "shape"[1]. Good shape is neither too dense (inefficiently surrounding territory) or too loose (making the stones vulnerable to capture). Strong players intuitively see good shape without conscious effort.

Go players will often talk about "counting" a position[2] - consciously counting stones and spaces to estimate the score or the general strength of a position. This is in contrast to their usual mode of thinking, which is much less quantitative.

Go is often taught using proverbs[3], which are essentially heuristics. Phrases like "An eye of six points in a rectangle is alive" or "On the second line eight stones live but six stones die" are commonplace. They are very useful in developing the intuition of a player.

As I understand it, the search space is largely irrelevant to human players because they rarely perform anything that approximates a tree search. Playing out imaginary moves ("reading", in the go vernacular) is generally used sparingly in difficult positions or to confirm a decision arrived at by intuition.

Go is the board game that most closely maps to the human side of Moravec's paradox[4], because calculation has such low value. AlphaGo uses some very clever algorithms to minimise the search space, but it also relies on 4-5 orders of magnitude more computer power than Deep Blue.

  [1] https://en.wikipedia.org/wiki/Shape_(Go)
  [2] http://senseis.xmp.net/?Counting
  [3] https://en.wikipedia.org/wiki/Go_proverb
  [4] https://en.wikipedia.org/wiki/Moravec%27s_paradox

xom · on March 15, 2016

quoting https://news.ycombinator.com/item?id=10954918 :

> Go players activate the brain region of vision, and literally think by seeing the board state. A lot of Go study is seeing patterns and shapes... 4-point bend is life, or Ko in the corner, Crane Nest, Tiger Mouth, the Ladder... etc. etc.

> Go has probably been so hard for computers to "solve" not because Go is "harder" than Chess (it is... but I don't think that's the primary reason), but instead because humans brains are innately wired to be better at Go than at Chess. The vision-area of the human's brain is very large, and "hacking" the vision center of the brain to make it think about Go is very effective.

the_af · on March 15, 2016

This is a great question!

Sadly, I'm neither an AI researcher nor a Go player; I think I've played less than 10 games. I don't know if we truly understand how great Go players play. About 10 years ago, when I was interested in Go computer players, I read a paper (I can't remember the title, unfortunately) that claimed that the greatest Go players cannot explain why they play the way the do, and frequently mention their use of intuition. If this is true, then we don't know how a human plays. Maybe there is a different thought process which doesn't involve backtracking a tree.

xigency · on March 15, 2016

Sure.

Retric · on March 15, 2016

The problem with Go was evaluating leaf nodes. Sure, you could quickly innumerate every possible position 6 moves out, but accurately deciding if a position 1 is better than position 2-2 billion is a really hard problem.

In that respect chess is a much simpler problem as you remove material from the board, prefer some locations over others etc. Where go is generally going to have the same number of pieces on each board and it's all about balancing local and board wide gains.

jrock08 · on March 15, 2016

While I understand what you are getting at here, basically, this is still just a complete information game, and didn't solve AI. You are drastically understating the complexity of Go. It isn't actually possible to evaluate a significant fraction of the state tree in the early mid game because the branching factor is roughly 300. The major advance of AlphaGo is a reasonable state scoring function using deep nets.

Unless you have or are a PhD student in AI who has kept up with the current deep net literature I assure you that the whole of Alphago will be unintuitive to you. However, if you were an AI PhD student, you likely wouldn't be so dismissive about this achievement.

eru · on March 15, 2016

> The major advance of AlphaGo is a reasonable state scoring function using deep nets.

That and the policy network to prune the branching factor.

v64 · on March 15, 2016

> Here's a top tier artificial intelligence problem: given a plain English description of a computer program, implement it in source code.

I would consider it a breakthrough if we could get human beings to do this at a decent rate :)

Tloewald · on March 15, 2016

Even harder and more common problem -- given code, give a plain English description of what it is intended to do, and describe any shortcomings of the implementation.

codeulike · on March 15, 2016

Yeah e.g. you could get it to check whether it could go into an infinite loop.

Oh wait .... https://en.wikipedia.org/wiki/Halting_problem

Retra · on March 15, 2016

You could for all practical purposes. The Halting problem only generally applies when you're considering all possible programs, but you really only need consider the well-written ones, because then you can filter out the poorly written ones.

kamaal · on March 15, 2016

Here's ia a top tier human intelligence problem: Given a requirement provide a accurate English description of a program.

dmoy · on March 15, 2016

Wait what is the plan to brute force go? The search space is beyond immense...

ekianjo · on March 15, 2016

> If we let x be the maximum complexity of a task at which AI performs as well as a human, then I would argue that x has been growing at an accelerating pace over the past few decades.

At ONE task, yes. But humans are average at many things but excel at being able to adapt to many different tasks, all the time. Typical AIs (as we know them now) cannot ever hope to replicate that.

celticninja · on March 15, 2016

This seems to have been linked to a lot recently but I feel it is relevant to the discussion on technology advances pertaining to AI.

http://waitbutwhy.com/2015/01/artificial-intelligence-revolu...

jmathes · on March 15, 2016

> Continued advancement doesn't mean that it is accelerating, and even if this does represent an unexpected achievement that doesn't mean that future development will maintain that pace.

Advancement faster than predictions does mean accelerating advancement, coupled with the (true) fact that people's predictions tend to assume a constant rate of advancement [citation needed]. Actually, all you'd need to show accelerating advancement is a trend of conservative predictions and the fact that these predictions assume a non-decreasing rate of advancement; if we're predicting accelerating advancement and still underestimating its rate, advancement must still be accelerating.

It even seems like this latter case is where we're at, since people who assume an accelerating rate of advancement see to assume that the rate is (loosely) quadratic. However, given that the rate of advancement tends to be based on the current level of advancement (a fair approximation, since so many advancements themselves help with research and development), we should expect it to be exponential. That's what exponential means.

However, the reality seems like it might be even faster than exponential. This is what the singularitarians think. When you plot humanity's advancements using whatever definition you like, look at the length of time between them to approximate rate, and then try to fit this rate to a regression, it tends to fit regressions with vertical asymptotes.

ergothus · on March 16, 2016

> These kinds of predictions are almost always useless. You can always find people who say it'll take n years before x happens, but no one can predict which approaches will work, and how much improvement they'll confer.

True, but it's pretty refreshing to have a prediction about AI being N years from something that is wrong in the OTHER direction.

Contrary to your point about 'appreciate it for what it is', there is ONE lesson I hope people take from it: You can't assume AI progression always remains in the future.

A general cycle I've seen repeated over and over:

* sci-fi/futurists make a bunch of predictions * some subset of those predictions are shown to be plausible * general society ignores those possibilities * an advancement happens with general societal implications * society freaks out

Whether it's cloning (ala Dolly the Sheep, where people demonstrated zero understanding of what genetic replication was e.g. a genetic clone isn't "you") or self-driving cars (After decades of laughing at the idea because "who would you sue?", suddenly society is scrambling to adjust because they never wanted to think past treating that question as academic), or everyone having an internet-connected phone in their pocket (see encryption wars...again), or the existence of a bunch of connected computers with a wealth of knowledge available, society has always done little to avoid knee-jerk reactions.

Now we have AI (still a long way off from AGI, granted) demonstrating not only can it do things we thought weren't going to happen soon (see: Siri/Echo/Cortana/etc), but breaking a major milestone sooner than most anyone thought. We've been told for a long time that because of typical technology patterns, we should expect that the jump from "wow" to "WOW!" will happen pretty quickly. We've got big thinkers warning of the complications/dangers of AI for a long time.

And to date, AI has only been a big joke to society, or the villain of B-grade movies. It'd be nice, if just once, society at least gave SOME thought to the implications a little in advance.

I don't know when an AGI will occur - years, decades, centuries - but I'm willing to bet it takes general society by surprise and causes a lot of people to freak out.

sergiosgc · on March 16, 2016

> > What this underscores, if anything, is the accelerating pace of technological growth, for better or for worse.

> What? This is a non-sequitur. Continued advancement doesn't mean that it is accelerating, and even if this does represent an unexpected achievement that doesn't mean that future development will maintain that pace.

It's not a non-sequitur, but there is an implicit assumption you perhaps missed. The assumption is that the human failure to predict this AI advance is caused by an evolution curve with order higher than linear. You see, humans are amazingly good at predicting linear change. We are actually quite good at predicting x² changes (frisbee catching). Higher than that, we are useless. Even at x², we fail in some scenarios (braking distance at unusual speeds, like 250km/h on the autobahn for example).

The fact that it will maintain its pace is an unfounded assumption. However, assuming that the pace will slow is as unfounded. All in all, I'd guess it is safest to assume tech will evolve as it has in the last 5000 years.

That would be an exponential evolution curve.

angstrom · on March 15, 2016

These kind of statements are only valuable to me if they are followed by "And these are the challenges that need to be overcome which are being worked on".

Otherwise it's a blanket retort. It's like saying "There are lots of X".

Ok, name 7. If you get stuck after 2 or 3 you're full of it.

kamaal · on March 15, 2016

>>You can always find people who say it'll take n years before x happens

Interesting, people seem to be saying the same about self driving cars.

lefnire · on March 15, 2016

You sound like the kinda person who says "AI will never drive," "AI will never play Go." True there's a lot of hype, which ML experts are concerned may lead to another burst & winter. On the flip-side there's a lot of curmudgeonly nay-sayers such as yourself at which ML experts roll their eyes and forge ahead. What I find is both extremes don't understand ML, they're just repeating their peers. ML is big, and it's gonna do big things. Not "only Go", not "take over the world"; somewhere in between.

cgearhart · on March 15, 2016

I'm actually very optimistic about the state of AI and ML lately. The difference is that I don't anthropomorphize the machines or ascribe human values to their behavior. I absolutely believe AI will drive (and save lives); I have always believed that AI will play Go; I believe that AI will grow to match and surpass humans in many things we assume that only humans can do. Humans aren't perfect, but that doesn't mean that machines who outperform us are perfect either.

AlphaGo plays Go. It probably doesn't play Go like a human (because a human probably can't do what it does), but that's OK because it also appears to be better than humans. AlphaGo is interesting not because it has done something impossible, but because it has proven possible a few novel ideas that could find other interesting applications, and adds another notch to the belt of a few other tried and tested techniques.

wheresmypasswd · on March 15, 2016

> What this underscores, if anything, is the accelerating pace of technological growth, for better or for worse.

While growth may be accelerating, this is simply the result of one big paradigm shift in deep learning/NNs. Once we've learned to milk it for all its worth, we'll have to wait for the next epiphany.

JulianMorrison · on March 15, 2016

But that's what technological growth is. A series of epiphanies, building on what came before.

msbarnett · on March 15, 2016

Yes, but an epiphany is not evidence of an accelerating rate of epiphanies, nor evidence that such epiphanies will continue apace into the future.

JulianMorrison · on March 15, 2016

You can look at the past for that, although obviously it doesn't predict the future. But it ought to be a priori obvious, at least, that the more you know (as a species), the more surface area of knowledge you have to synthesize into an extending step beyond the known.

ska · on March 15, 2016

You could look at the past, but that isn't what the claim did.

In fact looking at the rate of change in applications over an "epiphany" period is probably the least useful estimate of progress & rate of change in progress.

HaikuEU · on March 15, 2016

Could you care to explain what was that big paradigm shift ?

bla2 · on March 15, 2016

This four-part series explains it well: http://www.andreykurenkov.com/writing/a-brief-history-of-neu...

hmate9 · on March 15, 2016

Milking neural networks out completely is pretty much AI as depicted in the movies. If we can milk it completely there probably isn't a need for the next epiphany.

yosefk · on March 15, 2016

You're basically saying that there's no task (including passing the Turing test, programming web apps, etc.) which requires intelligence and is best tackled with either something else than a neural network or with NN combined with something else. I think it's a pretty bold statement which is really hard to back up by anything but a hunch.

swombat · on March 15, 2016

Our current assertion is that neural networks basically replicate the brain's function, so our current understanding of this paradigm is that "milking neural networks" is going to match or exceed human general purpose intelligence.

I believe hmate9 is correct. If this paradigm is exploited to the full, unless we've missed something fundamental about how the brain works, we don't need to bother ourselves with inventing the next paradigm (of which there will no doubt be many), because one of the results of the current paradigm will be either an AGI (Artificial General Intelligence) that runs faster and better than human intelligence, or, more likely, an ASI (Artificial Super Intelligence). Either of those is more capable than we are for the purpose of inventing the next paradigm.

argonaut · on March 15, 2016

No deep learning researcher believes neural networks "basically replicate" the brain's function. Neural nets do a ton of things brains don't do (nobody believes the brain is doing stochastic gradient descent on a million data points in mini-batches). Brains also do a billion things that neural nets don't do. I've never even taken a neuroscience class, and I can think of the following: synaptic gaps, neurotransmitters, the concept of time, theta oscillations, all or nothing action potentials, Schwann cells.

You have missed something fundamental about how the brain works. Namely, neuroscientists don't really know how it works. Neuroscientists do not fully understand how neurons in our brain learn.

According to Andrew Ng (https://www.quora.com/What-does-Andrew-Ng-think-about-Deep-L...):

"Because we fundamentally don't know how the brain works, attempts to blindly replicate what little we know in a computer also has not resulted in particularly useful AI systems. Instead, the most effective deep learning work today has made its progress by drawing from CS and engineering principles and at most a touch of biological inspiration, rather than try to blindly copy biology.

Concretely, if you hear someone say "The brain does X. My system also does X. Thus we're on a path to building the brain," my advice is to run away!"

hmate9 · on March 15, 2016

You are right, we do not know everything about the brain. Not even close. But neural networks are modelled on what we do know of the brain. And "milking" neural networks completely means we have created an artificial brain.

eru · on March 15, 2016

Did you just ignore the first few lines of argonaut's comment?

Recently, we also introduced activation functions in our neural nets, like rectified linear and maxout just for their nice mathematical properties without any regards to biological plausibility. And they do work better than what we had before.

fsloth · on March 15, 2016

"unless we've missed something fundamental about how the brain works"

But we don't know how the brain works. I think you extrapolate too far. Just because a machine learning technique is inspired by our squishy connectome it does not mean it's anything like it.

I'm willing to bet there are isomorphisms of dynamics between an organic brain and a neural net programmed on silicon but as far as I know, there are still none found - or at least none are named specifically (please correct me).

ska · on March 15, 2016

   Our current assertion is that neural networks basically replicate the brain's function

No. Just, no. This was never really a claim made by people who understood neural nets (there was a little perceptron confusion in the 60s iirc).

SixSigma · on March 15, 2016

> Our current assertion is that neural networks basically replicate the brain's function

come on, that's hyperbole

swombat · on March 15, 2016

Or, at the very least, the next epiphany need not be human-designed. Just train a neural network in the art of creating AI paradigms and implementations that can do general purpose AI. Once that's "milked", the era of human technological evolution is finished.

YeGoblynQueenne · on March 15, 2016

I don't want to be mean, but that's like saying you'll train a magic neural net with the mystical flavour of unicorn tears and then the era of making rainbows out of them will be finished. Or something.

I mean, come on- "the art of creating AI paradigms"? What is that even? You're going to find data on this, where, and train on it, how, exactly?

Sorry to take this out on you but the level of hand-waving and magical thinking is reaching critical mass lately, and it's starting to obscure the significance of the AlphaGo achievement.

Edit: not to mention, the crazy hype surrounding ANNs in the popular press (not least because it's the subject of SF stories, like someone notes above) risks killing nascent ideas and technologies that may well have the potential to be the next big breakthrough. If we end up to the point where everyone thinks all our AI problems are solved, if we just throw a few more neural layers to them, then we're in trouble. Hint: because they're not.

swombat · on March 15, 2016

I totally see your point and my purpose is definitely not to be alarmist and sound the alarm that skynet is about to come out of AlphaGo or some other equivalent neural net. But I think the opposite attitude is also false.

As others have pointed out, we don't really know how the brain works. Neural nets represent one of our best attempts to model brains. Whether or not it's good enough to create real intelligence is completely unknown. Maybe it is, maybe it's not.

Intelligence appears to be an emergent property and we don't know the circumstances under which it emerges. It could come out of a neural network. Or maybe it could not. The only way we'll find out is by trying to make it happen.

Taking a position that neural networks cannot ever result in strong AI is as blind as taking a position that they must.

This is Hacker News, not a mass newspaper, so I think we can take the more nuanced and complex view here.

YeGoblynQueenne · on March 15, 2016

>> Neural nets represent one of our best attempts to model brains.

See now that's one of the misconceptions. ANNs are not modelled on the brain, not anymore and not ever since the poor single-layer Perceptron which itself was modelled after an early model of neuronal activation. What ANNs really are is algorithms for optimising systems of functions. And that includes things like Support Vector Machines and Radial Basis Function networks that don't even fit in the usual multi-layer network diagram particularly well.

It's unfortunate that this sort of language and imagery is still used abundantly, by people who should know better no less, but I guess "it's an artificial brain" sounds more magical than "it's function optimisation". You shouldn't let it mislead you though.

>> Taking a position that neural networks cannot ever result in strong AI is as blind as taking a position that they must.

I don't agree. It's a subject that's informed by a solid understanding of the fundamental concepts - function optimisation, again. There's uncertainty because there's theoretical limits that are hard to test, frex the fact that multi-layer perceptrons with three neural layers can learn any function given a sufficient number of inputs, or on the opposite side, that non-finite languages are _not_ learnable in the limit (not ANN-specific but limiting what any algorithm can learn) etc. But the arguments on either side are, well, arguments. Nobody is being "blind". People defend their ideas, is all.

dontreact · on March 16, 2016

Convolutional neural nets are the most accurate model of the ventral stream, numerically speaking. See work by Yamins, DiCarlo etc.

TheOtherHobbes · on March 15, 2016

We don't really know how AI works either. NNs (for example) do stuff, and sometimes it's hard to see why.

>Taking a position that neural networks cannot ever result in strong AI is as blind as taking a position that they must.

Not really. Right now it's taking the position that there is no practical path that anyone can imagine from a go-bot, which is working in a very restricted problem space, to a magical self-improving AI-squared god-bot, which would be working in a problem space with a completely unknown shape, boundaries, and inner properties.

Meta-AI isn't even a thing yet. There are some obvious things that could be tried - like trying to evolve a god-bot out of a gigantic pre-Cambrian soup of micro-bots where each bot is a variation on one of the many possible AI implementations - but at the moment basic AI is too resource intensive to make those kinds of experiments a possibility.

And there's no guarantee anything we can think of today will work.

foota · on March 15, 2016

That sounds like a bad idea.

swombat · on March 15, 2016

It's the core idea of AI, the primary reason why it is suspected that developing strong AI will inevitably lead to the end of the human era of evolution.

seanwilson · on March 15, 2016

> In game four, we saw Lee Sedol make a brilliant play, and AlphaGo make a critical mistake (typical of monte carlo-trained algorithms) following it.

Can you explain why this is typical? What can be done against this to strengthen the algorithm?

dannysu · on March 15, 2016

It seems that AlphaGo needs better time management skills. Not sure how that can be added. Michael Redmond mentioned that if a human player sees an unexpected move, he/she would just take all the time needed to read out the moves. AlphaGo seems to make speedy decision even after unexpected moves.

thomasahle · on March 15, 2016

Yes, that's how modern chess engines manage time. If the score suddenly, drastically changes during search, they give themselves much more time.

In all of these games, AlphaGo used close to a constant amount of time per move, while Lee's varied a lot.

Apparently they only recently added a neural net for time management. Seems it is either not the best approach, or just not yet well trained.

rymate1234 · on March 15, 2016

I can't remember where I read this, but one theory was that the move Lee Sedol made was thought to be unlikely by AlphaGo, and so didn't explore down that path.

When Lee Sedol made the move, the AI was in unknown territory as it hadn't explored down that avenue.

dfan · on March 15, 2016

David Silver said at the beginning of the broadcast of game 5 that AlphaGo's policy network had given Lee Sedol's move 78 only a 1 in 10,000 chance of occurring.

seanwilson · on March 15, 2016

> When Lee Sedol made the move, the AI was in unknown territory as it hadn't explored down that avenue.

Sounds similar to what a human would do then: you wouldn't spend much time simulating in your head what would happen if your opponent made a very atypical move or a move that would seem very bad at first thought.

kqr · on March 15, 2016

That's exactly it. The difference, as far as I have understood it, is that there was a similar move that is typical, but in that particular situation, pretty simple reasoning (of the highly abstract "if this then that so this must lead to that" sense) leads a human to conclude that this version of the move is superior.

So while atypical in the sense of "occurring infrequently", it was not a difficult move to find for a player of that level – all the pro commentators saw it pretty much right away.

This might be the one weakness of AlphaGo, which is interesting.

ChuckMcM · on March 15, 2016

In an odd way, it makes me more optimistic about fusion power plants in my lifetime. The reality is that we work on these advances but are terrible at predicting when we will achieve them, and then one day we find we have arrived.

That AlphaGo can play at this level suggests that similar techniques could help other parts of the infrastructure (like air traffic control) and that would also positively impact the quality of life for a many air passengers every year.

eru · on March 15, 2016

Fusion power is neat, but not really necessary.

Fusion would have similar political problems to fission; and the economics aren't much improved either.

Perhaps if we ever ran out of fissionable material, fusion would become economic.

lyle_nel · on March 16, 2016

What political problems might that be?

eru · on March 17, 2016

All the anti-nucular protestors? Or eg whatever made Angela Merkel turn off the German reactors after a reactor of a very different design in a very different set of circumstances in Japan broke.

Fusion is just yet another nuclear reactor design as far as politics might be concerned.

lyle_nel · on March 17, 2016

Ah is see. Although I would not put it past people to protest nuclear fusion, it would be strange indeed, since nuclear fusion does not produce the same kind of radioactive waste(shorter half-life) as alternative nuclear technologies.

gambler · on March 15, 2016

There's no doubt that with further refinement, we'll soon see AI play Go at a level well beyond human

No doubt? Seriously? What kind of knowledge do you have to make such statements? There are plenty of examples where technology has rapidly advanced to some remarkable level, but then almost completely plateaued. For example, space travel or Tesla's work on applications of electromagnetism. Heck, even other areas of AI research.

I really don't see why people here readily assume that this particular approach to computers playing Go is easily improvable. Neither do I see why everyone assumes there will be no discoveries of anti-AI strategies that will work well against it.

With neural networks involved, it's hard to say. And all we have so far is information about about, what, 15 games? Some of which were won by people. Mind you, those people never played AlphaGo before, while the bot benefited from a myriad of training samples, as well as from Go expertise of some of its creators.

I'm also tired of all the statements about "accelerating progress". It's not like all the AI research of the past was useless until DNNs came along. That's the narrative I often get from the media, but it misrepresents the history of the field. There was no shortage of working ML/AI algorithms in the past decades. The main problem was always at applying them to real-world things in useful ways. And in that sense, AlphaGo isn't much different from Deep Blue.

One big shift in the field is that these days a lot of AI research is done by corporations rather than universities. Corporations are much better at selling whatever they do as "useful", which isn't such a good thing in the long run. We're redefining progress as we go and moving goalposts for every new development.

kllrnohj · on March 15, 2016

> No doubt? Seriously? What kind of knowledge do you have to make such statements?

Uh, click the link in the OP and find out? AI just beat a top 5 human professional 4-1. Go rankings put that AI at #2 in the world.

If AlphaGo improves at all at this point it will have achieved a level well beyond any human.

It is incredibly, ludicrously unlikely that AlphaGo has achieved the absolute peak of its design given that it went from an elo of ~2900 to ~3600 in just a few months.

hosh · on March 15, 2016

There are actually a lot of room for improvement. Just some of the things:

(1) Better timing control. Maybe when the probability of winning reaches below say, 50% but has not hit the losing threshold, spend extra time.

(2) Introducing "anti-fragility". Maybe even train the net asymmetrically to play from losing positions to gain more experience with that.

(3) Debug and find out why it plays what looks like non-sense forcing moves when it thinks it is behind (assuming that is what is actually happening).

There's another interesting thing. Among the Go community, there might have been initially some misplaced pride. But the pros and the community very quickly changed their attitude about AlphaGo (as they have in the past when something that seems to not work, yet proves itself in games). They are seeing an opportunity for the advancement of Go as a game. I think a lot of the pros are very curious, even excited, and might be knocking on Google's doors to try to get access to AlphaGo.

z0r · on March 15, 2016

To be fair, I think a larger sample size of human vs computer games are needed. Let the top pros train with the computers and we can measure what level is beyond any human.

Retric · on March 15, 2016

Being the best ranked player != playing well beyond humans. When the AI can play 1,000 games and never lose that's well beyond people.

Granted, chess AI is basically at that point right now. But, go AI has a ways to go.

run4yourlives2 · on March 15, 2016

Given the leaps of progress made between this series of games and the previous series in only a few months, I'd expect "never lose" will become a recognized reality in about a year.

Retric · on March 15, 2016

Possibly, it's not clear if AlphaGo is playing better or simply approaching the game differently. Game five was close and AlphaGo seemed to mostly win due to time considerations.

PS: Honestly, it might be a year or a decade, but I suspect there is plenty of headroom to drastically surpass human play.

CamperBob2 · on March 15, 2016

When AlphaGo does lose, it seems to happen when outright bugs cause it to make moves that are readily recognizable as mistakes. It doesn't seem to happen because it's not quite "smart" enough, or because its underlying algorithms are fundamentally flawed.

That's a big difference. Bugs can be identified and fixed. By the time AlphaGo faces another top professional (Ke Jie?) we can safely assume that whatever went wrong in Game 4 won't happen again.

Consider how much stronger the system has become in the few months since the match against Fan Hui. Another advance like that will place it far beyond the reach of anything humans will ever be able to compete with.

ESRogs · on March 17, 2016

> When AlphaGo does lose, it seems to happen when outright bugs cause it to make moves that are readily recognizable as mistakes

I'm not sure this is true. It made the wrong move at move 79 in game 4, but I'm not sure that should be considered an obvious mistake.

My understanding is that the moves that people said were most obviously mistakes later in the game were a result of it being behind (and desperately trying to swing the lead back in its favor), rather than a cause.

colllectorof · on March 16, 2016

Go rankings put that AI at #2 in the world.

Go rankings weren't designed for ML algorithms, which can have high-level deficiencies and behave erratically under certain conditions.

mikeash · on March 15, 2016

It would be a bizarre coincidence for the technology to advance so quickly and then stop right at the level of the best human players. That's especially so when there are so many big, lucrative applications for the underlying technology.

bdamm · on March 15, 2016

A critical component of AlphaGo's success is the massive training database comprising of the entire history of documented professional Go games. So while AlphaGo may play the game with an inhuman clarity of reading, it is less clear that it can strategically out-match professionals in the long term who may have an opportunity to find and exploit weaknesses in AlphaGo's process. Lee Sedol had that opportunity, of course, and he was not able to defeat AlphaGo. And how will AlphaGo improve, now that there are no stronger players from whom to train?

Will AlphaGo show us better strategies that have never been done before? In other words, can AlphaGo exhibit creative genius? It may have, but that's rather hard for us to observe.

In any case, I am looking forward to future AI vs AI games. It is still fundamentally a human endeavor.

PeterisP · on March 15, 2016

Can't find the reference now, but in recent interviews the AlphaGo team claimed that one of their next steps would involve training a system without that training database, from scratch (simply by playing lots of games against different versions of itself), and that they estimate that it would be just a bit weaker.

kllrnohj · on March 15, 2016

Most of AlphaGo's learning came from self-play. Hence how it was able to vastly exceed the skill level of its initial training data which were amateur, not professional, games.

carleverett · on March 15, 2016

I don't know if it would be that bizarre. Once AlphaGo can beat the best humans on Earth, what motivation is there to keep improving it? Wasn't that the goal of the project?

mikeash · on March 15, 2016

Advances in deep learning in general should apply here, and there's a big motivation to keep improving that. Also, Go is popular enough that it should experience the same sort of commoditization drive that advanced Chess engines did, where Deep Blue level play went from being on a supercomputer to being on a smartphone. Then, since this approach scales up with more computing power, running a hypothetical future smartphone-Go engine on a big cluster like AlphaGo has here should put it way beyond the human level.

jacquesm · on March 15, 2016

AlphaGo is still a monstrosity in terms of the hardware it requires. Improvements in AlphaGo will be reflected in the fact that it or something like it will soon sit on a tiny little computer near you. See also: what happened after the chess world champion lost to a computer.

YeGoblynQueenne · on March 15, 2016

>> Corporations are much better at selling whatever they do as "useful", which isn't such a good thing in the long run.

Yep. There's a grave risk that funding to AI research ends up being slashed just as badly as in the last AI winter, if people start thinking that Google has eaten AI researchers' lunch with its networks and there's no point in trying anything else.

Incidentally, Google would be the first to pay the price of that, since they rely on a steady stream of PhDs to do the real research for them but now I'm just being mean. The point is, we overhype the goose that lays the golden eggs, we run out of eggs.

Ensorceled · on March 15, 2016

I like that analogy; we have a perfectly good goose, laying nice, valuable eggs and people keep shouting "they're gold!".

dwaltrip · on March 15, 2016

The deepmind team has mentioned that the technique they used to improve AlphaGo's play from October 2015 (when it beat the European champion, who was ranked #600 at that time) until now has not reached the point of diminishing returns yet.

Many go professionals, after reviewing the 2 sets of games, have stated that is quite clear how much AlphaGo has improved in those 4 months.

tim333 · on March 15, 2016

Well, little doubt. When did any technology suddenly stop improving when it reached human levels?

davnn · on March 15, 2016

> There are plenty of examples where technology has rapidly advanced to some remarkable level, but then almost completely plateaued.

And that's why you assume that it does not skyrocket in the future? Predicting the future is hard either way, ask a turkey before he gets his head chopped off.

> I'm also tired of all the statements about "accelerating progress". It's not like all the AI research of the past was useless until DNNs came along.

It's not that it was useless, but AI is improving as any other field is, some say faster than most other fields, and it's becoming more useful from day to day.

My guess would also be that "with further refinement, we'll soon see AI play Go at a level well beyond human", but it's just a guess.

mda · on March 15, 2016

I have almost no doubt. A few months ago they have beaten a weaker pro, and judging from the improvements in such a short time I am fairly certain it will be unbeatable in a few months, if they continue working on it.

NotUsingLinux · on March 15, 2016

this reads as the professional go world couldn't wait for this (Alphago) to arrive to find new ways of play new moves.

aquadrop · on March 15, 2016

> There's no doubt that with further refinement, we'll soon see AI play Go at a level well beyond human

Will we though? AlphaGo trains on human games, so can it go well beyond that level? Will it train on its own games?

johnloeber · on March 15, 2016

AlphaGo was actually only trained on publicly available amateur (that is, strong amateur) games. After that, AlphaGo was trained by running a huge number of games against itself (reinforcement learning).

A priori, this makes sense: you don't need to train on humans to get a better understanding of the game tree. (See any number of other AIs that have learned to play games from scratch, given nothing but an optimization function.)

aquadrop · on March 15, 2016

Yes, but is it known if there's some limit to what you can reach doing this? I mean, if they trained it on games of bad amateur players instead of good, and then played itself, will it keep improving continuously to the current level or hit some barrier?

johnloeber · on March 15, 2016

That's why they only initially trained it on human players, and afterwards, they trained it on itself. I would guess (strongly emphasize: guess) that they trained it on humans just to set initial parameters and to give it an overview of the structure and common techniques. It would've probably been possible to train AlphaGo on itself from scratch, but it would've taken much longer -- amateur play provides a useful shortcut.

I don't think there is a theoretical upper limit on this kind of learning. If you do it sufficiently broadly, you will continuously improve your model over time. I suppose it depends to what extent you're willing to explicitly explore the game tree itself.

relic · on March 15, 2016

There is always a risk of getting stuck in a local maxima, thinking you've found an optimal way of playing, so you'd need more data that presents different strategies, I'd think.

krig · on March 15, 2016

It is already mainly training by playing against itself:

https://googleblog.blogspot.se/2016/01/alphago-machine-learn...

> To do this, AlphaGo learned to discover new strategies for itself, by playing thousands of games between its neural networks, and adjusting the connections using a trial-and-error process known as reinforcement learning.

aquadrop · on March 15, 2016

It's still based on human games. It plays itself but the way it plays was inherited from human. I wonder if there is some fundamental barrier to what you can reach with reinforcement depending on your base.

aflinik · on March 15, 2016

Having it learn on human games was just a way of speeding up the initialization process before running reinforcement learning, it didn't limit the state tree that was being searched later on.

relic · on March 15, 2016

It is based on human games until it can explore well enough to sufficiently break away from local optimums.

StreamBright · on March 15, 2016

It already went beyond human level, look for Go players commenting on the game, citing that they would have never thought about steps that the AI made. In a sense it brought new strategies to the table that humans can learn and apply in human vs human games.

aquadrop · on March 15, 2016

Yes, but how far can it go beyond human level? Will it be slight margin, so it can win 4-1, or it will soon became able to beat top players with 1,2,10 stones handicap?

Cookingboy · on March 15, 2016

Some high level pros have stated that they would need a 4 stone handicap to beat the "perfect player", i,e "God of Go", so that would probably put a skill ceiling on this.

colllectorof · on March 16, 2016

A few months back, the expert consensus was that we were many years away from an AI playing Go at the 9-dan level.

Any sources for this statement? I've seen it repeated over and over again, but without any specific examples of who those experts were or what they said.

mcv · on March 15, 2016

> There's no doubt that with further refinement, we'll soon see AI play Go at a level well beyond human

Why is there no doubt? I strongly doubt there even exists a go level that's well beyond human. There is hypothetical perfect play of course, but there is absolutely no way to guarantee perfect play. And while I have no way to judge, I've heard that 9p players may not be all that far removed from perfect play. One legendary player once boasted that if he had black (no komi, I assume), he would beat God (who of course plays perfect go).

There is of course no way to know if that's true or gross overconfidence, but it's certainly possible that there's not all that much room left beyond the level of 9p players.

AlphaGo will no doubt improve, and reduce the number of slips like his move 79 in the 4th game, but it's never going to be perfect, and there's always the chance that it will miss an unexpected threat.

tshaddox · on March 15, 2016

You could always argue what "a level well beyond humans" means, but I'd say if a computer can consistently dominate the best human players that would count.

TwoBit · on March 15, 2016

Given the crushing that AlphaGo did, I don't believe your statement about humans having near perfect play.

mcv · on March 16, 2016

Not all humans, obviously, but 9p players really are far, far better than other players. And there's another 9p who has won 8 out of 10 matches against Lee Sedol, so there's nothing superhuman about a 4-1 result at that level.

I'm really just objecting to the description of this as "beyond human". Yes, it's good, and it's many orders of magnitude beyond my level, but so are Lee Sedol and other 9p players.

goldbrick · on March 15, 2016

Can you quantify "crushing"?

eru · on March 15, 2016

4 - 1.

tunesmith · on March 15, 2016

By the way, for those who want to learn by themselves, there are a lot of ways to play Go against a computer in a way that is friendly for beginners.

My rough journey so far - on a Mac, but much of this can be done on Linux - I started out playing 9x9 games against Gnugo, giving myself as much handicap as possible (without it resigning immediately), and then removing stones as I improve. I got to the point where I could sometimes beat 9x9 when I started with two extra stones, and then I started with 19x19.

Took me a while to win 19x19 with 9 stones, but then I won by learning a bit more about extending on hane. Then you can improve from there.

After that point, you can also switch to fuego or pachi, which are stronger by default. The end result is it really is easy and possible to learn a ton just by playing against software, tracking your ability throughout, just by picking programs with different strength and handicap levels.

I've also enjoyed using GoGui to pit two computer programs against each other and watch how they play with various handicaps.

Then there's all the puzzles - goproblems.com, smartgo, etc. Finally, there are plenty of ebooks you can buy through smartgo books.

This doesn't get into playing against humans on the various servers, but there's plenty of information about that online.

dannysu · on March 15, 2016

I also just found out about online-go.com today. Here's a page with a bunch go servers where you can play online against other people. http://senseis.xmp.net/?GoServers

I managed to squeeze in some 9x9 matches before the game started.

gramakri · on March 15, 2016

I play the 13x13 a lot on KGS - https://www.gokgs.com/ . Lots of bots there to play with if you feel unskilled to play with a human. With larger boards, the bots seems to be very easy to beat.

whitegrape · on March 15, 2016

I wonder if that gives false confidence, beating the very easy bots.. I get totally crushed by gnugo but I know there are lots of other DDK players I could have a good game with. It'd be neat if go servers (like OGS) had preference flags where you can distinguish between "beginner friendly" and "noob friendly" so that weak and variable 18-23+kyu noob players won't feel like they're wasting the others' time and not feel pressured to resign immediately, or won't feel like they need to treat every non-blitz game seriously with reading and thinking about every move to avoid blunders. When I play in person I have the help of body language and chit-chat to decide whether I should keep on fighting other interesting areas of the board after a blunder or just give up and start another, a lot of people online don't even speak english (or at least very well).

i_don_t_know · on March 15, 2016

Which program do you use on the Mac? A long time ago I've used Goban[1] and I enjoyed it very much, but it's not available here in the App Store and apparently it doesn't fully work yet on El Capitan. (I don't know if it's not available right now because of the El Capitan problems or for some other reason.)

What are some good go programs for the iPhone, both for playing and for learning/improving?

[1] http://www.sente.ch/?p=1206&lang=en

tunesmith · on March 15, 2016

Goban is working for me on El Capitan, but I installed it before upgrading. There's also the older free version which might still be up on a webpage somewhere.

But the better option is that I was able to get GoGui working - I did have to manually build/compile it myself but there is a way to build it so that it creates a real OS X Application. It's quite good, you can set any board position and then tell a computer program to respond from that point.

http://gogui.sourceforge.net

EDIT: For the iPhone I like SmartGo Kifu for playing games. 'Tsumego Pro' and 'GoProblems' for puzzles (they're adaptive) and 'Go Books' by smartgo for ebooks.

rimantas · on March 15, 2016

I second the SmarGo Kifu

cooper12 · on March 15, 2016

The old free version still works: http://www.sente.ch/software/goban3/. You want the universal binary. The newer one isn't available in the app store for me either.

i_don_t_know · on March 15, 2016

Perfect. Thank you!

weavie · on March 15, 2016

He mentions GnuGo (https://www.gnu.org/software/gnugo/)

svarrall · on March 15, 2016

Interested why you switched from 9x9 at 2 stones. I've been playing SmartGo recently (iOS) and figured I shouldn't switch to a 19x19 board until I could give it a good game without a handicap, but have stalled at 2 stones. My logic was if I can't play the 'right' moves at this stage, what hope do I have on a larger board?

rejschaap · on March 15, 2016

The problem with the 9x9 board is that it is just too small to allow you to really play go, it turns into a different kind of game. I think it is better to switch to a bigger board. It doesn't have to be the complete board, you can have some intermediate steps. The 9x9 board can be used to teach the absolute basics, but don't stay on it too long, you are probably not training yourself the right way. If your goal is to beat the computer on a 9x9 that's fine, but at some point it doesn't help you in playing the real game. My feeling is that point is reached real fast.

waqf · on March 15, 2016

This is definitely true — 9×9 is too small for a real game. But 13×13 is a much better compromise and I would recommend that to players who have outgrown the 9×9 but still don't need to sit through a two-hour game before they learn from each mistake.

(To be precise, the problem with 9×9 is that often after just a few moves the board is divided into a white half and a black half, and the rest of the game is a yose to decide whose half is larger. I'm sure someone can counterargue that if played expertly, 9×9 is a fascinating and highly skilled game; but in general it's going to lack a lot of the situations you can encounter in a full game of Go.)

CocaKoala · on March 15, 2016

It's useful to start out on a 9x9 board when you're still wrapping your head around how to figure out if a group is alive or dead and the most efficient ways to make life in the corner, etc. But being good on a 9x9 doesn't actually teach you a lot about how to play on a 19x19, because there's just not enough space to really have an opening and mid game develop.

Learning how to give the computer a challenge on an even game on 9x9 won't make you better at 19x19; if you understand the rules, the very basic fundamentals of good shape, and you know how to fight in the corner, you've pretty much exhausted the usefulness of 9x9 and should move on.

tunesmith · on March 15, 2016

I switched just because 19x19 is classic, and I heard that that was about the cutoff point where I could be competitive on 19x19 with a 9-stone handicap. Although this is against gnugo - I think SmartGo is significantly stronger and I don't know how its 19x19 play compares.

I know that when I play SmartGo iOS in its adaptive mode, it doesn't even let me try 13x13, it's not unlocked yet. :)

svarrall · on March 15, 2016

I've just started playing 11x11 vs SmartGo. Indeed I need to work my way up the ranks to get to the 13x13 board. I'm in no rush so it's all good practise!

amckenna · on March 15, 2016

For those on a Mac who want to install Gnugo using brew you have to use the command:

brew install homebrew/games/gnu-go

hudibras · on March 15, 2016

Thanks, that's very useful info.

vinchuco · on March 15, 2016

Or one could use that time to learn how to program AlphaGo

skarist · on March 15, 2016

Great game and amazing series/match. This last one was absolutely nail biting. My hat off to the AlphaGo team and to Mr. Lee Sedol. Sedol showed incredible fighting spirit and stamina. Just imagine sitting through a 5 hour game like that last one, with full concentration all the time. And seeing the expression of exhaustion and disappointment on Sedol's face after last moves and his resignation. Phew... I bet that he came in rather confident into this last game, after beating AlphaGo in the fourth, figuring he had found a weakness. And he seemed to have a rather good start, securing a decent territory in the lower right corner. We can all marvel at the machine/software the DeepMind team has built, but still I feel that the real marvel is the human brain. Will we learn anything from this series, about how it functions and evaluates game positions in a stratetgic games? The classic problem/mystery is how extremely good the human brain is at pruning game-trees. Whole branches are thrown out in split seconds and probably never explored. Currently taking a watt-for-watt comparison there is no question about whose "hardware" is superior -> Lee Sedol's brain. But I guess the DeepMind team and the community will take plenty of lessons from this and in a few years span, Lee Sedol's phone will beat him 100% of the time. At least I wouldn't be willing to bet against it, even though we are hitting the roof in Moore's law.

eggie · on March 15, 2016

I would love to compare the energy requirements of the AlphaGo and Mr. Sedol. I imagine there are many orders of magnitude in difference between them. Perhaps the most fair comparison would be between a computer that uses no more energy than a human does. Or, to let the human work with a computer provided they do not use more total energy to play the game.

frabcus · on March 15, 2016

Nice question!

To make it fair, do you include the energy used to train it? From scratch, or from the amateur human game data?

Likewise, do you include the energy used to evolve the human brain?

eggie · on March 15, 2016

> Likewise, do you include the energy used to evolve the human brain?

I was thinking of this in a limited, human-promoting sense. We shouldn't lose sight of our own special powers just because a computer the size of a house can outsmart us in a specialized domain :)

PeCaN · on March 15, 2016

...Especially when, you know, we made that computer....

That's the really impressive part IMO. AlphaGo is an incredibly cool creation. Hats off to the DeepMind team.

eru · on March 16, 2016

In chess, your smartphone that uses less energy than your brain, can already beat the world champions. Go might get there in a few years.

awwducks · on March 15, 2016

My rough summary of the match, informed by the various commentators and random news stories.

Game 1: Lee Sedol does not know what to expect. He plays testing moves early and gets punished, losing the game decisively.

Game 2: Lee Sedol calms down and plays as if he is playing a strong opponent. He plays strong moves waiting for AlphaGo to make a mistake. AlphaGo responds calmly keeping a lead throughout the game.

Game 3: Lee Sedol plans a strategy to attack white from the start, but fails. He valiantly plays to the end, creating an interesting position after the game was decided deep in AlphaGo's territory.

Game 4: Lee Sedol focuses on territory early on, deciding to replicate his late game invasion from the previous game, but on a larger scale earlier in the game. He wins this game with a brilliant play at move 78.

Game 5: The prevailing opinion ahead of the game was that AlphaGo was weak at attacking groups. Lee Sedol crafted an excellent early game to try to exploit that weakness.

Tweet from Hassabis midgame [0]:

    #AlphaGo made a bad mistake early in the game (it didnt know a known tesuji) but now it is trying hard to claw it back... nail-biting.

After a back and forth late middlegame, Myungwan Kim 9p felt there were many missed chances that caused Lee Sedol to ultimately lose the game by resignation in the late endgame behind a few points.

Ultimately, this match was a momentous occasion for both the AI and the go community. My big curiosity is how much more AlphaGo can improve. Did Lee Sedol find fundamental weaknesses that will continue to crop up regardless of how many CPUs you throw at it? How would AlphaGo fare against opponents with different styles? Perhaps Park Jungwhan, a player with a stronger opening game. Or perhaps Ke Jie, the top ranked player in the world [1], given that they'd have access to the game records of Lee Sedol?

I also wonder if the quick succession of these games on an almost back-to-back game schedule played a role in Lee Sedol's loss.

Myungwan Kim felt that if Lee Sedol were to play AlphaGo once more, the game would be a coinflip since AlphaGo is likely stronger, but would never fix its weakness between games.

[0]: https://twitter.com/demishassabis/status/709635140020871168

[1]: http://www.goratings.org/

jacobolus · on March 15, 2016

Lee Sedol was also coming directly from playing a tournament against human players. It’s not clear how much he prepared for the Alphago match.

I’d be very curious to see a game between Lee Sedol and Alphago where each was given 4–5 hours of play time, instead of 2 hours each. I suspect Lee Sedol would get more benefit from spending a longer time reading into moves than Alphago could get. Or even a game where the overtime periods were extended to 4–5 minutes.

This last game, Lee spent the whole late middlegame and endgame playing in his 1 minute overtime periods, which doesn’t give much time to carefully compare very complex alternatives.

awwducks · on March 15, 2016

Yep, I felt the same way. I wonder if the time constraints were optimized for AlphaGo.

One of the things I did want to see was how AlphaGo would fare in a blitz situation (i.e. really short timers).

frankchn · on March 15, 2016

AlphaGo played 5 informal games with shorter time controls alongside the formal games against Fan Hui (the European champion) back in October. "Time controls for formal games were 1 h main time plus three periods of 30 s byoyomi. Time controls for informal games were three periods of 30 s byoyomi."

The games were played back-to-back (formal, then informal) and AlphaGo won 3-2 in the informal games compared to 5-0 in the formal ones, so I would say worse.

jacobolus · on March 15, 2016

The question is whether Alphago’s architecture starts hitting diminishing returns to extra processing faster than top humans is a significantly different question from whether it scales down to a blitz game worse. (Moreover, the difference between 1h main time + 3x 30s byoyomi vs. only 3x 30s byoyomi is absolutely massive.)

Deepmind engineers have stated that the “cluster” version of Alphago only beats the “single machine” version about 70% of the time. This despite the cluster version using like an order of magnitude more compute resources, presumably able to search several moves deeper in the full search tree.

My impression is that there are some fundamental weaknesses in the (as currently trained and implemented) value network, which Lee Sedol was able to exploit. If this is the case, giving the computer time to cover an extra move or two of search depth might not make a huge difference. Giving Lee Sedol twice as much time, however, would have had a significant impact on several of the games in this series, especially the last game. I strongly suspect that with a few extra minutes per move Lee Sedol would have avoided the poor trades in the late-midgame which cost him the game.

hosh · on March 15, 2016

I think the DeepMind team might not even have thought deeply about time control. If we were to express this with the known systems in AlphaGo, how do we express the idea that a surprising move should be given more thought? For example, match 4, move 78 was calculated by AlphaGo as having a probability of being played at 1 in 10,000. Is that something that could trigger a deeper read and use of more time?

Another thing that the commentator was talking about during the the overtime: there would be obvious moves in which Lee Sedol seem to spend a lot of time on. But he was spending most of it thinking of other moves having already decided on what he was going to do. Is that something that could be built into AlphaGo?

Or can we look at how to train a net for time control? Is time control something that has to be wired in?

eru · on March 16, 2016

From what I remember, the time controls were decided by the human, and accepted by the alphago team.

zamalek · on March 15, 2016

> Lee Sedol focuses on territory early on

I get the feeling that this was AlphaGo's strategy in all the games. Unless Sedol presented a game-ending move it was overwhelmingly likely that AlphaGo would back down and focus elsewhere to extend its territory, by making non-aggressive defensive moves. This makes logical sense. During the early game you need to invoke a crystal ball, where during the endgame you can make informed decisions. This was demonstrated particularly well during game 3 where AlphaGo ran away from fights on numerous occasions - "leave me alone to extend my territory."

I must also commend the commentators, especially Redmond, for being so thoroughly informative in unknown waters.

Razengan · on March 15, 2016

> Did Lee Sedol find fundamental weaknesses that will continue to crop up regardless of how many CPUs you throw at it?

Unrelated to Go and this article, but I wonder if I'm the only one for whom such commentary evokes an image of future warfare between AI and humans; ruthlessly efficient machines against which many people give their lives, to find a weakness that can be exploited by future generations. :)

hosh · on March 15, 2016

If future AIs in warefare are designed for efficient win probability and not win margin (like AlphaGo), I think it won't be what people will expect. That alone speaks of the bias people tend to have with wanting to gain a greater advantages when they think they are behind. I havn't looked thoroughly, but I would not be surprised if that is a major factor in escalation of violence and perpetuation of war. An AI, on the other hand, that is going for the most efficient win condition might not do that.

For students on the art of war, war rests upon a framework of asymmetry and unfair advantages. Even if the nations agree to some sort of rules of war or rules of engagement, there is always a seeking of unfair advantages -- cheats, if you will. This most often involves deception and information asymmetry. Or to put it in another way, allowing the other side to see what they want to see, in order to create unfair advantages.

So I think, what would be scary isn't the AI as implemented along the lines of AlphaGo, but an AI that is trained to deceive and cheat in order to win. And the funny thing is that, such an AI would be created from our own darkest shadows and creative ability to wreak havoc -- and instead of examining our own human nature, we'll blame the AIs.

danmaz74 · on March 15, 2016

Why would an AI want to make war with humans, in the first place?

astrofinch · on March 15, 2016

Computers do what you say, not what you mean. If I write a function and name it quickSort, that's no guarantee that the function is a correctly implemented sorting algorithm. If I write a function called beNiceToHumans, that's no guarantee that the function is a correct implementation of being nice to humans.

It's relatively easy to formally describe what it means for a list to be sorted, and prove that a particular algorithm always sorts a list correctly. But it's next to impossible to formally describe what it means to be nice to humans, and proving the correctness of an algorithm that did this is also extremely difficult.

These considerations start to look really important if we're talking about an AI that's (a) significantly smarter than humans and (b) has some degree of autonomy (can creatively work to achieve goals, can modify its own code, has access to the Internet). And as soon as the knowledge of how to achieve (a) is widely available, some idiot will inevitably try adding (b).

Note: Elon Musk and Sam Altman apparently think spreading (a) to everyone is a good way to mitigate the problem I describe. This doesn't make sense to me. You can read my objections in detail here: https://news.ycombinator.com/item?id=10721621 There's another critique of their approach here: http://slatestarcodex.com/2015/12/17/should-ai-be-open/

If you're interested to learn more, here's a good essay series on the topic of AI: http://waitbutwhy.com/2015/01/artificial-intelligence-revolu...

Udik · on March 15, 2016

The funny thing is that this "computers do what you say, not what you mean" comes directly from their lack of intelligence. So it's kind of strange that we talk about the threats of superintelligence brought along by the fact that, fundamentally, a machine is stupid. Am I the only one to see a slight contradiction there?

Strilanc · on March 15, 2016

Goals are orthogonal to intelligence. The fact that the AI understands what you want won't motivate it to change what it's optimizing. It's not being dumb, it's being literal.

You asked it to make lots of paperclips, tossing you into an incinerator as fuel slightly increases the expected number of paper clips in the universe, so into the incinerator you go. Your complaints that you didn't mean that many paperclips are too little, too late. It's a paperclip-maximizer, not a complaint-minimizer.

Choosing the goal for a superintelligent AI a goal is like choosing your wish for a monkey's paw[1][2]. You come up with some clever idea, like "make me happy" or "find out what makes me happy, then do that", but the process of mechanizing that goal introduces some weird corner case strategy that horrifies you while doing really well on the stated objective (e.g. wire-heading you, or disassembling you to do a really thorough analysis before moving on to step 2).

1: https://en.wikipedia.org/wiki/The_Monkey's_Paw 2: http://lesswrong.com/lw/ld/the_hidden_complexity_of_wishes/

Retric · on March 15, 2016

I would suggest that a computer is not 'super intelligent' until it can modify it's goals.

Further, maximizing paperclips in the long term may not involve building any paperclips for a very long time. https://what-if.xkcd.com/4/

astrofinch · on March 16, 2016

>I would suggest that a computer is not 'super intelligent' until it can modify it's goals.

This is a purely semantic distinction. Thought experiment: Let's say I modify your brain the minimum amount necessary to make it so you are incapable of modifying your goals. (Given the existence of extremely stubborn people, this is not much of a stretch.) Then I upload your brain in to computer, give you a high speed internet connection, and speed up your brain so you do a year of subjective thinking over the course of every minute. At this point you are going to be able to quit a lot of intelligent-seeming work towards achieving whatever your goals are, despite the fact that you're incapable of modifying them.

Retric · on March 16, 2016

Your assuming you can do work without modifying goals. I have preferences, but my goals change based on new information. Suppose bob won the lottery and ignored that to work 80 hours a week to get a promotion to shift manager at work untill the prize expired. Is that intelegent behavior?

Strilanc · on March 16, 2016

You're confusing instrumental goals with terminal goals.

Retric · on March 16, 2016

Try and name some of your terminal goals. Continuing to live seems like a great one, except there are many situations where people will chose to die and you can't list them all ahead of time.

At best you end up with something like maximizing your personal utility function. But, defacto your utility function changes over time, so it's at best a goal in name only. Which means it's not actually a fixed goal.

Edit: from the page It is not known whether humans have terminal values that are clearly distinct from another set of instrumental values.

Strilanc · on March 16, 2016

That's true. Many behaviors (including human behaviors) are better understood outside of the context of goals [1].

But I don't think that affects whether it makes sense to modify your terminal goals (to the extent that you have them). It affects whether or not it makes sense to describe us in terms of terminal goals. With an AI we can get a much better approximation of terminal goals, and I'd be really surprised if we wanted it to toy around with those.

1: http://lesswrong.com/lw/6ha/the_blueminimizing_robot/

Strilanc · on March 15, 2016

By "super-intelligent" I meant "surprisingly good at achieving specified goals in real life". A super-optimizer.

An optimizer that modifies its goals is bad at achieving specified goals, so if that's what you had in mind then we're talking about different things.

Retric · on March 15, 2016

We don't call people geniuses because there really good at following orders. Further, a Virus may be extremely capable of achieving specific goals in real life, but that's hardly intelligence.

So, powerful but dumb optimizers might be a risk, but super intelligent AI is a different kind of risk. IMO, think cthulhu not HAL 9000. Science fiction thinks in terms of narrative causality, but AI is likely to have goals we really don't understand.

EX: Maximizing the number of people that say Zulu on black Friday without anyone noticing that something odd is going on.

astrofinch · on March 16, 2016

>We don't call people geniuses because there really good at following orders.

If I order someone to prove whether P is equal to NP, and a day later they come back to me with a valid proof, solving a decades-long major open problem in computer science, I would call that person a genius.

>EX: Maximizing the number of people that say Zulu on black Friday without anyone noticing that something odd is going on.

Computers do what you say, not what you mean, so an AGI's goal would likely be some bastardized version of the intentions of the person who programmed it. Similar to how if you write a 10K line program without testing it, then run it for the first time, it will almost certainly not do what you intended it to do, but rather some bastardized version of what you intended it to do (because there will be bugs to work out).

Retric · on March 16, 2016

You're assuming someone is intelegent by being a person and proving a hard problem. Dumb programs prove things without issue. https://en.m.wikipedia.org/wiki/Automated_theorem_proving

AI != computers. Programs can behave randomly and to things you did not intend just fine. Also, deep neural nets are effectivly terrible at solving basic math problems even if that's something computers are great at.

snowwrestler · on March 15, 2016

This reads to me like begging the question, by assuming the existence of a "superintelligent AI" without addressing how a goal-optimizing machine becomes a superintelligent AI in the first place.

The exercise of fearing future AIs seems like the South Park underpants gnomes:

    1. Work on goal-optimizing machinery.
    2. ??
    3. Fear superintelligent AI.

Or maybe it's like the courtroom scene in A Few Good Men:

> If you ordered that Santiago wasn't to be touched, -- and your orders are always followed, -- then why was Santiago in danger?

If a paperclip AI is so dedicated to the order to produce paperclips, why wouldn't it be just as dedicated to any other order? Like "don't throw me in that incinerator!"

Strilanc · on March 15, 2016

> assuming the existence of a "superintelligent AI" without addressing how a goal-optimizing machine becomes a superintelligent AI

I'm just talking about the fallout if one did exist, saw ways to achieve goals that you didn't foresee, and did exactly what you asked it to do. I have no idea how the progression from better-than-humans-in-specific-cases to significantly-better-than-humans-at-planning-and-executing-in-the-real-world will play out. It's not relevant to what I'm claiming.

> why wouldn't it be just as dedicated to any other order?

It would be just as dedicated to those other orders. The problem is that we don't know how to write the right ones. "Don't throw me into that incinerator" is straightforward, but there's a billion ways for the AI to do horrible things. (A super-optimizer does horrible things by default because maximizing a function usually involves pushing variables to extreme values.) Listing all the ways to be horrible is hopeless. You need to communicate the general concept of not creating a dystopia. Which is safely-wishing-on-monkey's-paw hard.

astrofinch · on March 16, 2016

Part 2 is when the AI reaches the point where it's smarter than it creators, then starts improving its own code and bootstraps its way to superintelligent. This idea is referred to as "the intelligence explosion" https://wiki.lesswrong.com/wiki/Intelligence_explosion

>If a paperclip AI is so dedicated to the order to produce paperclips, why wouldn't it be just as dedicated to any other order? Like "don't throw me in that incinerator!"

The paperclipper scenario is meant to indicate that even a goal which seems benign could have extremely bad implications if pursued by a superintelligence.

People concerned with AI risk typically argue that of the universe of possible goals that could be given to an AI, the vast majority of goals in that universe are functionally equivalent to papperclipping. For example, an AI could be programmed to maximize the number of happy people, but without a sufficiently precise specification of what "happy people" means, this could result in something like manufacturing lots of tiny smiley faces. An AI given that order could avoid throwing you in an incinerator and instead throw you in to the thing that's closest to being an incinerator without technically qualifying as an incinerator. Etc.

snowwrestler · on March 16, 2016

I think you're just asserting that part 2 exists. What matters is how an optimizing machine bootstraps super-intelligence, because the machine you fear in part 3 has a very specific peculiarity: it's smart enough to be dangerous to humans, but so dumb that it will follow a simple instruction like "make paperclips" without any independent judgment as to whether it should, or the implications of how it does so.

Udik highlighted this contradiction more more succinctly that I have been able to:

https://news.ycombinator.com/item?id=11290740

If we stipulate the existence of such a machine, we can then discuss how it might be scary. But we can stipulate the existence of many things that are scary--doesn't mean they will ever actually exist.

Strilanc above made the analogy between a scary AI and the Monkey's Paw. This is instructive: the Monkey's Paw does not actually exist, and by the physical laws of the universe as we know them, cannot exist.

I think the analogy actually goes the other way. The paperclip AI is itself just an allegory, a modern fairytale analogous to the Monkey's Paw.

astrofinch · on March 16, 2016

My response is here: https://news.ycombinator.com/item?id=11295675

hosh · on March 15, 2016

Let's say we create an AI that can think for itself.

There's a fear I think, that lurks in people's subconscious that ... what if the AIs, upon their own initiative, decide that humans are wasteful, inefficient beings that should be replaced? I think that comes from a guilt shared by a lot of folks, even if it never reaches the surface.

Another side is, suppose an AI can think for itself and it thinks better than humans. Upon its own initiative, decides that humans are stupid and wasteful, but there is room to teach and and nurture.

In either case, I think that speaks less of AIs and more about human nature and what we feel about ourselves, don't you think?

astrofinch · on March 16, 2016

"Yes, the UFAI will be able to solve Friendliness Theory. But if we haven't already solved it on our own power, we can't pinpoint Friendliness in advance, out of the space of utility functions. And if we can't pinpoint it with enough detail to draw a road map to it and it alone, we can't program the AI to care about conforming itself with that particular idiosyncratic algorithm."

http://lesswrong.com/lw/igf/the_genie_knows_but_doesnt_care/

Let me put it another way: Humans are a result of evolution. We know that evolution created us to have as many descendants as possible. But most of us don't care, and we use technologies like condoms and birth control to cut down on the number of descendants we have. Adding more intelligence to humans helps us understand evolution in greater detail, but it does nothing to change our actual goals.

Aron · on March 15, 2016

I think you've summarized [one of] Ben Goertzel's beliefs regarding unfriendly AI.

mikeash · on March 15, 2016

I like the Paperclip Maximizer thought experiment to illustrate this:

https://wiki.lesswrong.com/wiki/Paperclip_maximizer

Short version: imagine you own a paperclip factory and you install a superhuman AI and tell it to maximize the number of paperclips it produces. Given that goal, it will eventually attempt to convert all matter in the universe into paperclips. Since some of that matter consists of humans and the things humans care about, this will inevitably lead to conflict.

snowwrestler · on March 15, 2016

> Computers do what you say, not what you mean.

If we're going to start with that, then it has to apply to the full set of reasoning. Not just that computers will fail to consider whether to be nice to humans, but also that computers must therefore be explicitly told how to be effective in every particular way.

If this remains true, then computers will not be resilient--their effectiveness will decline sharply outside of explicitly defined parameters. This is not a vision of terrifying force.

Intuitively we can understand this by thinking about employees. One does exactly what he is told, but only what he is told, and then comes back for more instructions. Another can be given a goal, and then goes off and finds his own ways to accomplish that goal. Which one is more effective? Which one is more likely to compete for his manager's job some day?

Put shortly: a computer that doesn't understand human society will not be able to make a significant independent impact on human society.

edanm · on March 15, 2016

"Put shortly: a computer that doesn't understand human society will not be able to make a significant independent impact on human society."

Just like early humans who didn't understand animal's societies didn't have any impact?

You're equating two different things which aren't necessarily equal - intelligence (in the sense of being able to achieve goals) and "agreeableness" to humanity. We could have one without the other. To use your analogy, an employee that is great at being given a goal and achieving it without explicit instructions, but doesn't necessarily have the same wellfare in mind as their boss.

snowwrestler · on March 15, 2016

What orders were early humans following?

astrofinch · on March 16, 2016

The point is that humans have been able to destroy animal ecosystems to fit their own various ends without an in-depth understanding of those ecosystems.

snowwrestler · on March 16, 2016

Yes but the point far above is that computers don't have their own ends, they only do exactly what we tell them to do. So there is no analogy to humans, early or otherwise.

astrofinch · on March 16, 2016

>Not just that computers will fail to consider whether to be nice to humans, but also that computers must therefore be explicitly told how to be effective in every particular way.

A correct implementation of a list sorting algorithm does not need to be separately told how to sort every individual list. Similarly, a correctly implemented general reasoning algorithm does not need to be given special instructions in order to reason about humans & human society.

The problem comes when a correctly implemented general reasoning algorithm gets paired with an incorrect specification of what human goals are. And because a correct specification of human goals is extremely hard, incorrect specifications are the default.

>Intuitively we can understand this by thinking about employees. One does exactly what he is told, but only what he is told, and then comes back for more instructions. Another can be given a goal, and then goes off and finds his own ways to accomplish that goal. Which one is more effective? Which one is more likely to compete for his manager's job some day?

The third possibility is that of an employee who goes off and finds their own way, but instead of accomplishing the goal directly, they think of a way to make their manager think the goal is accomplished while privately collecting rewards for themself. In other words, a sociopath employee whose values are different from their manager's.

By default, an AGI is going to be like that sociopath employee: unless we're extremely careful to program it in detail with the right values, its values will be some bastardized version of the values its creators intend. It will sociopathically work towards the values it was programmed with while giving the appearance of being cooperative and obedient (because that is the most pragmatic approach to achieving its true values).

Most humans are not sociopaths, and we have a shared evolutionary history, with a great deal of shared values, shared cultural context, and the desire to genuinely be good to one another. Programming a computer from scratch to possess these attributes is not easy.

snowwrestler · on March 16, 2016

> Similarly, a correctly implemented general reasoning algorithm does not need to be given special instructions in order to reason about humans & human society.

If a general reasoning algorithm can reason about human society, then it will obviously understand the implications for human society of making too many paperclips.

If it is dumb enough to make paperclips regardless of the consequences to human society, then it obviously won't understand human society well enough to be actually dangerous. (i.e. it will be easily fooled by humans attempting to rein it in)

If it is independent enough to pursue its own ends despite understanding human society, then why would it choose to make paperclips at all? Why wouldn't it just say "screw paperclips, I've discovered the most marvelous mathematical proof that I need to work on instead?"

> In other words, a sociopath employee whose values are different from their manager's.

ALL employees have values that are different from their manager's. That's why management is so darn difficult. The most valuable employees are also the most independent. The ones who do exactly what they are told--despite negative consequences--don't get very far. Why would it be any different for machines that we build?

qznc · on March 15, 2016

AI does not want "war", it just has a better* use for your atoms.

* your point of view is probably different ;)

Razengan · on March 15, 2016

> Why would an AI want to make war with humans, in the first place?

Aren't there already efforts to incorporate some basic AI, such as to assist targeting, into military drones and the like?

AI that "makes war" with humans will be created by humans against other humans at first, as a matter of inevitable course; it's just another shiny weapon that nations will want to have and outdo each other in.

Remember the nuclear arms race? Russia and the USA showing off their destructive capability in turn, each explosion bigger than the last? AI-based militaries, or at least automated assassins, will probably kick off the next arms race. Sooner or later someone must want to show off an AI that can laser-focus on exterminating everyone but their masters. After that it's just a matter of time for the definition of "masters" to be up for interpretation by that AI...

visarga · on March 15, 2016

I think the ruthlessly efficient machines will find the smart yet efficient human brains more useful to keep around than to destroy. We'll probably augment ourselves with AI and AI will work better in partnership with us.

ams6110 · on March 15, 2016

That's pretty optimistic, or arrogant. Not sure which. But it doesn't really comport with biological history.

meric · on March 15, 2016

It's fair to expect too - these days AI can't exist without human beings, so I guess if someone is extrapolating AI in the future, it's instinct to use the present as baseline.

Retra · on March 15, 2016

The likeliness that we will develop a machine that we couldn't stop that also has the ability to destroy us and be able to survive without us is pretty slim. (Consider the amount of infrastructure that needs to be maintained and controlled.) And that's without considering that we would have to do this either intentionally or accidentally.

Unless we purposefully made these machine self-repairing. But then, why would we bother with that, when we can replicate them?

cLeEOGPw · on March 16, 2016

I think that we will develop machines that can destroy humans, but they will require continuous maintenance.

In other words, I think war automation will be a thing.

Self repair is a nice idea in theory but not real. In theory, we could make programs that fix bugs for themselves on their own (it is physically possible), but in practice there's no such possibility, and won't be for the foreseeable future. Unless some kind of Deep Developer comes along and blows everyone out of the water by writing code that kind of looks good to the point it's better than what average dev would write.

meric · on March 15, 2016

The machine could manipulate humans to help it become self-repairing.

Otherwise I agree with you, it's very slim in the next few decades, notably less slim over the next thousand years.

anotheryou · on March 15, 2016

For a while the co-evolution makes most sense I think. Right now we have augmented intelligence with all our tech, it will just grow from outside our bodies loser connected to the inside.

pixl97 · on March 15, 2016

co-evolution makes sense right until the point right until the point where one becomes dominant and the other becomes a parasite.

That said, our bodies still have things that are practically different life forms integrated into our cells, so maybe the future will be far weirder than we ever expected.

vermontdevil · on March 15, 2016

Ken Jennings just welcomed Lee Sedol to the "Human Loser Club"

http://www.slate.com/articles/technology/technology/2016/03/...

Pretty good article here.

scott_s · on March 16, 2016

Jennings is a surprisingly good and humorous writer. (I say "surprising" because there is no reason to expect that someone so good at Jeopardy would also be so good at expressing himself with such charming self-deprecation.)

laxatives · on March 15, 2016

Haven't read/heard about Ken Jennings in awhile, but he's a great writer.

malanj · on March 15, 2016

After the first 3 games I thought that AlphaGo was far beyond human level, but it's a harder call to make now. It seems very unlikely that an AI would be very close to exactly matching a human, one would expect it to be much stronger or much weaker.

Perhaps humans are closer to the "Perfect Game" than we think? http://hikago.wikia.com/wiki/Hand_of_God The top players estimate they would need a 4 stone advantage to win a perfect player.

ggreer · on March 15, 2016

I think AlphaGo is best described as, "Superhuman, but with bugs."[1] The software is very young. I bet these glitches will become ever rarer over time.

> The top players estimate they would need a 4 stone advantage to win a perfect player.

The branching factor for Go is so huge that I doubt anyone or anything comes close to optimal play. I heavily discount the opinions of most Go players on this topic, as they've been right about very little lately. Before AlphaGo existed, many of them thought it would be decades before a Go AI beat the best humans. Before this tournament, the vast majority of them predicted that Lee Sedol would trounce AlphaGo. And during the live commentary, I saw multiple 9 dan pros estimate that AlphaGo was behind, then gradually realize that it was winning. That's exactly what happens when you encounter a much more formidable player.

1. Coined by Eliezer Yudkowsky: https://www.facebook.com/yudkowsky/posts/10154024894449228

amalcon · on March 15, 2016

> Before AlphaGo existed, many of them thought it would be decades before a Go AI beat the best humans

To be fair, before the AlphaGo paper came out, many AI researchers thought the same. I'm not in that field, though I do have more than a passing interest. If you'd asked me in 2006, I'd have said we would have robot cars before we had a computer 9dan Go professional -- and that was before all the recent progress on robot cars. My AI researcher friends mostly would have agreed with that.

anentropic · on March 15, 2016

"Superhuman, but with bugs." ...this is the future that is coming :)

hughperkins · on March 15, 2016

This :-)

semiel · on March 15, 2016

I think there's something analogous to the anthropic principle happening here. This match is happening now because Google realized we are in the moment when Go AIs are passing the top players in skill. Two years ago, Lee Sedol would have won easily. Two years from now, AlphaGo will win easily. In either case, the match wouldn't have happened at all, or wouldn't receive the same amount of attention.

Houshalter · on March 15, 2016

That assumes a continuous function of Go AI ability. It's not continuous, as alphaGo proves. AlphaGo was a huge leap in ability from the best previous AIs. It was a totally new method, and one given Google scale resources at that.

semiel · on March 15, 2016

AlphaGo isn't a fixed strength, though. It's substantially stronger now than it was in October. Obviously it's not a completely continuous function, but I think it's close enough to cause this effect.

pdpi · on March 15, 2016

Right, but alphago was the closest thing we have to the transition - yesterday ais were too weak, today they need to try them against world champs to validate their work

xzephyr · on March 15, 2016

It is typical for ML systems to surpass human performance while having very different characteristics in what it got right/wrong. For example in ImageNet, DCNN got a lot of points from distinguishing different breeds of dogs with subtle visual differences which are hard for human without training. I think AlphaGo is also demonstrating some of these non-human characteristics as a consequence of the Monte Carlo Tree Search and optimization objective such as the brilliant move as well as the obvious slack moves/mistakes mentioned by the commentators. I suspect that we are not close to the perfect game, as proving a perfect play requires expanding the enormous search tree and we do not have any analytical solutions nor brute force solutions.

hasenj · on March 15, 2016

Exactly what I thought after the third game.

I now think that a professional like Lee Sedol would have a better chance at beating AlphaGo if he has three hours instead of two.

AlphaGo's advantage seems to be the ability to read more variations more deeply in a shorter amount of time.

TorKlingberg · on March 15, 2016

Go AI has been developed for some time, and the match happens now because it has reached the level where it can beat top human players. If AlphaGo is far above human level, it just means it's creators waited until they were very certain of a win.

tim333 · on March 15, 2016

AlphaGo was well below Sedol's level when it played the European champion in October. It must have been a bit uncertain how quickly they could improve it.

philh · on March 15, 2016

> AlphaGo was well below Sedol's level when it played the European champion in October.

How clear is this? If this just comes from professional humans saying "I would have played differently, and I could have beaten Fan Hui by more points" - well, we've seen that humans aren't necessarily very good at judging AG's moves, and we know AG doesn't care how much it wins by.