I'm going to read the paper carefully and think hard, but going in to it I still...

jpapon · on Dec 21, 2017

>> Nobody can assess the entire game tree, so the fact that it is theoretically knowable isn't actually relevant in a game.

That's not the point though. The point is that you only need to consider current state when considering your next move in Go or Chess. This is not the case in an incomplete information game, since the sequence of moves that led to the current state contains a lot of information. That's not the case in Chess or Go. Basically, Chess and Go satisfy the Markov Property, while Poker does not.

roenxi · on Dec 22, 2017

The current state in Go is the state of the board, the current state of poker includes some historical data. This difference is only significant to humans because we have quite poor memories.

An AI has perfect recall, and the fact that a variable is temporal really doesn't make a difference to the theory.

(Coincidental aside, this is an issue in Go as well due to the ko rule; but it isn't a major part of the game such as in poker.)

jononor · on Dec 23, 2017

Which AI architecture has perfect recall, in a way that actually makes use of this information? Pretty sure AlphaGo etc does not make use of memory at all.

antognini · on Dec 21, 2017

It's true that you can't model the entire game perfectly for any interesting game. But to play a game like Go you only need to know the current state in order to play optimally. If you play poker in a way that only accounts for your current state (i.e., you just take an expected value given the cards on the table, the cards in your hand, and the size of the pot) it won't take long for a competent player to learn how to consistently beat you. (Such a player would get tricked by bluffing, for instance.) In order to play correctly you need to model your opponent's strategy by understanding what their past moves were so that you can guess their probability of bluffing.