SlideShare a Scribd company logo
1 of 26
Phantom Games




Phantom Games

                                      1. Phantom-games & phantom-go
                                                              2. Maths
                                                        3. Experiments


F. Teytaud, O. Teytaud
TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud,
OASE Lab,


Korea,
Summer 2011
                                             1
Phantom Games

What are phantom games ?
phantom-X = partial information counterpart of
               (full info) game X,
you're not informed of your opponent's moves
so you might play illegal moves:
  then you're informed they're illegal;
  and you just replay them.

Extremal case: no other information (just illegal moves)
More convenient: a bit more information:
 informed of ataris (in Go)
 see all the locations you can reach (Dark Chess).
                           2
Phantom Games

What are phantom games ?
phantom-X = partial information counterpart of
               (full info) game X,
you're not informed of your opponent's moves
so you might play illegal moves:
  then you're informed they're illegal;
  and you just replay them.

       Example: phantom Tic-Tac-Toe


                           3
Phantom Games

What are phantom games ?
phantom-X = partial information counterpart of
               (full info) game X,
you're not informed of your opponent's moves
so you might play illegal moves:
  then you're informed they're illegal;
  and you just replay them.

                  My opponent plays (I don't know
                  where)

                           4
Phantom Games

What are phantom games ?
phantom-X = partial information counterpart of
               (full info) game X,
you're not informed of your opponent's moves
so you might play illegal moves:
  then you're informed they're illegal;
  and you just replay them.

                               I try this...
                               ==> illegal move!

                           5
Phantom Games

What are phantom games ?
phantom-X = partial information counterpart of
               (full info) game X,
you're not informed of your opponent's moves
so you might play illegal moves:
  then you're informed they're illegal;
  and you just replay them.

                                      I know the
                                      state...
                                       ==> good :-)
                           6
Example: Dark Chess




- Different from Chinese Dark Chess
- Also known as “Fog of War”7
Example: phantom-Go




             8
Phantom Games




A little bit of maths (sorry)

                                            1. Phantom-games & phantom-go
                                                                 2. Maths
                                                            3. Experiments


F. Teytaud, O. Teytaud
TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud,
OASE Lab,


Korea,
Summer 2011
                                             9
Simple things


Consider a 2-player game with: - finite state space;
                               - one of the two players wins.
Then:
- Full information: one of the player has a
    winning strategy. We can know who by
    Minimax. Possibly 2EXP-complete (Go with
    Japanese rules, Robson's paper).

- Partial information, finite horizon: there exists p,
   Such that player 1 wins with proba p in case of
   perfect play. p is computable.

- Partial information, infinite horizon: p not
    computable ! (Auger et al, 2010, submitted)
                           10
Other simple things




Previous stuff was known, and mathematically hard.
Now, simple stuff, with concrete applications.

Goals: making approximate solving of partially observable games
  more tractable.
  With precise bounds.




                                11
Other simple thing==> practice


Difference with full information games + applications:

- good strategies are randomized
     (when playing games with hidden information)
     (illustration: play rock-paper-scissor;
          if you play a fixed strategy,
          at least one opponent is much stronger than you)

- remark: there is an optimal strategy which is invariant w.r.t
          rotations/symmetries

==> so we can work with only one version, and then symmetrize
       (uniformly)
==> no loss of optimality (Nash sense)
                                 12
Yet another simple thing ==> practice

- Change the game as follows: player 2 chooses
   the hidden state when in state S.

- Then, the game is harder for player 1 (in term of game-theoretical
   value).

==> So we can lower bound the value by considering
  - the worst case on opponent's strategies and
  - assuming he is allowed to rebuild the hidden state (consistently
  with your observations, however).
     ==> you get a matrix game (see example later)

==> if you have both lower and upper bounds, you can estimate
   the value of an history of observations.

==> looks stupid, but simplifies13
                                 analysis (examples next slide)
Examples: 4x4 Ponnuki              Simple case
(phantom version)              (you do it naturally)




           <=== Sure win in 4x4 ponnuki
               (phantom or not)



          ==>


                     ==> at least 1/3 for black
                14
Examples: 4x4 Ponnuki

                     Better case (you don't
                     do it without thinking
                         at the method)
           <=== Sure win in 4x4 ponnuki
               (phantom or not)



          ==>


                         ==> at least 1/3 for black
                15
One more simple thing ==> practice


- Specifically for phantom-games: if a move is either a win, or an
   “illegal” move, then play it.

- Trivially ok (no optimality loss),
          reduces (very much) the set of strategies

==> it can't hurt

==> very compact representation




                                16
One last simple thing ==> practice


Specifically for phantom-games:

 - If in fully observable game X, there are N possible sequences of
   actions and player 1 wins surely.

 - Then, player 1 wins with probability at least 1/N in phantom-X.

Proof: Player 1 can reach proba 1/N of winning by playing
  randomly a sequence of actions = optimal sequence with
  proba 1/N (at least).




                               17
Good in Go, bad in phantom-Go:
tightness


Black to play.
Go: black has lost
Phantom-Go:
 Black wins with
 Proba 1-1/8!

(==> bound from
previous slide
Is nearly tight)


                     18
Phantom Games




Some results on real games.

                                   1. Phantom-games & phantom-go
                                                         2. Maths
                         3. Experiments (manually performed :-) )


F. Teytaud, O. Teytaud
TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud,
OASE Lab,


Korea,
Summer 2011
                                             19
Phantom-tic-tac-toe

Strategy for 1st player / phantom-tic-tac-toe




==> dominating moves = moves which
      are either illegal or wins
                       20
Phantom-tic-tac-toe: bounds

Then, define 6 families of strategies for
 white, covering all possible cases;
using the “simple facts”, we show that in
 all cases 1st player wins with proba at
 least 3/4
But, 2nd player can ensure a draw in TTT.
So by Lemma: value of Phantom-TTT in



                    21
Phantom-tic-tac-toe: bounds

Then, define 6 families of strategies for
 white, covering all possible cases;
using the “simple facts”, we show that in
 all cases 1st player wins with proba at
 least 3/4                     384 = nb
                               of legal
But, 2nd player can ensure a draw in TTT.
                              sequences
                              as 2nd player
So by Lemma: value of Phantom-TTT in



                    22
Phantom-ponnuki

3x3 is a win for black.
4x4 is a win for black with proba:



(by conversion/inequalities with
   matrix games)



                    23
Conclusions



Here some simple tools, with rigorous bounds on
   Phantom-tic-tac-toe
   Phantom-Ponnuki in 3x3 and 4x4

The main tool is generic (opponent chooses hidden state
   ==> matrix game)

Main further work:
    Implementation inside a search algorithm (e.g. for ranking
      moves or evaluating leafs)
    Other simplification ideas ?
         e.g. more on worst 24
                             case analysis
Conclusions
PO board games = great challenge
   Phantom-Go (humans still stronger than computers ?)
   Fog of War (don't know)
   MineSweeper: usual solvers are not optimal (they optimize
    the short-term only: minimum proba of mine)
                                                      We got
                                                      optimal
                                                    play in 6x6,
                                                      4 mines.

   ==> better models than board games for real AI ?
   ==> involves taste of danger; beyond IQ ?
   ==> my feeling: many CI improvements possible here,
              maths can help.
  (human-level performance25at Urban Rivals, a PO card game)
Finished!




...thanks for your attention ! ...




                   26

More Related Content

Viewers also liked

Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learningOlivier Teytaud
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationOlivier Teytaud
 
Examples of operational research
Examples of operational researchExamples of operational research
Examples of operational researchOlivier Teytaud
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchOlivier Teytaud
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systemsOlivier Teytaud
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationOlivier Teytaud
 
Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systemsOlivier Teytaud
 

Viewers also liked (9)

Keywords and examples of machine learning
Keywords and examples of machine learningKeywords and examples of machine learning
Keywords and examples of machine learning
 
Bias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimizationBias and Variance in Continuous EDA: massively parallel continuous optimization
Bias and Variance in Continuous EDA: massively parallel continuous optimization
 
Power systemsilablri
Power systemsilablriPower systemsilablri
Power systemsilablri
 
Functional programming
Functional programmingFunctional programming
Functional programming
 
Examples of operational research
Examples of operational researchExamples of operational research
Examples of operational research
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
 
Planning for power systems
Planning for power systemsPlanning for power systems
Planning for power systems
 
Simple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimizationSimple regret bandit algorithms for unstructured noisy optimization
Simple regret bandit algorithms for unstructured noisy optimization
 
Artificial intelligence for power systems
Artificial intelligence for power systemsArtificial intelligence for power systems
Artificial intelligence for power systems
 

Recently uploaded

VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130Suhani Kapoor
 
Call Girls Nashik Gayatri 7001305949 Independent Escort Service Nashik
Call Girls Nashik Gayatri 7001305949 Independent Escort Service NashikCall Girls Nashik Gayatri 7001305949 Independent Escort Service Nashik
Call Girls Nashik Gayatri 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...rahim quresi
 
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser... Shivani Pandey
 
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur EscortsCall Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call Girls Agency In Goa 💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...
Call Girls  Agency In Goa  💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...Call Girls  Agency In Goa  💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...
Call Girls Agency In Goa 💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...russian goa call girl and escorts service
 
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...anamikaraghav4
 
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...ritikasharma
 
Russian Escorts Agency In Goa 💚 9316020077 💚 Russian Call Girl Goa
Russian Escorts Agency In Goa  💚 9316020077 💚 Russian Call Girl GoaRussian Escorts Agency In Goa  💚 9316020077 💚 Russian Call Girl Goa
Russian Escorts Agency In Goa 💚 9316020077 💚 Russian Call Girl Goasexy call girls service in goa
 
2k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 92055419142k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 9205541914Delhi Call girls
 
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escorts
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur EscortsVIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escorts
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...anamikaraghav4
 
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...Apsara Of India
 
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...noor ahmed
 
👙 Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service
👙  Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service👙  Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service
👙 Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Serviceanamikaraghav4
 
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...noor ahmed
 

Recently uploaded (20)

VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
VIP Call Girls Service Banjara Hills Hyderabad Call +91-8250192130
 
Call Girls Nashik Gayatri 7001305949 Independent Escort Service Nashik
Call Girls Nashik Gayatri 7001305949 Independent Escort Service NashikCall Girls Nashik Gayatri 7001305949 Independent Escort Service Nashik
Call Girls Nashik Gayatri 7001305949 Independent Escort Service Nashik
 
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...
Science City Kolkata ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sex...
 
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...
Model Call Girls In Pazhavanthangal WhatsApp Booking 7427069034 call girl ser...
 
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur EscortsCall Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
Call Girl Nagpur Roshni Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Agency In Goa 💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...
Call Girls  Agency In Goa  💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...Call Girls  Agency In Goa  💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...
Call Girls Agency In Goa 💚 9316020077 💚 Call Girl Goa By Russian Call Girl ...
 
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service NashikCall Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
Call Girl Nashik Saloni 7001305949 Independent Escort Service Nashik
 
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
Russian Call Girl South End Park - Call 8250192130 Rs-3500 with A/C Room Cash...
 
Desi Bhabhi Call Girls In Goa 💃 730 02 72 001💃desi Bhabhi Escort Goa
Desi Bhabhi Call Girls  In Goa  💃 730 02 72 001💃desi Bhabhi Escort GoaDesi Bhabhi Call Girls  In Goa  💃 730 02 72 001💃desi Bhabhi Escort Goa
Desi Bhabhi Call Girls In Goa 💃 730 02 72 001💃desi Bhabhi Escort Goa
 
Call Girls New Ashok Nagar Delhi WhatsApp Number 9711199171
Call Girls New Ashok Nagar Delhi WhatsApp Number 9711199171Call Girls New Ashok Nagar Delhi WhatsApp Number 9711199171
Call Girls New Ashok Nagar Delhi WhatsApp Number 9711199171
 
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
Behala ( Call Girls ) Kolkata ✔ 6297143586 ✔ Hot Model With Sexy Bhabi Ready ...
 
Russian Escorts Agency In Goa 💚 9316020077 💚 Russian Call Girl Goa
Russian Escorts Agency In Goa  💚 9316020077 💚 Russian Call Girl GoaRussian Escorts Agency In Goa  💚 9316020077 💚 Russian Call Girl Goa
Russian Escorts Agency In Goa 💚 9316020077 💚 Russian Call Girl Goa
 
Goa Call Girls 9316020077 Call Girls In Goa By Russian Call Girl in goa
Goa Call Girls 9316020077 Call Girls  In Goa By Russian Call Girl in goaGoa Call Girls 9316020077 Call Girls  In Goa By Russian Call Girl in goa
Goa Call Girls 9316020077 Call Girls In Goa By Russian Call Girl in goa
 
2k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 92055419142k Shot Call girls Laxmi Nagar Delhi 9205541914
2k Shot Call girls Laxmi Nagar Delhi 9205541914
 
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escorts
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur EscortsVIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escorts
VIP Call Girls Nagpur Megha Call 7001035870 Meet With Nagpur Escorts
 
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...
Call Girls Service Bantala - Call 8250192130 Rs-3500 with A/C Room Cash on De...
 
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...
Karnal Call Girls 8860008073 Dyal Singh Colony Call Girls Service in Karnal E...
 
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...
Book Paid Sonagachi Call Girls Kolkata 𖠋 8250192130 𖠋Low Budget Full Independ...
 
👙 Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service
👙  Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service👙  Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service
👙 Kolkata Call Girls Sonagachi 💫💫7001035870 Model escorts Service
 
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
↑Top Model (Kolkata) Call Girls Behala ⟟ 8250192130 ⟟ High Class Call Girl In...
 

Simple Lemmas on Partially Observable Games, and Applications to Phantom tic-tac-toe, Kriegspiel and Phantom-Go

  • 1. Phantom Games Phantom Games 1. Phantom-games & phantom-go 2. Maths 3. Experiments F. Teytaud, O. Teytaud TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud, OASE Lab, Korea, Summer 2011 1
  • 2. Phantom Games What are phantom games ? phantom-X = partial information counterpart of (full info) game X, you're not informed of your opponent's moves so you might play illegal moves: then you're informed they're illegal; and you just replay them. Extremal case: no other information (just illegal moves) More convenient: a bit more information: informed of ataris (in Go) see all the locations you can reach (Dark Chess). 2
  • 3. Phantom Games What are phantom games ? phantom-X = partial information counterpart of (full info) game X, you're not informed of your opponent's moves so you might play illegal moves: then you're informed they're illegal; and you just replay them. Example: phantom Tic-Tac-Toe 3
  • 4. Phantom Games What are phantom games ? phantom-X = partial information counterpart of (full info) game X, you're not informed of your opponent's moves so you might play illegal moves: then you're informed they're illegal; and you just replay them. My opponent plays (I don't know where) 4
  • 5. Phantom Games What are phantom games ? phantom-X = partial information counterpart of (full info) game X, you're not informed of your opponent's moves so you might play illegal moves: then you're informed they're illegal; and you just replay them. I try this... ==> illegal move! 5
  • 6. Phantom Games What are phantom games ? phantom-X = partial information counterpart of (full info) game X, you're not informed of your opponent's moves so you might play illegal moves: then you're informed they're illegal; and you just replay them. I know the state... ==> good :-) 6
  • 7. Example: Dark Chess - Different from Chinese Dark Chess - Also known as “Fog of War”7
  • 9. Phantom Games A little bit of maths (sorry) 1. Phantom-games & phantom-go 2. Maths 3. Experiments F. Teytaud, O. Teytaud TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud, OASE Lab, Korea, Summer 2011 9
  • 10. Simple things Consider a 2-player game with: - finite state space; - one of the two players wins. Then: - Full information: one of the player has a winning strategy. We can know who by Minimax. Possibly 2EXP-complete (Go with Japanese rules, Robson's paper). - Partial information, finite horizon: there exists p, Such that player 1 wins with proba p in case of perfect play. p is computable. - Partial information, infinite horizon: p not computable ! (Auger et al, 2010, submitted) 10
  • 11. Other simple things Previous stuff was known, and mathematically hard. Now, simple stuff, with concrete applications. Goals: making approximate solving of partially observable games more tractable. With precise bounds. 11
  • 12. Other simple thing==> practice Difference with full information games + applications: - good strategies are randomized (when playing games with hidden information) (illustration: play rock-paper-scissor; if you play a fixed strategy, at least one opponent is much stronger than you) - remark: there is an optimal strategy which is invariant w.r.t rotations/symmetries ==> so we can work with only one version, and then symmetrize (uniformly) ==> no loss of optimality (Nash sense) 12
  • 13. Yet another simple thing ==> practice - Change the game as follows: player 2 chooses the hidden state when in state S. - Then, the game is harder for player 1 (in term of game-theoretical value). ==> So we can lower bound the value by considering - the worst case on opponent's strategies and - assuming he is allowed to rebuild the hidden state (consistently with your observations, however). ==> you get a matrix game (see example later) ==> if you have both lower and upper bounds, you can estimate the value of an history of observations. ==> looks stupid, but simplifies13 analysis (examples next slide)
  • 14. Examples: 4x4 Ponnuki Simple case (phantom version) (you do it naturally) <=== Sure win in 4x4 ponnuki (phantom or not) ==> ==> at least 1/3 for black 14
  • 15. Examples: 4x4 Ponnuki Better case (you don't do it without thinking at the method) <=== Sure win in 4x4 ponnuki (phantom or not) ==> ==> at least 1/3 for black 15
  • 16. One more simple thing ==> practice - Specifically for phantom-games: if a move is either a win, or an “illegal” move, then play it. - Trivially ok (no optimality loss), reduces (very much) the set of strategies ==> it can't hurt ==> very compact representation 16
  • 17. One last simple thing ==> practice Specifically for phantom-games: - If in fully observable game X, there are N possible sequences of actions and player 1 wins surely. - Then, player 1 wins with probability at least 1/N in phantom-X. Proof: Player 1 can reach proba 1/N of winning by playing randomly a sequence of actions = optimal sequence with proba 1/N (at least). 17
  • 18. Good in Go, bad in phantom-Go: tightness Black to play. Go: black has lost Phantom-Go: Black wins with Proba 1-1/8! (==> bound from previous slide Is nearly tight) 18
  • 19. Phantom Games Some results on real games. 1. Phantom-games & phantom-go 2. Maths 3. Experiments (manually performed :-) ) F. Teytaud, O. Teytaud TAO, Inria-Saclay IDF, Cnrs 8623, Lri, Univ. Paris-Sud, OASE Lab, Korea, Summer 2011 19
  • 20. Phantom-tic-tac-toe Strategy for 1st player / phantom-tic-tac-toe ==> dominating moves = moves which are either illegal or wins 20
  • 21. Phantom-tic-tac-toe: bounds Then, define 6 families of strategies for white, covering all possible cases; using the “simple facts”, we show that in all cases 1st player wins with proba at least 3/4 But, 2nd player can ensure a draw in TTT. So by Lemma: value of Phantom-TTT in 21
  • 22. Phantom-tic-tac-toe: bounds Then, define 6 families of strategies for white, covering all possible cases; using the “simple facts”, we show that in all cases 1st player wins with proba at least 3/4 384 = nb of legal But, 2nd player can ensure a draw in TTT. sequences as 2nd player So by Lemma: value of Phantom-TTT in 22
  • 23. Phantom-ponnuki 3x3 is a win for black. 4x4 is a win for black with proba: (by conversion/inequalities with matrix games) 23
  • 24. Conclusions Here some simple tools, with rigorous bounds on Phantom-tic-tac-toe Phantom-Ponnuki in 3x3 and 4x4 The main tool is generic (opponent chooses hidden state ==> matrix game) Main further work: Implementation inside a search algorithm (e.g. for ranking moves or evaluating leafs) Other simplification ideas ? e.g. more on worst 24 case analysis
  • 25. Conclusions PO board games = great challenge Phantom-Go (humans still stronger than computers ?) Fog of War (don't know) MineSweeper: usual solvers are not optimal (they optimize the short-term only: minimum proba of mine) We got optimal play in 6x6, 4 mines. ==> better models than board games for real AI ? ==> involves taste of danger; beyond IQ ? ==> my feeling: many CI improvements possible here, maths can help. (human-level performance25at Urban Rivals, a PO card game)
  • 26. Finished! ...thanks for your attention ! ... 26