SlideShare uma empresa Scribd logo
1 de 25
On the Convergence of Regret Minimizing Dynamics in Concave Games Joint work with Eyal Even Dar , Yishay Mansour Microsoft Research, Cambridge UK, March 26, 2009 Uri Nadav Tel Aviv University, Tel Aviv Israel
Nash Equilibrium ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Player I Player II 0 ,0 4 ,1 1 ,4 3 ,3
Dynamics Player 1: Player 2: Day 1 Day 2 Day 3 Day 4 Day 5 0,0 4,1 1,4 3,3
Example Dynamics ,[object Object],[object Object],[object Object],Player 1: Player 2: Day 1 Day 2 Day 3 Day 4 Day 5 Unfortunately, does not always converge to equilibrium 0,0 4,1 1,4 3,3
No External Regret ,[object Object],[object Object],( total cost of best fixed row in hindsight ) ( total cost of alg ) - Regret Alg  (  T  ) =  A procedure is  “without external regret”  if for every sequence the external regret is sublinear in  T ,[object Object],4 Alg Cost Weather 3 Umbrella 4 No umbrella  0 1 1 0
Our Main Result ,[object Object],Resource Allocation Cournot Oligopoly Socially concave games Selfish Routing TCP Congestion Control
Cournot Oligopoly  [Cournot 1838] ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],X Y Cost 1 (X) Cost 2 (Y) We will show no-regret dynamics converges to NE for any number of players Market  price Overall quantity X y P
Resource Allocation Games ,[object Object],$5M $10M $17M $25M ,[object Object],‘ s allocated rate =  5+10+17+25 25 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],We can show that the best response dynamics generally diverges for linear resource allocation games f   ( U  =  )   - $25M
Routing Games ,[object Object],s 1 t 1 t 2 s 2 f 1, L f 1, R f 1,L f 2,T Latency on edge e = L e (f 1,L  + f 2,T ) e f 2, T f 2, B ,[object Object],f 2 f 1 ,[object Object]
Socially Concave Games ,[object Object],[object Object],[object Object],R There exists   1 ,…,  n  > 0 Such that  1  u 1  (x) +   2  u 2 (x)+…+  n  u n (x) Zero Sum Games  ½  Socially concave games ,[object Object],[object Object],[object Object],[object Object],[object Object]
Our Main Result ,[object Object],Player 1: Player 2: Player  n : Day 1 Day 2 Day 3 Day  T Average of days  1… T    - Nash equilibrium ,[object Object],If each players uses a  procedure without regret  in  socially concave games  then their joint play converges to Nash equilibrium: - Nash Eq
Convergence to NE – Proof Outline ,[object Object],Utility of player  i   at average   Utility of i playing Best  Response to the average   Definition of    - Nash equilibrium Player 1: Player 2: Player  n : Day 1 Day 2 Day 3 Day  T Average
Convergence to NE – Proof Outline ,[object Object],For each player  i : By definition of  Best Response Sum of utilities Utility of average action profile Convexity: The minimum utility is attained when the others play fixed action  Holds for every action  z
Convergence to NE – Proof Outline ,[object Object],By assumption, there exists   1 ,…,  n  such that : is concave Utility of average action profile (Average) Sum of utilities Concavity: The sum of utilities at average is greater than the average utility
Convergence to NE – Proof Outline ,[object Object],Upper Bound = Lower Bound +  Average Regret Upper Bound  ¸   ¸  Lower Bound Q.E.D
Convergence in Almost Socially Concave Games ,[object Object],[object Object],[object Object],[object Object],Therefore, the convergence theorem cannot be directly applied Playing gradient based dynamics, guarantees “playing” in a “socially concave zone” Playing gradient based dynamics, guarantees “no regret” in concave decision making [Zinkevich]
Regret Minimization & Equilibrium ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Ongoing Research I ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Ongoing Research II ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Other stuff I work on FIFO ,[object Object],[object Object],Slot #1 Slot #2 Slot #3 Slot #4 Slot #5 Slot #6 time ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Tha n k you! ,[object Object]
TCP Congestion Control  [kkps  01 ] User Utility:   u i  = g i  –  i  l i f i g i l i User action:   push flow  f i good-put: fraction  of  f i  forwarded loss:  l i   = fraction  of  f i  discard Channel Fraction of  good-put determined by router policy  i   : associated cost with lost flow retransmission, lost bandwidth, utilization
Router Policy ,[object Object],[object Object],[object Object],[object Object],Amount of flow to discard depends on the total amount of flow ,[object Object]
Resource Allocation Games ,[object Object],$5M $10M $17M $25M ,[object Object],‘ s allocated rate =  5+10+17+25 25 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],We can show that the best response dynamics generally diverges for linear resource allocation games f   ( U  =  )   - $25M
Nash Equilibrium ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Player I: Player II ½  ½  ½  ½  0 ,0 4 ,1 1 ,4 3 ,3

Mais conteúdo relacionado

Mais procurados (20)

Game Theory Presentation
Game Theory PresentationGame Theory Presentation
Game Theory Presentation
 
Applications of game theory on event management
Applications of game theory on event management Applications of game theory on event management
Applications of game theory on event management
 
Game Theory Assignment
Game Theory AssignmentGame Theory Assignment
Game Theory Assignment
 
Game theory
Game theoryGame theory
Game theory
 
Game theory project
Game theory projectGame theory project
Game theory project
 
Game design 4
Game design 4Game design 4
Game design 4
 
Robust Price of Anarchy
Robust Price of AnarchyRobust Price of Anarchy
Robust Price of Anarchy
 
Game theory in Economics
Game theory in EconomicsGame theory in Economics
Game theory in Economics
 
Game theory in network security
Game theory in network securityGame theory in network security
Game theory in network security
 
Game theory
Game theoryGame theory
Game theory
 
Game design 3
Game design 3Game design 3
Game design 3
 
Game Theory by Chadha
Game Theory by ChadhaGame Theory by Chadha
Game Theory by Chadha
 
Introduction to Game Theory
Introduction to Game TheoryIntroduction to Game Theory
Introduction to Game Theory
 
Game theory application
Game theory applicationGame theory application
Game theory application
 
Game theory
Game theoryGame theory
Game theory
 
TermPaper
TermPaperTermPaper
TermPaper
 
Game
Game Game
Game
 
Game theory
Game theoryGame theory
Game theory
 
Game Theory - Quantitative Analysis for Decision Making
Game Theory - Quantitative Analysis for Decision MakingGame Theory - Quantitative Analysis for Decision Making
Game Theory - Quantitative Analysis for Decision Making
 
Game Theory Economics
Game Theory EconomicsGame Theory Economics
Game Theory Economics
 

Semelhante a Dynamics

A Brief Introduction to the Basics of Game Theory
A Brief Introduction to the Basics of Game TheoryA Brief Introduction to the Basics of Game Theory
A Brief Introduction to the Basics of Game TheoryTrading Game Pty Ltd
 
Ssrn a brief inrtoduction to the basic of game theory
Ssrn a brief inrtoduction to the basic of game theorySsrn a brief inrtoduction to the basic of game theory
Ssrn a brief inrtoduction to the basic of game theoryYing wei (Joe) Chou
 
A brief introduction to the basics of game theory
A brief introduction to the basics of game theoryA brief introduction to the basics of game theory
A brief introduction to the basics of game theoryWladimir Augusto
 
A brief introduction to the basics of game theory
A brief introduction to the basics of game theoryA brief introduction to the basics of game theory
A brief introduction to the basics of game theoryYing wei (Joe) Chou
 
navingameppt-191018085333.pdf
navingameppt-191018085333.pdfnavingameppt-191018085333.pdf
navingameppt-191018085333.pdfDebadattaPanda4
 
Game Theory in Cryptoeconomics
Game Theory in CryptoeconomicsGame Theory in Cryptoeconomics
Game Theory in CryptoeconomicsJongseung Kim
 
Module 3 Game Theory (1).pptx
Module 3 Game Theory (1).pptxModule 3 Game Theory (1).pptx
Module 3 Game Theory (1).pptxDrNavaneethaKumar
 
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAsst.prof M.Gokilavani
 
Game Theory SV.docx
Game Theory SV.docxGame Theory SV.docx
Game Theory SV.docxsnehil35
 
Arman cdc11
Arman cdc11Arman cdc11
Arman cdc11armkiani
 
Game Theory_1.pptx
Game Theory_1.pptxGame Theory_1.pptx
Game Theory_1.pptxssuser8c2631
 
Cascading Behavior in Networks
Cascading Behavior in NetworksCascading Behavior in Networks
Cascading Behavior in NetworksOsamah Al-Ghammari
 
Game Theory_ 2.pptx
Game Theory_ 2.pptxGame Theory_ 2.pptx
Game Theory_ 2.pptxssuser8c2631
 
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit PokerDeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit PokerOfir Shalev
 

Semelhante a Dynamics (20)

A Brief Introduction to the Basics of Game Theory
A Brief Introduction to the Basics of Game TheoryA Brief Introduction to the Basics of Game Theory
A Brief Introduction to the Basics of Game Theory
 
Ssrn a brief inrtoduction to the basic of game theory
Ssrn a brief inrtoduction to the basic of game theorySsrn a brief inrtoduction to the basic of game theory
Ssrn a brief inrtoduction to the basic of game theory
 
A brief introduction to the basics of game theory
A brief introduction to the basics of game theoryA brief introduction to the basics of game theory
A brief introduction to the basics of game theory
 
A brief introduction to the basics of game theory
A brief introduction to the basics of game theoryA brief introduction to the basics of game theory
A brief introduction to the basics of game theory
 
game THEORY ppt
game THEORY pptgame THEORY ppt
game THEORY ppt
 
navingameppt-191018085333.pdf
navingameppt-191018085333.pdfnavingameppt-191018085333.pdf
navingameppt-191018085333.pdf
 
file1
file1file1
file1
 
Game Theory in Cryptoeconomics
Game Theory in CryptoeconomicsGame Theory in Cryptoeconomics
Game Theory in Cryptoeconomics
 
Module 3 Game Theory (1).pptx
Module 3 Game Theory (1).pptxModule 3 Game Theory (1).pptx
Module 3 Game Theory (1).pptx
 
tutorial_kearns
tutorial_kearnstutorial_kearns
tutorial_kearns
 
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
 
Game Theory SV.docx
Game Theory SV.docxGame Theory SV.docx
Game Theory SV.docx
 
nips-gg
nips-ggnips-gg
nips-gg
 
Arman cdc11
Arman cdc11Arman cdc11
Arman cdc11
 
Unit II A - Game Theory
Unit II A - Game TheoryUnit II A - Game Theory
Unit II A - Game Theory
 
Game Theory_1.pptx
Game Theory_1.pptxGame Theory_1.pptx
Game Theory_1.pptx
 
Cascading Behavior in Networks
Cascading Behavior in NetworksCascading Behavior in Networks
Cascading Behavior in Networks
 
Game Theory_ 2.pptx
Game Theory_ 2.pptxGame Theory_ 2.pptx
Game Theory_ 2.pptx
 
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit PokerDeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in Heads-Up No-Limit Poker
 
socialpref
socialprefsocialpref
socialpref
 

Dynamics

  • 1. On the Convergence of Regret Minimizing Dynamics in Concave Games Joint work with Eyal Even Dar , Yishay Mansour Microsoft Research, Cambridge UK, March 26, 2009 Uri Nadav Tel Aviv University, Tel Aviv Israel
  • 2.
  • 3. Dynamics Player 1: Player 2: Day 1 Day 2 Day 3 Day 4 Day 5 0,0 4,1 1,4 3,3
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22. TCP Congestion Control [kkps 01 ] User Utility: u i = g i –  i l i f i g i l i User action: push flow f i good-put: fraction of f i forwarded loss: l i = fraction of f i discard Channel Fraction of good-put determined by router policy  i : associated cost with lost flow retransmission, lost bandwidth, utilization
  • 23.
  • 24.
  • 25.

Notas do Editor

  1. If it reaches to NE it stays there
  2. Say why it id important
  3. Different slopes