39
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Embed Size (px)

Citation preview

Page 1: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Computing Equilibria

Christos H. Papadimitriou

UC Berkeley

“christos”

Page 2: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

1,-1 -1,1

-1,1 1,-1

4,4 1,5

5,1 0,0

3,3 0,4

4,0 1,1

matching pennies prisoner’s dilemmachicken

Games help us understand rational behavior in competitive situations

Page 3: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Concepts of rationality

• Nash equilibrium (or double best response)

• Problem: may not exist

• Idea: randomized Nash equilibrium

Theorem [Nash 1951]: Always exists....

Page 4: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

can it be found in polynomial time?

Page 5: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

is it then NP-complete?

No, because a solution always exists

Page 6: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

…and why bother?(a parenthesis)

• Equilibrium concepts provide some of the most intriguing specimens of problems

• They are notions of rationality, aspiring models of behavior

• Efficient computability is an important modeling prerequisite“if your laptop can’t find it, then neither can the market…”

Page 7: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Complexity of Nash Equilibria?

• Nash’s existence proof relies on Brouwer’s fixpoint theorem

• Finding a Brouwer fixpoint is a hard problem

• Not quite NP-complete, but as hard as any problem that always has an answer can be…

• Technical term: PPAD-complete [P 1991]

Page 8: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Complexity? (cont.)

• But how about Nash?

• Is it as hard as Brouwer?

• Or are the Brouwer functions constructed in the proof specialized enough so that fixpoints can be computed?

(cf contraction maps)

Page 9: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

An Easier Problem:Correlated equilibrium

4,4 1,5

5,1 0,0

Chicken:

•Two pure equilibria {me, you}

•Mixed (½, ½) (½, ½) payoff 5/2

Page 10: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Idea (Aumann 1974)

• “Traffic signal”

with payoff 3

• Compare with

Nash equilibrium

• Even better

with payoff 3 1/3

0 ½

½ 0

1/4 1/4

1/4 1/4

1/3 1/3

1/3 0

Probabilities in a lottery drawn by an impartial outsider, and announced to each player separately

Page 11: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Correlated equilibria

• Always exist (Nash equilibria are examples)

• Can be found (and optimized over) efficiently by linear programming

Page 12: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Linear programming?

• A variable x(s) for each box s

• Each player does not want to deviate from the signal’s recommendation – assuming that the others will play along

• For every player i and any two rows of boxes s, s':

( ) [ ( ) ( ')] 0 i is

x s u s u s

Page 13: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Linear programming!

• n players, s strategies each

• ns2 inequalites

• sn variables!

• Nice for 2 or 3 players

• But many players?

Page 14: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

The embarrassing subject of many players

• With games we are supposed to model markets and the Internet

• These have many players

• To describe a game with n players and s strategies per player you need nsn numbers

Page 15: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

The embarrassing subject of many players (cont.)

• These important games cannot require astronomically long descriptions

“if your problem is important, then its input cannot be astronomically long…”

• Conclusion: Many interesting games are

1. multi-player

2. succinctly representable

Page 16: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

e.g., Graphical Games

• [Kearns et al. 2002] Players are vertices of a graph, each player is affected only by his/her neighbors

• If degrees are bounded by d, nsd numbers suffice to describe the game

• Also: multimatrix, congestion, location, anonymous, hypergraphical, …

Page 17: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Surprise!

Theorem: A correlated equilibrium in a succinct game can be found in polynomial time provided the utility expectation over mixed strategies can be computed in polynomial time.

Corollaries: All succinct games in the literature

Page 18: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

U 0x max ix

0x TU 1y

0y

need to show dual is infeasible

show it is unbounded

Page 19: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Lemma [Hart and Schmeidler, 89]:For every y there is an x such that

xUTy = 0

•and in fact, x is the product of thesteady-state distributions of the Markovchains implied by y

•Idea: run “ellipsoid against hope”

Page 20: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Leonid Khachiyan [1953-2005]

Page 21: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

1 0TxU y

2 0Tx U y

... 0Tkx U y

These k inequalities are themselves infeasible!

Page 22: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

TU 1y

0y infeasible

TXU 1y

0y also infeasible

UXT 0x

0x just need to solve

Page 23: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

as long as we can solve…

given a succinct representation of a game,

and a product distribution x,

find the expected utility of a player,

in polynomial time.

Page 24: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

And it so happens that…

…in all known cases,

this problem can be solved

by applying one, two, or all three

of the following tricks:

• Explicit enumeration

• Dynamic programming

• Linearity of expectation

Page 25: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Corollaries:

• Graphical games (on any graph!)

• Polymatrix games

• Hypergraphical games

• Congestion games and local effect games

• Facility location games

• Anonymous games

• Etc…

Page 26: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Back to Nash complexity: summary

2-Nash 3-Nash 4-Nash … k-Nash …

1-GrNash 2-GrNash 3-GrNash … d-GrNash …

|||

Theorem (with Paul Goldberg, 2005):All these problems are equivalent

Page 27: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

From d-graphical games to d2-normal-form games

• Color the graph with d2 colors• No two vertices affecting the same vertex

have the same color• Each color class is represented by a single

player who randomizes among vertices, strategies

• So that vertices are not “neglected:” generalized matching pennies

Page 28: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

From k-normal-form games to graphical games

• Idea: construct special, very expressive graphical games

• Our vertices will have 2 strategies each

• Mixed strategy = a number in [0,1]

(= probability vertex plays strategy 1)

• Basic trick: Games that do arithmetic!

Page 29: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

“Multiplicationis the name of the gameand each generationplays the same…”

Page 30: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

The multiplication game

x

y

z = x · y

“affects”

w0 0

0 1

if w plays 0,then it gets xy.if it plays 1,then it gets z,but z gets punished

z wins whenit plays 1 and w plays 0

Page 31: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

From k-normal-form games to 3-graphical games (cont.)

• At any Nash equilibrium, z = x y

• Similarly for +, -, “brittle comparison”

• Construct graphical game that checks the equilibrium conditions of the normal form game

• Nash equilibria in the two games coincide

Page 32: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Finally, 4 players

• Previous reduction creates a bipartite graph of degree 3

• Carefully simulate each side by two players, refining the previous reduction

Page 33: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Nash complexity, summary

2-Nash 3-Nash 4-Nash … k-Nash …

1-GrNash 2-GrNash 3-GrNash … d-GrNash …

|||

Theorem (with Paul Goldberg, 2005):All these problems are equivalentTheorem (with Costas Daskalakis andPaul Goldberg, 2005): …and PPAD-complete

Page 34: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Nash is PPAD-complete

• Proof idea: Start from a PPAD-complete stylized version of Brouwer on the 3D cube

• Use arithmetic games to compute Brouwer functions

• Brittle comparator problem solved by averaging

Page 35: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Open problems

• Conjecture 1: 3-player Nash is also PPAD-complete

• Conjecture 2: 2-player Nash can be found in polynomial time

• Approximate equilibria? [cf. Lipton Markakis and Mehta 2003]

Page 36: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

In November…

• Conjecture 1: 3-player Nash is also PPAD-complete

• Proved!! [Chen&Deng05, DP05]

Page 37: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

In December…

• Conjecture 2: 2-player Nash is in P• PPAD-complete [Chen&Deng05b]

Page 38: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

game over!

Page 39: Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Khachiyan lecture, March 2 06

Thank You!