22
1 ISCI 330 Lecture 18 Evolutionary Game Theory ISCI 330 Lecture 18

Evolutionary Game Theory - University of British Columbia

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Evolutionary Game Theory - University of British Columbia

1 ISCI 330 Lecture 18

Evolutionary Game Theory

ISCI 330 Lecture 18

Page 2: Evolutionary Game Theory - University of British Columbia

2 ISCI 330 Lecture 18

Outline• A bit about historical origins of Evolutionary

Game Theory• Main (competing) theories about how

cooperation evolves• PD and other social dilemma games• Iterated PD, TFT, etc.• Evolutionary Stable Strategy (ESS)• N-player PD (and other games)• Simpson’s paradox and the

role of assortment

Page 3: Evolutionary Game Theory - University of British Columbia

3 ISCI 330 Lecture 18

Evolution by Natural Selection• Lewontin’s principles (from Darwin)

– 1) Phenotypic variation– 2) Differential fitness– 3) Heritability

• In Evolutionary Game Theory– 1) Population of strategies– 2) Utility determines number of offspring (fitness)– 3) Strategies breed true

• Frequency-dependent selection– One of the first examples is Fisher’s

sex ratio findings– Introduces idea of strategic phenotypes

Page 4: Evolutionary Game Theory - University of British Columbia

4 ISCI 330 Lecture 18

opponent’s behaviour

Dove

Hawk

actor’s

Dove

V / 2 5

0

behaviour Hawk

V 10

(V- c) / 2 -5

Ritualized Fighting

• V = 10; c = 20• The rare strategy has an advantage

(i.e. frequency dependent selection)• Hawk-Dove, Chicken, Snowdrift, Brinkmanship • If 0 < c < V, then game is PD instead

Page 5: Evolutionary Game Theory - University of British Columbia

5 ISCI 330 Lecture 18

Main Theories:Evolution of Altruism

• Multilevel Selection∆Q = ∆QB + ∆QW (Price Equation)

• Inclusive Fitness/Kin Selection– wincl. = wdirect + windirect

∆Q > 0 if rb > c (Hamilton’s rule)• Reciprocal Altruism

∆Q > 0 if altruists are sufficiently compensated for their sacrifices via reciprocity (ESS)

Page 6: Evolutionary Game Theory - University of British Columbia

6 ISCI 330 Lecture 18

Additive Prisoner’s Dilemma (PD) Actor's Fitness (Utility)

opponent’s behaviour

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c 4

w0 – c 0

behaviour D

sacrifices 0

w0 + b 5

w0 1

• w0 = 1; b = 4; c = 1

Page 7: Evolutionary Game Theory - University of British Columbia

7 ISCI 330 Lecture 18

Non-Additive PDActor's Fitness (Utility)

opponent’s behavior

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c (+d) 3

w0 – c 0

behavior D

sacrifices 0

w0 + b 5

w0 1

• w0 = 1; b = 4; c = 1; d = -1

Page 8: Evolutionary Game Theory - University of British Columbia

8 ISCI 330 Lecture 18

Main Theories:Evolution of Altruism

• Multilevel Selection– Predominate models are in terms of public good

• Inclusive Fitness/Kin Selection– Predominate models is in terms of individual

contributions (b and c)• Reciprocal Altruism

– Predominate models in terms of iterated PD (iPD)

Page 9: Evolutionary Game Theory - University of British Columbia

9 ISCI 330 Lecture 18

Evolutionarily Social Dilemma Games• What features do Hawk-Dove and the PD

have in common? – Cs do better in CC pairs than Ds do in DD pairs– Ds do better than Cs in mixed pairs

• Given 4 utility levels (1st, 2nd, 3rd, 4th) how many 2-player, symmetric games are there that capture this idea of “social dilemma”?

• With a partner, find these other games. Can you name them?

Page 10: Evolutionary Game Theory - University of British Columbia

10 ISCI 330 Lecture 18

6 evolutionarily interesting“social dilemmas”

• How do these games compare in terms of– Nash equilibria?– Pareto optimality?– Is it better to be rare or common?

• Consider populations of strategies rather than 2-players

• Relative vs. Absolute fitness

Page 11: Evolutionary Game Theory - University of British Columbia

11 ISCI 330 Lecture 18

Common EGT Assumptions• Population of strategies • Replicator equations

– Number of individuals of a certain strategy in next generation depends on:

• Average fitness (utility) of individuals with that strategy• Which depends on frequency distribution of strategies

• Often assume– infinite populations where replicator equations give

proportion of strategy (scaled by average fitness)– continuous (or discrete) time– complete mixing (random

interactions)– strategies breed true

(no sex or mutation)

Page 12: Evolutionary Game Theory - University of British Columbia

12 ISCI 330 Lecture 18

Iterated Prisoner’s Dilemma• Robert Triver’s concept of reciprocal

altruism• Robert Axelrod’s tournaments

– Every strategy plays every other one • Or at random for evolutionary experiments

– On average 200 rounds– Final score is cumulative

payoffs from all rounds– Can condition current behaviour

on any amount of history • opponent’s and actor’s

Page 13: Evolutionary Game Theory - University of British Columbia

13 ISCI 330 Lecture 18

Conditional Strategies• Anatol Rappoport’s Tit-For-Tat strategy

– Unless provoked, the agent will always cooperate

– If provoked, the agent will retaliate swiftly– The agent is quick to forgive – Susceptible to noise– Backwards induction issue

• Imagine gene for when to start defecting

• Alternatives– Forgiving TFT, TF2T, Pavlov,

walk-away• Under what conditions does

TFT beat it’s opponent?

Page 14: Evolutionary Game Theory - University of British Columbia

14 ISCI 330 Lecture 18

Simple Iterated PD Model

TFT

ALLD

TFT

TFT

TFTTFT

TFT

TFTTFT

TFT

TFTTFT

ALLD ALLD ALLD

ALLD

ALLD

ALLD

ALLD

ALLD

ALLD

ALLDALLD

TFTTFT

TFT

opponent’s behaviour

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c 4

w0 – c 0

behaviour D

sacrifices 0

w0 + b 5

w0 1

pick pairsat random

TFTTFT

TFTALLD ALLD

ALLD

TFT

offspring replace parents

offspring proportional to cumulative payoff

play i times

Page 15: Evolutionary Game Theory - University of British Columbia

15 ISCI 330 Lecture 18

Numerical Simulations of Iterated PD varying Q, i, and b (c = 1)

-0.1

0

0.1

1 3 5 7 9 11 13 15

b

∆QQ = 0.2; i = 2Q = 0.15; i = 2Q = 0.15; i = 4Q = 0.15; i = 15Q = 0.1; i = 2

– Fletcher & Zwick, 2006. The American Naturalist

Page 16: Evolutionary Game Theory - University of British Columbia

16 ISCI 330 Lecture 18

Simple Iterated PD Model

ALLD

ALLD

ALLD

ALLD

ALLD

ALLDALLD

ALLDTFT

ALLD

ALLDALLD

ALLD ALLD ALLD

ALLD

ALLD

ALLD

ALLD

ALLD

ALLD

ALLDALLD

ALLD

ALLDALLD

opponent’s behaviour

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c 4

w0 – c 0

behaviour D

sacrifices 0

w0 + b 5

w0 1

pick pairsat random

ALLD ALLD

TFTALLD ALLD

ALLD

ALLD

offspring replace parents

offspring proportional to cumulative payoff

play i times

Page 17: Evolutionary Game Theory - University of British Columbia

17 ISCI 330 Lecture 18

Simple Iterated PD Model

TFT

TFT

TFT

TFT

TFTTFT

TFT

TFTTFT

TFT

TFTTFT

TFT TFT TFT

TFT

TFT

TFTTFT

TFT

TFT

TFTALLD

TFTTFT

TFT

opponent’s behaviour

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c 4

w0 – c 0

behaviour D

sacrifices 0

w0 + b 5

w0 1

pick pairsat random

TFTTFT

TFTTFT TFT

ALLD

TFT

offspring replace parents

offspring proportional to cumulative payoff

play i times

Page 18: Evolutionary Game Theory - University of British Columbia

18 ISCI 330 Lecture 18

Evolutionary Stable Strategy (ESS)• A strategy which if adopted by a population

cannot be invaded by any competing alternative strategy

• How does this compare to a Nash Equilibrium?– Here assume almost all players play the same

strategy (call it S)– S is a Nash Eq. if u(S, S) ≥ u(S', S) for any S'– S is an ESS if

• u(S, S) > u(S', S) for any S' (strict Nash)• Or u(S, S) = u(S', S) AND u(S, S’) > u(S', S')

– If S is an ESS, then it is a Nash Eq.– If S is a strict Nash Eq. (given a population of S),

then it is an ESS

Page 19: Evolutionary Game Theory - University of British Columbia

19 ISCI 330 Lecture 18

ESS and Hawk-Dove (Chicken)• Is Hawk an ESS? Is Dove and ESS?• Is there a mixture of playing Hawk and Dove

that is an ESS?– Find it assuming original game

• Note that we can think of this in 2 ways– Agents playing a mixed strategy

• Monomorphic solution where an allele for this mixture has fixed in the population

– A mixed population of strategies at this ratio

• Polymorphic stable equilibrium

• Learning vs. Evolving

opponent’s behaviour

Dove

Hawk

actor’s

Dove

5

0

behaviour Hawk

10

-5

Page 20: Evolutionary Game Theory - University of British Columbia

20 ISCI 330 Lecture 18

ESS and the PD• Is C an ESS? Is D and ESS?• Is there a mixture of playing C and D that is an

ESS?• What if we break the assumption of random

interactions?• Is TFT an ESS? Is ALLD and ESS?

– Does it depend on iterations in iPD?– Find game (payoff matrix) for TFT vs. ALLD if i = 3– What do we call this game?

• Can think of TFT in an iPD in two ways:– Conditional behaviour causes C

behaviours of others to be more assorted with TFT than with ALLD

– In iPD, TFT and ALLD change the game to Assurance

opponent’s behaviour

C

contributes b

D

contributes 0

actor’s

C

sacrifices c

w0 + b – c 4

w0 – c 0

behaviour D

sacrifices 0

w0 + b 5

w0 1

Page 21: Evolutionary Game Theory - University of British Columbia

21 ISCI 330 Lecture 18

N-Player Prisoner’s Dilemma(Tragedy of the Commons)

q, fraction of cooperators in a group

fitne

ss p

er in

divi

dual

0 0.5 1.00

c

b

fitness to a cooperator, wa

fitness to a defector, ws

average fitness, wav

• ai' = ai [1 + wa(qi)] si' = si [1 + ws(qi)]• wa(qi) = bqi – c + w0 ws(qi) = bqi + w0

– Fletcher & Zwick, 2007. Journal of Theoretical Biology

Page 22: Evolutionary Game Theory - University of British Columbia

22 ISCI 330 Lecture 18

Simpson’s Paradox

Non-altruists

Altruists

TotalTi

me

• Altruists become a smaller portion of each group• But altruists become a larger portion of the whole

Group 1 Group 2