25
Evolving New Evolving New Strategies Strategies The Evolution of The Evolution of Strategies in the Strategies in the Iterated Prisoner’s Iterated Prisoner’s Dilemma Dilemma 01 / 25

Evolving New Strategies

Embed Size (px)

DESCRIPTION

Evolving New Strategies. The Evolution of Strategies in the Iterated Prisoner’s Dilemma. 01 / 25. What is the Prisoner’s Dilemma?. There are two prisoners Each one has taken part in the same criminal act The authorities are interrogating each one - PowerPoint PPT Presentation

Citation preview

Page 1: Evolving New Strategies

Evolving New StrategiesEvolving New Strategies

The Evolution of Strategies in the The Evolution of Strategies in the Iterated Prisoner’s DilemmaIterated Prisoner’s Dilemma

01 / 25

Page 2: Evolving New Strategies

What is the Prisoner’s Dilemma?What is the Prisoner’s Dilemma?

There are two prisonersThere are two prisoners Each one has taken part in the same criminal actEach one has taken part in the same criminal act The authorities are interrogating each oneThe authorities are interrogating each one Each prisoner can choose to keep their mouth shut or rat out their Each prisoner can choose to keep their mouth shut or rat out their

partnerpartner If both prisoners stay quiet, they each get n months of jail timeIf both prisoners stay quiet, they each get n months of jail time If only one prisoner gets ratted out, that prisoner gets n + x months If only one prisoner gets ratted out, that prisoner gets n + x months

of jail time while the other prisoner gets n – y months of jail timeof jail time while the other prisoner gets n – y months of jail time If the prisoners rat each other out, they each get n + z months of jail If the prisoners rat each other out, they each get n + z months of jail

time.time. In this case, n, x, y, and z are all greater than zero.In this case, n, x, y, and z are all greater than zero. In this case, x is greater than z.In this case, x is greater than z.

02 / 25

Page 3: Evolving New Strategies

What is the Iterated Prisoner’s What is the Iterated Prisoner’s Dilemma?Dilemma?

Prisoner’s Dilemma performed several timesPrisoner’s Dilemma performed several times The two criminals have committed several The two criminals have committed several

crimes togethercrimes together They are interrogated for each crime, with each They are interrogated for each crime, with each

set of interrogations being an instance of the set of interrogations being an instance of the original Prisoner’s Dilemmaoriginal Prisoner’s Dilemma

These interrogations are performed in sequence These interrogations are performed in sequence (or (or iterativelyiteratively), and the jail time distributed to ), and the jail time distributed to each prisoner is cumulativeeach prisoner is cumulative

03 / 25

Page 4: Evolving New Strategies

How does the IPD relate to GAs?How does the IPD relate to GAs?

No optimal solutionNo optimal solutionNo real strategyNo real strategyNo clueNo clueHard problemHard problem

So back to the paperSo back to the paper

04 / 25

Page 5: Evolving New Strategies

What This Paper ShowsWhat This Paper Shows

GAs in a rich social settingGAs in a rich social settingAdvantage of developing new strategiesAdvantage of developing new strategies

One parentOne parentTwo parentTwo parent

Early commitments to pathsEarly commitments to pathsEvolutionary processes optimal or arbitraryEvolutionary processes optimal or arbitrary

05 / 25

Page 6: Evolving New Strategies

How Does It Show It?How Does It Show It?

SimulationSimulationMultiple casesMultiple casesComparative outputComparative output

06 / 25

Page 7: Evolving New Strategies

The SimulationThe Simulation

Specify the environmentSpecify the environmentSpecify the encodingSpecify the encodingTesting the effects of random mutationTesting the effects of random mutationRun the simulationRun the simulationAnalyze the resultsAnalyze the results

07 / 25

Page 8: Evolving New Strategies

The EnvironmentThe Environment

Prisoner’s dilemmaPrisoner’s dilemmaMultiple prisonersMultiple prisonersGoal is to achieve mutual cooperationGoal is to achieve mutual cooperation Individuals may meet more than onceIndividuals may meet more than once

08 / 25

Page 9: Evolving New Strategies

Initial ExperimentInitial Experiment

Original strategies were submitted by Original strategies were submitted by fourteen peoplefourteen peopleGame TheoryGame TheoryEconomicsEconomicsSociologySociologyPolitical SciencePolitical ScienceMathematicsMathematics

Various levels of intricacyVarious levels of intricacy

09 / 25

Page 10: Evolving New Strategies

Initial ExperimentInitial Experiment

Most complex strategyMost complex strategyMarkov process Markov process Bayesian inferenceBayesian inference

Least complex strategyLeast complex strategyTFTTFT

TFT won TFT won

10 / 25

Page 11: Evolving New Strategies

Second ExperimentSecond Experiment

Sixty-two entriesSixty-two entriesSix countriesSix countriesComputer hobbyists, professorsComputer hobbyists, professors

TFT was submitted againTFT was submitted again It wonIt won

11 / 25

Page 12: Evolving New Strategies

The GAThe GA

PopulationPopulationEncodingEncodingGenerationGenerationCrossoverCrossoverMutationMutationFifty GenerationsFifty Generations

12 / 25

Page 13: Evolving New Strategies

PopulationPopulation

Twenty chromosomesTwenty chromosomesSeventy genesSeventy genes

13 / 25

Page 14: Evolving New Strategies

EncodingEncoding

For each prisoner’s dilemma, there are For each prisoner’s dilemma, there are four possibilitiesfour possibilities

Each “player” has memoryEach “player” has memoryWhat each gene representsWhat each gene represents

14 / 25

Page 15: Evolving New Strategies

A Single GenerationA Single Generation

Multiple gamesMultiple games Each game had one-hundred and fifty-one movesEach game had one-hundred and fifty-one moves Each chromosome played eight othersEach chromosome played eight others

Fitness was assignedFitness was assigned Ratted out – Zero pointsRatted out – Zero points Mutually ratted out – One pointMutually ratted out – One point Mutual cooperation – Three pointsMutual cooperation – Three points You ratted, other person stayed quit – Five pointsYou ratted, other person stayed quit – Five points

15 / 25

Page 16: Evolving New Strategies

CrossoverCrossover

Fitness proportional selectionFitness proportional selection Involved standard deviation from meanInvolved standard deviation from mean

Strictly ten crossoversStrictly ten crossoversSingle pointSingle pointTwo parentsTwo parents

16 / 25

Page 17: Evolving New Strategies

MutationMutation

Single gene flipSingle gene flipOne gene per two chromosomesOne gene per two chromosomes

17 / 25

Page 18: Evolving New Strategies

ResultsResults

Median resultant memberMedian resultant memberJust as good as TFTJust as good as TFTResembled TFTResembled TFT

Five properties were foundFive properties were found Don’t rock the boatDon’t rock the boat Be provocableBe provocable Accept apologiesAccept apologies ForgetForget Accept a rutAccept a rut

18 / 25

Page 19: Evolving New Strategies

ResultsResults

ADJUSTERADJUSTERSpecial chromosome which consistently Special chromosome which consistently

seeks to exploitseeks to exploitTFTTFTMajority of other chromosomesMajority of other chromosomes

19 / 25

Page 20: Evolving New Strategies

ResultsResults

Twenty-five percent of runsTwenty-five percent of runsMedian was betterMedian was betterExploit one chromosomeExploit one chromosome

20 / 25

Page 21: Evolving New Strategies

Results: Why is this important?Results: Why is this important?

Chromosomes had to learnChromosomes had to learnDiscriminatory based on evidenceDiscriminatory based on evidenceSelf adjusting for exploitationSelf adjusting for exploitationNo alienationNo alienation

Break primary rule of first tournamentBreak primary rule of first tournamentBe nice? I don’t think soBe nice? I don’t think so

21 / 25

Page 22: Evolving New Strategies

Results: Misleading?Results: Misleading?

MedianMedianExploitativeExploitativeMenacingMenacingA true criminal?A true criminal?

Fixed size population and tournamentsFixed size population and tournamentsSimulate real evolutionSimulate real evolution

22 / 25

Page 23: Evolving New Strategies

Results: A Slight TwistResults: A Slight Twist

Asexual reproductionAsexual reproductionTFTTFTLess than half of the mediansLess than half of the medians

Changing environmentChanging environmentPlay against everyonePlay against everyoneEveryone starts aggressiveEveryone starts aggressive

Fitness rapidly declinesFitness rapidly declinesFitness begins to even outFitness begins to even outFitness begins to riseFitness begins to rise

23 / 25

Page 24: Evolving New Strategies

ConclusionsConclusions

The GA is good for searching, large, multi-The GA is good for searching, large, multi-dimensional spacesdimensional spaces

Multiple parent crossover helpsMultiple parent crossover helpsArbitrary aspects of evolutionArbitrary aspects of evolution

Hitch hikersHitch hikersExploration vs. ExploitationExploration vs. Exploitation

Selection PressureSelection PressureEvolutionary Commitments can be Evolutionary Commitments can be

irreversibleirreversible

24 / 25

Page 25: Evolving New Strategies

Related TopicsRelated Topics

MutationMutationCrossoverCrossover InversionInversionCoding principlesCoding principlesDominant/RecessiveDominant/RecessiveRate of evolutionRate of evolutionPopulation viscosityPopulation viscositySpeciation and nichesSpeciation and niches

25 / 25