Enhancements for Multi-Player Monte-Carlo Tree Search

J. (Pim) A.M. NijssenMark H.M. Winands

29 September 2010

5 October 2010 Enhancements for Multi-Player Monte-Carlo Tree Search 2

Overview• Introduction• Progressive History• MP-MCTS-Solver• Test domains• Experiments and Results• Conclusions• Future Research

Introduction• Enhancements for Multi-Player Monte-

Carlo Tree Search– More than 2 players– Techniques

• maxn (Luckhardt and Irani, 1986)• Paranoid (Sturtevant and Korf, 2000)

– Games• Chinese Checkers• Hearts

Carlo Tree Search– Best-first search technique– Monte Carlo simulations– Four phases

• Selection (UCT)• Expansion (1 node per sample)• Playout (ε-greedy)• Backpropagation

Carlo Tree Search– Stores tuple of size N in nodes– Game returns tuple of size N

• Winner gets a score of 1, losers get a score of 0• Score is split in case of multiple winners

– e.g. [½, ½, 0] is returned if Players 1 and 2 both win

Carlo Tree Search– Progressive History– Multi-Player Monte-Carlo Tree Search Solver

Progressive History• Combination of Progressive Bias (Chaslot

et al., 2008) and the history heuristic (Schaeffer, 1983)

• Move selection strategy uses action information

• More information available• Information is less accurate• Influence decreases over time

Progressive History

+−×+×+=

History heuristic Progressive Bias

Divide by number of losses

MP-MCTS-Solver• Multi-Player version of MCTS-Solver

(Winands et al., 2008)• Updating game-theoretical values• Update rules

– Standard (mate in one, one winner)– Paranoid– First winner

MP-MCTS-Solver

E F G H I

Player 3

Player 1

[0,1,0][…]

[1,0,0]

[0,1,0]

[1,0,0]

[0,1,0]

[?] Paranoid [0,1,0]

[1,0,0]First winner

Test domains• Multi-player games• Zero-sum• Perfect information

• Focus• Chinese Checkers

Focus• Capturing pieces

by creating stacks• Goal

– Total number of pieces captured

– Number of pieces captured from each opponent

Focus• Moving

– Only stacks one owns– Orthogonally– Move as many squares

as the number of pieces

– Maximum stack size is 5

• Capture pieces by creating larger stacks

Chinese Checkers• Goal: move pieces to

other side of the board

• Move pieces to adjacent fields or jump over other pieces– Sequential jumps

Experiments and Results• Processor: AMD64 2.4 GHz• Programming language: Java 6

• MCTS settings: C = 0.2, ε = 0.05

• Time: 2.5s per turn• 3360 games per tournament• All possible configurations

Experiments and Results• Progressive History in Focus

W 2 players 3 players 4 players

0 52.0% 51.2% 50.8%

0.5 59.0% 61.1% 57.5%

0.1 59.8% 63.0% 58.9%

0.25 61.3% 62.9% 59.4%

0.5 64.1% 65.5% 59.9%

1 66.0% 65.4% 58.2%

3 62.2% 65.2% 59.6%

5 57.9% 63.8% 59.6%

7.5 51.3% 60.6% 57.1%

10 47.4% 57.8% 56.9%

Experiments and Results• Progressive History in Chinese Checkers

W 2 players 3 players 4 players

0.25 52.8% 59.0% 56.9%

0.5 58.2% 62.8% 58.3%

1 67.8% 63.5% 61.9%

3 79.9% 66.7% 66.4%

5 83.5% 65.8% 66.8%

10 83.2% 65.3% 69.6%

15 81.0% 65.0% 69.2%

20 60.8% 60.2% 63.2%

Experiments and Results• Divide by number of losses

Game 2 players 3 players 4 players

Focus 64.8% 61.0% 52.0%

Chinese Checkers 57.6% 54.8% 53.9%

Experiments and Results• MP-MCTS-Solver in Focus

Update rule 2 players 3 players 4 players

Standard 53.0% 54.9% 53.3%

Paranoid 51.9% 50.4% 44.9%

First winner 52.8% 51.5% 43.4%

Conclusions• Progressive history

– Significant enhancement in Chinese Checkers and Focus

– Dividing by number of losses in Progressive Bias part increases performance

• MP-MCTS-Solver– Small but significant enhancement in Chinese

Checkers– Standard update rule works best

Future Research• Test Progressive History in other games• Compare Progressive History with similar

techniques, like RAVE, prior knowledge (Gelly and Silver, 2007), Gibbs Sampling (Björnsson and Finnsson, 2009), etc.

• Create new update rules for MP-MCTS-Solver

Enhancements for Multi-Player Monte-Carlo Tree Search · Enhancements for Multi-Player Monte-Carlo...

Documents

Monte Carlo Methods - UNIGE · Monte Carlo Methods Stéphane Paltani What are Monte-Carlo methods? Generation of random variables Markov chains Monte-Carlo Error …

Monte Carlo and Kinetic Monte Carlo Methods – A Tutorial · 2016. 5. 24. · Monte Carlo and Kinetic Monte Carlo Methods – A Tutorial Peter Kratzer Fachbereich Physik and Center

Monte Carlo - ππππ Calculation - HKRPS Basic Course in Monte Carlo... · 2/2/2016 2 Monte Carlo – Sales Forecast 7 Sales Forecast Monte Carlo Simulation 8 Monte Carlo in Radiation

Chapter 4 Monte Carlo Methods and Simulation. Monte Carlo Methods

CLASSICAL MONTE CARLO & METROPOLIS ALGORITHMstaff.ustc.edu.cn/~yjdeng/CQMC2012/lecture_notes/part1.pdf · CLASSICAL MONTE CARLO & METROPOLIS ALGORITHM Monte Carlo (MC) simulations

Monte Carlo Solution for Actuarial Problems - Member · MONTE CARLO SOLUTION FOR ACTUARIAL PROBLEMS ... Monte Carlo simulation is a collection of techniques to extract ... MONTE CARLO

Monte Carlo Simulation of Stochastic Processescacs.usc.edu/education/cs596/09Stochastic.pdf · Monte Carlo Simulation of Stochastic Processes MONTE CARLO METHOD • Monte Carlo

Monte Carlo approach to uncertainty analyses in forestry ... Monte Carlo... · inventory Monte Carlo Small or large ... To do Monte Carlo simulations, ... Monte Carlo simulation #

Monte Carlo Simulationathreya/Teaching/statistics1/dootika.pdf · Monte Carlo Simulation Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms

JetForm:836 - mcnp.lanl.gov · 06-7094 Monte Carlo Eigenvalue Calculations Forrest Brown Monte Carlo lectures. 1 Monte Carlo Eigenvalue Calculations Forrest Brown Monte Carlo Codes

Monte Carlo Method - Monte Carlo Simulation · Monte Carlo Method Monte Carlo Simulation Peter Frank Perroni December 1, 2015 Peter Frank Perroni Monte Carlo Method

Monte Carlo Methods - ethz.ch · 1 Monte Carlo Methods Nicholas Constantine Metropolis Stanislaw Ulam 2 Books about Monte Carlo • M.H. Kalos and P.A. Whitlock: „Monte Carlo Methods“

Fatin Sezgin - MCQMC2010 - Monte Carlo and Quasi-Monte Carlo

Monte Carlo Simulation Monte Carlo Simulation - University of Florida

Monte Carlo Particle Transport: Algorithm and Performance ... · arise in photon Monte Carlo simulations. Neutron Monte Carlo Transport The Monte Carlo method of simulating particle

Quasi-Monte Carlo Variational InferenceQuasi-Monte Carlo Variational Inference 3. Quasi-Monte Carlo Variational Inference In this Section, we introduce Quasi-Monte Carlo Varia-tionalInference(qmcvi),usingrandomizedqmc

Simulasi Monte Carlo - pertiwimulya.staff.gunadarma.ac.idpertiwimulya.staff.gunadarma.ac.id/...+Simulasi+Monte+Carlo.pdf · APLIKASI SIMULASI MONTE CARLO Contoh: •Sebuah perusahaan

Quantum Monte Carlo for Electronic Structurekentpr/talks/uc_group_mtg_June2003.pdf · •Real-world Applications •Monte Carlo integration •Variational Monte Carlo •Diffusion

FC1 - Monte Carlo Simulationen - physik.uni-paderborn.de · FC1 - Monte Carlo Simulationen 3 1 Das Monte Carlo Verfahren Monte Carlo ist ein ¨ublicher Name f ¨ur eine große Anzahl

Monte Carlo and quasi-Monte Carlo methods - PKUdsec.pku.edu.cn/~tieli/notes/numer_anal/MCQMC_Caflisch.pdf · MONTE CARLO AND QUASI-MONTE CARLO 3 quasi-random points converges more