44
Programs that Play better than Us Melvin Zhang [email protected] @melvinzhangzy

Programs that Play better than Us

Embed Size (px)

Citation preview

Page 1: Programs that Play better than Us

Programs that Play better than Us

Melvin [email protected]

@melvinzhangzy

Page 2: Programs that Play better than Us

https://en.wikipedia.org/wiki/File:ST Battle Chess.png

Page 3: Programs that Play better than Us

https://en.wikipedia.org/wiki/Deep Blue (chess computer)

Deep Blue (IBM, 1996)

Page 4: Programs that Play better than Us

http://afflictor.com/2012/09/11/chess-programs-regularly-play-at-good-amateur-level/

Page 5: Programs that Play better than Us

Game tree

Page 6: Programs that Play better than Us

Optimal play

Terminal

min player

max player

Page 7: Programs that Play better than Us

Optimal play

1 01 1 1Terminal

min player

max player

Page 8: Programs that Play better than Us

Optimal play

1 01 1 1

0

Terminal

min player

max player

Page 9: Programs that Play better than Us

Optimal play

1 01 1 1

0 1

Terminal

min player

max player

Page 10: Programs that Play better than Us

Optimal play

1 01 1 1

0 1

1

Terminal

min player

max player

Page 11: Programs that Play better than Us

Chess has about 1046 states!

Page 12: Programs that Play better than Us

Minimax algorithm

Cut-off

min player

max player

Page 13: Programs that Play better than Us

Minimax algorithm

.7 .1 .6 .9Cut-off

min player

max player

Page 14: Programs that Play better than Us

Minimax algorithm

.7 .1 .6 .9

.1

Cut-off

min player

max player

Page 15: Programs that Play better than Us

Minimax algorithm

.7 .1 .6 .9

.1 .6

Cut-off

min player

max player

Page 16: Programs that Play better than Us

Minimax algorithm

.7 .1 .6 .9

.1 .6

.6

Cut-off

min player

max player

Page 17: Programs that Play better than Us

https://stockfishchess.org/

Stockfish

Page 18: Programs that Play better than Us

https://tests.stockfishchess.org/

Testing AI changes is crucial

Page 19: Programs that Play better than Us

Value functions are hard!

Page 20: Programs that Play better than Us

http://mathworld.wolfram.com/Go.html

Page 21: Programs that Play better than Us

http://www.remi-coulom.fr/CrazyStone/

Remi Coulom

Page 22: Programs that Play better than Us

http://www.wired.com/2014/05/the-world-of-computer-go/

Page 23: Programs that Play better than Us

Monte Carlo evaluations

Cut-off

min player

max player

Page 24: Programs that Play better than Us

Monte Carlo evaluations

Cut-off

min player

max player

Page 25: Programs that Play better than Us

Monte Carlo evaluations

Cut-off

min player

max player

Page 26: Programs that Play better than Us

Monte Carlo evaluations

Cut-off

min player

max player

Page 27: Programs that Play better than Us

Monte Carlo evaluations

Cut-off

min player

max player

.7

Page 28: Programs that Play better than Us

Monte Carlo Tree Search (MCTS)

Page 29: Programs that Play better than Us

by Google Deepmind

https://deepmind.com/research/alphago/

Page 30: Programs that Play better than Us

https://gogameguru.com/alphago-races-ahead-2-0-lee-sedol/

Page 31: Programs that Play better than Us

MCTS + Policy and value networks

Page 32: Programs that Play better than Us

http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html

MCTS + Policy and value networks

Page 33: Programs that Play better than Us

Some games have hidden information!

Page 34: Programs that Play better than Us

http://magic.wizards.com/en/events/coverage/gpsin15/father-son-2015-06-27

Page 35: Programs that Play better than Us

https://magarena.github.io

Page 36: Programs that Play better than Us
Page 37: Programs that Play better than Us

Determinization: choose a random instance of thehidden information during simulation

Page 38: Programs that Play better than Us

Comparison of Minimax and MCTS

At 1s thinking time:Minimax MCTS

1 0.88

Page 39: Programs that Play better than Us

Comparison of Minimax and MCTS

At 1s thinking time:Minimax MCTS

1 0.88

At 4s thinking time:Minimax MCTS

1 1.71

Page 40: Programs that Play better than Us

Open problems

Page 41: Programs that Play better than Us

MCTS is bad at tight tactical play.

Page 42: Programs that Play better than Us

MCTS plays badly when it is behind in the game.

Page 43: Programs that Play better than Us

Further readings

Page 44: Programs that Play better than Us

Further readings