Programs that Play better than Us

Melvin Zhangmelvin@melvinzhang.net

@melvinzhangzy

https://en.wikipedia.org/wiki/File:ST Battle Chess.png

https://en.wikipedia.org/wiki/Deep Blue (chess computer)

Deep Blue (IBM, 1996)

http://afflictor.com/2012/09/11/chess-programs-regularly-play-at-good-amateur-level/

Game tree

Optimal play

Terminal

min player

max player

Optimal play

1 01 1 1Terminal

min player

max player

Optimal play

1 01 1 1

Terminal

min player

max player

Optimal play

1 01 1 1

Terminal

min player

max player

Optimal play

1 01 1 1

Terminal

min player

max player

Chess has about 1046 states!

Minimax algorithm

Cut-off

min player

max player

Minimax algorithm

.7 .1 .6 .9Cut-off

min player

max player

Minimax algorithm

.7 .1 .6 .9

Cut-off

min player

max player

Minimax algorithm

.7 .1 .6 .9

Cut-off

min player

max player

Minimax algorithm

.7 .1 .6 .9

Cut-off

min player

max player

https://stockfishchess.org/

Stockfish

https://tests.stockfishchess.org/

Testing AI changes is crucial

Value functions are hard!

http://mathworld.wolfram.com/Go.html

http://www.remi-coulom.fr/CrazyStone/

Remi Coulom

http://www.wired.com/2014/05/the-world-of-computer-go/

Monte Carlo evaluations

Cut-off

min player

max player

Cut-off

min player

max player

Cut-off

min player

max player

Cut-off

min player

max player

Cut-off

min player

max player

Monte Carlo Tree Search (MCTS)

by Google Deepmind

https://deepmind.com/research/alphago/

https://gogameguru.com/alphago-races-ahead-2-0-lee-sedol/

MCTS + Policy and value networks

http://www.nature.com/nature/journal/v529/n7587/full/nature16961.html

MCTS + Policy and value networks

Some games have hidden information!

http://magic.wizards.com/en/events/coverage/gpsin15/father-son-2015-06-27

https://magarena.github.io

Determinization: choose a random instance of thehidden information during simulation

Comparison of Minimax and MCTS

At 1s thinking time:Minimax MCTS

1 0.88

Comparison of Minimax and MCTS

1 0.88

1 1.71

Open problems

MCTS is bad at tight tactical play.

MCTS plays badly when it is behind in the game.

Programs that Play better than Us

Software

Live Streamed, Interactive Video Play, Better Left Unsaid

Reviews Management Programs: How can I get better reviews?

Play Hard - Work Better - PMI Austin Conference Fall 2014

Collaborating with Teens to Build Better Library Programs Part 3

Better Buildings Residential Network Program …...Better Buildings Residential Network Better Buildings Residential Network: Connects energy efficiency programs and partners to share

Building Better Energy Efficiency Programs for Low-Income Households

BEST PRACTICES · Nescens Better-Aging Programs all programs start with consultations in medical, osteopathy and nutrition all programs include an individual Better-Aging Cuisine

Jonny Shaw - Better Customer Relationships Are Built Through Play

THE MIDDLE EAST, BETTER EXPLAINED Could the OIC play a key

Better Teamwork Corporate Teambuilding Programs

BETTER CONNECTIVITY, BETTER PROGRAMS · 2018-04-09 · 4 Better Connectivity, Better Programs: How to Implement a Demand Aggregation Program BETTER CONNECTIVITY, BETTER PROGRAMS:

Video Based Safety Programs Smarter Driving. Better Fleets

A Better Way to Play

LADIES ACADEMY - The Reserve Vineyards and Golf ClubLADIES ACADEMY Beginners deserve a better way to learn to play golf! Most golf programs are incomplete. That is why the Reserve

Play Better GOLF Improve Your Putting in Sixty Minutes

Work better, play together? Rypple on Enterprise Gamification

Adjusting Drug Treatment Programs for better Efficiency. #CND58 Vienna

Worker Programs and Resource Use: Evidence from Better

Power Play! for Summer Meal Programs

CORPORATE GIFTING - millenniumlogoglove.com€¦ · CORPORATE GIFTING & BRANDING PROGRAMS We will make you better. Better branding. Better Advertising value. Better customer retention