Upload
reynold-fox
View
213
Download
1
Embed Size (px)
Citation preview
SAL: A Game Learning Machine
Joel Paulson & Brian LannersJoel Paulson & Brian Lanners
Introduction
Why AI game playing?Why AI game playing? Why AI game learning?Why AI game learning? SAL (Michael Gherrity, 1993)SAL (Michael Gherrity, 1993) Search and LearningSearch and Learning
Consistency Search
Basic ConceptBasic Concept Reasons for UseReasons for Use
Allows for Errors in Evaluation FunctionAllows for Errors in Evaluation Function Pathological GamesPathological Games
Procedure
Consistent PositionsConsistent Positions Evaluation of a position is equal to its Evaluation of a position is equal to its
minimax valueminimax value Inconsistent PositionsInconsistent Positions
Identifying and Correcting ErrorsIdentifying and Correcting Errors
General Example
If B is inconsistent, then one of the If B is inconsistent, then one of the evaluations of B, D, or E is incorrectevaluations of B, D, or E is incorrect
Consistency Search in Play
Organization of SAL
Game Independent KernelGame Independent KernelConsistency Search AlgorithmConsistency Search AlgorithmEvaluation FunctionsEvaluation Functions
Game Specific Move Generator Game Specific Move Generator Incorporates rules of gameIncorporates rules of gameThree Subroutines: MoveGenerator, Three Subroutines: MoveGenerator,
MakeMove, EndOfGameMakeMove, EndOfGame
Features
Used as input for Neural NetworkUsed as input for Neural Network Feature Discovery ProblemFeature Discovery Problem Features in SALFeatures in SAL
Neural Networks
SAL uses features for InputSAL uses features for Input Weights altered following each game using Weights altered following each game using
Temporal Difference LearningTemporal Difference Learning
Performance of SAL Tic-Tac-ToeTic-Tac-Toe Connect FourConnect Four ChessChess