CSE 473: Artificial Intelligence - courses.cs.washington.edu · 2014. 5. 2. · CSE 473: Artificial...

CSE 473: Artificial Intelligence Spring 2014

Markov Models

Hanna Hajishirzi

Many slides adapted from Pieter Abbeel, Dan Klein, Dan Weld,Stuart Russell, Andrew Moore & Luke Zettlemoyer

Markov Chains

Reasoning#over#Time#or#Space#

!  OZen,#we#want#to#reason#about#a#sequence#of#observa8ons#!  Speech#recogni8on#!  Robot#localiza8on#!  User#a:en8on#!  Medical#monitoring#

!  Need#to#introduce#8me#(or#space)#into#our#models#

Markov Models (Markov Chains)

§  A Markov model is: §  a MDP with no actions (and no rewards)

X2 X1 X3 X4

§  A Markov model includes: § Random variables Xt for all time steps t (the state) § Parameters: called transition probabilities or

dynamics, specify how the state evolves over time (also, initial probs)

Markov Models (Markov Chains)

§  A Markov model defines §  a joint probability distribution:

X2 X1 X3 X4

§  One common inference problem: § Compute marginals P(Xt) for all time steps t

Joint#Distribu8on#of#a#Markov#Model#

!  Joint#distribu8on:#

!  More#generally:#

!  Ques8ons#to#be#resolved:#!  Does#this#indeed#define#a#joint#distribu8on?#!  Can#every#joint#distribu8on#be#factored#this#way,#or#are#we#making#some#assump8ons#about#the#joint#distribu8on#by#using#this#factoriza8on?#

X2 X1 X3 X4

P (X1, X2, X3, X4) = P (X1)P (X2|X1)P (X3|X2)P (X4|X3)

P (X1, X2, . . . , XT ) = P (X1)P (X2|X1)P (X3|X2) . . . P (XT |XT�1)

= P (X1)TY

P (Xt|Xt�1)

X2 X1 X3 X4

= P (X1)TY

P (Xt|Xt�1)

Markov Model

X2 X1 X3 X4

= P (X1)TY

P (Xt|Xt�1)

X2 X1 X3 X4 XN

Chain Rule and Markov Models

Chain#Rule#and#Markov#Models#

!  From#the#chain#rule,#every#joint#distribu8on#over#################################can#be#wri:en#as:#

!  Assuming#that#####################################################################and#

####results#in#the#expression#posited#on#the#previous#slide:##

X2 X1 X3 X4

X1, X2, X3, X4

P (X1, X2, X3, X4) = P (X1)P (X2|X1)P (X3|X1, X2)P (X4|X1, X2, X3)

X4 ?? X1, X2 | X3X3 ?? X1 | X2

X2 X1 X3 X4

= P (X1)TY

P (Xt|Xt�1)

Chain Rule and Markov Models

Chain#Rule#and#Markov#Models#

!  From#the#chain#rule,#every#joint#distribu8on#over#########################################can#be#wri:en#as:#

!  Assuming#that#for#all#t:##

####gives#us#the#expression#posited#on#the#earlier#slide:##

X2 X1 X3 X4

Xt ?? X1, . . . , Xt�2 | Xt�1

P (X1, X2, . . . , XT ) = P (X1)TY

P (Xt|Xt�1)

P (X1, X2, . . . , XT ) = P (X1)TY

P (Xt|X1, X2, . . . , Xt�1)

X1, X2, . . . , XT

Conditional Independence

§  Basic conditional independence: §  Past and future independent of the present §  Each time step only depends on the previous §  This is called the (first order) Markov property

X2 X1 X3 X4

Implied Conditional Independencies

Implied#Condi8onal#Independencies#

! We#assumed:#################################and#

!  Do#we#also#have # # # # #?#!  Yes!##!  Proof:#

X2 X1 X3 X4

X4 ?? X1, X2 | X3X3 ?? X1 | X2

X1 ?? X3, X4 | X2

P (X1 | X2, X3, X4) =P (X1, X2, X3, X4)

P (X2, X3, X4)

=P (X1)P (X2 | X1)P (X3 | X2)P (X4 | X3)Px1

P (x1)P (X2 | x1)P (X3 | X2)P (X4 | X3)

=P (X1, X2)

P (X2)

= P (X1 | X2)

Markov Models (Recap)

Markov#Models#Recap#

!  Explicit#assump8on#for#all###t#:#!  Consequence,#joint#distribu8on#can#be#wri:en#as:##

!  Implied#condi8onal#independencies:##(try#to#prove#this!)#

!  Past#variables#independent#of#future#variables#given#the#present#i.e.,#if#####################or######################then:#

!  Addi8onal#explicit#assump8on:#########################is#the#same#for#all#t0

Xt ?? X1, . . . , Xt�2 | Xt�1

= P (X1)TY

P (Xt|Xt�1)

Xt1 ?? Xt3 | Xt2t1 < t2 < t3 t1 > t2 > t3

P (Xt | Xt�1)

Example: Markov Chain

§  Weather: §  States: X = {rain, sun} §  Transitions:

§  Initial distribution: 1.0 sun §  What’s the probability distribution after one step?

rain sun

0.1 This is a conditional distribution

Markov Chain Inference

§  Question: probability of being in state x at time t? §  Slow answer:

§  Enumerate all sequences of length t which end in s §  Add up their probabilities

Mini-Forward Algorithm

§  Question: What’s P(X) on some day t? §  We don’t need to enumerate every sequence!

Forward simulation

Example

§  From initial observation of sun

§  From initial observation of rain

P(X1) P(X2) P(X3) P(X∞)

rain sun

0.9 0.1

Stationary Distributions

§  If we simulate the chain long enough: §  What happens? §  Uncertainty accumulates §  Eventually, we have no idea what the state is!

§  Stationary distributions: §  For most chains, the distribution we end up in is

independent of the initial distribution §  Called the stationary distribution of the chain §  Usually, can only predict a short time out

Stationary Distributions

Example:#Sta8onary#Distribu8ons#

!  Ques8on:#What’s#P(X)#at#8me#t#=#infinity?#

X2 X1 X3 X4

Xt#1% Xt% P(Xt|Xt#1)%

sun# sun# 0.9#

sun# rain# 0.1#

rain# sun# 0.3#

rain# rain# 0.7#

P1(sun) = P (sun|sun)P1(sun) + P (sun|rain)P1(rain)

P1(rain) = P (rain|sun)P1(sun) + P (rain|rain)P1(rain)

P1(sun) = 0.9P1(sun) + 0.3P1(rain)

P1(rain) = 0.1P1(sun) + 0.7P1(rain)

P1(sun) = 3P1(rain)

P1(rain) = 1/3P1(sun)

P1(sun) + P1(rain) = 1

P1(sun) = 3/4

P1(rain) = 1/4Also:#

rain sun

0.9 0.1

Pac-man Markov Chain Pac-man knows the ghost’s initial position, but gets no observations!

Web Link Analysis §  PageRank over a web graph

§  Each web page is a state §  Initial distribution: uniform over pages §  Transitions:

§  With prob. c, follow a random outlink (solid lines) §  With prob. 1-c, uniform jump to a random page

(dotted lines, not all shown)

PageRank

§  Stationary distribution §  Will spend more time on highly reachable pages §  E.g. many ways to get to the Acrobat Reader download page §  Somewhat robust to link spam §  Google 1.0 returned the set of pages containing all your

keywords in decreasing rank, now all search engines use link analysis along with many other factors (rank actually getting less important over time)

CSE 473: Artificial Intelligence - courses.cs.washington.edu · 2014. 5. 2. · CSE 473: Artificial...

Documents

Informed Search CSE 473 University of Washington

CSE 473: Artificial Intelligence Autumn 2010 Machine Learning: Naive Bayes and Perceptron Luke Zettlemoyer Many slides over the course adapted from Dan

Lecture 1 What is AI? CSE 473 Artificial Intelligence Oren Etzioni

Artificial Intelligence CSE 473 - courses.cs.washington.edu€¦ · What is artificial intelligence (agent view) •An agent is anything that can be viewed as perceivingits environment

CSE 473: Artificial Intelligence Bayes’ Nets: Inference

CSE 473: Artificial Intelligence - courses.cs.washington.educourses.cs.washington.edu/courses/cse473/14sp/slides/22-BN-inference.pdfCSE 473: Artificial Intelligence Bayesian Networks:

CSE 473: Artificial Intelligence - courses.cs.washington.edu · 2011-10-10 · CSE 473: Artificial Intelligence Constraint Satisfaction Luke Zettlemoyer ... Linear constraints solvable

CSE 473: Artificial Intelligence Spring 2012 Bayesian Networks - Learning Dan Weld Slides adapted from Jack Breese, Dan Klein, Daphne Koller, Stuart Russell,

Artificial Intelligence Recap & Expectation Maximization CSE 473 Dan Weld

CSE 473 Artificial Intelligence Dan Weld Krzysztof Gajos Tessa MacDuff

Uncertainty and ExpectimaxTree Search · CSE 473: Artificial Intelligence Uncertainty and ExpectimaxTree Search Instructor: Luke Zettlemoyer University of Washington [These slides

CSE 473: Artificial Intelligence Autumn 2011 Bayesian Networks: Inference Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart

CSE 473: Artificial Intelligence Spring 2012 Bayesian Networks - Inference Dan Weld Slides adapted from Jack Breese, Dan Klein, Daphne Koller, Stuart Russell,

CSE 473: Introduction to Artificial Intelligence[Demo: Ghostbusters –Circular Dynamics –HMM (L14D2)] Video of Demo Ghostbusters –Circular Dynamics --HMM. Conditional Independence

Logistics CSE 473: Artificial IntelligenceCSE 473: Artificial Intelligence Autumn 2014 Problem Spaces & Search With slides from ! Dan Klein, Stuart Russell, Andrew Moore, Luke Zettlemoyer!

Artificial Intelligence CSE 473

CSE 473: Artificial Intelligence Dan Weld Slides from Dan Klein, Luke Zettlemoyer, Stuart Russell, Andrew Moore

CSE 473: Artificial Intelligence Spring 2014

CSE 473: Artificial Intelligence - University of … · CSE 473: Artificial Intelligence Spring 2014 Hanna Hajishirzi ... If goal?(node) then return node Add children of node to queue

CSE 473: Artificial Intelligence Spring 2014 › courses › cse473 › ... · CSE 473: Artificial Intelligence Spring 2014 ... Which Algorithm? Optimal A* Tree Search! A heuristic