CSE-571 Robotics

CSE-571Robotics

Planning and Control:

Markov Decision Processes

Problem Classes

•Deterministic vs. stochastic actions

•Full vs. partial observability

Deterministic, fully observable

Stochastic, Fully Observable Stochastic, Partially Observable Markov Decision Process (MDP)

0.90.1

0.30.4

0.8 r=-10

Markov Decision Process (MDP)

• Given:• States x• Actions u• Transition probabilities p(x’|u,x)• Reward / payoff function r(x,u)

• Wanted:• Policy p(x) that maximizes the future

expected reward

Rewards and Policies• Policy (general case):

• Policy (fully observable case):

• Expected cumulative payoff:

• T=1: greedy policy• T>1: finite horizon case, typically no discount• T=infty: infinite-horizon case, finite reward if discount < 1

ttt uuz ®-- 1:11:1 ,:p

tt ux ®:p

ùêë

é= å

tT rER1t

Policies contd.• Expected cumulative payoff of policy:

• Optimal policy:

• 1-step optimal policy:

• Value function of 1-step optimal policy:

ùêë

é== å

=-+-+++

tttttT uzurExR1

1:11:1 )(|)(t

tttttp pg

),(argmax)(1 uxrxu

),(max)(1 uxrxVu

)(argmax tT xRpp

2-step Policies• Optimal policy:

• Value function:

π 2 (x) = argmaxu

r(x,u)+ V1(x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

V2 (x) = γ maxu r(x,u)+ V1(x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

T-step Policies• Optimal policy:

• Value function:

πT (x) = argmaxu

r(x,u)+ VT −1(x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

VT (x) = γ maxu r(x,u)+ VT −1(x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

Infinite Horizon

• Optimal policy:

• Bellman equation

• Fix point is optimal policy

• Necessary and sufficient condition

V∞(x) = γ maxu r(x,u)+ V∞(x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

Value Iteration• for all x do

• endfor

• repeat until convergence• for all x do

• endfor• endrepeat

V̂ (x)←γ maxu

r(x,u)+ V̂ (x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

min)(ˆ rxV ¬

π (x) = argmaxu

r(x,u)+ V̂ (x ')p(x ' | u, x)dx '∫⎡⎣

⎤⎦

Noise=0.2Discount=0.9Livingreward=0

k=2 k=3 k=4

k=5 k=6 k=7

k=8 k=9 k=10

k=11 k=12 k=100

Value Function and Policy•Each step takes O(|A| |S| |S|) time.•Number of iterations required is

polynomial in |S|, |A|, 1/(1-gamma)

Value Iteration for Motion Planning(assumes knowledge of robot’s location)

Frontier-based Exploration• Every unknown location is a target point.

Manipulator Control

Arm with two joints Configuration space

Manipulator Control Path

State space Configuration space

Manipulator Control Path

State space Configuration space

POMDPs• In POMDPs we apply the very same idea as in

• Since the state is not observable, the agent has to make its decisions based on the belief state which is a posterior distribution over states.

• For finite horizon problems, the resulting value functions are piecewise linear and convex.

• In each iteration the number of linear constraints grows exponentially.

• Full fledged POMDPs have only been applied to very small state spaces with small numbers of possible observations and actions.

• Approximate solutions are becoming more and more capable.

CSE-571 Robotics - University of Washington€¦ · CSE-571 Robotics Planning and Control: Markov...

Documents

Lecture Note #2 EECS 571 Cyber-Physical Systems · EECS 571 Cyber-Physical Systems Kang G. Shin CSE/EECS The University of Michigan (drawn from many sources including lecture notes

Introduction to Robotics - CSE - IIT Kanpur · Introduction to Robotics Amitabha Mukerjee IIT Kanpur, India. What is a Robot? Mobile Robots Robot properties:

Boletim 571

rozhnama 571

Microcontroller Review CSE 571 Winter 2005 Homework #2

CSE-571 Deterministic Path Planning in Robotics• Planning approaches: - deterministic planning:-assume some(i.e.,most likely) environment, execution, pose - plana singleleast-cost

Pattern Recognition in Mobile Robotics - CSE 455/555 ...jcorso/t/2011S_555/files/jryde-robotics-pr.pdfsimultaneously updating the map with new observed information. PR in Robotics

Camira - Viccarbe · Cse 34 Cse 30 Cse 07 Cse 16 Cse 35 Cse 33 Cse 18 Cse 36 Cse 27 Cse 03 Cse 44 Cse 25 Cse 05 Cse 29 Cse 45 Cse 31 Cse 14 Cse 32 Cse 46 Cse 13 Cse 26 Cse 02 Cse

CSE-490R Robotics

CSE 571: Artificial Intelligence Instructor: Subbarao Kambhampati rao@asu.edu Homepage: //rakaposhi.eas.asu.edu/cse571

Human Systems Engineering, Ph.D. · 2021. 4. 24. · CSE 571: Artificial Intelligence PSY 528: Sensation and Perception PSY 535: Cognitive Processes PSY 551: Advanced Social Psychology

SEMANARIO 571

CSE 571: Artificial Intelligence

2019 - gvpcew.ac.ingvpcew.ac.in/Dept Magazines/cse/COSCENGERS INSPIRE 2019.pdf7 | P a g e EVOLUTION OF ROBOTICS Kapparath Aishwarya 4/4 B Tech, CSE Department INTRODUCTION Robotics

Autonomous Mobile Robot Design - Autonomous Robots Lab...Autonomous Mobile Robot Design Dr. Kostas Alexis (CSE) Topic: Propulsion Systems for Robotics. Propulsion Systems for Robotics

scan0002 - Dhaka University of Engineering & Technology ... · q f? 8qf g so 1 i math-3411 cse-3811 cse-3813 cse-3311 cse-3511 cse-4623 cse-4621 cse-4625 cse-4821 cse-4721 cse-2111

Computer Science at Lehigh · • CSE 343 - Network Security • CSE 347 - Data Mining • CSE 360 - Mobile Robotics • CSE 398 - Big Data Analytics. The Attention You Deserve •

ONLY PROJECT CENTER WITH OWN DEVELOPERS - CSE, …...tanner , microwind , embedded , robotics , mechanical , mechatronics , wireless NETWORKS, OPNET , OMNET Over 11000+ projects ,

Robotics CSE chapter 2011.pdf · F.L. Lewis, M. Fitzgerald, and K. Liu “Robotics,” in The Computer Science Handbook ... process robotics was in the ... operations such as assembly,