Upload
morrie
View
51
Download
0
Embed Size (px)
DESCRIPTION
Chapter 7: Hybrid Deliberative/Reactive Paradigm. Part 1: Overview & Managerial Architectures Part 2: State Hierarchy Architectures. Objectives. Describe the hybrid paradigm in terms of 1) SPA and 2) sensing organization - PowerPoint PPT Presentation
Citation preview
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 1
7Chapter 7:
Hybrid Deliberative/Reactive Paradigm
Part 1: Overview & Managerial ArchitecturesPart 2: State Hierarchy Architectures
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 2
7 Objectives
• Describe the hybrid paradigm in terms of 1) SPA and 2) sensing organization
• Given a list of responsibilities, be able to say whether it belongs in the deliberative layer or in the reactive layer
• List the five basic components of a Hybrid architecture: sequencer agent, resource manager, cartographer, mission planner, performance monitoring and problem solving agent.
• Be able to describe the difference between managerial, state hierarchy, and model-oriented styles of Hybrid architectures.
• Be able to describe the use of state to define behaviors and deliberative responsibilities in state hierarchy styles of Hybrid architectures
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 3
7 Motivating Example for Deliberation: USAR
• Worker places robot at entrance to unstable building, loads in the floor plan, contextual knowledge and tells robot to look for survivors efficiently and map out safe routes for workers to pass through
• contextual knowledge includes probability of where people are more likely to be
What can a reactive architecture do? What can’t it do?
Path planning, handling detours due to blockage, map making, learn from past rescues
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 4
7 Organization: Plan, Sense-Act
SENSE
PLAN
ACT
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 5
7 Sensing Organization
BEHAVIOR
BEHAVIOR
BEHAVIOR
SENSOR 1
SENSOR 2
ACTUATORS
WORLD MAP/KNOWLEDGE REP
SENSOR 3
virtual sensor
Deliberative functions*Can “eavesdrop”*Can have their ownSensors*Have output which Looks like a sensorOutput to a behavior(virtual sensor)
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 6
7 Deliberation v. Reactionas a function of TIME
• Past, Present, Future
• Reactive– exists in the PRESENT (will a bit of
duration)
• Deliberative– can reason about the PAST– can project into the FUTURE
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 7
7 Architectures:Key Questions
• How does the architecture distinguish between reaction and deliberation?
• How does it organize responsibilities in the deliberative portion?
• How does overall behavior emerge?
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 8
7 Architectures: Common Functionality
• Mission planner – interacts with human, puts commands into robot terms, constructs mission
• Cartographer – creates, stores, maintains map or world model, knowledge;
• Sequencer agent – generates set of behaviors to use to accomplish task
• Behavioral/resource manager – allocates resources (schemas, etc.) to behaviors
• Performance monitor/problem solving agent (fairly rare)
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 9
7 Architectures: 3 Styles
• Managerial (division of responsibilities looks like in business)– AuRA, SFX
• State Hierarchies (strictly by time scope)– 3T
• Model-Oriented (models serve as virtual sensors)– Saphira, TCA
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 10
7 Mgr Architecture 1:AuRA (Autonomous Robot Arch.)
Ron Arkin, Georgia Institute of Technology
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 11
7 AuRA Architectural Layout
reac
tive
del
iber
ativ
eInterfaces with
humanComputes path,
breaks into subtasks
Generates behaviors for subtasks
Uses potential fields
Changes gains of behaviors’
potential fields
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 12
7 Architectures: Common Functionality
• Mission planner • Cartographer• Sequencer agent• Behavioral manager• Performance monitor/problem solving
agent
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 13
7 AuRA Architectural LayoutCartographer
Sequencer
MissionPlanner
Behavioral manager(mgr+schemas)
PerformanceMonitoring
Emergent behavior
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 14
7 HOW WOULD THIS DO USAR TASK?
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 15
7 Motivating Example for Deliberation: USAR
• Worker places robot at entrance to unstable building, loads in the floor plan, contextual knowledge and tells robot to look for survivors efficiently and map out safe routes for workers to pass through
• contextual knowledge includes probability of where people are more likely to be
What can a reactive architecture do? What can’t it do?
Path planning, handling detours due to blockage, map making, learn from past rescues
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 16
7 Motivating Example for Deliberation: USAR
• Worker places robot at entrance to unstable building, loads in the floor plan, contextual knowledge and tells robot to look for survivors efficiently and map out safe routes for workers to pass through
• contextual knowledge includes probability of where people are more likely to be
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 17
7 Example USAR (overlay)• Cartographer accepts the map
• Navigator uses path planning algorithm to visit nodes in order of likelihood of survivors
• Pilot determines the list of behaviors, Motor Schema Manager instantiates them (MS & PS) and waits for termination
• Homeostatic might notice that robot is running out of power, so opportunistically picks up low probability room on way back to home
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 18
7 Mgr. Architecture 2:SFX (Sensor Fusion Effects)
• Focus on sensing
• Biomimetic organization
• deliberative layer consists of managerial agents
• reactive layer has tactical behaviors
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 19
7 SFX (Sensor Fusion Effects)
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 20
7 SFX (Sensor Fusion Effects)
Behaviors(using direct
perception, fusion)
SenseSenseSenseSenseMuscleMuscleMuscleActuators
Deliberative Layer Managers
SenseSenseSenseSensor
SenseSenseSenseReceptiveField
Choice of behaviors, resourceallocation, motivation, context
Focus of attention,recalibration
SensorWhiteboard
BehavioralWhiteboard
Del
iber
ativ
e L
ayer
Rea
cti v
e L
a yer
Parameters to behaviors,sensor failures, task progress
actions
SuperiorColliculus-likefunctions
CerebralCortex-likefunctions
Cartographer(model/map
making)
Recognitionperception
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 21
7
Cartographer
SFX Implementation
Sensor Mgr
Task Planner
Interface Mgr
BehaviorsBehaviorsBehaviors
SensorsSensors
SensorsSensors
Sensors
AcuatorsAcuators
Acuators
Effector Mgr
Lis
pC
++
Beh
avio
rsU
se, F
use
In
form
atio
nd
irec
ts s
ensi
ng
Interacts with human, specifies
constraints
Managers are independent agents;
negotiate; use AI planning, scheduling,
problem solving to allaocate effector and sensor resources to
accomplish task
monitors task
performance and
whether habitat has
changed Map making, path planning
Layered into strategic and tactical behaviors
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 22
7 Ability to Substitute Components
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 23
7 Motivating Example for Deliberation: USAR
• Worker places robot at entrance to unstable building, loads in the floor plan, contextual knowledge and tells robot to look for survivors efficiently and map out safe routes for workers to pass through
• contextual knowledge includes probability of where people are more likely to be
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 24
7 Example USAR (overlay)
• Cartographer accepts the map
• Task Planner agent asks for path, requests behaviors, passes to managerial layer
• Sensing and Effector Mgrs negotiate allocation
• Behaviors run until terminate or encounter exception (either preset condition by mgrs or through monitoring)
• Mgrs can see “below” but not above--cannot relax constraint of Planner/Boss
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 25
7 Tactical Behaviors
sensors strategic behaviors tactical behaviors actuators
follow-path speed-controlcamera drive
motor
avoidsonar
steermotor
center-cameracamerapanmotor
inclino-meter
slope
clutter
obstacleshow much vehicle turns
direction to path safe direction
safe velocity
swivel camera
strategicvelocity
Tactical behaviors subsume strategic behaviors; interaction
of stratregic and tactical behaviors create emergent
behavior
Task typically has 1 strategic, several tactical behaviors
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 26
7 Summary:Managerial Architectures
• How does the architecture distinguish between reaction and deliberation?
– Deliberation: global knowledge or world models, projection forward or backward in time
– Reaction: behaviors which have some past/persistence of perception and external state
• How does it organize responsibilities in the deliberative portion?– hierarchy of managerial responsibility, managers may be peer
software agents
• How does overall behavior emerge?– From interactions of a set of behaviors dynamically instantiated
and modified by the deliberative layer– assemblages of behaviors
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 27
7Chapter 7:
Hybrid Deliberative/Reactive Paradigm
Part 1: Overview & Managerial ArchitecturesPart 2: State Hierarchy & Model-Based Architectures
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 28
7 Architectures: 3 Styles
• Managerial (division of responsibilities looks like in business)– AuRA, SFX
• State Hierarchies (strictly by time scope or “state”)– 3T
• Model-Oriented (models serve as virtual sensors)– Saphira, TCA
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 29
7 State Hierarchy Architectures
• How does the architecture distinguish between reaction and deliberation?– Deliberation: requires PAST or FUTURE knowledge– Reaction: behaviors are purely reflexive and have only local,
behavior specific; require only PRESENT
• How does it organize responsibilities in the deliberative portion?– By internal temporal state
• PRESENT (controller)• PAST (sequencer)• FUTURE (planner)
– By speed of execution
• How does overall behavior emerge?– From generation and monitoring of a sequence of behaviors– assemblages of behaviors called skills– subsumption
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 30
7 3T Architecture
• Used extensively at NASA
• Merging of subsumption variation (Gat, Bonasso), RAPs (Firby), and vision (Kortenkamp)
• Has 3 layers– reactive
– deliberative
– in-between (reactive planning)
• Arranges by time
• Arranges by execution rate– ex. vision in deliberation
Dave Kortenkamp,TRAC Labs (NASA JSC)
Used for planetary rovers, robot assistants for astronauts, underwater vehiclesReactive
action packages
– a reactive planner
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 31
7
Uses RAPS to select behaviors (skills),
develop network of tasks
Skills have events that indicate than an action has had the
correct effect
present
Past and present
Uses past and present to
predict future
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 32
7cartographer
Mission planner
sequencer
Behaviormgr
Performancemonitor
Emergent behavior
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 33
7 Model-Oriented Architectures• How does the architecture distinguish between reaction and
deliberation?– Deliberation: anything relating a behavior to a goal or objective– Reaction: behaviors are “small control units” operating in
present, but may use global knowledge as if it were a sensor (virtual sensor)
• How does it organize responsibilities in the deliberative portion?– Behavioral component– Model of the world and state of the robot– throwback to Hierarchical Paradigm with global world model
but virtual sensors– Deliberative functions
• How does overall behavior emerge?– From generation and monitoring of a sequence of behaviors– voting or fuzzy logic for combination
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 34
7Saphira Architecture• Developed at SRI by
Konolige, Myers, Saffioti
• Comes with Pioneer robots
• Behaviors produce fuzzy outputs, fuzzy logic combines them
• Has a global rep called a Local Perceptual Structure to filter noise
• Instead of RAPs, uses PRS-Lite
Procedural Reasoning System
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 35
7 Saphira and LPS
Local Perceptual Space (the world model)
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 36
7 Sequencer agent,Mission Planning,Performance mon.
Cartographer
Behavior mgr
Emergent behavior
Fuses behaviors with fuzzy logic
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 37
7 Symbol-Grounding Problem
• Computers (and AI) reasons using symbols– Ex. “room”, “box,” “corner,” “door”
• Robots perceive raw data
• How to convert sensor readings to these labels?
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 38
7 Spatial World Knowledge
• What do you see?
• How could a robot reliably extract the same labels?
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 39
7 Types of Knowledge (Arkin)
• Spatial World knowledge
• Object knowledge
• Perceptual knowledge
• Behavioral knowledge
• Ego knowledge
• Intentional knowledge
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 40
7 Task Control Architecture• Developed by Reid Simmons
• Used extensively by CMU Field Robotics Projects– NASA’s Nomad, Ambler,
Dante
• Closest to Hierarchical in philosophy, but strong reactive theme showing up in implementation
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 41
7 TCATask Control Architecture –
operating system flavor
low-level tasks are like behaviors
Evidence grids (Ch 11) – sensor
info percolates up through the global model
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 42
7 Evaluation of Hybrids
• Support of Modularity: high• Niche targetability: high (ex. Lower levels of
AuRA, SFX, 2 1/2 T is just reactive)• Robustness: SFX and 3T explicitly monitor
• Think in closed world, act in open world
Introduction to AI Robotics (MIT Press), copyright Robin Murphy 2000 Chapter 7: Hybrid Deliberative/Reactive Paradigm 43
7 Hybrid Summary
• P,S-A, deliberation uses global world models, reactive uses behavior-specific or virtual sensors
• Architectures generally have modules for mission planner, sequencer, behavioral mgr, cartographer, and performance monitoring
• Deliberative component is often divided into sub-layers (sequencer/mission planner or managers/mission planner)
• Reactive component tends to use assemblages of behaviors