94
Teaching an Old Robot New Tricks: Learning via Interaction with People and Things Matthew J. Marjanovic MIT AI Lab Marjanovic: Old Robots, New Tricks – p.1

Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Teaching an Old Robot New Tricks:

Learning via Interaction

with People and Things

Matthew J. MarjanovicMIT AI Lab

Marjanovic: Old Robots, New Tricks – p.1

Page 2: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Embodied AI

Lots of structure built-in. . . What about building structures?

Marjanovic: Old Robots, New Tricks – p.2

Page 3: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Overview

Three contributions to Embodied AI:

sok : Environment for behavior-based programming.

meso: A biologically-motivated motor-control system(virtual muscles).

pamet : A modular system for learning from interactionwith the environment and people, with enough modulesto learn some simple behaviors.

What you will see:

Cog is taught to perform some simple motor tasks.

Marjanovic: Old Robots, New Tricks – p.3

Page 4: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

An Example: The FingerBot

s

θ

One degree-of-freedom motion; single tactile sensor.

Our goal: teach FingerBot to point in response to touch.

Marjanovic: Old Robots, New Tricks – p.4

Page 5: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot initial state

meso tactile

emotion

extend

curl

θ s

R

Marjanovic: Old Robots, New Tricks – p.5

Page 6: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot learns to move

modellermodeller

modeller modeller

meso tactile

emotion

extend

curl

θ s

R

Marjanovic: Old Robots, New Tricks – p.6

Page 7: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot controller

meso tactile

emotion

extend

curl

model

model

controller

θ′

θ s

RA

A

Marjanovic: Old Robots, New Tricks – p.7

Page 8: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Teach FingerBot how to point

meso tactile

emotion

actionmodeller

θ′

θ s

R

Marjanovic: Old Robots, New Tricks – p.8

Page 9: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot points

meso tactile

emotion

actorA

θ s

R

θ′

Marjanovic: Old Robots, New Tricks – p.9

Page 10: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Teach FingerBot when to point

meso tactile

emotion

actor

triggermodeller

A

θ s

R

θ′ A

Marjanovic: Old Robots, New Tricks – p.10

Page 11: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot points when touched

meso tactile

emotion

actor

trigger

A

θ s

R

θ′

Marjanovic: Old Robots, New Tricks – p.11

Page 12: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Overview

sok

touch sense

tactile

processes, ports

happysad

emotion

pamet

models, modellers,movers, controllers,

actions, triggers,

virtual muscles, joint limits, fatigue

meso

transformers

vision

tracking,head control

attention,

Marjanovic: Old Robots, New Tricks – p.12

Page 13: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

The Robot: Cog

Humanoid, anthropomorphic robot:

4-dof head, with 3-dofstereoscopic eyes

two 6-dof torque-controlledarms

3-dof torque control in thetorso

off-board processing: ~28off-the-shelf x86 PC’s run-ning QNX4

Marjanovic: Old Robots, New Tricks – p.13

Page 14: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog’s Head

4-dof in head: tilt, pan, roll, neck “lean”

gyroscope: angular velocity, inclination

3-dof in two eyes: separate tilt, linked pan

dual cameras: wide-angle, narrow-angle “foveal”

Marjanovic: Old Robots, New Tricks – p.14

Page 15: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog’s Hand

reach grasp pinch point

2-dof: linked finger/thumb, paddle

Four recognizable gestures (sorry, no “thumbs up”)

Six surfaces of tactile sensation

Marjanovic: Old Robots, New Tricks – p.15

Page 16: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Why build a humanoid robot?

Assorted “practical” reasons:

Create machines/tools which adapt to humanenvironments.

Test theories from cognitive science, neuroscience.

Grand philosophical reason:

We want to create a human-level machine intelligence.

Intelligence is predicated on how an agent interactswith the world.

Marjanovic: Old Robots, New Tricks – p.16

Page 17: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Metaphors We Live By

[Lakoff & Johnson, 1980]

Metaphors are not just linguistic constructs; we think inmetaphors.

Happy is Up; Sad is Down.Significant is Big.Time is Money.

No literal meaning; no reduction to discretepropositional forms or symbols.

Operational meaning: represent consistent patterns ofinteraction.

Grounded in experience.

Marjanovic: Old Robots, New Tricks – p.17

Page 18: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

The Body in the Mind

[Johnson, 1987]Some metaphors are cultural;some arise from basic human physical interactions: imageschemata.

IN−OUTCONTAINER

Simple relationships which appear at all levels of thought:

Physical: “George put the toys in the box.”

Abstract: “Donald left out some facts.”Marjanovic: Old Robots, New Tricks – p.18

Page 19: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

So, what does this mean?

If you want to build a human-like machine intelligence:

1. You need a system which can develop and manipulatemetaphorical structures.

2. Those structures must be grounded in raw sensory andmotor experiences.

3. Those experiences must be human-like.

(You need a humanoid robot.)

Marjanovic: Old Robots, New Tricks – p.19

Page 20: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog is a humanoid robot.

Much work on different aspects of humanoid-ness. . .

Matt Williamson: motor control via couplednon-linear oscillators (central pattern generators).Robert Irie: auditory localization.Cynthia Breazeal (Kismet): facial/vocal gestures,social interaction, emotional models.Brian Scasselatti: visual animate/inanimatedistinction, theory of body, imitation of simplemotions.Paul Fitzpatrick: visuomotor object understanding.

Isolated parts, lacking direction/means for integration.

Mostly learning parameters, not structures.

(Where are the metaphors gonna go?!)Marjanovic: Old Robots, New Tricks – p.20

Page 21: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Metta’s Babybot

[Metta, 1999]5-dof head with stereoscopic vision,gyroscope

6-dof torque-controlled arm

Learns a developmental progression ofsensorimotor coordination:

Saccades to visual targets via visualfeedback (head motion rare).Coordinated head and eye movement(random arm movement).Controlling movement of arm, as avisual target.

Culminates in reaching out toward movingvisual stimuli.

Marjanovic: Old Robots, New Tricks – p.21

Page 22: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Drescher’s Schema Mechanism

Simulated robot learns progressively more abstractrelations by interacting with its world. [Drescher, 1991]

Mechanism has items, actions, schemas: predictors,CONTEXT + ACTION ⇒ RESULT

Learning and abstraction:

New schemas for more specific context or result.Composite actions and synthetic items.

Marjanovic: Old Robots, New Tricks – p.22

Page 23: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Schema Issues

Completely unrealistic simulation:

2-D grid-world with binary features.

10 discrete primitive actions.

Total of 141 bits of primitive binary state.

No physics, no noise.

Serialized state-machine for action execution.

It’s a symbolic AI engine for a symbolic universe.

Marjanovic: Old Robots, New Tricks – p.23

Page 24: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

A Design Philosophy

First and foremost, Real-World Constraints:

Real robot, real physics.

Real-time operation — human time-scales.

Distributed computation — expandable, scalable.

Noise — because separating signal from noise is whatit’s all about!

Marjanovic: Old Robots, New Tricks – p.24

Page 25: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

A Design Philosophy

Dynamic: create new models/concepts.

Transparent: compatible representations acrossmodalities.

Reuse same machinery at different levels ofdevelopment or abstraction.

Malleable: states/actions/etc allowed to change/evolve.

Distributed control: no central planner, arbitrator, actionselector, modeller.

Only serialize operations when required by resourcecontention.

Grounded close to the raw physics.

Marjanovic: Old Robots, New Tricks – p.25

Page 26: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Three Pieces

virtual muscles, joint limits, fatigue

meso

sokprocesses, ports

pamet

models, modellers,movers, controllers,

actions, triggers,transformers

happysad

emotion

touch sense

tactile

vision

tracking,head control

attention,

Marjanovic: Old Robots, New Tricks – p.26

Page 27: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

sok: Behavior-based Environment

Dynamic network of sok processes, connected via inportsand outports.

Provides interprocesscommunication and processcontrol.

Connection-basedmessage-passing.

Event-driven code.

Distributed: workstransparently over a networkof processors (via QNX).

C, C++ libraries, plus utilities.

state

portstate

init code

body code

runtimestate

portstate

init codebody code

runtimestate

portstate

init codebody code

runtimestate

portstate

runtime

init codebody code

runtimestate

portstate

init codebody code

runtimestate

portstate

init codebody code

runtimestate

portstate

init codebody code

runtimestate

portstate

init codebody code

Marjanovic: Old Robots, New Tricks – p.27

Page 28: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

sok Features

Processes and connections are dynamic.

Network is tangible.

Processes can be aware of the network topology.Processes can search for other ports/processes bytraversing the network.Ports are typed.

Separation of message-handling code and body code.

Processes can be killed and restarted withoutaffecting connectivity.

State of sok space can be dumped/restored from disk.

Marjanovic: Old Robots, New Tricks – p.28

Page 29: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

sok Type System

Ports are typed: C-like (or IDL)

10 atomic int and float types, plus arrays andstructures.Can define constants, too.

Scheme-based type compiler generates .h and .c filesfrom .stc files.

Type compiler is embedded in the sok library; typedescriptions can be processed at runtime.

Functions for walking typed buffers, generatingcomposite type identifiers, housekeeping.

Marjanovic: Old Robots, New Tricks – p.29

Page 30: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

sok Arbitrators

Message passing is asynchronous and anonymous:

Data received by an inport is buffered, and bodyprocess receives an event.

Default behavior is to overwrite — no queueing.

Body code doesn’t know about multiple connections.

Arbitrator: code which will change reception behavior, e.g.

Summing: Inport keeps a running sum of received data.

Averaging: Body process sees the mean of all datareceived between reads.

Blocking: One connection wins right to monopolize theport; messages from others are discarded.

Marjanovic: Old Robots, New Tricks – p.30

Page 31: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Dump and Restore

sok provides a dynamic environment, processes andconnections come and go as a system develops.

sok processes can be told to dump state to disk:

Connections to ports.Persistent data, arbitrator state.Command-line arguments, etc.

Entire state of machine can be dumped or restored withsingle shell command.

Marjanovic: Old Robots, New Tricks – p.31

Page 32: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Handy Utilities

sok command-line interface:

inspect network, processes, portsmake/break connectionsspawn/kill processes, send signalsinitiate dump and restore

Megasliderama Swiss-army GUI:

create panels of widgets connected to portssliders, stripcharts, buttons, image planes, . . .

Marjanovic: Old Robots, New Tricks – p.32

Page 33: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Sensory and Motor Boxes

virtual muscles, joint limits, fatigue

meso

sokprocesses, ports

pamet

models, modellers,movers, controllers,

actions, triggers,transformers

vision

tracking,head control

attention,

touch sense

tactile

happysad

emotion

Marjanovic: Old Robots, New Tricks – p.33

Page 34: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Biological Motor Control

Implements spring-like virtual muscles.

Allows for multi-joint coupling.

Provides performance feedback:

Muscle fatigue.Joint pain.

Marjanovic: Old Robots, New Tricks – p.34

Page 35: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Why muscles? Fatigue? Pain?

Movement is influenced by limitations of physiology.

We try to minimize energy use, take advantage ofphysics.We avoid uncomfortable motions.

Concepts linked to physical interaction are derived fromthose limitations.

“heavy” = “things which make muscles tired”“stretching limits”

Cog’s electric motors have no innate fatigue.

Marjanovic: Old Robots, New Tricks – p.35

Page 36: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Why spring-like? Multi-joint?

Why spring-like?

Real muscles are very non-linear force-producingelements, operate in antagonistic pairs.

At spinal level, feedback loops make muscle groupsbehave like simple springs.

Equilibrium-point control hypothesis. [Bizzi, Kelso]

Why multi-joint coupling?

Real muscles do span multiple joints.

Necessary for complete control of endpoint stiffness.[Hogan, 1985]

Multi-joint muscles enhance efficiency of certainmovements. [Gielen, 1993] Marjanovic: Old Robots, New Tricks – p.36

Page 37: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Endpoint Stiffness

Single-joint springs donot allow for full control ofstiffness in endpointspace.

Endpoint stiffness shouldbe tuned to match thetask.

Handshake: low vertical stiffness

Key in lock: high vertical stiffness

Marjanovic: Old Robots, New Tricks – p.37

Page 38: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Multi-joint Efficiency

~τa~τa

~τb

~x ~x

~τb

Depending on configuration, single-joint muscles mayundergo lengthening contraction, i.e. dissipating energy.

A multi-joint muscle acting as stiff linkage prevents this,makes motion more efficient.

To Cog, such motion will feel more efficient. Marjanovic: Old Robots, New Tricks – p.38

Page 39: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Three Layers

joint angle

A/D

upper arm

D/A

lower arm

D/Atorso/hand

D/A

skeletal model

~l~F

~F0~B ~v, ~K

Skeletal Model

Torque-Control

Muscular Modelmuscular modelδ

Low-level torque control.

Skeletal model (muscle kinematics).

Muscle model (muscle dynamics, fatigue).

Marjanovic: Old Robots, New Tricks – p.39

Page 40: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Skeletal Model

r

Fra

F rb

Compute muscle lengths ~l(~θ), joint torques ~τ( ~F , ~θ).

Simple “pulley” model:

lm =∑

j

rjmθj , τj =∑

m

rjmFm

Marjanovic: Old Robots, New Tricks – p.40

Page 41: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso: Muscle Model

Compute forces from lengths, i.e. spring-law:

F = K(l − l0) − B(l̇) + F0

Setpoint l0 and stiffness K are control parameters.

Damping constant B needs to be tuned.

Bias force F0 can be used for feedforward control (e.g.gravity/posture).

Marjanovic: Old Robots, New Tricks – p.41

Page 42: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Fatigue Model

Adams [2000]: detailed biochemical fatigue model:

Glucose/glycogen, adipose tissue, heart rate,insulin/glucagon/epinephrine. . .

Abstracted further:

Required power: P = α|Fv| + β|F | + γK

Metabolic supply: PR

Energy reserves: short-term SS , long-term SL.

Discomfort and muscle efficiency due to SL:

δ = 1 −

(

SL

SL0

)

, φ =

(

SL

SL0

)1/2

Marjanovic: Old Robots, New Tricks – p.42

Page 43: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

meso as a Black-box

meso

virtual muscles, joint limits, fatigue

~B, ~F0~θ,~l~v, ~K δ, ρ

Inputs: muscle setpoint velocity, stiffness.

Outputs: joint angles, muscle lengths, fatigue, pain.

Marjanovic: Old Robots, New Tricks – p.43

Page 44: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Next Box: Vision

virtual muscles, joint limits, fatigue

meso

sokprocesses, ports

pamet

models, modellers,movers, controllers,

actions, triggers,transformers

touch sense

tactile

happysad

emotion

vision

tracking,head control

attention,

Marjanovic: Old Robots, New Tricks – p.44

Page 45: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Vision System

Saliency filters:

Color, skin tone, motion.

Attention:

Weights/adds salienciesChooses potentialtargets.

Tracking system:

Locks onto imagepatch.Follows it around.

motion color skin

cameras

attention

motors MEI card

tracker disparity

motor control

(thanks Paul and Giorgio!)

Marjanovic: Old Robots, New Tricks – p.45

Page 46: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Vision as a Black Box

vision

tracking,head control

attention,

(R, G, B, Skin, Vivid)

θ, φ, d

Outputs only:

Target gaze-angles, disparity: (θ, φ, d)

Visual features of target: (R, G, B, Skin, Vivid)

Marjanovic: Old Robots, New Tricks – p.46

Page 47: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Next Box: Tactile Sense

touch sense

tactile

sokprocesses, ports

pamet

models, modellers,movers, controllers,

actions, triggers,transformers

happysad

emotion

virtual muscles, joint limits, fatigue

meso

vision

tracking,head control

attention,

Marjanovic: Old Robots, New Tricks – p.47

Page 48: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Tactile Sense

touch sense

tactile~s

22 force-sensitive-resistor pads.

Wired in parallel to yield six sensor surfaces.

Vector of six values, [0,1].

Marjanovic: Old Robots, New Tricks – p.48

Page 49: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Last Box: “Emotion”

actions, triggers,

touch sense

tactiletransformers

happysad

emotion

sokprocesses, ports

pamet

models, modellers,

virtual muscles, joint limits, fatigue

meso

movers, controllers,

vision

tracking,head control

attention,

Marjanovic: Old Robots, New Tricks – p.49

Page 50: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

“Emotion”

tactile

emo/happy

fatigue

painemo/sad

meso

s

R

s

R

Combines sources of innate positive/negative reward.

Positive: hand-squeezes.Negative: muscle fatigue, joint pain.

Leaky integrator dynamics:

R(t) = λR(t − 1) +∑

ri(t − 1, t)

Marjanovic: Old Robots, New Tricks – p.50

Page 51: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Final layer: pamet

touch sense

tactiletransformers

happysad

emotion

sokprocesses, ports

pamet

models, modellers,vision

tracking,attention,

head control

movers, controllers,

virtual muscles, joint limits, fatigue

meso

actions, triggers,

Marjanovic: Old Robots, New Tricks – p.51

Page 52: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

pamet: connecting, learning

Static modules (sok processes) provide initial structure:

Sensory and motor primitives (black boxes, movers).Proto-modeller modules to create modellers.

Modeller modules try to discover relations betweenstate parameters, and create models.

Mover models, action models, trigger models,transform models.

Instance modules use the models to produce behavior,create new state parameters.

Controllers, actors, triggers, transformers.

Knowledge is acquired in network structure and models.

Marjanovic: Old Robots, New Tricks – p.52

Page 53: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

An Example: The FingerBot

s

θ

One degree-of-freedom: joint angle θ.

Controlled by a single virtual muscle, length l.

Single tactile sensor s.

Goal: Teach the finger to point in response to touch.Marjanovic: Old Robots, New Tricks – p.53

Page 54: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Implementation of the FingerBot

Marjanovic: Old Robots, New Tricks – p.54

Page 55: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

A Menagerie of Modules

Module Subclass Inputs Outputs

mover meso A

controller ~v A

actor position-constant A, ~s ~v

position-parameter A, ~s1, ~s2 “

velocity-constant A, ~s “

trigger position ~s A

activation-delay A1, A2 A

transformer ~s1, ~v2 ~s2, ~v1, A

Basic data types: state parameters ~s, activations A,drive signals ~v

Marjanovic: Old Robots, New Tricks – p.55

Page 56: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot initial state

meso tactile

emotion

extend

curl

θ s

R

Two hard-coded mover modules: “curl” and “extend”.

Mover modules activate at random: exploration.

Random activation is supressed by explicit activation.

Marjanovic: Old Robots, New Tricks – p.56

Page 57: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Mover

“Causes movement, rate modulated via a scalar parameter.”

meso movers are the primitive connections to the motorsystem.

Send a vector of muscle velocities, scaled by controlparameter (and fixed stiffness).

Effect on state parameters is modelled by movermodels.

Marjanovic: Old Robots, New Tricks – p.57

Page 58: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot mover modelling

modellermodeller

modeller modeller

meso tactile

emotion

extend

curl

θ s

R

Mover modellers are spawned for each pair of stateparameter and mover activation.

Learn/discover the relationship between the moversand the joint velocity.

Marjanovic: Old Robots, New Tricks – p.58

Page 59: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

FingerBot controller

meso tactile

emotion

extend

curl

model

model

controller

θ′

θ s

RA

A

A controller is created for the joint angle: controls jointvelocity by activating movers.

Joint angle becomes a controllable parameter.

Marjanovic: Old Robots, New Tricks – p.59

Page 60: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Controller

“Controls the velocity of a state parameter by activatingmovers.”

Uses the mover modelsto determine how moverswill affect stateparameter.

At each instant, activatesthe mover which will yieldthe velocity closest to thecontrol input ~v:

m = arg max(~v · ~vm

‖~vm‖)

model

model

model

mover

mover

mover

controller

~vA1

A2

Ak

Marjanovic: Old Robots, New Tricks – p.60

Page 61: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Training FingerBot to Point

meso tactile

emotion

actionmodeller

θ′

θ s

R

An action modeller is created for each controllableparameter.

Correlates reward with joint angle; creates an actionmodel for the pointing posture.

Marjanovic: Old Robots, New Tricks – p.61

Page 62: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Training FingerBot to Point

meso tactile

emotion

actorA

θ s

R

θ′

An actor is spawned, which moves finger to the pointingposition at random times.

Marjanovic: Old Robots, New Tricks – p.62

Page 63: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Actor

model

actorcontroller

model

actorcontroller

~s0

~s = ~s0

A~v

~s ~s = ~s0

A~v

~s

“Drives a controllable parameter to a goal value.”

position-constant : constant prototype value in actionmodel.

position-parameter : goal derived from another inputparameter.

Have random activation.

Unused actors/actions will expire.

Marjanovic: Old Robots, New Tricks – p.63

Page 64: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Pointing in response to Touch

meso tactile

emotion

actor

triggermodeller

A

θ s

R

θ′ A

A trigger modeller is created for pairs of actors andstate parameters.

Correlates state parameter with reward and activation.

Creates a trigger model linking pointing action to arange of tactile sense values.

Marjanovic: Old Robots, New Tricks – p.64

Page 65: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Pointing in response to Touch

meso tactile

emotion

actor

trigger

A

θ s

R

θ′

A trigger is spawned, which activates the pointing actorin response to touch.

Marjanovic: Old Robots, New Tricks – p.65

Page 66: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Trigger

model

triggeractor

model

triggeractor

A0

~s ∼ ~s0? τ, Amin

A ~s A

“Activates an actor when input state satisfies a context.”

position: context is a region in input state space.

activation-delay : context is another activation signal.

Activation output indicates how well context is satisfied.

Marjanovic: Old Robots, New Tricks – p.66

Page 67: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Video of Three Arm Actions

Marjanovic: Old Robots, New Tricks – p.67

Page 68: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog, Initial State� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

E+

Sb+E−

Sb+E+

Sa+E−

Sa+E+

Sa+

Sb+Sc−

Sb+Sc+

Sc+

proto−controller

proto−action−modellerproto−mover−modeller

happysad

emotion[Ch. 5]

touch sense

tactile[Ch. 4]

vision

attention,tracking,

head control

[Ch. 4]

proto−trigger−modeller

proto−transform−modeller

pinch

point

virtual muscles, joint limits, fatigue

meso [Ch. 3]

Wa+Wb+

Wb+

Wa+

E+Wa+

Marjanovic: Old Robots, New Tricks – p.68

Page 69: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Mover Modelling

Modeller’s question:“Which state parameters are affected by the mover?”

Linear model of state velocity as a function of activation

~vs = A~m

Modeller operation:

1. Record samples (~s,A); calculate velocities ~v.2. Perform linear regression to estimate ~m.

3. Check correlation coefficient R2 for each axis of ~v.

Marjanovic: Old Robots, New Tricks – p.69

Page 70: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Elbow Mover (vs. Thumb)

-0.03

-0.02

-0.01

0

0.01

0.02

0.03

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4

ThumbElbow

Thumb FitElbow Fit

Elbow: R2 = 0.82, Thumb: R2 = 0.08

Marjanovic: Old Robots, New Tricks – p.70

Page 71: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog, with Controllers

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

Sb+E+

Sa+E−

Sa+E+

Sa+

Sb+Sc−

Sb+Sc+

Sc+

controller

proto−controller

action−modeller

action−modeller

controller

proto−action−modellerproto−mover−modeller

happysad

emotion[Ch. 5]

touch sense

tactile[Ch. 4]

vision

attention,tracking,

head control

[Ch. 4]

proto−trigger−modeller

proto−transform−modeller

pinch

point

virtual muscles, joint limits, fatigue

meso [Ch. 3]

Wa+Wb+

Wb+

Wa+

E+Wa+

E+

Sb+E−

Marjanovic: Old Robots, New Tricks – p.71

Page 72: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Three Arm Actions

outward

forward

across Marjanovic: Old Robots, New Tricks – p.72

Page 73: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Action Modelling

modellercontroller modellercontroller

~sg

R

~s

R

~sc

Modeller’s question:

Is there a prototype state sample being rewarded?

If so, what is it?

Model acts as a binary classifier: rewarded vs. unrewarded.

Marjanovic: Old Robots, New Tricks – p.73

Page 74: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Action Modelling, cont.

0

0.5

1

1.5

2

2.5

3

0 500 1000 1500 2000 2500

raw rewardstate parameterreward windowsraw rewardreward windows

state parameter

Collect samples (~s,R).

Calculate reward windows, preceding the reward.

Label ~s samples:

Within windows: rewarded, “T”.Outside windows: unrewarded, “F”.

Marjanovic: Old Robots, New Tricks – p.74

Page 75: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Is there an Action?

0

0.2

0.4

0.6

0.8

1

1.6 1.8 2 2.2 2.4 2.6 2.8 3

RewardedBackground

Compare per-axis distributions p(si|T ) and p(si|F ).

Measure absolute areal difference of CDF’s:

αi =

|P (si|T ) − P (si|F )| dsi

If αi > 0.1, axis i is relevant. (None = no model!)Marjanovic: Old Robots, New Tricks – p.75

Page 76: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

What is the Action?

Model distributions p(si|T ) of relevant axes by gaussian.

Measure mean and variance of rewarded samples.

Mean is the prototype position — goal of action.

Marjanovic: Old Robots, New Tricks – p.76

Page 77: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog, with Actors

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

Sb+E+

Sa+E−

Sa+E+

Sa+

Sb+Sc−

Sb+Sc+

Sc+

controller

proto−controller

actor

action−modeller

actor

trigger−modeller

trigger−modeller

trigger

trigger−modeller

trigger−modeller

trigger

trigger−modeller

trigger−modeller

actor

trigger−modeller

trigger−modeller

action−modeller

actor

trigger−modeller

trigger−modeller

actor

controller

proto−action−modellerproto−mover−modeller

happysad

emotion[Ch. 5]

touch sense

tactile[Ch. 4]

vision

attention,tracking,

head control

[Ch. 4]

proto−trigger−modeller

proto−transform−modeller

pinch

point

virtual muscles, joint limits, fatigue

meso [Ch. 3]

Wa+Wb+

Wb+

Wa+

E+Wa+

E+

Sb+E−

Marjanovic: Old Robots, New Tricks – p.77

Page 78: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Visual Triggers for the Arm

Marjanovic: Old Robots, New Tricks – p.78

Page 79: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Trigger Modelling

modelleractor

vision

A

A ~s

R

Modeller’s question:

Is there a prototype stimulus coincident with both actionand reward?

If so, what is it?

Model acts as a binary classifier: stimulus vs. background.

Marjanovic: Old Robots, New Tricks – p.79

Page 80: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Trigger Modelling, cont.

0

0.5

1

1.5

2

2.5

3

3.5

4

0 500 1000 1500 2000 2500 3000 3500 4000

raw Rewardaction activation

SkinR

reward events

Collect samples (~s,A,R).

Determine reward windows and action windows.

Label samples ~s:

Within both windows: stimulus, “T”.Outside both windows: background, “F”.

Compare distributions of T and F samples, per axis.

Marjanovic: Old Robots, New Tricks – p.80

Page 81: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Trigger Modelling Windows

0

0.5

1

1.5

2

2.5

0 500 1000 1500 2000 2500 3000 3500 4000

reward eventsAction Windows

ActionsReward Windows

Reward is “associated” with action if near actionendpoint.

Reward windows relative to reward event.

Action windows = reward windows for associatedrewards.

Marjanovic: Old Robots, New Tricks – p.81

Page 82: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Position Trigger

Relevant axes of T and F modelled by gaussiandistributions.

“Stimulus” and “background” distributions determinesalient region of state space:

ρ = P (T |~s) =p(~s|T )P (T )

p(~s|T )P (T ) + p(~s|F )P (F )

Trigger context is satisfied when ρ > 0.5.

ρ sent as activation value to actor.

Background distribution is updated/adapted over time.

Trigger will habituate to a continuous stimulus.

Marjanovic: Old Robots, New Tricks – p.82

Page 83: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Example Trigger Context

Decision boundary of “Red Ball” trigger:

0 0.2 0.4 0.6 0.8 1 0

0.2

0.4

0.6

0.8

1

0 0.2 0.4 0.6 0.8 1 0

0.2

0.4

0.6

0.8

1

0 0.2 0.4 0.6 0.8 1 0

0.2

0.4

0.6

0.8

1

red

green

blue

bluered

green

Marjanovic: Old Robots, New Tricks – p.83

Page 84: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Cog, with Triggers

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � � � � � � � � � � � � � � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

� � �

Sb+E+

Sa+E−

Sa+E+

Sa+

Sb+Sc−

Sb+Sc+

Sc+

controller

proto−controller

actor

action−modeller

actor

trigger−modeller

trigger−modeller

trigger

trigger−modeller

trigger−modeller

trigger

trigger−modeller

trigger

trigger−modeller

actor

trigger−modeller

trigger−modeller

trigger

action−modeller

actor

trigger−modeller

trigger−modeller

actor

controller

proto−action−modellerproto−mover−modeller

happysad

emotion[Ch. 5]

touch sense

tactile[Ch. 4]

vision

attention,tracking,

head control

[Ch. 4]

proto−trigger−modeller

proto−transform−modeller

pinch

point

virtual muscles, joint limits, fatigue

meso [Ch. 3]

Wa+Wb+

Wb+

Wa+

E+Wa+

E+

Sb+E−

Marjanovic: Old Robots, New Tricks – p.84

Page 85: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Action Model Refinement

Action modeller continues to collect data, to discovernew actions.

If a new model would be similar to an existing one, theold model is refined.

Connections to triggers, etc., remain intact.Marjanovic: Old Robots, New Tricks – p.85

Page 86: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Conclusion

Robot learns from interaction with self and environment.

Robot can be taught a few simple behaviors.

Holds true to design philosophy:

Dynamic: new models, structure created.Malleable: models can be refined.Distributed: no central planner/learner.Real robot, real world conditions.

Marjanovic: Old Robots, New Tricks – p.86

Page 87: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Future Work

Negative reward/reinforcement

Using the muscle-fatigue/joint-pain signals.

Inhibition and un-learning

Learning when not to do something.Forgetting triggers.

Accounting/distributing reward.

Consistency, “success” as a reinforcer.

Marjanovic: Old Robots, New Tricks – p.87

Page 88: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Future Work, cont.

Models for Johnson’s “important” image schemata?

CONTAINER BALANCE COMPULSION

BLOCKAGE COUNTERFORCE RESTRAINT REMOVAL

ENABLEMENT ATTRACTION MASS-COUNT

PATH LINK CENTER-PERIPHERY

CYCLE NEAR-FAR SCALE

PART-WHOLE MERGING SPLITTING

FULL-EMPTY MATCHING SUPERIMPOSITION

ITERATION CONTACT PROCESS

SURFACE OBJECT COLLECTION

Marjanovic: Old Robots, New Tricks – p.88

Page 89: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Future Work, cont.

Incorporate earlier/other work into framework:

Richer perceptual primitives.Social primitives; emotional system.Reflexes which encourage interaction.

. . . but, “black boxes” should not be opaque.

Marjanovic: Old Robots, New Tricks – p.89

Page 90: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Brain-teasers. . .

How many primitives do we need to implement?

How small do the building blocks need to be?

How dense must the interconnections be?

Do we need to simulate cells? Proteins?

Marjanovic: Old Robots, New Tricks – p.90

Page 91: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

EXTRA SLIDES

Marjanovic: Old Robots, New Tricks – p.91

Page 92: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Phases of Fatigue

b

0

5000

F producedF required

0

0.5

1discomfortefficiency

0

20

40

60Sl

Ss(x10)

a c d

0 2 4 6 8

0 20 40 60 80 100 120 140 160

P required

Marjanovic: Old Robots, New Tricks – p.92

Page 93: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Transformer

model

transformercontroller actor

actor

~y ~y = f(~x)

~̇yx~̇x

~x ~yx

Axy ~s

~v

A

Transformer module can do three things:

Predict state of ~x in y coordinate frame. (~yx = f(~x))

Decide whether or not ~x and ~y are observing the sameprocess. (Axy)

Control ~x via its velocity ~̇yx in the y coordinate frame.(~̇x = F−1~̇yx)

Marjanovic: Old Robots, New Tricks – p.93

Page 94: Teaching an Old Robot New Trickspeople.csail.mit.edu/maddog/phdtalk-dist.pdf · Saccades to visual targets via visual feedback (head motion rare). Coordinated head and eye movement

Trigger Modelling Windows, v1.0

0

0.5

1

1.5

2

2.5

0 500 1000 1500 2000 2500 3000 3500 4000

reward eventsAction Windows

ActionsReward Windows

Action is “rewarded” if reward event soon after actionendpoint.

Action windows near action onset.

Reward windows = action windows for rewardedactions.

To be effective, need to slide action windows.Marjanovic: Old Robots, New Tricks – p.94