23
Institute for Systems and Robotics Computer and Robot Vision Laboratory http://vislab.isr.ist.utl.pt On Learning and Using Affordances with Humanoid Robots José Santos-Victor , Alexandre Bernardino, Luis Montesano and Manuel Lopes Object-Action Complexes Workshop Humanoids, Paris, Dec. 2009 Instituto Superior Técnico Computer and Robot Vision Lab Institute for Systems and Robotics Outline Motivation: affordances – Modeling – Learning Using the model Imitation games Task learning Mirror an canonical neurons Extensions: multimodal perception (words) • Discussion

Outline - h2t-projects.webarchiv.kit.edu

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Outline - h2t-projects.webarchiv.kit.edu

Institute for Systems and Robotics

Computer and Robot Vision Laboratoryhttp://vislab.isr.ist.utl.pt

On Learning and Using Affordances with Humanoid Robots

José Santos-Victor, Alexandre Bernardino, Luis Montesano and Manuel Lopes

Object-Action Complexes WorkshopHumanoids, Paris, Dec. 2009

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Page 2: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

BaltazarBaltazar

Page 3: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

What are affordances?

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

What are affordances?

Actions(A)

Objects(O)

Effects(E)

Motor Perception

Perception Perception

Page 4: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

What are affordances?

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

What are affordances?

Page 5: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Robotic Affordances

Actions(A)

Objects(O)

Effects(E)

Motor Perception

Perception Perception

• Relations between actions, objects and effects• Involves action and perception• Bayesian networks to model relations• Learn through experimentation or observation

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Mathematical model: BN

• We use Bayesian Networks to represent affordances

• Nodes are:– Actions and action parameters– Object properties– Resulting effects

• BNs provide a unified and sound probabilistic framework for learning and using affordances

Page 6: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Learning Bayesian Networks

• Use a set of acquired data D to learn the network

Structure Learning Parameter Learning

Prior knowledge

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Learning Bayesian Networks

• Previous development phases provide the Actions, the object descriptors and the effect categories

• Bayesian structure learning

• Likelihood of the data given the model

• Prior over models

• Interventional data

Page 7: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Object and effects description

• Percepts of object features and effects are clusterized in unsupervised manner

• The categories form the space of

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Object and effects description

• Object features and effects are described using unsupervised learned clusters from

Page 8: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Learning affordances: Baltazar

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Learned object affordances

• 300 trials using random exploration, i.e. object action pair• The structure network learnt by the MC3 algorithm (Markov

Chain Monte Carlo Model Composition algorithm )

Not all network structures explain the data equally well

Page 9: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Robotic Affordances

Affordances as the link between sensory-motor coordination and social interaction

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Page 10: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

From SMM to social interaction

• Affordances can answer the following questions:

• Core capabilities for social interaction• Emulation: achieve the same effect

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Simple emulation

• One step Bayesian decision

• It is possible to define different simple reward functions to obtain different behaviors:– Matching an effect:

– Matching effect and Object:

Page 11: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Demonstration

(grasp on small box)

Imitation games

Objective: select action and object to obtain the same effect on a similar object

Which action gives the same effect?

The reward now also includes information about

the object features

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Imitation games

Page 12: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Imitation (task)

Affordance based imitation

-World model (cause-effect)-Action recognition

Task interpretation (BIRL)-Bayesian Inverse Reinforcement Learning-(sequences of actions, estimating the

task reinforcement function.)

Imitation (emulation)

Demonstration{(s1,e1), (s2,e2), … }

Demo Interpretation(affordances+BIRL)

Imitation(execution)

Page 13: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Experiments

Robot

Drop the boxes in the pile

Touch the large ball

Kick the balls out of the table

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Inaccurate and incomplete demonstration

Page 14: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Imitation

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Page 15: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Mirror Neurons

Active during observation of anothermonkey’s or experimenter’s handsinteracting with objects.

Observed & executed actions are the same:

Observed & executed action are NOT the same (tool):

[Gallese, Fadiga, Fogassi and Rizzolati, Brain, 1996]

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Canonical Neurons (affordances)

Responses for a selective neuron forring shapes.

Motor neurons

Respond also to the presentation of

– food or– graspable 3D objects, – even in the absence of

subsequent movement.

Object specific

(size and shape must becongruent with the typeof grip coded by theneuron).

Ring

Cylinder

Sphere

[L. Fadiga et al. Intl. Journal of Psychophysiology, 2000]

Page 16: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Individual A Individual B

Action observation/executionressonance

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Affordances vs mirror/canonical

Page 17: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

iCub example

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Page 18: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

The setup

W1….Wj

E1...En

F1…Fm

A1….Ak

P(A,W,F,E)

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Adding words to the model

• p(X) represents the world behaviour, i.e. what the robot has learned through experience

• W a set of words, description of one experiment,

• Goal: to find mapping between W and X, achieved by estimating p(X,W)

(1)

• is a subset of nodes X which are parents of word

• strong assumption is the independence among words, a “bag of words”

• to choose from all models described by (1) we use variation of the simple greedy approach – K2 algorithm.

∏∈

=W

i

i

iXpXpWXp

ωωω )()|(),(

iXω iω

Page 19: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Results

• Comparing results obtained with ideal speech recognizer (100%) and with the real one (81%), it is visible that mistakes from the real recognizer have only a small influence on the model performance

• model distinguishes non-referential words

• model sensitive to unbalanced training data

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Results

• grasp action and its associationPLAY VIDEO

Page 20: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Results – Instructing the Robot

• incomplete instructions

• both factors taken into consideration: available objects and verbal task assignment

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Grasping (where & how)

Page 21: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Outline

• Motivation: affordances– Modeling – Learning

• Using the model– Imitation games– Task learning– Mirror an canonical neurons

• Extensions: multimodal perception (words)

• Discussion

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Discussion

• Bridging sensorimotor layers and “higher cognitive behavior”

• Relationship to the “mirror” and “canonical” neurons

• Extension to other perceptual modalities (word/meaning)

• Future developments• Scale (structural learning with large dimensional graphs)• Batch versus incremental• Continuous versus discrete variables (parameterized is ok)• Application to grasping and handling

Page 22: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Credits (http://vislab.isr.ist.utl.pt)

Head, face, cover design: Ricardo Beira, M. Praça, L. VargasVision & attention: Alexandre Bernardino, J.Ruesch

Learning, affordances; Manuel Lopes, Luis MontesanoLanguage acquisition: Jonas Hornstein, Cláudia SoraesG. Salvi, Verica Krunic – Vision-speech associationMatteo Tajana – model based trackingRicardo Nunes – robot life support!

http://vislab.isr.ist.utl.ptJosé Santos-Victor – [email protected]

Page 23: Outline - h2t-projects.webarchiv.kit.edu

Instituto Superior Técnico

Computer and Robot Vision LabInstitute for Systems and Robotics

Credits (http://vislab.isr.ist.utl.pt)

Head, face, cover design: Ricardo Beira, Miguel Praça, L. Vargas

Vision & attention: Alexandre Bernardino, J.RueschLearning, affordances; Manuel Lopes, Luis MontesanoLanguage acquisition: Jonas Hornstein, Cláudia SoraesG. Salvi, Verica Krunic – Vision-speech associationMatteo Tajana – model based trackingRicardo Nunes – robot life support!

http://vislab.isr.ist.utl.pt