Twenty Second Conference on Artificial Intelligence AAAI 2007 Improved State Estimation in...

Twenty Second Conference on Artificial Intelligence

AAAI 2007

Improved State Estimation in Multiagent Settings with Continuous or Large Discrete State Spaces

Prashant DoshiDept. of Computer Science

University of Georgia

SpeakerYifeng Zeng

Aalborg University, Denmark

State Estimation

Physical State(Loc, Orient,...)

Single agent setting

)(),,(),,()( 11111

tti sbsasToasOsb

Interactive state

State Estimation

Physical State(Loc, Orient,...)

Multiagent setting

ji SIS (See AAMAS 05)

State Estimation in Multiagent Settings

Ascribe intentional models (POMDPs) to other agents

Update the other agents' beliefs

Estimate the interactive state jS

(See JAIR’05)

Previous Approach

Interactive particle filter (I-PF; see AAMAS'05, AAAI'05)

Generalizes PF to multiagent settings

Approximate simulation of the state estimation

Limitations of the I-PF Large no. of particles needed even for small state spaces

Distributes particles over the physical state and model spaces

Poor performance when the physical state space is large or continuous

Factoring the State Estimation

Update the physical state space

Update other agent's model

Factoring the State Estimation

Sample particles from just the physical state space

Substitute in state estimation

Implement using PF

Perform as exactly as possible

Rao-Blackwellisation of the I-PF

Assumptions on Distributions Prior beliefs

Singly nested and conditional linear Gaussian (CLG)

Transition functions

Deterministic or CLG

Observation functions

Softmax or CLG

Why these distributions? Good statistical properties

Well-known methods for learning these distributions from

Applications in target tracking, fault diagnosis

Belief Update over Models

Step 1: Update other agent's level 0 beliefs

Product of a Gaussian and Softmax

Use variational approximation of softmax (see Jordan '99)

Softmax Gaussian – tight lower bound

Update is then analogous to the Kalman filter

Belief Update over ModelsStep 2: Update belief over other's beliefs

Solve other's models – compute other's policy

Large variance – Listen

Obtain piecewise distributions

Updated Gaussian if prior belief supports the action

0 otherwise

Updated

belief over

other's belief

Approximate piecewise with Gaussian using ML

Belief Update over Models

Step 3: Form a mixture of Gaussians

Each Gaussian is for the optimal action and possible

observation of the other agent

Weight the Gaussian with the likelihood of receiving

the observation

Mixture components grow unbounded

components after one step

components after t steps

Comparative Performance

Compare accuracy of state estimation with I-PF (L1 metric)

Continuous multi-agent tiger problem

Public good problem with punishment

RB-IPF focuses particles on the large physical state space

Updates beliefs over other's models more accurately (supporting plots in paper)

Comparative Performance

Compare run times with I-PF (Linux, Xeon

3.4GHz, 4GB RAM)

Sensitivity to Gaussian approximation of

piecewise distribution

Discussion

How restrictive are the assumptions on the distributions?

Can we generalize RB-IPF, like I-PF?

Will RB-IPF scale to large number of update steps?

Closed form mixtures are needed

Is RB-IPF applicable to multiply-nested beliefs

Recursive application may not improve performance over I-

Thank you

Questions?

Twenty Second Conference on Artificial Intelligence AAAI 2007 Improved State Estimation in...

Documents

From Multiagent Systems to Multiagent Societies Michael Berger Based on: 1) “Multiagent Systems and Societies of Agents” / Michael N. Huhns and Larry M

VITUAL MULTIAGENT MODEL:CUSTOMERS, ENTERPRISES, MARKET, STOCK MARKET, LABOR MARKET, MASSMEDIA, BANK,STATE, UNIVERSITY?

Wagner Aaai Real Time

MULTIAGENT SYSTEMS IN MODULAR ROBOTICS50 modular robotics, multiagent systems, metamorphic structures of robots Rudolf JÁNOŠ * MULTIAGENT SYSTEMS IN MODULAR ROBOTICS Abstract The

COMP310 MultiAgent Systems

AAAI-08 / IAAI-08

AAAI-10 Final Schedule

A Multiagent Approach to Managing Air Traffic Flow - Oregon State

ADJUDIPRO - AAAI

CSCE 875 Seminar: Multiagent Learning using a Variable ...cse.unl.edu/~lksoh/Classes/CSCE475_875_Fall11/seminars/Seminar... · CSCE 875 Seminar: Multiagent Learning using a ... Multiagent

Transitioning Multiagent Technology to UAV Applicationspscerri/papers/AAMAS_L3_08.pdf · Transitioning Multiagent Technology to UAV Applications ... Transitioning Multiagent Technology

Aaai 2006 Pedersen

AAAI#16AcceptedPapers& (ordered&by&firstauthor ......AAAI#16AcceptedPapers& (ordered&by&firstauthor,&lastname&in&each&proceedings&category)& & AAAI#16MainTrack& & Applications+! 1953

[published at AAAI-2013]

AAAI-05 / IAAI-05 Proceedingsfileadmin.cs.lth.se/ai/Proceedings/aaai05/PAPERS/AAAI05-000.pdf · AAAI Organization / xxvii AAAI-05 Program Committee / xxxi IAAI-05 Program Committee

K2 07 MULTIAGENT

AAAI Tutorial (SA2)

Multiagent Systems - GBV

AAAI 2017 Conference Program

Fundamentals of Multiagent Systems