6
LEARNING BY SIMPLIFIED COST-REFERENCE PARTICLE FILTERING USING BIASED DATA onica F. Bugallo, Ting Lu, and Petar M. Djuri´ c, Department of Electrical and Computer Engineering Stony Brook University, Stony Brook, NY 11794, USA e-mail: {monica,tinglu,djuric}@ece.sunysb.edu ABSTRACT In this paper we address the problem of online learning by cost-reference particle filtering combined with Kalman fil- tering. We propose an efficient learning scheme applicable to problems where some of the unknowns of a dynamic sys- tem of interest are linear given the remaining unknowns, which are nonlinear. To that end, we exploit a concept that is analogous to Rao-Blackwellization, and we imple- ment it by using only one Kalman filter. The resulting al- gorithm is tested and compared to standard particle filtering for the problem of target tracking using bearings-only mea- surements acquired by two sensors. 1. INTRODUCTION Many methods for learning belong to the class of sequential methods, where learning is implemented recursively. The latest measurements containing information about the un- knowns of interest modify the existing knowledge about the unknowns following a certain theory. One such method- ology, known as particle filtering (PF), was introduced to the engineering community in the early nineties [1]. Since then it has advanced considerably so that today it is mostly well understood with all its advantages and disadvantages. PF is primarily used for learning in scenarios where the un- knowns are nonlinear and evolve with time and where the noises used in the models may be non-Gaussian. PF is based on Bayesian theory and therefore uses as- sumptions about the probability distributions of the noises in the modeling equations [2]. The objective of PF is very ambitious and it amounts to tracking the posterior distri- butions of the unknowns since they capture the complete knowledge about the unknowns. PF is based on approximat- ing these distributions by discrete random measures, which are composed of samples (particles) and weights associated to the particles. More recently, a PF method that is not based This work has been supported by the National Science Foundation under CCF-0515246 and the Office of Naval Research under Award N00014-06-1-0012. on probabilistic assumptions was proposed in [3]. In other words, the probability distributions of the noises in the state and observation equations are assume unknown and no at- tempts are made to estimate them. It is also based on parti- cles but instead of weights, they have costs that measure the “quality” of the weights according to an adopted criterion. The original name of the methodology is cost-reference par- ticle filtering (CRPF). The philosophy of CRPF is practically the same as that of PF. The complete knowledge about the unknowns is described by a cost function defined on the space of the unknowns. This function is approximated by a discrete measure that is updated with every measurement. In its original form, CRPF in spirit is close to auxiliary PF [4]. Since CRPF does not require probabilistic assumptions, it is not surprising that it is much more robust than PF. In many problems some of the unknowns are con- ditionally linear on the remaining (nonlinear) unknowns. For such problems there is a methodology known as Rao-Blackwellization (RB) [5], which combines PF with Kalman filtering (KF) [6]. According to the scheme, ev- ery particle stream has its Kalman filter which takes care of the linear unknowns. RB allows for more accurate estimates of the unknowns because the dimension of the space that is explored with particles is reduced and therefore it is much better searched. In this paper we propose to use an analogous scheme for CRPF. That is, the nonlinear unknowns are processed by us- ing particles and the linear ones by using KF. Moreover, we propose to use only one Kalman filter for all the particle streams instead of as many as there are streams 1 . We il- lustrate the use of the method on a target tracking problem where the tracking is achieved by bearings-only measure- ments obtained by two sensors. The linear unknowns are the biases of the measurement devices. The paper is organized as follows. The learning prob- lem is stated in Section 2. In Section 3 we describe the 1 Here we assume that the cost function used for computing the costs is unimodal. In cases of multimodality the number of KFs corresponds, in general, to the number of modes. 1-4244-1566-7/07/$25.00 ©2007 IEEE. 402

[IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

  • Upload
    petar-m

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Page 1: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

LEARNING BY SIMPLIFIED COST-REFERENCE PARTICLE FILTERING

USING BIASED DATA

Monica F. Bugallo, Ting Lu, and Petar M. Djuric,

Department of Electrical and Computer Engineering

Stony Brook University, Stony Brook, NY 11794, USA

e-mail: {monica,tinglu,djuric}@ece.sunysb.edu

ABSTRACT

In this paper we address the problem of online learning by

cost-reference particle filtering combined with Kalman fil-

tering. We propose an efficient learning scheme applicable

to problems where some of the unknowns of a dynamic sys-

tem of interest are linear given the remaining unknowns,

which are nonlinear. To that end, we exploit a concept

that is analogous to Rao-Blackwellization, and we imple-

ment it by using only one Kalman filter. The resulting al-

gorithm is tested and compared to standard particle filtering

for the problem of target tracking using bearings-only mea-

surements acquired by two sensors.

1. INTRODUCTION

Many methods for learning belong to the class of sequential

methods, where learning is implemented recursively. The

latest measurements containing information about the un-

knowns of interest modify the existing knowledge about the

unknowns following a certain theory. One such method-

ology, known as particle filtering (PF), was introduced to

the engineering community in the early nineties [1]. Since

then it has advanced considerably so that today it is mostly

well understood with all its advantages and disadvantages.

PF is primarily used for learning in scenarios where the un-

knowns are nonlinear and evolve with time and where the

noises used in the models may be non-Gaussian.

PF is based on Bayesian theory and therefore uses as-

sumptions about the probability distributions of the noises

in the modeling equations [2]. The objective of PF is very

ambitious and it amounts to tracking the posterior distri-

butions of the unknowns since they capture the complete

knowledge about the unknowns. PF is based on approximat-

ing these distributions by discrete random measures, which

are composed of samples (particles) and weights associated

to the particles. More recently, a PF method that is not based

This work has been supported by the National Science Foundation

under CCF-0515246 and the Office of Naval Research under Award

N00014-06-1-0012.

on probabilistic assumptions was proposed in [3]. In other

words, the probability distributions of the noises in the state

and observation equations are assume unknown and no at-

tempts are made to estimate them. It is also based on parti-

cles but instead of weights, they have costs that measure the

“quality” of the weights according to an adopted criterion.

The original name of the methodology is cost-reference par-

ticle filtering (CRPF).

The philosophy of CRPF is practically the same as that

of PF. The complete knowledge about the unknowns is

described by a cost function defined on the space of the

unknowns. This function is approximated by a discrete

measure that is updated with every measurement. In its

original form, CRPF in spirit is close to auxiliary PF [4].

Since CRPF does not require probabilistic assumptions, it

is not surprising that it is much more robust than PF.

In many problems some of the unknowns are con-

ditionally linear on the remaining (nonlinear) unknowns.

For such problems there is a methodology known as

Rao-Blackwellization (RB) [5], which combines PF with

Kalman filtering (KF) [6]. According to the scheme, ev-

ery particle stream has its Kalman filter which takes care of

the linear unknowns. RB allows for more accurate estimates

of the unknowns because the dimension of the space that is

explored with particles is reduced and therefore it is much

better searched.

In this paper we propose to use an analogous scheme for

CRPF. That is, the nonlinear unknowns are processed by us-

ing particles and the linear ones by using KF. Moreover, we

propose to use only one Kalman filter for all the particle

streams instead of as many as there are streams1. We il-

lustrate the use of the method on a target tracking problem

where the tracking is achieved by bearings-only measure-

ments obtained by two sensors. The linear unknowns are

the biases of the measurement devices.

The paper is organized as follows. The learning prob-

lem is stated in Section 2. In Section 3 we describe the

1Here we assume that the cost function used for computing the costs is

unimodal. In cases of multimodality the number of KFs corresponds, in

general, to the number of modes.

1-4244-1566-7/07/$25.00 ©2007 IEEE. 402

Page 2: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

new CRPF for a system with conditionally linear parame-

ters which makes use of only one Kalman filter in its im-

plementation. Simulation results regarding the problem of

bearings-only tracking are given in Section 4. Final conclu-

sions are given in Section 5.

2. PROBLEM FORMULATION

Many learning problems in signal processing can be stated

in terms of estimating an unobserved discrete-time random

state in a dynamic system of the form

xt = f(xt−1,ut) (1)

yt = g(xt) + bt + vt (2)

where, in equation (1), xt is a nonlinear unknown and

represents the system state at time t; f(·) is a known vector

function, which, in general, may be nonlinear; and ut is

a zero-mean noise vector whose probability distribution is

unknown. The measurements, yt, are functions of the

unknown state modeled using (2) where g(·) denotes a

known vector function of the state; bt = b represents a

vector of unknown biases2; and vt is another zero-mean

noise vector with an unknown probability distribution. The

objective is to obtain the posterior probability distribution

of the state, xt, given the sensor measurements, y1:t, i.e., to

acquire p(xt|y1:t) in the presence of the unknown biases b.

3. PROPOSED METHOD

We seek a solution to the learning problem based on

a probabilistic-free approach, i.e., we do not make any

assumptions about the probability distributions of the noise

vectors. We only assume that the mean of the state noise

vector is known. Moreover, for the state-space model

described by (1)-(2), we apply the RB philosophy that

reduces the variance of the nonlinear state estimate. We

can do that because the vector of biases b constitutes a

linear parameter given xt. Note, however, that to apply

such scheme by means of KF and using CRPF reasoning,

one has to circumvent properly the need for knowledge of

the covariances of the noises required by the traditional RB

approach. We include a method that estimates the necessary

covariances. Also, unlike traditional RB-PF approaches

where one Kalman filter is used per particle stream, we

use a single Kalman filter for the entire filter. We have

reported the use of one Kalman filter for RB in the context

of standard PF in [7] and [8].

Although CRPF is a probabilistic-free method, its se-

quential procedure follows a similar structure as that of

standard PF (SPF) [3]. In particular, the discrete measure

2In the paper we consider a vector of unknown constant biases. With a

slight modification of the algorithm, dynamic biases can also be treated.

in SPF is composed of particles and weights associated to

them, and in CRPF, it also contains particles, but its weights

represent costs that capture the quality of the state particles

computed by using the observations.

In CRPF, we denote the discrete measure by ζt ={x

(m)0:t , c

(m)t

}M

m=1, where x

(m)0:t represent the particle

streams and c(m)t are the costs associated to x

(m)0:t . To al-

low for online processing of the observations, the costs are

updated following [3]

c(m)t = c(x

(m)0:t |y1:t)

= λc(x(m)0:t−1|y1:t−1) + �c(x

(m)t |yt) (3)

where λ is a control factor, which prevents assignment

of excessive weight to past particles (0 ≤ λ ≤ 1), and

�c(x(m)t |yt) is an incremental cost, which measures the

“quality” of the particles by the latest measurement. Here

we use as incremental cost function

�c(x(m)t |yt) = ||yt − y

(m)t ||q

where y(m)t is an estimate of the observation based on the

particle x(m)t , and q > 0. Other forms of �c(·) can also be

considered.

In this paper the novelty is the special treatment of the

linear parameters in the system (the unknown biases). We

propose tracking them by using a Kalman filter-type algo-

rithm which can be used because the biases are condition-

ally linear parameters. In particular, the proposed method

can be implemented as follows. At time instant t and using

the new observation yt, the random measure ζt−1 is updated

using the next five steps:

1. Particle selection: Compute the risks of the particles,

i.e., the prediction of the costs of propagating the

particles from the previous step3. One possibility in

computing the risks is the use of

r(m)t = λc

(m)t−1 + ||yt − g(f(x

(m)t−1),0) − bt−1||

q.

The particles are resampled according to a probability

mass function (pmf) defined by π(k)t ∼ μ(r

(k)t ),

where μ(·) is a monotonically decreasing function.

As a result, we generate a new resampled random

measure

ζt−1 ={

x(m)0:t−1, c

(m)t−1

}M

m=1.

Note that the computational complexity of this step,

mainly due to the calculation of the pmf, can be

reduced employing other alternative selection tech-

niques which are based on sorting [9].

3This initial step mimics the idea of auxiliary particle filtering [4].

403

Page 3: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

2. Particle propagation: We draw new particles ac-

cording to a proposal distribution

x(m)t ∼ pt(xt|x

(m)t−1)

where pt(·) is an appropriate probability density func-

tion (pdf). Then we evaluate the corresponding costs

following the expression

c(m)t = λc

(m)t−1 + �c

(m)t .

3. State estimation: This step can be carried out by

using different schemes. One possible estimate is the

mean-square error (MSE) estimate,

xt =M∑

m=1

π(m)t x

(m)t

with π(m)t ∝ μ(c

(m)t ) [3]. Another alternative is the

minimum cost particle [9].

4. Observation noise estimation: Here we estimate the

covariance matrix of the observation noise in order to

apply a blind Kalman filter that does not know the

value of that matrix. The following derivations allow

us to obtain it

zt = yt − g(xt) − bt−1

zt =t − 1

tzt−1 +

1

tzt

Czt=

t − 1

tCzt−1

+1

t(zt − zt)(zt − zt)

Cbt=

t − 1

tCbt−1

+1

tCbt−1

Cv,t = Czt− Cbt

.

5. Measurement update of the linear states: The

equations of the blind Kalman filter then become

Kt = Cbt−1

(Cbt−1

+ Cv,t

)−1

bt = bt−1 + Kt zt

Cbt= Cbt−1

(I − Kt

).

4. EXAMPLE

In this section we present computer simulations that show

the validity and performance of the proposed algorithm.

4.1. Bearings-only tracking problem

For illustration purposes, we considered the problem of

tracking one target using two static sensors which collected

sensor 2

target

sensor 1

y2,ty1,t

Fig. 1. A system with two static sensors and their bearings-

only measurements.

bearings-only measurements. Figure 1 depicts the geometry

of the problem, whose mathematical formulation is stated as

xt = Gxxt−1 + Guut (4)

yt = h(xt) + b + vt (5)

where t = 1, 2, · · · , represents discrete-time index.

In (4), x�

t = [x1,t x2,t x1,t x2,t]� denotes the state vec-

tor composed of the coordinates and velocity components

of the target at time t; Gx and Gu are state- and noise-

transition matrices defined by

Gx =

⎡⎢⎢⎣

1 0 Ts 00 1 0 Ts

0 0 1 00 0 0 1

⎤⎥⎥⎦ , Gu =

⎡⎢⎢⎣

T 2s /2 00 T 2

s /2Ts 00 Ts

⎤⎥⎥⎦

with Ts being the sampling interval; and ut ∈ �2 is the state

noise vector modeled according to the following mixture

Gaussian distribution:

ut ∼ .3N (0, I2) + .5N (0, .25I2) + .2N (0, .01I2).

In our experiment, the observation vector y�

t =[y1,t y2,t]

� in (5) consists of collected measurements where

the function h(·) is defined as

h(xt) =

⎡⎣arctan

(x2,t−l2,1

x1,t−l1,1

)

arctan(

x2,t−l2,2

x1,t−l1,2

)⎤⎦

where (l1,1, l2,1) and (l1,2, l2,2) denote the positions of

sensors 1 and 2, respectively; the vector b� = [b1 b2]�

is composed of unknown biases from each of the sensors;

and the observation noise, vt ∈ �2, independent from ut,

is also modeled as a mixture Gaussian distribution, but of

the form

vt ∼ .5N (0, 10−2I2) + .4N (0, 10−4I2) + .1N (0, 10−5I2) .

404

Page 4: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

Assuming that the observations from the sensors are

sent to a fusion center, the objective is to apply the pro-

posed algorithm for estimation of the target’s position as

accurately as possible.

−1200 −1000 −800 −600 −400 −200 0 200 400−500

0

500

1000

1500

2000

2500

3000

x1 (m)

x 2 (m

)

True trajectorySPFnCRPF−1KF

Fig. 2. Trajectory of the target and the estimates obtained

by the proposed CRPF-1KF and the SPFn methods.

4.2. Algorithms

We applied the algorithm proposed in the previous section

(the algorithm is labeled as CRPF-1KF to indicate that we

used one cost-reference particle filter and one Kalman fil-

ter) for the considered bearings-only tracking problem. For

comparison and benchmarking purposes, we also imple-

mented the following algorithms:

• A CRPF that, by imitating the traditional marginal-

ized PF, used one Kalman filter per particle (labeled

as CRPF-MKF to indicate that we used M Kalman

filters)

• A CRPF that made a wrong assumption by consider-

ing that there were no biases (labeled as CRPFn)

• A standard particle filter that considered complete

knowledge of the noise probability distributions and

of the biases (labeled as SPF), and therefore it did not

have to include them in the estimation problem

• A standard particle filter (labeled as SPFn) that

assumed (a) no biases and (b) wrong noise probability

distributions given by

ut ∼ N (0, 0.25I2)

wt ∼ N (0, 10−4I2).

The implemented CRPF algorithms used Gaussian

propagation4, N (0, 4I2), and λ = 0, q = 2 and μ(x) =1/x.

0 50 100 150 200 250 3000

0.2

0.4

0.6

0.8

t (s)

bias

0 50 100 150 200 250 30010

−4

10−2

100

102

t (s) st

d

true bias sensor 1estimated bias sensor 1

0 50 100 150 200 250 3000

0.1

0.2

0.3

0.4

t

bias

0 50 100 150 200 250 30010

−4

10−2

100

102

t

std

true bias sensor 2estimated bias sensor 2

Fig. 3. Estimates and standard deviations of the biases

obtained by CRPF-1KF. Top: results corresponding to the

bias of sensor 1, b1. Bottom: results corresponding to the

bias of sensor 2, b2.

4.3. Results

We simulated a scenario where the target evolved from the

origin for T = 300s with a sampling period of Ts = 1s and

zero initial velocity. The coordinates of the sensors were

placed at (l1,1, l2,1) = (−12000, 13000) and (l1,2, l2,2) =(10000, 15000), and their biases were set to b1 = 0.606and b2 = 0.225, respectively. In the implementation of the

4Note that other propagation distributions can be used [3].

405

Page 5: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

0 50 100 150 200 250 30010

0

101

102

103

104

105

t (s)

MS

E lo

catio

n

SPFnSPFCRPF−1K

0 50 100 150 200 250 30010

0

101

102

103

104

105

t (s)

MS

E lo

catio

n

CRPFnCRPF−MKCRPF−1K

Fig. 4. MSE in m2 for the position of the target obtained by the different methods.

various methods we used M = 500 particles and for the

methods which use KF we set b0 = 0 and Cb0 = 100I2.

Figure 2 shows the trajectories of the target and the

obtained estimates in the two-dimensional space resulting

from a single simulation of the dynamic system. To

demonstrate the robustness of the proposed method, we

also show the result obtained by the SPFn. It is clear

that the proposed algorithm remained locked to the state

trajectory while the SPFn had a poor performance. It is

important to remark that the proposed algorithm did not use

the probabilistic information about the system and still had

a very good performance.

Although the biases were considered nuisance parame-

ters for the estimation of the nonlinear state, the algorithm

can still provide their estimates (see step 5 of the previous

Section). Figure 3 illustrates the estimates of the biases and

the evolution of the standard deviation of the estimated bi-

ases, which as expected, decreased with time.

We also computed the mean-square error (MSE) of the

position of the target measured in square meters according

to the formula

MSEt =1

2

1

J

J∑j=1

[(xj

1,t − xj1,t)

2 + (xj2,t − xj

2,t)2]

where [xj1,t xj

2,t]� was the true position of the target at time

t in the j-th run, and [xj1,t xj

2,t]� was the corresponding

estimate obtained by the filter. The MSE plots were

computed by averaging J = 50 independent simulations.

Figure 4 depicts the obtained results. For clarity in the

presentation, we show a comparison of the proposed method

with other SPFs on the left and a comparison of the

proposed method with other CRPFs on the right. We clearly

see that the worst performance was achieved by the particle

filters that assumed there were no biases, i.e., SPFn and

CRPFn. The SPF, which assumed complete knowledge of

the biases, achieved the best performance and constituted

a lower bound of the estimation problem. The proposed

method showed a performance close to the bound and

very similar to the CRPF-MKF that used one Kalman filter

per particle. We reiterate that the new method had such

performance even though it did not use any probabilistic

assumption unlike the SPF. Note also that since the method

only uses one Kalman filter, its computational complexity is

considerably reduced compared to the CRPF-MKF .

5. CONCLUSIONS

We presented a new simplified cost-reference particle

filtering-based algorithm for online learning in problems

with biased measurements. The proposed method, unlike

standard statistical learning techniques, does not use any

knowledge about the probabilistic distributions of the noises

in the system. Besides, following the Rao-Blackwellization

philosophy, the new algorithm treats the biases as nuisance

parameters and marginalizes them out of the estimation

problem using only one Kalman filter. The validity of the

method was tested through computer simulations by ap-

plying it to a bearings-only tracking problem. The results

showed that the new method clearly outperforms the par-

ticle filter that does not assume biased sensors and sup-

poses wrong probabilistic information and is close to the

performance of the standard particle filter that has complete

knowledge of the biases. Furthermore, when compared to

the traditional implementation of Rao-Blackwellized cost-

reference particle filter, it performs practically the same,

406

Page 6: [IEEE 2007 IEEE Workshop on Machine Learning for Signal Processing - Thessaloniki, Greece (2007.08.27-2007.08.29)] 2007 IEEE Workshop on Machine Learning for Signal Processing - Learning

while at the same time it requires much less computations.

6. REFERENCES

[1] N. J. Gordon, D. J. Salmond, and A. F. M. Smith,

“Novel approach to nonlinear/non-Gaussian Bayesian

state estimation,” IEE Proceedings-F, vol. 140, no. 2,

pp. 107–113, 1993.

[2] A. Doucet, N. de Freitas, and N. Gordon, Eds., Sequen-

tial Monte Carlo Methods in Practice, Springer, New

York, 2001.

[3] J. Mıguez, M. F. Bugallo, and P. M. Djuric, “A new

class of particle filters for random dynamical systems

with unknown statistics,” EURASIP Journal on Applied

Signal Processing, vol. 2004(15), pp. 2278–2294, 2004.

[4] M. Pitt and N. Shepard, “Filtering via simulation:

auxiliary particle filters,” Journal of the American

Statistical Association, vol. 94, no. 446, pp. 590–599,

June 1999.

[5] C. P. Robert and G. Casella, Monte Carlo Statistical

Methods, Springer, New York, 1999.

[6] T. Schon, F. Gustafsson, and P. Nordlund, “Marginal-

ized particle filters for mixed linear/nonlinear state-

space models,” IEEE Transactions on Signal Process-

ing, vol. 50, pp. 2279–2289, 2005.

[7] M. F. Bugallo, T. Lu, and P. M. Djuric, “Tracking

with biased measurements of signal strength sensors,”

in Proceedings of the 15th International Conference on

Digital Signal Processing.

[8] M. F. Bugallo, T. Lu, and P. M. Djuric, “Biased sensors:

Localization and data fusion,” in Proceedings of the

15th European Signal Processing Conference.

[9] M. F. Bugallo, J. Mıguez, and P. M. Djuric, “Positioning

by cost reference particle filters: study of various imple-

mentations,” in Proceedings of the 2005 International

Conference on “Computer as a tool”, EUROCON, Bel-

grade, Serbia and Montenegro, 2005.

407