2 estimators

Estimators

SOLO HERMELIN

Updated: 22.02.09 17.06.14

http://www.solohermelin.com

EstimatorsSOLO

Table of Content

Summary of Discrete Case Kalman FilterExtended Kalman FilterUscented Kalman Filter

Kalman Filter Discrete Case & Colored Measurement Noise

Parameter EstimationHistory

Optimal Parameter EstimateOptimal Weighted Last-Square EstimateRecursive Weighted Least Square Estimate (RWLS)Markov Estimate

Maximum Likelihood Estimate (MLE)

Bayesian Maximum Likelihood Estimate (Maximum Aposterior – MAP Estimate)

The Cramér-Rao Lower Bound on the Variance of the Estimator

Kalman Filter Discrete Case

Properties of the Discrete Kalman Filter ( ) ( ) 01|1~1|1ˆ =++++ kkxkkxE T

(1)(2) Innovation =White Noise for Kalman Filter Gain

EstimatorsSOLO

Table of Content (continue – 1)

Optimal State Estimation in Linear Stationary Systems

Kalman Filter Continuous Time Case

Applications

Multi-sensor Estimate

Target Acceleration Models

Kalman Filter for Filtering Position and Velocity Measurements

α - β (2-D) Filter with Piecewise Constant White Noise Acceleration Model

Optimal Filtering

Continuous Filter-Smoother Algorithms

References

End of Estimation Presentation

Review of Probability

Random Variables

Matrices

Inner Product

Signals

Estimators

( )vxh , z

Estimate parameters x of a given system, by using measurements z corrupted by noise v.

Parameter is a quantity (scalar or vector-valued) that isusually assumed to be time-invariant. If the parameter does change with time, it is designed as a time-varyingparameter, but its time variation is assumed slow relativeto system states. The estimation is performed on different measurements j = 1,…,k that provide different results z (j) because of the random variables (noises) v (j)

( ) ( )( ) kjjvxjhjz ,,1,, ==

We define the observation (information) vector as: ( ) ( ) ( ) k

Tk jzkzzZ 11: ===

We want to find the estimation of x, given the measurements Zk:

( ) ( )kZkxkx ,ˆˆ =

Assuming that the parameters x are observable (defined later) from the measurement, and knowledge of the system h (x,ν) theestimation of x will be done in some sense.

Parameter Estimation

Estimators

( )vxh , z

Desirable Properties of Estimators.

( ) ( ) ( )kxZkxEkxE k == ,ˆˆ

Unbiased Estimator1

Consistent or Convergent Estimator2

( ) ( )[ ] ( ) ( )[ ] 00ˆˆProblim =>>−−∞→

εkxkxkxkx T

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ] KkforkxkxkxkxEkxkxkxkxE TT >−−≤−− γγ ˆˆˆˆ

Efficient or Assymptotic Efficient Estimator if for All Unbiased Estimators 3 ( )( )kxγγ ˆ

Sufficient Estimator if it contains all the information in the set of observed values regarding the parameter to be observed.

4 kZ( )kx

Table of Content

EstimatorsSOLO

History

The Linear Estimation Theory is credited o Gauss, who, in 1798, atage of 18, invented the method of Least Square.

On January 1st, 1801, the Italian astronomer Giuseppe Piazzi had discovered the asteroid Ceres and had been able to track its path for 40 days before it was lost in the glare of the sun. Based on this data, it was desired to determine the location of Ceres after it emerged from behind the sun without solving the complicated Kepler’s nonlinear equations of planetary motion. The only predictions that successfully allowed the German astronomer Franz Xaver von Zach to relocate Ceres on 7 December 1801, were those performed by the 24-year-old Gauss using least-squares analysis.However, Gauss did not publish the method until 1809, when it appeared in volume two of his work on celestial mechanics, “Theoria Motus Corporum Coelestium in sectionibus conicis solem ambientium”.

Giuseppe Piazzi1746 - 1826

Franz Xaver von Zach1754 - 1832

Gauss' potrait published in Astronomische Nachrichten 1828

Johann Carl Friedrich Gauss

1777 - 1855

"In this work Gauss systematically developed the method of orbit calculation from three observations he had devised in 1801 to locate the planetoid Ceres, the earliest discovered of the 'asteroids,' which had been spotted and lost by G. Piazzi in January 1801. Gauss predicted where the planetoid would be found next, using improved numerical methods based on least squares, and a more accurate orbit theory based on the ellipse rather than the usual circular approximation. Gauss's calculations, completed in 1801, enabled the astronomer W. M. Olbers to find Ceres in the predicted position, a remarkable feat that cemented Gauss's reputation as a mathematical and scientific genius" (Norman 879).

http://www.19thcenturyshop.com/apps/catalogitem?id=84#

Theoria motus corporum coelestium (1809)

Sketch of the orbits of Ceres and Pallas, by Gauss

http://www.math.rutgers.edu/~cherlin/History/Papers1999/weiss.html

EstimatorsSOLO

History

Legendre published a book on determining the orbits of comets in 1806. His method involved three observations taken at equal intervals and he assumed that the comet followed a parabolic path so that he ended up with more equations than there were unknowns. He applied his methods to the data known for two comets. In an appendix Legendre gave the least squares method of fitting a curve to the data available. However, Gauss published his version of the least squares method in 1809 and, while acknowledging that it appeared in Legendre's book, Gauss still claimed priority for himself. This greatly hurt Legendre who fought for many years to have his priority recognized.

Adrien-Marie Legendre1752 - 1833

The idea of least-squares analysis was independently formulated by the Frenchman Adrien-Marie Legendre in 1805 and the american Robert Adrain in 1808.

Robert Adrain1775 - 1843

Legendre, A.M. “Nouvelles Méthodes pour La Déterminationdes Orbites des Comètes”, Paris, 1806

EstimatorsSOLO

History

Mark Grigorievich Krein1907 - 1989

Andrey Nikolaevich Kolmogorov1903 - 1987

Norbert Wiener1894 - 1964

The first studies of minimum-mean-square estimation in stochasticprocesses were made by Kolmogorov (1939), Krein (1945) and Wiener (1949)

Kolmogorov, A.N., “Sur l’interpolation et extrapolation dessuites stationaires”, C.R. Acad. Sci. Paris, vol.208, 1939, pp.2043-2045

Krein, M.G., “On a problem of extrapolation of A. N. Kolmogorov”, C.R. (Dokl) Akad. Nauk SSSR, vol.46, 1945, pp.306-309

Wiener, N., “Extrapolation, Interpolation and Smoothing of Stationary Time Series, with Engineering Applications”, MIT Press, Cambridge, MA, 1949 (secret version 1942)

Kolmogorov developed a comprehensive treatment of the linearprediction problem for discrete-time stochastic processes.

Krein extended the results to continuous time by the lever use of bilinear transformation.

Wiener, independently, formulated the continuous time linearprediction problem and derived an explicit formula for the optimal predictor. Wiener also considered the filtering problem of estimatinga process corrupted by additive noise.

Kalman, Rudolf E. 1920 -

Peter Swerling1929 - 2000

The filter is named after Rudolf E. Kalman, though Thorvald Nicolai Thiele and Peter Swerling actually developed a similar algorithm earlier. Stanley F. Schmidt is generally credited with developing the first implementation of a Kalman filter. It was during a visit of Kalman to the NASA Ames Research Center that he saw the applicability of his ideas to the problem of trajectory estimation for the Apollo program, leading to its incorporation in the Apollo navigation computer. The filter was developed in papers by Swerling (1958), Kalman (1960), and Kalman and Bucy (1961).

Kalman Filter History

Thorvald Nicolai Thiele1830 - 1910

Stanley F. Schmidt1926 -

The filter is sometimes called filter due to the fact that it is a special case of a more general, non-linear filter developed earlier by Ruslan L. Stratonovich. In fact, equations of the special case, linear filter appeared in these papers by Stratonovich that were published before summer 1960, when Rudolf E. Kalman met with Ruslan L. Stratonovich during a conference in Moscow. In control theory, the Kalman filter is most commonly referred to as linear quadratic estimator (LQE).

Kalman, R.E., “A New Approach to Filtering and Prediction Problems”,J. Basic Eng., March 1960, p. 35-46

Kalman, R.E., Bucy, R.S.,“New Results in Filtering and Prediction Theory”,J. Basic Eng., March 1961, p. 95-108 Table of Content

EstimatorsSOLO

Optimal Parameter Estimate v

The optimal procedure to estimate depends on the amount of knowledge of theprocess that is initially available.

The following estimators are known and are used as function of the assumed initial knowledge available:

Estimators Known initiallyWeighted Least Square (WLS)& Recursive WLS

( ) ( ) Tkkkkkkk vvvvERvEv −−== &Markov Estimator2

Maximum Likelihood Estimator3 ( ) ( )xZLxZp xZ ,:|| =

Bayes Estimator4 ( ) ( )Zxporvxp Zxvx |, |,

The amount of assumed initial knowledge available on the process increases in this order.

Table of Content

Estimators for Static Systems

Optimal Weighted Last-Square Estimate

Assume that the set of p measurements, can be expressed as a linear combination,of the elements of a constant vector plus a random, additive measurement error, :

vxHz +=

( ) ( ) 1

1−−=−−= −

T xHzxHzWxHzJ

pzzzz ,,, 21 =

nxxxx ,,, 21 =( )T

pvvvv ,,, 21 =We want to find , the estimation of the constant vector , that minimizes the cost function:

that minimizes J, is obtained by solving:0x

( ) 02/ 1 =−=∂∂=∇ − xHzWHxJJ T

( ) zWHHWHx TT 111

−−−=

This solution minimizes J iff :

( ) [ ] ( ) ( ) ( ) 02/ 0

0 <−−−=−∂∂− − xxHWHxxxxxJxx TTT

or the matrix HTW-1H is positive definite.

W is a hermitian (WH = W, H stands for complex conjugate and matrix transpose), positive definite weighting matrix.

Optimal Weighted Least-Square Estimate (continue – 1)

( ) zWHHWHx TT 111

−−−=

Since the mean of the estimate is equal to the estimated parameter, the estimator is unbiased.

vxHz +=Since is random with mean

xHvExHvxHEzE =+=+=0

( ) ( ) xxHWHHWHzEWHHWHxE TTTT === −−−−−− 111111

is also random with mean:0x

( ) ( ) ( ) ( )0

* : xHzWHxxHzWzxHzxHzWxHzJ TTT

T −+−=−=−−= −−−

Using we want to find the minimum value of J:0

11 xHWHzWH TT −− =

( ) ( ) ( )0

1 xHzWzxHWHzWHxxHzWz TTTTT −=−+−= −−−−

TT xHzxHWHxzWzxHWzzWzTT

−=−=−= −−−−

111 −−− −=−=WWW

xHzxHzJ

where is a norm.aWaa T

12: −=

Using we obtain: 0

11 xHWHzWH TT −− =

( ) ( )0

−=−−−

xHWHxzWHx

xHzWxHxHzxHTT

bWaba T

1:, −=

This suggest the definition of an inner product of two vectors and (relative to the

weighting matrix W) as

Projection Theorem

The Optimal Estimate is such that is the projection (relative to the weightingmatrix W) of on the plane.

xHTable of Content

111 −−− −=−=WWW

xHzxHzJ

Projection Theorem

The Optimal Estimate is such that is the projection (relative to the weightingmatrix W) of on the plane.

xHTable of Content

( )vxHz

zWHHWHx TT

+== −−− 111

( ) ( ) ( ) vWHHWHxvxHWHHWHxx TTTT 1111110

−−−−−− =−+=−

Recursive Weighted Least Square Estimate (RWLS)

Assume that the set of N measurements, can be expressed as a linear combination,of the elements of a constant vector plus a random, additive measurement error, :

0zx 0H

x vvxHz += 00

( ) ( ) 10

0000 −−=−−= −W

T xHzxHzWxHzJ

We found that the optimal estimator , that minimizes the cost function:

( )−x

( ) ( ) 0

00 zWHHWHx TT −−−=−is

Let define the following matrices for the complete measurement set

0:,:,: 0

1( ) ( ) 1

00:−−=− HWHP T

Therefore:

( ) ( )1

1 10 0 0 01 1

1 1 1 1 1 1 0 01 1

0 0T T T T T TW H W z

x H W H H W z H H H HH zW W

−− −− −

− −

+ = = ÷ ÷ ÷

( ) ( ) 0

00 zWHPx T −−=−

An additional measurement set, is obtainedand we want to find the optimal estimator .

z ( )+x

Recursive Weighted Least Square Estimate (RWLS) (continue -1)

( ) ( ) 1

00:−−=− HWHP T( ) ( ) 0

( ) ( ) [ ] [ ]

( ) ( )zWHzWHHWHHWH

WHHzWHHWHx

TTTTTT

−−−−−

−−

−−−

Define ( ) ( ) HWHPHWHHWHP TTT 111

1 : −−−−− +−=+=+

( ) ( )[ ] ( ) ( ) ( )[ ] ( )−+−−−−=+−=+ −−−− PHWHPHHPPHWHPP TTLemmaMatrixInverse

T 1111

( ) ( )[ ] ( )[ ] ( ) 111111 −−−−−− +=+−≡+−− WHPWHHWHPWHPHHP TTTTT

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( )−+−−=−+−−−−=+ −−PHWHPPPHWHPHHPPP TTT 11

( ) ( ) ( )( ) ( ) ( )[ ] ( ) ( ) zWHPzWHPHWHPHHPP

zWHzWHPxTTTT

−−−

−−

++−+−−−−=

( ) ( ) ( )( ) ( ) ( )[ ] ( ) ( )

( )( )

( ) ( )[ ]( )

( )( )

( ) ( ) ( ) ( ) zWHPxHWHPx

zWHPzWHPHWHPHHPzWHP

zWHPzWHPHWHPHHPP

zWHzWHPx

−−

−−−

−−

++−+−−=

++−+−−−−=

( ) ( ) 0

( ) ( ) HWHPP T 111 −−− +−=+

( ) ( ) ( ) ( )( )−−++−=+ − xHzWHPxx T 1

Recursive Weighted Least Square Estimate (RWLS)

( )−x

( ) HWHP T 11 −− =+

( ) 1−+ WHP T

Estimator

( ) ( )

( ) ( )[ ]( ) ( ) ( ) ( )xHzWxHzxHzWxHz

WxHzxHz

xHzxHzWxHzJ

−−+−−=

−−

−−=

−−

=−−=

−−

1 : HWHP T −− =−

Second Way

We want to prove that

where ( ) ( ) 0

00: zWHPx T −−=−

( ) ( ) ( )[ ] ( ) ( )[ ]−−−−−=−− −− xxPxxxHzWxHz TT 1

Therefore

( )[ ] ( ) ( )[ ] ( ) ( ) ( ) ( ) 11

111 −− −+−−=−−+−−−−−=

−−−

TT xHzxxxHzWxHzxxPxxJ

1 : HWHP T −− =−

Second Way (continue – 1)

Define

( ) ( ) 0

00: zWHPx T −−=− ( ) ( )−=− − PHWzx TT

( ) ( )−−= −− xPzWH T 1

00( ) ( )−−= −− 1

00 PxHWz TT

( ) ( ) ( )[ ] ( ) ( )[ ]−−−−−=−− −− xxPxxxHzWxHz TT 1

( ) ( )xHWHxzWHxxHWzzWz

xHzWxHzTTTTTT

−−−−

+−−=

−−

( )[ ] ( ) ( )[ ]( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )−−−+−−−−−−−=

−−−−−−−−−

xPxxPxxPxxPx

xxPxxTTTT

( ) ( ) xPxxHWz TT −−= −− 1

00( ) ( )−−= −− xPxzWx TTT 1

( ) xHWHxxPx TTT 0

1 −− =−

1 : HWHP T −− =− ( ) ( ) 0

00: zWHPx T −−=−

Define

( ) ( ) ( )[ ] ( ) ( )[ ]−−−−−=−− −− xxPxxxHzWxHz TT 1

( ) ( )xHWHxzWHxxHWzzWz

xHzWxHzTTTTTT

−−−−

+−−=

−−

( )[ ] ( ) ( )[ ]( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )−−−+−−−−−−−=

−−−−−−−−−

xPxxPxxPxxPx

xxPxxTTTT

( ) ( ) ( ) ( ) 0

1 zWHPHWzxPx TTT −−−− −=−−−

Use the identity: ( )1

−−−−−−

+≡+− TTT HIHWWHIHWHHWW

( ) 0lim1

−−

− TTT HHHHHIHWW εεε εεε

( ) ( ) 1

−−−−−−−− −== WHPHWWHHWHHWW TTT

( ) ( ) ( ) ( ) 0

1 zWzzWHPHWzxPx TTTT −−−−− =−=−−− q.e.d.

( )[ ] ( ) ( )[ ] ( ) ( )xHzWxHzxxPxxJ TT −−+−−−−−= −− 11

Choose that minimizes the scalar cost function

Solution

( ) ( )[ ] ( ) 022 *1*11 =−−−−−=

∂∂ −− xHzWHxxP

Define: ( ) ( ) HWHPP T 111 : −−− +−=+Then:

( ) ( )[ ] ( ) ( ) ( ) ( )[ ]−−+−+=+−−+=+ −−−−−− xHzWHxPzWHxHWHPxP TTT 11111*1

( )[ ] ( ) ( ) zWHxPxHWHP TT 11*11 −−−− +−−=+−

( ) ( ) ( ) ( )[ ]−−++−=+= − xHzWHPxxx T 1*

( )[ ] ( )+=+−=

∂∂ −−− 111

22 PHWHPx

If P-1(+) is a positive definite matrix then is a minimum solution.*x

( ) ( ) 1

1−−=−−= −

T xHzxHzWxHzJ

For W = I (Identity Matrix) we have the Least-Square Estimator (LSE).

How to choose W?

If x (i) ≠ constant we can use either one step of measurement or if we assume thatx (i) changes continuously we can choose

λ is the fading factor.

Table of Content

vxHz += 00

( ) zRHHRHx TT 10

00−−−=

Markov Estimate

For the particular vector measurement equation

where for the measurement noise, we know the mean: vEv =

and the variance: ( ) ( ) TvvvvER −−=

We choose W = R in WLS, and we obtain:

( ) ( ) 1

0:−−=− HRHP T

( ) ( ) HRHPP T 111 −−− +−=+

( ) ( ) ( ) ( )( )−−++−=+ − xHzRHPxx T 1

RWLS = Markov EstimateW = R

In Recursive WLS, we obtain for a newobservation: vxHz +=

Table of Content

vxHz +=

Maximum Likelihood Estimate (MLE)

For the particular vector measurement equation

where the measurement noise, is gaussian (normal), with zero mean:

( )RNv ,0~

( ) ( )( )xp

zxpxzp

and independent of , the conditional probability can be written, using Bayes rule as:

x ( )xzp xz ||

==−=

pxnxpxnpxpx

zxfxHzv

( ) ( ) 2/1

,, /,, T

vxzx JJvxpzxp =

The measurement noise can be related to and by the function:v zx

∂∂

∂∂=

( ) ( ) ( ) ( )vpxpvxpzxp vxvxzx ⋅== ,, ,,

Since the measurement noise is independent of :xv

zThe joint probability of and is given by:x

Maximum Likelihood Estimate (continue – 1)

( ) ( ) ( ) ( )vpxpvxpzxp vxvxzx ⋅== ,, ,,

( )vxp vx ,,

( ) ( )

( )( ) ( )

−−−=

− xHzRxHzR

xHzpxzp

12/12/

( ) ( ) ( )[ ] ( )RWWLSxHzRxHzxzp T

x⇒−−⇔ −1

| min|max

( ) ( )[ ] ( ) 02 11 =−−=−−∂∂ −− xHzRHxHzRxHzx

0*11 =− −− xHRHzRH TT ( ) zRHHRHxx TT 111*: −−−==

( ) ( )[ ] HRHxHzRxHzx

2 −− =−−∂∂ this is a positive definite matrix, therefore

the solution minimizesand maximizes

( ) ( )[ ]xHzRxHz T −− −1

( )xzp xz ||

( ) ( )( ) ( )

−=== − vRv

zxpxzp T

12/12/

Gaussian (normal), with zero mean

( ) ( )xzpxzL xz |:, |= is called the Likelihood Function and is a measureof how likely is the parameter given the observation .x z

Maximum Likelihood Estimate (continue – 2)

( ) ( )xzpxzL xz |:, |= is called the Likelihood Function and is a measureof how likely is the parameter given the observation .x z

Fisher, Sir Ronald Aylmer 1890 - 1962

R.A. Fisher first used the term Likelihood. His reason for theterm likelihood function was that if the observation is and , then it is more likely that the true value of is than .

zZ =( ) ( )21 ,, xzLxzL >

1x 2xX

Bayesian Maximum Likelihood Estimate (Maximum Aposterior – MAP Estimate)

H zxvxHz +=Consider a gaussian vector , where ,measurement, , where the Gaussian noiseis independent of and .( )Rv ,0~ N

vx ( ) ( )[ ]−− Pxx ,~

( )( ) ( )

( )( ) ( ) ( )( )

−−−−−−

−= − xxPxx

2/12/ 2

( ) ( )( )

( ) ( )

−−−=−= − xHzRxHz

RxHzpxzp T

2/12/| 2

( ) ( ) ( ) ( )∫∫+∞

∞−

== xdxpxzpxdzxpzp xxzzxz |, |,

is Gaussian with( )zp z ( ) ( ) ( ) ( ) ( )−=+=+= xHvExEHvxHEzE

( ) ( )[ ] ( )[ ] ( )[ ] ( )[ ] ( )( )[ ] ( )( )[ ] ( )[ ] ( )[ ] ( )[ ] ( )[ ] ( ) RHPHvvEHxxvEvxxEH

HxxxxEHvxxHvxxHE

xHvxHxHvxHEzEzzEzEz

+−=+−−−−−−

−−−−=+−−+−−=

−−+−−+=−−=

( )( ) ( )

( )[ ] ( )[ ] ( )[ ]

−−+−−−−

+−= −

xHzRHPHxHzRHPH

Tpz ˆˆ2

2/12/π

Bayesian Maximum Likelihood Estimate (Maximum Aposterior Estimate) (continue – 1)

H zxvxHz +=Consider a Gaussian vector , where ,measurement, , where the Gaussian noiseis independent of and .( )Rvv ,0;~ N

vx ( ) ( )[ ]−− Pxxx ,;~

( )( ) ( )

( )( ) ( ) ( )( )

−−−−−−

−= − xxPxx

2/12/ 2

π( ) ( )

( )( ) ( )

−−−=−= − xHzRxHz

RxHzpxzp T

2/12/| 2

( )( ) ( )

( )[ ] ( )[ ] ( )[ ]

−−+−−−−

+−= −

xHzRHPHxHzRHPH

Tpz ˆˆ2

2/12/π

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( )( ) ( ) ( )( ) ( )[ ] ( )[ ] ( )[ ]

−−+−−−+−−−−−−−−−⋅

−−− xHzRHPHxHzxxPxxxHzRxHz

xpxzpzxp

2/12/12/

from which

( ) ( ) ( )( ) ( ) ( )( ) ( )( ) ( )[ ] ( )( )−−+−−−−−−−−−+−−−−− xHzRHPHxHzxxPxxxHzRxHz TTTT 111

( ) ( )( )[ ] ( ) ( )( )[ ] ( )( ) ( ) ( )( )( )( ) ( )[ ] ( )( ) ( )( ) ( )[ ] ( )( )

( )( ) ( )( ) ( )( ) ( )( ) ( )( ) ( )[ ] ( )( )−−+−−−+−−−−−−−−−−

−−+−−−−=−−+−−−−

−−−−−+−−−−−−−−−−=

−−−−

−−−

−−

xxHRHPxxxxHRxHzxHzRHxx

xHzRHPHRxHzxHzRHPHxHz

xxPxxxxHxHzRxxHxHz

( )( ) ( )( ) ( )( ) ( ) ( )( ) ( )( ) ( )[ ] ( )( )−−+−−−−−−−−−+−−−−−−− xHzRHPHxHzxxPxxxHzRxHz TTTT 111

( )[ ] ( )[ ] 11111111 −−−−−−−− −++/−/=+−− RHPHRHHRRRRHPHR TTTwe have

Define: ( ) ( )[ ] 111:−−− +−=+ HRHPP T

( )( ) ( ) ( )[ ] ( ) ( )( )( )( ) ( ) ( )[ ] ( )( ) ( )( ) ( ) ( )[ ] ( )( )( )( ) ( )[ ] ( )( )−−+−−−+

−−++−−−−−++−−−

−−+++−−=

−−

−−−−

−−−

xxHRHPxx

xxPPHRxHzxHzRHPPxx

xHzRHPPPHRxHz

( ) ( ) ( )( )[ ] ( ) ( ) ( ) ( )( )[ ]−−++−−+−−++−−= −−− xHzRHPxxPxHzRHPxx TTT 111

( )( ) ( )

( ) ( ) ( )( )[ ] ( ) ( ) ( ) ( )( )[ ]

−−+−−−+−−+−−−−⋅

+= −−− xHzRHPxxPxHzRHPxx

Pzxp TTT

1112/12/| 2

where: ( ) ( )[ ] 111:−−− +−=+ HRHPP T

( )( ) ( )

( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ]

−+−−−+−+−−−−⋅

+= −−− xHzRHPxxPxHzRHPxx

Pzxp TTT

nzx111

2/12/| 2

( )zxp zxx

|max | ( ) ( ) ( ) ( )( )−−++−==+ − xHzRHPxxx T 1*:

Table of Content

Estimators

( )vxh ,z

Estimatorx

The Cramér-Rao Lower Bound (CRLB) on the Variance of the Estimator

- estimated mean vector

[ ]( ) [ ]( ) TTT

x xExExxExExxExE

−=−−=2σ - estimated variance matrix

For a good estimator we want

xxE =- unbiased estimator vector

x xExExxE

−=2σ - minimum estimation variance

( ) ( ) Tk kzzZ 1:= - the observation matrix after k observations

( ) ( ) ( ) xkzzLxZL k ,,,1, = - the Likelihood or the joint density function of Zk

We have:

pzzzz ,,, 21 = ( ) T

nxxxx ,,, 21 = ( )T

pvvvv ,,, 21 =

The estimation of , using the measurements of a system corrupted by noise is a random variable with

x x zv

( ) ( ) ( ) ( )∫== dvvpxvZpxZpxZL vk

xzk ;||, ||

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ] ( ) ( )

[ ] [ ] ( )xbxZdxZLZx

kzdzdxkzzLkzzxkzzxE

kkk +==

∫∫

1,,,1,,1,,1

- estimator bias( )xb

therefore:

Estimators

( )vxh ,z

Estimatorx

The Cramér-Rao Lower Bound on the Variance of the Estimator (continue – 1)

[ ] [ ] [ ] ( )xbxZdxZLZxZxE kkkk +== ∫ ,

We have:

[ ] [ ] [ ] ( )x

ZxE kk

∂∂+=

∂∂=

∂∂ ∫ 1

Since L [Zk,x] is a joint density function, we have:

[ ] 1, =∫ kk ZdxZL

[ ] [ ] [ ] [ ]0,,

∂∂=

∂∂→=

∂∂ ∫∫∫ k

xZLxZd

[ ]( ) [ ] ( )x

xZLxZx k

∂∂+=

∂∂−∫ 1

Using the fact that: [ ] [ ] [ ]x

xZLxZL

xZL kk

∂∂=

∂∂ ,ln

[ ]( ) [ ] [ ] ( )x

xZLxZLxZx k

∂∂+=

∂∂−∫ 1

EstimatorsSOLO

[ ]( ) [ ] [ ] ( )x

xZLxZLxZx k

∂∂+=

∂∂−∫ 1

Hermann Amandus Schwarz

1843 - 1921

Let use Schwarz Inequality:

( ) ( ) ( ) ( )∫∫∫ ≤ dttgdttfdttgtf22

The equality occurs if and only if f (t) = k g (t)

[ ]( ) [ ] [ ] [ ]xZLx

xZLgxZLxZxf k

,ln:&,:

∂∂=−= choose:

[ ]( ) [ ] [ ]

( ) [ ]( ) [ ]( ) [ ] [ ]

∂−≤

∂−

∫∫

xZLxZLZdxZLxZx

xZLxZLxZx

,ln,,1

[ ]( ) [ ]( )

[ ] [ ]∫∫

∂+≥−

xZLxZL

ZdxZLxZx2

EstimatorsSOLO

[ ]( ) [ ]( )

[ ] [ ]∫∫

∂+≥−

xZLxZL

ZdxZLxZx2

This is the Cramér-Rao bound for a biased estimator

Harald Cramér1893 – 1985

Cayampudi RadhakrishnaRao

1920 -

[ ] ( ) [ ] 1,& =+= ∫ kkk ZdxZLxbxZxE

[ ]( ) [ ] [ ] [ ] ( )( ) [ ][ ] [ ] ( ) [ ] ( ) [ ] [ ] ( ) [ ]

( ) [ ]

∫∫∫∫

−+−=

+−=−

kkkkkkkk

kkkkkkk

ZdxZLxb

ZdxZLZxEZxxbZdxZLZxEZx

ZdxZLxbZxEZxZdxZLxZx

[ ] [ ] ( ) [ ]( )

[ ] [ ]( )xb

xZLxZL

ZdxZLZxEZxk

∂+≥−=

∫∫

EstimatorsSOLO

[ ] [ ] ( ) [ ]( )

[ ] [ ]( )xb

xZLxZL

ZdxZLZxEZxk

∂+≥−=

∫∫

[ ] [ ] [ ][ ]

[ ] [ ] [ ] 0,,ln

∂→=∂

∂→= ∫∫∫∂

kkkxZL

kk ZdxZLx

xZLZdxZL

[ ] [ ] [ ] [ ] [ ][ ]

0,,ln,ln

∂∂

∂+∂

∂→ ∫∫∂

∂∂

ZdxZLx

xZLZdxZL

[ ] [ ]0

,ln,ln2

∂∂→

∂∂

[ ]( )

[ ] ( )xb

∂∂

∂+−=−

∂+≥σ

Estimators

[ ]( ) [ ]( )

∂∂

∂+−=

∂+≥−∫

ZdxZLxZxkk

[ ]( )

[ ] ( )xb

∂∂

∂+−=−

∂+≥σ

For an unbiased estimator (b (x) = 0), we have:

[ ] [ ]

∂∂

∂≥

http://www.york.ac.uk/depts/maths/histstat/people/cramer.gif

Estimators

[ ]( ) [ ]( ) [ ] [ ]( ) [ ]( ) ( ) [ ] [ ] ( )

( ) [ ] ( )

∂∂

∂+−=

∂+≥

−−=−−

xZxxZxEZdxZLxZxxZx

TkkkkTkk

,ln,ln

The multivariable form of the Cramér-Rao Lower Bound is:

[ ]( )[ ]

−=−

[ ]( ) [ ][ ]

∂∂

∂=∇

xZLxZL

,ln,ln

Fisher Information Matrix

[ ] [ ] [ ]

∂∂−=

2 ,ln,ln,ln:J

Fisher, Sir Ronald Aylmer 1890 - 1962

Fisher, Sir Ronald Aylmer (1890-1962)

The Fisher information is the amount of information that an observable random variable z carries about an unknown parameter x upon which the likelihood of z, L(x) = f(Z; x), depends. The likelihood function is the joint probability of the data, the Zs, conditional on the value of x, as a function of x. Since the expectation of the score is zero, the variance is simply the second moment of the score, the derivative of the lan of the likelihood function with respect to x. Hence the Fisher information can be written

( ) [ ]( ) [ ]( ) [ ]( ) x

x xZLExZLxZLEx ,ln,ln,ln: ∇∇−=∇∇=J

Table of Content

Estimators

( ) ( ) ( ) ( ) ( ) ( )kPkekeEkxEkxke xT

xxx =−= &:

kkkkkkk

+=Γ++Φ= −−−−−− 111111

Kalman Filter Discrete Case

Assume a discrete dynamic system

( ) ( ) ( ) ( ) ( ) ( ) lkT

www kQlekeEkwEkwke ,

&: δ=−=

kkkkkkk zKxKx += −1|| ˆ'ˆ

( ) ( ) ( ) ( ) ( ) ( ) lkT

vvv kRlekeEkvEkvke ,

&: δ=−= ( ) ( ) ( ) 1, −= lk

Tvw kMlekeE δ

Let find a Linear Filter that works in two stages:

s.t. will minimize (by choosing the optimal gains Kk and Kk’ )

( ) ( ) kkkkk

kkkkkT

xxxwhere

xxExxxxEJ

=−−=

~~ˆˆ

kkk xExE =|ˆ Unbiased Estimator 0ˆ~|| =−= kkkkk xExExE

lklk 1

111|111| ˆˆ −−−−−− +Φ= kkkkkkk uGxxkz 1. One step prediction, before the measurement ,based on the estimation at step k-1 :1|ˆ −kkx

2. Update after the measurement is received:kz

Estimators

kkkkk xxx −= −− 1|1| ˆ:~

kkkkkkk zKxKx += −1|| ˆ'ˆ

Kalman Filter Discrete Case (continue – 1)

Define

kkkkk xxx −= || ˆ:~

The Linear Estimator we want is:

Therefore

[ ] [ ] [ ] kkkkkkkkk

kkkkkkk vKxKxIHKKvxHKxxKxx

++−+=++++−= −−

1||~''~'~

Unbiaseness conditions: 0~~1|| == −kkkk xExE

gives: [ ] 0~''~

| =++−+= − kkkkkkkkkkk vEKxEKxEIHKKxE

or: kkk HKIK −='

Therefore the Unbiased Linear Estimator is:

[ ]1|1|| ˆˆˆ −− −+= kkkkkkkkk xHzKxx

Estimators

+=Γ++Φ= −−−−−−

kkkkkkk

wuGxx 111111

The discrete dynamic system

The Linear Filter (Linear Observer)[ ]

−++=

−−−−

−−−−−−

1|111||

111|111|

ˆˆˆ

kkkkkkkkkkk

kkkkkkk

xHzKuGxx

111|111|1|~ˆ:~

−−−−−−− Γ−Φ=−= kkkkkkkkkk wxxxx

Tkkkkkk QPxxEP 11111|111|1|1|

~~: −−−−−−−−−− ΓΓ+ΦΦ==

−−−−

wxEwxE

[ ] [ ]

Tkkkkk

Tkkkkkk

wwExwE

wxExxE

1111111

11111111

1|1|1|

−−−−−−−−

−−−

ΓΓ+ΦΓ−

ΓΦ−ΦΦ=

Γ−ΦΓ−Φ=

1|111|

~~−−−−−−−− Γ−=Γ−Φ=

Tkkkkkkk

Tkkk MvwEvxEvxE

EstimatorsSOLO

kkkkkkkkkkkk vxHKxxxx −−=−= −− 1|1|||~~ˆ:~

( )[ ] ( )[ ] Tk

Tkkkkkkkkk

Tkkkkkk KvHxxvxHKxExxEP −−−−== −−−− 1|1|1|1||||

~~~~~~:

−−− Γ−= kkT

kkk MvxE

( ) ( ) [ ] ( ) [ ]T

Tkkkkkk

KxvEKHIxvEK

KvxEKHIxxEHKI

1|1|1|

−−

−−−

+−−=

( ) ( )

( ) ( )Tkk

Tkkkkk

Tkkkkkk

HKIMKKMHKI

KvvEKHKIxxEHKI

−Γ−Γ−−

+−−=

−−−−

−−

+=Γ++Φ= −−−−−−

kkkkkkk

wuGxx 111111The discrete dynamic system

The Linear Filter (Linear Observer)[ ]

−++=

−−−−

−−−−−−

1|111||

111|111|

ˆˆˆ

kkkkkkkkkkk

kkkkkkk

xHzKuGxx

EstimatorsSOLO

( ) ( ) ( ) ( ) T

kkkkkT

kkkkkk

Tkkkkkk

HKIMKKMHKIKRKHKIPHKI

−Γ−Γ−−+−−=

−−−−− 11111|

|||~~:

( ) ( )( ) T

kkkkkk

Tkkkkk

Tkkkkkk

KHPHHMMHRK

MPHKKMHPPxxEP

1|1111

111|111|1||||~~:

−−−−−

−−−−−−−

+Γ+Γ++

Γ+−Γ+−==

Completion of Squares

[ ][ ] [ ]

+Γ+Γ+Γ+−

Γ+−

=−−−−−−−−

−−−−

Tkkkkk

HPHHMMHRMPH

1|1111111|

111|1|

Joseph Form (true for all Kk)

Estimators

KPtracexxEtracexxEJ

kkkk|min~~min~~minmin ===

Completion of Squares

Use the Matrix Identity:

−−∆

−−

[ ] [ ] ( )

+Γ+Γ+

∆−==

−−−−−−

Tkkkkk

kkkkkkCBK

HPHHMMHRCBKIxxEP

11|1111

to obtain

( ) ( ) ( )Tk

Tkkkkkkk

Tkkkkkk MPHHPHHMMHRMHPP 111|

1|1111111|1|: −−−

−−−−−−−−− Γ++Γ+Γ+Γ+−=∆

[ ][ ] [ ]

+Γ+Γ+Γ+−

Γ+−

=−−−−−−−−

−−−−

Tkkkkk

HPHHMMHRMPH

1|1111111|

111|1|

Estimators

kkkkkkT

kkk PtracexxEtracexxEJ |||||~~~~ ===

[ ] [ ]

1|1111111|..*

−−−−−−−− +Γ+Γ+Γ+==C

Tkkkkk

kk HPHHMMHRMHPKK

To obtain the optimal K (k) that minimizes J (k+1) we perform

[ ] [ ] 011| =−−+∆∂

∂=∂

∂∂ −− T

k CBKCCBKtraceKK

Ptrace

Using the Matrix Equation: (see next slide) ( )TT BBAABAtraceA

+=∂∂

[ ] ( ) 01*| =+−=∂

∂∂ − T

k CCCBKK

Ptrace

Jwe obtain

Kalman Filter Gain

( ) ( ) ( ) ( )

kkkkkk

Tkkkkk

kkkkkK

MHPKPtrace

MHPHPHHMMHRMHPPtracetracePtracekJT

111|1|

1|1111111|1||min

−−−−

−−−

−−−−−−−−−

Γ+−=

Γ++Γ+Γ+Γ+−=∆==−

( ) [ ]Tkkkk

Tkkkkk

k HPHHMMHRCCK

Ptrace

J1|11112

2 −−−−− +Γ+Γ+=+=∂

∂∂

MatricesSOLO

Differentiation of the Trace of a square matrix

[ ] ( )( )

∑∑∑∑∑∑=

==l p k

lkpklp

Tklpklp

T abaabaABAtracelk

[ ]TABAtraceA∂

∂ [ ] ∑∑ +=∂

baabABAtracea

[ ] ( )TTT BBABABAABAtraceA

+=+=∂∂

Estimators

1|1|*−

−− += Tkkkkk

Tkkkk HPHRHPK

( ) ( ) Tkkk

Tkkkkkkkk KRKHKIPHKIP ***** 1|| +−−= −

we found that the optimal Kk that minimizes Jk is

( ) 1|

1|1|1| −

−−− +−= kkkT

kkkkkT

kkkkk PHHPHRHPP

( ) [ ] 1|

11 −

−−−− −=+=

−− kkkkkkT

LemmaMatrixInverse

existRPPHKIHRHP

When Mk = 0, where:

( ) ( ) 1, −= lkkT

vw MlekeE δ

EstimatorsSOLO

We found that the optimal Kk that minimizes Jk (when Mk-1 = 0 ) is

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] 11|1111|11*

−+++++++=+ kHkkPkHkRkHkkPkK TT

( ) ( ) 11111|

1| 11|

−−−−−

−−−

− +−=+−

−− k

Tkkkkkk

LemmaMatrixInverse

existPR

Tkkkkk RHHRHPHRRHPHR

( ) 11111|

11|* −−−−

−−

− +−= kT

kkkkkT

kkkk RHHRHPHRHPRHPK

( ) ( ) 11111|

1111|1|

−−−−−

−−−−− +−+= k

Tkkkkk

Tkkkkk RHHRHPHRHHRHPP

[ ] 11|

11111|* −

−−−−−

− =+= kT

kkkk RHPRHHRHPK

If Rk-1 and Pk|k-1

-1 exist:

Table of Content

EstimatorsSOLO

Properties of the Kalman Filter

0~ˆ || =Tkkkk xxE

Proof (by induction):

( )1111

00000001 0

xxwuGxx

+==Γ++Φ=k=1:

( ) ( )( )0010|00110010010011000|00

0010|0011111000|00

00|00|1111000|001|1

ˆˆˆˆ

uGHxHvwHuGHxHKuGx

uGHxHvxHKuGx

xExxHzKuGxx

−Φ−+Γ++Φ++Φ=−Φ−+++Φ=

=−++Φ=

( ) 1100110|00110|0011|11|1~~ˆ~ vKwIHKxHKxxxx +Γ−+Φ−Φ=−=

( )[ ] 100110|001000|001|11|1~ˆ~ˆ vwHKxKuGxExxE T +Γ+Φ−+Φ=

( )[ ] TvKwIHKxHKx 1100110|00110|00~~ +Γ−+Φ−Φ

( ) ( )

IKHwwEHKHKIxxEHK

110000110110|00|0011

−ΓΓ+Φ−Φ−=

EstimatorsSOLO

Properties of the Discrete Kalman Filter

0~ˆ || =Tkkkk xxE

Proof (by induction) (continue – 1):

( ) ( ) TTTTTTT KRKIKHQHKHKIPHKxxE 11111000110110|00111|11|1~ˆ +−ΓΓ+Φ−Φ−=

( ) ( ) TTT

TT KRKKHQPHKQPHK 1111100000|001100000|0011

0|10|1

+ΓΓ+ΦΦ+ΓΓ+ΦΦ−=

[ ] [ ] 01

111|11

111|111111110|111

=+−=+−−=RHPK

KRPHKKRKKHIPHK

In the same way we continue for k > 1 and by induction we prove the result.

Table of Content

EstimatorsSOLO

Properties of the Kalman Filter

1,,10~| −== kjzxE T

Proof:

( )jjjj

kkkkkkkkk

xHzKxx

−+= −− 1|1|| ˆˆˆ

( ) ( ) kkkkkkkkkkkkkkkkkkkkk vKxHKIxxHvxHKxxxx +−=−−++=−= −−− 1|1|1|||~ˆˆˆ:~

( )[ ] ( ) ( ) ( )

Tjkkkk

Tjjjkkkkkkjkk

vvEKHxvEKvxEHKIHxxEHKI

vxHvKxHKIEzxE

++−+−=

++−=

→>→>

−−

( )[ ] ( )

1|1||~~~

−− +−=+−=jk

Tjkkkk

Tjkkkkkk

Tjkk zvEKzxEHKIzvKxHKIEzxE

Estimators

xxx =−= &:

kkkkkkk

+=Γ++Φ= −−−−−− 111111

Kalman Filter Discrete Case - Innovation Assume a discrete dynamic system

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) lklekeE Tvw ,0 ∀=

lklk 1

kkkkkkkk vxHzz +−=−= −− 1|1|~ˆ:ι

Innovation is defined as:

The Linear Filter (Linear Observer)

−−

−−−−−−

kkkkkkkkk

kkkkkkk

xHzKxx

111|111|

ˆˆˆ

111|1111|1|~ˆ:~

−−−−−−−− Γ−Φ=−= kkkkkkkkkk wxxxx

1| =+−= − kkkkk vExEHE ι

1|1|.. :

−− += kT

kkkkkkkFK

k RHPHPHK

2Properties of the Discrete Kalman Filter

Estimators

( )∑+=

+−++−

−−−−−−−−

Γ−Φ+=

Γ−Φ+Γ−Φ+=

Γ−Φ+−Φ=Γ−Φ=

jkkkkkk

iiiiiiiiiiiiii

iiiiiii

iiiiiiiiii

wvKFFFxFFF

wvKwvKxFF

wvKxHKIwxx

111|111

111112|11

SOLOKalman Filter Discrete Case – Innovation (continue – 1)

Assume i > j:

→+≥

Γ−Φ+=

Tjjkkkki

Tjjjjji

Tjjji xwExvEKFxxEFxxE

|1|11,|1|1~~~~~~

( )iiikiiki

FFFFFF

==−Φ=

− :&:

( ) ( ) iiiiiiiiiiiiiii vKxHKIvxHKxx +−=−−= −−− 1|1|1||~~~~

( ) ( ) Tj

Tjjiiii

Tji vHxvxHEE +−+−= −− 1|1|

~~ιι

Tjjiii vvEHxvEvxEHHxxEH +−−= −−−− 1|1|1|1|

jjjjjjj wxx Γ−Φ=+ ||1~~

jjji PFxxE |11,|1|1~~

++++ =

Estimators

( )∑+=

++++ Γ−Φ+=i

jkkkkkkkijjjiji wvKFxFx

11,|11,|1

Kalman Filter Discrete Case – Innovation (continue – 2)

Assume i > j:

|1|1|1

=Γ−Φ=→>⇒

wvExvExvE

( )iiikiiki

FFFFFF

==−Φ=

− :&:

1|11,|1|1

+=++++++++

Γ−Φ+= ∑

Tjkkkki

Tjjjji

vwEvvEKFvxEFvxE

1,1111 +++++ = jijT

ji RvvE δ

Tjjiii

Tji vvEHxvEvxEHHxxEHE 111|111|111|1|1111

~~~~++++++++++++++ +−−=ιι

jjjjjjj wxx Γ−Φ=+ ||1~~

Estimators

11|111|11

.. −

+++++++ += jT

j RHPHHPKFK

SOLOKalman Filter Discrete Case – Innovation (continue – 3)

Assume i > j:

1,111112,111|11,1 +++++++++++++ +Φ−+= jijjjjjiiiT

jjjjii RRKFHHHPFH δ

Tjjiii

Tji vvEHxvEvxEHHxxEHE 111|111|111|1|1111

~~~~++++++++++++++ +−−=ιι

( )1112,12,1, +++++++ −Φ== jjjjijjiji HKIFFFF

( ) 1,11111|11112,1 ++++++++++++ +−−Φ= jijjjT

jjjjjjjii RRKHPHKIFH δ

[ ]1,11

1,1111|1111|112,1

+++++++++++++

++−Φ=

jjjjjT

jjjjjii

RRHPHKHPFH

1,1111

+++++ = jij

ji REFK

διι 01 =+iE ιInnovation =

White Noise forKalman Filter Gain!!!

Table of Content

Kalman FilterState Estimation in a Linear System (one cycle)

State vector prediction111|111| ˆˆ −−−−−− +Φ= kkkkkkk uGxx

Covariance matrix extrapolation111|111| −−−−−− +ΦΦ= kT

kkkkkk QPP

Innovation CovariancekT

kkkkk RHPHS += −1|

Gain Matrix Computation11|

−−= k

Tkkkk SHPK

Innovation1|ˆ

−−=kkz

kkkkk xHzi

Filteringkkkkkk iKxx += −1|| ˆˆ

Covariance matrix updating

( )( ) ( ) T

kkkkkk

Tkkkkk

kkkkkkk

KRKHKIPHKI

PHSHPPP

+−−=

−=−=

−−

1+= kk

Kalman FilterState Estimation in a Linear System (one cycle)

Sensor DataProcessing andMeasurement

Formation

Observation -to - Track

Association

InputData Track Maintenance

( Initialization,Confirmationand Deletion)

Filtering andPrediction

GatingComputations

Samuel S. Blackman, " Multiple-Target Tracking with Radar Applications", Artech House,1986

Samuel S. Blackman, Robert Popoli, " Design and Analysis of Modern Tracking Systems",Artech House, 1999

Rudolf E. Kalman( 1920 - )

1|1| ˆˆ: −− −=−= kkkkkkkk zzxHzi

Recursive Bayesian EstimationSOLO

Linear Gaussian Markov Systems (continue – 18)Innovation

The innovation is the quantity:

We found that:

( ) 0ˆ||ˆ| 1|1:11:11|1:1 =−=−= −−−−− kkkkkkkkkk zZzEZzzEZiE

[ ] [ ] kT

kkkkkkT

kkkkkk SHPHRZiiEZzzzzE =+==−− −−−−− :ˆˆ 1|1:11:11|1|

Using the smoothing property of the expectation:

( ) ( ) ( ) ( )( )

( ) ( ) xEdxxpxdxdyyxpx

dxdyypyxpxdyypdxyxpxyxEE

x yyxp

∫∫ ∫

∫ ∫∫ ∫

−∞=

1:1 −= kT

jk ZiiEEiiEwe have:

Assuming, without loss of generality, that k-1 ≥ j, and innovation I (j) is Independent on Z1:k-1, and it can be taken outside the inner expectation:

1:11:1 =

== −−T

jk iZiEEZiiEEiiE

1|1| ˆˆ: −− −=−= kkkkkkkk zzxHzi

Linear Gaussian Markov Systems (continue – 18)Innovation (continue – 1)

The innovation is the quantity:

We found that:

( ) 0ˆ||ˆ| 1|1:11:11|1:1 =−=−= −−−−− kkkkkkkkkk zZzEZzzEZiE

kkkkkkT

kk SHPHRZiiE =+= −− :1|1:1

0=Tjk iiE

jk SiiE δ=

The uncorrelatedness property of the innovation implies that since they are Gaussian,the innovation are independent of each other and thus the innovation sequence isStrictly White. Without the Gaussian assumption, the innovation sequence is Wide Sense White.

Thus the innovation sequence is zero mean and white for the Kalman (Optimal) Filter.

The innovation for the Kalman (Optimal) Filter extracts all the available informationfrom the measurement, leaving only zero-mean white noise in the measurement residual.

kn iSiz

:2 −

Define the quantity:

Let use: kkk iSu2/1

= Since is Gaussian (a linear combination of the nz components of )is Gaussian too with:

ki ku ki

kkk iESuE z

Tkk ISiiESSiiSEuuE ===

−−−− 2/12/12/12/1

where Inz is the identity matrix of size nz. Therefore, since the covariance matrix ofu is diagonal, its components ui are uncorrelated and, since they are jointly Gaussianthey are also independent.

( )1,0;Pr:1

Tkn uuuuuiSi

zN==== ∑

Therefore is chi-square distributed with nz degrees of freedom.2

Since Sk is symmetric and positive definite, it can be written as:

0,,& 1 >=== SiSSknH

kkkkznz

diagDITTTDTS λλλ H

kkkk TDTS 11 −− = 2/12/11

2/12/12/1 ,,& −−−−− ==znSSk

Hkkkk diagDTDTS λλ

SOLO Review of Probability

Chi-square Distribution

( ) ( ) xT

xT ePexExPxExq

:−−

=−−=

Assume a n-dimensional vector is Gaussian, with mean and covariance P, then we can define a (scalar) random variable:

Since P is symmetric and positive definite, it can be written as:

0,,& 1 >=== PiPPPnHH

P ndiagDITTTDTP λλλ

HP TDTP 11 −− = 2/12/1

12/12/12/1 ,,& −−−−− ==

P diagDTDTP λλ

Since is Gaussian (a linear combination of the n components of )is Gaussian too, with:

x u ( )xEx −

=−=−

xExEPuE n

T IPeeEPPeePEuuE ===−−−− 2/12/12/12/1

where In is the identity matrix of size n. Therefore, since the covariance matrix ofu is diagonal, its components ui are uncorrelated and, since they are jointly Gaussianthey are also independent.

( )1,0;Pr:1

Tx uuuuuePeq N==== ∑

Therefore q is chi-square distributed with n degrees of freedom.

Let use: ( ) xePxExPu 2/12/1: −− =−=

Derivation of Chi and Chi-square Distributions

Given k normal random independent variables X1, X2,…,Xk with zero men values and same variance σ2, their joint density is given by

( ) ( ) ( )

++−=

= ∏=

1 2exp

,,1 σσπσπ

normal

tindependenkXX

Define

Chi-square 0:: 22

2 ≥++== kk xxy χ

Chi 0: 22

1 ≥++= kk xx χ

+≤++≤=Χ kkkkkk dxxdpk

χχχχχ 22

The region in χk space, where pΧk (χk) is constant, is a hyper-shell of a volume

(A to be defined)

χχ dAVd k 1−=

( ) ( )

kkkkkkkk dAdxxdpk

χχσ

χσπ

χχχχχ 1

1 2exp

1Pr −

+≤++≤=

( ) ( )

2 σχ

σπχχ k

Compute

χdχχπ ddV 24=

Derivation of Chi and Chi-square Distributions (continue – 1)

( ) ( ) ( )kk

χσπ

Chi-square 0: 22

2 ≥++== kk xxy χ

( ) ( ) ( ) ( ) ( )( )

−+==

Yk kkk

σσπ

χ χχ

A is determined from the condition ( ) 1=∫∞

∞−

( ) ( ) ( ) ( ) ( )( )2/

222exp

22/ kAk

yyAdyyp

kY Γ=→=Γ=

= ∫∫

∞−

ππσσσπ

( ) ( )( )

( )yUyy

σσσ

Γ is the gamma function ( ) ( )∫∞

− −=Γ0

1 exp dttta a

( ) ( ) ( )

( ) ( )kk

−−−

Function ofOne Random

Variable

Chi-square 0: 22

2 ≥++== kk xxy χ

Mean Value 2 2 2 21k kE E x E x kχ σ= + + =

( ) ( ) 4

2 42 2 4

1, ,& 3

Moment of aGauss Distribution

x i i i i

i kE x x E x xσ σ σ

= = − = − =

( ) ( )

22 22 2 2 2 2 4 2 2 4

2 2 2 4 4 2 2 2 4

1 1 1 1 13

2 2 4 43 2

k k ii

k k k k k

i j i i ji j i i j

E k E k E x k

E x x k E x E x x k

k k k k k

σ χ σ χ σ σ

= = = = =≠

= − = − = − ÷ = − = + − ÷ ÷

= + − − =

∑ ∑ ∑ ∑∑

Variance ( ) 2

22 2 2 42k

kE k kχ

σ χ σ σ= − =

where xi

are Gaussianwith

Gauss’ Distribution

Tail probabilities of the chi-square and normal densities.

The Table presents the points on the chi-square distribution for a given upper tail probability

xyQ >= Pr

where y = χn2 and n is the number of degrees

of freedom. This tabulated function is also known as the complementary distribution.

An alternative way of writing the previousequation is: ( )QxyQ n −=≤=− 1Pr1 2χwhich indicates that at the left of the point xthe probability mass is 1 – Q. This is 100 (1 – Q) percentile point.

Examples

1. The 95 % probability region for χ22 variable

can be taken at the one-sided probabilityregion (cutting off the 5% upper tail): ( )[ ] [ ]99.5,095.0,0 2

2. Or the two-sided probability region (cutting off both 2.5% tails): ( ) ( )[ ] [ ]38.7,05.0975.0,025.0 22

22 =χχ

.0 975 .0 025.0 05

3. For χ1002 variable, the two-sided 95% probability region (cutting off both 2.5% tails) is:

( ) ( )[ ] [ ]130,74975.0,025.0 2100

2100 =χχ

Note the skewedness of the chi-square distribution: the above two-sided regions arenot symmetric about the corresponding means

nE n =2χ

For degrees of freedom above 100, thefollowing approximation of the points on thechi-square distribution can be used:

( ) ( )[ ]22 1212

11 −+−=− nQQn Gχ

where G ( ) is given in the last line of the Tableand shows the point x on the standard (zeromean and unity variance) Gaussian distributionfor the same tail probabilities.In the case Pr y = N (y; 0,1) and withQ = Pr y>x , we have x (1-Q) :=G (1-Q)

.5 99.0 51

.0 975 .0 025.0 05

Table of Content

The fact that the innovation sequence is zero mean and white for the Kalman (Optimal) Filter, is very important and can be used in Tracking Systems:

1. when a single target is detected with probability 1 (no false alarms), the innovation can be used to check Filter Consistency (in fact the knowledge of Filter Parameters Φ (k), G (k), H (k) – target model, Q (k), R (k) – system and measurement noises)

2. when a single target is detected with probability 1 (no false alarms), and the target initiate a unknown maneuver (change model) at an unknown time the innovation can be used to detect the start of the maneuver (change of target model) by detecting a Filter Inconsistency and choose from a bank of models (see IMM method) (Φi (k), Gi (k), Hi (k) –i=1,…,n target models) the one with a white innovation.

3. when a single target is detected with probability less then 1 and false alarms are also detected, the innovation can be used to provide information of the probability of each detection to be the real target (providing Gating capability that eliminates less probable detections) (see PDAF method).

4. when multiple targets are detected with probability less then 1 and false alarms are also detected, the innovation can be used to provide Gating information for each target track and probability of each detection to be related to each track (data association). This is done by running a Kalman Filter for each initiated track. (see JPDAF and MTT methods)

Linear Gaussian Markov Systems (continue – 20)Evaluation of Kalman Filter Consistency

A state-estimator (filter) is called consistent if its state estimation error satisfy

( ) ( ) ( ) 0|~:|ˆ ==− kkxEkkxkxE

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( )kkPkkxkkxEkkxkxkkxkxE TT ||~|~:|ˆ|ˆ ==−−

this is a finite-sample consistency property, that is, the estimation errors based on a finite number of samples (measurements) should be consistent with the theoreticalstatistical properties:

• Have zero mean (i.e. the estimates are unbiased).• Have covariance matrix as calculated by the Filter.

The Consistency Criteria of a Filter are:

1. The state errors should be acceptable as zero mean and have magnitude commensurate with the state covariance as yielded by the Filter.

2. The innovation should have the same property as in (1).

3. The innovation should be white noise.

Only the last two criteria (based on innovation) can be tested in real data applications.The first criterion, which is the most important, can be tested only in simulations.

Linear Gaussian Markov Systems (continue – 21)Evaluation of Kalman Filter Consistency (continue – 1)

When we design the Kalman Filter, we can perform Monte Carlo (N independent runs)Simulations to check the Filter Consistency (expected performances).

Real time (Single-Run Tests)

In Real Time, we can use a single run (N = 1). In this case the simulations are replacedby assuming that we can replace the Ensemble Averages (of the simulations) by theTime Averages based on the Ergodicity of the Innovation and perform only the tests(2) and (3) based on Innovation properties.

The Innovation bias and covariance can be evaluated using

( ) ( ) ( )∑∑== −

1ˆ&1ˆ

Real time (Single-Run Tests) (continue – 1)

Test 2: ( ) ( ) ( ) ( ) ( ) ( )kSkikiEkiEkkzkzE T ===−− &0:1|ˆ

Using the Time-Average Normalized Innovation Squared (NIS) statistics

( ) ( ) ( )∑=

Ti kikSki

must have a chi-square distribution with K nz degrees of freedom.

The test is successful if [ ]21, rri ∈εwhere the confidence interval [r1,r2] is definedusing the chi-square distribution of iε

[ ] αε −=∈ 1,Pr 21 rri

For example for K=50, nz=2, and α=0.05, using the two tails of the chi-square distribution we get

( )( )

==→=

==→=→

6.250/130130925.0

5.150/7474025.0~50

1002100

χχε

.0 975 .0 025

Real time (Single-Run Tests) (continue – 2)

Test 3: Whiteness of Innovation

Use the Normalized Time-Average Autocorrelation

( ) ( ) ( ) ( ) ( ) ( ) ( )2/1

+++= ∑∑∑

Ti lkilkikikilkikilρ

In view of the Central Limit Theorem, for large K, this statistics is normal distributed.

For l≠0 the variance can be shown to be 1/K that tends to zero for large K.

Denoting by ξ a zero-mean unity-variance normalrandom variable, let r1 such that

[ ] αξ −=−∈ 1,Pr 11 rr

For α=0.05, will define (from the normal distribution) r1 = 1.96. Since has standard deviation ofThe corresponding probability region for α=0.05 will be [-r, r] where

iρ K/1

KKrr /96.1/1 ==Normal Distribution

Monte-Carlo Simulation Based Tests

The tests will be based on the results of Monte-Carlo Simulations (Runs) that provideN independent samples

( ) ( ) ( ) ( ) ( ) ( ) NikkxkkxEkkPkkxkxkkx Tiiiii ,,1|~|~|&|ˆ:|~ ==−=

Test 1:For each run i we compute at each scan k

And compute the Normalized (state) Estimation Error Squared (NEES)

( ) ( ) ( ) ( ) NikkxkkPkkxk iT

ixi ,,1|~||~: 1 == −ε

Under the Hypothesis that the Filter is Consistent and the Linear Gaussian,is chi-square distributed with nx (dimension of x) degrees of freedom. Then

( )kxiε

( ) xxi nkE =ε

The average, over N runs, of is( )kxiε

( ) ( )∑=

ixix k

1: εε

Monte-Carlo Simulation Based Tests (continue – 1)

Test 1 (continue – 1):

The average, over N runs, of is( )kxiε

( ) ( )∑=

ixix k

1: εε

The test is successful if [ ]21, rrx ∈εwhere the confidence interval [r1,r2] is definedusing the chi-square distribution of iε

[ ] αε −=∈ 1,Pr 21 rrx

For example for N=50, nx=2, and α=0.05, using the two tails of the chi-square distribution we get

( )( )

==→=

==→=→

6.250/130130925.0

5.150/7474025.0~50

1002100

χχε

.0 975 .0 025

must have a chi-square distribution with N nx degrees of freedom.

The test is successful if [ ]21, rri ∈εwhere the confidence interval [r1,r2] is definedusing the chi-square distribution of iε

[ ] αε −=∈ 1,Pr 21 rri

For example for N=50, nz=2, and α=0.05, using the two tails of the chi-square distribution we get

( )( )

==→=

==→=→

6.250/130130925.0

5.150/7474025.0~50

1002100

χχε

.0 975 .0 025

must have a chi-square distribution with N nz degrees of freedom.

Test 2: ( ) ( ) ( ) ( ) ( ) ( )kSkikiEkiEkkzkzE T ===−− &0:1|ˆ

Using the Normalized Innovation Squared (NIS) statistics, compute from N Monte-Carlo runs:

( ) ( ) ( ) ( )∑=

Tji kikSki

Test 3: Whiteness of Innovation

Use the Normalized Sample Average Autocorrelation

( ) ( ) ( ) ( ) ( ) ( ) ( )2/1

= ∑∑∑

Tji mimikikimikimkρ

In view of the Central Limit Theorem, for large N, this statistics is normal distributed.

For k≠m the variance can be shown to be 1/N that tends to zero for large N.

Denoting by ξ a zero-mean unity-variance normalrandom variable, let r1 such that

[ ] αξ −=−∈ 1,Pr 11 rr

For α=0.05, will define (from the normal distribution) r1 = 1.96. Since has standard deviation ofThe corresponding probability region for α=0.05 will be [-r, r] where

iρ N/1

NNrr /96.1/1 ==Normal Distribution

Examples Bar-Shalom, Y, Li, X-R, “Estimation and Tracking: Principles, Techniques and Software”, Artech House, 1993, pg.242

Single Run, 95% probability

[ ]99.5,0∈xεTest (a) Passes if

A one-sided region is considered.For nx = 2 we have

( ) ( )[ ] [ ]99.5,095.0,02 22

22 == χχxn

( ) ( ) ( ) ( )∑=

Tx kkxkkPkkx

1 |~||~1:ε

( ) ( ) ( ) qkxkkx +−Φ= 1

See behavior of for various values of the process noise qfor filters that are perfectly matched.

Monte-Carlo, N=50, 95% probability

[ ] [ ]6.2,5.150/130,50/74 =∈xεTest (a) Passes if

( ) ( ) ( ) ( )∑=

Tjx kkxkkPkkx

1 |~||~1:ε(a)

( ) ( ) ( ) ( ) ( ) ( ) ( )2/1

= ∑∑∑

Tji mimikikimikimkρ(c)

The corresponding probability region for α=0.05 will be [-r, r] where

28.050/96.1/1 === Nrr

[ ] [ ]43.1,65.050/4.71,50/3.32 =∈iεTest (b) Passes if

( ) ( ) ( ) ( )∑=

Tji kikSki

11:ε(b)

( ) ( )[ ] [ ]130,74925.0,025.02 2100

2100 == χχxn

( ) ( )[ ] [ ]71,32925.0,025.01 2100

2100 == χχzn

Example Mismatched Filter

A Mismatched Filter is tested: Real System Process Noise q = 9 Filter Model Process Noise qF=1

( ) ( ) ( ) ( )∑=

Tx kkxkkPkkx

1 |~||~1:ε

( ) ( ) ( ) qkxkkx +−Φ= 1

(1) Single Run

(2) A N=50 runs Monte-Carlo with the 95% probability region

( ) ( ) ( ) ( )∑=

Tjx kkxkkPkkx

1 |~||~1:ε

[ ] [ ]6.2,5.150/130,50/74 =∈xεTest (2) Passes if

( ) ( )[ ] [ ]130,74925.0,025.02 2100

2100 == χχxn

Test Fails

[ ]99.5,0∈xεTest (1) Passes if

( ) ( )[ ] [ ]99.5,095.0,02 22

22 == χχxn

Example Mismatched Filter (continue -1)

A Mismatched Filter is tested: Real System Process Noise q = 9 Filter Model Process Noise qF=1

( ) ( ) ( ) qkxkkx +−Φ= 1

( ) ( ) ( ) ( )∑=

Tji kikSki

[ ] [ ]43.1,65.050/4.71,50/3.32 =∈iεTest (3) Passes if

( ) ( )[ ] [ ]71,32925.0,025.01 2100

2100 == χχzn

( ) ( ) ( ) ( ) ( ) ( ) ( )2/1

= ∑∑∑

Tji mimikikimikimkρ

The corresponding probability region for α=0.05 will be [-r, r] where

28.050/96.1/1 === Nrr

Test Fails

Extended Kalman FilterSensor Data

Processing andMeasurement

Formation

Association

GatingComputations

In the extended Kalman filter, (EKF) the state transition and observation models need not be linear functions of the state but may instead be (differentiable) functions.

( ) ( ) ( )[ ] ( )kwkukxkfkx +=+ ,,1

( ) ( ) ( )[ ] ( )11,1,11 +++++=+ kkukxkhkz νState vector dynamics

Measurements

xxx =−= &:

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

lklk 1

The function f can be used to compute the predicted state from the previous estimate and similarly the function h can be used to compute the predicted measurement from the predicted state. However, f and h cannot be applied to the covariance directly. Instead a matrix of partial derivatives (the Jacobian) is computed.

( ) ( ) ( )[ ] ( ) ( )[ ] ( )( )

( ) ( )( )

( ) ( )kekex

fkekukxEkfkukxkfke wx

Hessian

Jacobian

wx ++∂∂+

∂∂=+−=+

1,,,,1

( ) ( ) ( )[ ] ( ) ( )[ ] ( )( )

( ) ( )( )

( ) ( )1112

1111,1,11,1,11

++++∂∂+++

∂∂=+++++−+++=+

hkkukxEkhkukxkhke x

Hessian

Jacobian

z νν

Taylor’s Expansion:

Extended Kalman FilterState Estimation (one cycle)

( )11|11| ,ˆ,1ˆ −−−− −= kkkkk uxkfxState vector prediction

Jacobians Computation

1|1|1 ˆˆ

1 &−−−

∂∂=

∂∂=Φ −

kkkk x

Covariance matrix extrapolation111|111| −−−−−− +ΦΦ= kT

kkkkkk QPP

Innovation CovariancekT

kkkkk RHPHS += −1|

Gain Matrix Computation11|

−−= k

Tkkkk SHPK

Innovation1|ˆ

−−=kkz

kkkkk xHzi

Filteringkkkkkk iKxx += −1|| ˆˆ

Covariance matrix updating

( )( ) ( ) T

kkkkkk

Tkkkkk

kkkkkkk

KRKHKIPHKI

PHSHPPP

+−−=

−=−=

−−

1+= kk

Extended Kalman FilterState Estimation (one cycle)

Formation

Association

GatingComputations

Rudolf E. Kalman( 1920 - )

Unscented Kalman FilterSOLO

Criticism of the Extended Kalman FilterUnlike its linear counterpart, the extended Kalman filter is not an optimal estimator. In addition, if the initial estimate of the state is wrong, or if the process is modeled incorrectly, the filter may quickly diverge, owing to its linearization. Another problem with the extended Kalman filter is that the estimated covariance matrix tends to underestimate the true covariance matrix and therefore risks becoming inconsistent in the statistical sense without the addition of "stabilising noise".Having stated this, the extended Kalman filter can give reasonable performance, and is arguably the de facto standard in navigation systems and GPS.

Uscented Kalman FilterSOLO

When the state transition and observation models – that is, the predict and update functions f and h (see above) – are highly non-linear, the extended Kalman filter can give particularly poor performance [JU97]. This is because only the mean is propagated through the non-linearity. The unscented Kalman filter (UKF) [JU97] uses a deterministic sampling technique known as the to pick a minimal set of sample points (called sigma points) around the mean. These sigma points are then propagated through the non-linear functions and the covariance of the estimate is then recovered. The result is a filter which more accurately captures the true mean and covariance. (This can be verified using Monte Carlo sampling or through a Taylor series expansion of the posterior statistics.) In addition, this technique removes the requirement to analytically calculate Jacobians, which for complex functions can be a difficult task in itself.

( ) ( ) ( )[ ] ( )kwkukxkfkx +=+ ,,1

( ) ( )[ ] ( )11,11 ++++=+ kkxkhkz νState vector dynamics

Measurements

xxx =−= &:

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

lklk 1

The Unscent Algorithm using ( ) ( ) ( ) ( ) ( ) ( )kPkekeEkxEkxke xT

xxx =−= &:

Determines ( ) ( ) ( ) ( ) ( ) ( )kPkekeEkzEkzke zT

zzz =−= &:

( ) ( )[ ]

∂=∇⋅

∇⋅=+

Develop the nonlinear function f in a Taylor series around

Define also the operator ( )[ ] ( )xfx

∂=∇⋅= ∑=1

: δδδ

Propagating Means and Covariances Through Nonlinear Transformations

Consider a nonlinear function .( )xfy =

Let compute

Assume is a random variable with a probability density function pX (x) (known orunknown) with mean and covariance

x ( ) ( ) Txx xxxxEPxEx ˆˆ,ˆ −−==

( )[ ] ∑ ∑∑

∑∞

∂=∇⋅=

( ) ( ) xxTT PxxxxExxE

=−−=

=−=+=

Consider a nonlinear function .(continue – 1)

( )xfy = ( ) ( ) xxTT PxxxxExxE

=−−=

=−=+=

( ) ( )

∂=+=

∑∑∑

∑∑ ∑

xExffx

δδδ

Since all the differentials of f are computed around the mean (non-random) x

( )[ ] ( )[ ] ( )[ ] ( )[ ]xxxxT

xxx fPfxxEfxxEfxE ˆˆˆˆ2 ∇∇=∇∇=∇∇=∇⋅ δδδδδ

( )[ ] 0

∂∂=

∇⋅=∇⋅ ∑∑

xxx fx

xEfxEfxExx

δδδδ

( ) [ ] ( ) ( )[ ] [ ] [ ] +++∇∇+==+= ∑∞

=xxxxxx

nx fDEfDEfPxffDE

nxxfEy ˆ

1ˆˆ δδδδ

Simon J. Julier

Consider a nonlinear function .(continue - 2)

( )xfy = ( ) ( ) xxTT PxxxxExxE

=−−=

=−=+=

Unscented Transformation (UT), proposed by Julier and Uhlmannuses a set of “sigma points” to provide an approximation ofthe probabilistic properties through the nonlinear function

Jeffrey K. Uhlman

A set of “sigma points” S consists of p+1 vectors and their associatedweights S = i=0,1,..,p: x(i) , W(i) . (1) Compute the transformation of the “sigma points” through the nonlinear transformation f:

( ) ( )( ) pixfy ii ,,1,0 ==(2) Compute the approximation of the mean: ( ) ( )∑

ii yWy0

The estimation is unbiased if:( ) ( ) ( ) ( ) ( ) yWyyEWyWE

ii ˆˆ00 ˆ0

∑∑∑

( ) 10

(3) The approximation of output covariance is given by

( ) ( )( ) ( )( )∑=

−−≈p

Tiiiyy yyyyWP0

Consider a nonlinear function (continue – 3)( )xfy =

One set of points that satisfies the above conditions consists of a symmetric set of symmetric p = 2nx points that lie on the covariance contour Pxx:

( ) ( )

( ) ( ) ( )( ) ( ) ( )

where is the row or column of the matrix square root of nx Pxx /(1-W0)(the original covariance matrix Pxx multiplied by the number of dimensions of x, nx/(1-W0)). This implies:

( )( )i

xxx WPn 01/ −

xxx PW

01 00 111 −=

−∑

Unscented Transformation (UT) (continue – 1)

Consider a nonlinear function (continue – 3)( )xfy =

Unscented Transformation (UT) (continue – 2)

( ) ( )( )( )

∑∞

2,,1ˆ!

,,1ˆ!

nnixfDn

nixfDn

Unscented Algorithm:

( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )∑∑

∑ ∑∑ ∑∑

++−+−+=

++++−+=

−+−+==

xfDxfDn

xfDxfDxfDxfn

WxfWyWy

δδδ

±=±=01

ˆˆ δ

Since ( ) ( )( )( )

∂−= ∑=

−oddnxfD

evennxfDxf

ˆˆˆ

δδ δ

Unscented Kalman Filter

( ) ( ) ( ) ( )∑=

++−+∇∇+=

xxTUT xfDxfD

WxfPxfy

640 ˆ!6

1ˆˆ δδ

±=±=01

ˆˆ δ

Consider a nonlinear function (continue – 4)( )xfy =Unscented Transformation (UT) (continue – 3)

Unscented Algorithm:

( ) ( )

( ) ( ) ( )xfPxfPW

xxTxxxT

∇∇=∇

∇−=∇

∇−=

∇−=−

∑∑

Finally:

We found

( ) [ ] ( ) ( )[ ] [ ] [ ] +++∇∇+==+= ∑∞

=xxxxxx

nx fDEfDEfPxffDE

nxxfEy ˆ

1ˆˆ δδδδ

We can see that the two expressions agree exactly to the third order.

Consider a nonlinear function (continue – 5)( )xfy =Unscented Transformation (UT) (continue – 4)

Accuracy of the Covariance:

( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( )[ ] [ ] [ ]

( ) ( )[ ] [ ] [ ] T

xxxxxxxxT

fDEfDEfPxf

xfDxffDn

xfDxfE

yyyyEyyyyEP

+++∇∇+⋅

+++∇∇+−

−=−−=

∑∑∞

ˆˆˆˆ

δδδδ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

∑∑

xfxfDn

ExfxfDExfDn

ExfxfDExfxfxf

1ˆˆˆ

1ˆˆˆˆˆ

δδδδ

Uscented Kalman FilterSOLO

( ) ( )∑∑ −−==N

ii zzPz2

ψψβψβ

[ ]xxi PxPxx ααχ −+=

Weightedsample mean

Weightedsample

covariance

Table of Content

Uscented Kalman FilterSOLOUKF Summary

Initialization of UKF

( ) ( ) TxxxxEPxEx 00000|000 ˆˆˆ −−==

[ ] ( ) ( )

=−−===

xxxxEPxxExTaaaaaTTaa

ˆˆ00ˆˆ0|0

00000|0000

[ ]TTTTa vwxx =:

For ∞∈ ,,1 k

Calculate the Sigma Points ( )( )

−−−−+

−−

−−−−−−

−−−−

,,1ˆˆ

1|11|11|1

State Prediction and its Covariance

System Definition( ) ( )

==+−= −−−−−−−

lkkkkk

lkkkkkk

RvvEvEvxkhz

QwwEwEwuxkfx

,1111111

( ) Liuxkfx ki

kk 2,,1,0,ˆ,1ˆ 11|11| =−= −−−−

( ) ( ) ( )( ) LiL

WxWx mi

mikk 2,,1

1&ˆˆ 0

01|1| =

+== ∑

=−− λλ

( ) ( ) ( ) ( ) ( )( ) LiL

WxxxxWP ci

ikk 2,,12

1&1ˆˆˆˆ 2

01|1|1|1|1| =

+=+−+

+=−−= ∑

=−−−−− λ

βαλ

Uscented Kalman FilterSOLOUKF Summary (continue – 1)

Measure Prediction

( ) Lixkhz ikk

ikk 2,,1,0ˆ,ˆ 1|1| == −−

( ) ( ) ( )( ) LiL

WzWz mi

mikk 2,,1

1&ˆˆ 0

01|1| =

+== ∑

=−− λλ

Innovation and its Covariance4

1|ˆ −−= kkkk zzi

( ) ( ) ( ) ( ) ( )( ) LiL

WzzzzWPS ci

izzkkk 2,,1

1&1ˆˆˆˆ 2

01|1|1|1|1| =

+=+−+

+=−−== ∑

=−−−−− λ

βαλ

Kalman Gain Computations5( ) ( ) ( ) ( ) ( )

( ) LiL

WzzxxWP ci

ixzkk 2,,1

1&1ˆˆˆˆ 2

01|1|1|1|1| =

+=+−+

+=−−= ∑

=−−−−− λ

βαλ

−−−= zz

kkxzkkk PPK

Update State and its Covariance6kkkkkk iKxx += −1|| ˆˆ

Tkkkkkkk KSKPP −= −1||

k = k+1 & return to 1

Unscented Kalman FilterState Estimation (one cycle)

Formation

Association

GatingComputations

Simon J. Julier Jeffrey K. Uhlman

Estimators

xxx =−= &:

( ) ( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( )( ) ( ) ( ) ( )kkvkkv

kvkxkHkz

kwkkukGkxkkx

ξ+Ψ=++=

Γ++Φ=+

Assume a discrete dynamic system

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) ( ) ( ) ( ) ( ) lkT kRlekeEkvEkvke ,

&: δξξξ =−=

( ) ( ) 0=lekeE Tw ξ

lklk 1

Solution

Define a new “pseudo-measurement”:

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]kvkxkHkkvkxkHkzkkzk +Ψ−++++=Ψ−+= 1111:ζ

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( )( )

( ) ( ) ( )kxkHkkvkkvkwkkukGkxkkHk

Ψ−Ψ−++Γ++Φ+=

( ) ( ) ( ) ( )[ ]( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( )

kkwkkHkukGkHkxkHkkkHε

ξ+Γ++++Ψ−Φ+= 111*

( ) ( ) ( ) ( ) ( ) ( ) ( )kkukGkHkxkHk εζ +++= 1*

Estimators

xxx =−= &:

( ) ( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( )111211*1

++++++++=+Γ++Φ=+

kkukGkHkxkHk

kwkkukGkxkkx

The new discrete dynamic system:

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) lklk

TTT kRlHlkQkkHlekeE

kEkwEkkHkke

++ΓΓ+=

+Γ+−=

( ) ( ) 0=lekeE Tw ξ

lklk 1

Solution (continue – 1)

( ) ( ) ( ) ( ) ( )kkwkkHk ξε +Γ+= 1:

( ) ( ) ( ) ( ) ( )kHkkkHkH Ψ−Φ+= 1:*

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) lkTTTTTTT kHkkQllHllwkwElkwE ,11 δξε +Γ=++Γ=

To decorrelate measurements and system noises write the discrete dynamic system:

( ) ( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]

kkukGkHkxkHkkD

kwkkukGkxkkx

εζ ++−−+Γ++Φ=+

Estimators

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]kRkHkkQkkHkDkHkkQkkkkDkwkE TTTTT ++ΓΓ+−+ΓΓ==−Γ 1110εε

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) lklkTTT kRkRkHkkQkkHlkE ,, *:11 δδεε =++ΓΓ+=

The new discrete dynamic system: Solution (continue – 2)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( )111211*1

++++++++=+−Γ+

+−−++Φ=+

kkukGkHkxkHk

kkDkwk

kukGkHkxkHkkDkukGkxkkx

εζε

To de-correlate measurement and system noises choose D (k) such that:

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) lkTTTTTTT kHkkQllHllwkwElkwE ,11 δξε +Γ=++Γ=

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] 1111

−++ΓΓ++ΓΓ= kRkHkkQkkHkHkkQkkD TTTT

The Discrete Kalman Filter Estimator is: ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) 000|0ˆ

1|ˆ*|ˆ|1ˆ

kukGkHkkxkHkkDkukGkkxkkkx

==+−−++Φ=+ ζ

( ) ( ) ( ) ( ) ( )kHkkkHkH Ψ−Φ+= 1:*

The Aprior Covariance Update is:

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( )( ) 00|0

**|1*|1

kDkRkDkkQkkHkDkkkPkHkDkkkP TTT

=+ΓΓ+−Φ+−Φ=+

Estimators

( ) ( ) ( ) ( )[ ] ( ) 0=−Γ kkkDkwkE Tεε( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) lklk

TTT kRkRkHkkQkkHlkE ,, *:11 δδεε =++ΓΓ+=

The discrete dynamic system: Solution (continue – 3)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) ( ) ( ) ( ) ( ) ( )111211*1

++++++++=++−−++Φ=+

kkukGkHkxkHk

kukGkHkxkHkkDkukGkxkkx

εζζ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] 1111

−++ΓΓ++ΓΓ= kRkHkkQkkHkHkkQkkD TTTT

The Discrete Kalman Filter Estimator is: ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) 000|0ˆ

1/ˆ*|ˆ|1ˆ

==+−−++Φ=+ ζ

( ) ( ) ( ) ( ) ( )kHkkkHkH Ψ−Φ+= 1:*

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( )( ) 00|0

**|1*|1

kDkRkDkkQkkHkDkkkPkHkDkkkP TTT

=+ΓΓ+−Φ+−Φ=+

( ) ( ) ( ) ( ) ( )[ ]( ) ( ) ( ) ( )kRkHkkPkK

kHkRkHkkPkkPT

**1|11

***|11|1−

−−−

++=+++=++

( ) ( ) ( ) ( ) ( ) ( )[ ]kkxkHkkKkkxkkx |1ˆ1*11|1ˆ1|1ˆ ++−++++=++ ζ

Summary:

Estimators

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) 000|0ˆ

1|ˆ*|ˆ|1ˆ

==+−−++Φ=+ ζ

Kalman Filter Discrete Case & Colored Measurement NoiseSolution (continue – 4)

Summary:

( ) ( ) ( ) ( ) ( ) ( )[ ]kkxkHkkKkkxkkx |1ˆ1*11|1ˆ1|1ˆ ++−++++=++ ζ

( ) ( ) ( ) ( )kzkkzk Ψ−+= 1ζ

Table of Content

Estimators

( ) ( ) ( ) ( )[ ]∫ −+−=t

dtntHty0

λλλλ s

Optimal State Estimation in Linear Stationary Systems

The output of the Stationary Filter is given by:

Hnxn (t) is the impulse response matrix of the Stationary Filter

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )tytteteteEtyttytE iT

i −==−− yyy :

We want to estimate a vector signal that, after be corrupted by noise , passes trough a Linear Stationary Filter. We want to design the filter in order to estimate the signal using only the measured filter output vector .( )tyn 1×

( )tsn 1× ( )tnn 1×

( )tsn 1×

nnx1 (t) is a noise with autocorrelationand uncorrelated to the signal

( ) ( ) ( ) ( )τττ −=+= tRtntnER nnT

( ) ( ) ( ) ( ) 0=+=+ ττ tstnEtntsE TT

( ) ( ) ( ) ( ) teteEtraceteteE TT =Where the trace of a square matrix A = ai,j is the sum of the diagonal terms

=× ==n

iiinjijinn aatraceAtrace

1,,,1,, :

( ) ( ) ( )∫ −=t

i dtIty0

λλλ s

The uncorrupted signal is observed through a linear system, with impulse response I (t) and output yi (t):We want to choose a Stationary Filter that minimizes:

EstimatorsSOLO

Optimal State Estimation in Linear Stationary Systems (continue – 1)

The Autocorrelation of the error is:

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ] ( )

( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( )

−+−−+−−+

−−−−−=

−++−−+

+−−−=

∫∫∫∫

∫∫∫∫∞+

∞−

22222221111111

ξξτξξξτξτξξξξξξξξ

ξξτξξξξτξξξξξξξξ

dtHndtHtIdntHdtHtIE

dtHndtIdntHdtIE

teteER

Therefore

( ) ( ) ( )[ ] ( ) ( )[ ]( )

( ) ( )[ ]

( ) ( ) ( )[ ]( )

( )∫ ∫

∫ ∫∞+

∞−

∞− −

∞−

∞− −

−+−+

−+−−+−−−=

212211

21222111

ξξξτξξξ

ξξξτξτξξξξτ

ddtHnnEtH

ddtHtIEtHtIR

Estimators

( ) ( ) ( )[ ] ( ) ( )[ ]( )

( ) ( )[ ]

( ) ( ) ( )[ ]( )

( )∫ ∫

∫ ∫∞+

∞−

∞− −

∞−

∞− −

−+−+

−+−−+−−−=

212211

21222111

ξξξτξξξ

ddtHnnEtH

ddtHtIEtHtIR

( ) ( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( )sSssssSsssS TTT −+−−−−= HHHIHI nnssee

SOLOOptimal State Estimation in Linear Stationary Systems (continue – 2)

Using the Bilateral Laplace Transform we obtain:( ) ( ) ( )

( ) ( )[ ] ( ) ( ) ( )[ ] ( )

( ) ( ) ( )∫ ∫

∫ ∫ ∫

∞−

−+−−+

−−+−−+−−−−=

212211

21222111 exp

ξξξτξξξ

τξξτξτξτξξξξ

τττ

ddtHRtH

dddstHtIRtHtI

( ) ( )[ ] ( )[ ] ( ) ( )[ ] ( ) ( )[ ] ( )[ ]

( ) ( )

( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]

∫ ∫ ∫

∞−

−−

∞−

−+−+−−−−−−+

−+−+−−+−−−−−−−−=

222212111

122222121111

expexpexp

ξτξτξτξξξξξξ

ξξτξτξτξτξξξξξξξ

ddsttHsRsttH

dddsttHtIsRsttHtI

Estimators

( ) ( ) ( )[ ] ( ) ( )[ ]( )

( ) ( )[ ]

( ) ( ) ( )[ ]( )

( )∫ ∫

∫ ∫∞+

∞−

∞− −

∞−

∞− −

−+−+

−+−−+−−−=

212211

21222111

ξξξτξξξ

ddtHnnEtH

ddtHtIEtHtIR

( ) ( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( )sSssssSsssS TTT −+−−−−= HHHIHI nnssee

Using the Bilateral Laplace Transform we finally obtained:

( ) ( ) ( )( ) ( )

( ) ( ) ( ) ( ) ( )

( ) ( ) ( )( )

( ) ( ) ( )ns

rrrrrrrr

rrrrrrrrrr

expexp

expexpexp

=−=−=

−==−−=−=

∫∫

∫∫∫∞+

∞−

−=+ ∞

∞−

−=+ ∞

∞− r

sSdsRdsRsS

sSdsRdsRdsRsS

ττττττ

υυυττττττ

τυττ

EstimatorsSOLO

( )( ) ( )

( )( )0minminmin === τee

tHRtraceteteEtraceteteE

( ) ( ) ( ) ( )∫∫∞+

∞−=

∞−

eeee dssSj

dsssSj

We want to find the Optimal Stationary Filter, ,that minimizes:( )tH

( )( ) ( )

( )( )

( )( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( ) ∫

∫∞+

∞−

−+−−−=

dssSssssSssj

RtracedssSj

traceteteE

HHHIHI nnssH π

1minmin

Using Calculus of Variation we write ( ) ( ) ( ) 0ˆ →Ψ+= εε sss HH

( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ] ( ) ( )[ ] ( ) ( )[ ]

( ) ( ) ( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) 0ˆˆ2

1ˆˆ2

ˆˆˆˆ2

=−Ψ+−−+−+−−−−Ψ=

−Ψ+−Ψ++−Ψ−−−−Ψ−−∂∂

∫∫

∫∞+

∞−

∞−→

dsssSssSssj

tracedssSsssSsj

dsssSssssssSsssj

εεεεεπ ε

nnssnnss

HHIHHI

HHHIHI

EstimatorsSOLO

( ) ( ) ( ) ( )[ ] ( )

( ) ( )[ ] ( ) ( ) ( ) ( ) 0ˆˆ2

=−Ψ+−−+

−+−−−−Ψ

∫∞+

∞−

dsssSssSssj

dssSsssSsj

Since by tacking –s instead of s in one of the integrals we obtain the other, they are equaland have zero value:

( ) ( )[ ] ( ) ( ) ( ) ( ) 0ˆˆ2

1 =−Ψ+−−∫∞+

∞−

T dsssSssSssj

trace επ nnss HHI

This integral is zero for all if and only if: ( ) 0≠−Ψ sT

( ) ( )[ ] ( ) ( ) ( ) 0ˆˆ =+−− sSssSss nnss HHI ( ) ( ) ( )[ ] ( ) ( )sSssSsSs ssnnss IH =+ˆ

Since we can perform a Spectral Decomposition: ( ) ( ) ( ) ( )[ ] TsSsSsSsS −+−=+ nnssnnss

( ) ( ) ( ) ( )sssSsS T −∆∆=+ nnss( )s∆ - All poles and zeros are in L.H.P s.

- All poles and zeros are in R.H.P s.( )sT −∆

( ) ( ) ( ) ( ) ( )sSssss TssIH =−∆∆ˆ ( ) ( ) ( ) ( )[ ] ( )sssSss T 1

PartRealizable

ˆ −− ∆−∆= ssIH

Estimators

( ) ( ) ( ) 1&11

==+−

= sIsSsss

sS nnss

( ) ( ) ( ) ( )[ ] ( )sssSss T 1

PartRealizable

ˆ −− ∆−∆= ssIH

Example 8.3-2 Sage, “Optimum System Control”, Prentice Hall, 1968, pp.191-192

( ) ( )( ) ( )

−∆∆

−−

−−=+

( ) ( ) ( )[ ] ( ) ( )

Partrealizable-Un

PartRealizable

sssSs T

−−

−=−∆−

Solution:

( )( ) ( )

( )( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( )

( ) ( ) ( ) ( ) 12

ˆˆˆˆ2

1minmin

−+−−−==

∫∫∫

∫∫∞+

∞−

RHPLHP

dssSssssSssj

tracedsSj

traceteteE

ππHHHIHI nnssee

Estimators

Solution:

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( ) ( ) 21

2121 ,0

tttvtwE

ttRtvtvE

ttQtwtwET

( ) ( ) ( ) ( ) ( ) 1&& === tItvtntxts

( ) ( ) ( )sWBAsIsS 1−−= ( ) ( ) ( ) ( ) ( ) TTT AsIBQBAsIsSsSsS −− −−−=−= 1ss

( ) RsS =nn

( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( )[ ] ( )( ) [ ] [ ] ( ) TT

AsIRsRsAsI

AsIAsIRAsIBQBAsI

AsIBQBAsIsssSsS

−−

−−−−−−=

−−−−−+−=

−−−=−∆∆=+

2/12/11

RAARRR

ARABQBTT

+−=−+=

2/12/1 ΤΤ

ΤΤPRAR

PPPARR TTT

+−==−−=

( ) ( ) ( ) ( ) ( )( ) ( )[ ] ( ) ( )( ) ( )T

−−=

+−−=−−−=

−−=−−=∆

−−−−

−−−

2/1112/111

2/112/112/11

RRPAsIAsIRRPRAIsAsI

RRRIsAsIRsAsIs

Estimators

( ) ( ) ( ) ( ) ( ) [ ] [ ] ( ) TTT AsIRsRsAsIsssSsS −− −−−−−−=−∆∆=+ ΤΤnnss2/12/11

( ) ( ) ( )[ ] ( ) ( )( )

( ) ( )( )

( ) ( )

realizableUn

Realizable

−−

−∆

−−−−

−−−=

−−−−−−−=−∆−

sRBQBAsI

sRAsIAsIBQBAsIssSsT

Solution (continue - 1):

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( ) ( ) 21

2121 ,0

tttvtwE

ttRtvtvE

ttQtwtwET

( ) ( ) ( ) ( ) ( ) 1&& === tItvtntxts

Let decompose the last expression in the Realizable and Un-realizable parts:

( ) ( ) ( ) ( )TTTT

ARABQBPRPAPPAARA

PARRPRARRR

+=−−−=−−==

−−

12/112/1 ΤΤΤΤ

01 =+−+ − BQBPRPAPPA T

( ) ( ) ( ) ( )

realizableUn

Realizable

realizableUn

Realizable

−−

−− −−+−=−−− TTT sRNMAsIsRBQBAsI TT where M and N must be defined

Estimators

01 =+−+ − BQBPRPAPPA T

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( ) ( ) 21

2121 ,0

tttvtwE

ttRtvtvE

ttQtwtwET

( ) ( ) ( ) ( ) ( ) 1&& === tItvtntxts

Let decompose the last expression in the Realizable and Un-realizable parts:( ) ( ) ( ) ( )

realizableUn

Realizable

realizableUn

Realizable

−−

−− −−+−=−−− TTT sRNMAsIsRBQBAsI TTwhere M and N must be defined

Pre-multiply this equality by (sI-A) and post-multiply by (-s R1/2 –TT) to obtain

( ) ( ) NAsIsRMBQB TT −+−−= T2/1

2/12/1 0 RMNNRM =⇒=−2/1RMAMNAMBQB TTT −−=−−= TT

PPPARR TTT =−−= &2/1 Τ

( ) 2/12/12/11 RMAPRARMPRPAPPA TT −−−−=+−− −−

( ) ( ) ( ) 012/12/12/1 =−+−−−− − PRRMPARMPRMPA T 2/1−= RPM PN =

Estimators

( ) ( ) ( ) ( )[ ] ( ) ( )( ) ( ) ( )[ ]

( ) ( )( )

T AsIRPAIsRRPAsIsssSssT 1

PartRealizable

112/12/111

PartRealizable

ˆ−− ∆

−−−

−∆

−−−− −+−−=∆−∆=ssI

( ) ( ) ( ) ( ) ( )2/12/12/112/11 −−− +−−=−−=∆ RPRARsAsIRsAsIs Τ

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( ) ( ) 21

2121 ,0

tttvtwE

ttRtvtvE

ttQtwtwET

( ) ( ) ( ) ( ) ( ) 1&& === tItvtntxts

Decompose the last expression in the Realizable and Un-realizable parts:

( ) ( ) ( )[ ] ( ) ( ) ( ) ( )

realizableUn

Realizable

realizableUn

Realizable

−−−

−−− −−+−=−−−=−∆ TTTT sRPRPAsIsRBQBAsIssSs TTI ss

PRAR +−=2/1Τ

Estimators

( ) ( ) ( ) ( )[ ] ( ) ( )( ) ( ) ( )[ ]

( ) ( )( )

T AsIRPAIsRRPAsIsssSssT 1

PartRealizable

112/12/111

PartRealizable

ˆ−− ∆

−−−

−∆

−−−− −+−−=∆−∆=ssI

( ) ( ) ( )( ) ( ) ( )

( ) ( ) ( ) ( ) 21

2121 ,0

tttvtwE

ttRtvtvE

ttQtwtwET

( ) ( ) ( ) ( ) ( ) 1&& === tItvtntxts

( ) ( ) ( ) ( )[ ] ( ) ( )[ ]( ) ( )[ ] ( ) ( ) ( ) ( )[ ]

( ) ( )[ ] ( ) ( ) ( )[ ]( ) ( )[ ] ( ) 111

111111

1111111111ˆ

−−−−

−−−

−−−−−−−

−−−−−−−−−−−

−−−−−−−−−−

+−=+−=

+−=−−+=

−+−=−+−=

−+−=+−−−=

RPRPAsIRPAsIRP

IAsIRPAsIAsIRP

AsIRPAsIRPRPAsIIAsI

RPAsIIRPAsIRPAsIAsIRPAsIsH

Finally: ( ) ( ) 111ˆ −−−+−= RPRPAsIsH

01 =+−+ − BQBPRPAPPA Twhere P is given by:Continuous Algebraic

Riccati Equation (CARE)

( )xyRPxAx ˆˆˆ 1 −+= −

These solutions are particular solutions of the Kalman Filter algorithm for aStationary System and infinite observation time (Wiener Filter) Table of Content

Estimators

( ) ( ) ( ) ( ) ( ) ( )twtGtxtFtxtxtd

Kalman Filter Continuous Time Case

Assume a continuous time linear dynamic system

( ) vxtHz +=( ) ( ) ( ) ( ) ( ) ( )tPteteEtxEtxte T

xxx =−= &:( ) ( ) ( ) ( ) ( ) ( ) ( )21121

&: tttQteteEtwEtwte Twww −=−= δ

( ) ( ) ( ) ( ) ( )∫+=t

dztAtxttBtx0

,ˆ,ˆ 00 τττ

( ) ( ) ( ) ( ) ( ) ( ) ( )21121

&: tttRteteEtvEtvte Tvvv −=−= δ

( ) ( ) 021 =teteE T

Let find a Linear Filter with the state vector that is a function of Z (t) (the historyof z for t0 < τ < t )

s.t. will minimize

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )txtxtxwheretxtxEtxtxtxtxEJ TT −==−−= ˆ:~~~ˆˆ

( ) ( ) txEtxE =ˆ Unbiased Estimator

( ) ( ) ( ) 0ˆ~ =−= txEtxEtxE

EstimatorsSOLO

Kalman Filter Continuous Time Case (continue – 1)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) txtxEdtxzEtAttBtxtxE

dtAztxEttBtxtxEttBttBtxtxEttB

dtxzEtAdtAztxEddtAzzEtA

txdztAtxttBtxdztAtxttBEtxtxE

+−−

−−+

−−=

∫∫∫ ∫

∫∫

000000

ˆ,,ˆ

,ˆ,ˆ,,ˆˆ,

,ˆ,,ˆ,~~

τττ

λλλ

ττττττλτλλττ

ττττττ

( ) ( )[ ] ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )txtxtxwheretxtxEtxtxtxtxEJ TT −==−−= ˆ:~~~ˆˆ

( ) ( ) ( ) 0ˆ~ =−= txEtxEtxE

EstimatorsSOLO

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )0000

,ˆˆ,,,

,ˆ,,ˆ,~~

ttBtxtxEttBddtAzzEtA

dtxzEtAdtAztxEtxtxE

dztAtxttBtxdztAtxttBtxEtxtxEJ

−−=

−−

−−==

∫ ∫

∫∫

λτλλττ

ττττττ

Let use Calculus of Variation to find the minimum of J:

( ) ( ) ( ) ( ) ( ) ( )τνετττηεττ ,,ˆ,&,,ˆ, ttBtBttAtA +=+=

( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 0,ˆˆ,ˆ,ˆˆˆ,

,,ˆ,ˆ,

000000000000

0 00 0

−−=∂

∫ ∫∫ ∫

∫∫=

tttxtxEttBttBtxtxEtt

ddtzzEtAddtAzzEt

dtxzEtdtztxEJ

λτληλττλτλλττη

τττηττητεε

( ) ( ) ( ) ( ) ( ) ( ) ( ) λ

λλττλλ

=−= ∫tt

dzzEtAztxEztxEt

EstimatorsSOLO

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 0,ˆˆ,ˆ,,ˆ

,ˆˆ,ˆ,,ˆ

000000

−−+

−−=

∂∂

∫ ∫

∫ ∫=

tttxtxEttBdtdzzEtAztxE

tttxtxEttBdtdzzEtAztxEJ

νλλητλττλ

νλλητλττλεε

This is possible for all η (t,τ), ν (t,t0) iff

( ) 0,ˆ& 0 =ttB

From this we can see that: ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )∫−−=−=t

dztAtxttBtxtxtxtx0

,ˆˆ,ˆˆ~0

0 τττ

Orthogonal Projection Theorem

Wiener-Hopf Equation

Eberhard Frederich Ferdinand Hopf

1902 - 1983

( ) ( ) ( ) ( ) ( ) λτλττλ <<= ∫ ttdzzEtAztxEt

EstimatorsSOLO

Solution of Wiener-Hopf Equation ( ) ( ) ( ) ( ) ( ) λτλττλ <<= ∫ ttdzzEtAztxEt

Let Differentiate the Wiener-Hopf Equation relative to t:

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( ) ( )

λλλλτ TTTTT ztwEtGztxEtFztwtGtxtFEztxtd

dEztxE

=∂∂

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∫∫ ∂∂+=

∂∂ t

T dzzEtAt

ztzEttAdzzEtAt

,ˆ,ˆ,ˆ τλττλτλττ

therefore( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∫ ∂

∂+=t

TTT dzzEtAt

ztzEttAztxEtF0

,ˆ,ˆ τλττλλ

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

,ˆ,ˆ,ˆ,ˆ λλλλ TTTT ztvEttAztxEtHttAztvtxtHEttAztzEttA +=+=

Now ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∫=t

TT dzzEtAtFztxEtF0

,ˆ τλττλ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 0,ˆ,ˆ,ˆ,ˆ

∂∂−−∫

T dzzEtAt

tAtHttAtAtF τλττττ

EstimatorsSOLO

Solution of Wiener-Hopf Equation(continue – 1)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 0,ˆ,ˆ,ˆ,ˆ

∂∂−−∫

T dzzEtAt

tAtHttAtAtF τλττττ

( ) ( ) ( ) ( ) ( ) ( ) 0,ˆ,ˆ,ˆ,ˆ =∂∂−− τττ tAt

tAtHttAtAtFThis is true only if

Define ( ) ( )ttAtK ,ˆ:=

The Optimal Filter was found to be: ( ) ( ) ( )∫=t

dztAtx0

,ˆˆ τττ

( ) ( ) ( ) ( ) ( ) ( )( )

( ) ( ) ( ) ( )( )

( ) ( ) ( )

( ) ( ) ( ) ( ) ( )[ ] ( ) ( )

( ) ( ) ( ) ( ) ( ) ( )[ ]txtHtztKtxtFdztAtHtKtFtztK

dztAtHttAtAtFtzttAdztAt

tzttAtxtd

t tKtK

ˆˆ,ˆ

,ˆ,ˆ,ˆ,ˆ,ˆ,ˆˆ

−+=−+=

∂∂+=

∫∫

τττ

τττττττ

Therefore the Optimal Filter is given by: ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]txtHtztKtxtFtxtd

dˆˆˆ −+=

EstimatorsSOLO

Solution of Wiener-Hopf Equation(continue – 2)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∫ ∂∂+=

TTT dzzEtAt

ztzEttAztxEtF0

,ˆ,ˆ τλττλλ

( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( )λλλλλλ TTTT HxtxEvxHtxEztxE =+=Now

( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( )

−+=++=λ

λδλλλνλλνλt

TTTT ttRHxtxEtHxHttxtHEztzE

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )∫+=t

TTTTT dGQGttxxEtxtxEλ

γγλϕγγγγϕλϕγγγϕλ ,,,,

( ) ( ) ( )∫=t

dztAtx0

,ˆˆ τττ ( ) ( ) ( ) ( ) ( ) λτλττλ <<= ∫ ttdzzEtAztxEt

Must prove that

( ) ( ) ( ) ( ) ( )tRtHtPttAtK T 1,ˆ −==

Table of Content

Eberhard Frederich Ferdinand Hopf

1902 - 1983

In 1930 Hopf received a fellowship from the Rockefeller Foundation to study classical mechanics with Birkhoff at Harvard in the United States. He arrived Cambridge, Massachusetts in October of 1930 but his official affiliation was not the Harvard Mathematics Department but, instead, the Harvard College Observatory. While in the Harvard College Observatory he worked on many mathematical and astronomical subjects including topology and ergodic theory. In particular he studied the theory of measure and invariant integrals in ergodic theory and his paper On time average theorem in dynamics which appeared in the Proceedings of the National Academy of Sciences is considered by many as the first readable paper in modern ergodic theory. Another important contribution from this period was the Wiener-Hopf equations, which he developed in collaboration with Norbert Wiener from the Massachusetts Institute of Technology. By 1960, a discrete version of these equations was being extensively used in electrical engineering and geophysics, their use continuing until the present day. Other work which he undertook during this period was on stellar atmospheres and on elliptic partial differential equations.

On 14 December 1931, with the help of Norbert Wiener, Hopf joined the Department of Mathematics of the Massachusetts Institute of Technology accepting the position of Assistant Professor. Initially he had a three years contract but this was subsequently extended to four years (1931 to 1936). While at MIT, Hopf did much of his work on ergodic theory which he published in papers such as Complete Transitivity and the Ergodic Principle (1932), Proof of Gibbs Hypothesis on Statistical Equilibrium (1932) and On Causality, Statistics and Probability (1934). In this 1934 paper Hopf discussed the method of arbitrary functions as a foundation for probability and many related concepts. Using these concepts Hopf was able to give a unified presentation of many results in ergodic theory that he and others had found since 1931. He also published a book Mathematical problems of radiative equilibrium in 1934 which was reprinted in 1964. In addition of being an outstanding mathematician, Hopf had the ability to illuminate the most complex subjects for his colleagues and even for non specialists. Because of this talent many discoveries and demonstrations of other mathematicians became easier to understand when described by Hopf.

http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Hopf_Eberhard.html

Estimators

( ) ( ) ( ) ( ) ( ) ( )twtGtxtFtxtxtd

Kalman Filter Continuous Time Case (Second Way)

Assume a continuous time dynamic system

( ) vxtHz +=( ) ( ) ( ) ( ) ( ) ( )tPteteEtxEtxte T

xxx =−= &:( ) ( ) ( ) ( ) ( ) ( ) ( )21121

&: tttQteteEtwEtwte Twww −=−= δ

( ) ( ) ( ) ( ) ( ) ( ) ( )21121

&: tttRteteEtvEtvte Tvvv −=−= δ

( ) ( ) 021 =teteE T

Let find a Linear Filter with the state vector that is a function of Z (t) (the historyof z for t0 < τ < t ). Assume the Linear Filter:

( ) ( ) ( ) ( ) ( ) ( )tztKtxtKtxtxtd

d +== ˆ'ˆˆ

where K’(t) and K (t) will be chosen such that:

1 The Filter is Unbiased: ( ) ( ) txEtxE =ˆ

2 The Filter will yield a maximum rate of decrease of the error by minimizingthe scalar cost function:

( ) ( )[ ] ( ) ( )[ ] ( )tPdt

dtracetxtxtxtxE

dtraceJ

KKKK ',',',minˆˆminmin =−−=

Estimators

( ) ( ) ( ) ( ) ( )twtGtxtFtx +=

Kalman Filter Continuous Time Case (Second Way – continue - 1)

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]tvtxtHtKtxtKtx ++= ˆ'

1 The Filter is Unbiased: ( ) ( ) txEtxE =ˆ

Solution

Define ( ) ( ) ( )txtxtx −= ˆ:~

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )twtGtvtKtxtFtHtKtKtxtKtx −+−++= '~'~

( ) ( ) ( ) 0ˆ~ =−= txEtxEtxE

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )

'~'~ twEtGtvEtKtxEtFtHtKtKtxEtKtxE −+−++=

We can see that the necessary condition for an unbiased estimator is:

( ) ( ) ( ) ( )tHtKtFtK −='

Therefore: ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )twtGtvtKtxtHtKtFtx −+−= ~~

and the Unbiased Filter has the form:

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]txtHtztKtxtFtx ˆˆˆ −+=

EstimatorsSOLO

Kalman Filter Continuous Time Case (Second Way – continue - 2)

Solution

where: ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )twtGtvtKtxtHtKtFtx −+−= ~~

2 The Filter will yield a maximum rate of decrease of the error by minimizingthe scalar cost function:

( ) ( )[ ] ( ) ( )[ ] ( )tPdt

dtracetxtxtxtxE

dtraceJ

KKminˆˆminmin =−−=

( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )[ ]( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )tGtwtwEtGtKtvtvEtK

tHtKtFtxtxEtHtKtFtxtxETTTT

++−−= ~~~~

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) ( )tGtQtGtKtRtKtHtKtFtPtHtKtFtP TTT ++−−=

To obtain the optimal K (t) that minimize J (t) we perform: ( ) 0=∂∂=

∂∂

tPtraceKK

Using the Matrix Equation: we obtain ( )TT BBAABAtraceA

+=∂∂

( ) ( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) 022 =+−−=∂∂=

∂∂

tRtKtHtPtHtKtFtPtraceKK

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] 1−+= tRtHtPtHtHtPtFtK TT

Table of Content

EstimatorsSOLO

Applications

Table of Content

EstimatorsSOLO

Multi-sensor Estimate

Consider a system comprised of two sensors,each making a single measurement, zi (i=1,2),of a constant, but unknown quantity, x, in thepresence of random, dependent, unbiasedmeasurement errors, vi (i=1,2). We want to design an optimal estimator that combines the two measurements.

( ) ( ) ( ) ( ) 11

01122112

211111 ≤≤−=−−

=−=+=

=−=+=ρσσρ

σvEvvEvE

vEvEvEvxz

In absence of any other information, we chose an estimator that combines, linearly,the two measurements:

2211ˆ zkzkx += where k1 and k2 must be found such that:

1. The Estimator is Unbiased: 0~ˆ ==− xExxE ( ) ( )

( ) ( ) 011

=−+=−+++=−+++==−

xkkxEkkvEkvEk

xvxkvxkExExxE

121 =+ kk

EstimatorsSOLO

Multi-sensor Estimate (continue – 1)

2211ˆ zkzkx +=

where k1 and k2 must be found such that:

1. The Estimator is Unbiased: 0~ˆ ==− xExxE 121 =+ kk

2. Minimize the Mean Square Estimation Error: ( ) 2

~minˆmin2121

xExxEkkkk

=−( ) ( ) ( ) ( )[ ] ( )[ ]

( ) ( ) ( ) ( )[ ]21112

121112

121min121min

1min1minˆmin

σσρσσσσρσσ

kkkkvvEkkvEkvEk

vkvkExvxkvxkExxE

−+−+=

−+=−+−++=−

( ) ( )[ ] ( ) ( ) 0212122121 2112

1121112

=−+−−=−+−+∂∂ σσρσσσσρσσ kkkkkkkk

2ˆ1ˆ&

σσρσσσσρσ

−+−=−=

−+−= kkk

( ) 22

1~min σσσσρσσ

ρσσ ≤−+

−=xE Reduction of Covarriance Error

Estimator:

EstimatorsSOLO

−−−−

−−−

−−−−

−−−

−+−+

−+−=

−+−+

−+−=

σσρσσσσρσ

( ) ( ) 22

1~min σσσσρσσ

ρσσρσσ

ρσσ ≤−+

−=−+

−= −−−−xE

1. Uncorrelated Measurement Noises (ρ =0)

( ) ( ) 2

21ˆ zzx

−−−−−−−− +++= σσσσσσ

0~min 2 =xE

2. Fully Correlated Measurement Noises (ρ =±1)

3.Perfect Sensor (σ 1 = 0)

1ˆ zx = 0~min 2 =xE The estimator will use the perfect sensor as expected.

11ˆ zzx −−

−−

+=σσ

σσσ

EstimatorsSOLO

Consider a system comprised of n sensors,each making a single measurement, zi (i=1,2,…,n),of a constant, but unknown quantity, x, in thepresence of random, dependent, unbiasedmeasurement errors, vi (i=1,2,…,n). We want to design an optimal estimator that combines the n measurements.

nivEvxz iii ,,2,10 ==+=

[ ] [ ] RVEVVEVEVE

=−−=

1121122

σσσρσσρ

σσρσσσρ

σσρσσρσ

[ ] ZK

kkkzkzkzkx T

212211 ,,,ˆEstimator:

EstimatorsSOLO

ZKx T=ˆEstimator:

1. The Estimator is Unbiased:

( ) 01ˆ~

=+−=−+=−=VEKxUKxVKxUKExxExE TTTT

01 =−UK T

2. Minimize the Mean Square Estimation Error: ( ) 2

ˆmin~min xxExEUK

−===

( ) ( ) KRKKVVEKVKVKExE T

TTTT 111

minminmin~min====

Use Lagrange multiplier λ (to be determined) to include the constraint 01 =−UK T

( ) ( )1−−= UKKRKKJ TT λ ( ) 0=−=∂∂

UKRKJK

11 == − URUUK TT λ( ) URURUK T 111 −−−= ( ) 112

~min−−

= URUxE T

URK 1−= λ

Table of Content

SOLO RADAR Range-Doppler

Target Acceleration Models

Equation of motion of a point mass object are described by:

3333 0

- Range vector

- Velocity vector

- Acceleration vector

333333

Since the target acceleration vector is not measurable, we assume that it is a random process defined by one of the following assumptions:

1. White Noise Acceleration Model .

3. Piecewise (between samples) Constant White Noise Acceleration Model .

5. Singer Acceleration Model .

2. Wiener Process acceleration model .

4. Piecewise (between samples) Constant Wiener Process Acceleration Model .

Target Acceleration Models (continue – 1)

1. White Noise Acceleration Model – Second Order Model

( ) ( ) ( ) ( ) ( )τδτ −==

tqwtwEtwEtw

Discrete System ( ) ( ) ( ) ( ) ( )kwkkxkkx Γ+Φ=+1

( ) [ ]

=+===Φ ∑∫

= 3333

333366

TIITAITA

idAT ττ

3333 ≥∀

=→→

( ) ( ) ( ) ( ) ( ) ( )∫ −Φ−Φ=ΓΓT

TTT dTBBTqkkwkwEk0

τττ ( ) ( ) ( )τδτ −= tqwtwE T

1. White Noise Acceleration Model (continue – 1)

( ) [ ] ( ) ττ

−= ∫

33333333

3333 00

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )∫ −Φ−Φ=ΓΓ=ΓΓT

TTTTT dTBBTqkkQkkkwkwEk0

τττ

( ) ( )[ ] ( ) ( )( )

τττττ

TITIqdITI

x ∫∫

−−−

0 3333

333333

( ) ( ) ( )

TITIqkkQk

Guideline for Choice of Process Noise Intensity

The change in velocity over a sampling period T are of the order of TqQ =22

For a nearly constant velocity assumed by this model, the choice of q must be suchto give small changes in velocity compared to the actual velocity . V

2. Wiener Process acceleration model – Third Order Model

( ) ( ) ( ) ( ) ( )τδτ −==

tIqwtwEtwEtw

333333

Discrete System ( ) ( ) ( ) ( ) ( )kwkkxkkx Γ+Φ=+1

( ) [ ]

=++===Φ ∑∫

=333333

333333

2333333

TATAITAi

dAT ττ

333333

=→→

( ) ( ) ( ) ( ) ( ) ( )∫ −Φ−Φ=ΓΓT

TTT dTBBTqkkwkwEk0

τττ

Since the derivative of acceleration is the jerk, this model is also called White Noise Jerk Model.

( ) ( ) ( )τδτ −= tIqwtwE xT

2. Wiener Process Acceleration Model (continue – 1)

( ) ( )( ) [ ] ( )

( ) ( )τ

ττττ

−−

−−−

= ∫3333

333333

2333333

( ) ( ) ( ) ( ) ( ) ( )∫ −Φ−Φ=ΓΓT

TTT dTBBTqkkwkwEk0

τττ

( )( ) ( ) ( )[ ]

( ) ( ) ( )( ) ( ) ( )( ) ( )

τττ

τττττ

TITITI

qdITITI

∫∫

−−

−−−

=−−

−−

2/2/4/

( ) ( ) ( )

TITITI

2/3/8/

6/8/20/

Guideline for Choice of Process Noise Intensity The change in acceleration over a sampling period T are of the order of TqQ =33

For a nearly constant acceleration assumed by this model, the choice of q must be suchto give small changes in velocity compared to the actual acceleration . A

( ) ( ) ( )τδτ −= tIqwtwE xT

3. Piecewise (between samples) Constant White Noise Acceleration Model – 2nd Order

( ) ( ) ,0&0

3333 =

Discrete System

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) klTTT lqkllwkwEkkwkkxkkx δΓΓ=ΓΓΓ+Φ=+ 01

( ) [ ]

=+===Φ ∑∫

= 3333

333366

TIITAITA

idAT ττ

3333 ≥∀

=→→

( ) ( ) ( ) ( )( )

( ) ( ) ( )kwTI

TIIdkTwBTkwk

−=+−Φ=Γ ∫∫

0 3333

ττττ

3. Piecewise (between samples) Constant White Noise Acceleration Model

( ) ( ) ( ) ( ) ( ) ( ) [ ] klxx

TTT TITITI

TIqlqkllwkwEk δδ 33

00 2/2/

=ΓΓ=ΓΓ

( ) ( ) ( ) ( ) lk

TITIqllwkwEk ,2

2/2/δ

For this model q should be of the order of maximum acceleration magnitude aM.

A practical range is 0.5 aM ≤ q ≤ aM.

4. Piecewise (between samples) Constant Wiener Process Acceleration Model

( ) ( ) 0&0

333333

Discrete System( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) lk

TTT lqkllwkwEkkwkkxkkx ,01 δΓΓ=ΓΓΓ+Φ=+

( ) [ ]

=++===Φ ∑∫

=333333

333333

2333333

TATAITAi

dAT ττ

333333

≥∀

=→→

( ) ( ) ( ) ( )( )

( ) ( )( ) ( ) ( )kw

dkTwBTkwk

−−−

=+−Φ=Γ ∫∫33

333333

2333333

: ττττ

τττ

4. Piecewise (between samples) Constant White Noise acceleration model

( ) ( ) ( ) ( ) ( ) ( ) [ ] lkxxx

lkTTT ITITI

qlqkllwkwEk ,33332

0,0 2/

=ΓΓ=ΓΓ

( ) ( ) ( ) ( ) lk

TITITI

qllwkwEk ,

2/2/2/

For this model q should be of the order of maximum acceleration increment over asampling period ΔaM.

A practical range is 0.5 ΔaM ≤ q ≤ ΔaM.

Singer Target Model

R.A. Singer, “Estimating Optimal Tracking Filter Performance for Manned ManeuveringTarget”, IEEE Trans. Aerospace & Electronic Systems”, Vol. AES-6, July 1970, pp. 437-483

The target acceleration is modeled as a zero-mean random process with exponential autocorrelation ( ) ( ) ( ) TetataER mTT

ττσττ /2 −=+= where σm

2 is the variance of the target acceleration and τT is the time constant of itsautocorrelation (“decorrelation time”).

The target acceleration is assumed to:1. Equal to the maximum acceleration value amax

with probability pM and to – amax

with the same probability.2. Equal to zero with probability p0.3. Uniformly distributed between [-amax, amax]

with the remaining probability 1-2 pM – p0 > 0.

( ) ( ) ( )[ ] ( ) ( ) ( )[ ]max

0maxmax0maxmax 2

ppaauaauppaaaaap M

−−−−+++−++= δδδ

RADAR Range-Doppler

Singer Target Model (continue 1)

( ) ( ) ( )[ ] ( ) ( ) ( )[ ]max

0maxmax0maxmax 2

ppaauaauppaaaaap M

−−−−+++−++= δδδ

( ) ( ) ( )[ ] ( )

( ) ( )[ ]

( ) ( )[ ] 022

00maxmax

0maxmax

=−−+⋅++−=

−−−−++

+−++==

−−

∫∫

ppppaa

ppaauaau

daappaaaadaapaaE δδδ

( ) ( ) ( )[ ] ( )

( ) ( )[ ]

0maxmax

20maxmax

ppaauaau

daappaaaadaapaaE

−−+−++=

−−−−++

+−++==

−−

∫∫ δδδ

222 413

aEaE Mm −+=−=

( ) ( ) ( )

max0max

afdaafaaa

+≤≤−

=−∫−

RADAR Range-Doppler

Target Acceleration Approximation by a Markov Process

w (t) x (t)

( )tG ∫x (t)

( ) ( ) ( ) ( ) ( ) ( )twtGtxtFtxtxtd

d +== Given a Continuous Linear System:

Let start with the first order linear system describing Target Acceleration :

( ) ( ) ( )twtata TT

T +−=τ1

( ) ( ) T

tta ett τφ /

00, −−=

( ) ( ) [ ] ( ) ( ) [ ] ( )τδττ −=−− tqwEwtwEtwE( ) ( ) [ ] ( ) ( ) [ ] ( )ttRtaEtataEtaE

TT aaTTTT ,τττ +=−+−+

( ) ( ) [ ] ( ) ( ) [ ] ( )τττ +=+−+− ttRtaEtataEtaETT aaTTTT ,

( ) ( ) [ ] ( ) ( ) [ ] ( ) ( ) 2,TTTTT aaaaaTTTT ttRtVtaEtataEtaE σ===−−

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )tGtQtGtFtVtVtFtVtd

xxx ++= ( ) ( ) qtVtVtd

dTTTT aa

Taa +−=

( ) ( )00 ,1

, tttttd

τφ −=

RADAR Range-Doppler

( ) ( ) qtVtVtd

dTTTT aa

Taa +−=

( ) ( )

−−TT

aaaa eq

eVtV ττ τ 22

( )( ) ( ) ( )

( ) ( ) ( )

<+=+Φ+

>=+Φ=+

ττττ

τττ

tVetttV

tVetVttttR

( )( ) ( ) ( )

( ) ( ) ( )

<+=++Φ

>=+Φ=+

ττττ

τττ

tVetVtt

tVetttVttR

For ( ) ( )2

5 Tstatesteadyaaaaaa

qVtVtV

TTTTTT

ττττ ==+≈⇒> −

( ) ( ) ( ) TT

TTTTTTTTe

qeVVttRttR

statesteadyaaaaaaaaττ

ττ τττττ −−

− =≈≈+≈+⇒>2

Target Acceleration Approximation by a Markov Process (continue – 1)Target Acceleration Models (continue – 12)

RADAR Range-Doppler

0 22 T

Taa qde

qdVArea T

TTτττττ τ

=== ∫∫+∞ −+∞

∞−

τT is the correlation time of the noise w (t) and defines in Vaa (τ) the correlation time corresponding to σa

2 /e.One other way to find τT is by tacking the double sides Laplace Transform L 2 on τ of:

( ) ( ) ( ) qdetqtqs sww =−=−=Φ ∫

∞−

− ττδτδ ττ2L

( ) ( )

( ) ( ) ( )sHqsHs

sTssaaaa

−=−

==Φ ∫+∞

∞−

−−−

τττ ττττL

τT defines the ω1/2 of half of the power spectrum

q/2 and τT =1/ ω1/2.

( ) ( ) ( ) TT

TTTTTTTe

qeVttRttR

aaaaaaaττ

ττ τσττττ −−

=≈≈+≈+⇒>2

aTqτσ 22

Target Acceleration Approximation by a Markov Process (continue – 2)

RADAR Range-Doppler

Constant Speed Turning Model

RADAR Range-Doppler

Denote by and the constant velocity and turning rate vectors.Ptd

== 1 ωωω 1=

VVVVtd

×=×=+

== ωω 111:

( ) ( ) VVVVVtd

ωωωωωωωω −=−⋅=××=×+×

Define( ) ( )

Denote the position vector of the vehicle relative to an Inertial system..P

Therefore A

We want to find ф (t) such that ( ) ( ) ( )TTT ΦΛ=Φ

Continuous TimeConstant Speed

Target Model

Constant Speed Turning Model (continuous – 1)

RADAR Range-Doppler

Let rotate the vector around by a large angle , to obtain the new vector

→= OAPT

Tωθ =→

From the drawing we have:→→→→

++== CBACOAOBP

( ) ( )θcos1ˆˆ −××=→

TPnnAC Since direction of is: ( ) ( ) φsinˆˆ&ˆˆ TTT PPnnPnn

=××××

and it’s length is:

( )θφ cos1sin −TP

( ) θsinˆ TPnCB

×=→ Since has the direction and the

absolute valueCB

×ˆθφsinsinv

( ) ( ) ( ) θθ sinˆcos1ˆˆ TTT PnPnnPP

×+−××+=

( ) ( )[ ] ( ) ( )TPnTPnnPP TTT ωω sinˆcos1ˆˆ

×+−××+=

We will find ф (T) by direct computation of a rotation:

Constant Speed Turning Model (continuous – 2)

RADAR Range-Doppler

( ) ( ) ( ) ( )TPnnTPnTd

PdV TT ωωωω

sinˆˆcosˆ ××+×==

( ) ( )TT PnTVV

×=== ˆ0 ω

( ) ( ) ( ) ( )TPnnTPnTd

VdA TT ωωωω cosˆˆsinˆ 22

××+×−==

( ) ( )TT PnnTAA

××=== ˆˆ0 2ω

( ) ( )[ ]( ) ( )

( ) ( )

−++=−

−−

ATVTPP

ωωω

ωωωω

cossin

sincos

cos1sin1

( ) ( ) ( ) ( )[ ]TPnnTPnPP TTT ωω cos1ˆˆsinˆ −××+×+=

Constant Speed Tourning Model (continuous – 3)

RADAR Range-Doppler

( ) ( )[ ]( ) ( )

( ) ( )

−++=−

−−

ATVTPP

ωωω

ωωωω

cossin

sincos

cos1sin1

( ) ( )[ ]( ) ( )

( ) ( )( )

−−

ωωωωωω

ωωωω

cossin0

sincos0

cos1sin1

Discrete TimeConstant Speed

Target Model

Constant Speed Tourning Model (continuous – 4)

RADAR Range-Doppler

( )( ) ( )[ ]

( ) ( )( ) ( )

−=Φ −

−−

ωωωωωω

ωωωω

cossin0

sincos0

cos1sin1

( )( ) ( )[ ]

( ) ( )( ) ( )

−−−

=Φ −

−−

ωωωωωω

ωωωω

cossin0

sincos0

cos1sin1

1( )( ) ( )

( ) ( )( ) ( )

−−−=Φ

ωωωωωωω

ωωω

sincos0

cossin0

sincos0

We want to find Λ (t) such that

( ) ( ) ( )TTT ΦΛ=Φ therefore ( ) ( ) ( )TTT 1−ΦΦ=Λ

( ) ( ) ( )( ) ( )

( ) ( )( ) ( )

( ) ( )[ ]( ) ( )( ) ( )

−−−

−−−=ΦΦ=Λ −

−−−

ωωωωωω

ωωωω

ωωωωωωω

ωωω

cossin0

sincos0

cos1sin

sincos0

cossin0

sincos01

2ωWe recovered the transfer matrix for the continuouscase.

Force Equations

A A ++=

Lzgg 1= where

Fixed Wing Air Vehicle Acceleration Model

RADAR Range-Doppler

( ) ( ) WWA zLxDF 11 αα −−= -Drag and Lift Aerodynamic Forces as functions

of angle of attack αBxTT 1=

- Thrust Force

For small angle of attack α the wind (W) coordinates and body (B) coordinates coincide, therefore we will use only wind (W) and Local Level Local North (L) coordinates, related by:

WLC - Transformation Matrix from (L) to (W)

- earth gravitation

( )LWW zgz

DTA 111 +−−≈

Force Equations

By measuring Air Vehicle trajectory we can estimate its, position, velocity and accelerationvectors, , CL

W matrix and (T – D)/m and L / m. ( )AVP

WxVV 1= - Air Vehicle Velocity Vector

Fixed Wing Air Vehicle Acceleration Model (continue – 1)

RADAR Range-Doppler

( ) WWWWWWWWWWWWW

zVqyVrxVxzryqxpVxVtd

Vd11111111

11 −+=×+++=+=

( )( ) ( )

( )( )

( )[ ]( ) VgCr

VgClqW

Therefore the Air vehicle Acceleration in it’s Wind (W) Coordinates is given by:

( ) ( )WWWWWWWWWWWWWW

xqyplyrzqfzlxfzlxfzlxftd

AdA 1111111111: +−−+−+−=−+−==

⋅⋅

( ) ( )( ) gCAmLl

gCAmDTfW

+−==

−=−=

−−

RADAR Range-Doppler

( )[ ]( ) VgCr

VgClqW

−=We found:

( )mLl

−−

, pW are pilot controlled and are modeled as zero mean random variables

( ) ( )[ ]

( )[ ] ( )( )[ ] ( )[ ]

−−−

−−

VgClgCV

VgCgCV

/3,31,3

/2,31,3

( ) ( )( )[ ]

( )[ ] ( )( )[ ] ( )[ ]

−−−

−−

gClgCV

3,31,3

2,31,3

( ) ( ) ( )

CAEA W

( ) ( ) ( ) ( ) ( ) WL

TLLLL ClCAEAAEAE

SOLO RADAR Range-DopplerTarget Acceleration Models (continue – 22)

333333

Discrete System

( ) ( ) ( ) ( ) ( )kAkkxkkx Γ+Φ=+1

( ) [ ]

=++===Φ ∑∫

=333333

333333

2333333

TATAITAi

dAT ττ

333333

≥∀

=→→

( ) ( ) ( ) ( )( )

( ) ( )( ) ( ) ( )kA

dkTABTkAk

−−−

=+−Φ=Γ ∫∫33

333333

2333333

: ττττ

τττ

( ) ( )

333333

2333333

Discrete System

( ) ( ) ( )

CAEA W

( ) ( ) ( ) ( ) ( ) WL

TLLLL ClCAEAAEAE

Fixed Wing Air Vehicle Acceleration Model (continue – 5)We need to defined the matrix CL

W. For this we see that is along and is alongWx1 Wz1V

( ) ( ) ( ) ( )( )( )( )

CxCx( ) ( ) ( ) ( )

( )( )( )

Therefore ( ) ( ) ( ) ( ) ( ) ( )[ ]LLVLVLVVC LLLLTWL ///

LWW zgzlxfA 111 +−=

Azgxfzl LWW

−+= 111

( ) ( ) ( ) ( )[ ] ( ) ( ) gCVgCACACACgCAf WL

LxW 3,13,13,12,11,13,1 −=−++=−=

( ) ( ) ( ) ( )[ ] ( )( )( )( )

−++=

gCACACACzl

3,13,12,11,11

( ) ( ) ( )[ ] [ ] 222/3,12,11,1 zyxzyxW

L VVVVVVCCC ++=

( ) ( ) ( )[ ] [ ] VAVAVAVACACACAV zzyyxxzW

LzW /3,12,11,1 ++=++==

CLW, f, l , qW, rW Computation from Vectors ( ) ( )LL AV

Compute:

( ) ( ) ( )[ ] [ ] 222/3,12,11,1 zyxzyxW

L VVVVVVCCC ++= 1

( ) ( ) ( )[ ] [ ] VAVAVAVACACACAV zzyyxxzW

LzW /3,12,11,1 ++=++==2

( ) ( )( )( )( )

( )( )( )

AgVVVgVV

AVVVgVV

−+−

−−

( )[ ] ( )[ ] ( )[ ]222//////: zzzyyzxxz AgVVVgVVAVVVgVVAVVVgVVAbs −+−+−−+−−=

( ) ( ) ( ) ( ) ( ) ( )[ ]LLVLVLVVC LLLLTWL ///

( ) ( ) ( )( )

( )[ ]( ) VgCr

VgClqW

−=( )( ) ( ) ( )[ ] ( ) gCACACACl

3,33,32,31,3

+++−=

Ballistic Missile Acceleration Model

A A ++=

( ) ( )( ) ( ) ( )[ ]WLWDref

zVCxVCSVZ

zVLxVDF

ααραα

−−=

−−= - Drag and Lift Aerodynamic Forces as

functions of angle of attack α and

air pressure ρ (Z)

BxTT 1=

- Thrust ForceFor small angle of attack α the wind (W) coordinates and body (B) coordinates coincide, therefore we will use only wind (W) and Local Level Local North (L) coordinates, related by:

WLC - Transformation Matrix from (L) to (W)

zgg 11 2

µ== where - earth gravitation

( )LWW zgz

DTA 111 +−−≈

Force Equations

WxVV 1= - Air Vehicle Velocity Vector

WqBrWr

Ballistic Missile Acceleration Model (continue – 1)

WqBrWr

( )( )

−−

Therefore the Air vehicle Acceleration in it’s Wind0 (W0 – for which φ =0 ) Coordinates is given by:

( ) WWWWWWWWWWWWW

zVqyVrxVxzryqxpVxVtd

Vd11111111

11 −+=×+++=+=

Define:

Tt =: ( )

D DrefCC == :&

( ) ( ) ( ) ( )tzztm

LrefCC ωωωρω sin:&cos:&

−===

We assume that the ballistic missile performs a barrel-roll motion with constant rotation rate ω. Therefore at each instant the aerodynamic lift force will be at an

angle φ = ω t.

Assuming constant CL/m: (barrel-roll model)02 =+ CC zz ωAssuming constant ω (barrel-roll model)0=ω

CLW0 Computation:

( ) 2/1222 ZYXV ++=( )

Define: ψ - trajectory azimuth angle ( )XY ,tan 1−=ψγ - trajectory pitch angle ( )221 ,tan YXZ += −γ

[ ] [ ]

γψγψγψψ

γψγψγ

ψψψψ

γγψγ

cossinsincossin

0cossin

sinsincoscoscos

0cossin

0sincos

cos0sin

sin0cos

( )( )

( ) ( ) 2

where:

Assuming constant CL/m (barrel-roll model)02 =+ CC zz ω

0=Cd Assuming constant CD/m

( ) 2/1222 ZYXV ++=

Assuming constant ω (barrel-roll model)0=ω

WqBrWr

( ) ( ) ( )

( )( )( ) ( )

−−

0000000000

000000000

0000000000

13,2000000

12,2000000

11,2000000

0000100000

0000010000

0000001000

VCVCVC

ρρω

System Dynamics is given by:

SOLOTarget Acceleration Models (continue – 31)

WqBrWr

Table of Content

171171

EstimatorsSOLOKalman Filter for Filtering Position and Velocity Measurements

Assume a Cartezian Model of a Non-maneuvering Target:

( ) [ ]

=+=+++++==Φ ∫ 10

1exp: 22

TTAITA

nTATAIdAT nn

10 2 ≥∀

=→→

= nAAA n

Measurements

( ) ( ) ( )

−−=

−=−Φ=Γ ∫∫ T

TTT 2/2/

00 τττ

τττ

Discrete System

+=Γ+Φ=

++++++

PTjkkkk

kjqTjkkkkk

vvERvxz

wwEQwT

111111

172172

EstimatorsSOLOKalman Filter for Filtering Position and Velocity Measurements (continue – 1)

The Kalman Filter:

+++++++

kkkkkkkkk

xHzKxx

|1111|11|1

ˆˆˆ

Tkkkkkk QPP ΓΓ+ΦΦ=+ ||1

[ ]TTT

kk 2/2/

|12212

1211|1 σ

[ ]TTT

TppTpp

kk 2/2/

01 222

22121211

|12212

1211|1 σ

( ) ( )( )

TpTTpp

TTppTTpTpp

TppTpTpp

232212

242221211

|222212

221211

|12212

1211|1

+++++=

173173

The Kalman Filter:

+++++++

kkkkkkkkk

xHzKxx

|1111|11|1

ˆˆˆ

11|111|11

+++++++ += kT

kkkk RHPHHPK

( ) ( )kkP

2112212

1211 1

−+−++

σσσσ

( ) ( )( )

( )kkPV

VP pppppppp

pppppppp

211222212

2122212

21212111211

−+−+

++−−+−++

σσσσ

( ) ( )( )

( )kkPV

VP pppp

−+−++

σσσσ

174174

The Kalman Filter:

11/111/11

+++++++ += kT

kkkk RHPHHPK

( ) ( )( )

( )kkPV

VP pppp

−+−++

σσσσ

/222212

221211

/12212

1211/1

2/4/2q

TppTpTpp

ppP σ

( ) ( )( ) ( ) ( )( ) ( ) ( ) ( ) 4///2//1

2////1

42222121111

32221212

222222

TTkkpTkkpkkpkkp

TTkkpkkpkkp

Tkkpkkp

175175

The Kalman Filter:

11/111/11

+++++++ += kT

kkkk RHPHHPK

( ) ( ) ( )( ) ( ) ( )

+−−−

−−=+

++++++++

++++ T

kKRKHKIPHKI

11111111

( ) ( )( )

kppppp

2222211

21112221

−+−++

σσσσ

( ) ( )( )

( )kkVPV

pppHKI

−+−++

=−σσσ

σσσσσ

( ) ( ) ( )( )

( )kkVPV

kkkkkk pp

pppPHKIP

/12212

/1111/1

−+−++

=−=σσσ

σσσσσ

( ) ( )( )[ ]

( )[ ]

−+−++

kkPVVP

σσσσ

σσσσσσ

Estimators

We want to find the steady-state form of the filter for

Assume that only the position measurements are available

- position

- velocity

[ ] kjjkkk

kkkk RvvEvEvx

xvxHz δ==+

=+= ++++

+++++ 1111

1111 0&01

Discrete System

+=Γ+Φ=

++++++

kjwTjkkkkk

vvERvxz

wwEQwT

2111111

α - β (2-D) Filter with Piecewise Constant White Noise Acceleration Model

EstimatorsSOLO

Discrete System

+=Γ+Φ=

++++++

kjwTjkkkkk

vvERvxz

wwEQwT

2111111

( ) ( ) ( ) ( ) ( )11/111 +++++=+ kRkHkkPkHkS T

( ) ( ) ( ) ( ) 111/11 −+++=+ kSkHkkPkK T

When the Kalman Filter reaches the steady-state

( ) ( )

∞→∞→2212

12111/1lim/limpp

ppkkPkkP

∞→2212

1211/1limmm

[ ] 211

101 PP m

mmS σσ +=+

( )( )

2112212

( ) ( ) ( )[ ] ( )kkPkHkKIkkP /1111/1 +++−=++[ ]

1211 0110

( ) ( )( )

( ) ( )( ) ( )

−−

−−=

221111

1212221211

12111111

σσσ

σσσσ

α - β (2-D) Filter with Piecewise Constant White Noise Acceleration Model (continue – 1)

EstimatorsSOLO

From ( ) ( ) ( ) ( ) ( )kQkkkPkkkP T +ΦΦ=+ //1

we obtain ( ) ( ) ( ) ( )[ ] ( )kkQkkPkkkP T−− Φ−+Φ= /1/ 1

( ) ( )

∞→∞→2212

12111/1lim/limpp

ppkkPkkP

∞→2212

1211/1limmm

−− ΦΦ

For Piecewise (between samples) Constant White Noise acceleration model

( ) ( )( )

−+−

+−−+−=

−−

−−22

232212

1212221211

12111111

TmTmTm

TmTmTmTmTm

221212

23221211

2121111

TmTmTmk

EstimatorsSOLO

( )112

1111 1/ kkm P −= σ

12 / kTm wσ=

( ) 12121122

121122 2//2// mkTkTTmkm w +=+= σ

We obtained the following 5 equations with 5 unknowns: k11, k12, m11, m12, m22

( )112

1212 1/ kkm P −= σ( )2

111111 / Pmmk σ+=1

( )2111212 / Pmmk σ+=2

4/2 2422

2121111 wTmTmTmk σ+−=3

2/23221211 wTmTmk σ−=4

221212 wTmk σ=5

Substitute the results obtained from and in1 2 34 5

( ) ( ) ( ) ( )

1212112

121222

12121111

141212

σσσσ

1211122

11 =++− kTkkTkTk

EstimatorsSOLO

We obtained: 04

1211122

11 =++− kTkkTkTk

Kalata introduced the α, β parameters defined as: Tkk 1211 :: == βα

and the previous equation is written as function of α, β as:

12 22 =++− ββαβα

which can be used to write α as a function of β: 22

ββα −=

( ) 12

12 1 k

km wP σσ =

We obtained:

TTm wP

σβ22

−= ( )

λσσ

αβ ==− P

σσλ

:= Target Maneuvering Index proportional to the ratio of:

Motion Uncertainty:2

22Twσ

Observation Uncertainty: 2Pσ

EstimatorsSOLO

=−+ λβλβ

The positive solution for from the above equation is:β ( )λλλβ 822

1 2 ++−=

Therefore: ( ) ( )λλλλλλλλλβ 844

1 222 +−+=+−+=

( )( )λλλλλλλλβα 8428168

111 222

++−++++−=−=

( )( )λλλλλα 8488

1 22 ++−+−=

ββα −=We obtained: ( )2

β ==− P

wTand:

( ) ( )2

2/12/21 ββ

βββλ

EstimatorsSOLO

We found

( ) ( )( )

−−

−−=

1212221211

12111111

pp( )11

21111 1/ kkm P −= σ

( )112

1212 1/ kkm P −= σ

( ) 121211

22121122

TTmkm w

+=+= σ

( ) 211111111 1 Pkmkp σ=−=

( ) 212121112 1 Pkmkp σ=−=

( )( )

σββα

−=−+=

121211

121212121122

mkmkTkp

211 Pp σα=

212 PT

p σβ=

( )( )

p σα

βαβ−

Estimators

( )( )λλλλλα 8488

1 22 ++−+−=

We found

( ) ( )λλλλλλλλλβ 844

1 222 +−+=+−+=

α, β gains, as function of λ in semi-log and log-log scales

EstimatorsSOLO

Q −− ΦΦ

For White Noise acceleration model

( ) ( )( )

−+−

+−−+−=

−−

qTmqTmTm

qTmTmqTmTmTm

1212221211

12111111

qTmTmk

qTmTmTmk

2221211

2121111

α - β (2-D) Filter with White Noise Acceleration Model

EstimatorsSOLO

( )112

1111 1/ kkm P −= σ

1212 / kqTm =

( ) 121211121122 2//2// mkTkqTTmkm +=+=

We obtained the following 5 equations with 5 unknowns: k11, k12, m11, m12, m22

( )112

1212 1/ kkm P −= σ( )2

111111 / Pmmk σ+=1

( )2111212 / Pmmk σ+=2

3/2 322

2121111 qTmTmTmk +−=3

2/2221211 qTmTmk −=4

qTmk =12125

Substitute the results obtained from and in1 2 34 5

( ) ( ) ( ) ( )

1212112

12121111

131212

−σσσσ3

1211122

11 =++− kTkkTkTk

α - β (2-D) Filter with White Noise Acceleration Model (continue – 1)

EstimatorsSOLO

We obtained: 06

1211122

11 =++− kTkkTkTk

The α, β parameters defined as: Tkk 1211 :: == βα

and the previous equation is written as function of α, β as:

12 22 =++− ββαβα

which can be used to write α as a function of β:212

22 βββα −+=

αβσ

β −=

−===

We obtained:

qT λσα

β ==−

α - β (2-D) Filter with White Noise Acceleration Model (continue – 2)

1 cλβββ

β =+−+

=−The equation for solving β is:

which can be solved numerically.

EstimatorsSOLO

We found

( ) ( )( )

−−

−−=

1212221211

12111111

pp( )11

21111 1/ kkm P −= σ

( )112

1212 1/ kkm P −= σ

( ) 12121122 2// mkTkm +=

( ) 211111111 1 Pkmkp σ=−=

( ) 212121112 1 Pkmkp σ=−=

( )( )

σββα

−=−+=

121211

121212121122

mkmkTkp

211 Pp σα=

212 PT

p σβ=

( )( )

p σα

βαβ−

α - β Filter with White Noise Acceleration Model (continue – 3)

Estimators

We want to find the steady-state form of the filter for

Assume that only the position measurements are available

[ ] kjjkkk

kkkk RvvEvEv

vxHz δ==+

=+= ++++

++++ 1111

1111 0&001

Discrete System

+=Γ+Φ=

++++++

kjwTjkkkkk

vvERvxz

wwEQwT

2111111

α – β - γ (3-D) Filter with Piecewise Constant Wiener Process Acceleration Model

- position- velocity

- acceleration

SOLO Estimators

Piecewise (between samples) Constant White Noise acceleration model

( ) ( ) ( ) ( ) ( ) ( ) [ ]12/

00 TTT

qlqkllwkwEk klTTT

=ΓΓ=ΓΓ δ

( ) ( ) ( ) ( )

=ΓΓ12/

2/2/2/

qllwkwEk TT

For this model q should be of the order of maximum acceleration increment over asampling period ΔaM.

A practical range is 0.5 ΔaM ≤ q ≤ ΔaM.

α – β - γ (3-D) Filter with Piecewise Constant Wiener Process Acceleration Model (continue – 1)

SOLO Estimators

The Target Maneuvering Index is defined as for α – β Filter as:P

σσλ

The three equations that yield the optimal steady-state gains are:

αγ =−

( ) ααβ −−−= 1422 or: 2/2 ββα −=

αβγ

This system of three nonlinear equations can be solved numerically.

The corresponding update state covariance expressions are:

( )( )

σαγβγσγ

σαγββσβ

αβγβασα

−−==

−−−+==

SOLO Estimators

α – β - γ Filter gains as functions of λ in semi-log and log-log scales:

Table of Content

SOLO Estimators

Optimal Filtering

An “Optimal Filter” is said to be optimal in some specific sense.

1. Minimum Mean-Square Error (MMSE)

( )∫ −=− nnnnnx

xdZxpxxZxxEnn

2|ˆmin|ˆmin

Solution: ( )∫== nnnnnnn xdZxpxZxEx :0:0 ||ˆ

2. Maximum a Posteriori (MAP)

( ) ( ) nxxxxnn

xxIEZxp

ς≤−−⇔ ˆ::0 1min|modemin

Where is an indicator function and ζ is a small scalar. ( )nxI

3. Maximum Likelihood (ML) ( )nny

4. Minimax: Median of Posterior ( )nn Zxp :0|

5. Minimum Conditional Inaccuracy

( ) ( ) ( ) ( )∫=− ydxdyxp

yxpyxpEx

yxpx |ˆ

1log|ˆmin|ˆlogmin ,

SOLO Estimators

Optimal Filtering

An “Optimal Filter” is said to be optimal in some specific sense.

6. Minimum Conditional KL Divergence

( ) ( )( ) ( )∫= ydxd

yxpyxpKL

7. Minimum Free Energy: It is a lower bound of maximum log-likelihood, which is aimed to minimize

( ) ( ) ( ) ( )( )

( ) ( ) ( ) xQEyxP

xQEyxPEPQ xQxQxQ log

|log|log, −

=−=F

where Q (x) is an arbitrary distribution of x.

The first term is called Kulleback – Leibler (KL) divergence between distribution Q (x)and P (x|y), the second term is entropy w.r.t. Q (x).

Table of Content

SOLO EstimatorsContinuous Filter-Smoother Algorithms

Problem - Choose w(t) and x(t0) to minimize:

( ) ( ) ∫ −− −+−+−+−=f

tQRSffS

dtwwxHzxtxxtxJ0

subject to: ( ) ( ) ( ) ( ) ( ) ( )twtGtxtFtxtxtd

( ) ( ) ( ) ( )tvtxtHtz +=

and given: ( ) ( ) ( ) ( ) ( ) ( ) ( )tGtFtHtQtRSSxxtwtz ff ,,,,,,,,,, 00

Smoothing Interpretation

are noisy observations of Hx, i.e.:

v(t) is zero-mean white noise vector with density matrix R(t).

w(t) are random forcing functions, i.e., white noise vector with prior mean w(t) and density matrix Q(t).

(x0, P0) are mean and covariance of initial state vector from independent observations before test

(xf, Pf) are mean and covariance of final state vector from independent observations after test

( ) ( )[ ] ( )[ ]TS

xtxSxtxxtx 00000

−−=−where

Solution to the Problem :

( ) nHamiltonia:H =++−+−= −− wGxFwwxHz T

QRλ22

Euler-Lagrange equations:

+−=∂∂=

−−=∂∂−=

FHRxHzx

Two-Point Boundary Value Problem

Define:

( ) ( )[ ]

−=∂∂=

−−=∂∂−=

λ 0000

Boundary equations:

λTGQww −=

( ) ( )( ) ( )

( ) ( ) ( ) ( )ttPtxtxtSxtx

tSxtxFF

ffff λλ

λ−=⇒

+= →

zRHFxHRH TTT 11 −− +−−= λλ

( ) ( )

TGQwGxFtx λ−+=

( )( )

−−−

−− zRH

11 λλ

Assumed solution

Forward

Solution to the Problem (continue – 1) :

Differentiate and use previous equations

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( )[ ]( )

( ) ( )

( ) ( ) ( )[ ]( )

( ) ( )twGtGQGttPtxF

tzRHtFttPtxHRHtPttPtx

ttPttPtxtx

+−−=

+−−−⋅−−=

−−=

−−

λλλ

( ) ( ) ( ) ( ) ( )[ ] ( )( ) ( ) ( ) ( ) ( )[ ] ( )ttPHRHtPtPFFtPtP

twGtxHtzRHtPtxFtx

+−−=

−−−−

( ) ( ) ( ) ( )ttPtxtx FF λ−=First Way, Assumption 1 .

( ) ( )( ) ( )

−=−

ffff tSxtx

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( ) ( )[ ] ( )( ) ( ) ( ) ( ) ( )[ ] ( )ttPHRHtPtPFFtPtP

twGtxHtzRHtPtxFtx

+−−=

−−−−

( ) ( )( ) ( )

−=−

ffff tSxtx

We want to have xF(t) independent on λ(t). This is obtain by choosing

( ) ( ) ( ) ( ) ( ) ( ) 1000

1 −− ==−+= SPtPtPHRHtPtPFFtPtP FFT

( ) ( ) ( ) ( ) ( )[ ] ( ) ( )( ) ( ) 1

: −=

=+−+=

RHtPtK

xtxtwGtxHtztKtxFtxT

FFFFFTherefore

Let substitute the results in the equation( )tλ( ) ( ) ( ) ( )[ ] ( ) ( )

( ) ( ) ( ) ( )[ ]

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( )[ ]ffFffFfffffFffFffF

xtxPtPttPxtxtxtxttP

txHtzRHtHKF

tzRHtFttPtxHRHt

−+=⇒−−=−=

−−=

+−−−=

−−

λλλ

( ) ( ) ( ) ( )ttPtxtx FF λ−=First Way, Assumption 1 (continue – 1) .

( ) ( ) ∫ −− −+−+−+−=f

tQRSffS

dtwwxHzxtxxtxJ0

( ) ( ) ( ) ( )tvtxtHtz +=

Forward Covariance Filter

( ) ( ) ( ) ( ) ( )[ ] ( ) ( )( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) 1

=−+=

=+−+=

RHtPtKwhere

PtPtPHRHtPtPFFtPtP

xtxtwGtxHtztKtxFtx

Store xF(t) and PF(t)

Backward Information Filter (τ = tf – t)

( )[ ] ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]ffFffFfFTT

F xtxPtPttxHtzRHtHtKFtd

d −+=−−−−=−= −− 11 λλλτλ

Summary of First Assumption – Forward then Backward Algorithms

where = Estimate of w(t)( ) ( ) ( )tGQtwtw Tλ−=

= Smoothed Estimate of x(t)( ) ( ) ( )tPtxtx FF λ−=

QRλ22

+−=∂∂=

−−=∂∂−=

FHRxHzx

Define:

( ) ( )[ ]

−=∂∂=

−−=∂∂−=

λ 0000

Boundary equations:

λTGQww −=

( ) ( )[ ]

( ) ( )[ ]( ) ( ) ( ) ( )txtStt

−=⇒

−=∂∂=

−−=∂∂−=

λλλ

λ 0000

Second Way, Assumption 2:

Forward

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( )

( ) ( ) ( ) ( )[ ] ( )tzRHtxtStFtxHRH

twGtxtStGQGtxFtStxtSt

txtStxtStt

11 −− +−−−=

+−−⋅−−=

−−=

λλλ

( ) ( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( )[ ] ( )txHRHtSGQGtStSFFtStS

twGtStzRHtGQGtStFtT

−+++=

−−++

λλλ

( ) ( ) ( ) ( )txtStt FF −=λλSecond Way, Assumption 2

( ) ( )[ ]( ) ( )[ ]

−−=

λλ 0000

( )( )

−−−

−− zRH

11 λλ

twGtStzRHtGQGtStFtT

−+++=

−−++

λλλ( ) ( ) ( ) ( )txtStt FF −=λλSecond Way, Assumption 2

( ) ( )[ ]( ) ( )[ ]

−−=

λλ 0000

We want to have λF(t) independent on x(t). This is obtain by choosing( ) ( ) ( ) ( ) ( ) ( )( ) ( )tSQGtC

StSHRHtCQtCtSFFtStS

=+−−−= −−

Therefore( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) 000

1 xSttwGtStzRHttCGFt FFT

FF =+++−= − λλλ

Let substitute the results in the equation( )tx

( ) ( ) ( ) ( ) ( )[ ] ( )( )[ ] ( ) ( ) ( )[ ]

( ) ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]fffFffFfffFfFffff

xStStStxtxtStxStxS

tQGtwGtxtCGF

twGtxtStGQGtxFtx

++=⇒−+=

−++=

+−−=

− λλ

( ) ( )[ ] ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( )( ) ( )tSQGtC

StSHRHtCQtCtSFFtStS

xSttwGtStzRHttCGFt

=+−−−=

=+++−=−−

λλλ

( ) ( ) ∫ −− −+−+−+−=f

tQRSffS

dtwwxHzxtxxtxJ0

( ) ( ) ( ) ( )tvtxtHtz +=

Forward InformationFilter

Store λF(t) and SF(t)Backward Information Smoother (τ = tf – t)

Summary of Second Assumption – Forward then Backward Algorithms

( )[ ] ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]fffFffFfFT

F xStStStxtQGtwGtxtCGFtd

xd ++=⇒−−+−=−= − λλτ

QRλ22

+−=∂∂=

−−=∂∂−=

FHRxHzx

Define:

( ) ( )[ ]

−=∂∂=

−−=∂∂−=

λ 0000

Boundary equations:

λTGQww −=

( )[ ] ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( )ttPtxtx

tPtSxtx

tPtSxtxBB

ffffff

λλλ

λλ+=⇒

==−−−

Third Way, Assumption 3:

Backward

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( ) ( )

( ) ( ) ( )[ ] ( ) ( )twGtGQGttPtxF

tzRHtFttPtxHRHtPttPtx

ttPttPtxtx

+−+=

+−+−⋅++=

++=−−

λλλλλ

λλ11

( ) ( ) ( ) ( ) ( )[ ] ( )( ) ( ) ( ) ( ) ( )[ ] ( )ttPHRHtPGQGFtPtPFtP

twGtxHtzRHtPtFxtx

+−++−=

−−+−

( ) ( ) ( ) ( )ttPtxtx BB λ+=Third Way, Assumption 3

( ) ( )[ ]( ) ( )[ ]

−−=

λλ 0000

( )( )

−−−

−− zRH

11 λλ ( ) ( )[ ]

( ) ( )[ ]

−−=

λλ 0000

We want to have xB(t) independent on λ(t). This is obtain by choosing

Therefore

Let substitute the results in the equation( )tλ

( ) ( ) ( ) ( )ttPtxtx BB λ+=Third Way, Assumption 3

( ) ( ) ( ) ( ) ( )[ ] ( )( ) ( ) ( ) ( ) ( )[ ] ( )ttPHRHtPGQGFtPtPFtP

twGtxHtzRHtPtFxtx

+−++−=

−−+−

( ) ( ) ( ) ( ) ( ) ( )( ) 1: −=

=−+−−=−

PtPtKRtKGQGFtPtPFtPT

ffBBBTT

( ) ( ) ( ) ( ) ( )[ ] ( ) ( ) ffBBBBB xtxtwGtxHtztKtFxtx =−−+−=−

( ) ( ) ( ) ( )[ ] ( ) ( )

( ) ( ) ( ) ( )[ ]

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( )[ ]001

00000000000

xtxPtPttPxtxtxtxttP

txHtzRHtHKF

tzRHtFttPtxHRHt

−+−=⇒−+−=+−=

+−+−=

−−

λλλ

( ) ( ) ( ) ( ) ( )[ ] ( ) ( )( ) ( ) ( ) ( ) ( ) ( )

( ) 1: −=

=−+−−=−

=−−+−=−

PtPtKRtKGQGFtPtPFtP

xtxtwGtxHtztKtFxtx

ffBBBTT

ffBBBBB

( ) ( ) ∫ −− −+−+−+−=f

tQRSffS

dtwwxHzxtxxtxJ0

( ) ( ) ( ) ( )tvtxtHtz +=

Backward Covariance Filter (τ = tf – t)

Store xB(t) and PB(t)

Forward Covariance Smoother

Summary of Third Assumption – Backward then Forward Algorithms

( ) ( ) ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]001

0001 xtxPtPttxHtzRHtHKFt BBB

TTB −+−=−++−= −− λλλ

QRλ22

+−=∂∂=

−−=∂∂−=

FHRxHzx

Define:

( ) ( )[ ]

−=∂∂=

−−=∂∂−=

λ 0000

Boundary equations:

λTGQww −=

( ) ( )[ ]

( ) ( )[ ]( ) ( ) ( ) ( )txtStt

−=∂∂=

−−=∂∂−=

λλλ

λ 0000

Fourth Way, Assumption 4:

Backward

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )[ ] ( )

( ) ( ) ( ) ( )[ ] ( )tzRHtxtStFtxHRH

twGtxtStGQGtxFtStxtSt

txtStxtStt

11 −− ++−−=

++−⋅++=

λλλ

twGtStzRHtGQGtStFtT

−+−−−=

+−−+

λλλ

( ) ( ) ( ) ( )txtStt BB +=λλFourth Way, Assumption 4

( ) ( )[ ]( ) ( )[ ]

−−=

λλ 0000

( )( )

−−−

−− zRH

11 λλ

( ) ( ) ( ) ( )txtStt BB +=λλFourth Way, Assumption 4

( ) ( )[ ]( ) ( )[ ]

−−=

λλ 0000

We want to have λF(t) independent on x(t). This is obtain by choosing

Therefore( ) ( )[ ] ( ) ( ) ( ) ( ) ( ) fffBB

TBB xSttwGtStzRHttCGFt −=+−−=− − λλλ 1

Let substitute the results in the equation( )tx

( ) ( ) ( ) ( ) ( )[ ] ( )( )[ ] ( ) ( ) ( )[ ]

( ) ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]0001

0000000000 xStStStxtxtStxStxS

tQGtwGtxtCGF

twGtxtStGQGtxFtx

+−+=⇒+−=

−+−=

++−=

− λλ

twGtStzRHtGQGtStFtT

−+−−−=

+−−+

λλλ

( ) ( ) ( ) ( ) ( ) ( )( )tSQGC

StSHRHtCQtCtSFFtStS

=+−=− −−

( ) ( )[ ] ( ) ( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( ) ( )

( )tSQGC

StSHRHtCQtCtSFFtStS

xSttwGtStzRHttCGFt

fffBBT

=+−=−

−=+−−=−−−

λλλ

( ) ( ) ∫ −− −+−+−+−=f

tQRSffS

dtwwxHzxtxxtxJ0

( ) ( ) ( ) ( )tvtxtHtz +=

Backward InformationFilter (τ = tf – t)

Store λB(t) and SB(t)

Forward Information Smoother

Summary of Fourth Assumption – Backward then Forward Algorithms

( ) ( )[ ] ( ) ( ) ( )[ ] ( ) ( )[ ] ( )[ ]0001

000 xStStStxtQGtwGtxtCGFtx BBBT

B +−+=−+−= − λλ

Table of Content

EstimatorsSOLO

References

Minkoff, J., “Signals, Noise, and Active Sensors”, John Wiley & Sons, 1992

Sage, A. P., Melsa, J. L., “Estimation Theory with Applications to Communication and Control”, McGraw Hill, 1971

Gelb, A.,Ed., written by the Technical Staff, The Analytic Sciences Corporation, “Applied Optimal Estimation”, M.I.T. Press, 1974

Bryson, A.E. Jr., Ho, Y-C., “Applied Optimal Control”, Ginn & Company, 1969

Kailath, T., Sayed, A.H., Hassibi, B, “Linear Estimators”, Prentice Hall, 2000

Sage, A. P., “Optimal Systems Control”, Prentice-Hall, 1968, 1st Ed., Ch.8, Optimal State Estimation

Sage, A. P., White, C.C., III “Optimal Systems Control”, Prentice-Hall, 1977, 2nd Ed.,Ch.8, Optimal State Estimation

Y. Bar-Shalom, T.E. Fortmann, “Tracking and Data Association”, Academic Press, 1988

Y. Bar-Shalom, Xiao-Rong Li., “Multitarget-Multisensor Tracking: Principles and Techniques”, YBS Publishing, 1995

Haykin, S. “Adaptive Filter Theory”, Prentice Hall, 4th Ed., 2002

EstimatorsSOLO

References (continue – 1(

Minkler, G., Minkler, J., “Theory and Applications of Kalman Filters”, Magellan, 1993

Stengel, R. F., “Stochastic Optimal Control – Theory and Applications”, John Wiley & Sons, 1986

Kailath, T., “Lectures on Wiener and Kalman Filtering”, Springer-Verlag, 1981

Anderson, B. D. O., Moore, J. B., “Optimal Filtering”, Prentice-Hall, 1979

Deutch, R., “System Analysis Techniques”, Prentice Hall, 1969, ch. 6

Chui, C. K., Chen, G., “Kalman Filtering with Real Time Applications”, Springer-Verlag, 1987

Catlin, D. E., “Estimation, Control, and the Discrete Kalman Filter”, Springer-Verlag, 1989

Haykin, S., Ed., “Kalman Filtering and Neural Networks”, John Wiley & Sons, 2001

Zarchan, P., Musoff, H., “Fundamentals of Kalman Filtering – A Practical Approach”, AIAA, Progress in Astronautics & Aeronautics, vol. 190, 2000

Brookner, E., “Tracking and Kalman Filtering Made Easy”, John Wiley & Sons, 1998

EstimatorsSOLO

References

Arthur E. Bryson Jr.Professor Emeritus

Aeronautics and AstronauticsPhone:650.857.1354

E-mail:bryson@sun-valley.stanford.edu

Andrew P. Sage Thomas Kailath1935 -

From left-to-right: Sam Blackman, Oliver Drummond, Yaakoov Bar-Shalom and Rabinder Madan

Dr. Simon HaykinUniversity ProfessorDirector Adaptive Systems Laboratory

McMaster University, CRL-1051280 Main Street WestHamilton, ONCanada L8S 4L7Tel: (905) 525-9140 ext. 24809Fax: (905) 521-2922

Table of Content

January 10, 2015 215

TechnionIsraeli Institute of Technology

1964 – 1968 BSc EE1968 – 1971 MSc EE

Israeli Air Force1970 – 1974

RAFAELIsraeli Armament Development Authority

1974 – 2013

Stanford University1983 – 1986 PhD AA

Normal (Gaussian) Distribution

Karl Friederich Gauss1777-1855

( )( )

−−=

( ) ( )∫∞−

−−=x

σπσµ

( ) µ=xE

( ) σ=xVar

( ) ( )[ ]( ) ( )

−−=

∫∞+

∞−

σωµω

ωσµ

Probability Density Functions

Cumulative Distribution Function

Mean Value

Variance

Moment Generating Function

Moments

Normal Distribution ( ) ( ) ( )[ ]σπ

2/exp;

22xxpX

[ ] ( ) −⋅

=oddnfor

evennfornxE

131 σ

[ ]( )

=−⋅=

+ 12!22

12 knfork

knforn

Proof:

Start from: and differentiate k time with respect to a( ) 0exp 2 >=−∫∞

∞−

dxxaπ

Substitute a = 1/(2σ2) to obtain E [xn]

( ) ( )0

1231exp

1222 >−⋅=− +

∞−∫ a

kdxxax

kkk π

[ ] ( ) ( )[ ] ( ) ( )[ ]( ) ( ) 12

222221212

2/exp2

22/exp

+∞+=

∞∞

∞−

−=−=

∫∫

xdxxxdxxxxE

σπσ

σσπ

Now let compute:

[ ] [ ]( )2244 33 xExE == σ

Chi-square

Normal (Gaussian) Distribution (continue – 1)

Karl Friederich Gauss1777-1855

( ) ( ) ( )

−−−= −−

xxPxxPPxxpT 12/1

1exp2,; π

A Vector – Valued Gaussian Random Variable has theProbability Density Functions

= Mean Value

( ) ( ) TxxxxEP −−= Covariance Matrix

If P is diagonal P = diag [σ12σ2

2 … σk2] then the components of the random vector

are uncorrelated, andx

( ) ( ) ( ) ( )

−−=

−−

−−=

−−

xxxxxxxx

1exp2,;

σπσ

therefore the components of the random vector are also independent

SOLO Review of ProbabilityMonte Carlo Method

Monte Carlo methods are a class of computational algorithms that rely on repeated random sampling to compute their results. Monte Carlo methods are often used when simulating physical and mathematical systems. Because of their reliance on repeated computation and random or pseudo-random numbers, Monte Carlo methods are most suited to calculation by a computer. Monte Carlo methods tend to be used when it is infeasible or impossible to compute an exact result with a deterministic algorithm.

The term Monte Carlo method was coined in the 1940s by physicists Stanislaw Ulam, Enrico Fermi, John von Neumann, and Nicholas Metropolis, working on nuclear weapon projects in the Los Alamos National Laboratory

Stanislaw Ulam1909 - 1984

Enrico - Fermi1901 - 1954

John von Neumann1903 - 1957 Nicholas Constantine Metropolis

(1915 –1999)

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable (Unknown Statistics)

jimxExE ji ,∀==

DefineEstimation of thePopulation mean

A random variable, x, may take on any values in the range - ∞ to + ∞.Based on a sample of k values, xi, i = 1,2,…,k, we wish to compute the sample mean, ,and sample variance, , as estimates of the population mean, m, and variance, σ2.

2ˆkσkm

( ) ( ) ( )[ ] ( ) ( )[ ]2

ˆˆ21

σσσ

mkmkkk

mxmxEk

++−+++−−+=

∑ ∑∑∑

∑∑∑

jimxExE ji ,2222 ∀+== σ

iik == ∑

jimxExExxE ji

tindependenxx

Compute

Biased

Unbiased

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable (continue - 1)

jimxExE ji ,∀==

DefineEstimation of thePopulation mean

2ˆkσkm

−∑

jimxExE ji ,2222 ∀+== σ

iik == ∑

jimxExExxE ji

tindependenxx

Biased

Unbiased

Therefore, the unbiased estimation of the sample variance of the population is defined as:

( )∑=

−−

ikik mx

22 ˆ1

1:σ since ( ) 2

22 ˆ1

1:ˆ σσ =

−= ∑

ikik mx

Unbiased

2ˆkσkm

iik == ∑

22 ˆ1

1:ˆ σσ =

−= ∑

ikik mx

iik == ∑

1ˆ ( ) 2

22 ˆ1

1:ˆ σσ =

−= ∑

ikik mx

We found:

Let Compute:

( ) ( )

( ) ( ) ( )

( ) ( ) ( ) k

mxEmxEmxEk

mxmxEmxEk

−−+−=

−=−=

∑ ∑∑

∑∑∑

∑∑

=≠==

mmE kmk

ˆ ˆ:σσ =−=

Let Compute:

( ) ( ) ( )

( ) ( ) ( ) ( )[ ]

( ) ( ) ( ) ( )

−−

−+−

−−+−

−−+−−+−

−−+−

−−

−=−=

∑∑

ˆˆ21

σσσσσσ

mmmmmxmxk

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )

mmEmxE

mmEkmxE

mmEmxEmxEmxE

σσσ

σσµ

−−

−−−

−−−−

−−

−−−

−+−−

+−−

+−−−+

−−+−−

∑∑

∑∑∑

∑∑ ∑∑

==≠==

Since (xi – m), (xj - m) and are all independent for i ≠ j:( )kmm ˆ−

( )( )( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) 4

−−

+−+−+

−−

+−−

+−−+

−≈

σµσσσ

σσσµσσ

σµσσ

−≈ ( ) 44 : mxE i −=µ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )

mmEmxE

mmEkmxE

mmEmxEmxEmxE

σσσ

σσµ

−−

−−−

−−−−

−−

−−−

−+−−

+−−

+−−−+

−−+−−

∑∑

∑∑∑

∑∑ ∑∑

==≠==

iik == ∑

22 ˆ1

1:ˆ σσ =

−= ∑

ikik mx

We found:

mmE kmk

ˆ ˆ:σσ =−=

( ) ( )k

σµσσσσσ

−≈

−−

−=−= ∑

( ) 44 : mxE i −=µ

Kurtosis of random variable xiDefine

σµλ =

( ) ( ) ( )k

σλσσσσσ

−≈

−−

−=−= ∑

[ ] ϕσσσ σσ =≤≤ 2ˆ

kˆ-0Prob n

For high values of k, according to the Central Limit Theorem the estimations of mean and of variance are approximately gaussian random variables.

km2ˆkσ

We want to find a region around that will contain σ2 with a predefined probabilityφ as function of the number of iterations k.

2ˆkσ

Since are approximately gaussian random variables nσ is given by solving:

2ˆkσ

ϕζζπ

−∫

1 nσ φ

1.000 0.6827

1.645 0.9000

1.960 0.9500

2.576 0.9900

Cumulative Probability within nσStandard Deviation of the Mean for a

Gaussian Random Variable

22 1ˆ-

1 σλσσσλσσ k

n−≤≤−−

ˆ-11 σλσσλ

−−≤≤

+−−

[ ] ϕσσσ σσ =≤≤ 2ˆ

kˆ-0Prob n

22 1ˆ-

1 σλσσσλσσ k

n−≤≤−−

ˆ-11 σλσσλ

−−≤≤

+−−

kσλσ

2 11ˆ

11 σλσσλ

−−≥≥

λσσ

−−

=≥≥=−+ λ

σσσσλ

kn kk 1ˆ

σσλ

Monte-Carlo Procedure

Choose the Confidence Level φ and find the corresponding nσ

using the normal (gaussian) distribution.

nσ φ

1.000 0.6827

1.645 0.9000

1.960 0.9500

2.576 0.9900

Run a few sample k0 > 20 and estimate λ according to2

mxkλ∑

3 Compute and as function of kσ σ

4 Find k for which

[ ] ϕσσσ σσ =≤≤ 2ˆ

kˆ-0Prob n

5 Run k-k0 simulations

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable (continue – 11)

Monte-Carlo Procedure

Choose the Confidence Level φ = 95% that gives the corresponding nσ=1.96.

nσ φ

1.000 0.6827

1.645 0.9000

1.960 0.9500

2.576 0.9900

The kurtosis λ = 32

3 Find k for which ϕσλσσ

−≤≤

2 1ˆ-0Prob

4 Run k>800 simulations

Example:Assume a gaussian distribution λ = 3

96.1ˆ-0Prob

≤≤

σσσk

Assume also that we require also that with probability φ = 95 % 22k

2 1.0ˆ- σσσ ≤

96.1 =k

800≈k

Kurtosis of random variable xi

Kurtosis

Kurtosis (from the Greek word κυρτός, kyrtos or kurtos, meaning bulging) is a measure of the "peakedness" of the probability distribution of a real-valued random variable. Higher kurtosis means more of the variance is due to infrequent extreme deviations, as opposed to frequent modestly-sized deviations.

1905 Pearson defines Kurtosis, as a measure of departure from normality in a paper published in Biometrika. λ=3 for the normal distribution and the terms ‘leptokurtic’ (λ>3), mesokurtic (λ=3), platikurtic (λ<3) are introduced.

( ) ( ) [ ]224 /: mxEmxE ii −−=λ

( ) ( ) [ ]22

−=λ

Karl Pearson (1857 –1936)

A leptokurtic distribution has a more acute "peak" around the mean (that is, a higher probability than a normally distributed variable of values near the mean) and "fat tails" (that is, a higher probability than a normally distributed variable of extreme values). A platykurtic distribution has a smaller "peak" around the mean (that is, a lower probability than a normally distributed variable of values near the mean) and "thin tails" (that is, a lower probability than a normally distributed variable of extreme values).

234Hyperbolic-Secant

Distribution GraphicalRepresentation

FunctionalRepresentation

Kurtosisλ

ExcessKurtosis

Normal ( )

σπσµ

2exp 2

−− x3 0

Laplace

−−

Uniformbxorxa

≤≤−0

1.8 -1.2

WignerRx

≤−

2 222π -1.02

Skewness of random variable xi

Skewness

( ) ( ) [ ] 2/32

−=γ Karl Pearson (1857 –1936)

Negative skew: The left tail is longer; the mass of the distribution is concentrated on the right of the figure. The distribution is said to be left-skewed.

Positive skew: The right tail is longer; the mass of the distribution is concentrated on the left of the figure. The distribution is said to be right-skewed.

More data in the left tail thanit would be expected in a normal distribution

More data in the righttail thanit would be expected in a normal distribution

Karl Pearson suggested two simpler calculations as a measure of skewness:• (mean - mode) / standard deviation • 3 (mean - median) / standard deviation

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable using a Recursive Filter (Unknown Statistics)

We found that using k measurements the estimated mean and variance are given in batch form by:

A random variable, x, may take on any values in the range - ∞ to + ∞.Based on a sample of k values, xi, i = 1,2,…,k, we wish to estimate the sample mean, ,and the variance pk, by a Recursive Filter

The k+1 measurement will give:

+= ∑ kk

iik xxk

( )kkkk xxk

xx ˆ1

1ˆˆ 11 −

++= ++

Therefore the Recursive Filter form for the k+1 measurement will be:

( )∑=

−−

ikik xx

( )∑+

=++ −=

211 ˆ

ikik xx

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable using a Recursive Filter (Unknown Statistics) (continue – 1)

We found that using k+1 measurements the estimated variance is given in batch form by:

+−−

++= ++ kkkkk p

( )( )

( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) 21

ˆˆˆ1

ikikkkk

xxxxxxkk

−=−

−+−−+

−+−=

+−−−=−=

∑∑

( )kkkk xxk

xx ˆ1

1ˆˆ 11 −

++= ++

SOLO Review of ProbabilityEstimation of the Mean and Variance of a Random Variable using a Recursive Filter (Unknown Statistics) (continue – 2)

+−−

++= ++ kkkkk p

( )kkkk xxk

xx ˆ1

1ˆˆ 11 −

++= ++ ( ) ( ) ( )kkkk xxkxx ˆˆ1ˆ 11 −+=− ++

( ) ( )

−−++= ++ kkkkk p

kxxkpp

1ˆˆ1 2

Estimate the value of a constant x, given discrete measurements of x corrupted by anuncorrelated gaussian noise sequence with zero mean and variance r0.The scalar equations describing this situation are:

kk xx =+1

kkk vxz +=

System

Measurement ( )0,0~ rNvk

The Discrete Kalman Filter is given by:

( ) ( )+=−+ kk xx ˆˆ 1

( ) ( ) ( ) ( )[ ] ( )[ ]−−+−−+−=+ ++−

01111 ˆˆˆ

kkkk xzrppxx

kk wxx Γ+Φ=+

kk vxHz +=

( ) ( )[ ] ( )[ ]

( )+=ΓΓ+Φ+Φ=−−−−=− +++++ kT

kkkkk pQpxxxxEp

11111 ˆˆ

( ) ( )[ ] ( )[ ] ( ) ( )

( ) ( ) ( )( ) 0

0111111

prpHrHpHHpp

xxxxEp

Tkkkkk

+++=−

+−−−−=

−+−+=+

++++++

General Form

with Known Statistics Moments Using a Discrete Recursive FilterEstimation of the Mean and Variance of a Random Variable

Estimate the value of a constant x, given discrete measurements of x corrupted by anuncorrelated gaussian noise sequence with zero mean and variance r0.

We found that the Discrete Kalman Filter is given by:

( ) ( ) ( )[ ]+−++=+ +++ kkkkk xzKxx ˆˆˆ 111

( ) ( )( )

( )( )0

+=+++=++

=+ ( ) ( )( )0

+=+ ( )k

( )( ) 0

+=+( ) ( )( )

( )[ ]+−++

++=+ ++ kkkk xzk

xx ˆ11

ˆˆ 1

0=k1=k

( )111

with Known Statistics Moments Using a Discrete Recursive Filter (continue – 1)Estimation of the Mean and Variance of a Random Variable

Estimate the value of a constant x, given continuous measurements of x corrupted by anuncorrelated gaussian noise sequence with zero mean and variance r0.The scalar equations describing this situation are:

vxz +=

System

Measurement ( )rNv ,0~

The Continuous Kalman Filter is given by:

( ) ( ) ( ) ( ) ( )[ ] ( ) 00ˆ&ˆˆˆ

− xtxtzrHtptxAtx

wxAx Γ+=

( ) ( ) ( )[ ] ( ) ( )[ ] TtxtxtxtxEtp −−= ˆˆ:

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 12

−− −=−++= rtptptHrtHtptGQtGtAtptptAtp TT

General Form

with Known Statistics Moments Using a Continuous Recursive FilterEstimation of the Mean and Variance of a Random Variable

( ) ( ) ( ) 012 0& ptprtptp ==−= −or: ∫∫ −=

1 ( )t

1+== − ( ) ( )[ ]txz

tx ˆ1

Monte Carlo approximation

Monte Carlo runs , generate a set of samples that approximate the filtering distribution . So, with P samples, expectations with respect to the filtering distribution are approximated by

( ) ( ) ( )( )∑∫=

dxxpxf1

and , in the usual way for Monte Carlo, can give all the moments etc. of the distribution up to some degree of approximation.

( ) ( )∑∫=

≈==P

dxxpxxE1

( ) ( ) ( ) ( )( )∑∫=

−≈−=−=P

nLnnn x

PdxxpxxE

1 µµµµ

Types of Estimation

t t+τ

available measurement data

Filtering

t+ττ > 0

τ > 0

Use all the measurement datato the present time t to estimate.

Smoothing

Use all the measurement datato a future time t+τ to estimateat present time t..

Prediction

Use all the measurement datato the present time t to predictthe outcome at a future time t + τ.

Conditional Expectations and Their Smoothing Property

The Conditional Expectation is defined as: ( )∫+∞

∞−

= dxyxpxyxE yx || |

Similarly, for a function of x and y, g (x,y), the Conditional Expectation is defined as:

( ) ( ) ( )∫+∞

∞−

= dxyxpyxgyyxgE yx |,|, |

Smoothing property of the Expectation states that the Expected value of the ConditionalExpectation is equal to the Unconditional Expected Value

( ) ( )

( ) xEdxxpx

dxdyyxpx

dxdyypyxpx

dyypdxyxpxyxEE

∫ ∫

∞−

xEyxEE =|

This relation is also called the Law of Iterated Expectation, summarized as:

Gaussian Mixture Equations

A mixture is a p.d.f. given by a weighted sum of p.d.f.s with the weighths summing upto unity:

( ) ( )∑=

jjjj Pxxpxp

A Gaussian Mixture is a p.d.f. consisting of a weighted sum of Gaussian densities

where: 11

( ) jjj PxxA ,~: N=Denote by Aj the event that x is Gaussian distributed with mean and covariance Pjjx

with Aj , j=1,…,n, mutually exclusive and exhaustive: and S

1A 2A nA

jj pAP =:jiOAAandSAAA jin ≠∀/=∩=∪∪∪ 21

( ) ( ) ( ) ( )∑∑==

jjjj AxpAPPxxpxp

|,;NTherefore:

Gaussian Mixture Equations (continue – 1)

A Gaussian Mixture is a p.d.f. consisting of a weighted sum of Gaussian densities

( ) ( ) ( ) ( )∑∑==

jjjj AxpAPPxxpxp

The mean of such a mixture is:

( ) ( ) ∑∑==

jjjj xpPxxEpxpxEx

The covariance of the mixture is:

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )∑∑

∑∑

−−+−−+

−−+−−=

−+−−+−=

−−=−−

pxxxxpAxxExx

pxxAxxEpAxxxxE

pAxxxxxxxxE

pAxxxxExxxxE

Gaussian Mixture Equations (continue – 2)

The covariance of the mixture is:

( ) ( ) ( ) ( ) ( ) ( ) PpPpxxxxpAxxxxExxxxEn

+=−−+−−=−− ∑∑∑===

where:

( ) ( )∑=

−−=n

Tjj pxxxxP

Is the spread of the mean term.

pxxxpxpxxpxxP

+−−=

∑∑∑∑

( ) ( ) Tn

T xxpxxpPxxxxE −+=−− ∑∑== 11

Note: Since we developed only first and second moments of the mixture, those relations will still be correct even if the random variables in the mixture are not Gaussian.

Linear Gaussian Systems

A Linear Combination of Independent Gaussian random vectors is also a Gaussian random vector

mmm XaXaXaS +++= 2211:

( ) ( ) ( )( ) ( )

( ) ( ) ( )

( ) ( )

+++++++−=

ΦΦ⋅Φ==Φ ∫ ∫+∞

∞−

aaajaaa

ajaajaaja

YdYdYYpSjm

µµµωσσσω

µωσωµωσωµωσω

ωωωωω

2211222

,,exp21

( ) ( )

−−= 2

i σµ

σπσµ ( ) ( ) ( )

+−==Φ ∫

∞−iiiiXiX jXdXpXj

iiµωσωωω 22

1expexp:

Moment-Generating

Function

Gaussian distribution

Define

Proof:

( ) ( )iXii

iiYiii Xp

aYpXaY

( ) ( ) ( ) ( ) ( ) ( )

+−=Φ===Φ ∫∫

∞−

∞−iiiiiiX

iXiiiiYiY ajaXaXda

XpXajYdYpYj

iiµωσωωωω 222

1expexpexp:

Linear Gaussian Systems

A Linear Combination of Independent Gaussian random vectors is also a Gaussian random vector

mmm XaXaXaS +++= 2211:

Therefore the Linear Combination of Independent Gaussian Random Variables is a Gaussian Random Variable with

µµµµσσσσ

Therefore the Sm probability distribution is:

( ) ( )

−−=

σπσµ

Proof (continue – 1):

( ) ( ) ( )

+++++++−=Φ mmmmS aaajaaa

mµµµωσσσωω 2211

We found:

Linear Gaussian Markov Systems

kkkkkkk

+=Γ++Φ= −−−−−− 111111

wk-1 and vk, white noises, zero mean, Gaussian, independent

xxx =−= &:

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) ( ) ( ) ( ) ( ) lkT

&: δ=−=

( ) ( ) 0=lekeE Tvw

lklk 1

( ) ( )Qwwpw ,0;N=

( ) ( )Rvvpv ,0;N=

( )( )

−= − wQw

2/12/ 2

( )( )

−= − vRv

2/12/ 2

A Linear Gaussian Markov Systems is defined as

( ) ( )0|0000 ,;0

Pxxxp ttx == = N ( )( )

( ) ( )

−−−= =

−== 00

10|0002/1

0|02/0 2

xxPxxP

tntxπ

Linear Gaussian Markov Systems (continue – 2)

111111 −−−−−− Γ++Φ= kkkkkkk wuGxxPrediction phase (before zk measurement)

1:111111:1111:11| |||:ˆ −−−−−−−−−− Γ++Φ== kkkkkkkkkkkk ZwEuGZxEZxEx

or 111|111| ˆˆ −−−−−− +Φ= kkkkkkk uGxx

The expectation is

[ ] [ ] ( )[ ] ( )[ ] 1:1111|111111|111

1:11|1|1|

|ˆˆ:

−−−−−−−−−−−−−

−−−−

Γ+−ΦΓ+−Φ=

−−=

kkkkkkkkkkkk

kkkkkkkk

ZwxxwxxE

ZxExxExEP

( ) ( ) ( )

( ) Tk

Tkkkkk

Tkkkkkkk

wwExxwE

wxxExxxxE

1|1111

11|11111|111|111

ˆˆˆ

−−−−−−−−−−

−−−−−−−−−−−−−−

ΓΓ+Φ−Γ+

Γ−Φ+Φ−−Φ=−−

Tkkkkkk QPP 1111|111| −−−−−−− ΓΓ+ΦΦ=

( )1|1|1:1 ,ˆ;| −−− = kkkkkkk PxxZxP NSince is a Linear Combination of Independent Gaussian Random Variables:

111111 −−−−−− Γ++Φ= kkkkkkk wuGxx

Table of Content

Random VariablesSOLO

Random Variable: A variable x determined by the outcome Ω of a random experiment.

( )Ω= xx

Random Process or Stochastic Process:

A function of time x determined by the outcome Ω of a random experiment.

( ) ( )Ω= ,txtx

This is a family or an ensemble of functions of time, in general different for each outcome Ω.

Mean or Ensemble Average of the Random Process: ( ) ( )[ ] ( ) ( )∫+∞

∞−

=Ω= ξξξ dptxEtx tx,:

Autocorrelation of the Random Process: ( ) ( ) ( )[ ] ( ) ( ) ( )∫ ∫+∞

∞−

=ΩΩ= ηξξξη ddptxtxEttR txtx 21 ,2121 ,,:,

Autocovariance of the Random Process: ( ) ( ) ( )[ ] ( ) ( )[ ] 221121 ,,:, txtxtxtxEttC −Ω−Ω=

( ) ( ) ( )[ ] ( ) ( ) ( ) ( ) ( )2121212121 ,,,, txtxttRtxtxtxtxEttC −=−ΩΩ=

Stationarity of a Random Process

1. Wide Sense Stationarity of a Random Process: • Mean Average of the Random Process is time invariant:

( ) ( )[ ] ( ) ( ) .,: constxdptxEtx tx ===Ω= ∫+∞

∞−

ξξξ

• Autocorrelation of the Random Process is of the form: ( ) ( ) ( )ττ

RttRttRtt 21:

2121 ,−=

( ) ( ) ( )[ ] ( ) ( ) ( ) ( )12,2121 ,,,:,21

ttRddptxtxEttR txtx === ∫ ∫+∞

∞−

ηξξξηωωsince:

We have: ( ) ( )ττ −= RR

Power Spectrum or Power Spectral Density of a Stationary Random Process:

( ) ( ) ( )∫+∞

∞−

−= ττωτω djRS exp:

2. Strict Sense Stationarity of a Random Process: All probability density functions are time invariant: ( ) ( ) ( ) .,, constptp xtx == ωωω

Ergodicity:

( ) ( ) ( )[ ]Ω==Ω=Ω ∫+

−∞→

1:, lim txExdttx

ErgodicityT

A Stationary Random Process for which Time Average = Assembly Average

Time Autocorrelation:

Ergodicity:

( ) ( ) ( ) ( ) ( )∫+

−∞→

Ω+Ω=Ω+Ω=T

dttxtxT

txtxR ,,2

1:,, lim τττ

For a Ergodic Random Process define

Finite Signal Energy Assumption: ( ) ( ) ( ) ∞<Ω=Ω= ∫+

−∞→

txR ,2

1,0 22 lim

Define: ( ) ( ) ≤≤−Ω

=Ωotherwise

TtTtxtxT 0

,:, ( ) ( ) ( )∫

∞−

Ω+Ω= dttxtxT

R TTT ,,2

1: ττ

( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( )∫∫∫

∫∫∫

−−

∞−

Ω+Ω−Ω+Ω=Ω+Ω=

Ω+Ω+Ω+Ω++Ω=

dttxtxT

τττ

ττωττ

Let compute:

( ) ( ) ( ) ( ) ( )∫∫−∞→−∞→∞→

Ω+Ω−Ω+Ω=T

dttxtxT

τττ ,,2

1limlimlim

( ) ( ) ( )ττ RdttxtxT

=Ω+Ω∫−∞→

( ) ( ) ( ) ( )[ ] 0,,2

1 suplimlim →

Ω+Ω≤Ω+Ω≤≤−∞→−∞→

∫ τττττ

dttxtxT TT

therefore: ( ) ( )ττ RRTT

=→∞

( ) ( ) ( )[ ]Ω==Ω=Ω ∫+

−∞→

1:, lim txExdttx

ErgodicityT

T− T+

( )txT

Ergodicity (continue - 1):

( ) ( ) ( ) ( ) ( )

( ) ( )[ ] ( ) ( )( )[ ]

( ) ( ) ( ) ( )( )

( ) ( ) ( ) ( ) [ ]TTTT

dvvjvxdttjtxT

dtjtxdttjtxT

ddttjtxtjtxT

dttxtxdjT

1exp,exp,

exp,exp,2

,,exp2

=−ΩΩ=

+−Ω+Ω=

Ω+Ω−=−

∫∫

∫ ∫

∫ ∫∫

∞−

ττωτω

τττωττωτLet compute:

where: and * means complex-conjugate.( ) ( )∫+∞

∞−

−Ω= dvvjvxX TT ωexp,:

Define:

( ) ( ) ( ) ( ) ( ) ( )[ ]∫ ∫∫+∞

∞−

−∞→

∞−∞→∞→

Ω+Ω−=

= τττωττωτω ddttxtxE

TjdjRE

1expexp

2: limlimlim

Since the Random Process is Ergodic we can use the Wide Stationarity Assumption:

( ) ( )[ ] ( )ττ RtxtxE TT =Ω+Ω ,,

( ) ( ) ( ) ( ) ( )

( ) ( )∫

∫ ∫∫ ∫∞+

∞−

−∞→

∞−

−∞→∞→

ττωτ

ττωττττωω

jRddtRT

limlimlim

Ergodicity (continue - 2):

We obtained the Wiener-Khinchine Theorem (Wiener 1930):

( ) ( ) ( )∫+∞

∞−→∞−=

= dtjR

XXES TT

τωτω exp2

Alexander YakovlevichKhinchine1894 - 1959

The Power Spectrum or Power Spectral Density of a Stationary Random Process S (ω) is the Fourier Transform of the Autocorrelation Function R (τ).

White Noise

A (not necessary stationary) Random Process whose Autocorrelation is zero for any two different times is called white noise in the wide sense.

( ) ( ) ( )[ ] ( ) ( )211

2121 ,,, ttttxtxEttR −=ΩΩ= δσ

2 tσ - instantaneous variance

Wide Sense Whiteness

Strict Sense Whiteness

A (not necessary stationary) Random Process in which the outcome for any two different times is independent is called white noise in the strict sense.

( ) ( ) ( ) ( )2121, ,,21

ttttp txtx −=Ω δ

A Stationary White Noise Random has the Autocorrelation:

( ) ( ) ( )[ ] ( )τδσττ 2,, =Ω+Ω= txtxER

In general whiteness requires Strict Sense Whiteness. In practice we have only moments (typically up to second order) and thus only Wide Sense Whiteness.

White Noise

A Stationary White Noise Random has the Autocorrelation:

( ) ( ) ( )[ ] ( )τδσττ 2,, =Ω+Ω= txtxER

The Power Spectral Density is given by performing the Fourier Transform of the Autocorrelation:

( ) ( ) ( ) ( ) ( ) 22 expexp στωτδστωτω =−=−= ∫∫+∞

∞−

dtjdtjRS

( )ωS

We can see that the Power Spectrum Density contains all frequencies at the same amplitude. This is the reason that is called White Noise.

The Power of the Noise is defined as: ( ) ( ) 20 σωτ ==== ∫+∞

∞−

Table of Content

Markov Processes

A Markov Process is defined by:

Andrei AndreevichMarkov

1856 - 1922

( ) ( )( ) ( ) ( )( ) 111 ,|,,,|, tttxtxptxtxp >∀ΩΩ=≤ΩΩ ττ

i.e. the Random Process, the past up to any time t1 is fully defined by the process at t1.

Examples of Markov Processes:

1. Continuous Dynamic System( ) ( )( ) ( )wuxthtz

vuxtftx

2. Discrete Dynamic System

( ) ( )( ) ( )kkkkk

wuxthtz

vuxtftx

x - state space vector (n x 1)u - input vector (m x 1)v - white input noise vector (n x 1)

- measurement vector (p x 1)z

- white measurement noise vector (p x 1)w

Table of Content

Markov Processes

3. Continuous Linear Dynamic System( ) ( ) ( )( ) ( )txCtz

tvtxAtx

Using the Fourier Transform we obtain: ( ) ( )( )

( ) ( ) ( )ωωωωωω

VVAIjCZ HH

=−= −

Using the Inverse Fourier Transform we obtain:

( ) ( ) ( )∫+∞

∞−

= ξξξ dvtHtz ,

( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( )( )( )

( ) ( ) ( )∫∫ ∫

∫ ∫∫

∞−

−=−=

ξξξξξωξωωπ

ωωξξωξωπ

ωωωωπ

dvtHdvdtj

dtjdjvdtjVtz

egrattionoforderchange

expexp2

Table of Content

Markov Processes

3. Continuous Linear Dynamic System( ) ( ) ( )( ) ( )txCtz

tvtxAtx

The Autocorrelation of the output is:

( ) ( ) ( )∫+∞

∞−

= ξξξ dvtHtz ,

( ) ( ) ( )[ ] ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )∫∫

∫ ∫∫ ∫

∫∫

∞−

−=∞+

∞−

+=−+−=

−−+−=−+−=

−+−=+=

ζτζζξξτξ

ξξξξδξτξξξξτξξξ

ξξτξξξξττ

ξζdHSHdtHStH

ddtHStHddtHvvEtH

dtHvdvtHEtztzER

212121211211

222111

( ) ( ) ( )[ ] ( )τδττ vvT

vv StvtvER =+=

( ) ( ) ( ) ( ) ( ) vvvvvvvv SdjSdjRS =−=−= ∫∫+∞

∞−

ττωτδττωτω expexp

( ) ( ) ( )( ) ( )

( ) ( ) ( )

( ) ( ) ( ) ( )( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( )ωωχχωχζζωζ

χχωζζωζχττζωζζωζτζ

ττωζτζζττωτω

χτζ

*expexp

expexpexpexp

expexp

HH vvT

SdjHSdjH

djdjHSHdjdjHSH

djdHSHdjRSzzzz

−=−−−=

−−=−=

∫∫

∫ ∫∫ ∫

∫ ∫∫

∞−

=+∞+

∞−

−=+∞

∞−

( ) ( ) ( ) ( ) conjugatecomplexSS vvzz −== ∗ωωωω *HH

Table of Content

Markov Processes

4. Continuous Linear Dynamic System ( ) ( ) ( )∫+∞

∞−

= ξξξ dvthtz ,

( ) ( ) ( )[ ] ( )τδσττ 2

vvv tvtvER =+= ( ) 2

vvvS σω =

v (t) z (t)( )xj

ωωω

The Power Spectral Density of the output is:

( ) ( ) ( ) ( ) ( ) 2

ωωσωωωω

( ) ( ) 2

ωωσω

vvK σ

The Autocorrelation of the output is:( ) ( ) ( )

( ) ( ) ( ) ( )∫∫

∫∞+

∞−

−−

τωσ

πωτω

ωωσ

ωτωωπ

exp/12

( ) 0/1 2

=−∫

∞→R

vv dses

ωσ( ) 0

=−∫

∞→R

vv dses

xω−

ωσ js +=

0<τ0>τ

( ) τωσωω xeK

R vvxzz

vvxK σω

( )τωσω

−= exp2

( ) ( )

−−=

−−

−→

Reexp2

τσωτ

ωσω

τσωτ

ωσω

sKsdss

Markov Processes

5. Continuous Linear Dynamic System with Time Variable Coefficients

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )21121&

tttQteteE

twEtwtetxEtxteT

−=−=

w (t) x (t)

( )tG ∫x (t)

( ) ( ) ( ) ( ) ( ) ( )twtGtxtFtxtxtd

( ) ( ) ( ) ( ) ( )tetGtetFte wxx +=

( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

dwGttxtttx0

,, 00 λλλλ

The solutions of the Linear System are:

where:

( ) ( ) ( ) ( ) ( ) ( ) ( )3132210000 ,,,&,&,, ttttttItttttFtttd

d Φ=ΦΦ=ΦΦ=Φ

( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

wxx deGttettte0

,, 00 λλλλ

( ) ( ) ( ) ( ) ( ) twEtGtxEtFtxE +=

Random VariablesSOLO Markov Processes

5. Continuous Linear Dynamic System with Time Variable Coefficients (continue – 1)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )21121&

tttQteteE

twEtwtetxEtxteT

−=−=

w (t) x (t)

( )tG ∫x (t)

( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

dwGttxtttx0

,, 00 λλλλ ( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

wxx deGttettte0

,, 00 λλλλ

( ) ( ) ( ) ( ) ( )ttRteteEtxVartV xT

xxx ,: ===( ) ( ) ( ) 2121 :, teteEttR Txxx =

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( )∫ ∫∫

∫∫

ΦΦ+Φ

ΦΦ+ΦΦ=

Φ+Φ=

222222111102101111

2222200102

222220021111100121

ddtGeeEGtttdtxwEGt

dtGwtxEttttteteEtt

dwGttxttdwGttxttEttR

λλλλλλλλλλλλ

λλλλ

λλλλλλλλ

λλδλ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( )

( ) ( ) ( ) ( ) ( ) ( ) ( )( )

∫∫ ∫=

ΦΦ=ΦΦ

≤≤←==21

2122221111

212102001

,,0ttt

dtGQGtddtGeeEGt

tttwtxEtxwE

λλλλλλλλλλλλλλ

λλλλ

λλδλ

5. Continuous Linear Dynamic System with Time Variable Coefficients (continue – 2)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )21121&

tttQteteE

twEtwtetxEtxteT

−=−=

w (t) x (t)

( )tG ∫x (t)

( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

dwGttxtttx0

,, 00 λλλλ ( ) ( ) ( ) ( ) ( ) ( )∫Φ+Φ=t

wxx deGttettte0

,, 00 λλλλ

( ) ( ) ( ) ( ) ( )ttRteteEtxVartV xT

xxx ,: ===( ) ( ) ( ) 2121 :, teteEttR Txxx =

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )( )

ΦΦ+ΦΦ==21

0200012121 ,,,,,,ttt

Tx dtGQGtttttVtttxtxEttR λλλλλλ

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )∫ ΦΦ+ΦΦ===t

Tx dtGQGtttttVttttRtxtxEtV

,,,,,, 0000 λλλλλλ

6. Discrete Linear Dynamic System with Variable Coefficients

( ) ( ) ( ) ( ) ( )kwkkxkkx Γ+Φ=+1( ) ( ) ( )

( ) ( ) ( )lkQlekeE

kwEkwke

( ) ( ) ( ) ( ) ( ) ( )kXkekeE

kxEkxkeT

−=: ( ) ( ) lkkekeE Twx ,0 ∀=

( ) ( ) ( ) ( ) ( ) kwEkkxEkkxE Γ+Φ=+1

( ) ( ) ( ) ( ) ( )kekkekke wxx Γ+Φ=+1

( ) ( ) ( ) ( ) ( ) ( ) ( )( )

( ) ( ) ( ) ( ) ( ) ( )kekkekkkekkkekkekke wwx

wxx 1111112,2

+Γ+Γ+Φ+Φ+Φ=+Γ+++Φ=++Φ

( ) ( ) ( ) ( )( )

( ) ( ) ( ) ( )∑−+

Γ++Φ+Φ+Φ−+Φ=+1

1,11lk

x nennlkkekklklke

where we defined ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )kmknnmIkkkklkklk ,,,&,11:, Φ=ΦΦ=ΦΦ+Φ−+Φ=+Φ

Hence ( ) ( ) ( ) ( ) ( ) ( )∑−+

Γ++Φ++Φ=+1

knwxx nennlkkeklklke

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∑−+

Γ++Φ++Φ=+1

Txx keneEnnlkkekeEklkkelkeE

6. Discrete Linear Dynamic System with Variable Coefficients (continue – 1)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ∑−+

Γ++Φ++Φ=+1

Txx keneEnnlkkekeEklkkelkeE

( ) ( ) ( ) ( ) ( ) ( )∑−+

Γ++Φ++Φ=+1

( ) ( ) ( ) ( ) ( ) ( )∑−

Γ+Φ+−−Φ=1

lkmwxx memmklkelkkke

→,2,1l

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

( ) ( )∑−

−= −

+ΦΓ+−Φ−=1

Txw mkmmeneElkklkeneEkeneE

[ ][ ]

=−−∈−+∈

lkkn ( ) ( ) ( ) 0

=−=−

lkeneE

δ( ) ( ) 0=keneE T

( ) ( ) ( ) ( ) ( ) kekeEklkkelkeE Txx

Txx ,+Φ=+

6. Discrete Linear Dynamic System with Variable Coefficients (continue – 2)

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )∑−+

++ΦΓ++Φ=+1

Txx nlknnekeEklkkekeElkekeE

( ) ( ) ( ) ( ) ( ) ( )∑−+

Γ++Φ++Φ=+1

( ) ( ) ( ) ( ) ( ) ( )∑−

Γ+Φ+−−Φ=1

lkmwxx memmklkelkkke

→,2,1l

( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( )

∑−

−= −

Γ+Φ+−−Φ=1

lkmnmQ

nemeEmmknelkeElkknekeE

[ ][ ]

=−−∈−+∈

lkkn ( ) ( ) ( ) 0

=−=−

nelkeE

δ( ) ( ) 0=nekeE T

( ) ( ) ( ) ( ) ( )klkkekeElkekeE TTxx

Txx ,+Φ=+

Table of Content

SOLO Matrices Trace of a Square Matrix The trace of a square matrix is defined as ( ) ( )T

iiinn AtraceaAtrace ×

=× == ∑

q.e.d.

( ) ( )ABtraceBAtrace =1

Proof:

( ) ∑ ∑= =

jjiij baBAtrace

( ) ( )BAtracebaabABtracen

iijji ==

= ∑ ∑∑ ∑

= == = 1 11 1

( ) ( ) ( ) ( ) ( ) ( ) ( )( )

( )ABtraceBAtraceBAtraceABtraceABtraceBAtrace TTTT111

=≠===2

Proof:

( ) ( ) ( )ABtraceBAtracebabaBAtracen

= ∑ ∑∑ ∑

= == = 1 11 1

( ) ( )Tn

T BAtraceabABtrace =

= ∑ ∑

= =1 1q.e.d.

=× == ∑

Proof:

q.e.d.

( ) ( ) ( )∑=

− ==n

ii APAPtraceAtrace

where P is the eigenvector matrix of A related to the eigenvalue matrix Λ of A by

( ) ( ) ( ) ( )AtraceAPPtracePAPtrace == −− 11

=Λ=→ −

( ) ( ) ∑=

− =Λ=→n

itracePAPtace1

=× == ∑

Proof:

q.e.d.

Definition

4( )AtraceA ee =det

( )AtraceA eeePeP

PePPePe

======∑

=ΛΛΛ−Λ− 1detdetdetdet

1detdetdetdetdet 11

If aij are the coefficients of the matrix Anxn and z is a scalar function of aij, i.e.:

( ) njiazz ij ,,1, ==

then is the matrix nxn whose coefficients i,j areA

∂∂

,,1,: =∂∂=

∂∂

(see Gelb “Applied Optimal Estimation”, pg.23)

=× == ∑

Proof:

q.e.d.

5( ) ( ) ( )

AtraceI

Atrace T

n ∂∂==

∂∂ 1

==∂∂=

∂ ∑= ji

Atraceij

6( ) ( ) ( ) ( ) nmmnTTT RBRCCBBC

BCAtrace

ABCtrace ×× ∈∈==∂

∂=∂

Proof:

( ) ( ) ( )[ ] ijT

kklpklp

BCBCbcabcaA

ABCtrace ===∂

∂ ∑∑ ∑ ∑=

== = = 11 1 1q.e.d.

7 If A, B, C ∈ Rnxn,i.e. square matrices, then

( ) ( ) ( ) ( ) ( ) ( ) TTT CBBCA

BCAtrace

CABtrace

ABCtrace ==∂

∂=∂

∂ 11

=× == ∑

Proof:

q.e.d.

8 ( ) ( ) ( ) ( ) ( )( ) ( )nmmn

RBRCBCA

ABCtrace

BCAtrace

ABCtrace ×× ∈∈=∂

∂=∂

∂ 721

( ) ( ) ( ) ( ) ( ) ( )BC

BCAtrace

CABtrace

ABCtrace TTT 811

∂=∂

If A, B, C ∈ Rnxn,i.e. square matrices, then

( ) TAA

Atrace2

( ) ( ) ( ) ijT

ijijij

Aaaaaaa

Atrace

Atrace2

∂∂=

∂ ∑ ∑= =

( ) ( ) 1−=∂

∂ kTk

Atrace

Proof:( ) ( ) ( ) ( ) ( ) 1111 −−−− =+++=

⋅∂

∂ kT

kTkTkT

AkAAAA

AAAtrace

Atrace

q.e.d.

=× == ∑

Proof:

q.e.d.

( ) TAA

etrace =∂

( ) ( ) ( ) TAn

Atrace

etrace ===

∂∂=

∂∂ ∑ ∑∑∑

= =→∞

→−−

→∞=

→∞1 0

( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )( ) ( )

( ) ( ) ( ) ( ) ( ) ( )TT

TTTTTTTTT

BACBAC

ACABtrace

BACAtrace

ABACtrace

CABAtrace

BACAtrace

CABAtrace

ACABtrace

BACAtrace

ABACtrace

∂∂=

∂∂

( ) ( ) ( ) ( ) ( ) ( ) TTTTTTT

BACBACCABBACA

ABACtrace

ABACtrace +=+==∂

∂+∂

∂=∂

∂ + 86

1Proof: q.e.d.

( ) ( ) ( )A

AAtrace

AAtrace TT

∂=∂

∂Table of Content

Functional AnalysisSOLO

Inner Product

If X is a complex linear space, for the Inner Product < , > between the elements (a complex number) is defined by:

Xzyx ∈∀ ,,

><>=< xyyx ,,1 Commutative law><+>>=<+< zxyxzyx ,,,2 Distributive law

Cyxyx ∈∀><>=< λλλ ,,300,&0, =⇔=><≥>< xxxxx4

Define: ( ) ( ) ( ) ( ) ( )( )

( )( )

==>< ∫

tfdttgtftgtf

Table of Content

SignalsSOLO

Signal Duration and Bandwidth

( ) ( )∫+∞

∞−

−= tdetsfS tfi π2 ( ) ( )∫+∞

∞−

= fdefSts tfi π2

( ) 2ts

( ) 2fS

( ) ( )

∫∞+

∞−

tdtstt

( )∫

∫∞+

∞−

∞−=tdts

Signal Duration Signal Median

( ) ( )

∫∞+

∞−

fdfSff

π ( )

( )∫

∫∞+

∞−

∞−=fdfS

Signal Bandwidth Frequency Median

Fourier

Signals

( ) ( )∫+∞

∞−

= fdefSts tfi π2

Signal Duration and Bandwidth (continue – 1)

( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( )∫∫ ∫

∫ ∫∫ ∫∫∞+

∞−

−∞+

∞−

dffSfSdfdesfS

dfdefSsdfdefSsdss

tfitfi

τττττττ

( ) ( )∫+∞

∞−

= fdefSts tfi π2 ( ) ( ) ( )∫+∞

∞−

== fdefSfitd

tsdts tfi ππ 22'

( ) ( ) ( ) ( ) ( ) ( )

( ) ( ) ( ) ( ) ( )∫∫ ∫

∫ ∫∫ ∫∫∞+

∞−

−+∞

∞−

−+∞

∞−

dffSfSfdfdesfSfi

dfdesfSfidfdefSfsidss

tfitfi

'2'2''

πττπ

ττπττπτττ

( ) ( )∫∫+∞

∞−

= dffSds 22 ττ

Parseval Theorem

( ) ( )∫∫+∞

∞−

= dffSfdtts2222

Signals

( ) ( )

( )∫

∫ ∫

∫∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

∞− =====dffS

fdfdfSd

fdtdetstfS

tdfdefStst

tdtstst

πππ

Signal Duration and Bandwidth

( ) ( )∫+∞

∞−

−= tdetsfS tfi π2 ( ) ( )∫+∞

∞−

= fdefSts tfi π2Fourier

( ) ( )∫+∞

∞−

−−= tdetstifd

fSd tfi ππ 22( ) ( )∫

∞−

= fdefSfitd

tsd tfi ππ 22

( ) ( )

( )∫

∫ ∫

∫∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

====tdts

tsdtsi

tdfdefSfts

fdtdetsfSf

fdfSfSf

2 2222

ππ ππππ

Signals

( ) ( ) ( ) ( ) ( )∫∫∫∫∫+∞

∞−

dffSfdttstdttsdttstdtts

222222

( ) ( )∫∫+∞

∞−

= dffSdts22 τ

0&0 == ftChange time and frequency scale to get

From Schwarz Inequality: ( ) ( ) ( ) ( )∫∫∫+∞

∞−

≤ dttgdttfdttgtf22

Choose ( ) ( ) ( ) ( ) ( )tstd

tsdtgtsttf ':& ===

( ) ( ) ( ) ( )∫∫∫+∞

∞−

≤ dttsdttstdttstst22

''we obtain

( ) ( )∫+∞

∞−

dttstst 'Integrate by parts( )

dtstsdu

( ) ( ) ( ) ( ) ( )∫∫∫+∞

∞−

−−= dttststdttsstdttstst '' 2

( ) ( ) ( )∫∫

∞−

−= dttsdttstst 2

( ) ( )∫∫+∞

∞−

= dffSfdtts2222

( )∫

∫∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

∞−∞+

∞−

∞− =≤dffS

assume ( ) 0lim =→∞

SignalsSOLO

∞−

∫ π

Finally we obtain ( ) ( )ft ∆∆≤2

0&0 == ftChange time and frequency scale to get

Since Schwarz Inequality: becomes an equalityif and only if g (t) = k f (t), then for:

( ) ( ) ( ) ( )∫∫∫+∞

∞−

≤ dttgdttfdttgtf22

( ) ( ) ( ) ( )tftsteAttd

sdtgeAts tt ααα αα 222:

−=−=−==⇒= −−

we have ( ) ( )ft ∆∆=2

1Table of Content

2 estimators

Science

Lecture Notes: Estimation of Dynamic Games · 2018. 4. 13. · 1.Introduction: Pseudo-Maximum-Likelihood (PML) estimators for discrete games of incom-plete information. 2.PML estimators

Point Estimators - Stony Brook

Estimators Piping Man Hour Manual

Estimators Piping Manhour Manual

Matching Estimators

Bayesian wavelet estimators in nonparametric regressionLecture 1. Classical and Bayesian approaches to estimation in nonparametric regression 1. Classical estimators • Kernel estimators

Expectation Values & Estimators

Spectral Estimators-Herlan D

Comparison of Variance Estimators for the … of Variance Estimators...Comparison of Variance Estimators for the Consumer Price ... We can expect the variance estimator ... New index

1 Empire Estimators · 2 Empire Estimators 116 West 23rd Street, 5th Floor New York, NY 10011 (212) 920-1015 CLEANING_ESTIMATE 2/5/2017 Page: 2 CLEANING_ESTIMATE Basement Up

COMPARISON OF WEIBULL TAIL-COEFFICIENT ESTIMATORS … · COMPARISON OF WEIBULL TAIL-COEFFICIENT ESTIMATORS Authors: ... where ξn,1 and ξn,2 converge in distribution to a standard

Finite Sample Evidence of IV Estimators Under Weak Instrumentsqed.econ.queensu.ca/jae/datasets/flores-lagunes001/... · 2006. 2. 9. · Finite Sample Evidence of IV Estimators Under

Lectures 3 & 4: Estimators

Patient Liability Estimators

Adaptive warped kernel estimators

Streaming Coreset Constructions for M-Estimators · StreamingCoresetConstructionsforM-Estimators Vladimir Braverman DepartmentofComputerScience,JohnsHopkinsUniversity,Baltimore,MD,USA

Generalization of Consistent Standard Error Estimators under … · 2019. 10. 12. · 2.). ( ). . ( ) 2. ( ).). .). ( ).). ( )

Sampling estimators of total mill receipts for use in ... · Sampling estimators of total mill receipts for use in timber product output studies ... Ratio estimators are selected

Robust Estimators v9

Chapter 2 Per-Flow Size Estimators