54
© Crown copyright 2007 Page 1 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

  • View
    217

  • Download
    3

Embed Size (px)

Citation preview

Page 1: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 1

Data assimilation for systems with multiple time-scales

Mike Cullen

and Gordon Inverarity

4 September 2007

Page 2: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 2

Introduction

Page 3: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 3

Motivation

Data assimilation optimised for short-range (12 to 36 hour) operational forecasts in the extratropics.

Same techniques must be applicable globally, though maybe non-optimal in the tropics

Convective-scale data assimilation for shorter timescales will require different techniques.

Page 4: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 4

Observed behaviour of the system

Forecasting for timescales of 12-36 hrs in the extratropics requires accurate prediction of weather systems.

These are characterised by timescales greater than 1/f, where f is the Coriolis parameter expressing the Earth’s rotation. This corresponds to fluid trajectories taking more than a day to change direction by π/2.

The atmosphere contains many other motions with faster timescales.

Page 5: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 5

Implications for data assimilation

The observation network is designed to measure the geostrophic disturbances, it is not really sufficient even for this task.

Most other types of motion can only be properly observed during special field campagns.

It is important to use the observations to define the geostrophic disturbances properly.

Page 6: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 6

Basic 4D-Var

Page 7: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 7

The problem

Given a previous forecast which takes the form of a trajectory calculated using an (imperfect) numerical model. This is called the background trajectory.

Given a new set of (imperfect) observations.

Constrain the forecast trajectory using the new observations and the background trajectory to give the best possible new forecast.

Page 8: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 8

4D-Var

Page 9: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 9

4D-Var

Page 10: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 10

4D-Var

Page 11: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 11

What is 4D-Var? I

Assume a nonlinear forecast model Mi,i-1 and uncorrelated zero-mean Gaussian model error Wi with covariance Q

, 1 1 1( )i i i i iM X X W

Page 12: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 12

What is 4D-Var? II

Assume a nonlinear observation operator Hi and uncorrelated zero-mean Gaussian observation error Vi with covariance R

( )i i i iH Y X V

Page 13: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 13

4D-Var with model error

Huge minimisation state vector

0 1, , | , , 0 0( , , ) exp( ( , , ))n n n np J X X Y Y X X X X

22 2

0 0 b , 1 11 1

1 1 1( , , ) ( ) ( )

2 2 2

n n

n i i i i i i ii i

J H M

B R QX X X x Y X X X

T 1A

x x A x

Page 14: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 14

4D-Var with a perfect forecast model

Smaller minimisation state vector

0 1, , | , , 0 , 1 1 01

( , , ) ( ( )) exp( ( ))n n

n

n i i i ii

p M J

X X Y Y X X X X X

2 2

0 0 b1

1 1( ) ( )

2 2

n

i i ii

J H

B RX X x Y X

Page 15: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 15

Effect of perfect model formulation I

The observations are fitted using a model trajectory.

The trajectory is defined by its initial values, which are now the only control parameters.

The error in the background trajectory , in reality a mixture of model error and errors in previous observations, is assumed to be solely due to errors in observations.

The resulting analysis trajectory is ,0 ( )i aM x

,0 ( )i bM x

Page 16: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 16

Effect of perfect model formulation II

This solution is statistically optimal if:

• The model is perfect

• Perturbations to the model trajectory evolve according to linear dynamics

• The error in the initial value is Gaussian with zero mean

All these assumptions are seriously deficient in a real situation

Page 17: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 17

Alternative formulation

Page 18: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 18

Alternative formulation

Wish to acknowledge imperfect model, without losing requirement to fit observations with a model trajectory, thus ensuring consistency with forecast.

Page 19: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 19

Distributed-background 4D-Var

Penalise the difference between the background and analysis trajectories throughout the assimilation window

Reduces to traditional 4D-Var when =p/n, the Jacobian can accurately evolve increments and

These are the assumptions listed earlier.

2/ 1

b 0 ,0 0 ,0 b0

( ) ( ) ( )2

pi

n p

pi pii

J M M

B

X X x

T,0 b ,0 b( ) { ( )}pi pi piB M x B M x

,0piM

Page 20: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 20

Exploitation

Can now use different growth assumption based on diagnostics

Expect that non-geostrophic errors are ‘permanently saturated’. Thus the flow consists of geostrophic disturbances, the errors in which grow in time, and a background of other motions whose total energy is constant in time.

This would lead to no growth of errors in non-geostrophic motions in the assimilation window. Will show that this discourages fitting of observations with such motions.

Page 21: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 21

Diagnostic evidence of error growth

Page 22: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 22

Evaluation of assumptions

We can examine the linear growth assumption by comparing the difference of two nonlinear runs with the evolution of a linearised perturbation model.

Many of the physical processes represented in the model are highly nonlinear-limiting the accuracy demonstrated by such a test.

Page 23: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 23

Linearisation test over 6 hour period

Relative error

Page 24: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 24

Singular vectors

Even using a linear model, the growth of perturbations is not simply exponential or wave-like because of the time-dependent trajectory about which the evolution is linearised. Thus the growth has to be assessed over a given time interval.

A particular choice of perturbation is referred to as a ‘singular vector’.

Page 25: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 25

Behaviour of standard 4D-Var

xx

xx

x

* Background

If observations late in 4D-Var window, they will be fitted using the most rapidly growing singular vectors, since that gives the smallest penalty in the initial value. The growth is assessed over the assimilation window, typically 6-12 hours.

Page 26: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 26

Error growth

In the real system, error growth in the rotationally dominated (geostrophic) disturbances saturates at a point when the weather systems are completely out of phase.

This takes a time of order 2 weeks.

The errors in other motions saturate at a much lower value and on a shorter timescale (~ 1 day).

Page 27: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 27

Singular vector growth

Singular vector calculations by Corinna Klapproth and Tim Payne using operational data from July 2003 follow.

They show that the most rapid growth is in non-geostrophic modes unless the period over which the growth is calculated is greater than 36 hrs.

This is much longer than an assimilation window, so 4D-Var will tend to use other types of perturbation to fit the observations..

Page 28: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 28

Energy partition in geostrophic and ageostrophic

Page 29: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 29

Energy partition in geostrophic and ageostrophic

Page 30: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 30

Energy partition in geostrophic and ageostrophic

Page 31: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 31

Energy partition in geostrophic and ageostrophic

Page 32: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 32

Energy partition in geostrophic and ageostrophic

Page 33: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 33

Energy partition in geostrophic and ageostrophic

Page 34: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 34

Error growth due to imperfect model

The combination of the growth of differences between the truth ‘model’ and the forecast model and the growth of perturbations under the action of the forecast model can be inferred using verification figures from operational forecasts.

These suggest a linear growth (greater in first 24 hrs).

The same is true for errors in significant weather systems, such as tropical and extratropical cyclones.

Page 35: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 35

Error growth in key parameters

Rms errors in surface pressure (thick solid), 500 hpa height (dashed) and 200 hpa wind (thin solid) over N hemisphere in 1995

Page 36: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 36

Illustration using toy model

Page 37: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 37

3-body problem: Inverarity (2007)

12-dimensional Hamiltonian problem of a sun-planet-moon orbital system with fast and slow timescales

Nonlinear 4D-Var using Gauss-Newton method to solve grad J = 0

Explicit symplectic integrator with analytical Jacobian

B matrix generated using an extended Kalman filter (model error variance tuned to ensure sensible filter behaviour)

Page 38: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 38

Multiple timescales

Slow timescale associated with planet’s orbit round sun (7200 timesteps)

Fast timescale with moon’s orbit round planet (540 timesteps)

Choose assimilation period of about 50% of moon’s orbital period (corresponds to 6 hr period in real system)

Page 39: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 39

3-body orbits – Watkinson (2006)

Page 40: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 40

Perfect (truth) vs imperfect (forecast) model

Sun mass = 1.0Planet mass = 0.1Moon mass = 0.01

Sun mass = 1.0Planet mass = 0.1Moon mass = 0.011

Page 41: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 41

Issues to study

A systematically imperfect model does not satisfy the assumptions of 4D-Var.

The growth of differences between the truth model and the forecast model may dominate the growth of perturbations under the action of the forecast model.

The linear model used to describe error growth in the forecast model is not adequate.

Page 42: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 42

Model error growth

Time unit 1000 timesteps

Page 43: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 43

Linearisation tests

Show evolution of differences from nonlinear runs compared with growth of difference predicted by linear model.

Linear assumption good for sun’s position/momentum (slow dynamics); not for moon’s position/momentum (fast dynamics) as desired to match real system

Page 44: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 44

Test results-(forecast model)

Page 45: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 45

Assimilation test

Use standard 4D-Var with B derived from extended Kalman filter, would be ‘correct’ if model was perfect stochastic model.

Run further 300 steps after EKF ‘analysis’ to get background state.

Complete set of observations during assimilation window.

Also use distributed background term with two ‘nodes’, B for both taken as time average from EKF.

Page 46: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 46

Results with no assimilation-900 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 47: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 47

Results with standard 4D-Var-900 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 48: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 48

Results with distributed Jb-900 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 49: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 49

Results with no assimilation-4000 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 50: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 50

Results with standard 4D-Var-4000 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 51: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 51

Results with distributed Jb-4000 steps

Position

Momentum

Red=Sun, Blue=Planet Green=Moon

Page 52: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 52

Summary

Demonstrated limitations of 4D-Var assumptions using diagnostics from real data.

Demonstrated possible approach to handling systematic model error within variational assimilation framework.

Allows using model as strong constraint, giving consistency between assimilation and forecast, and also allows some aspects of multiscale behaviour to be addressed.

Page 53: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 53

Accreditation

WAFCWorld Area Forecast Centre

Page 54: Page 1© Crown copyright 2007 Data assimilation for systems with multiple time-scales Mike Cullen and Gordon Inverarity 4 September 2007

© Crown copyright 2007 Page 54

Questions & Answers