9
1 cs542g-term1-2007 Notes Notes Extra class this Friday 1-2pm Assignment 2 is out Error in last lecture: quasi-Newton methods based on the secant condition: Really just Taylor series applied to the gradient function g: H Δx gx x ( ) gx () gx x ( )= gx ( )+ H Δx + O Δx 2 ( )

Notes

Embed Size (px)

DESCRIPTION

Notes. Extra class this Friday 1-2pm Assignment 2 is out Error in last lecture: quasi-Newton methods based on the secant condition: Really just Taylor series applied to the gradient function g:. Nonlinear Least Squares. One particular nonlinear problem of interest: - PowerPoint PPT Presentation

Citation preview

Page 1: Notes

1cs542g-term1-2007

NotesNotes

Extra class this Friday 1-2pm

Assignment 2 is out

Error in last lecture: quasi-Newton methods based on the secant condition:

• Really just Taylor series applied to the gradient function g:

HΔx≈g x+ Δx( )−g x( )

g x + Δx( ) =g x( ) + HΔx+O Δx2( )

Page 2: Notes

2cs542g-term1-2007

Nonlinear Least SquaresNonlinear Least Squares

One particular nonlinear problem of interest:

Can we exploit the structure of this objective?

Gradient:

Hessian:

minxb−g x( )

2

2

−2J T b − g(x)( ), Jij =∂gi∂x j

H =2J T J −2 b−g(x)( )g

∂2g∂x2

Page 3: Notes

3cs542g-term1-2007

Gauss-NewtonGauss-Newton

Assuming g is close to linear, just keep the JTJ term

Automatically positive (semi-)definite:in fact, modified Newton solve is now a linear least-squares problem!

Convergence may not be quite as fast as Newton, but more robust.

minΔx

b−g(x)( )−J Δx2

2

Page 4: Notes

4cs542g-term1-2007

Time IntegrationTime Integration

A core problem across many applications• Physics, robotics, finance, control, …

Typical statement:system of ordinary differential equations,first order

Subject to initial conditions:

Higher order equations can be reduced to a system - though not always a good thing to do

dy

dt= f y,t( )

y 0( ) =y0

Page 5: Notes

5cs542g-term1-2007

Forward EulerForward Euler

Derive from Taylor series:

Truncation error is second-order (Global) accuracy is first-order• Heuristic: to get to a fixed time T, need T/∆t

time steps

y t + Δt( ) =y t( ) + Δtdydt

t( ) +O Δt2( )

yn+1 =yn + Δt f yn,tn( )

Page 6: Notes

6cs542g-term1-2007

Analyzing Forward EulerAnalyzing Forward Euler

Euler proved convergence as ∆t goes to 0 In practice, need to pick ∆t > 0

- how close to zero do you need to go? The first and foremost tool is using a

simple model problem that we can solve exactly

dy

dt=Ay

Page 7: Notes

7cs542g-term1-2007

Solving Linear ODE’sSolving Linear ODE’s

First: what is the exact solution? For scalars:

This is in fact true for systems as well, though needs the “matrix exponential”

More elementary approach:change basis to diagonalize A(or reduce to Jordan canonical form)

dy

dt=Ay, y 0( ) =y0

⇒ y t( ) =eAty0

Page 8: Notes

8cs542g-term1-2007

The Test EquationThe Test Equation

We can thus simplify our model down to a single scalar equation, the test equation

Relate this to a full nonlinear system by taking the eigenvalues of the Jacobian of f:

• But obviously nonlinearities may cause unexpected things to happen: full nonlinear analysis is generally problem-specific, and hard or impossible

dy

dt=λy

∂f∂y

⎛⎝⎜

⎞⎠⎟x = λ x

Page 9: Notes

9cs542g-term1-2007

Solution of the Test Solution of the Test EquationEquation

Split scalar into real and imaginary parts:

For initial conditions y0=1

The magnitude only depends on a• If a<0: exponential decay (stable)

If a>0: exponential increase (unstable)If a=0: borderline-stable

λ =a + b −1

y t( ) =eλt =eat cosbt+ −1sinbt( )