Video Processing Communications Yao Wang Chapter-5-6a

Video Modeling(Basics: Camera Models and Motion Models)

Two-Dimensional Motion Estimation(Fundamentals & Basic Techniques)

Chapter 5 (parts) and Chapter 6

Outline (chapter 5)

• Camera projection• 3D motion • Projection of 3-D motion• 2D motion due to rigid object motion

– Projective mapping

• Approximation of projective mapping– Affine model– Bilinear model

Pinhole Camera

Cameracenter

Imageplane

2-Dimage

3-Dpoint

The image of an object is reversed from its 3-D position. The object appears smaller when it is farther away.

Pinhole Camera Model:Perspective Projection

ZyxZYFy

torelatedinversely are ,

All points in this ray will have the same image

Approximate Model:Orthographic Projection

images) cmicroscopi i.e. Z,Ffor is Yy and X(xobject. theof distance the tocompared

small isobject in theation withdepth vari theas long as used beCan const whenF/Z with icorthograph scaled sY,y sX,actually x

, ,)(far very isobject When the

ZZYyXx

Rigid Object Motion

zyxzyx TTT ,,:;,,:][;)]([':centerobject theon wrp. translatiandRotation

TRCTCXRX

Rigid Object Motion

],,[ zyx TTTT

zyxzyx TTT ,,:;,,:][;)]([':centerobject theon wrp. translatiandRotation

TRCTCXRX

rrrrrrrrr

Flexible Object Motion

• Two ways to describe– Decompose into multiple, but connected rigid sub-objects– Global motion plus local motion in sub-objects– Ex. Human body consists of many parts each undergo a rigid

motion– Elastic motion can’t easily decomposed– Mesh models

3-D Motion -> 2-D Motion

2-D MV

3-D MV

Sample (2D) Motion Field

Occlusion Effect

Motion is undefined in occluded regions

Typical Camera Motions

2-D Motion Corresponding to Camera Motion

Camera zoom Camera rotation around Z-axis (roll)

2-D Motion Corresponding to Rigid Object Motion

• General case:

• Projective mapping:(if not planar – planar patches)

FTZFryrxrFTZFryrxr

FTZFryrxrFTZFryrxrFx

rrrrrrrrr

Projection ePerspectiv

ycxcybxbby

ycxcyaxaax

:c)bYaX(Z object When the planar is surface

Projective Mapping

Two features of projective mapping:• Chirping: increasing perceived spatial frequency for far away objects• Converging (Keystone): parallel lines converge in distance

Affine and Bilinear Model

• Approximations to Projective Mapping– Affine (6 parameters):

• Good for mapping triangles to triangles

– Bilinear (8 parameters):

• Good for mapping blocks to quadrangles– Polynomial models (biquadratic – 12 parameters)

ybxbbyaxaa

yxdyxd

),(),(

xybybxbbxyayaxaa

yxdyxd

),(),(

Motion Field Corresponding toDifferent 2-D Motion Models

Translation

Bilinear Perspective

Affine

Outline (Chapter 6)二维运动估计

• Optical flow equation and ambiguity in motion estimation• General methodologies in motion estimation

– Motion representation– Motion estimation criterion– Optimization methods– Gradient descent methods

• Pixel-based motion estimation• Block-based motion estimation

– EBMA algorithm

2-D Motion vs. Optical Flow

On the left, a sphere is rotating under a constant ambient illumination, but the observed image does not change.

On the right, a point light source is rotating around a stationary sphere, causing the highlight point on the sphere to rotate.

• 2-D Motion: Projection of 3-D motion, depending on 3D object motion and projection operator• Optical flow: “Perceived” 2-D motion based on changes in image pattern, also depends on illumination and object surface texture

Optical Flow Equation

• When illumination condition is unknown, the best one can do it to estimate optical flow.

• Constant intensity assumption -> Optical flow equation

0or 0or 0

:equation flow optical thehave we two,above theCompare

),,(),,(

:expansion sTaylor' using But,

),,(),,(:"assumptionintensity constant "Under

tyxdtdydx

Tyxtyx

tyxtyx

Optical Flow Equation

),,( ofector gradient v spatial theis ],[

vector velocity theis )v, ( where

equation flow optic isThat

Ambiguities in Motion Estimation

• Optical flow equation only constrains the flow vector in the gradient direction

• The flow vector in the tangent direction ( ) is under-determined

• In regions with constant brightness ( ), the flow is indeterminate -> Motion estimation is unreliable in regions with flat texture, more reliable near edges

• The flow vector in the tangent direction ( Vt ) is under-determined (aperture problem)

General Considerations for Motion Estimation

• Two categories of approaches:– Feature based (more often used in object tracking, 3D

reconstruction from 2D)– Intensity based (based on constant intensity assumption)

(more often used for motion compensated prediction, required in video coding, frame interpolation) -> Our focus

• Three important questions– How to represent the motion field?– What criteria to use to estimate motion parameters?– How to search motion parameters?

Motion Representation

Global:Entire motion field is represented by a few global parameters

Pixel-based:One MV at each pixel, with some smoothness constraint between adjacent MVs.

Region-based:Entire frame is divided into regions, each region corresponding to an object or sub-object with consistent motion, represented by a few parameters.

Block-based:Entire frame is divided into blocks, and motion in each block is characterized by a few parameters.

Also mesh-based (flow of corners, approximated inside)

Notations

Anchor frame:

Target frame:

Motion parameters:

Motion vector at a pixel in the anchor frame:

Motion field:

Mapping function:

)(xdxaxd ),;(

xaxdxaxw ),;();(

TLaaa ],...,,[ 21a

Backward motion estimation

Forward motion estimation

Motion Estimation Criterion

• To minimize the displaced frame difference (DFD)

(MSE)Error SquareMean ,min)());(()(

(MAD) Difference AbsoluteMean ,min)());(()(

MSE-DFD

MAD-DFD

xaxdxa

min)()();()()(

)()( small is if 0

xxaxdxa

• To satisfy the optical flow equation

10 1 1 1 2 1

( ( ( )) ( ) ) ( ( ( ) ( )) ( ))

( ) ( )( ) ( )( )

( )( )( )( ) ( )

x x x x

v x x y x tv

y tx y y

d x x x x x

Block size: 8*8 16*16

This is homework

• Two above methods fail– Constant intensity assumption is not true everywhere (say shadows)– Flat texture: many motions satisfy the assumption (ill-posed)

• Need to impose additional smoothness constraint using regularization technique (Important in pixel- and block-based representation)

– Penalty term for smoothness, measure motion flow differences between adjacent pixels

– use weights to include a priori knowledge of importance of smoothness

min)()(

);();()(

aydaxdax y

Relation Among Different Criteria

• OF criterion is good only if motion is small. • OF criterion can often yield closed-form solution as the objective

function is quadratic in MVs.• When the motion is not small, can iterate the solution based on

the OF criterion to satisfy the DFD criterion.• Bayesian criterion can be reduced to the DFD criterion plus

motion smoothness constraint• More in the textbook

Optimization Methods

• Exhaustive search – Typically used for the DFD criterion with p=1 (MAD)– Guarantees reaching the global optimal– Computation required may be unacceptable when number of

parameters to search simultaneously is large!– Fast search algorithms reach sub-optimal solution in shorter time

• Gradient-based search– Typically used for the DFD or OF criterion with p=2 (MSE)

• the gradient can often be calculated analytically• When used with the OF criterion, closed-form solution may be obtained

– Reaches the local optimal point closest to the initial solution• Multi-resolution search

– Search from coarse to fine resolution, faster than exhaustive search– Avoid being trapped into a local minimum

Gradient Descent Method

• Iteratively update the current estimate in the direction opposite the gradient direction.

• The solution depends on the initial condition. Reaches the local minimum closest to the initial condition

• Choice of step side:– Fixed step size: Step size must be small to avoid oscillation, requires many iterations– Steepest gradient descent (adjust step size optimally)

Not a good initial

A good initial

Appropriatestepsize

Stepsize too big

Newton’s Method

• Newton’s method

– Converges faster than 1st order method (I.e. requires fewer number of iterations to reach convergence)

– Requires more calculation in each iteration– More prone to noise (gradient calculation is subject to noise, more so with 2nd order

than with 1st order)– May not converge if \alpha >=1. Should choose \alpha appropriate to reach a good

compromise between guaranteeing convergence and the convergence rate.

Newton-Raphson Method

• Newton-Ralphson method – Approximate 2nd order gradient with product of 1st order gradients– Applicable when the objective function is a sum of squared errors– Only needs to calculate 1st order gradients, yet converge at a rate similar to

Newton’s method.

Video Processing Communications Yao Wang Chapter-5-6a

Documents

Basics of Video - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL6123old/Introduction.pdf · Basics of Video Yao Wang ... – Rods: night vision, perceive brightness only

Polytechnic University Inverse SystemsDTFT, Filter …eeweb.poly.edu/~yao/EE3054/DTFT_filter_design.pdf · Proof difficult, after we ... Yao Wang, Polytechnic University 9 Filter

Ultrasound imaging ch11 - New York University …eeweb.poly.edu/~yao/EL5823/Ultrasound_imaging_ch11.pdf · EL5823 Ultrasound Imaging Yao Wang, Polytechnic U., Brooklyn 3 Ultrasound

Yao-Chin Wang, Ph.D., MBA, CHIA

S&IP Consortium Course Material Code Development Speaker: Chun-Yao Wang

Yao Wang - New York University

Video Processing Communications Yao Wang Chapter8a

Basics of Video Courtesy of Professor Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Lixin Huang*, Ming Yang, Xiaojun Zhou, Qi Yao and Lin Wang

Video Processing Communications Yao Wang Chapter9b-13a

Projection Radiography - NYU Tandon School of …eeweb.poly.edu/~yao/EL5823/ProjectionRadiography_ch5.pdf · Projection Radiography Yao Wang Polytechnic University, ... - Impulse

Cluster, Grid and Cloud Computing Jazz Wang Yao-Tsung Wang jazz@nchc.org.tw Jazz Wang Yao-Tsung Wang jazz@nchc.org.tw

Basics of Video - Polyyao/videobook/Introduction.pdfBasics of Video Yao Wang Polytechnic University, Brooklyn, NY11201 yao@vision.poly.edu

Physics of Radiography - NYU Tandon School of Engineeringeeweb.poly.edu/~yao/EL5823/Lect2_PhysicsRadiography_ch4.pdf · Physics of Radiography Yao Wang Polytechnic University, Brooklyn,

Introduction, Review of Signals & Systems, Image Quality ...eeweb.poly.edu/~yao/EL6813/Lect1_Ch1_2_3_JM.pdfEL6813: Introduction Yao Wang, NYU 2 Lecture Outline • Overview of several

Video Processing Communications Yao Wang Chapter9a

~yao Yao Wangeeweb.poly.edu/~yao/EE4414/digitalTV_transmission.pdfDigital TV Transmission: Channel Coding and Mod Channel Coding and Modulation ulation Yao Wang Polytechnic University,

Basics of Videoeeweb.poly.edu/~yao/EL6123old/Introduction_no_color.pdf · Yao Wang, 2013 Video Basics 22. Color TV Broadcasting and Receiving LiLuminance, Chrominance, Audio Multiplexing

Projection Radiography - NYU Tandon School of …eeweb.poly.edu/~yao/EL6813/ProjectionRadiography_ch5.pdf · Projection Radiography Yao Wang, NYU 9 Bremsstrahlung • Continuous spectrum

Spatial Domain LinearSpatial Domain Linear Filteringeeweb.poly.edu/~yao/EL5123/lecture5_filtering.pdf · Spatial Domain LinearSpatial Domain Linear Filtering Yao Wang ... Averaging