32
Multiscale Analysis of Images Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros)

Multiscale Analysis of Images

  • Upload
    heba

  • View
    74

  • Download
    0

Embed Size (px)

DESCRIPTION

Multiscale Analysis of Images. Gilad Lerman Math 5467 (stealing slides from Gonzalez & Woods, and Efros). The Multiscale Nature of Images. Recall: Gaussian pre-filtering. G 1/8. G 1/4. Gaussian 1/2. Solution: filter the image, then subsample - PowerPoint PPT Presentation

Citation preview

Page 1: Multiscale Analysis of Images

Multiscale Analysis of Images

Gilad LermanMath 5467

(stealing slides from Gonzalez & Woods, and Efros)

Page 2: Multiscale Analysis of Images

The Multiscale Nature of Images

Page 3: Multiscale Analysis of Images

Recall: Gaussian pre-filtering

G 1/4

G 1/8

Gaussian 1/2

Solution: filter the image, then subsample• Filter size should double for each ½ size reduction.

Page 4: Multiscale Analysis of Images

Subsampling with Gaussian pre-filtering

G 1/4 G 1/8Gaussian 1/2

Solution: filter the image, then subsample• Filter size should double for each ½ size reduction.

Page 5: Multiscale Analysis of Images

Image Pyramids

Known as a Gaussian Pyramid [Burt and Adelson, 1983]• In computer graphics, a mip map [Williams, 1983]• A precursor to wavelet transform

Page 6: Multiscale Analysis of Images

A bar in the big images is a hair on the zebra’s nose; in smaller images, a stripe; in the smallest, the animal’s nose

Figure from David Forsyth

Page 7: Multiscale Analysis of Images

Gaussian pyramid construction

filter mask

Repeat• Filter• Subsample

Until minimum resolution reached • can specify desired number of levels (e.g., 3-level pyramid)

The whole pyramid is only 4/3 the size of the original image!

Page 8: Multiscale Analysis of Images

What does blurring take away? (recall)

original

Page 9: Multiscale Analysis of Images

What does blurring take away? (recall)

smoothed (5x5 Gaussian)

Page 10: Multiscale Analysis of Images

High-Pass filter

smoothed – original

Page 11: Multiscale Analysis of Images

Band-pass filtering

Laplacian Pyramid (subband images)Created from Gaussian pyramid by subtraction

Gaussian Pyramid (low-pass images)

Page 12: Multiscale Analysis of Images

Laplacian Pyramid

How can we reconstruct (collapse) this pyramid into the original image?

Need this!

Originalimage

Page 13: Multiscale Analysis of Images
Page 14: Multiscale Analysis of Images
Page 15: Multiscale Analysis of Images

Image resampling (interpolation)So far, we considered only power-of-two subsampling

• What about arbitrary scale reduction?• How can we increase the size of the image?

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function• If we could somehow reconstruct the original function, any new image could be generated, at any resolution and

scale

1 2 3 4 5

d = 1 in this example

Page 16: Multiscale Analysis of Images

Image resamplingSo far, we considered only power-of-two subsampling

• What about arbitrary scale reduction?• How can we increase the size of the image?

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function• If we could somehow reconstruct the original function, any new image could be generated, at any resolution and

scale

1 2 3 4 5

d = 1 in this example

Page 17: Multiscale Analysis of Images

Image resamplingSo what to do if we don’t know

1 2 3 4 52.5

1 d = 1 in this example

• Answer: guess an approximation• Can be done in a principled way: filtering

Image reconstruction• Convert to a continuous function

• Reconstruct by cross-correlation:

Page 18: Multiscale Analysis of Images

Resampling filtersWhat does the 2D version of this hat function look like?

Better filters give better resampled images• Bicubic is common choice

Why not use a Gaussian?

What if we don’t want whole f, but just one sample?

performs linear interpolation

(tent function) performs bilinear interpolation

Page 19: Multiscale Analysis of Images

Bilinear interpolationSmapling at f(x,y):

Page 20: Multiscale Analysis of Images

Applications (more efficient with wavelets)

•Coding High frequency coefficients need fewer bits, so one can collapse a compressed Laplacian pyramid•Denoising/Restoration Setting “most” coefficients in Laplacian pyramid to zero•Image Blending

Page 21: Multiscale Analysis of Images

Application: Pyramid Blending

0

1

0

1

0

1

Left pyramid Right pyramidblend

Page 22: Multiscale Analysis of Images

Image Blending

Page 23: Multiscale Analysis of Images

Feathering

01

01

+

=Encoding transparency

I(x,y) = (R, G, B, )

Iblend = left Ileft + right Iright

Ileft Iright

leftright

Page 24: Multiscale Analysis of Images

Affect of Window Size

0

1 left

right0

1

Page 25: Multiscale Analysis of Images

Affect of Window Size

0

1

0

1

Page 26: Multiscale Analysis of Images

Good Window Size

0

1

“Optimal” Window: smooth but not ghostedBurt and Adelson (83): Choose by pyramids…

Page 27: Multiscale Analysis of Images

Pyramid Blending

Page 28: Multiscale Analysis of Images

Pyramid Blending (Color)

Page 29: Multiscale Analysis of Images

Laplacian Pyramid: BlendingGeneral Approach:

1. Build Laplacian pyramids LA and LB from images A and B2. Build a Gaussian pyramid GR from selected region R (black

white corresponding images)3. Form a combined pyramid LS from LA and LB using nodes

of GR as weights:• LS(i,j) = GR(I,j,)*LA(I,j) + (1-GR(I,j))*LB(I,j)

4. Collapse the LS pyramid to get the final blended image

Page 30: Multiscale Analysis of Images

laplacianlevel

4

laplacianlevel

2

laplacianlevel

0

left pyramid right pyramid blended pyramid

Page 31: Multiscale Analysis of Images

Blending Regions

Page 32: Multiscale Analysis of Images

Simplification: Two-band BlendingBrown & Lowe, 2003

• Only use two bands: high freq. and low freq.• Blends low freq. smoothly• Blend high freq. with no smoothing: use binary mask

Can be explored in a project…