27
1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California at Berkeley

1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

  • View
    222

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

1

The Ecological Statistics of Grouping by Similarity

Charless Fowlkes, David Martin, Jitendra Malik

Computer Science Division

University of California at Berkeley

Page 2: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

2

Brunswik & Kamiya 1953

• Thesis: Gestalt rules reflect the structure of the natural world

• Attempted to validate the grouping rule of proximity of similars

• Brunswik was ahead of his time… we now have the tools. Egon Brunswik (1903-1955)

Page 3: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

3

P(grouping | image features)

• Proximity: Brunswik/Kamiya 53

• Good Continuation: Geisler, et. al. 01, Ren/Malik 02

• Similarity: Martin/Fowlkes/Malik 01, and this talk

P(image features)

• Edges/Filters/Coding: Ruderman 94/97, Dong/Atick 95, Olshausen/Field 96, Bell/Sejnowski 97, Hateren/Schaaf 98, Buccigrossi/Simoncelli 99, Alvarez/Gousseau/Morel 99, Huang/Mumford 99

Natural Image Statistics

Page 4: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

4

Human Segmentation Dataset

Page 5: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

5

Details…

• 30 subjects, age 19-23• 8 months• 1,458 person hours• 1,020 Corel images• 11,595 Segmentations

– 5,555 color, 5,554 gray, 486 inverted/negated

“You will be presented a photographic image. Divide the image into some number of segments, where the segments represent “things” or “parts of things” in the scene. The number of segments is up to you, as it depends on the image. Something between 2 and 30 is likely to be appropriate. It is important that all of the segments have approximately equal importance.”

Page 6: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

6

Similarity Cuesa) distanceb) patch-based similarity cuesc) edge-based cues

What image measurements allow us to gauge the probability that pixels i and j belong to the same segment?

Page 7: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

7

Brightness, Color and Texture Features

• Brightness and Color Features– CIE L*a*b* color-space

– Estimate distributions of L*, a* and b* values inside analysis window

• Texture Features– Filter image with even/odd-symmetric filters which

resemble V1 receptive fields.

– Estimate distribution of vector-quantized filter responses inside analysis window

• Compare histograms using chi-squared difference

i ii

ii

hg

hghg

22 )(

2

1),(

Page 8: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

8

Evaluate

ImageEstimated

Similarity (W)

Edge Cues

Region Cues

Learning Similarity

Human Segmentations

GroundtruthSimilarity (S)

Wij is our estimate of the probability that i and j lie in the same segment

Page 9: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

9

Classifiers for Cue Combination

• Classification Trees– Top-down splits to maximize entropy, error bounded

• Density Estimation– Adaptive bins using k-means

• Logistic Regression, 3 variants– Linear and quadratic terms– Confidence-rated generalization of AdaBoost (Schapire&Singer)

• Hierarchical Mixtures of Experts (Jordan&Jacobs)– Up to 8 experts, initialized top-down, fit with EM

• Support Vector Machines (libsvm, Chang&Lin)– Gaussian kernel, -parameterization

Range over bias, complexity, parametric/non-parametric

Page 10: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

10

Combining Similarity Cues

Brightness

Color

Texture

Contour CuesBrightness

Color

Texture

Patch Cues

Distance

∑ Wij

Logistic Regression

Page 11: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

11

Two Evaluation Methods

1. Precision-Recall of same-segment pairs– Precision is P(Sij=1 | Wij > t)

– Recall is P(Wij > t | Sij = 1)

2. Mutual Information between W and S

Groundtruth SijEstimate Wij

∫p(s,w) log p(s)p(w) / p(s,w)

Page 12: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

12

Precision-Recall Curves

Goal

FewerFalsePositives

Fewer Misses

Page 13: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

13

Individual Features

Patches Gradients

Page 14: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

14

Combining Cues

Page 15: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

15

Affinity Model vs. Humans

Page 16: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

16

Conclusions

• Both Edges and Patches are useful.

• Texture gradients can be quite powerful

• Color patches better than gradients

• Brightness gradients better than patches.

• Proximity is a result, not a cause of grouping

Page 17: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

17

The End

Page 18: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

18

Page 19: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

19

Page 20: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

20

Conclusions

1. Common Wisdom: Use patches only / Use edges only Finding : Use both.

2. Common Wisdom : Must use patches for texture Finding : Not true, texture gradient is powerful

3. Common Wisdom : Color provides lots of information Finding : True, but texture is better

4. Common Wisdom : Brightness patches are a poor cue Finding : True, shading and shadows

5. Common Wisdom : Proximity is a grouping cue Finding : Proximity is a result, not a cause of grouping

Page 21: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

21

Overview

• Gathering the Human Segmentation Dataset• Computing Similarity Cues• Modeling Pairwise Similarity• Evaluating the Relative Power of Similarity Cues

Page 22: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

22

Overview

• Gathering the Human Segmentation Dataset• Computing Similarity Cues• Modeling Pairwise Similarity

• Evaluating the Relative Power of Similarity Cues

Page 23: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

23

Pairwise similarity.

• Wij is our estimate of the probability that pixels i and j

belong to the same segment given the region and boundary information

Image

EstimatedSimilarities

Boundary

Cues

Region Cues

W

Page 24: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

24

Overview

• Gathering the Human Segmentation Dataset• Computing Similarity Cues• Modeling Pairwise Similarity

• Evaluating the Relative Power of Similarity Cues

Page 25: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

25

Overview

• Gathering the Human Segmentation Dataset• Computing Similarity Cues• Modeling Pairwise Similarity

• Evaluating the Relative Power of Similarity Cues

Page 26: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

26

Page 27: 1 The Ecological Statistics of Grouping by Similarity Charless Fowlkes, David Martin, Jitendra Malik Computer Science Division University of California

27

ROC vs. Precision/Recall

Sig

nal

Truth

P N

P TP FP

N FN TN

/

/

/

/

= Specificity

= Sensitivity

Precision = TP / (TP+FP) =

Recall = TP / (TP+FN) =

PR Curve

Hit Rate = TP / (TP+FN) =

False Alarm Rate = FP / (FP+TN) =

ROC Curve