Object Recognition Vision Class 2006-7. Object Classes

Object Recognition

Vision Class 2006-7

Object Classes

http://images.google.com/imgres?imgurl=http://www.ecomagic.org/fruition/trees-1.jpg&imgrefurl=http://www.ecomagic.org/fruition/friends.html&h=375&w=500&sz=99&tbnid=Crq2ZBkq7-kJ:&tbnh=95&tbnw=127&hl=en&start=10&prev=/images%3Fq%3Dtrees%26svnum%3D10%26hl%3Den%26lr%3D

http://images.google.com/imgres?imgurl=http://pinker.wjh.harvard.edu/photos/cambridge_boston/images/trees%2520in%2520Cambridge%2520Common.jpg&imgrefurl=http://pinker.wjh.harvard.edu/photos/cambridge_boston/pages/trees%2520in%2520Cambridge%2520Common.htm&h=600&w=900&sz=148&tbnid=aCzG9fGgmJAJ:&tbnh=96&tbnw=145&hl=en&start=1&prev=/images%3Fq%3Dtrees%26svnum%3D10%26hl%3Den%26lr%3D

http://images.google.com/imgres?imgurl=http://www.museum.state.il.us/isas/trees/whiteoak.jpeg&imgrefurl=http://www.museum.state.il.us/isas/trees/&h=344&w=515&sz=37&tbnid=bH2nsyp9QZYJ:&tbnh=85&tbnw=128&hl=en&start=14&prev=/images%3Fq%3Dtrees%26svnum%3D10%26hl%3Den%26lr%3D

http://images.google.com/imgres?imgurl=http://upload.wikimedia.org/wikipedia/en/thumb/3/3c/Birchandmaple.jpg/180px-Birchandmaple.jpg&imgrefurl=http://en.wikipedia.org/wiki/Tree&h=232&w=180&sz=29&tbnid=oYewFL9I-NcJ:&tbnh=103&tbnw=79&hl=en&start=20&prev=/images%3Fq%3Dtrees%26svnum%3D10%26hl%3Den%26lr%3D

http://images.google.com/imgres?imgurl=http://www.stellenboschwriters.com/araucaria.jpg&imgrefurl=http://www.stellenboschwriters.com/trees.html&h=400&w=283&sz=146&tbnid=F9JwGV-Zsc8J:&tbnh=120&tbnw=84&hl=en&start=37&prev=/images%3Fq%3Dtrees%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.jmadden.info/trees/Frosty%2520trees%25203.jpg&imgrefurl=http://www.jmadden.info/trees/trees.htm&h=416&w=312&sz=65&tbnid=nLwh0UJ6peMJ:&tbnh=122&tbnw=91&hl=en&start=51&prev=/images%3Fq%3Dtrees%26start%3D40%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.uri.edu/personal/jsch5838/shoes.jpg&imgrefurl=http://www.uri.edu/personal/jsch5838/pics.html&h=564&w=451&sz=25&tbnid=845fGieS5oUJ:&tbnh=131&tbnw=104&hl=en&start=4&prev=/images%3Fq%3Dshoes%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en

http://images.google.com/imgres?imgurl=http://www.top-trendy.com/images/DC%2520Shoes%2520Womens%2520Lunamm3.jpg&imgrefurl=http://www.top-trendy.com/images/&h=500&w=500&sz=39&tbnid=KbkKC9kiL_wJ:&tbnh=127&tbnw=127&hl=en&start=6&prev=/images%3Fq%3Dshoes%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en

http://images.google.com/imgres?imgurl=http://www.ameliacaruso.com/pinkdotshoesweb.jpg&imgrefurl=http://www.ameliacaruso.com/shoe.htm&h=247&w=325&sz=78&tbnid=UBGvZqY30iAJ:&tbnh=86&tbnw=114&hl=en&start=27&prev=/images%3Fq%3Dshoes%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.muffys.com/images/500.JPG&imgrefurl=http://www.muffys.com/modern_traditional.html&h=480&w=640&sz=32&tbnid=gznXlksZIK8J:&tbnh=101&tbnw=135&hl=en&start=34&prev=/images%3Fq%3Dshoes%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.mydivashop.com/Shoes%2520-%2520sexy%2520black%2520open%2520toe%2520sandal.jpg&imgrefurl=http://www.mydivashop.com/winter_clearance.htm&h=320&w=320&sz=11&tbnid=Ub88RLfQ0V0J:&tbnh=113&tbnw=113&hl=en&start=57&prev=/images%3Fq%3Dshoes%26start%3D40%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.onesmallchild-accessories.com/Shoes-maryjanes.jpg&imgrefurl=http://www.onesmallchild-accessories.com/Shoes-girl.asp&h=720&w=1200&sz=35&tbnid=K7P2imhtuk8J:&tbnh=90&tbnw=150&hl=en&start=60&prev=/images%3Fq%3Dshoes%26start%3D40%26svnum%3D10%26hl%3Den%26lr%3D%26rls%3DGGLD,GGLD:2005-13,GGLD:en%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.ncbi.nlm.nih.gov/genome/guide/img/tasha_image.jpg&imgrefurl=http://www.ncbi.nlm.nih.gov/genome/guide/dog/&h=1536&w=1024&sz=237&tbnid=mqZeA11z--0J:&tbnh=150&tbnw=100&hl=en&start=17&prev=/images%3Fq%3Ddog%2B%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DG

http://images.google.com/imgres?imgurl=http://www.dogart.net/images/index.1.gif&imgrefurl=http://www.dogart.net/&h=375&w=298&sz=73&tbnid=KLRZCl5XkUEJ:&tbnh=118&tbnw=93&hl=en&start=6&prev=/images%3Fq%3Ddog%2B%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DG

http://images.google.com/imgres?imgurl=http://i7.photobucket.com/albums/y295/RachelMorris/DC01.jpg&imgrefurl=http://www.suite101.com/discussion.cfm/mixed_breed_dogs/103260&h=500&w=445&sz=30&tbnid=Hkh6KsJq4jQJ:&tbnh=127&tbnw=113&hl=en&start=34&prev=/images%3Fq%3Ddog%2B%26start%3D20%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DN

http://images.google.com/imgres?imgurl=http://animals.timduru.org/dirlist/dog/dog-Trucker.jpg&imgrefurl=http://animals.timduru.org/dirlist/dog/&h=256&w=384&sz=13&tbnid=3au8AbG1xAQJ:&tbnh=79&tbnw=119&hl=en&start=59&prev=/images%3Fq%3Ddog%2B%26start%3D40%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DN

http://images.google.com/imgres?imgurl=http://www.ezthemes.com/previews/d/dog.jpg&imgrefurl=http://rinnan.net/dog.htm&h=187&w=250&sz=40&tbnid=8_xeHYxVqTwJ:&tbnh=79&tbnw=106&hl=en&start=16&prev=/images%3Fq%3Ddog%2B%26svnum%3D10%26hl%3Den%26lr%3D%26sa%3DN

http://flickr.com/photos/turniptopia/63553871/

http://flickr.com/photos/35618275@N00/59100598/

Individual Recognition

Brief History: Recognition

Mental Rotation

Three-point alignment

Huttenlocher D. & Ullman, S. Recognizing solid objects by alignment with

an image. Int. J. Computer Vision 5(3), 195 – 212, 1990.

Object Alignment

Given three model points P1, P2, P3, and three image points p1, p2, p3, there is a unique transformation (rotation, translation, scale)

that aligns the model with the image .

(SR + d)Pi = pi

Alignment -- comments

• The projection is orthographic projection (combined with scaling).

• The 3 points are required to be non-collinear.

• The transformation is determined up to a reflection of the points about the image plane and translation in depth.

Car Recognition

Car Models

Alignment: Cars

Alignment: Mismatch

Brief History: Classification

RBC

Structural Description

G2

G4

G3

G1

G4

Above

Right-of Left-of

Touch

Classification: Current Approaches

Visual Class: Similar Arrangement of Shared Components

Optimal Class Components?

• Large features are too rare

• Small features are found

everywhere

Find features that carry the highest amount of information

Entropy

Entropy: H = -Σp(xi) log2 p(xi)

x = 0 1 H p = 0.5 0.5 ?

0.1 0.9 0.47 0.01 0.99 0.08

Mutual information

H(C) when F=1 H(C) when F=0

I(C;F) = H(C) – H(C/F)

F=1 F=0

H(C)

))(()()( xPLogxPxH

Mutual Information I

X alone: p(x) = 0.5, 0.5 H = 1.0

X given Y: Y = 0 Y = 1

p(x) = 0.8, 0.2 H = 0.72

p(x) = 0.1, 0.9H = 0.47

H(X|Y) = 0.5*0.72 + 0.5*0.47 = 0.595

H(X) – H(X|Y) = 1 – 0.595 = 0.405

I(X,Y) = 0.405

Mutual Information II

yx ypxp

yxpyxpYXI

, )()(

),(log),(),(

Computing MI from Examples

• Mutual information can be measured from examples:

100 Faces 100 Non-faces

Feature: 44 times 6 times

Mutual information: 0.1525H(C) = 1, H(C|F) = 0.8475

Mutual Info vs. Threshold

0.00 20.00 40.00

Detection threshold

Mu

tu

al In

fo

forehead

hairline

mouth

eye

nose

nosebridge

long_hairline

chin

twoeyes

Fragments Selection

• For a set of training images:• Generate candidate fragments

– Measure p(F/C), p(F/NC)

• Compute mutual information• Select optimal fragment • After k fragments: Maximizing the minimal addition in mutual

information with respect to each of the first k fragments

Highly Informative Face Fragments

Horse-class features

Car-class features

Fragment ‘Weight’

)|(

)|()(

CFP

CFPFR

Likelihood ratio:

Weight of F:

))(()( FRLogFw

Decision:

∑wi Fi > θ

Combining fragments

kkFW

w1 wkw2

D1 D2Dk

Feature detection :

Within a region

S(F,I) > Threshold

Fragment-based Classification

Leibe, Schiele 2003

Fergus, Perona, Zisserman 2003

Agarwal, Roth 2002

Recognition: ROC Curves

Training & Test Images

• Frontal faces without distinctive features (K:496,W:385)• Minimize background by cropping• Training images for extraction: 32 for each class• Training images for evaluation: 100 for each class• Test images: 253 for Western and 364 for Korean

Training – Fragment Extraction

WesternFragment

Score 0.92 0.82 0.77 0.76 0.75 0.74 0.72 0.68 0.67 0.65

Weight 3.42 2.40 1.99 2.23 1.90 2.11 6.58 4.14 4.12 6.47

KoreanFragment

Score 0.92 0.82 0.77 0.76 0.75 0.74 0.72 0.68 0.67 0.65

Weight 3.42 2.40 1.99 2.23 1.90 2.11 6.58 4.14 4.12 6.47

Extracted Fragments

Classifying novel images

Westerner

Korean

Unknown

kF

wF

Detect FragmentsCompare

Summed WeightsDecision

)w()k( FWFW

50%

60%

70%

80%

90%

100%

1 2 3 4 5 6 7 8 9 10 20 30 40 50 60 70 80 90 100

Number of fragments

Co

rre

ct -

Err

or

(%)

Eastern test set Western test setEffect of Number of Fragments

• 7 fragments: 95%, 80 fragments: 100%• Inherent redundancy of the features• Slight violation of independence assumption

Harris Corner Detection

Ix2 IxIy

IxIy

Iy2

∑

Harris Corner Operator

<Ix2> < IxIy<

< < yIxI < yI2>

H=

Averages within a neighborhood.

Corner: The two eigenvalues λ1, λ2 are large

Indirectly:

‘Corner’ = det(H) – k trace2(H)

Harris Corner Examples

SIFT descriptor

David G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 60, 2 (2004), pp. 91-110

Example :

4*4 sub-regions

Histogram of 8 orientations in each

V = 128 values:

g1,1,…g1,8,… …g16,1,…g16,8

Constellation of Patches Using interest points

Fegurs, Perona, Zissermann 2003

A CAPTCHATM is a program that can generate and grade tests that most humans can pass, but current computer programs can't pass.

Classification: Class Examples

Object Recognition Vision Class 2006-7. Object Classes

Documents

Semi-supervised learning and recognition of object classespeople.csail.mit.edu/fergus/papers/fergus_cogvisys_chapter.pdf · Semi-supervised learning and recognition of object classes

creating object classes

Classes & object

Computational Vision: Object Recognition Object Recognition Jeremy Wyatt

Introduction to Object Recognition CS773C Machine Intelligence Advanced Applications Spring 2008: Object Recognition

Exploiting Object Dynamics for Recognition and Control · Exploiting Object Dynamics for Recognition and ... Exploiting Object Dynamics for Recognition and Control by ... The general

Object recognition - TU Chemnitz€¦ · Object recognition !! Hierarchical models of object"!recognition! Suggested reading:! • Fukushima, K (1980) Neocognitron: A self-organizing

Object recognition

Object Class Recognition

Chapter 15 Object Recognition - USFr1k/MachineVisionBook/MachineVision.files/Machi… · Chapter 15 . Object Recognition . An object recognition system finds objects in the real world

Object Recognition. So what does object recognition involve?

Object Recognition Szeliski Chapter 14. Recognition

Visual Object Recognition Computational Models and ...klab.tch.harvard.edu/academia/classes/Neuro230/... · • Object agnosias Warrington and Shallice. Brain (1984) 107:829-854 Areas

Visual Object Recognition

Machine Learning for Object Recognition - ut · PDF fileMachine Learning for Object Recognition ... Object Recognition as a Classification Task. ... mashine_learning.pptx

Object Reading: Text Recognition for Object Recognition · object recognition, where we outperform other state-of-the-art saliency methods for object recognition on the PASCAL VOC

Visual Object Category Recognition

Multispectral Image Analysis for Object Recognition and ... · Multispectral Image Analysis for Object Recognition and Classification ... Object recognition can be accomplished with

Iccv2009 recognition and learning object categories p2 c01 - recognizing a large number of object classes

Supervised object recognition, unsupervised object ...courses.csail.mit.edu/6.869/lectnotes/lect18/lect18-slides.pdfSupervised object recognition, unsupervised object recognition then