CS664 Lecture #21: SIFT, object recognition, dynamic

CS664 Lecture #21: SIFT, object recognition, dynamic programming

Some material taken from:

Sebastian Thrun, Stanfordhttp://cs223b.stanford.edu/

Yuri Boykov, Western Ontario

David Lowe, UBChttp://www.cs.ubc.ca/~lowe/keypoints/

Announcements

Paper report due on 11/15Next quiz Tuesday 11/15– coverage through next lecture

PS#2 due today (November 8)– Code is due today, you can hand in the writeup

without penalty until 11:59PM Thursday (November 10)

There will be a (short) PS3, due on the last day of classes.

Invariant local features

– Invariant to affine transformations, or changes in camera gain and bias

SIFT Features

Keypoint detection

Laplacian is a center-surround filter– Very high response at dark point surrounded

by bright stuff• Very low response at the opposite

In practice, often computed as difference of Gaussians (DOG) filter:– (I hσ1)-(I hσ2), where σ1/σ2 is around 2– Scale parameter σ is important

Keypoints are maxima (minima) of DOG that occur at multiple scales

Scale-space pyramid

All scales must be examined to identify scale-invariant featuresDOG pyramid (Burt & Adelson, 1983)

Resample

Subtract

Rotation invariance

Create histogram of local gradient directions computed at selected scaleAssign canonical orientation at peak of smoothed histogramEach key specifies stable 2D coordinates (x, y, scale, orientation)

SIFT feature vector

Note: this is somewhat simplified; there are a number of somewhat ad hoc steps, but the whole thing works pretty well

Hough transform

Motivation: find global features

Example: vanishing points

From edges to lines

An edge should “vote” for all lines that go (roughly) through it– Find the line with lots of votes– A line is parameterized by m and b

• This is actually a lousy choice, as it turns out

Hough transform for lines

SIFT-based recognition

Given: a database of features– Computed from model library

We want to probe for the features we see in the imageUse approximate nearest-neighbor scheme

SIFT is quite robust

SIFT DEMO!

Recognition

Classical recognition (Roberts, 1962)– http://www.packet.cc/files/mach-per-3D-

solids.html• Influenced by J. J. Gibson

Given: set of objects of known fixed shapeFind: position and pose (“placement”)Match model features to image featuresModels and/or image can be 2D or 3D– 2D to 2D example: OCR– Common case is 3D model, 2D image

Face recognition

Extensively studied special caseApproaches: intensities or features– Intensities: SSD (L2 distance) or variants– Features: extract eyes, nose, chin, etc.

Intensities seem to work more reliably– Images need to be registered– Famous application of PCA: eigenfaces

Nothing really works with serious changes in lighting, profile, appearance– FERET database has good evaluation metrics

Combinatorial search

Possible formulation of recognition: match each model feature to an image feature– Some model features can be occluded

This leads to an intractable problem with lots of backtracking – “Interpretation tree” search– Especially bad with unreliable features

The methods that work tend to avoid explicit search over matchings– Robust to feature unreliability

Distance-based matching

Intuition: all points (features) in model should be close to some point in image– We will assume binary features, usually edges– All points assumption means no occlusions– Many image points will be unmatched

Naively posed, this is very hard– For each point in the model, find the distance

to the nearest point in the image– Do this for each placement of the model– How can we make this fast?

Dynamic programming

General technique to speed up computations by re-using results– Many successful applications in vision

Canonical examples:– Shortest paths (Dijkstra’s algorithm)

• Many applications in vision (curves)

– Integral images • Efficiently compute the sum of any quantity over

an arbitrary rectangle• Useful for image smoothing, stereo, face

detection, etc.

Shortest paths via DP

Dijkstra’s algorithm

- processed nodes (distance to A is known)- active nodes (front)- active node with the smallest distance value

Integral images via DP

Suppose we want to compute the sum in D– At each pixel (x,y),

compute the sum in the rectangle [(0,0),(x,y)]

– Gives: A+C,A+B,A+B+C+D– (A+B+C+D) – (A+C) –

(A+B) + A = D– Can compute rectangle

sums by same trick• Row major scan

A BC D

CS664 Lecture #21: SIFT, object recognition, dynamic

Documents

SIFT and Object Recognition - Princeton University … and Object Recognition Dan O’Shea Prof. Fei Fei Li, COS 598B Distinctive image features from scale-invariant keypoints David

Sift Prcsr

Iris Recognition Based on SIFT Features - DiVA portal589332/FULLTEXT01.pdfIris Recognition Based on SIFT Features Fernando Alonso-Fernandez, Pedro Tome-Gonzalez, Virginia Ruiz-Albacete,

Car Recognition Through SIFT Keypoint Matching

Lecture 31: Object Recognition: SIFT Keysrtc12/CSE486/lecture31.pdf · Application: Object Recognition For sets of 3 SIFT key matches, compute affine transformation and perform model

Smart Navigation System - Building Recognition Server ...eprints.utar.edu.my/1547/1/Smart_navigation_system... · SFBR Steerable Filter-based Building Recognition SIFT Scale-Invariant

Specific Object Recognition using SIFT

Scale Invariant Feature Transform (SIFT)turkel/notes/illusion_files/sift-Marganit.pdf · Scale Invariant Feature Transform (SIFT) The SIFT descriptor is a coarse description of the

A 3-Dimensional SIFT Descriptor and its Application to ...saada/Publications/2007_ACMMM_3DSIFT.pdfA 3-Dimensional SIFT Descriptor and its Application to Action Recognition Paul Scovanner

ALPRS - A New Approach for License Plate Recognition Using the SIFT Algorithm

Object Recognition from Local Scale-Invariant Featurespeople.cs.pitt.edu/~kovashka/cs3710_sp15/features_yan.pdf · • Proposed Method: Scale Invariant Feature Transform(SIFT) Department

REAL TIME HAND GESTURE RECOGNITION USING …vixra.org/pdf/1208.0088v1.pdfREAL TIME HAND GESTURE RECOGNITION USING SIFT Pallavi Gurjal (pallavigurjal@yahoo.co.in), Kiran Kunnur(k.kunnur@gmail.com)

Object Recognition using 3D SIFT in Complex CT Volumesbreckon.eu/toby/publications/papers/flitton10baggage.pdfother objects close by. The 3D extension of SIFT for explicit object recognition

Vehicle Logo Recognition In Traffic Images Using HOG,SIFT & svn

Inferring Generalized Pick-and-Place Tasks from Pointing ...holz/spme/talks/09_pangercic_inferring... · Object Recognition using Vocabulary Tree and SIFT + + ... Contact ... Inferring

CS664–CompilerTheoryandDesign–LIU 1of16 ANTLR · CS664–CompilerTheoryandDesign–LIU 1of16 ANTLR ChristopherLeague* 17February2016 ANTLRisaparsergenerator.Thereareothersimilartools,suchasyacc,ﬂex,bison,

Mobile museum guide based on fast SIFT recognition · 3 Feature extraction After evaluating several methods for object recognition, Scale-Invariant Feature Transform (SIFT), conceived

Cs664 8 Binary Matching

PLATES RECOGNITION Part II Segmentazione targa e Riconoscimento cifre tramite SIFT Docente: Fiora Pirri Corso: Visione e Percezione Studenti: Alessio Borraccino

Algoritmo SIFT