CAP5415-Computer Vision Lecture 21-Deformable Part Model...

CAP5415-ComputerVisionLecture21-DeformablePartModel(DPM)

GuestLecturer:Sarfaraz Husein

Motivation• Objectcategorydetection:

– Detectallobjectswiththesamecategoryinanimage• Forexamplehorsedetection:

ClassificationTask

Atleastonebus

DetectionTask

Predictedboundingboxshouldoverlapbyatleast50%withgroundtruth!!!

Detections“nearmisses”

PascalVOCChallenge-TheObjectClasses

Imagesretrieved fromflickerwebsite.

SuccessfulDetectionMethod

• Jointwinnerin2009PascalVOCchallengewiththeOxfordMethod.

• Awardof"lifetimeachievement“in2010.• Mixtureofdeformablepartmodels.• Eachcomponenthasglobaltemplate+deformableparts– HOGfeaturetemplates.

• Fullytrainedfromboundingboxesalone.

(P.Felzenszwalbetal.)

Recap:DeformableModels

Challenge:handlingvariationofappearanceswithinobjectclassesnon-rigidobjects

Idea:Considerobjectsasadeformedversionofatemplate!

WhatisDPM?• Deformable Part is a discriminatively trained,

multi-scale model for image training that aim at making possible the effective use of more latent information such as hierarchical (grammar) models and models involving latent three dimensional pose.

DPM• Definition:

– Root : Catch Roughly appearance of object– Part : Catch local appearance of object– Spring : spatial connections between parts

• Displacement :– Using minimizing energy function to find the optimal

displacement[1] Pictorial Structures for Object Recognition, Daniel P. Huttenlocher,1973

(b1) coarse template (b2)part templates (b3) spatial model (a) person detection Example

The deformable model includes both a coarse global template covering an entire object and higher resolution part templates. The templates represent histogram of gradient features

ModelingSpatialRelationswithDPM

Spring-based models

[Fischler & Elschlager ,’73], [Ramanan et al,’07], [Felszwenwalb et al,’05,’09], [Kumar et al, ‘09]

Parts Based Model

Vertices – Local Appearance

Edges - Spatial Relationship

Goal: Assign model parts to image regions preserving both local appearance and spatial

relationships

MatchingProblem

Parts based models - Inference Problem

Inference problem: What is the best scoring assignment f?

Local Appearance term Pairwise Spatial Relationship term

Inference is NP-hard for general graphs

For trees can use belief propagation for exact solution in polytime

Parts based models - Learning Problem

Linear models:

Local Appearance term Pairwise Spatial Relationship term

Convex max-margin objective

Positive examples on one side

Negative examples on the other

[Kumar et al,’09]

Learning linear models: Find weight vectors that best separate positive and negative examples. E.g.,

FindingMotorbike

HumanTracking

HumanPoseEstimation

HoG:HistogramofGradients

HistogramofGradient1.Compute gradient every pixel2.Group 8*8 pixels into a cell, and 4*4 cells into a

block.Build histogram of each cells about 9 orientation

bins (00~1800)

HistogramofGradient3.A block contain 4*9 feature vector for local

information, and a bounding box which contain n’s blocks has n*4*9 feature vector

4.Training by SVM

Filters

ObjectHypothesis

Training

LearnedModels

Training Models• Latent SVM is used for learning the training images

Experiment Result• PASCAL VOC 2006,2007,2008 comp3 challenge datasets• Some statistics:

– It takes 2 seconds to evaluate a model in one image(4952 images in the test dataset)

– It takes 4 hours to train a model– MUCH faster than most systems.– All of the experiments were done on a 2.8Ghz 8-core Intel Xeon Mac

Pro computer running Mac OS X 10.5.

Experiment Result• Measurement: predicted bounding box is correct if it overlaps

more than 50 percent with ground truth bounding box; otherwise, considered false positive

Experiment Result

Experiment Result• Best Average Precision score in 9 out of 20, second in 8

Experiment Result

SliceCreditsandReferences• JonathanHuang,TomaszMalisiewicz,2009.• P.Felzenszwalb,D.McAllester,D.Ramanan.ADiscriminatively Trained,

Multiscale,DeformablePartModel.IEEEConferenceonComputerVisionandPatternRecognition(CVPR),2008

• P.Felzenszwalb,R.Girshick,D.McAllester,D.Ramanan.ObjectDetectionwithDiscriminatively TrainedPartBasedModels. IEEETransactionsonPatternAnalysis andMachineIntelligence,Vol.32,No.9,Sep.2010.

• LevelImageRepresentationforSceneClassificationandSemanticFeatureSparsification.ProceedingsoftheNeuralInformationProcessingSystems(NIPS),2010.

CAP5415-Computer Vision Lecture 21-Deformable Part Model...

Documents

Computer Visioncrcv.ucf.edu/courses/CAP5415/Fall2014/CVIntroductionAugust2014.pdf · Computer Vision Mubarak Shah shah@crcv.ucf.edu Computer Vision ... Playing Guitar Playing Piano

lec21 - Washington University in St. Louisjst/cse/473/lec/lec21.pdfdata played out at steady rate during active periods ... may also need to track delay changes during talkspurt

Medical Image Analysis · Corresponding author: Tel.: +1-407-823-1047; fax: 1-407-823-0594; e-mail: bagci@crcv.ucf.edu (Ulas Bagci) Preprint submitted to Medical Image Analysis April

Lec21 spark analytics - avid.cs.umass.eduavid.cs.umass.edu/courses/645/s2020/lectures/Lec21-Spark.pdfSpark core (RDD API) share data between stages via memory. How to Support Interactive

CAP5415-Computer Vision Lecture 13-Support Vector Machines ...bagci/teaching/computervision15/lec13.pdf · Lecture 13: Support Vector Machines for Computer Vision Applicaons denotes

Bagci MRI basic physics - CS Department - Homebagci/teaching/mic17/basics_mri_bagci.pdfMRI - Basic Physics Dr. Ulas Bagci Center for Research in Computer Vision (CRCV), University

Vertical Alignment Lec21

Awais Mansoor, Ulas Bagci , and Daniel J. Mollura · Awais Mansoor, Ulas Bagci , and Daniel J. Mollura Abstract—Accurate delineation of pathological lungs from computed tomography

Lec21-Particle Physics Intro

06_09 Bagci Changing Geopolitics and Turkish Forei

CS 360: Machine Learningcs.haverford.edu/faculty/smathieson/teaching/f19/lecs/lec21/lec21.pdf · SVM flowchart Goal: maximize the separation between positive and negative examples

Lec21 1slide Per Page 26nov

Lec21 green buildings 2010 - Johns Hopkins University

LECTURE 11: Active Contour and Level Set for Medical Image ...bagci/teaching/mic16/lec11.pdfLECTURE 11: Active Contour and Level Set for Medical Image Segmentation Dr. Ulas Bagci HEC

Lec21 Annotated

CAP5415-Computer Vision Lecture 20-Face …bagci/teaching/computervision16/Lec20.pdfCAP5415-Computer Vision Lecture 20-Face Recognition, HaarFeatures, Local Binary Patterns, and Boosting

Ch232 lec21 jan2015

CS 160: Lecture 21jfc/cs160/SP03/lectures/lec21/lec21.pdf4/22/2003 15 Typical Errors From Norman’s 1983 survey: Mode errors: * User forgets what mode they’re in, and does the command

Ulas Bagci bagci@ucfbagci/teaching/computervision17/Lec8.pdfPiaalls & Alternaves • Brightness constancy is not sasﬁed – Correlaon based method could be used • A point may not

Lec21 Taste Chap15pillowlab.princeton.edu/teaching/sp2017/slides/Lec21...• salty • sour • sweet • bitter Tastant : Any stimulus that can be tasted ions enter the cell tastant