38
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team http://www.eng.cam.ac.uk/~cipolla/peop le.html

Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Embed Size (px)

Citation preview

Page 1: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Looking at people and

Image-based LocalisationRoberto Cipolla

Department of Engineering

Research team http://www.eng.cam.ac.uk/~cipolla/people.html

Page 2: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

1. Real-time hand detection and tracking

Page 3: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Why is it hard?

• Highly articulated object, 27 model parameters

• Shape variation and self-occlusions

• Unreliable point features

• Ambiguities in single viewlead to multi-modal distributions (local minima)

Page 4: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Why is it hard?

• Background clutter

• Potentially fast motion

• Lighting changes

• Partial / full occlusion

Page 5: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

A Solved Problem?

3D tracking, 6/7 DOF• Model: 3D quadrics• Cost Function:

Edges or colour-edges • Tracking: Unscented

Kalman filtering• Single or dual view• Single hypothesis

filter, no recovery strategy

Page 6: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

A Robust Tracker

• Should work in scenes with complex backgroundand varying illumination– Important: Cost function design– Optimization strategy

• Should handle multi-modality– Examples: Particle filters, multi-hypotheses filters

• Should have a recovery strategy when track is lost– Trigger search algorithm

Page 7: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

3D Pose Recovery

3D hand model constructed from cones and ellipsoids Contour projection, handling self-occlusions 27 motion parameters

Page 8: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Hierarchy of classifiers

Page 9: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Likelihood : Edges

Edge Detection Projected Contours

Robust EdgeMatching

Input Image 3D Model

Page 10: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Chamfer Matching

Input image Canny edges

Distance transform Projected Contours

Page 11: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Likelihood : Colour

Skin Colour ModelProjected Silhouette

Input Image 3D Model

Template Matching

Page 12: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Tree-based bayesian filtering

Page 13: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Matching Multiple Templates

Use tree structure to efficiently match many templates (>50,000) Arrange templates in tree based on their similarity Traverse tree using breadth-first search, several ‘active’ leaves possible

Search TreeGrid-based partitioning of parameter space

Page 14: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Bayesian-Tree

• The search-tree is brought into a Bayesian framework by adding the prior knowledge from previous frame.

• The Bayesian-Tree can be thought as approximating the posterior probability at different resolutions.

State space partitioning Estimation of posterior pdf

Page 15: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Experiments

Global Motion3D motions limited to hemisphere

Dynamics: First-order Gaussian process

3 level tree with 16,000 templates at leaf level

5 scales, divisions of 15 degrees in 3D rotation and

divisions of 10 degrees in image plane rotation

Translation search at 20, 5, 2-pixel resolution

Page 16: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Tracking Results

Page 17: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Tracking Results

Page 18: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Experiments

Finger Articulation

• Opening and closing of thumb and fingers approximated by 2 parameters

• Global motion restricted to smaller range, but still with 6 DOF

• 35,000 templates at the leaf level

Page 19: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Opening and closing

Page 20: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Hand detection system

Page 21: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Ongoing work

Large number of templates requiredExamples shown here show only constrained motionNumber of templates required for fully articulated motion?

Tracking rates at 5 fps to 0.2 fps For 400 - 35,000 templates (on a 2.4 GHz Pentium IV)

Error introduced by geometric model No palm deformation, no skin deformation, no arm model

Page 22: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Detecting people

Page 23: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

2. Building 3D models of cities

Page 24: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Trumpington Street Data

Page 25: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Camera pose determination

Page 26: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

3D reconstruction

Page 27: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Reconstruction texture mapped

Page 28: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

3. Where am I?

Page 29: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

......

Page 30: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 31: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

……

Page 32: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 33: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 34: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 35: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 36: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 37: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Image-based localisation

Page 38: Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team cipolla/people.html

Summary and deliverables

• Realtime hand detection in clutter

• 3D models from uncalibrated images

• Image-based localisation for augmented reality