CS5670: Computer Vision - Cornell University · x C(x, y, d); the disparity space image (DSI) d....

Stereo

CS5670: Computer VisionNoah Snavely

Single image stereogram, by Niklas Een

Announcements

• Project 3 code due this Thursday, 3/28 at 11:59pm

– Artifact due Friday, 3/29 at 11:59pm

Mark Twain at Pool Table", no date, UCR Museum of Photography

Stereo

• Given two images from different viewpoints– How can we compute the depth of each point in the image?– Based on how much each pixel moves between the two images

epipolarlines

Epipolar geometry

(x1, y1) (x2, y1)

x2 -x1 = the disparity of pixel (x1, y1)

Two images captured by a purely horizontal translating camera(rectified stereo pair)

Your basic stereo matching algorithm

• Match Pixels in Conjugate Epipolar Lines

– Assume brightness constancy

– This is a challenging problem

– Hundreds of approaches

• A good survey and evaluation: http://www.middlebury.edu/stereo/

Your basic stereo algorithm

For each epipolar line

For each pixel in the left image

• compare with every pixel on same epipolar line in right image

• pick pixel with minimum match cost

Improvement: match windows

Stereo matching based on SSD

dmin dBest matching disparity

Window size

– Smaller window

– Larger window

W = 3 W = 20

Better results with adaptive window• T. Kanade and M. Okutomi, A Stereo Matching Algorithm

with an Adaptive Window: Theory and Experiment,, Proc. International Conference on Robotics and Automation, 1991.

• D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2):155-174, July 1998

Effect of window size

Stereo results

– Data from University of Tsukuba

– Similar results on other images without ground truth

Ground truthScene

Results with window search

Window-based matching

(best window size)

Ground truth

Better methods exist...

State of the art methodBoykov et al., Fast Approximate Energy Minimization via Graph Cuts,

International Conference on Computer Vision, September 1999.

Ground truth

For the latest and greatest: http://www.middlebury.edu/stereo/

Stereo as energy minimization

• What defines a good stereo correspondence?1. Match quality

• Want each pixel to find a good match in the other image

2. Smoothness• If two pixels are adjacent, they should (usually) move about

the same amount

• Find disparity map d that minimizes an energy function

• Simple pixel / window matching

SSD distance between windows I(x, y) and J(x + d(x,y), y)=

I(x, y) J(x, y)

y = 141

C(x, y, d); the disparity space image (DSI)x

y = 141

Simple pixel / window matching: choose the minimum of each column in the DSI independently:

Greedy selection of best match

• Better objective function

match cost smoothness cost

Want each pixel to find a good match in the other image

Adjacent pixels should (usually) move about the same amount

match cost:

smoothness cost:

4-connected neighborhood

8-connected neighborhood

: set of neighboring pixels

Smoothness cost

“Potts model”

L1 distance

How do we choose V?

Dynamic programming

• Can minimize this independently per scanline using dynamic programming (DP)

• Basic idea: incrementally build a table of costs D one column at a time

: minimum cost of solution such that d(x,y) = i

Recurrence:

Base case: (L = max disparity)

Dynamic programming

• Finds “smooth”, low-cost path through DPI from left to right

y = 141

Dynamic Programming

Dynamic programming

• Can we apply this trick in 2D as well?

• No: dx,y-1 and dx-1,y may depend on different values of dx-1,y-1

Slide credit: D. Huttenlocher

Stereo as a minimization problem

• The 2D problem has many local minima– Gradient descent doesn’t work well

• And a large search space– n x m image w/ k disparities has knm possible solutions

– Finding the global minimum is NP-hard in general

• Good approximations exist… we’ll see this soon

Questions?

Depth from disparity

x x’

baseline

C C’

Real-time stereo

• Used for robot navigation (and other tasks)– Several real-time stereo techniques have been

developed (most based on simple discrete search)

Nomad robot searches for meteorites in Antarticahttp://www.frc.ri.cmu.edu/projects/meteorobot/index.html

• Camera calibration errors

• Poor image resolution

• Occlusions

• Violations of brightness constancy (specular reflections)

• Large motions

• Low-contrast image regions

Stereo reconstruction pipeline

• Steps– Calibrate cameras– Rectify images– Compute disparity– Estimate depth

What will cause errors?

Active stereo with structured light

• Project “structured” light patterns onto the object– simplifies the correspondence problem– basis for active depth sensors, such as Kinect and iPhone X (using IR)

camera 2

camera 1

projector

camera 1

projector

Li Zhang’s one-shot stereo

Active stereo with structured light

https://ios.gadgethacks.com/news/watch-iphone-xs-30k-ir-dots-scan-your-face-0180944/

Laser scanning

• Optical triangulation– Project a single stripe of laser light

– Scan it across the surface of the object

– This is a very precise version of structured light scanning

Digital Michelangelo Projecthttp://graphics.stanford.edu/projects/mich/

Laser scanned models

The Digital Michelangelo Project, Levoy et al.

Questions?

CS5670: Computer Vision - Cornell University · x C(x, y, d); the disparity space image (DSI) d....

Documents

D D D D X X X X X X X X X X X X GGXX · PDF fileRCRCRC ASASAS KFKF D D D D X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X XX G G GG G G G G Licensed to: Magna Vista High

ZERBITZU GABEKO SUKALDEKO LANGILEEN BEHIN … · esteban lujambio enrique x x d/p etxebarria mendiburu amets x x d/p etxeberria etxeberria amaia x x d/p ezeiza jauregi jon x x d/p

Implementation Completion and Results Report (ICR) Document€¦ · d > k& ked ed^ d ^, d x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x

Castello - Cloudinary · Castello Pergola 10033546 10033548 10033549. 3 D EN TEILE / PARTS x x x x x x x x x x x x x x x. 4 D EN ZUSAMENBAU / ASSEMBLY 1. 5 D EN 2. 6 D EN HERSTELLER

Our rocks reveal the story of change h - Metlink › wp-content › uploads › 2013 › 11 › ... · 0 100 km50 n s w e x x d d x sd x eo x x g os c c c d x c sd d d x d s d d s

LA GESTION CULTURAL EN EL MÃ XICO DEL SIGLO XXI · ^> ' ^d/me h>dhz > e > d y/ k > ^/'>k yy/ u d /k^ w z /'d _ Ë1',&( ,1752'8&&,Ï1 x x x x x x x x x x x x x x x x x x x x x x x

bijlage 1 verkoopsbundel Anemoonproject 1... · /:> ' í r s z e dkkewzk: d î l í ô í skkz^d >>/e' wzk: d x x x x x x x x x x x x x x x x x x x x x x

CS5670: Computer Vision - Cornell University

Strain and Deformation γfast10.vsb.cz/lausova/lesson01.pdf · 5 / 33 Geometrical equations x y MA B C C’ B’ A’ M’ dy d x dx’ v u x u x x u x x u x u x x x x ∂ d d d d

CS5670 : Computer Vision

ZERBITZU GABEKO GARBIKETAKO LANGILEEN …...VICHO LOPEZ DIEGO X X D/P VIDAL CASTRESANA ARKAITZ X X D/P VIDAL CASTRESANA URKO X X D/P VIDAURRE GONZALEZ MARIA ELENA X X D/P VIEIRA GODOY

x o X CD o o o D X o D x D D -o o o O o x D D D D X E s -o x o D x E s x X x o o s -c x o 01 o o D -o o o s x D D X -c D o -o s O D o D D o o s E o o -a D D x x o D X

>/^d WZ /K^ > /E >d de precios... · >/^d wz /k^ > /e >d /e / w ' ^kz/k^ dd e zk u /e z >hd/e/k x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x x í

HEZKUNTZA SAILA DEPARTAMENTO DE EDUCACIÓN …...lemos hoyos jennifer andrea x x d/p lencero galan francisca x x d/p leniz ortuondo amaya x x d/p ... llarena prieto yeray x x d/p llarena

D hZ/ /K ' >> 'K Z E'/&K í í ñ ñ í õ ô D Eh > > : E ZK s ...bibliotecadigital.usb.edu.co/bitstream/10819/3703/... · d > ked e/ k >/^d d > ^ x x x x x x x x x x x x x x x x

h1 > ^dh / Ed W Z > y D E KDW> y/sK - FACULTAD DE ...fci.utm.edu.ec/wp-content/uploads/documentos/guia-del-estudiante... · ï x z d z1^d/ ^ > y d e kdw> y/sk x x x x x x x x x x

Observations and Experiments on the Food Habits of ...D-02 XXX X D-04 XX XX I D-06 X XXX D -08 X X X X X X X D -20 XXXX D-21 XXXX D -27 XXXX D-28 IXXX I X. Food Habits of Aplysia-WINKLERand

CS5670: Computer Vision - Cornell University · CS5670: Computer Vision Noah Snavely, ZhengqiLi Single image stereogram, by NiklasEen. Mark Twain at Pool Table", no date, UCR Museum

I F i gu r e3: Ex s t nC o d, R aT l- k y Sp M m Pa123.g.akamai.net/7/123/11558/abc123/forestservic.download.akamai... · !D!D!9!9!D!D!i!i!i!i!i!D!D!i X X X X X X X X X X X E E E

DPP SMPTE BVE IMF Presentationv0.2wihoutnotes 25 24 localisation processes ... ss dx uu u s d u s d x d u s d s us x s u x d u x d x s d d s x u u x x u x sd x d x s s u u s x u x