47
Computer Vision, CS766 Staff Instructor: Li Zhang [email protected] TA: Yu-Chi Lai [email protected]

Computer Vision, CS766 Staff Instructor: Li Zhang [email protected] TA: Yu-Chi Lai [email protected]

Embed Size (px)

Citation preview

Page 1: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Computer Vision, CS766

Staff

Instructor: Li [email protected]

TA: Yu-Chi [email protected]

Page 2: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Today

Introduction

Administrative Stuff

Overview of the Course

Page 3: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

About Me

• Li Zhang (张力 )– Last name pronounced as Jung

• New Faculty– PhD 2005, U of Washington– Research Scientist 06-07, Columbia U

• Research– Vision and Graphics

• Teaching– CS766 Computer Visoin– CS559 Computer Graphics

Page 4: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Previous Research Focus

• 3D shape reconstruction

Four examples of recovered 3D shapes of a moving face from six video streams

Page 5: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Previous Research Focus

• 3D shape reconstruction

• Application

Licensed by SONY for Games

Used by VA Hospital for Prosthetics

Page 6: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Please tell me about you

Page 7: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Prerequisites

• Prerequisites—these are essential!– Data structures– A good working knowledge of C and C++

programming• (or willingness/time to pick it up quickly!)

– Linear algebra – Vector calculus

• Course does not assume prior imaging experience– no image processing, graphics, etc.

Page 8: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Administrative Stuff

• 4 programming projects – 15%, 2-3 weeks each

• 1 final project – 40%, 5 weeks, open ended of your choosing, but needs – project proposal after 1 week– progress report after 3 weeks– Final presentation after 5 weeks

• Computer account: – Everyone registered in this class will get a Computer

Systems Lab account to do project assignments.

• Email list: – [email protected]

Page 9: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Questions?

Page 10: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Every picture tells a story

Goal of computer vision is to write computer programs that can interpret images

Page 11: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Can computer match human perception?

• Yes and no (but mostly no!)– computers can be better at “easy” things

Page 12: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Can computer match human perception?

• Yes and no (but mostly no!)– computers can be better at “easy” things– humans are much better at “hard” things

Page 13: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Computer Vision vs Human Vision

• Can do amazing things like:

• Recognize people and objects• Navigate through obstacles• Understand mood in the scene• Imagine stories

• But still is not perfect:

• Suffers from Illusions• Ignores many details• Ambiguous description of the world• Doesn’t care about accuracy of world

Srinivasa Narasimhan’s slide

Page 14: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Computer vision vs Human Vision

What we see What a computer sees

Srinivasa Narasimhan’s slide

Page 15: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Components of a computer vision system

Lighting

Scene

Camera

Computer

Scene Interpretation

Srinivasa Narasimhan’s slide

Page 16: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Topics Covered

Page 17: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Cameras and their optics

Today’s Digital Cameras

The Camera Obscura

Srinivasa Narasimhan’s slide

Page 18: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Biological vision

Human Eye Mosquito Eye

Srinivasa Narasimhan’s slide

Page 19: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 1: High Dynamic Range Imaging

• Cameras have limited dynamic range

Short Exposure Long Exposure

Desired Image Shree Nayar’s slide

Page 20: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 1: High Dynamic Range ImagingLo

w D

ynam

ic R

ange

Exp

osur

es

+

Combination Yields High Dynamic Range

Shree Nayar’s slide

Page 21: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Image Processing

Fourier TransformSampling, Convolution

Image enhancement Feature detection

Srinivasa Narasimhan’s slide

Page 22: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Camera Projection

Page 23: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Image Transformation

Steve Seitz and Chuck Dyer, View Morphing, SIGGRAPH 1996

Page 24: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 2: Panoramic Imaging

Input images:

Output Image:

Steve Seitz’s slide

Page 25: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Projective Geometry

Page 26: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Single View Metrology

• https://research.microsoft.com/vision/cambridge/3d/3dart.htm

Page 27: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Single View Metrology

• https://research.microsoft.com/vision/cambridge/3d/3dart.htm

Page 28: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Shading and Photometric Stereo

http://www.eecs.harvard.edu/~zickler/helmholtz.html

Page 29: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Texture Modeling

repeated stochastic

“Semi-stochastic” structures

radishes rocks yogurt

Alexei Efros’ slide

Page 30: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 3: Texture Synthesis

Input

Output

Image Quilting, Efros and Freeman., SIGGRAPH 2002.

Page 31: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 3: Texture Synthesis

Graphcut Textures, Kwatra et al., SIGGRAPH 2003.

Input images:

Output Image:

Page 32: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Multi-view Geometry

• Binocular Stereo (2 classes)• Multiview Stereo (1 class)• Structure from Motion (2 classes)

http://phototour.cs.washington.edu/

Page 33: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Face Detection and Recognition

Page 34: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Project 4: EigenFaces

Face detection and recognition

Page 35: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Motion Estimation

Hidden Dragon Crouching Tiger

Page 36: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Motion Estimation

Application

Andy Serkis, Gollum, Lord of the Rings

Page 37: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Segmentation

http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/

Page 38: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Segmentation

Application

Medical Image Processing

Page 39: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Matting

Input Matting Composition

Page 40: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Light, Color, and Reflection

Page 41: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Capturing Light Field

Camera Arrays, Graphics Lab, Stanford University

Page 42: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Capturing Light Field

Applications

Page 43: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 44: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 45: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Structured Light and Ranging Scanning

http://graphics.stanford.edu/projects/mich/

Page 46: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Novel Cameras and Displays

http://www1.cs.columbia.edu/CAVE/projects/cc.htm

Page 47: Computer Vision, CS766 Staff Instructor: Li Zhang lizhang@cs.wisc.edu TA: Yu-Chi Lai yu-chi@cs.wisc.edu

Course Info

http://www.cs.wisc.edu/~cs766-1/