45
Humans & Machines collaborating on vision Pietro Perona California Institute of Technology NSF Workshop - Frontiers in Vision Cambridge, 23 Aug 2011 Friday, August 26, 2011

Fcv hum mach_perona

  • Upload
    zukun

  • View
    210

  • Download
    0

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Fcv hum mach_perona

Humans & Machinescollaborating on vision

Pietro PeronaCalifornia Institute of Technology

NSF Workshop - Frontiers in VisionCambridge, 23 Aug 2011

Friday, August 26, 2011

Page 2: Fcv hum mach_perona

“Collaborative vision’’ ?

Pietro PeronaCalifornia Institute of Technology

NSF Workshop - Frontiers in VisionCambridge, 23 Aug 2011

Friday, August 26, 2011

Page 3: Fcv hum mach_perona

Objectives

• Sketch new area of research

• Sampler of initial work

• Drawing lessons

• Brainstorm: potential, way forward

Friday, August 26, 2011

Page 4: Fcv hum mach_perona

Plan

• Define area (10’)

• Presentations (50’): Perona, Geman, Grauman, Berg, Belongie

• Discussion (15’)

Friday, August 26, 2011

Page 5: Fcv hum mach_perona

Definition

Friday, August 26, 2011

Page 6: Fcv hum mach_perona

6

Friday, August 26, 2011

Page 7: Fcv hum mach_perona

?

6

Friday, August 26, 2011

Page 8: Fcv hum mach_perona

7

Friday, August 26, 2011

Page 9: Fcv hum mach_perona

Friday, August 26, 2011

Page 10: Fcv hum mach_perona

Friday, August 26, 2011

Page 11: Fcv hum mach_perona

Friday, August 26, 2011

Page 12: Fcv hum mach_perona

9

Friday, August 26, 2011

Page 13: Fcv hum mach_perona

Lessons:

• Visual queries

• Easy for humans

• Difficult for machines

• Much information is available on line

• Pictures are digital dark matter

• Experts not providing visual knowledge

10

Friday, August 26, 2011

Page 14: Fcv hum mach_perona

Unsupervised learning?

[Fergus et al., CVPR03] 11

Friday, August 26, 2011

Page 15: Fcv hum mach_perona

Unsupervised learning?

[Fergus et al., CVPR03] 11

Friday, August 26, 2011

Page 16: Fcv hum mach_perona

12

Friday, August 26, 2011

Page 17: Fcv hum mach_perona

Friday, August 26, 2011

Page 18: Fcv hum mach_perona

Throat

Friday, August 26, 2011

Page 19: Fcv hum mach_perona

Throat

Friday, August 26, 2011

Page 20: Fcv hum mach_perona

Visual knowledge

Task-oriented (practitioners)Categorical (experts) 14

Friday, August 26, 2011

Page 21: Fcv hum mach_perona

Annotators Automata

ExpertsShared

knowledgeUsers

World

Querie

s

Answ

ers

Education

Models

ObservationObserv

ation

Science,expertise

Imageannotations

Machine visionscientists15

Friday, August 26, 2011

Page 22: Fcv hum mach_perona

Annotators Automata

ExpertsShared

knowledgeUsers

World

Querie

s

Answ

ers

Education

Models

ObservationObserv

ation

Science,expertise

Imageannotations

Machine visionscientists15

Friday, August 26, 2011

Page 23: Fcv hum mach_perona

Annotators Automata

ExpertsShared

knowledgeUsers

World

Querie

s

Answ

ers

Education

Models

ObservationObserv

ation

Science,expertise

Imageannotations

Machine visionscientists15

Friday, August 26, 2011

Page 24: Fcv hum mach_perona

Annotators Automata

ExpertsShared

knowledgeUsers

World

Querie

s

Answ

ers

Education

Models

ObservationObserv

ation

Science,expertise

Imageannotations

Machine visionscientists15

Friday, August 26, 2011

Page 25: Fcv hum mach_perona

Some progress...

Friday, August 26, 2011

Page 26: Fcv hum mach_perona

Waterbirds

Mallard American Black Duck

Canada Goose Red Necked Grebe Clutter

DUCKS

Friday, August 26, 2011

Page 27: Fcv hum mach_perona

x1i

x2i

xi = (x1i , x

2i )

p(xi | zi = 1)

p(xi | zi = 0)

Multidimensional signals and annotators

Friday, August 26, 2011

Page 28: Fcv hum mach_perona

x1i

x2i

xi = (x1i , x

2i )

p(xi | zi = 1)

p(xi | zi = 0)

Multidimensional signals and annotators

Friday, August 26, 2011

Page 29: Fcv hum mach_perona

x1i

x2i

xi = (x1i , x

2i )

p(xi | zi = 1)

p(xi | zi = 0)

Multidimensional signals and annotators

wj = (w1j , w

2j )

τj

Friday, August 26, 2011

Page 30: Fcv hum mach_perona

lijxi

N

M

ij

σj

yij

θz

zi

Ji

βwj τj

γα

images

annotators

labels |Lij |

Full model

[Welinder et al., NIPS2010]Friday, August 26, 2011

Page 31: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 32: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 33: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 34: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 35: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 36: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 37: Fcv hum mach_perona

Is there a duck in the image?

x1

i

x2

i

Friday, August 26, 2011

Page 38: Fcv hum mach_perona

Concluding...

Friday, August 26, 2011

Page 39: Fcv hum mach_perona

Aut

omat

ion

Performance

100%

100%

100%

0%

Collaborative vision

Friday, August 26, 2011

Page 40: Fcv hum mach_perona

Aut

omat

ion

Performance

100%

100%

100%

0%

Collaborative vision

Friday, August 26, 2011

Page 41: Fcv hum mach_perona

Aut

omat

ion

Performance

100%

100%

100%

0%

Collaborative vision

Friday, August 26, 2011

Page 42: Fcv hum mach_perona

Aut

omat

ion

Performance

100%

100%

100%

0%

Collaborative vision

Friday, August 26, 2011

Page 43: Fcv hum mach_perona

Aut

omat

ion

Performance

100%

100%

100%

0%

Collaborative vision

+ApplicationsTraining data

-ComplexityCost

Friday, August 26, 2011

Page 44: Fcv hum mach_perona

Annotators Automata

ExpertsShared

knowledgeUsers

World

Querie

s

Answ

ers

Education

Models

ObservationObserv

ation

Science,expertise

Imageannotations

Machine visionscientists24

Friday, August 26, 2011

Page 45: Fcv hum mach_perona

New research directions• Incremental learning

• Models of human vision, decision, attention

• Systems composed of machines and humans

• Performance bounds (humans, machines)

• Representations (human-machine-friendly)

• Extracting visual knowledge from experts

Friday, August 26, 2011