57
CAP 6412 Advanced Computer Vision Website: http://www.cs.ucf.edu/~bgong/CAP6412.html Jan 14, 2016

CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

CAP6412AdvancedComputerVision

Website:http://www.cs.ucf.edu/~bgong/CAP6412.html

Jan14,2016

Page 2: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Today

• Administrivia• Neuralnetworks&backpropagation (PartI)• FundamentalsofConvolutionalNeuralNetworks(CNN),byFareeha

Page 3: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Webcourse vs.Coursehomepage

• Webcourse:https://webcourses.ucf.edu/

• Announcements• CheckyourUCFemail!

• Homeworksubmission

• Coursehomepage:http://www.cs.ucf.edu/~bgong/CAP6412.html

• Alltheothers• Lecturenotes,papers,linkstoresources,syllabus,etc.• Bookmarkandcheckregularly

Page 4: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Topicsyouhavechosen

01234567

Page 5: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

TentativescheduleWeek2 CNNvisualization&objectrecognition

Week3 CNN&objectlocalization

Week4 CNN &transferlearning

Week5 CNN &segmentation,super-resolution

Week6 CNN&videos(opticalflow,pose)

Week7 Imagecaptioning&attentionmodel

Week8 Visualquestionanswering

Week9 Attentionmodel,aligningbookswithmovies

Week10--16 Video:tracking,action,surveillanceHuman-centered CV3DCVLow-levelCV,etc.

Page 6: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Nextweek:CNNvisualizatin &objectrecognition

Tuesday(01/19)

[ILSVRC] Russakovsky, Olga, Jia Deng, Hao Su, Jonathan Krause,Sanjeev Satheesh, Sean Ma, Zhiheng Huang et al. “Imagenet largescale visual recognition challenge.” International Journal of ComputerVision (2014): 1-42.[152 layers] He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun.“Deep Residual Learning for Image Recognition.” arXiv preprintarXiv:1512.03385 (2015).

Thursday(01/21)

[Visualization] Zeiler, Matthew D., and Rob Fergus. “Visualizing andunderstanding convolutional networks.” In Computer Vision–ECCV2014, pp. 818-833. Springer International Publishing, 2014.Zhou, Bolei, Aditya Khosla, Agata Lapedriza, Aude Oliva, and AntonioTorralba. “Object detectors emerge in deep scene cnns.” arXivpreprint arXiv:1412.6856 (2014).

Page 7: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Link willbesenttoyourUCFemails

Page 8: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Today

• Administrivia• Neuralnetworks&backpropagation (PartI)• FundamentalsofConvolutionalNeuralNetworks(CNN),byFareeha

Page 9: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Biologicalneurons

• Humanbrainshasabout10billionnuerons• Eachconnectedto10Kotherneurons• Aneuronfiresifthesumofelectrochemicalinputsexceedssomethreshold

Imagecredit:cs.stanford.edu/people/eroberts

Page 10: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Artificialneurons--- perceptrons

• IntroducedbyRosenblattin1958• Thebasicbuildingblocks for(notall)neuralnetworks

Imagecredit:www.hiit.fi/u/ahonkela/dippa/node41.html

y = '(nX

i=1

wixi + b)

= '(wTx+ b)

'(·) : activation function

Page 11: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Popularactivationfunctions

-10 -5 0 5 10-1

-0.5

0

0.5

1Binary step

-10 -5 0 5 10-1

-0.5

0

0.5

1Logistic

-10 -5 0 5 10-1

-0.5

0

0.5

1TanH

-10 -5 0 5 100

2

4

6

8

10Rectified Linear Unit (ReLU)

'(x) =

(0 if x < 0

1 if x � 0'(x) =

1

1 + exp(�x)

'(x) = tanh(x)

=

exp(x)� exp(�x)

exp(x) + exp(�x)

'(x) =

(0 if x < 0

x if x � 0

Page 12: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Artificialneurons--- perceptrons

• SupportVectorMachines• Logisticregression• AND• OR• NOT• XOR?

• Linearregression

Imagecredit:www.hiit.fi/u/ahonkela/dippa/node41.html

Page 13: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Buildingneuralnetworksfromperceptrons

• NextTuesday

Page 14: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Today

• Administrivia• Neuralnetworks&backpropagation (PartI)• FundamentalsofConvolutionalNeuralNetworks(CNN),byFareeha

Page 15: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Convolutional Neural

Networks

Fareeha Irfan

Page 16: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Outline ❏ Background❏ Applications: Convnets for object recognition and language❏ How to design convolutional layers❏ How to design pooling layers❏ How to integrate back-propagation in Convnets❏ How to build convnets in torch❏ AlexNet

Page 17: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Background❏ Complex classification tasks❏ Object Recognition in Images:

❏ grayscale: 32 x 32 = 1024 pixels

❏ rbg: 32 x 32 x 3 = 3072 pixels

❏ Fully-connected NN becomes computationally intensive

Algorithm that mimics the brain..

● Neural connections● Neurons activated during learning

Page 18: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Convnet Applications● Image/Object Recognition: Can predict who is in the image, what pose are they

in.● Natural Language Processing: Predict sentiments about sentences to classify

tweets. Extract summaries by finding sentences that are most predictive.● Drug Discovery: Predicting the interaction between molecules and biological

proteins can be used to identify potential treatments.

Some Common Libraries:

● Caffe : Supports both CPU & GPU. Developed in C++● Torch framework: Written in C● Cuda-convnet: Implementation in CUDA

Page 19: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

A Simple Neural Network

Activation Functions:

● Sigmoid ● Hyperbolic● Tangent● ReLU (Rectified Linear Unit)

Page 20: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Neural Network

1

Layer 1Layer 2

Layer 3

2

Page 21: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Convnet Overview

Page 22: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Neural Network Layer 1 (C1)parameters:

(32*32+1)*(28*28+1)*6= 4827750

ConvNetLayer 1 (C1)parameters:(5*5+1)*6

= 156

Page 23: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Convolutional Layery : Output of the convolutionx : Map with K channelsK′ : Total filters, generating a K′ dimensional map y

Page 24: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Back-propagation

Page 25: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Back-propagation for Conv Layer

Page 26: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Pooling Layer

A pooling operator operates on individual feature channels, coalescing nearby feature values into one by the application of a suitable operator.

Common choices include max-pooling (using the max operator) or sum-pooling (using summation).

Max-pooling is defined as:

Page 27: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Pooling Layer

Page 28: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Convnet

Page 29: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 30: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 31: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 32: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 33: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 34: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 35: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 36: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 37: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 38: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 39: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

● 60 million parameters ● 650,000 neurons● 5 convolutional layers ( followed by

max-pooling layers)● 3 fully-connected layers with ● a 1000-way softmax final layer

Page 40: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 41: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 42: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 43: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 44: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 45: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 46: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 47: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 48: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 49: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 50: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Reduces the top-1 error rate by over 1%

Page 51: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 52: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 53: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

TrainingUsing stochastic gradient descent and the backpropagation algorithm (repeated application of the chain rule)

Start with some initialized weights

Optimize so the correct label is predicted

Propagate errors back, and update weights to take a small step in the direction that minimizes the error

http://image-net.org/challenges/LSVRC/2012/supervision.pdf

Page 54: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 55: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)

Stochastic Gradient Descent Learning

Page 56: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)
Page 57: CAP 6412 Advanced Computer Vision - CS Departmentbgong/CAP6412/lec2.pdfHuman-centered CV 3D CV Low-level CV, etc. Next week: CNN visualizatin & object recognition Tuesday (01/19)