63
Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute, Kolkata, India

Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

  • Upload
    others

  • View
    17

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Deep Learning in Computer Vision

Bikash SantraSenior Research Fellow

Electronics and Communication Sciences Unit

Indian Statistical Institute, Kolkata, India

Page 2: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

What Computers “See”

Page 3: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Images Are Numbers

Page 4: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Images Are Numbers

Page 5: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Images Are Numbers

Page 6: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

What is Image Processing?

1. Photometric Transformations

a) Sharpening

b) Smoothing

c) Contrast enhancement

d) Stretching

2. Geometric Transformations

1. Scaling

2. Rotations

3. Translation

3. Image Compression

4. And many more

Page 7: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

What is Computer Vision?

1. Emulates human vision

2. Goal is to understand

images and its contents

Page 8: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Tasks in Computer Vision

Feature

Extraction

Page 9: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

High Level Feature Detection

Page 10: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Manual Feature Extraction

Page 11: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Manual Feature Extraction

Page 12: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Question 1

Page 13: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Question 2

Page 14: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Yes, we can! Using deep learning

Page 15: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Neural NetworksDeep

Page 16: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Neural Networks: Architectures

Slide Credit: Fei-Fei Li et al., Stanford University

Page 17: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Example Problem

Page 18: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Example Problem: Will I pass this class?

Page 19: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Example Problem: Will I pass this class?

Page 20: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Example Problem: Will I pass this class?

Page 21: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Example Problem: Will I pass this class?

Page 22: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Quantifying Loss

Page 23: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Empirical Loss

Page 24: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Binary Cross Entropy Loss

Page 25: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Mean Squared Error Loss

Page 26: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Mean Squared Error Loss

Page 27: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Alexander Amini, MIT

Loss Optimization

Solved using

gradient descent

Page 28: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Convolutional Neural Network

Page 29: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Abin - Roozgard

Introduction

Convolutional neural networks

Signal processing, Image processing

improvement over the multilayer perceptron

performance, accuracy and some degree of

invariance to distortions in the input images

Page 30: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

FCNN for Image Processing

Page 31: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Using Spatial Structure

Page 32: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Using Spatial Structure

Page 33: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Applying Filters to Extract Features

Page 34: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Feature Extraction with Convolution

Page 35: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Fei-Fei Li et al., Stanford

Convolutional Neural Networks

CNN is evolved basically to deal with images.

Page 36: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Components of CNN

Page 37: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 38: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 39: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 40: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 41: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 42: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 43: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 44: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 45: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 46: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 47: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 48: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 49: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 50: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 51: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 52: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 53: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 54: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Image Source: http://cs231n.github.io/convolutional-networks/

Convolution

Operation

Demo

Page 55: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Fei-Fei Li et al., Stanford

Convolution Layer

Page 56: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

Introducing Non-Linearity

Page 57: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Fei-Fei Li et al., Stanford

Pooling Layer

Page 58: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Fei-Fei Li et al., Stanford

Pooling Layer

Page 59: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

CNNs for Classification: Feature Learning

Page 60: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

CNNs for Classification: Class Probabilities

Page 61: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Slide Credit: Ava Soleimany, MIT

CNNs: Training with Backpropagation

Page 62: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Deep Learning Resources

Name Language Link Note

Pylearn2 Python http://deeplearning.net/software/pylearn2/ A machine learning library built on Theano

Theano Python http://deeplearning.net/software/theano/ A python deep learning library

Keras Python https://keras.io/ A python deep learning library

Caffe C++ http://caffe.berkeleyvision.org/ A deep learning framework by Berkeley

Torch Lua http://torch.ch/ An open source machine learning framework

Overfeat Lua http://cilvr.nyu.edu/doku.php?id=code:start A convolutional network image processor

Deeplearning4j Java http://deeplearning4j.org/ A commercial grade deep learning library

Word2vec C https://code.google.com/p/word2vec/ Word embedding framework

GloVe C http://nlp.stanford.edu/projects/glove/ Word embedding framework

Doc2vec Chttps://radimrehurek.com/gensim/models/d

oc2vec.htmlLanguage model for paragraphs and documents

StanfordNLP Java http://nlp.stanford.edu/ A deep learning-based NLP package

TensorFlow Python http://www.tensorflow.org A deep learning based python library

PyTorch Python https://pytorch.org/ A deep learning based python library

Page 63: Deep Learning in Computer Vision · Deep Learning in Computer Vision Bikash Santra Senior Research Fellow Electronics and Communication Sciences Unit Indian Statistical Institute,

Thank you