59
Deep Neural Networks: Part I What are DNN and Deep Learning Yuan-Kai Wang, 2016 Web site of this course: http://pattern-recognition.weebly.com source: Deep learning: a birds-eye view, by R. Pieters, 2015.

02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Neural Networks:

Part I What are DNN and Deep Learning

Yuan-KaiWang, 2016

Web site of this course: http://pattern-recognition.weebly.com source: Deep learning: a birds-eye view, by R. Pieters, 2015.

Page 2: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2

Machine Learning vs. Deep Learning

Page 3: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Representation learning Attempts to automatically learn

good features or representations

Deep learning Attempt to learn multiple levels of representation of increasing

complexity/abstraction

Deep Learning vs. Representation Learning

3

Page 4: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Data

ProgramOutput

Data ProgramOutput

Machine Learning:

(labels)(“weights”/model)

Computer

Computer

4

Traditional Programming vs. Machine Learning

Traditional Programming:

Page 5: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Machine Learning

• Most machine learning methods work well because of human-designed/hand-engineered features (representations)

machine learning -> optimising weights to best make a final prediction

5

Page 6: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Neural NetTypical ML Regression

Deep Learning: Why?

Page 7: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Problem definition

Raw data

Preprocessed data

Hypothesis

Knowledge/Predictive model

One pa此of thedata mlnlngprocess

• Each step generates many questions :

Data generation : data types,samplesize,online/offline ...

Preprocessing: normalization,missing values,featureselection/extraction ...

Machine learning: hypothesis,choice of learningparadigm/algorithm ...

Hypothesis validation: cross-validation,model deploymen t...

Machine Learning → Deep Learning

Page 8: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

DEEP NET (DEEP LEARNING)

Machine Learning → Deep Learning

Page 9: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

1 neuron

2 + 2 neurons

10 +10 neurons

Machine Learning → Deep LearningDeep Learning: Why?

Page 10: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Learning is everywhere…

Page 11: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Google

XBOX ! BRINCMEA PIE!

Page 12: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Learning in the News

Page 13: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

(source: Google Trends)

19

Page 14: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

(source: arXiv bookworm, Culturomics)

ScientificArticles:

Page 15: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 16: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 17: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 18: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Why Now?

18

• Inspired by the architectural depth of the brain, researchers wanted for decades to train deep multi-layer neural networks.

No successful attempts were reported before 2006… Exception:convolutional neural networks, LeCun 1998

SVM: Vapnik and his co-workers developed the Support Vector Machine (1993)(shallow architecture).

Breakthrough in 2006!

Page 19: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Renewed Interest: 1990s

19

But…• Neural network learning is very slow and

inefficient• Researchers turn to SVMs, random

forests, ...

Neural networks can• Learn multiple layers with "back propagation"• Learn any function "theoretically"

Page 20: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

20

• More data

• Faster hardware: GPU’s, multi-core CPU’s

• Working ideas on how to train deep architectures

Page 21: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

21

• More data

• Faster hardware: GPU’s, multi-core CPU’s

• Working ideas on how to train deep architectures

Page 22: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 23: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

Growth of datasets

23

The Big Data Era

Page 24: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

24

• More data

• Faster hardware: GPU’s, multi-core CPU’s

• Working ideas on how to train deep architectures

Page 25: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

32

vs

Page 26: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

7#

Page 27: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

27

• More data

• Faster hardware: GPU’s, multi-core CPU’s

• Working ideas on how to train deep architectures

Page 28: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

2006 Breakthrough

28

Stacked Restricted Boltzman Machines* (RBM)Hinton, G. E, Osindero, S., and Teh, Y. W. (2006).A fast learning algorithm for deep belief nets. Neural Computation, 18:1527-1554.

Stacked Autoencoders (AE)

Bengio, Y., Lamblin, P., Popovici, P., Larochelle, H. (2007). Greedy Layer-Wise Training of Deep Networks,Advances in Neural Information Processing Systems 19

* Called Deep Belief Networks (DBN)

Page 29: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Learning for the Win!

Page 30: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

• 1.2M images with 1000 object categories

• AlexNet of uni Toronto: 15% error rate vs 26% for 2th placed (traditional CV)

Impact on Computer Vision

ImageNet Challenge 2012

Page 31: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

(from Clarifai)

Impact on Computer Vision

Page 32: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 33: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Classification results on ImageNet 2012

Detection results

source: Szegedy et al. Going deeper with convolutions (GoogLeNet ), ILSVRC2014, 19 Sep 2014 33

Team Year Place Error (top-5) Uses external data

SuperVision 2012 - 16.4% no

SuperVision 2012 1st 15.3% ImageNet 22k

Clarifai 2013 - 11.7% no

Clarifai 2013 1st 11.2% ImageNet 22k

MSRA 2014 3rd 7.35% no

VGG 2014 2nd 7.32% no

GoogLeNet 2014 1st 6.67% no

Team Year Place mAP e x t e r n a ldata

ensemble c o n t e x t u a lmodel

approach

UvA-Euvision 2013 1st 22.6% none ? yes F i s h e r vectors

Deep Insight 2014 3rd 40.5% I L S V R C 1 2Classification+ Localization

3 models yes ConvNet

C U H KDeepID-Net

2014 2nd 40.7% I L S V R C 1 2Classification+ Localization

? no ConvNet

GoogLeNet 2014 1st 43.9% I L S V R C 1 2Classification 6 models no ConvNet

Page 34: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

ConvolutionPoolingSoftmax Other

Winners of:Large Scale Visual Recognition Challenge 2014

(ILSVRC2014)19 September 2014

GoogLeNet

source: Szegedy et al. Going deeper with convolutions (GoogLeNet ), ILSVRC2014, 19 Sep 2014 34

Page 35: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Inception256

source: Szegedy et al. Going deeper with convolutions (GoogLeNet ), ILSVRC2014, 19 Sep 2014 35

Width of inception modules ranges from 256 filters (in early modules) to 1024 in top inception modules.

Can remove fully connected layers on top completely

Number of parameters is reduced to 5 million

480 480512 512 512

832 832 1024

Computional cost is increased by less than 2X compared to Krizhevsky’s network. (<1.5Bn operations/evaluation)

Page 36: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Computer Vision

Latest State of the Art:

Page 37: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Computer Vision: Current State of theArt

Page 38: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Audio Processing

First public Breakthrough with Deep Learning in 2010

Dahl et al. (2010)

38

Page 39: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Audio Processing

First public Breakthrough with Deep Learning in 2010

Dahl et al. (2010)

-33%! -32%!

39

Page 40: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Audio ProcessingSpeech Recognition

40

Page 41: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Audio ProcessingTIMITSpeech Recognition

(from: Clarifai)41

Page 42: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 43: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Pos: Toutanova et al. 2003)Ner: Ando & Zhang 2005

C&W 2011

C&W 2011

Impact on Natural Language Processing

Page 44: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Impact on Natural Language Processing

Named Entity Recognition:

Page 45: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Learning: Who’s to blame?

Page 46: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 47: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Architectures can be representationally efficient• Fewer computational units for same function

Deep Representations might allow for a hierarchy or representation

Allows non-local generalisation Comprehensibility

Multiple levels of latent variables allow combinatorial sharing of statistical strength

Deep Learning: Why?

47

Page 48: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Biological Justification

Deep Learning = Brain “inspired”Audio/Visual Cortex has multiple stages == Hierarchical

Page 49: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Different Levels ofAbstraction

57

Page 50: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Hierarchical Learning• • Natural progression

from low level to high level structure as seen in natural complexity

• A good lower level representation can be used for many distinct tasks

Different Levels ofAbstraction

Feature Representation

50

Page 51: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 52: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 53: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts
Page 54: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

HierarchiesDistributed

EfficientSharin

g Generalization

Unsupervised*

Major PWNAGE!

Black Box

Training Time

Much Data

Why go Deep ?

65

Page 55: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

No More Handcrafted Features !

66

Page 56: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

[Kudos to Richard Socher, for this eloquent summary :) ]

• Manually designed features are often over-specified, incomplete and take a long time to design and validateLearned Features are easy to adapt, fast to learn•

• Deep learning provides a very flexible, (almost?) universal, learnable framework for representing world, visual and linguistic information.

• Deep learning can learn unsupervised (from raw text/audio/ images/whatever content) and supervised (with specific labels like positive/negative)

Why Deep Learning ?

Page 57: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

Deep Learning: Future Developments

57

Currently an explosion of developments••••

• Hessian-Free networks (2010)• Long Short Term Memory (2011)• Large Convolutional nets, max-pooling (2011)• Nesterov’s Gradient Descent (2013)

Currently state of the art but...•••

No way of doing logical inference (extrapolation) No easy integration of abstract knowledge Hypothetic space bias might not conform with reality

Page 58: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

• TensorFlow – Google Open Source.https://www.tensorflow.org/• A playground for neural network

• Theano - CPU/GPU symbolic expression compiler in python (from LISA lab at University of Montreal). http://deeplearning.net/software/theano/

• Torch -Matlab-like environment for state-of-the-art machine learning algorithms in lua (from Ronan Collobert, Clement Farabet and Koray Kavukcuoglu) http://torch.ch/

• More info: http://deeplearning.net/software links/

Wanna Play ? General Deep Learning

58

Page 59: 02-Tutorial15-Deep Learning-a (non-techy) birds-eye view [已修復]pattern-recognition.weebly.com/uploads/4/0/1/0/40109209/... · 2019. 8. 1. · Representation learning Attempts

• cuda-convnet2 (Alex Krizhevsky, Toronto) (c++/ CUDA, optimized for GTX 580) https://code.google.com/p/cuda-convnet2/

• Caffe (Berkeley) (Cuda/OpenCL,Theano, Python) http://caffe.berkeleyvision.org/

• OverFeat (NYU)• http://cilvr.nyu.edu/doku.php?id=code:start

Wanna Play ? Computer Vision

59