Introduction to Deep Learning - RLLAB @...

Introduction to Deep LearningConvolutional Neural Networks (1)

Prof. Songhwai OhECE, SNU

Prof. Songhwai Oh (ECE, SNU) Introduction to Deep Learning 1

ALEXNET

Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenetclassification with deep convolutional neural networks." NIPS. 2012.

ImageNet Large‐Scale Visual Recognition Challenge, 2012

Tasks:• Decide whether a given image contains a particular type of object or not. For example, a

contestant might decide that there are cars in this image but no tigers. • Find a particular object and draw a box around it. For example, a contestant might decide that

there is a screwdriver at a certain position with a width of 50 pixels and a height of 30 pixels.

• 1000 different categories• Over 1 million images• Training set: 456,567 images

Year Winning Error Rate

2010 28.2%

2011 25.8%

2012 16.4% (2nd 25.2%)

2013 11.2%

2014 6.7%

2015 3.57%

Human About 5.1%

ImageNet Large Scale Visual Recognition Challenge. Russakovsky et al. arXiv preprint arXiv:1409.0575. URL: http://arxiv.org/abs/1409.0575v1

ImageNet Dataset

Source: https://cs.stanford.edu/people/karpathy/cnnembed/cnn_embed_full_1k.jpg

AlexNet on ImageNet

Architecture

5 convolutional layers3 fully

connected layers

Key ideas: • Rectified Linear Unit (ReLU): an activation function• GPU implementation (2 GPUs)• Local response normalization, Overlapping pooling• Data augmentation, Dropout

Learned 11x11x3 filters

CONVOLUTION

Convolution

2D Convolution

K (3x3 filter) I (7x7 image) Output (5x5)

2D Convolution

……

2D Convolution

……

RGB Image Convolution

32x32x3 Image5x5x3 filter

28x28x1 feature

tensor

RGB Image Convolution

32x32x3 Image 28x28x4 feature

Four 5x5x3 filters

Convolutional Neural Network

CONV (4 5x5x3filters),ReLU

CONV (10 5x5x4filters),ReLU

ReLU: Rectified Linear Unit

Stride

stride 1 3x3 filter

Stride

stride 1 3x3 filter

Stride

stride 1 3x3 filter

Stride

stride 1 3x3 filter

Stride

stride 1 3x3 filter=> 5x5 output

Stride

stride 2 3x3 filter

Stride

stride 2 3x3 filter

Stride

stride 2 3x3 filter=> 3x3 output

Output Size

• N = input size• F = filter size• S = stride

• Output size = ( N – F ) / S + 1

Zero Padding

0 0 0 0 0 0 0 0 0

7x7 inputZero padding with 1 pixel border3x3 filter

=> 7x7 output

Output Size

• N = input size• F = filter size• S = stride• P = padding size

• Output size = ( N + 2P – F ) / S + 1

1x1 Convolution

CONV (32 1x1x128filters)

• Dimension reduction• Same output size (H x W)

ReLU Activation

• Preserves properties of linear models– Easy to optimize with gradient descent– Good generalization– Large and consistent gradients

• Overcomes the vanishing gradient problem

Other Activation Functions

Sigmoid/Logistic

tanh (hyperbolic tangent)tanh

Leaky ReLUmax , , 1

maxoutmax , )

Pooling

1 2 1 0

5 0 0 3

8 0 0 5

0 2 2 0

8 5max pooling with 2x2 filterwith stride 2

1 2 1 0

5 0 0 3

8 0 0 5

0 2 2 0

2.5 1.4average pooling with 2x2 filterwith stride 2

• Poolingmakes features invariant to local translations of input• Dimension reduction

Wrap Up

• Convolutional Neural Networks– Convolution– Activation function: ReLU– Pooling

Introduction to Deep Learning - RLLAB @...

Documents

Introduction to Deep Learning - RLLAB @ SNUrllab.snu.ac.kr/courses/deep-learning-2018/files/01_dl_intro_p1.pdfProf. Songhwai Oh (ECE, SNU) Introduction to Deep Learning 5 TensorFlow,

430.729 (003) –Spring 2020 Deep Reinforcement Learningrllab.snu.ac.kr/courses/deeprl_2020/files/lecture01... · ImageNet Large‐Scale Visual Recognition Challenge, 2012 Prof. Songhwai

Playing Alkagi with a Humanoid Robot - RLLAB @ SNUcpslab.snu.ac.kr/publications/papers/2014_iccas_alkagi.pdf · playing soccer [1], table tennis [2], and chess [3]. It has been demonstrated

Chess Review May 11, 2005 Berkeley, CA Tracking Multiple Objects using Sensor Networks and Camera Networks Songhwai Oh EECS, UC Berkeley (sho@eecs.berkeley.edu)

rllab Documentation · rllab Documentation, Release 0.0.1 rllab is a framework for developing and evaluating reinforcement learning algorithms. rllab is a work in progress, input

Introduction to Deep Learning - Robot Learning Laboratorycpslab.snu.ac.kr/courses/deep-learning-2018/files/05_dl... · Generative Adversarial Networks (GAN) Prof. Songhwai Oh (ECE,

Closing the Loop Towards a theory for High Confidence Cyber Physical Systems for Societal Systems Songhwai Oh (Seoul Natl.), Saurabh Amin, Alvaro Cardenas

Introduction to Deep Learning - RLLAB @ SNUrllab.snu.ac.kr/courses/deep-learning-2018/files/04_dl... · Recurrent Neural Networks : Typical RNNs Prof. Songhwai Oh (ECE, SNU) Introduction

Ray, Distributed Computing, and · TensorFlow Serving Flink, many others Baselines, RLlab, ELF, Coach, TensorForce, ChainerRL MapReduce, Hadoop, Spark Vizier, many internal systems

A Hierarchical Multiple Target Tracking Algorithm for Sensor Networks Songhwai Oh and Shankar Sastry EECS, Berkeley Nest Retreat, Jan. 2004

Introduction to Deep Learning - RLLAB @ SNUrllab.snu.ac.kr/courses/deep-learning-2018/files/02_dl_intro_p2.pdf · AlphaGo, 2016 Prof. Songhwai Oh (ECE, SNU) Introduction to Deep Learning

Introduction to Deep Learningcpslab.snu.ac.kr/courses/deep-learning-2018/files/03_dl_cnn_p1.pdf · Prof. Songhwai Oh (ECE, SNU) Introduction to Deep Learning 18 32 32 3 28 28 4 24

Uncertain Multiagent Systems: Games and Learning H. Jin Kim, Songhwai Oh and Shankar Sastry University of California, Berkeley July 17, 2002 Decision-Making

May 11, 2005 Tracking on a Graph Songhwai Oh Shankar Sastry Target trajectoriesEstimated tracks Tracking in Sensor Networks Target tracking is a representative

Introduction to Deep Learning - RLLAB @ SNUrllab.snu.ac.kr/courses/deep-learning-2018/files/05_dl... · Generative Adversarial Networks (GAN) Prof. Songhwai Oh (ECE, SNU) Introduction

Ubuntu and ROS Installation - CPSLAB @ SNUcpslab.snu.ac.kr/courses/intelligent-systems_2017/... · Introduction to Intelligent System (430.457, fall 2017) Prof. Songhwai Oh Figure

1 AM3 Task 1.3 Navigation Using Spatio- Temporal Gaussian Processes Songhwai Oh (Presented by Sam Burden) MAST Annual Review University of Pennsylvania

430.729 (003) –Spring 2020 Deep Reinforcement Learningrllab.snu.ac.kr/courses/deeprl_2020/files/lecture03_gpr.pdf · Data GPR Prediction. Kriging Prof. Songhwai Oh Deep Reinforcement

Instruction for Preliminary Project 1 - RLLAB @ SNUrllab.snu.ac.kr/.../project/preproject1-1.pdf · Instruction for Preliminary Project 1 Ubuntu and ROS Installation 1. ... 2.1.1

Introduction to Deep Learning - Robot Learning Laboratorycpslab.snu.ac.kr/courses/deep-learning-2018/files/04_dl... · Recurrent Neural Networks : Typical RNNs Prof. Songhwai Oh (ECE,