CS376 Computer Vision Lecture 20: Deep Learning Basicshuangqx/CS376_Lecture_20.pdf · CS376...

CS376 Computer Vision Lecture 20: Deep Learning Basics

Qixing Huang

April 10th 2019

Slide material from: http://cs231n.stanford.edu/slides/2018/cs231n_2018_lecture04.pdf

http://cs231n.stanford.edu/slides/2018/cs231n_2018_lecture05.pdf

Today’s Topic

• Neural Networks

– Activation functions

– Fully connected

– Convolutional neural networks

• Optimization

• Back-propagation

Neural Networks

From linear to multi-layer

Activation functions

Neural Networks: Architectures

Summary

• We arrange neurons into fully-connected layers

• The abstraction of a layer has the nice property that it allows us to use efficient vectorized code (e.g. matrix multiplies)

• Neural networks are not really neural

Convolutional Neural Network

Convolutional Neural Networks

• A bit of history

Convolutional Neural Networks

• A bit of history

Recall fully connected layers

Convolution layer

ConvNet

• ConvNet is a sequence of Convolution Layers, interspersed with activation functions

A typical ConvNet

A closer look at spatial dimensions

Activations

Output size

Padding is necessary

Padding

Note that

An example

General form of a Conv layer

An extreme case

The brain/neuron view of CONV layer

Reminder: Fully Connected Layer

Pooling layer

• makes the representations smaller and more manageable

• operates over each activation map independently:

Max pooling

General form

Fully Connected Layer (FC Layer)

• Contains neurons that connect to the entire input volume, as in ordinary Neural Networks

Summary

• ConvNets stack CONV,POOL,FC layers

• Trend towards smaller filters and deeper architectures

• Trend towards getting rid of POOL/FC layers (just CONV)

• Typical architectures look like

• [(CONV-RELU)*N-POOL?]*M-(FC-RELU)*K,SOFTMAX where N is usually up to ~5, M is large, 0 <= K <= 2.

– but recent advances such as ResNet/GoogLeNet challenge this paradigm

Optimization

Let us go back to the linear case

Optimization

Gradient descent

Compuational graphs

Back-propagation

A toy example

Back-propagation

• recursive application of the chain rule along a computational graph to compute the gradients of all inputs /parameters / intermediates

• implementations maintain a graph structure, where the nodes implement the forward() / backward() API

• forward: compute result of an operation and save any intermediates needed for gradient computation in memory

• backward: apply the chain rule to compute the gradient of the loss function with respect to the inputs

CS376 Computer Vision Lecture 20: Deep Learning Basicshuangqx/CS376_Lecture_20.pdf · CS376...

Documents

Qixing Tanglangquan -Baiyuanxiaomu.Geng Jun

Stanford hci group / cs376 u Scott Klemmer · 3 April 2008 Seminal Ideas in Human-Computer Interaction

CS376 Computer Vision Lecture 5: Texturehuangqx/CS376_Lecture_5.pdf · Slide Credit: Kristen Grauman. Texture representation: example original image derivative filter responses, squared

Stanford hci group / cs376 u Jeffrey Heer · 12 May 2009 Information Visualization

The Interpretation of Cultures: Selected Essayshci.stanford.edu/cs376/private/readings/geertz_thick_description.pdf · The Interpretation of Cultures: Selected Essays Clifford Geertz

Stanford hci group / cs376 u Jeffrey Heer · 5 May 2009 Design Methods: Prototyping

CS376 Computer Vision Lecture 9: Active Contours

Stanford hci group / cs376 u Scott Klemmer · 10 October 2006 Fieldwor k & Prototyp ing

Stanford hci group / cs376 Adaptive Interfaces Nathan Sakunkoo

Stanford hci group / cs376 u Scott Klemmer · 03 October 2006 CSCW Computer-Supported Cooperative Work

Catalogo general qixing

Physical representations: object knowledge cs376 fall 2005 - sdoorley

Stanford hci group / cs376 u Scott Klemmer · 12 October 2006 Evaluati on Methods

CS354 Computer Graphics Introduction to OpenGLhuangqx/CS354_Lecture_9.pdf · 2018-02-14 · Introduction to OpenGL Qixing Huang February 14th 2018. Synthetic Camera Model. Pinhole

Information Visualization Mike Brzozowski cs376 October 28, 2004

Information Retrieval Visualization CPSC 533c Class Presentation Qixing Zheng March 22, 2004

CS376 Computer Vision Lecture 5: Texturehuangqx/CS376_Lecture_5.pdf · texture in window b. Slide credit: Kristen Grauman. Texture representation: window scale •We’re assuming

Qixing Huang Karthik Ramani Purdue Generating 3D shape surfaces using deep residual networks Ayan Sinha MIT sinhayan@mit.edu Asim Unmesh IIT Kanpur a.unmesh@gmail.com Qixing Huang

Research & HCI Mike Krieger CS376 Tuesday, April 15th

CS354 Computer Graphics Introductionhuangqx/CS354_Lecture_1.pdfCS 354–Computer Graphics •Instructor: Qixing Huang –huangqx@cs.utexas.edu –Office: GDC 5.422 –Office hours: