Understanding RNN and LSTM

View
53
Download
0
Category

Data & Analytics

Preview:

Citation preview

RNN and LSTM(Oct 12, 2016)

YANG Jiancheng

Outline

• I. Vanilla RNN

• II. LSTM

• III. GRU and Other Structures

• I. Vanilla RNN

In theory, RNNs are absolutely capable of handling such “long-term dependencies.” A human could carefully pick parameters for them to solve toy problems of this form. Sadly, in practice, RNNs don’t seem to be able to learn them.

GREAT Intro: Understanding LSTM Networks

http://colah.github.io/posts/2015-08-Understanding-LSTMs/

• I. Vanilla RNN

WILDML has a series of articles to introduce RNN (4 articles, 2 GitHub repos).

http://www.wildml.com/

• I. Vanilla RNN• Back Prop Through Time (BPTT)

• I. Vanilla RNN• Gradient Vanishing Problem

tanh and derivative. Source: http://nn.readthedocs.org/en/rtd/transfer/

RNNs tend to be very deep

• II. LSTM• Differences of LSTM and Vanilla RNN

• II. LSTM• Core Idea Behind LSTMs

Cell state Gates

• II. LSTM• Step-by-Step Walk Through

0 ~ 1

• II. LSTM• Step-by-Step Walk Through

0 ~ 1

• III. GRU and other structures• Gated Recurrent Unit (GRU)

• Combines the forget and input gates into a single “update gate.” • Merges the cell state and hidden state• Other changes

• III. GRU and other structures• Variants on Long Short Term Memory

Greff, et al. (2015) do a nice comparison of popular variants, finding that they’re all about the same.

http://arxiv.org/pdf/1503.04069.pdf

Bibliography

• [1] Understanding LSTM Networks

• [2] Back Propagation Through Time and Vanishing Gradients

http://colah.github.io/posts/2015-08-Understanding-LSTMs/

http://www.wildml.com/2015/10/recurrent-neural-networks-tutorial-part-3-backpropagation-through-time-and-vanishing-gradients/

Thanks for listening!

Recommended

Intro to Recurrent Neural Networks€¦ · This Lecture: 1. Segmentation Review 2. Intro to Polygon-RNN 3. Modeling Sequences 1. Vanilla RNN 2. Training and Problems 3. LSTM 4. Convolutional

Documents

The Munich LSTM-RNN Approach to the MediaEval 2014 “Emotion in Music” Task

Software

Examensarbete Systemarkitekturutbildningen Philip ...hb.diva-portal.org/smash/get/diva2:1142444/FULLTEXT01.pdf · box trading, lstm, rnn, time series forecasting , prediction, tensorflow,

Documents

Understanding RNN States with Predictive Semantic ...summit.sfu.ca/system/files/iritems1/19423/etd20450.pdf · Understanding RNN States with Predictive Semantic Encodings and Adaptive

Documents

Fast, Compact, and High Quality LSTM-RNN Based … · Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices Heiga Zen, Yannis

Documents

A LSTM-RNN-Assisted Vector Tracking Loop for Signal Outage …downloads.hindawi.com/journals/ijae/2020/2975489.pdf · 2020. 8. 12. · Research Article A LSTM-RNN-Assisted Vector

Documents

Graph LSTM with Context-Gated Mechanism for Spoken ...xiaohuiyan.github.io/paper/AAAI-ZhangL.2694.pdf · Graph LSTM with Context-Gated Mechanism for Spoken Language Understanding

Documents

CSE 291G : Deep Learning for Sequencescseweb.ucsd.edu/classes/wi19/cse291-g/student_presentations/NER.… · RNN : LSTM • Overcome drawbacks of existing system • Account for variable

Documents

180912 Lec6 RNN architectures(LSTM, GRU 등 최신 RNN 모델, …alinlab.kaist.ac.kr/resource/Lec6_RNN_architectures.pdf• Google’s Neural Machine Translation (GNMT) 3. Overcoming

Documents

COMPARISON OF RNN, LSTM AND GRU ON SPEECH … · 2020. 1. 24. · class of RNN, Long Short-Term Memory [LSTM] networks. LSTM networks have special memory cell structure, which is

Documents

Recurrent Neural Networks · 2020-04-06 · Overview •Finite state models •Recurrent neural networks (RNNs) •Training RNNs •RNN Models •Long short-term memory (LSTM) •Attention

Documents

Recurrent Neural Networks - GitHub PagesRNN: LSTM: Source: Understanding LSTM Networks Christopher Olah LSTM •Modeling –Input gate –Input information Source: Understanding LSTM

Documents

Image Captioning - Kiran Vodrahalli · – LSTM (most people), RNN, RNNLM (Mikolov); ... required to encode each word in LM ... Use BPTT (Backprop through

Documents

IMPLEMENTASI METODE LSTM-RNN PADA KLASIFIKASI …

Documents

YoTube: Searching Action Proposal Via Recurrent and Static ... Searching... · A. Recurrent Neural Network Recurrent Neural Network (RNN), especially Long-Short Term Memory (LSTM)

Documents

Situation Assessment for Planning Lane Changes ...Neural Network (RNN) with Long Short-Term Memory (LSTM) units. We further extend this to a Bidirectional RNN. As Bidirectional RNNs

Documents

A RNN-LSTM based Predictive Autoscaling Approach on ...ijsrcseit.com/Paper/CSEIT1833498.pdf3Department of Information Technology, PSG College of Technology, Coimbatore, Tamilnadu,

Documents

RNN & NLP Application - cs.kangwon.ac.krleeck/ML/RNN_NLP.pdf · LSTM RNN 응용 예 • Hand Writing by Machine • Music Composition • Neural Machine Translation • Image Caption

Documents

RNN & NLP Application - cs.kangwon.ac.krcs.kangwon.ac.kr/~leeck/NLP/RNN_NLP.pdf · Long Short-Term Memory RNN • Vanishing Gradient Problem for RNN • LSTM can preserve gradient

Documents

Recurrent Neural Networks · Recurrent Neural Networks 11-785 / 2020 Spring / Recitation 7 Vedant Sanil, David Park “Drop your RNN and LSTM, they are no good!” The fall of RNN

Documents