Layer-wise Asynchronous Training of Neural …wcohen/10-605/2016-project-presentations/...Neural...

Layer-wise Asynchronous Training of Neural Network with Synthetic Gradient

Xupeng Tong, Hao Wang, Ning Dong, Griffin Adams

CONTENTS

BACKGROUND

INTRODUCTION

SYNCHRONOUS TRAINING

INSIGHTS

ASYNCHRONOUS TRAINING

Part OneBACKGROUND01

1BACKGROUND

Asynchronous SGD

Downpour SGD

Dean, Jeffrey, et al. "Large scale distributed deep networks." Advances in neural information processing systems. 2012.

Part TwoINTRODUCTION02

2-1OVERVIEW

Jaderberg, Max, et al. "Decoupled neural interfaces using synthetic gradients." arXiv preprint arXiv:1608.05343 (2016).

Part ThreeSYNCHRONOUS TRAINING03

𝐿ℳ =% 𝛿' − 𝛿)' **

𝐿ℐ = % ℎ' − ℎ.' **

Minimizing the synthetic input/gradient simultaneously with the general loss

𝐿 𝑤0, 𝑏0,⋯ ,𝑤', 𝑏',⋯ ,𝑤4, 𝑏4 = 𝐿(𝑊,𝐵)

Part FourASYNCHRONOUS TRAINING04

4-1ONEPOSSIBLEARCHITECTURE

4-1 INFRASTRUCTURE - BASIC

4-2INFRASTRUCTURE- LIGHT

4-2SLAVE

4-3MASTER

Part FiveINSIGHTS05

• Synthetic gradient and synthetic input as a new alternative of batch normalization / Dropput

• M and I auxiliary network introduces noises in the input/gradient of each layer

• No exact update is required!

• The model will learn how to reduce the variance within each batch

• while keeping the flavor of that specific batch.

Thank You

Layer-wise Asynchronous Training of Neural …wcohen/10-605/2016-project-presentations/...Neural...

Documents

CD133 neural stem cells in the ependyma of mammalian … · 2008. 7. 17. · CD133 neural stem cells in the ependyma of mammalian postnatal forebrain Volkan Coskun*†‡, Hao Wu†‡,

Research on Advanced Training Algorithms of Neural Networks Hao Yu Ph.D Defense Aug 17 th 2011 Supervisor: Bogdan Wilamowski Committee Members: Hulya Kirkici

mi hao hao

Ni hao Carla!

Towards Fast and Energy-Efficient Binarized Neural Network ...Towards Fast and Energy-Efficient Binarized Neural Network Inference on FPGA Cheng Fu1,2,∗, Shilin Zhu2, Hao Su2, Ching-En

Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Medhdi Khashei, Mehdi Bijari 2011, ASOC A novel hybridization of artificial neural

Text-Guided Graph Neural Networks for Referring 3D ...Text-Guided Graph Neural Networks for Referring 3D Instance Segmentation Pin-Hao Huang,1 Han-Hung Lee,1,2 Hwann-Tzong Chen,2,4

Shiqin Yan, Huaicheng Li, Mingzhe Hao, Michael Hao Tong, … · 2019-12-18 · Shiqin Yan, Huaicheng Li, Mingzhe Hao, Michael Hao Tong, Swaminathan Sundararaman*, Andrew Chien, and

Visualizing the Loss Landscape of Neural Nets...Visualizing the Loss Landscape of Neural Nets Hao Li 1, Zheng Xu , Gavin Taylor2, Christoph Studer3, Tom Goldstein1 1University of Maryland,

Hao Li - Portfolio

Indicadores Hao Lab

HAO & JOYCE WEDDING

libreto Hairspray ola hao jajaoah haola hao jajaoah haola hao jajaoah haola hao jajaoah haola hao jajaoah ha

A S C ATTENTIONS IN GRAPH NEURAL NETWORKS ...Hao Zhang Chongqing University of Posts and Telecommunications sufeidechabei@gmail.com Xingjian Shi Unafﬁliated xshiab@connect.ust.hk

Hao ren cnc

Ni Hao México

E cient Multi-Key Homomorphic Encryption with Packed ...E cient Multi-Key Homomorphic Encryption with Packed Ciphertexts with Application to Oblivious Neural Network Inference Hao

Ni hao Hanoi

FPNN: Field Probing Neural Networks for 3D DataFPNN: Field Probing Neural Networks for 3D Data Yangyan Li 1;2 Sören Pirk Hao Su Charles R. Qi 1Leonidas J. Guibas 1Stanford University,

Da jia hao !