Making All Tickets Winners (RigL) Rigging The Lottery · Making All Tickets Winners (RigL)...

Rigging The Lottery:Making All Tickets Winners (RigL)Efficient and accurate training for sparse networks. Utku Evci, Trevor Gale, Jacob Menick, Pablo Samuel Castro, Erich Elsen

AI Residency

go/brain-montreal

Google AI Residency

Brain Montreal

Sparse networks and their advantages

Motivation

Sparse Networks➔ On device training/inference:

Reduces FLOPs and network size drastically without harming performance.

➔ Reduced memory footprint: We can fit wider/deeper networks in memory and get better performance from the same hardware.

➔ Architecture search:Causal relationships?Interpretability?

Dense NetworkSparsity: 0

Sparse NetworkSparsity: 60%

Efficient Neural Audio Synthesis, Kalchbrenner, N., Elsen, E., Simonyan, K., Noury, S., Casagrande, N., Lockhart, E., Stimberg, F., Oord, A.,Dieleman, S., and Kavukcuoglu, K., 2018

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers, Zhuohan Li, Eric Wallace, Sheng Shen, Kevin Lin, Kurt Keutzer, Dan Klein, Joseph E. Gonzalez, 2020

Sparse networks perform betterfor the same parameter count.

Accelerating sparsityIs difficult, but possible

➔ Block Sparse Kernels: Efficient sparse operations.

➔ Efficient WaveRNN: Text-to-Speech

➔ Optimizing Speech Recognition for the Edge: Speech Recognition

➔ Fast Sparse Convnets: Fast mobile inference for vision models with sparsity (1.3 - 2.4x faster).

….and more to come.

➔ Pruning method requires dense training: (1) limits the biggest sparse network we can train (2) not efficient.

➔ Training from scratch performs much worse.

➔ Lottery* initialization doesn't help. Test accuracy of ResNet-50 networks trained on

ImageNet-2012 dataset at different sparsity levels*.

How do we findsparse networks?

* The Difficulty of Training Sparse Neural NetworksUtku Evci, Fabian Pedregosa, Aidan Gomez, Erich Elsen, 2019

* The Lottery Ticket Hypothesis:Finding Sparse, Trainable Neural NetworksJonathan Frankle, Michael Carbin, ICLR 2019

Can we train sparse neural networks end-to-end?

(without ever needing the dense parameterization)

(as good as the dense-to-sparse methods)

and surpassing pruning performance.

YES. RigL!

Rigging The Lottery:Making all Tickets Winners

The Algorithm

➔ Start from a random sparse network.

➔ Train the sparse network.➔ Every N steps update

connectivity:◆ Drop least magnitude

connections◆ Grow new ones using

gradient information.

The Algorithm

Evaluation of Connections

Evaluation of first layer connections during MNIST MLP training.

Before training

Resnet-50RigL matches pruning performance using significantly less resources.

Static

Small-Dense

Exceeding Pruning Performance

➔ RigL◆ outperforms pruning

➔ ERK sparsity distribution:◆ greater performance◆ more FLOPs.

14x less FLOPs and parameters.

RigL (ERK)

Static

Small-Dense Prune

80% ERK on Resnet-50

P 15Stage 1-2-3 Stage 4 Stage 5

Sparse MobileNets

same parameter/flops: 4.3% absolute improvement in Top-1 Accuracy.

➔ Difficult to prune.

➔ Much better results with RigL.

Character Level Language Modelling on WikiText-103

➔ Similar to WaveRNN.

➔ RigL falls short of matching the pruning performance

Static training stucks in a suboptimal basin.

RigL helps escaping from it.

More results in our workshop paper https://arxiv.org/pdf/1906.10732.pdf

Bad Local Minima and RigL

Summary of Contributions

- Training randomly initialized sparse networks without needing the dense parametrization is possible.

- Sparse networks found by RigL can exceed the performance of pruning.

- Non-uniform sparsity distribution like ERK brings better performance.

- RigL can helps us with feature selection.

Limitations

- RigL requires more iterations to converge.

- Dynamic sparse training methods seem to have suboptimal performance training RNNs.

- Sparse kernels that can utilize sparsity during training are not widely available (yet).

Thank you!● Sparse networks are promising.

● End-to-end sparse training is possible and it has potential to replace dense->sparse training.

https://github.com/google-research/rigl

@eriche@psc@jmenick@tgale@evcu

Making All Tickets Winners (RigL) Rigging The Lottery · Making All Tickets Winners (RigL)...

Documents

WINNERS LIST Citibank TICKETS by the Minute 2 … GALAXY Tab3 Lite SHAK KUM CHOW XXXXXXXXX107 Samsung GALAXY Tab3 Lite THAVENDRAN A/L P.SELVARATNAM XXXXXXXXX013 Samsung GALAXY Tab3

Computer Graphics - Villanova University · project, and any known bugs. PRIZES: First and second place winners will receive tickets to a movie of choice (preferably an animation

depedelsalvadorcity.net · 2021. 6. 8. · Grade 3 44 winners Grade 4 44 winners Grade 5 44 winners Grade 6 44 winners Grade 7 40 winners Grade 8 — 40 winners Grade 9 — 40 winners

MBA Winners - TorontoBluesSociety · Gary Kendall, Lily Sazz, ... Tickets on sale until February 15 (from TBS website): ... and mandolin as he is on valuable old guitars.” (4 stars)

Digital Winners 2013: Rolf assev digital winners

WINNERS LIST Citibank TICKETS by the Minute 2 Contest ... · PDF fileSamsung GALAXY S5 YEOH JIT KOOI XXXXXXXXX413 ... Samsung GALAXY S5 CHIEW KIN NING XXXXX592 ... Samsung GALAXY Tab3

OVER $50 MILLION MORE/media/Retailer/salesmaker/2010... · Create a winner awareness area by displaying the tickets of past winners to create excitement for playing Lottery games

2016 Winners List - Heartland Emmy Awards · Edward Nelson, Graphic Artist !!!Investigative Report - Series “Addicted to Tickets ” ... Business/Consumer - News Single Story “buyER

2004 YoungArts Winners Winners/2004 YoungArts... · 2004 YOUNGARTS WINNERS FINALISTS 2004 YoungArts Winners ... Cedar Hill, TX ... Jenna M. Caschera

download.microsoft.comdownload.microsoft.com/.../Flavorus_WindowsAzure_CS.docx · Web viewThe company supports traditional tickets, print-at-home tickets, and mobile tickets. “As

PREVIOUS WINNERS - Newsquest Scotland Eventsnewsquestscotlandevents.com/.../2017/08/POLY2017-winners-pdf.pdf · PREVIOUS WINNERS 2016 WINNERS ... Councillor Mark Macmillan ... Justice

ISSUE THE GOLDEN TICKETS - World Bridge …championships.worldbridge.org/huaian17-files/bulletins/Bul_07.pdf · winners! Both the games you played and the moments we shared are unforgettable

SPONSORSHIP PACKET - va.gov · VIPs and sponsor recognition event 12 tickets 10 tickets 8 tickets 6 tickets 4 tickets 2 tickets Access to exclusive event volunteer opportunities Up

State of Rhode Island and Providence Plantations Probate ......DATE FILED FOR APPLICATION FOR APPROVAL OF COURT USE ONLY FIDUCIARY’S AND ATTORNEY’S FEES RIGL 9-14-25 & RIGL 33-14-8

Auto Glass Winners Auto Glass Winners

CEO Comment 2013 FOA FOA Windsor Awards Evening Winners ... · • Judges Choice - 2 return tickets to Hollywood courtesy of Hawaiian airlines to study for 10 weeks at TAFTA’s film

Winners photo competition - the winners

State of Rhode Island and Providence Plantations Probate ......RIGL 33-17-1.2 & RIGL 33-17-1.3 COURT ONLY PC-3.1A (Rev. 09/17) State of Rhode Island and Providence Plantations Probate

MATERIAL DIDÁCTICO Ciencias Naturales 4digital.aique.com.ar/segundo-ciclo/archivos/Guia_docente_Ciencias... · Material didáctico Ciencias Naturales 4 Federal / Marcelo Pablo Rigl

depedpines.com · 2021. 5. 2. · Grade 3 44 winners Grade 4 44 winners Grade 5 44 winners Grade 6 44 winners Grade 7 40 winners Grade 8 —40 winners 40 Grade 9 — winners Grade