1

Learning to Teach with Dynamic Loss Functions 1 Lijun Wu, 2 Fei Tian, 2 Yingce Xia, 3 Yang Fan, 2 Tao Qin, 1 Jianhuang Lai and 2 Tie-Yan Liu 1 Sun Yat-sen University 2 Microsoft Research 3 University of Science and Technology of China Contact: [email protected] [email protected] Contact: [email protected] [email protected] 1. Machine Learning • Fixed data order • Fixed model space • Fixed loss function 2. Loss Function Teaching • Goal: Automatically discover the optimal loss functions for student model training. 3. Teaching Requirement • Requirements of Loss Function Teaching Adaptive Dynamic ( ,), with as its coefficient = (− log WԦ + ) = {, } Teacher Model: = Adaptive = ( ) Dynamic Reward: dev measure 4. Challenge & Algorithm • Gradient-based Optimization for Teacher Model • Algorithm/Structure 5. Experiments • Image Classification Task • Neural Machine Translation Task • Student model: :→ , = σ , ∈ ( ,) , : measure • Teacher model: max ( , )

Contact: Learning to Teach with Dynamic Loss Functions ... · Learning to Teach with Dynamic Loss Functions 1Lijun Wu, 2Fei Tian, 2Yingce Xia, 3Yang Fan, 2Tao Qin, 1Jianhuang Lai

Download PDF Report

Upload
others
View
1
Download
0

Embed Size (px)

Citation preview

Page 1: Contact: Learning to Teach with Dynamic Loss Functions ... · Learning to Teach with Dynamic Loss Functions 1Lijun Wu, 2Fei Tian, 2Yingce Xia, 3Yang Fan, 2Tao Qin, 1Jianhuang Lai

Learning to Teach with Dynamic Loss Functions

1Lijun Wu, 2Fei Tian, 2Yingce Xia, 3Yang Fan, 2Tao Qin, 1Jianhuang Lai and 2Tie-Yan Liu

1 Sun Yat-sen University 2 Microsoft Research 3 University of Science and Technology of China

Contact:

[email protected]

[email protected]

Contact:

[email protected]

[email protected]

1. Machine Learning

• Fixed data order

• Fixed model space

• Fixed loss function

2. Loss Function Teaching

• Goal:

Automatically discover the

optimal loss functions for

student model training.

3. Teaching Requirement

• Requirements of Loss Function Teaching

Adaptive

Dynamic

𝐿𝜙(𝑓𝜔 𝑥 , 𝑦), with 𝜙 as its

coefficient

𝐿𝜙 = 𝜎(− log𝑇 𝑝 𝑥 W Ԧ𝑦 + 𝑏)

𝜙 = {𝑊, 𝑏}

Teacher

Model:

𝝓 = 𝝁𝜽

Adaptive

𝜙𝑡 = 𝜇𝜃(𝑠𝑡)

Dynamic

Reward: dev measure

4. Challenge & Algorithm

• Gradient-based Optimization for Teacher

Model

• Algorithm/Structure

5. Experiments

• Image Classification Task

• Neural Machine Translation Task

• Student model:

𝑓𝜔: 𝑥 → 𝑦

𝐿 𝑓𝜔, 𝐷𝑡𝑟𝑎𝑖𝑛 =σ 𝑥,𝑦 ∈𝐷𝑡𝑟𝑎𝑖𝑛

𝑙(𝑓𝜔 𝑥 , 𝑦)

𝑚 𝑓𝜔 𝑥 , 𝑦 : measure

• Teacher model:

𝑢𝜃

max𝜃

𝑚(𝑓𝜔, 𝐷𝑑𝑒𝑣)

mailto:[email protected]

mailto:[email protected]

mailto:[email protected]

mailto:[email protected]

WHAT DOES ART TEACH FOR THOSE WHO TEACH ART?

WHAT DOES ART TEACH FOR THOSE WHO TEACH ART?

Documents

" To Teach or not to Teach"

" To Teach or not to Teach"

Documents

Technology

How to Teach How to Teach

How to Teach How to Teach

Documents

Numerical Analysis of Transmission Line Telegraph · PDF fileNumerical Analysis of Transmission Line Telegraph Equation Based on FDTD Method 1Jianhui Song, 2Yanju Liu, 3Yang Yu *1School

Numerical Analysis of Transmission Line Telegraph · PDF fileNumerical Analysis of Transmission Line Telegraph Equation Based on FDTD Method 1Jianhui Song, 2Yanju Liu, 3Yang Yu *1School

Documents

What the Schools Teach and Might Teach

What the Schools Teach and Might Teach

Documents

Teach English, Teach about the Environment (PDF)

Teach English, Teach about the Environment (PDF)

Documents

© 2016 Teach Children Ltd V1.pdf · © 2016 Teach Children Ltd © 2016 Teach Children Ltd © 2016 Teach Children Ltd © 2016 Teach Children Ltd. Title: Beg-Straights V1 Author: Lucy

© 2016 Teach Children Ltd V1.pdf · © 2016 Teach Children Ltd © 2016 Teach Children Ltd © 2016 Teach Children Ltd © 2016 Teach Children Ltd. Title: Beg-Straights V1 Author: Lucy

Documents

WE NEED MANY LEADERS WORKING TOGETHER Workshop... · 2019-02-22 · • Teach First New Zealand • Teach For Bangladesh • Teach For India • Teach For Nepal • Teach For Afghanistan

WE NEED MANY LEADERS WORKING TOGETHER Workshop... · 2019-02-22 · • Teach First New Zealand • Teach For Bangladesh • Teach For India • Teach For Nepal • Teach For Afghanistan

Documents

Leni Devera Asrar - mte.pasca.mercubuana.ac.id · secara grafis dalam bentuk linear, eksponesial, polinomial orde 2 dan polinomial orde 3yang hasilnya kemudian dianalisi s dan akhirnya

Leni Devera Asrar - mte.pasca.mercubuana.ac.id · secara grafis dalam bentuk linear, eksponesial, polinomial orde 2 dan polinomial orde 3yang hasilnya kemudian dianalisi s dan akhirnya

Documents

Neural Architecture Optimizationpapers.nips.cc/paper/8007-neural-architecture...Neural Architecture Optimization 1Renqian Luoy, 2Fei Tiany,2Tao Qin, 1Enhong Chen, 2Tie-Yan Liu 1University

Neural Architecture Optimizationpapers.nips.cc/paper/8007-neural-architecture...Neural Architecture Optimization 1Renqian Luoy, 2Fei Tiany,2Tao Qin, 1Enhong Chen, 2Tie-Yan Liu 1University

Documents

Emails: Arif.Albayrak@nasa.gov; William.L.Teng@nasa.gov A ... · deep-learning-meetup-5/ 3Yang et al., 2016, Hierarchical attention networks for document classification, in Proc

Emails: [email protected]; [email protected] A ... · deep-learning-meetup-5/ 3Yang et al., 2016, Hierarchical attention networks for document classification, in Proc

Documents

PERKEMBANGAN ikasma3bdg · siswa SMA 3 Bandung dalam ... Output Sarana para alumni SMA 3yang berminat di dunia entrepreneur berjejaring & belajar Penanggung Jawab Bidang 2 (Div Janus)

PERKEMBANGAN ikasma3bdg · siswa SMA 3 Bandung dalam ... Output Sarana para alumni SMA 3yang berminat di dunia entrepreneur berjejaring & belajar Penanggung Jawab Bidang 2 (Div Janus)

Documents

Teach Yourself - Teach Yourself Speed Reading

Teach Yourself - Teach Yourself Speed Reading

Documents

To teach or not to teach: my journey

To teach or not to teach: my journey

Education

Emails: Arif.Albayrak@nasa.gov; William.L.Teng@nasa.gov A … · 2019. 10. 25. · deep-learning-meetup-5/ 3Yang et al., 2016, Hierarchical attention networks for document classification,

Emails: [email protected]; [email protected] A … · 2019. 10. 25. · deep-learning-meetup-5/ 3Yang et al., 2016, Hierarchical attention networks for document classification,

Documents

Teach English, Teach about the Environment

Teach English, Teach about the Environment

Documents

Teach(er) and Teach(ing): Beliefs, Attitudes, and Teaching ... › wp-content › uploads › 2020 › 06 › TDAJ-V1N1-Ramzan… · Teach(er) and Teach(ing): Beliefs, Attitudes,

Teach(er) and Teach(ing): Beliefs, Attitudes, and Teaching ... › wp-content › uploads › 2020 › 06 › TDAJ-V1N1-Ramzan… · Teach(er) and Teach(ing): Beliefs, Attitudes,

Documents

Where do you teach? What do you teach?

Where do you teach? What do you teach?

Documents

pa900.files.wordpress.com€¦ · Web viewKerjasama Ekonomi Asia Pasifik . Zon Aman Bebas dan Berkecuali. Sidang Kemuncak Asia Timur. 3Yang manakah pendirian Malaysia dalam menghadapi

pa900.files.wordpress.com€¦ · Web viewKerjasama Ekonomi Asia Pasifik . Zon Aman Bebas dan Berkecuali. Sidang Kemuncak Asia Timur. 3Yang manakah pendirian Malaysia dalam menghadapi

Documents

Discourse Intonation: To Teach or not to Teach

Discourse Intonation: To Teach or not to Teach

Documents

Education

Learn 2 Teach, Teach 2 Learn

Learn 2 Teach, Teach 2 Learn

Business

Using Technology to Teach Global Challenges … to Teach Latin...Using Technology to Teach Global Challenges ... teacher in grades 5-16. ... To Teach Global Challenges . Impacting

Using Technology to Teach Global Challenges … to Teach Latin...Using Technology to Teach Global Challenges ... teacher in grades 5-16. ... To Teach Global Challenges . Impacting

Documents

Presentazione di PowerPoint - agopuntura.org · i sei livelli (3yang+3yin) rappresentano il luogo di risonanza tra fisiologia profonda dell’individuo e le energie del macrocosmo,

Presentazione di PowerPoint - agopuntura.org · i sei livelli (3yang+3yin) rappresentano il luogo di risonanza tra fisiologia profonda dell’individuo e le energie del macrocosmo,

Documents

Teach to Teach with Technology

Teach to Teach with Technology

Education

What to Teach and When to Teach It - lisaboyd.pbworks.comlisaboyd.pbworks.com/w/file/fetch/46964400/What to Teach and When... · What to Teach and When to Teach It: ... Nathaniel

What to Teach and When to Teach It - lisaboyd.pbworks.comlisaboyd.pbworks.com/w/file/fetch/46964400/What to Teach and When... · What to Teach and When to Teach It: ... Nathaniel

Documents

Teach For All : An Overview - polito.it For All... · Empieza por Educar SWEDEN Teach For Sweden THAILAND Teach For Thailand UGANDA Teach For Uganda UKRAINE Teach For Ukraine UNITED

Teach For All : An Overview - polito.it For All... · Empieza por Educar SWEDEN Teach For Sweden THAILAND Teach For Thailand UGANDA Teach For Uganda UKRAINE Teach For Ukraine UNITED

Documents

Teach English: Teach English with ESL GAMES

Teach English: Teach English with ESL GAMES

Documents

Why? Why teach X. Why? How? Why teach XHow to teach X

Why? Why teach X. Why? How? Why teach XHow to teach X

Documents