Demystifying Deep Learning in NetworkingWhy Deep Neural Nets in Networking? 2 Application Example...

Demystifying Deep Learning in Networking

Ying Zheng, Ziyu Liu, Xinyu You, Yuedong Xu Junchen Jiang

Why Deep Neural Nets in Networking?

Application Example gains

Cloud job scheduling 32.4% less job slowdown

Adaptive-bitrate streaming 12-25% better QoE

Routing 23-50% less congestion

Cellular traffic scheduling 14.7% higher utilization

Great promises of DNNs: Two-digit improvements in multiple applications

Many more papers are coming!

So what’s wrong with DNNs in networking?

DNNSystem State Action

DNN is a “black box”

Problems• Hard to confer causality• Hard to trust• Agnostic to domain knowledge

White-box logic

System State ActionTraditional

Deep learning

Black-Box Problem #1: Hard to confer causality

DNN5, 1, 3, 10, … 1

Sequence of incoming jobs (length)

Picked job (length)

DNN9, 2, 7, 15, … 2

DNN20, 11, 13, 14Wait

(not 11!)

So probably DNN is looking for shortest job

But …

Correlation ≠ Causality

Problem #2: Hard to trust

0 10 20 30 40 50Avg

Experiment

Rule-based logic (shortest job-first)

DNN-based logic

“The rule is suboptimal, but we know how it works”

The DNN is on avg better, but we don’t know when/why it will fail

System engineers may favor certainty over performance

Problem #3: Agnostic to domain-specific knowledge

Empirical requirement:Resource utilization should be below 80%

Hard to imbue a domain-specific rule in a black box.

DNNJob sequence Scheduling

Roadmap to white-box DNNs

•How is a decision made?

•When will it fail?

•Can domain-specific knowledge be integrated?

Outline

•Why try to interpret deep learning in networking

•An attempt to explain DNN in networking: A case study

•Examples of worst-case performance of DNN

• Improving performance by domain-specific knowledge

First attempt: use saliency map to understand DNNs

• Calculate gradient vector

𝑤𝑖=𝜕𝑦𝑖

𝜕𝑥

• x: input vector

• y: output vectorthe 𝑗𝑡ℎ element of 𝑤𝑖 shows how much a small change on the 𝑗𝑡ℎ

feature of x will change how likely 𝑖𝑡ℎ action is picked.

DeepRM Input/Output: A Case Study

• Input: Remaining resource, Resource requirements• Output: Act probability over possible actions

ut state

Deep Neural Network(DNN)

Job 1 Job2 Job3

• Request time of jobs(outstanding district)

What features make DeepRM a decision?

• Saliency map

• CPU/RAM occupancy has small impact on the output

• High-level feathers have connections to intermediate layer and affect final output

Relationship between intermediate output and decision

Activatio

Job Length

0 4 8 12 16 20

121086420

Outline

•Reverse-engineer DNN in networking: A case study

When DNN fails in a different workload than what it's trained on

Comparisonobjective

Testing distribution

DNNRule-basedalgorithm

Average job slowdown

Training distribution

3.73 4.513.25 4.873.93 5.373.03 4.71

Other distribution

10.57 9.6911.44 10.2312.02 10.7511.48 10.22

When DNN fails *even* in the same workload to what it's trained on

Comparisonobjective

Test data DeepRM Rule-basedalgorithm

Average jobslowdown

Normalsequence

1.84 2.01

Adversarialsequence

1.80 1.46

Outline

•Reverse-engineer DNN in networking: A case study

Domain-specific knowledge might provide a cure

•Better transferability

•Better robustness

•Better training efficiency

A standard way to incorporate domain knowledge

Loss Function

DeepRMTeacher Student

Projection

Back-propagation

Some takeaways, a lot more need to be done

• DNNs in networking achieve superior performance

• DNNs’ limitations compel us to try to at least understand it.

• Our preliminary findings:

a) One can explain to some extent how networking DNNs work

b) DNNs may fail: fits only distribution, vulnerable to noises

Thanks for your listening!

Demystifying Deep Learning in NetworkingWhy Deep Neural Nets in Networking? 2 Application Example...

Documents

APPROACHES TO ADAPTIVE BITRATE VIDEO S · based adaptive bitrate (ABR) video streaming. In particular, I look at the difference between client pull and server push based approaches,

HOW GOOD IS YOUR JOB? MEASURING AND ASSESSING JOB … · Job strain Excessive demands Insufficient resources % 4 Job quality over the recent crisis and recovery The deep and often

Job Uncertainty and Deep Recessionsuctpvst/Ravn-Sterk-Uncertainty... · 2013. 11. 8. · Job Uncertainty and Deep Recessions Morten O. Ravn, University College London, Centre for

AutoTune: Game-based Adaptive Bitrate Streaming …hs6ms/publishedPaper/Conference/...AutoTune: Game-based Adaptive Bitrate Streaming in P2P-Assisted Cloud-Based VoD Systems Yuhua

2014-02 - Edicao - Mac, Codecs, Bitrate

Exceptional Remote Performance at Low Bitrate

From Theory to Practice: Improving Bitrate …...From Theory to Practice: Improving Bitrate Adaptation in the DASH Reference Player Kevin Spiteri UMass, Amherst kspiteri@cs.umass.edu

Compressed Histogram of Gradients: A Low-Bitrate Descriptor.bgirod/pdfs/Chandrasekhar_IJCV... · 2011-07-18 · Compressed Histogram of Gradients: A Low-Bitrate Descriptor. ... each

4. H.263 Video Codec for Low Bitrate Communcationsvada.skku.ac.kr/ClassInfo/microsystem/multimedia/4h2… · · 2002-04-04H.263 Video Codec for Low Bitrate Communcations 1998.

คู่มือการใช้งาน เครื่องบันทึก ...¸„ู่มือการใช้...o VBR : Average Bitrate ค อการบ นท

Anticipatory Wireless Bitrate Control for Blocks

Choosing the Segment Length for Adaptive Bitrate Streaming

Job Uncertainty and Deep Recessions - e-Repositori UPF

Expectancy Theory and Job Behavior I - Deep Blue: Home

Collaborative Multi-bitrate Video Caching and Processing ...rci.rutgers.edu/~txt1/2017_WONS_Collaborative.pdf · Collaborative Multi-bitrate Video Caching and Processing in Mobile-Edge

Adaptive Bitrate Video Streaming over HTTP in Mobile ...heim.ifi.uio.no/paalh/students/HaakonRiiser-phd.pdf · Adaptive Bitrate Video Streaming over HTTP in Mobile Wireless Networks

DASH in Twitch: Adaptive Bitrate Streaming in Live Game Streaming Platforms

ABR- Adaptive Bitrate Streaming - We are SMPTE 2015... · Tektronix Video Training ABR- Adaptive Bitrate Streaming Karl Kuhn Sr. Video Applications Engineer karlku@tek.com

Maintaining High Quality of Experience in an Adaptive Bitrate System · 2019. 7. 9. · Maintaining High Quality of Experience in an Adaptive Bitrate System With ABR going mainstream

Organizational Determinants of Job Stress - Deep Bluedeepblue.lib.umich.edu/bitstream/handle/2027.42/25099/0000531.pdf · Organizational Determinants of Job Stress ... cantly related