Upload
others
View
11
Download
0
Embed Size (px)
Citation preview
Deep Learning ExplainedDolev Pomeranz, Chief Architect
Trax @ BGU
2017
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
2
Intro to Trax
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
3
Traxs’ Business Application
3
Manual Audit
Slow and ExpensiveInconsistent untraceable
Fast and CheapConsistentTraceable
Trax Automatic Recognition Audit
‘Big Data’ for retail
AVAILABILITY SHARE OFSHELF
PRICING PROMOTIONALACTIVATIONS
COMPETITVEINSIGHTS
PLANOGRAMCOMPLIANCE
SHELF STANDARDS
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
4
Trax unlocks ‘Big Data’ for the retail industry
Scale of coverage Scale of the Data
• Visits• Scenes
• Images• Products
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
v
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
5
Welcome to Trax Universe
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
6
Trax’s Visual Challenges
# Classes
Fine-Grained Classification
Crowded Scene
Dynamic Classes
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
7
Trax AI Retail InfrastructureImages Actionable Insights
Trax AI Retail Infrastructure
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
8
Computational
power
Data Science
Models
Data Science
Engine
Deep Fine-Grained
Recognition Engine
Deep
Learning
Computational
power
Retail ‘Big Data’
Inside the Trax AI Retail Infrastructure
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
9
AI – Quest for learning
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
10
What can we learn
CV NLP SR RL
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
11
The AI revolution
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
12
The AI revolution
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
AI & Machine learning
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
What is learning?
𝐿𝑒𝑎𝑟𝑛𝑖𝑛𝑔 → 𝐺𝑒𝑛𝑒𝑟𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
The power of Math
Source http://www.eric-kim.net/eric-kim-net/posts/1/kernel_trick.html
→𝐺𝑒𝑛𝑒𝑟𝑎𝑙𝑖𝑧𝑎𝑡𝑖𝑜𝑛 𝐿𝑜𝑐𝑎𝑙𝑖𝑡𝑦 → 𝐷𝑖𝑠𝑡𝑎𝑛𝑐𝑒
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
v
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
16
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
17
Introduction to ‘Neural networks’
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Biologically inspired?
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Biologically inspired?
The real Neuron
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Biologically inspired?
Simple and Complex cells (1981 Nobel – Hubel, Wiesel)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Biologically inspired?
Place and Grid cells (2014 Nobel - O'Keefe, Moser, Moser)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Biologically inspired?
Concept cell – Luke Skywalker cell
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
History of Neural Networks
• 1940’s – Complex behavior from a network of simple units [Hebb]
• 1950’s – Perceptron [Rosenblatt]
– Can implement NAND (universal gate)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Neural Networks
• The Neuron
𝑧 ∈ ℝ ֜ 𝐿𝑒𝑎𝑟𝑛𝑖𝑛𝑔
𝑧 = 𝑓 𝑊 ∙ 𝑋 + 𝑏
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Neural Networks
Activation functions
𝑧 =1
1 + 𝑒− 𝑊∙𝑋+𝑏
𝑆𝑖𝑔𝑚𝑜𝑖𝑑
𝑧 = tanh 𝑊 ∙ 𝑋 + 𝑏
𝑇𝑎𝑛ℎ
𝑧 = max 0,𝑊 ∙ 𝑋 + 𝑏
𝑅𝑒𝑙𝑢
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Neural Networks
• MLP - multilayer perceptron (fully connected)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
27
Deep learning explained
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
28
Why should we explain?
Source: https://www.eff.org/ai/metrics
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
29
Why should we explain?
Source: https://www.eff.org/ai/metrics
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
30
State of CV and AI just before DL
Andrej Karpathy blog
http://karpathy.github.io/ 2012/10/22etats/-fo-retupmoc-noisiv/
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
31
Why should we explain?
Source: https://www.eff.org/ai/metrics
Deep Learning
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
32
Why should we explain?
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
33
Why should we explain?
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Main properties
Deep & Huge
Architecture
Single
Internet & MobileAugmentation
Backpropagation
SGD
GPU
Transfer learning
Model
Data
Training
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
35
Data
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
36
Data – Internet & Mobile
Pope Francis Pope Benedict
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
37
Data – Underappreciated
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
38
Data – Augmentation + Synthesis
https://www.cs.tau.ac.il/~wolf/papers/repcounticcv.pdfhttps://sites.google.com/site/mrsdproject201415teamg/document
/softwarehttps://github.com/udacity/self-driving-car-sim
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
39
Data – Transfer Learning
https://medium.com/merantix/applying-deep-learning-to-real-world-problems-
ba2d86ac5837
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
40
Model
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
41
Model – Architecture
Convolutional layer
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
42
Model – Architecture
Representation learning
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
43
Model – Architecture
Pooling Layer
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
44
Model – Deep & Huge
Remember MLP
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
45
Model – Deep & Huge
LeNet [1989]
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
46
Model – Deep & Huge
AlexNet [2012]
(SuperVision)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
47
Model – Deep & Huge
GoogLeNet [2014]
(Inception)
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
48
Model – Deep & Huge
http://www.topbots.com/a-brief-history-of-neural-network-architectures/
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
49
Model – Single VS Pipeline
• Traditional
• Deep
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
50
Training
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
51
Training – Backpropagation
How to minimize?
𝑓 𝑥 = 𝑦
argmin𝑥
𝑓 𝑥
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
52
Training – Backpropagation
Method 1 – Random search
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
53
Training – Backpropagation
Method 1 – Random search
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
54
Training – Backpropagation
Method 2 – Approx. Gradient Decent
𝑑𝑓 𝑥
𝑑𝑥= lim
ℎ→0
𝑓 𝑥 + ℎ − 𝑓 𝑥
ℎ
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
55
Training – Backpropagation
Method 2 – Approx. Gradient Decent
𝑑𝑓 𝑥
𝑑𝑥≈𝑓 𝑥 + ℎ − 𝑓 𝑥 − ℎ
2ℎ
Approximating the gradient – numerical analysis
𝐸𝑟𝑟𝑜𝑟:
𝑂 ℎ2
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
56
Training – Backpropagation
Method 2 – Approx. Gradient Decent
𝐿 =1
𝑁
𝑖=1
𝑁
𝐿𝑖 𝑥𝑖 , 𝑦𝑖 ,𝑊 + 𝜆𝑅 𝑊
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
57
Training – Backpropagation
Method 2 – Approx. Gradient Decent
𝛻𝑊𝐿 =𝑑𝐿 𝑤1𝑑𝑤1
, ⋯ ,𝑑𝐿 𝑤𝑚𝑑𝑤𝑚𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
58
Training – Backpropagation
Method 2 – Approx. Gradient Decent
Back to Neural nets
AlexNet → 𝑚 = 65,000,000
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
59
Training – Backpropagation
Method 2 – Approx. Gradient Decent
Remember the huge data set
→ 𝑁 = 14,000,000
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
60
Training – Backpropagation
Running time
𝐼𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛 𝑆𝑝𝑒𝑒𝑑 = 𝐹𝑜𝑟𝑤𝑎𝑟𝑑 𝑝𝑎𝑠𝑠 𝑠𝑝𝑒𝑒𝑑 ∗ 𝑁𝑒𝑤 𝑤𝑒𝑖𝑔ℎ𝑡𝑠 𝑝𝑎𝑠𝑠𝑒𝑠 ∗ 𝐷𝑎𝑡𝑎 𝑠𝑒𝑡 𝑠𝑖𝑧𝑒
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
61
Training – Backpropagation
Summary so far
Random Approx. GD
Code Simple Simple
New weights
passes
Fast 𝑂(1) Slow 𝑂(𝑚)
Iteration speed Slow 𝑂(𝑁) Slow 𝑂(𝑚 ∗ 𝑁)
Gradient None Approx.
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
62
Training – Backpropagation
Can we do better?
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
63
Training – Backpropagation
Calculus recap
The sums, products, and compositions of analytic functions are analytic
Any analytic function is infinitely differentiable
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
64
Training – Backpropagation
Neural network is a composition of analytical functions
= tanh 𝑊2,1 ∙ tanh 𝑊1,1 ∙ 𝑋 + 𝑏1,1 , tanh 𝑊1,2 ∙ 𝑋 + 𝑏1,2 , ⋯ + 𝑏2,1
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
65
Training – Backpropagation
Gradients using the Chain rule
𝑐
𝜕𝐿
𝜕𝑐
𝜕𝐿
𝜕𝑎=𝜕𝐿
𝜕𝑐
𝜕𝑐
𝜕𝑎
𝑎
𝑏
𝜕𝐿
𝜕𝑏=𝜕𝐿
𝜕𝑐
𝜕𝑐
𝜕𝑏
Assume known
Then:
And:
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
66
Training – Backpropagation
• Analytic functions– Exploiting structure
• Chain rule – Gradient based only on following neighbor neurons
• Backward pass– Propagating the gradient, caching calculations
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
67
Training – Backpropagation
Dynamic programming
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
68
Training – Backpropagation
𝐵𝑎𝑐𝑘𝑝𝑟𝑜𝑝𝑎𝑔𝑎𝑡𝑖𝑜𝑛 = 𝐶ℎ𝑎𝑖𝑛 𝑅𝑢𝑙𝑒 + 𝐷𝑦𝑛𝑎𝑚𝑖𝑐 𝑝𝑟𝑜𝑔𝑟𝑎𝑚𝑚𝑖𝑛𝑔
𝑚𝑎𝑡ℎ 𝑐𝑜𝑚𝑝𝑢𝑡𝑒𝑟 𝑠𝑐𝑖𝑒𝑛𝑐𝑒
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
69
Training – SGD
Great improvement but still slow
Remember the huge data set
→ 𝑁 = 14,000,000
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
70
Training – SGD
Simple solution − Stochastic Gradient Decent
• Each epoch permutated the data– stochastic
• Each iteration take a small constant subset (i.e., mini-batch)– gradient approximation
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
71
Training – SGD
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
72
Training – SGD
Algorithms comparison
Random Approx. GD Analytic SGD
Code Simple Simple Complex
New weights
passes
Fast 𝑂(1) Slow 𝑂(𝑚) Fast 𝑂(1)
Iteration speed Slow 𝑂(𝑁) Slow 𝑂(𝑚 ∗ 𝑁) Fast 𝑂(1)
Gradient None Approx. Exact
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
73
Training – Intermediate summary
Can we do even better?
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
74
Training – Intermediate summary
Simple network classifying MNIST digits
https://cloud.google.com/blog/big-data/ 2017/01nrael/-wolfrosnet-dna-
peed-gninrael-tuohtiw-a-dhp
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
75
Training – Intermediate summary
Working on mini-batches of 100 images
https://cloud.google.com/blog/big-data/ 2017/01nrael/-wolfrosnet-dna-
peed-gninrael-tuohtiw-a-dhp
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
76
Training – Intermediate summary
Working on mini-batches of 100 images
https://cloud.google.com/blog/big-data/ 2017/01nrael/-wolfrosnet-dna-
peed-gninrael-tuohtiw-a-dhp
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
77
Training – Intermediate summary
Using algebra matrix notations
https://cloud.google.com/blog/big-data/ 2017/01nrael/-wolfrosnet-dna-
peed-gninrael-tuohtiw-a-dhp
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
78
Training – GPU
Someone already knows how to multiply matrices fast!
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
79
Training – GPU
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
80
Training – GPU
One GPU to rule them all
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
81
Training – Summary
𝐼𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛 𝑆𝑝𝑒𝑒𝑑 = 𝐹𝑜𝑟𝑤𝑎𝑟𝑑 𝑝𝑎𝑠𝑠 𝑠𝑝𝑒𝑒𝑑 ∗ 𝑁𝑒𝑤 𝑤𝑒𝑖𝑔ℎ𝑡𝑠 𝑝𝑎𝑠𝑠𝑒𝑠 ∗ 𝐷𝑎𝑡𝑎 𝑠𝑒𝑡 𝑠𝑖𝑧𝑒
BackpropagationGPU SGD
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
82
Recent research
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
83
One Model To Learn Them All
[16/6/2017] L. Kaiser, A. N. Gomez, N. Shazeer, A. Vaswani, N. Parmar, L. Jones, J. Uszkoreit
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
84
See, Hear, and Read: Deep Aligned Representations
[3/6/2017] Y. Aytar, C. Vondrick, A. Torralba
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
85
Distill – Research Debt
http://distill.pub/2017/research-debt/
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
86
Open source
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
v
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
87
Open source libraries
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
v
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
88
We capture retail today. So you can see the opportunity of tomorrowIntellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved
Intellectual Property Information © 2016 Trax Image Recognition. All Rights Reserved.
This document and the information contained herein is confidential. You must hold this document and any oral information provided
in connection with this document in strict confidence; Any reproduction, retransmission, republication, translation, or other use of, all
or part of this document is expressly prohibited, unless prior written permission has been granted by Trax.
This document is provided for information purposes only for the exclusive use of the recipients to whom it is addressed and the
contents hereof are subject to change without notice. Whilst the information contained herein has been prepared in good faith, it is
not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law,
including implied warranties and conditions of merchantability, fitness for a particular purpose and/or in relation to the accuracy,
completeness or reliability of the information contained in this document or any oral information provided in connection herewith, or
any data it generates. Trax accepts no responsibility, obligation or liability (whether direct or indirect, in contract, tort or otherwise) in
relation to any of such information. Trax expressly disclaim any and all liability may be based on this document and any errors
therein or omissions therefrom; and no contractual obligations are formed either directly or indirectly by this document.
Trax, the Trax logo and other all other Trax trademarks, logos and service marks used in this document are the trademarks or
service marks of Trax and its affiliates. All other marks contained herein are the property of their respective owners. Trax has
intellectual property rights relating to technology that is described in this document.
We capture retail today,
So you can see the opportunity of tomorrow