Upload
others
View
8
Download
0
Embed Size (px)
Citation preview
Collaborative Deep Learning for Recommender Systems
Hao Wang
Naiyan Wang
Dit-Yan Yeung
1
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 2
Recommender Systems
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 3
Observed preferences:
To predict: Matrix completion
Rating matrix:
Recommender Systems with Content
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 4
Content information: Plots, directors, actors, etc.
Modeling the Content Information
Handcrafted features Automatically learn features
Automatically learn features and
adapt for ratings
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 5
1. Powerful features for content information
Deep learning
Modeling the Content Information
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 6
2. Feedback from rating information Non-i.i.d.
Collaborative deep learning
Deep Learning
Stacked denoising autoencoders
Convolutional neural networks
Recurrent neural networks
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 7
Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction.
Bengio et al. 2015
Deep Learning
Stacked denoising autoencoders
Convolutional neural networks
Recurrent neural networks
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Typically for i.i.d. data 8
1. Powerful features for content information
Deep learning
2. Feedback from rating information Non-i.i.d.
Collaborative deep learning (CDL)
Modeling the Content Information
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 9
Contribution
Collaborative deep learning:
* deep learning for non-i.i.d. data
* joint representation learning and
collaborative filtering
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 10
Contribution
Collaborative deep learning
Complex target:
* beyond targets like classification and regression
* to complete a low-rank matrix
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 11
Contribution
Collaborative deep learning
Complex target
First hierarchical Bayesian models for
hybrid deep recommender system
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 12
Contribution
Collaborative deep learning
Complex target
First hierarchical Bayesian models for
hybrid deep recommender system
Significantly advance the state of the art
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 13
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
14
Stacked Denoising Autoencoders (SDAE)
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Corrupted input Clean input
Vincent et al. 2010
15
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
16
Probabilistic Matrix Factorization (PMF)
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Graphical model:
Generative process:
Objective function if using MAP:
latent vector of item j
latent vector of user i
rating of item j from user i
Notation:
Salakhutdinov et al. 2008
17
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
18
Probabilistic SDAE
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Generalized SDAE
Graphical model:
Generative process:
corrupted input
clean input
weights and biases
Notation:
19
Collaborative Deep Learning
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Graphical model:
Collaborative deep learning SDAE
corrupted input
clean input
weights and biases
content representation
rating of item j from user i
latent vector of item j
latent vector of user i
Notation: Two-way interaction
•More powerful representation •Infer missing ratings from content •Infer missing content from ratings
20
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Neural network representation for degenerated CDL
Collaborative Deep Learning
21
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Collaborative Deep Learning
Information flows from ratings to content
22
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Collaborative Deep Learning
Information flows from content to ratings
23
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Collaborative Deep Learning
Reciprocal: representation and recommendation
24
Learning maximizing the posterior probability is equivalent to maximizing the joint log-likelihood
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 25
Learning
Prior (regularization) for user latent vectors, weights, and biases
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 26
Learning
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Generating item latent vectors from content representation with Gaussian offset
27
Learning
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
‘Generating’ clean input from the output of probabilistic SDAE with Gaussian offset
28
Learning
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Generating the input of Layer l from the output of Layer l-1 with Gaussian offset
29
Learning
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
measures the error of predicted ratings
30
Learning If goes to infinity, the likelihood becomes ¸s¸s
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 31
Update Rules For U and V, use block coordinate descent:
For W and b, use a modified version of backpropagation:
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 32
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
33
Datasets
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Content information
Titles and abstracts Titles and abstracts Movie plots
Wang et al. 2011 Wang et al. 2013
34
Evaluation Metrics
Recall:
Mean Average Precision (mAP):
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Higher recall and mAP indicate better recommendation performance
35
Comparing Methods
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Hybrid methods using BOW and ratings
Loosely coupled; interaction is not two-way
PMF+LDA
36
Recall@M
citeulike-t, sparse setting
citeulike-t, dense setting
Netflix, sparse setting
Netflix, dense setting
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
When the ratings are very sparse:
When the ratings are dense:
37
Mean Average Precision (mAP)
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
Exactly the same as Oord et al. 2013, we set the cutoff point at 500 for each user.
A relative performance boost of about 50%
38
Number of Layers Sparse Setting
Dense Setting
Motivation Stacked DAE PMF Collaborative DL Experiments Summary
The best performance is achieved when the number of layers is 2 or 3 (4 or 6 layers of generalized neural networks).
39
Example User
Moonstruck
True Romance
Romance Movies
Precision: 30% VS 20% Motivation Stacked DAE PMF Collaborative DL Experiments Summary 40
Example User
Johnny English
American Beauty
Action & Drama Movies
Precision: 50% VS 20% Motivation Stacked DAE PMF Collaborative DL Experiments Summary 41
Example User
Precision: 90% VS 50% Motivation Stacked DAE PMF Collaborative DL Experiments Summary 42
•Motivation
•Stacked Denoising Autoencoders
•Probabilistic Matrix Factorization
•Collaborative Deep Learning
•Experiments
•Summary
43
Summary
Non-i.i.d (collaborative) deep learning
With a complex target
First hierarchical Bayesian models for
hybrid deep recommender system
Significantly advance the state of the art
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 44
Word2vec, tf-idf
Sampling-based, variational inference
Tagging information, networks
Summary
Motivation Stacked DAE PMF Collaborative DL Experiments Summary 45