35
Factorization Machine ز2016/12

© ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

  • Upload
    others

  • View
    9

  • Download
    0

Embed Size (px)

Citation preview

Page 1: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

Factorization Machine

2016/12

Page 2: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

Outline• FM

• vs

• vs

• vs

• FM FFM FM

Page 3: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM

Page 4: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• sigmoid

Page 5: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• w_ = 0.5 y =

• CTR etc

gbdt DNN deep-width model

Page 6: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• 0-18 19-25 25-30 30+

• onehot

• eg: =20: [0, 1, 0, 0] [1, 1, 0, 0]

• gini

Page 7: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• + = += etc

• + +

Page 8: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• SVM, d=2

• xi xj != 0

• xi xj

• CTR xi xj

Page 9: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• k << n

• W

• 0

Page 10: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Rendle S. Factorization Machines with libFM[J]. ACM Transactions on Intelligent Systems and Technology, 2012, 3(3).

Page 11: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• ( )

• 0

Page 12: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• SGD

• ALS

• MCMC

• libFM

• Spark FM DMSPA

• difacto DMLC PS FM

• tensorflow FM https://github.com/kopopt/fast_tffm

Rendle S. Factorization Machines with libFM[J]. ACM Transactions on Intelligent Systems and Technology, 2012, 3(3).

Page 13: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• movielens

Page 14: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda
Page 15: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM vs

Page 16: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

review CF•

• FM

Page 17: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

CF FM• (userID, itemID) label score

• f(userID, itemID) = ID onehot ++ ID one hot

123 45 3

[0,0,…,1,0,…,0] [0,0,…,1,0,…,0]

[0,0,…,1,0,…,1,0,…0]

3

3

Page 18: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

CF FM

FM

• FM

• poi

Page 19: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM vs

Page 20: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM vs •

• FM

x1=1

x2=1

Page 21: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

100

40 60

40

x1=1 x1=0

x2=1x2=0

x 2 = 1

! FMFM

Page 22: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda
Page 23: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM vs

Page 24: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• Hintonbottleneck dark knowledge

dark knowledge

Page 25: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

embedding•

• FM embedding

• word2vecFM

• FM embedding

Page 26: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

deep & wide •

Cheng H T, Koc L, Harmsen J, et al. Wide & deep learning for recommender systems[C]//Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 2016: 7-10.

Page 27: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

deep & wide

Cheng H T, Koc L, Harmsen J, et al. Wide & deep learning for recommender systems[C]//Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 2016: 7-10.

Page 28: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FFM• (Field)

• FM FFM

x_i w_i

x_iw_i1

w_i2

w_if

x_jw_j(w_i · w_j)

x_j

w_j1

w_j2

w_jf

w_i,fj w_j,fi

(w_i,fj · w_j,fi)

Page 29: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FFM•

M. Jahrer, A. To ̈scher, J.-Y. Lee, J. Deng, H. Zhang, and J. Spoelstra, “Ensemble of collaborative filtering and feature engineered model for click through rate prediction,” in KDD Cup 2012 Workshop, ACM, 2012.

Page 30: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FFM• O(k·n0^2), n0 0

• k_ffm << k_fm, n0 k_ffm 10

• libFFM ffm Spark DMSPA

Juan Y, Zhuang Y, Chin W, et al. Field-aware Factorization Machines for CTR Prediction[C]. conference on recommender systems, 2016: 43-50.

Page 31: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

KDD CUP 2012• KDD Cup 2012 CTR

• 234M

• AUC

M. Jahrer, A. To ̈scher, J.-Y. Lee, J. Deng, H. Zhang, and J. Spoelstra, “Ensemble of collaborative filtering and feature engineered model for click through rate prediction,” in KDD Cup 2012 Workshop, ACM, 2012.

Page 32: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

• CTR Criteo Avazu Kaggle

Juan Y, Zhuang Y, Chin W, et al. Field-aware Factorization Machines for CTR Prediction[C]. conference on recommender systems, 2016: 43-50.

Page 33: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM• d-way FM d

• High order FM d 2,3...,d-1

• Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda N, et al. Higher-Order Factorization Machines[C]. neural information processing systems, 2016: 3351-3359.

Page 34: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

FM• FM

• Blondel M, Fujino A, Ueda N, et al. Higher-Order Factorization Machines[C]. neural information processing systems, 2016: 3351-3359.

ANOVA kernel

ANOVA kernel 递推关系

Page 35: © ü b Factorization Machine - GitHub Pages · 2019-01-19 · • Rendle S. Factorization Machines[C]. international conference on data mining, 2010. • Blondel M, Fujino A, Ueda

©2016-2017 XX