15
Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Gui llermo Sapiro Journal of Machine Learning Research 2010

Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Embed Size (px)

Citation preview

Page 1: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Online Learning for Matrix Factorization and Sparse Coding

Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro

Journal of Machine Learning Research 2010

Page 2: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Introduction

• This paper focuses on the large scale matrix factorization problem, including– Dictionary learning for sparse coding– Non-negative matrix factorization (NMF)– Sparse principal component analysis (SPCA)

• Contributions of this paper:– An iterative online algorithm is proposed for large scale matrix

factorization– This algorithm is proved to converge almost surely to a stationary

point of the objective function– This algorithm is shown to be much faster than previous methods

in the experiment.

Page 3: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Problem Statement

• Classical dictionary learning problem Given a finite training set , the objective is

to optimize the following function

where

• Online Learning

This algorithm process one sample (or a mini-batch) at a time and sequentially minimize the following function:

Page 4: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Basic Algorithm

Page 5: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Dictionary Update

Page 6: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Optimizing the Algorithm

• Handling fixed-sized data sets

• Scaling the “past” data

• Mini-batch extension

Page 7: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Proof of Convergence

• Assumptions:

• Main results

Page 8: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Extensions to Matrix Factorization

• Non-negative matrix factorization (NMF)

• Non-negative sparse coding (NNSC)

• Sparse principal component analysis (SPCA)

Page 9: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Data for Experiment

• 1.25 million patches from Pascal VOC’06 image database

Page 10: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Online VS. Batch

• Training data size: 1 million• OL1:• OL2:• OL3:

Page 11: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Comparison with NMF and NNSC• NMF

• NNSC

Page 12: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Face Results

Page 13: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Image Patches Results

Page 14: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Inpainting Results

• Image size: 12-Megapixel• Dictionary with 256 elements• Training data: 7 million 12 by 12 color patches

Page 15: Online Learning for Matrix Factorization and Sparse Coding Julien Mairal, Francis Bach, Jean Ponce and Guillermo Sapiro Journal of Machine Learning Research

Conclusion

• A new online algorithm for learning dictionaries adapted to sparse coding tasks, and proven its convergence.

• Experiments demonstrate that this algorithm is significantly faster than existing batch methods.

• This algorithm can be extended to other matrix factorization problems such as non-negative matrix factorization and sparse principal component analysis.