35
Informatics and Mathematical Modelling / Intelligent Signal Processing Non-negative matrix factorization with Gaussian process priors Mikkel N. Schmidt Mikkel N. Schmidt Mikkel N. Schmidt Mikkel N. Schmidt Technical University of Denmark Technical University of Denmark Technical University of Denmark Technical University of Denmark

Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

  • Upload
    others

  • View
    10

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Non-negative matrix factorization with Gaussian process priors

Mikkel N. SchmidtMikkel N. SchmidtMikkel N. SchmidtMikkel N. Schmidt

Technical University of DenmarkTechnical University of DenmarkTechnical University of DenmarkTechnical University of Denmark

Page 2: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Outline

� Non-negative matrix factorization (NMF)

Examples of applications– DNA microarray analysis

– Monaural audio separation

� Gaussian processes (GP)

� NMF with GP-priors

Example of application– Chemical shift brain imaging

Page 3: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Non-negative matrix factorization

� Non-negative bi-linear decomposition

� In matrix notation

D≈≈≈≈ ××××M

N

M

KN

KX H

Page 4: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Why non-negativity?

� Many signals are non-negative by nature– Pixel intensities

– Amplitude spectra

– Occurrence counts

– Discrete probabilities

– etc.

� Non-subtractive model– No terms cancel out

– Parts-based: The whole is modeled as a sum of parts

Page 5: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

NMF, PCA, and VQ

NMF

PCA

VQ

××××

××××

××××=

=

=

Page 6: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Over-complete decomposition

256 × 20

256 × 400

256 × 400400 × 500

400 × 500

20 × 500256 × 500

256 × 500

256 × 500

××××

××××

××××=

=

=

Under-complete

Over-complete

Sparse and over-complete

Page 7: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

DNA microarray analysis

� DNA microarray technology enables parallel analysis of thousands of genes

� Data can be represented in non-negative matrixe.g. gene × experiment

Page 8: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Two types of leukemia

38 samples

7129 genes

7129 genes

38 samples

X ≈ D × H

2 classes

2 classes

Class 1 Class 227 samples 11 samples

Page 9: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Monaural audio separation

Audio Separation

� Problem: Separate audio sources using only one-microphone recording of mixture

Page 10: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Amplitude spectrogram

� Audio represented as a non-negative matrix

50 100 150 200 250 300 350 400

50

100

150

200

250

20 40 60 80 100 120

50

100

150

200

250

5 0 1 0 0 1 5 0 2 0 0 2 5 0 3 0 0 3 5 0 4 0 0

2 0

4 0

6 0

8 0

1 0 0

1 2 0

Spectrogram Basis

Activation

Time

Frequency

Page 11: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Audio separation with NMF

1. Learn a basis for each source

2. Compute activation for mixture

3. Reconstruct each source separately

Page 12: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Subspace illustration

Training data

Basis vectors

Observed mixture

Mappings onto subspaces

.......

��

Page 13: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Audio demonstration

Page 14: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Computing NMF

� Minimize divergence between XXXX and DHDHDHDH, possibly plus some regularization

� Constrained minimization problem: A variety of algorithms for different divergence measures and regularizations

� Choice of divergence measure corresponds to assumptions about the data/noise/factor distribution

Page 15: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Maximum likelihood NMF

Example: Gaussian i.i.d. noise

� Likelihood function

� Negative log-likelihood (serves as divergence)

Page 16: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Maximum a posteriori NMF

� Posterior distribution

� Negative log posterior

Likelihood

(divergence)

Prior

(regularization)

Page 17: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Bayesian NMF

� Posterior distribution

� In general, integrals over the posterior are intractable � approximate, e.g., using MCMC.

Page 18: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Probabilistic model of factors

� Standard NMF

– Factors constrained to be non-negative

– No other assumptions about the factors

� Prior distribution over factors

– Prior distribution captures non-negativity as well as other properties, such as sparseness, smoothness, symmetries, etc.

Page 19: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Which prior distributions to use?

� Distribution over non-negative reals– Rectified Gaussian: L2 norm regularization

– One-sided exponential: L1 norm regularization

� Gaussian process mapped to the non-negative reals– Flexible, principled, and practical approach

– Sparseness, smoothness, symmetries, etc.

Page 20: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Gaussian Processes

� A stochastic process which generates samples, xi , such that any linear combination of xi is Gaussian

� Characterized by its mean and covariance function

� Defines a distribution over functions

Page 21: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Example of Gaussian process

� Mean function

� Covariance function

-5 -4 -3 -2 -1 0 1 2 3 4 5-4

-2

0

2

4

x

f(x)

-5 -4 -3 -2 -1 0 1 2 3 4 5-4

-2

0

2

4

x

f(x)

-5 -4 -3 -2 -1 0 1 2 3 4 5-4

-2

0

2

4

x

f(x)

σ = 1

k(x1, x2) = exp

(

−(x1 − x2)

2

2σ2

)

m(x) = 0

σ = 0.5

σ = 0.1

Page 22: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

NMF with GP priors

� GDand G

Hare Gaussian processes

� Link functions– Strictly increasing

– Maps the reals to the non-negative reals -2 -1 0 1 20

0.5

1

1.5

2

x

f(x)

Page 23: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Link Function

� Map marginal distribution of Gaussian process to desired marginal distribution

� Example: Gaussian-to-Exponential

Page 24: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Change of Variable

� New variables δδδδ and ηηηη

– CCCC is Cholesky decomposition of covariance matrix

– Variables δδδδ and ηηηη are i.i.d. Gaussian

�MAP estimate: p(δδδδ, ηηηη ||||XXXX)– More robust and less local minima

– Unconstrained optimization (use e.g. conj. grad.)

Page 25: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Illustration of NMF with GP priors

� Full matrix data

Page 26: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Illustration of NMF with GP priors

� Full matrix data

� Missing values

Page 27: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Illustration of NMF with GP priors

� Full matrix data

� Missing values

� General bi-linear form

Page 28: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Toy Example

Page 29: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Toy ExampleCNMF

CNMF

CNMF

CNMF

GPP

GPP

GPP

GPP-- -- NMF

NMF

NMF

NMF

Underlying data

Underlying data

Underlying data

Underlying data Columns of Columns of Columns of Columns of DDDD Rows of Rows of Rows of Rows of HHHH

Page 30: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Chemical Shift Brain Imaging

� Data: 369 point chemical shift spectra measured at 512 positions (8×8×8 grid) in human skull.

� Task: Distinguish between brain and muscle tissue

� Prior for spectra: Smooth, exponentially distributed.

� Prior for activations in skull: Smooth in 3D, exponentially distributed, left-to-right symmetric.

Page 31: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Chemical Shift Brain Imaging

GPP

GPP

GPP

GPP-- -- NMF

NMF

NMF

NMF

CNMF

CNMF

CNMF

CNMF

Muscle TissueMuscle TissueMuscle TissueMuscle Tissue

Muscle TissueMuscle TissueMuscle TissueMuscle Tissue

Brain TissueBrain TissueBrain TissueBrain Tissue

Brain TissueBrain TissueBrain TissueBrain Tissue

CNMF

CNMF

CNMF

CNMF

GPP

GPP

GPP

GPP-- -- NMF

NMF

NMF

NMF

Page 32: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Conclusions

� NMF– General and versatile method

– Can be used to analyze a variety of problems including

� DNA microarray analysis

� Audio signal separation

� NMF with GP-priors– Extends the NMF framework by adding prior information

– Can improve the quality of non-negative factorizations

Page 33: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

Future work

� Full Bayesian treatment of the model (MCMC)

� Learn parameters of kernel function

� Learn link functions from data

� Learn number of components (nonparametric Bayes)

Page 34: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

References

Non-negative Matrix Factorization� D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401(6755):788–791, 1999.

� P. Paatero and U. Tapper. Positive matrix factorization: A nonnegative factor model with optimal utilization of error-estimates of data values. Environmetrics, 5:111–126, 1994.

� Michael W. Berry, Murray Browne, Amy N. Langville, V. Paul Pauca, and Robert J. Plemmons. Algorithms and applications for approximate nonnegative matrix factorization. Computational Statistics and Data Analysis, 2006.

DNA Microarray Analysis� Jean-Philippe Brunet, Pablo Tamayo, Todd R. Golub, and Jill P. Mesirov. Metagenes and molecular pattern discovery using matrix factorization. Proceedings of the National Academy of Sciences

(PNAS), 101(12):4164–4169, Mar 2004.

Page 35: Non-negative matrix factorization with Gaussian process priorsmikkelschmidt.dk/presentations/nmfgppcam.pdf · 2015. 4. 15. · Non-negative Matrix Factorization D. D. Lee and H. S

Informatics and Mathematical Modelling / Intelligent Signal Processing

References

Speech Separation with NMF� Mikkel N. Schmidt and Rasmus K. Olsson, Linear Regression on Sparse Features for Single-Channel Speech Separation. Applications of Signal Processing to Audio and Acoustics, IEEE Workshop on, 2007.

� Mikkel N. Schmidt and Rasmus K. Olsson, Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization. Spoken Language Processing, ISCA International Conference on, 2006.

Gaussian Processes� Carl Edward Rasmussen and Christopher K. I. Williams, Gaussian Processes for Machine Learning. MIT Press, 2006.

NMF with GP-priors� Mikkel N. Schmidt and Hans Laurberg, Non-negative matrix factorization with Gaussian process priors. Computational Intelligence and Neuroscience, 2008.