Recursive Unsupervised Learning of Finite Mixture Models

Recursive Unsupervised Learning ofFinite Mixture Models

Zoran Zivkovic and Ferdinand van der HeijdenNetherlands – PAMI 2004

Presented by: Janaka

Introduction• Sample data -> Mixture Model parameters• EM - maximum likelihood estimation of parameters• Variations– Fixed vs. Variable number of components– Batch vs. Online (recursive)

ML and MAP

• Estimate population parameter θ from samples (x)

• Maximum Likelihood (ML)

• Prior distribution g over θ exists• Maximum a posteriori (MAP)

Introduction

• Using a prior with EM [3] [6]• Recursive parameter estimation [5,13,15] –

approximates batch processing• Connecting above two – coming up with a heuristic

• Randomly initialize M components• Search for MAP using iterative proc(e.g. EM)• Let prior drive irrelevant components to extinction

EM algorithm

DefinitionIteratively reach the best set of parameters that model the observed data, under the occurrence of some unobserved (missing) parameters/data.

• Apply to Mixture models– Unobserved data – the component each data point

belongs to– Parameters – parameters of the each component

Repeat until convergence!

EM Algorithm

How to classify points and estimate parameters of the models in a mixture at the same time?

(Chicken and egg problem)

• Expectation step: Use current parameters (and observations) to reconstruct hidden structure

• Maximization step: Use that hidden structure (and observations) to reestimate parameters

Mixture Models

is a random variable of d-dimensions,

Given data, ML estimate given by

EM searches for the local maximum of log likelihood function (i.e. ML estimate)

EM for Mixture Models

• For each , missing data– Multinomial distribution

• Set of unobserved data• Estimate in kth iteration– E-step

– M-step

Differences with EM

• For EM must know the M-num components• All data at the same time

• Apply MAP (ML with prior) to the EM – the prior biased towards compact models

• Data – one at a time

• Criteria: increase

• Log-likelihood and prior • Find ML for different M’s (by EM) and find

highest J.• Simple prior

• Prior is about the distribution of parameters

Prior in EM

• Select • Start with

• Ownership • ML estimate

• MAP using prior• Combining

componentper parameters is N ; 2 cNcm

EM + Prior iterations

• Keep bias fixed– Decreases with t– Negative update for small t

• Approx by • Update equation for weights

• Prior only influences weights - Remove when negative• Other parameters same as EM

EM for GMM

• Other parameters same as in EM• Mean and covariance matrix

Practical Algorithm (RuEM)

• Fix the Influence from new samplesto

– Instability for small t– Rapidly forget the past

• Apply to GMM– Start with a large M– For d-dimensional data N =

ˆ and ˆ update

; component; discard )0ˆ(

ˆ weightsupdate

ownerships compute

ˆ,RuEM

)1()1(

return

Mmthenif

Experiments

1. Apply to standard problems (Gaussian)– Three 2D - 900– Iris - Three 4D – 150– 3D shrinking spiral - 900– Enzyme - 1D -245

2. Comparison with batch algorithms– Carefully initialized EM– Split and Merge EM– Greedy EM – start with one component– Polished RuEM – learn rate + EM

Three Gaussians• Mixture of 3 Gaussians – 2D• 900 samples• EM needs 200 iterations (x 900)• RUEM needs 9000 iterations (repeatedly apply

900 samples)• 20 times faster

• Iris, Shrinking Spiral, Enzyme

ML (mean and variance)

Learning rate on MThree Gaussians Shrinking Spiral

Discussion

Recursive Unsupervised Learning of Finite Mixture Models

Documents

Chapter 11 – Recursion Recursive Processes Writing a Recursive Method A Recursive Factorial Method Comparison of Recursive and Iterative Solutions Recursive

Lecture 11 - csd.uwo.ca · Lecture 11 Unsupervised Learning EM. Today • New Topic: Unsupervised Learning • supervised vs. unsupervised learning • unsupervised learning • nonparametric

Recursive Deﬁnitions and Structural Induction 1 Recursive

Recursive Gaussian Mixture Models for Adaptive Process Monitoringstatic.tongtianta.site/paper_pdf/8f4f5e84-61d8-11e9-9046... · 2019. 4. 18. · so on.28−33 In this paper, a new

GMCM: Unsupervised Clustering and Meta … Unsupervised Clustering and Meta-Analysis using Gaussian Mixture Copula Models Anders E. Bilgrau Aalborg University Poul S. Eriksen Aalborg

Unsupervised Learning Learning Unsupervised...Unsupervised Learning and Data Mining Learning Mining Unsupervised Learning and Data Mining Unsupervised Data Clustering Supervised Learning

Recursive Algorithms Recursive procedure

Recursive Neural Networks - Technion – Israel this assignment” Recursive Neural Networks for Structure Predicon Recursive Neural Networks for Structure Predicon A standard Recursive

Unsupervised Labeled Parsing with Deep Inside …tors from deep inside-outside recursive au-toencoders (DIORA). Speciﬁcally, we clus-ter span representations to induce span labels

High-Dimensional Unsupervised Selection and Estimation of a Finite Generalized Dirichlet Mixture model Based on Minimum Message Length by Nizar Bouguila

Unsupervised Structure Learning: Hierarchical Recursive ...people.csail.mit.edu/leozhu/paper/usl_eccv_2008.pdfcoincidences eliminates proposals which occur infrequently in the image

Unsupervised Learning Of Finite Mixture Models With ...cgl.soic.indiana.edu/presentations/Jong_damix.v8.pptx.pdf · 7 2000 4000 6000 8000 Type Temp Starting Temperature 1st Critical

Gaussian Mixture Models and EM algorithm · Gaussian Mixture Model •Unsupervised method •Fit multimodal Gaussian distributions . Formal Definition •The model is described as:

Unsupervised Learning of Finite Gaussian Mixture Models ...vislab.isr.ist.utl.pt/wp-content/uploads/2012/12/10-ictai-greggio.pdf · Unsupervised Learning of Finite Gaussian Mixture

Handling tree structures — recursive SPs, nested sets, recursive CTEs

Unsupervised Learning and Clusteringsrihari/CSE555/Chap10.Part1.pdf · Mixture Densities and Identifiability • Samples come from c classes • Priors are known P(ω i) • Forms

On the capacity of unsupervised recursive neural networks for symbol processing 29. 8. 2006 Prof. Dr. Barbara Hammer Computational Intelligence Group Institute

Recursive Data Structures and Grammars Themes Recursive Description of Data Structures Recursive Definitions of Properties of Data Structures Recursive

Unsupervised learning or Clustering – K-means …guestrin/Class/10701-S07/Slides/clustering.pdf · Unsupervised learning or Clustering – K-means Gaussian mixture models Machine

Handing Uncertain Observations in Unsupervised Topic-Mixture Language Model Adaptation