The Infinite Hierarchical Factor Regression Model

Piyush Rai and Hal Daume IIINIPS 2008

Presented by Bo ChenMarch 26, 2009

Outline

• Introduction • The Infinite Hierarchical Factor Regression

Model• Indian Buffet Process and Beta Process• Experiment• Summary

Introduction• The latent factor representation benefits: 1. Discovering the latent process underlying the data 2. Simpler predictive modeling through a compact data

representation. Large P, Small N. N>=10 · d · C

• The fundamental advantages over standard FA model: 1. not assume known number of factors; 2. not assume factors are independent; 3. not assume all features are relevant to the factor

analysis.

Algorithm Model

Graphical Model

T is used to eliminate the spurious genes or noise features.So Tp determines whether the p-th customer will enter restaurant to eat anydish.

Indian Buffet Process--from latent classes to latent features

• For a finite feature model:

(Tom Griffiths, 2006)

• Indian restaurant with countably many infinite dishes

Differences between DP and IBP

DP class matrix

IBP ‘class’ matrix

1. Latent feature 2. Clustering 3. others

Different styles match different problems.

Two-Parameter Finite Model

the first customer samples Poisson( ) dishes the i-th customer

samples a previously sampled dish with probability

then samples new dishes

(Z. Ghahramani et. al., 2006)

Beta Process V.S. IBP

• Beta Process:

the first customer samples Poisson( ) dishes the i-th customer

samples a previously sampled dish with probability

then samples new dishes

Hierarchical Factor Prior• Kingman’s Coalescent It is a distribution over the genealogy of a countably infinite set of

individuals. Construct tree structure

• Brownian diffusion A Markov process which encodes message (mean and covariance)

in each node of the above tree.

Y. W. Teh, H. Daume III, and D. M. Roy. Bayesian Agglomerative Clustering with Coalescents. In NIPS, 2008.

Feature Selection Prior• Some genes are spurious

Before selecting dishes, these ‘spurious’ customers

should leave the restaurant.

Provided by Piyush Rai

Experimental results

E-coli data:100 samples 50 genes8 underlying factors

Breast cancer data:251 samples226 genes5 underlying factors

1. The hierarchy can be used to find factors in order of their prominence.2. Hierarchical modeling results in better predictive performance for the

factor regression task.3. The factor hierarchy leads to faster convergence since most of the unlikely

configurations will never be visited as they are constrained by the hierarchy.

The Comparison of Factor Loading Matrice Learned from Different Methods

Ground Truth NIPS Method

Sparse BPFA on Factor loading VB Sparse BPFA on Factor score VB

Factor Regression

Training and test data are combined together and test responsesare treated as missing values to be imputed.

The Existing Similar FA Models• Putting binary matrix on factor score matrix

David Knowles and Zoubin Ghahramani. Infinite Sparse Factor Analysis and Infinite Independent Components Analysis, ICA 2007John Paisley et. al., Nonparametric Factor Analysis with Beta Process Priors, in submission 2009.

Summary: 1. For ‘large P, small N’ problems, the first one is faster to learn thesmall factor score matrix with KxN. Considering MCMC solution, it is difficult for the second one to handle the problem with tens of thousands of genes . 2. The second one can give an explanation to the relationship between geneand factor (pathway).

• Putting binary matrix on factor loading matrix

Piyush Rai and Hal Daume III. The Infinite Hierarchical Factor Regression Model, NIPS 2008.

The New Developments of IBP

F. Doshi, K. T. Miller, J. Van Gael and Y.W. Teh, Variational Inference for the Indian Buffet Process, AISTATS 2009.

Jurgen Van Gael, Yee Whye Teh, Zoubin Ghahramani , The Infinite Factorial Hidden Markov Model, NIPS 2008.

K. A. Heller and Zoubin Ghahramani, A Nonparametric Bayesian Approach to Modeling Overlapping Clusters, AISTATS 2007.

The Infinite Hierarchical Factor Regression Model

Documents

lqmm: Estimating Quantile Regression Models for ...web.warwick.ac.uk/statsdept/user2011/TalkSlides/...lqmm: Estimating Quantile Regression Models for Independent and Hierarchical Data

Scales of association: hierarchical linear models and the ...Hierarchical linear models (HLM) use advanced estimation algorithms to measure regression relationships and variance-covariance

Hierarchical Scene Coordinate Classiﬁcation and Regression ... · 3. Hierarchical Scene Coordinate Prediction We now describe our coarse-to-ﬁne hierarchical scene coordinate prediction

Hierarchical Regression - Columbia University · Hierarchical Regression David M. Blei Columbia University December 3, 2014 Hierarchical models are a cornerstone of data analysis,

(8) Hierarchical models - stat.ncsu.edureich/ABA/notes/Hier.pdf · ST440/540: Applied Bayesian Statistics (8) Hierarchical models. Hierarchical models ... Hierarchical linear regression

Slide 1 Hierarchical Binary Logistic Regression. Slide 2 Hierarchical Binary Logistic Regression In hierarchical binary logistic regression, we are

Data analysis using regression and multilevel hierarchical models

Hierarchical Multiple Linear Regression and the correct ... · Interpreting R2 magnitudes 17th June, 2016 Cognadev Technical Report #6 3 | P a g e Tables Table 1: The four regression

Modeling Transport Mode Decisions Using Hierarchical Binary … · 2012-05-22 · Modeling Transport Mode Decisions Using Hierarchical Logistic Regression Models with Spatial and

Infinite Hierarchical Hidden Markov Models

Incremental Hierarchical Discriminant Regressionweng/research/TNN-IHDR.pdf · Incremental Hierarchical Discriminant Regression ... incremental learning, cortical development, discriminant

36-463/663: Multilevel & Hierarchical Modelsbrian/463-663/week03/06-logistic...9/22/2016 1 36-463/663: Multilevel & Hierarchical Models Logistic Regression Brian Junker 132E Baker

1 Advanced Topics in Regression Quantile Regression Analysis of Causality Mediation Analysis Hierarchical Linear Modeling Compiled by Nick Evangelopoulos,

Dale Berger Hierarchical Regression Demonstration ...wise.cgu.edu/.../Hierarchical-Regression-Gender-and-Faculty-Salary.pdf · Hierarchical Regression Analysis: Gender Differences

Tree Based Hierarchical Reinforcement Learningreports-archive.adm.cs.cmu.edu/anon/anon/usr0/ftp/... · Keywords: Reinforcement Learning, Regression Trees, Markov Decision Processes,

Examining a Hierarchical Linear Regression Model of

A hierarchical regression analysis of the relationship between blog

SW388R7 Data Analysis & Computers II Slide 1 Logistic Regression – Hierarchical Entry of Variables Sample Problem Steps in Solving Problems

Hierarchical Logistic Regression Model for Multilevel Analysis: An … · 2019-12-28 · 59 Linda Vugutsa Luvai and Fred Ongango: Hierarchical Logistic Regression Model for Multilevel

Bayesian Inference Chapter 4: Regression and Hierarchical ... · Y i jp i ˘P( i) log i = x i Conchi Aus n and Mike Wiper Regression and hierarchical models Masters Programmes 19