English

Nonparametric Bayesian sparse factor models with application to gene expression modeling

Applications 2011-07-29 v2 Artificial Intelligence Machine Learning

Abstract

A nonparametric Bayesian extension of Factor Analysis (FA) is proposed where observed data Y\mathbf{Y} is modeled as a linear superposition, G\mathbf{G}, of a potentially infinite number of hidden factors, X\mathbf{X}. The Indian Buffet Process (IBP) is used as a prior on G\mathbf{G} to incorporate sparsity and to allow the number of latent features to be inferred. The model's utility for modeling gene expression data is investigated using randomly generated data sets based on a known sparse connectivity matrix for E. Coli, and on three biological data sets of increasing complexity.

Keywords

Cite

@article{arxiv.1011.6293,
  title  = {Nonparametric Bayesian sparse factor models with application to gene expression modeling},
  author = {David Knowles and Zoubin Ghahramani},
  journal= {arXiv preprint arXiv:1011.6293},
  year   = {2011}
}

Comments

Published in at http://dx.doi.org/10.1214/10-AOAS435 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

R2 v1 2026-06-21T16:50:27.839Z