English

On Estimation and Selection for Topic Models

Applications 2011-12-30 v3

Abstract

This article describes posterior maximization for topic models, identifying computational and conceptual gains from inference under a non-standard parametrization. We then show that fitted parameters can be used as the basis for a novel approach to marginal likelihood estimation, via block-diagonal approximation to the information matrix,that facilitates choosing the number of latent topics. This likelihood-based model selection is complemented with a goodness-of-fit analysis built around estimated residual dispersion. Examples are provided to illustrate model selection as well as to compare our estimation against standard alternative techniques.

Keywords

Cite

@article{arxiv.1109.4518,
  title  = {On Estimation and Selection for Topic Models},
  author = {Matthew A. Taddy},
  journal= {arXiv preprint arXiv:1109.4518},
  year   = {2011}
}

Comments

Scheduled to appear in the proceedings of AISTATS 2012

R2 v1 2026-06-21T19:08:11.629Z