On Estimation and Selection for Topic Models
Applications
2011-12-30 v3
Abstract
This article describes posterior maximization for topic models, identifying computational and conceptual gains from inference under a non-standard parametrization. We then show that fitted parameters can be used as the basis for a novel approach to marginal likelihood estimation, via block-diagonal approximation to the information matrix,that facilitates choosing the number of latent topics. This likelihood-based model selection is complemented with a goodness-of-fit analysis built around estimated residual dispersion. Examples are provided to illustrate model selection as well as to compare our estimation against standard alternative techniques.
Keywords
Cite
@article{arxiv.1109.4518,
title = {On Estimation and Selection for Topic Models},
author = {Matthew A. Taddy},
journal= {arXiv preprint arXiv:1109.4518},
year = {2011}
}
Comments
Scheduled to appear in the proceedings of AISTATS 2012