English

Efficient Learning for Undirected Topic Models

Machine Learning 2015-06-25 v1 Computation and Language Information Retrieval Machine Learning

Abstract

Replicated Softmax model, a well-known undirected topic model, is powerful in extracting semantic representations of documents. Traditional learning strategies such as Contrastive Divergence are very inefficient. This paper provides a novel estimator to speed up the learning based on Noise Contrastive Estimate, extended for documents of variant lengths and weighted inputs. Experiments on two benchmarks show that the new estimator achieves great learning efficiency and high accuracy on document retrieval and classification.

Keywords

Cite

@article{arxiv.1506.07477,
  title  = {Efficient Learning for Undirected Topic Models},
  author = {Jiatao Gu and Victor O. K. Li},
  journal= {arXiv preprint arXiv:1506.07477},
  year   = {2015}
}

Comments

Accepted by ACL-IJCNLP 2015 short paper. 6 pages

R2 v1 2026-06-22T09:59:37.259Z