English
Related papers

Related papers: Combinatorial Topic Models using Small-Variance As…

200 papers

Recent advances have made it feasible to apply the stochastic variational paradigm to a collapsed representation of latent Dirichlet allocation (LDA). While the stochastic variational paradigm has successfully been applied to an uncollapsed…

Machine Learning · Computer Science 2013-12-03 Arnim Bleier

Recently, topic-grounded dialogue system has attracted significant attention due to its effectiveness in predicting the next topic to yield better responses via the historical context and given topic sequence. However, almost all existing…

Computation and Language · Computer Science 2022-10-18 Xiaofei Wen , Wei Wei , Xian-Ling Mao

The expectation-maximization (EM) algorithm can compute the maximum-likelihood (ML) or maximum a posterior (MAP) point estimate of the mixture models or latent variable models such as latent Dirichlet allocation (LDA), which has been one of…

Machine Learning · Computer Science 2015-12-08 Jia Zeng , Zhi-Qiang Liu , Xiao-Qin Cao

Topic models have been widely explored as probabilistic generative models of documents. Traditional inference methods have sought closed-form derivations for updating the models, however as the expressiveness of these models grows, so does…

Computation and Language · Computer Science 2018-05-23 Yishu Miao , Edward Grefenstette , Phil Blunsom

Topics models, such as LDA, are widely used in Natural Language Processing. Making their output interpretable is an important area of research with applications to areas such as the enhancement of exploratory search interfaces and the…

Computation and Language · Computer Science 2019-04-01 Areej Alokaili , Nikolaos Aletras , Mark Stevenson

As one of the most powerful topic models, Latent Dirichlet Allocation (LDA) has been used in a vast range of tasks, including document understanding, information retrieval and peer-reviewer assignment. Despite its tremendous popularity, the…

Machine Learning · Computer Science 2021-04-13 Qi Zhou , Haipeng Chen , Yitao Zheng , Zhen Wang

Supervised topic models are often sought to balance prediction quality and interpretability. However, when models are (inevitably) misspecified, standard approaches rarely deliver on both. We introduce a novel approach, the…

Machine Learning · Computer Science 2020-03-04 Jason Ren , Russell Kunes , Finale Doshi-Velez

Modeling topics effectively in short texts, such as tweets and news snippets, is crucial to capturing rapidly evolving social trends. Existing topic models often struggle to accurately capture the underlying semantic patterns of short…

Computation and Language · Computer Science 2025-02-18 Shuyu Chang , Rui Wang , Peng Ren , Qi Wang , Haiping Huang

Ontologies provide features like a common vocabulary, reusability, machine-readable content, and also allows for semantic search, facilitate agent interaction and ordering & structuring of knowledge for the Semantic Web (Web 3.0)…

Information Retrieval · Computer Science 2017-09-08 Monika Rani , Amit Kumar Dhar , O. P. Vyas

Topic models are some of the most popular ways to represent textual data in an interpret-able manner. Recently, advances in deep generative models, specifically auto-encoding variational Bayes (AEVB), have led to the introduction of…

Information Retrieval · Computer Science 2022-04-08 Jeffrey Chiu , Rajat Mittal , Neehal Tumma , Abhishek Sharma , Finale Doshi-Velez

We present a novel method for hierarchical topic detection where topics are obtained by clustering documents in multiple ways. Specifically, we model document collections using a class of graphical models called hierarchical latent tree…

Computation and Language · Computer Science 2016-12-22 Peixian Chen , Nevin L. Zhang , Tengfei Liu , Leonard K. M. Poon , Zhourong Chen , Farhan Khawar

Unsupervised learning aims at the discovery of hidden structure that drives the observations in the real world. It is essential for success in modern machine learning. Latent variable models are versatile in unsupervised learning and have…

Machine Learning · Computer Science 2016-06-13 Furong Huang

We consider the problem of sampling from constrained distributions, which has posed significant challenges to both non-asymptotic analysis and algorithmic design. We propose a unified framework, which is inspired by the classical mirror…

Machine Learning · Computer Science 2021-01-01 Ya-Ping Hsieh , Ali Kavis , Paul Rolland , Volkan Cevher

Topic modeling is a well-established technique for exploring text corpora. Conventional topic models (e.g., LDA) represent topics as bags of words that often require "reading the tea leaves" to interpret; additionally, they offer users…

Computation and Language · Computer Science 2024-04-03 Chau Minh Pham , Alexander Hoyle , Simeng Sun , Philip Resnik , Mohit Iyyer

Rating Prediction is a basic problem in Recommender System, and one of the most widely used method is Factorization Machines(FM). However, traditional matrix factorization methods fail to utilize the benefit of implicit feedback, which has…

Machine Learning · Computer Science 2014-10-30 Xudong Liu , Bin Zhang , Ting Zhang , Chang Liu

For organizing large text corpora topic modeling provides useful tools. A widely used method is Latent Dirichlet Allocation (LDA), a generative probabilistic model which models single texts in a collection of texts as mixtures of latent…

Computation and Language · Computer Science 2020-04-02 Jonas Rieger , Lars Koppers , Carsten Jentsch , Jörg Rahnenführer

Topic models, and more specifically the class of Latent Dirichlet Allocation (LDA), are widely used for probabilistic modeling of text. MCMC sampling from the posterior distribution is typically performed using a collapsed Gibbs sampler. We…

Machine Learning · Statistics 2017-08-16 Måns Magnusson , Leif Jonsson , Mattias Villani , David Broman

The recent advancement of large language models has spurred a growing trend of integrating pre-trained language model (PLM) embeddings into topic models, fundamentally reshaping how topics capture semantic structure. Classical models such…

Computation and Language · Computer Science 2026-03-12 Hanlin Xiao , Mauricio A. Álvarez , Rainer Breitling

Topic modeling is a widely used technique for uncovering thematic structures from large text corpora. However, most topic modeling approaches e.g. Latent Dirichlet Allocation (LDA) struggle to capture nuanced semantics and contextual…

Information Retrieval · Computer Science 2024-09-25 Satya Kapoor , Alex Gil , Sreyoshi Bhaduri , Anshul Mittal , Rutu Mulkar

It is increasingly common to solve combinatorial optimisation problems that are partially-specified. We survey the case where the objective function or the relations between variables are not known or are only partially specified. The…

Machine Learning · Computer Science 2022-05-23 Stefano Teso , Laurens Bliek , Andrea Borghesi , Michele Lombardi , Neil Yorke-Smith , Tias Guns , Andrea Passerini