English
Related papers

Related papers: Combinatorial Topic Models using Small-Variance As…

200 papers

Hierarchical latent tree analysis (HLTA) is recently proposed as a new method for topic detection. It differs fundamentally from the LDA-based methods in terms of topic definition, topic-document relationship, and learning method. It has…

Machine Learning · Computer Science 2015-08-06 Peixian Chen , Nevin L. Zhang , Leonard K. M. Poon , Zhourong Chen

Topic models provide a useful method for dimensionality reduction and exploratory data analysis in large text corpora. Most approaches to topic model inference have been based on a maximum likelihood objective. Efficient algorithms exist…

Machine Learning · Computer Science 2012-12-20 Sanjeev Arora , Rong Ge , Yoni Halpern , David Mimno , Ankur Moitra , David Sontag , Yichen Wu , Michael Zhu

Variational Bayes (VB) applied to latent Dirichlet allocation (LDA) has become the most popular algorithm for aspect modeling. While sufficiently successful in text topic extraction from large corpora, VB is less successful in identifying…

Machine Learning · Computer Science 2022-08-22 Rebecca M. C. Taylor , Johan A. du Preez

As one of the most popular linear subspace learning methods, the Linear Discriminant Analysis (LDA) method has been widely studied in machine learning community and applied to many scientific applications. Traditional LDA minimizes the…

Machine Learning · Computer Science 2019-07-02 Feiping Nie , Hua Wang , Zheng Wang , Heng Huang

Linear Discriminant Analysis (LDA) is a well-known method for dimensionality reduction and classification. Previous studies have also extended the binary-class case into multi-classes. However, many applications, such as object detection…

Machine Learning · Computer Science 2013-09-24 Gang Chen

Latent Dirichlet Allocation models discrete data as a mixture of discrete distributions, using Dirichlet beliefs over the mixture weights. We study a variation of this concept, in which the documents' mixture weight beliefs are replaced…

Machine Learning · Computer Science 2011-10-24 Philipp Hennig , David Stern , Ralf Herbrich , Thore Graepel

Topic modeling seeks to uncover latent semantic structure in text, with LDA providing a foundational probabilistic framework. While recent methods often incorporate external knowledge (e.g., pre-trained embeddings), such reliance limits…

Machine Learning · Computer Science 2026-04-01 Tal Ishon , Yoav Goldberg , Uri Shaham

In this paper, we present the Polylingual Labeled Topic Model, a model which combines the characteristics of the existing Polylingual Topic Model and Labeled LDA. The model accounts for multiple languages with separate topic distributions…

Computation and Language · Computer Science 2017-05-03 Lisa Posch , Arnim Bleier , Philipp Schaer , Markus Strohmaier

Correlated topic modeling has been limited to small model and problem sizes due to their high computational cost and poor scaling. In this paper, we propose a new model which learns compact topic embeddings and captures topic correlations…

Machine Learning · Computer Science 2017-07-04 Junxian He , Zhiting Hu , Taylor Berg-Kirkpatrick , Ying Huang , Eric P. Xing

We introduce a non-parametric hierarchical Bayesian approach for open-ended 3D object categorization, named the Local Hierarchical Dirichlet Process (Local-HDP). This method allows an agent to learn independent topics for each category…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 H. Ayoobi , H. Kasaei , M. Cao , R. Verbrugge , B. Verheij

Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on statistical coherence, often produce redundant or off-target topics that miss the user's underlying intent. We introduce…

Artificial Intelligence · Computer Science 2026-04-15 Rui Wang , Yi Zheng , Dongxin Wang , Haiping Huang , Yuanzhi Yao , Yuxiang Zhou , Jialin Yu , Philip Torr

This paper studies the distributed optimization problem with possibly nonidentical local constraints, where its global objective function is composed of $N$ convex functions. The aim is to solve the considered optimization problem in a…

Optimization and Control · Mathematics 2022-08-26 Hongzhe Liu , Wenwu Yu , Guanghui Wen , Wei Xing Zheng

Advances in topic modeling have yielded effective methods for characterizing the latent semantics of textual data. However, applying standard topic modeling approaches to sentence-level tasks introduces a number of challenges. In this…

Computation and Language · Computer Science 2016-07-21 Ruey-Cheng Chen , Reid Swanson , Andrew S. Gordon

We propose a new method of estimation in topic models, that is not a variation on the existing simplex finding algorithms, and that estimates the number of topics K from the observed data. We derive new finite sample minimax lower bounds…

Machine Learning · Statistics 2019-09-06 Xin Bing , Florentina Bunea , Marten Wegkamp

A conversational system needs to know how to switch between topics to continue the conversation for a more extended period. For this topic detection from dialogue corpus has become an important task for a conversation and accurate…

Information Retrieval · Computer Science 2020-06-08 Haider Khalid , Vincent Wade

Latent Dirichlet allocation (LDA) is widely used for unsupervised topic modelling on sets of documents. No temporal information is used in the model. However, there is often a relationship between the corresponding topics of consecutive…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-30 Werner van der Merwe , Herman Kamper , Johan du Preez

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to perform scene recognition and annotation. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator…

Computer Vision and Pattern Recognition · Computer Science 2013-05-24 Yin Zheng , Yu-Jin Zhang , Hugo Larochelle

This work studies the theoretical rules of feature selection in linear discriminant analysis (LDA), and a new feature selection method is proposed for sparse linear discriminant analysis. An $l_1$ minimization method is used to select the…

Methodology · Statistics 2013-04-23 Cheng Wang , Longbing Cao , Baiqi Miao

Solving optimization problems with unknown parameters often requires learning a predictive model to predict the values of the unknown parameters and then solving the problem using these values. Recent work has shown that including the…

Machine Learning · Computer Science 2020-10-23 Kai Wang , Bryan Wilder , Andrew Perrault , Milind Tambe

Statistical topic models are increasingly and popularly used by Digital Humanities scholars to perform distant reading tasks on literary data. It allows us to estimate what people talk about. Especially Latent Dirichlet Allocation (LDA) has…

Computation and Language · Computer Science 2019-09-26 Thomas N. Haider