English
Related papers

Related papers: Continuous-time Infinite Dynamic Topic Models

200 papers

Topic models have proven to be a useful tool for discovering latent structures in document collections. However, most document collections often come as temporal streams and thus several aspects of the latent structure such as the number of…

Information Retrieval · Computer Science 2012-03-19 Amr Ahmed , Eric P. Xing

For extracting meaningful topics from texts, their structures should be considered properly. In this paper, we aim to analyze structured time-series documents such as a collection of news articles and a series of scientific papers, wherein…

Computation and Language · Computer Science 2018-05-08 Rem Hida , Naoya Takeishi , Takehisa Yairi , Koichi Hori

Topic modeling analyzes documents to learn meaningful patterns of words. For documents collected in sequence, dynamic topic models capture how these patterns vary over time. We develop the dynamic embedded topic model (D-ETM), a generative…

Computation and Language · Computer Science 2019-10-14 Adji B. Dieng , Francisco J. R. Ruiz , David M. Blei

In this paper we describe a novel framework for the discovery of the topical content of a data corpus, and the tracking of its complex structural changes across the temporal dimension. In contrast to previous work our model does not impose…

Information Retrieval · Computer Science 2015-12-29 Adham Beykikhoshk , Ognjen Arandjelovic , Dinh Phung , Svetha Venkatesh

Topic models and all their variants analyse text by learning meaningful representations through word co-occurrences. As pointed out by Williamson et al. (2010), such models implicitly assume that the probability of a topic to be active and…

Computation and Language · Computer Science 2023-01-27 Kostadin Cvejoski , Ramsés J. Sánchez , César Ojeda

This paper proposes a new methodology to study sequential corpora by implementing a two-stage algorithm that learns time-based topics with respect to a scale of document positions and introduces the concept of Topic Scaling which ranks…

Information Retrieval · Computer Science 2021-04-05 Sami Diaf , Ulrich Fritsche

In this paper we describe a novel framework for the discovery of the topical content of a data corpus, and the tracking of its complex structural changes across the temporal dimension. In contrast to previous work our model does not impose…

Information Retrieval · Computer Science 2015-02-10 Adham Beykikhoshk , Ognjen Arandjelovic , Dinh Phung , Svetha Venkatesh

Certain type of documents such as tweets are collected by specifying a set of keywords. As topics of interest change with time it is beneficial to adjust keywords dynamically. The challenge is that these need to be specified ahead of…

Machine Learning · Statistics 2020-01-23 Xingyu Wang , Lida Zhang , Diego Klabjan

This paper proposes a modeling framework for dynamic topic evolution based on temporal large language models. The method first uses a large language model to obtain contextual embeddings of text and then introduces a temporal decay function…

Computation and Language · Computer Science 2025-11-04 Di Wu , Shuaidong Pan

We study the problem of topic modeling in corpora whose documents are organized in a multi-level hierarchy. We explore a parametric approach to this problem, assuming that the number of topics is known or can be estimated by…

Machine Learning · Statistics 2015-04-14 Do-kyum Kim , Geoffrey M. Voelker , Lawrence K. Saul

Recently, considerable research effort has been devoted to developing deep architectures for topic models to learn topic structures. Although several deep models have been proposed to learn better topic proportions of documents, how to…

Information Retrieval · Computer Science 2018-11-05 He Zhao , Lan Du , Wray Buntine , Mingyuan Zhou

Dynamic topic models (DTMs) model the evolution of prevalent themes in literature, online media, and other forms of text over time. DTMs assume that word co-occurrence statistics change continuously and therefore impose continuous…

Machine Learning · Statistics 2018-03-22 Patrick Jähnichen , Florian Wenzel , Marius Kloft , Stephan Mandt

The abundant sequential documents such as online archival, social media and news feeds are streamingly updated, where each chunk of documents is incorporated with smoothly evolving yet dependent topics. Such digital texts have attracted…

Information Retrieval · Computer Science 2021-06-28 Jinjin Guo , Longbing Cao , Zhiguo Gong

Dynamic topic modeling is widely used to analyze evolving trends in scientific literature, medical records, and social media. Traditional topic models represent each topic through a single probability vector on the multinomial simplex and…

Machine Learning · Computer Science 2026-05-28 Hanjia Gao , Hanwen Ye , Qing Nie , Annie Qu

Topic modeling seeks to uncover latent semantic structure in text corpora with minimal supervision. Neural approaches achieve strong performance but require extensive tuning and struggle with lifelong learning due to catastrophic forgetting…

Computation and Language · Computer Science 2026-04-20 Karthik Singaravadivelan , Anant Gupta , Zekun Wang , Christopher J. MacLellan

In this paper, we develop the continuous time dynamic topic model (cDTM). The cDTM is a dynamic topic model that uses Brownian motion to model the latent topics through a sequential collection of documents, where a "topic" is a pattern of…

Information Retrieval · Computer Science 2015-05-19 Chong Wang , David Blei , David Heckerman

Academic researchers often need to face with a large collection of research papers in the literature. This problem may be even worse for postgraduate students who are new to a field and may not know where to start. To address this problem,…

Computation and Language · Computer Science 2016-09-30 Leonard K. M. Poon , Nevin L. Zhang

Latent Dirichlet analysis, or topic modeling, is a flexible latent variable framework for modeling high-dimensional sparse count data. Various learning algorithms have been developed in recent years, including collapsed Gibbs sampling,…

Machine Learning · Computer Science 2012-05-14 Arthur Asuncion , Max Welling , Padhraic Smyth , Yee Whye Teh

Topic modeling analyzes documents to learn meaningful patterns of words. However, existing topic models fail to learn interpretable topics when working with large and heavy-tailed vocabularies. To this end, we develop the Embedded Topic…

Information Retrieval · Computer Science 2019-07-12 Adji B. Dieng , Francisco J. R. Ruiz , David M. Blei

Lifelong learning has recently attracted attention in building machine learning systems that continually accumulate and transfer knowledge to help future learning. Unsupervised topic modeling has been popularly used to discover topics from…

Computation and Language · Computer Science 2023-06-28 Pankaj Gupta , Yatin Chaudhary , Thomas Runkler , Hinrich Schütze
‹ Prev 1 2 3 10 Next ›