English
Related papers

Related papers: Combinatorial Topic Models using Small-Variance As…

200 papers

Topic Modelling (TM) is from the research branches of natural language understanding (NLU) and natural language processing (NLP) that is to facilitate insightful analysis from large documents and datasets, such as a summarisation of main…

Computation and Language · Computer Science 2023-04-19 Bernadeta Griciūtė , Lifeng Han , Goran Nenadic

A common task in many political institutions (i.e. Parliament) is to find politicians who are experts in a particular field. In order to tackle this problem, the first step is to obtain politician profiles which include their interests, and…

Information Retrieval · Computer Science 2024-01-22 Luis M. de Campos , Juan M. Fernández-Luna , Juan F. Huete , Luis Redondo-Expósito

Topic modeling analyzes documents to learn meaningful patterns of words. For documents collected in sequence, dynamic topic models capture how these patterns vary over time. We develop the dynamic embedded topic model (D-ETM), a generative…

Computation and Language · Computer Science 2019-10-14 Adji B. Dieng , Francisco J. R. Ruiz , David M. Blei

Background: Unstructured and textual data is increasing rapidly and Latent Dirichlet Allocation (LDA) topic modeling is a popular data analysis methods for it. Past work suggests that instability of LDA topics may lead to systematic errors.…

Computation and Language · Computer Science 2018-09-04 Mika Mäntylä , Maëlick Claes , Umar Farooq

Topic models are a popular tool for understanding text collections, but their evaluation has been a point of contention. Automated evaluation metrics such as coherence are often used, however, their validity has been questioned for neural…

Computation and Language · Computer Science 2024-02-21 Zongxia Li , Andrew Mao , Daniel Stephens , Pranav Goel , Emily Walpole , Alden Dima , Juan Fung , Jordan Boyd-Graber

The Predict-Then-Optimize framework uses machine learning models to predict unknown parameters of an optimization problem from exogenous features before solving. This setting is common to many real-world decision processes, and recently it…

Machine Learning · Computer Science 2024-09-10 James Kotary , Vincenzo Di Vito , Jacob Cristopher , Pascal Van Hentenryck , Ferdinando Fioretto

The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on…

Machine Learning · Statistics 2018-08-20 Filipe Rodrigues , Mariana Lourenço , Bernardete Ribeiro , Francisco Pereira

We consider the problem of parameter estimation using weakly supervised datasets, where a training sample consists of the input and a partially specified annotation, which we refer to as the output. The missing information in the annotation…

Machine Learning · Computer Science 2012-06-22 M. Pawan Kumar , Ben Packer , Daphne Koller

Social media users have finite attention which limits the number of incoming messages from friends they can process. Moreover, they pay more attention to opinions and recommendations of some friends more than others. In this paper, we…

Social and Information Networks · Computer Science 2013-01-29 Jeon-Hyung Kang , Kristina Lerman , Lise Getoor

Topic models are a useful analysis tool to uncover the underlying themes within document collections. The dominant approach is to use probabilistic topic models that posit a generative story, but in this paper we propose an alternative way…

Computation and Language · Computer Science 2020-10-08 Suzanna Sia , Ayush Dalmia , Sabrina J. Mielke

We propose a neural network based approach for learning topics from text and image datasets. The model makes no assumptions about the conditional distribution of the observed features given the latent topics. This allows us to perform topic…

Machine Learning · Computer Science 2017-03-01 Gaurav Pandey , Ambedkar Dukkipati

The exponential growth of online social network platforms and applications has led to a staggering volume of user-generated textual content, including comments and reviews. Consequently, users often face difficulties in extracting valuable…

Computation and Language · Computer Science 2023-08-23 Anusuya Krishnan

Traditional neural topic models are typically optimized by reconstructing the document's Bag-of-Words (BoW) representations, overlooking contextual information and struggling with data sparsity. In this work, we propose a novel approach to…

Computation and Language · Computer Science 2026-02-23 Raymond Li , Amirhossein Abaskohi , Chuyuan Li , Gabriel Murray , Giuseppe Carenini

Given the progress in image recognition with recent data driven paradigms, it's still expensive to manually label a large training data to fit a convolutional neural network (CNN) model. This paper proposes a hybrid supervised-unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2017-04-11 Kai Zhen , Mridul Birla , David Crandall , Bingjing Zhang , Judy Qiu

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks,…

Computer Vision and Pattern Recognition · Computer Science 2016-01-01 Yin Zheng , Yu-Jin Zhang , Hugo Larochelle

Speech recognition systems are often highly domain dependent, a fact widely reported in the literature. However the concept of domain is complex and not bound to clear criteria. Hence it is often not evident if data should be considered to…

Computation and Language · Computer Science 2015-09-23 Mortaza Doulaty , Oscar Saz , Thomas Hain

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang

Language models are typically applied at the sentence level, without access to the broader document context. We present a neural language model that incorporates document context in the form of a topic model-like architecture, thus…

Computation and Language · Computer Science 2017-10-16 Jey Han Lau , Timothy Baldwin , Trevor Cohn

This paper proposes a nonparametric Bayesian method for exploratory data analysis and feature construction in continuous time series. Our method focuses on understanding shared features in a set of time series that exhibit significant…

Machine Learning · Statistics 2010-08-13 Suchi Saria , Daphne Koller , Anna Penn

Recently, discrete latent variable models have received a surge of interest in both Natural Language Processing (NLP) and Computer Vision (CV), attributed to their comparable performance to the continuous counterparts in representation…

Computation and Language · Computer Science 2022-11-08 Erxin Yu , Lan Du , Yuan Jin , Zhepei Wei , Yi Chang