Related papers: Term-community-based topic detection with variable…

Efficient Topic Extraction via Graph-Based Labeling: A Lightweight Alternative to Deep Models

Extracting topics from text has become an essential task, especially with the rapid growth of unstructured textual data. Most existing works rely on highly computational methods to address this challenge. In this paper, we argue that…

Computation and Language · Computer Science 2025-11-07 Salma Mekaoui , Hiba Sofyan , Imane Amaaz , Imane Benchrif , Arsalane Zarghili , Ilham Chaker , Nikola S. Nikolov

A network approach to topic models

One of the main computational and scientific challenges in the modern age is to extract useful information from unstructured texts. Topic models are one popular machine-learning approach which infers the latent topical structure of a…

Machine Learning · Statistics 2018-07-20 Martin Gerlach , Tiago P. Peixoto , Eduardo G. Altmann

A Survey of Community Detection Approaches: From Statistical Modeling to Deep Learning

Community detection, a fundamental task for network analysis, aims to partition a network into multiple sub-structures to help reveal their latent functions. Community detection has been extensively studied in and broadly applied to many…

Social and Information Networks · Computer Science 2021-08-17 Di Jin , Zhizhi Yu , Pengfei Jiao , Shirui Pan , Dongxiao He , Jia Wu , Philip S. Yu , Weixiong Zhang

Topic segmentation via community detection in complex networks

Many real systems have been modelled in terms of network concepts, and written texts are a particular example of information networks. In recent years, the use of network methods to analyze language has allowed the discovery of several…

Computation and Language · Computer Science 2016-06-28 Henrique F. de Arruda , Luciano da F. Costa , Diego R. Amancio

A Human Word Association based model for topic detection in social networks

With the widespread use of social networks, detecting the topics discussed on these platforms has become a significant challenge. Current approaches primarily rely on frequent pattern mining or semantic relations, often neglecting the…

Computation and Language · Computer Science 2024-08-22 Mehrdad Ranjbar Khadivi , Shahin Akbarpour , Mohammad-Reza Feizi-Derakhshi , Babak Anari

Exploratory topic modeling with distributional semantics

As we continue to collect and store textual data in a multitude of domains, we are regularly confronted with material whose largely unknown thematic structure we want to uncover. With unsupervised, exploratory analysis, no prior knowledge…

Information Retrieval · Computer Science 2015-07-20 Samuel Rönnqvist

Temporal Identification of Latent Communities on Twitter

User communities in social networks are usually identified by considering explicit structural social connections between users. While such communities can reveal important information about their members such as family or friendship ties…

Social and Information Networks · Computer Science 2015-09-15 Hossein Fani , Fattane Zarrinkalam , Xin Zhao , Yue Feng , Ebrahim Bagheri , Weichang Du

Discovering Discrete Latent Topics with Neural Variational Inference

Topic models have been widely explored as probabilistic generative models of documents. Traditional inference methods have sought closed-form derivations for updating the models, however as the expressiveness of these models grows, so does…

Computation and Language · Computer Science 2018-05-23 Yishu Miao , Edward Grefenstette , Phil Blunsom

Topics in Contextualised Attention Embeddings

Contextualised word vectors obtained via pre-trained language models encode a variety of knowledge that has already been exploited in applications. Complementary to these language models are probabilistic topic models that learn thematic…

Computation and Language · Computer Science 2023-01-12 Mozhgan Talebpour , Alba Garcia Seco de Herrera , Shoaib Jameel

Graph-Community Detection for Cross-Document Topic Segment Relationship Identification

In this paper we propose a graph-community detection approach to identify cross-document relationships at the topic segment level. Given a set of related documents, we automatically find these relationships by clustering segments with…

Computation and Language · Computer Science 2016-06-14 Pedro Mota , Maxine Eskenazi , Luisa Coheur

Learning Topic Models by Neighborhood Aggregation

Topic models are frequently used in machine learning owing to their high interpretability and modular structure. However, extending a topic model to include a supervisory signal, to incorporate pre-trained word embedding vectors and to…

Machine Learning · Statistics 2019-09-17 Ryohei Hisano

Topics in the Haystack: Extracting and Evaluating Topics beyond Coherence

Extracting and identifying latent topics in large text corpora has gained increasing importance in Natural Language Processing (NLP). Most models, whether probabilistic models similar to Latent Dirichlet Allocation (LDA) or neural topic…

Computation and Language · Computer Science 2023-03-31 Anton Thielmann , Quentin Seifert , Arik Reuter , Elisabeth Bergherr , Benjamin Säfken

Complex networks based word embeddings

Most of the time, the first step to learn word embeddings is to build a word co-occurrence matrix. As such matrices are equivalent to graphs, complex networks theory can naturally be used to deal with such data. In this paper, we consider…

Computation and Language · Computer Science 2019-10-04 Nicolas Dugué , Victor Connes

Topic-Selective Graph Network for Topic-Focused Summarization

Due to the success of the pre-trained language model (PLM), existing PLM-based summarization models show their powerful generative capability. However, these models are trained on general-purpose summarization datasets, leading to generated…

Computation and Language · Computer Science 2023-02-28 Shi Zesheng , Zhou Yucheng

Improving Topic Models with Latent Feature Word Representations

Probabilistic topic models are widely used to discover latent topics in document collections, while latent feature vector representations of words have been used to obtain high performance in many NLP tasks. In this paper, we extend two…

Computation and Language · Computer Science 2018-10-16 Dat Quoc Nguyen , Richard Billingsley , Lan Du , Mark Johnson

Latent Tree Models for Hierarchical Topic Detection

We present a novel method for hierarchical topic detection where topics are obtained by clustering documents in multiple ways. Specifically, we model document collections using a class of graphical models called hierarchical latent tree…

Computation and Language · Computer Science 2016-12-22 Peixian Chen , Nevin L. Zhang , Tengfei Liu , Leonard K. M. Poon , Zhourong Chen , Farhan Khawar

Changepoint Analysis of Topic Proportions in Temporal Text Data

Changepoint analysis deals with unsupervised detection and/or estimation of time-points in time-series data, when the distribution generating the data changes. In this article, we consider \emph{offline} changepoint detection in the context…

Computation and Language · Computer Science 2021-12-03 Avinandan Bose , Soumendu Sundar Mukherjee

Understanding User Topic Preferences across Multiple Social Networks

In recent years, social networks have shown diversity in function and applications. People begin to use multiple online social networks simultaneously for different demands. The ability to uncover a user's latent topic and social network…

Social and Information Networks · Computer Science 2021-09-13 Ziqing Zhu , Jiuxin Cao , Tao Zhou , Huiyu Min , Bo Liu

Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks

Topic models aim to reveal latent structures within a corpus of text, typically through the use of term-frequency statistics over bag-of-words representations from documents. In recent years, conceptual entities -- interpretable,…

Computation and Language · Computer Science 2024-08-27 Manuel V. Loureiro , Steven Derby , Tri Kurniawan Wijaya

Expertise localization discovered through correlation of key term distribution and community detection in co-author networks

We present an efficient and effective automatic method for determining the research focus of scientific communities found in co-authorship networks. It utilizes bibliographic data from a database to form the network, followed by fastgreedy…

Social and Information Networks · Computer Science 2014-07-17 Joe Durante , Tyler Whitehouse , F. G. Serpa , Artjay Javier