English
Related papers

Related papers: Deep NMF Topic Modeling

200 papers

Topic models have been extensively used to organize and interpret the contents of large, unstructured corpora of text documents. Although topic models often perform well on traditional training vs. test set evaluations, it is often the case…

Computation and Language · Computer Science 2017-07-04 Kelsey MacMillan , James D. Wilson

Non-negative matrix factorization (NMF) based topic modeling is widely used in natural language processing (NLP) to uncover hidden topics of short text documents. Usually, training a high-quality topic model requires large amount of textual…

Computation and Language · Computer Science 2022-05-27 Shijing Si , Jianzong Wang , Ruiyi Zhang , Qinliang Su , Jing Xiao

Topic Modeling is an approach used for automatic comprehension and classification of data in a variety of settings, and perhaps the canonical application is in uncovering thematic structure in a corpus of documents. A number of foundational…

Machine Learning · Computer Science 2012-04-13 Sanjeev Arora , Rong Ge , Ankur Moitra

We introduce a new method based on nonnegative matrix factorization, Neural NMF, for detecting latent hierarchical structure in data. Datasets with hierarchical structure arise in a wide variety of fields, such as document classification,…

Machine Learning · Computer Science 2023-03-02 Tyler Will , Runyu Zhang , Eli Sadovnik , Mengdi Gao , Joshua Vendrow , Jamie Haddock , Denali Molitor , Deanna Needell

Nonnegative matrix factorization (NMF) has found many applications including topic modeling and document analysis. Hierarchical NMF (HNMF) variants are able to learn topics at various levels of granularity and illustrate their hierarchical…

Machine Learning · Computer Science 2022-02-16 Joshua Vendrow , Jamie Haddock , Deanna Needell

Fully unsupervised topic models have found fantastic success in document clustering and classification. However, these models often suffer from the tendency to learn less-than-meaningful or even redundant topics when the data is biased…

Machine Learning · Computer Science 2021-02-08 Joshua Vendrow , Jamie Haddock , Elizaveta Rebrova , Deanna Needell

As the amount of text data continues to grow, topic modeling is serving an important role in understanding the content hidden by the overwhelming quantity of documents. One popular topic modeling approach is non-negative matrix…

Information Retrieval · Computer Science 2022-08-23 Maksim E. Eren , Nick Solovyev , Manish Bhattarai , Kim Rasmussen , Charles Nicholas , Boian S. Alexandrov

Non-negative matrix factorization (NMF) is a common method for generating topic models from text data. NMF is widely accepted for producing good results despite its relative simplicity of implementation and ease of computation. One…

Machine Learning · Computer Science 2016-08-09 Brendan Gavin , Vijay Gadepally , Jeremy Kepner

Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to…

Machine Learning · Computer Science 2022-02-01 Pengyu Li , Christine Tseng , Yaxuan Zheng , Joyce A. Chew , Longxiu Huang , Benjamin Jarman , Deanna Needell

Topic modeling is a technique for organizing and extracting themes from large collections of unstructured text. Non-negative matrix factorization (NMF) is a common unsupervised approach that decomposes a term frequency-inverse document…

Machine Learning · Computer Science 2024-07-30 Selma Wanna , Ryan Barron , Nick Solovyev , Maksim E. Eren , Manish Bhattarai , Kim Rasmussen , Boian S. Alexandrov

We utilize a recently developed topic modeling method called SeNMFk, extending the standard Non-negative Matrix Factorization (NMF) methods by incorporating the semantic structure of the text, and adding a robust system for determining the…

Digital Libraries · Computer Science 2022-01-04 Valentin Stanev , Erik Skau , Ichiro Takeuchi , Boian S. Alexandrov

Nonsmooth Nonnegative Matrix Factorization (nsNMF) is capable of producing more localized, less overlapped feature representations than other variants of NMF while keeping satisfactory fit to data. However, nsNMF as well as other existing…

Computer Vision and Pattern Recognition · Computer Science 2018-03-21 Jinshi Yu , Guoxu Zhou , Andrzej Cichocki , Shengli Xie

We propose new semi-supervised nonnegative matrix factorization (SSNMF) models for document classification and provide motivation for these models as maximum likelihood estimators. The proposed SSNMF models simultaneously provide both a…

Topic modeling is widely studied for the dimension reduction and analysis of documents. However, it is formulated as a difficult optimization problem. Current approximate solutions also suffer from inaccurate model- or data-assumptions. To…

Machine Learning · Computer Science 2019-10-25 Jianyu Wang , Xiao-Lei Zhang

Non-negative matrix factorization (NMF) is a fundamental matrix decomposition technique that is used primarily for dimensionality reduction and is increasing in popularity in the biological domain. Although finding a unique NMF is generally…

Information Theory · Computer Science 2021-08-23 Rami Nasser , Yonina C. Eldar , Roded Sharan

In this work, we apply topic modeling using Non-Negative Matrix Factorization (NMF) on the COVID-19 Open Research Dataset (CORD-19) to uncover the underlying thematic structure and its evolution within the extensive body of COVID-19…

Computation and Language · Computer Science 2025-03-25 Divya Patel , Vansh Parikh , Om Patel , Agam Shah , Bhaskar Chaudhury

Nonnegative matrix factorization (NMF) has become a widely used tool for the analysis of high-dimensional data as it automatically extracts sparse and meaningful features from a set of nonnegative data vectors. We first illustrate this…

Machine Learning · Statistics 2014-12-10 Nicolas Gillis

Topic models often fail to capture low-prevalence, domain-critical themes, so-called minority topics, such as mental health themes in online comments. While some existing methods can incorporate domain knowledge, such as expected topical…

Machine Learning · Computer Science 2025-05-23 Seyedeh Fatemeh Ebrahimi , Jaakko Peltonen

Non-negative matrix factorization (NMF) is a technique for finding latent representations of data. The method has been applied to corpora to construct topic models. However, NMF has likelihood assumptions which are often violated by real…

Machine Learning · Statistics 2017-11-22 Skyler Seto , Sarah Tan , Giles Hooker , Martin T. Wells

The importance of unsupervised clustering and topic modeling is well recognized with ever-increasing volumes of text data. In this paper, we propose a fast method for hierarchical clustering and topic modeling called HierNMF2. Our method is…

Machine Learning · Computer Science 2015-10-05 Da Kuang , Barry Drake , Haesun Park
‹ Prev 1 2 3 10 Next ›