Related papers: Semantic Word Clusters Using Signed Normalized Gra…

Graph-based Clustering for Detecting Semantic Change Across Time and Languages

Despite the predominance of contextualized embeddings in NLP, approaches to detect semantic change relying on these embeddings and clustering methods underperform simpler counterparts based on static word embeddings. This stems from the…

Computation and Language · Computer Science 2024-02-05 Xianghe Ma , Michael Strube , Wei Zhao

Thresholding of Semantic Similarity Networks using a Spectral Graph Based Technique

Semantic similarity measures (SSMs) refer to a set of algorithms used to quantify the similarity of two or more terms belonging to the same ontology. Ontology terms may be associated to concepts, for instance in computational biology gene…

Molecular Networks · Quantitative Biology 2013-05-22 Pietro Hiram Guzzi , Simone Truglia , Pierangelo Veltri , Mario Cannataro

Word-Centered Semantic Graphs for Interpretable Diachronic Sense Tracking

We propose an interpretable, graph-based framework for analyzing semantic shift in diachronic corpora. For each target word and time slice, we induce a word-centered semantic network that integrates distributional similarity from diachronic…

Computation and Language · Computer Science 2026-02-02 Imene Kolli , Kai-Robin Lange , Jonas Rieger , Carsten Jentsch

Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding

Many word clouds provide no semantics to the word placement, but use a random layout optimized solely for aesthetic purposes. We propose a novel approach to model word significance and word affinity within a document, and in comparison to a…

Information Retrieval · Computer Science 2017-08-14 Erich Schubert , Andreas Spitz , Michael Weiler , Johanna Geiß , Michael Gertz

Neural Text Classification by Jointly Learning to Cluster and Align

Distributional text clustering delivers semantically informative representations and captures the relevance between each word and semantic clustering centroids. We extend the neural text clustering approach to text classification tasks by…

Computation and Language · Computer Science 2020-11-25 Yekun Chai , Haidong Zhang , Shuo Jin

Scalable Multilevel and Memetic Signed Graph Clustering

In this study, we address the complex issue of graph clustering in signed graphs, which are characterized by positive and negative weighted edges representing attraction and repulsion among nodes, respectively. The primary objective is to…

Data Structures and Algorithms · Computer Science 2024-07-10 Felix Hausberger , Marcelo Fonseca Faraj , Christian Schulz

Mimicking Human Process: Text Representation via Latent Semantic Clustering for Classification

Considering that words with different characteristic in the text have different importance for classification, grouping them together separately can strengthen the semantic expression of each part. Thus we propose a new text representation…

Computation and Language · Computer Science 2019-06-19 Xiaoye Tan , Rui Yan , Chongyang Tao , Mingrui Wu

Using Curvature and Markov Clustering in Graphs for Lexical Acquisition and Word Sense Discrimination

We introduce two different approaches for clustering semantically similar words. We accommodate ambiguity by allowing a word to belong to several clusters. Both methods use a graph-theoretic representation of words and their paradigmatic…

Other Condensed Matter · Physics 2009-09-29 Beate Dorow , Dominic Widdows , Katarina Ling , Jean-Pierre Eckmann , Danilo Sergi , Elisha Moses

Analysis of Word Embeddings Using Fuzzy Clustering

In data dominated systems and applications, a concept of representing words in a numerical format has gained a lot of attention. There are a few approaches used to generate such a representation. An interesting issue that should be…

Computation and Language · Computer Science 2020-12-08 Shahin Atakishiyev , Marek Z. Reformat

Word Embeddings and Validity Indexes in Fuzzy Clustering

In the new era of internet systems and applications, a concept of detecting distinguished topics from huge amounts of text has gained a lot of attention. These methods use representation of text in a numerical format -- called embeddings --…

Computation and Language · Computer Science 2022-05-16 Danial Toufani-Movaghar , Mohammad-Reza Feizi-Derakhshi

Computing Word Classes Using Spectral Clustering

Clustering a lexicon of words is a well-studied problem in natural language processing (NLP). Word clusters are used to deal with sparse data in statistical language processing, as well as features for solving various NLP tasks (text…

Computation and Language · Computer Science 2018-08-17 Effi Levi , Saggy Herman , Ari Rappoport

Semantic Document Clustering on Named Entity Features

Keyword-based information processing has limitations due to simple treatment of words. In this paper, we introduce named entities as objectives into document clustering, which are the key elements defining document semantics and in many…

Information Retrieval · Computer Science 2018-07-23 Tru H. Cao , Vuong M. Ngo , Dung T. Hong , Tho T. Quan

Aligning Linguistic Words and Visual Semantic Units for Image Captioning

Image captioning attempts to generate a sentence composed of several linguistic words, which are used to describe objects, attributes, and interactions in an image, denoted as visual semantic units in this paper. Based on this view, we…

Computer Vision and Pattern Recognition · Computer Science 2019-08-07 Longteng Guo , Jing Liu , Jinhui Tang , Jiangwei Li , Wei Luo , Hanqing Lu

From Image to Text Classification: A Novel Approach based on Clustering Word Embeddings

In this paper, we propose a novel approach for text classification based on clustering word embeddings, inspired by the bag of visual words model, which is widely used in computer vision. After each word in a collection of documents is…

Computation and Language · Computer Science 2017-07-26 Andrei M. Butnaru , Radu Tudor Ionescu

Spectral Clustering of Attributed Multi-relational Graphs

Graph clustering aims at discovering a natural grouping of the nodes such that similar nodes are assigned to a common cluster. Many different algorithms have been proposed in the literature: for simple graphs, for graphs with attributes…

Machine Learning · Computer Science 2023-11-06 Ylli Sadikaj , Yllka Velaj , Sahar Behzadi , Claudia Plant

Correlation Clustering for General Graphs

Correlation clustering provides a method for separating the vertices of a signed graph into the optimum number of clusters without specifying that number in advance. The main goal in this type of clustering is to minimize the number of…

Combinatorics · Mathematics 2025-07-15 Leila Parsaei-Majd

Text-Guided Image Clustering

Image clustering divides a collection of images into meaningful groups, typically interpreted post-hoc via human-given annotations. Those are usually in the form of text, begging the question of using text as an abstraction for image…

Machine Learning · Computer Science 2024-02-20 Andreas Stephan , Lukas Miklautz , Kevin Sidak , Jan Philip Wahle , Bela Gipp , Claudia Plant , Benjamin Roth

Language Modeling by Clustering with Word Embeddings for Text Readability Assessment

We present a clustering-based language model using word embeddings for text readability prediction. Presumably, an Euclidean semantic space hypothesis holds true for word embeddings whose training is done by observing word co-occurrences.…

Computation and Language · Computer Science 2017-09-07 Miriam Cha , Youngjune Gwon , H. T. Kung

Constrained Hierarchical Clustering via Graph Coarsening and Optimal Cuts

Motivated by extracting and summarizing relevant information in short sentence settings, such as satisfaction questionnaires, hotel reviews, and X/Twitter, we study the problem of clustering words in a hierarchical fashion. In particular,…

Machine Learning · Computer Science 2023-12-08 Eliabelle Mauduit , Andrea Simonetto

On Layered Area-Proportional Rectangle Contact Representations

Semantic word clouds visualize the semantic relatedness between the words of a text by placing pairs of related words close to each other. Formally, the problem of drawing semantic word clouds corresponds to drawing a rectangle contact…

Computational Geometry · Computer Science 2026-02-05 Carolina Haase , Philipp Kindermann