Related papers: Self-Supervised Graph Embedding Clustering

Manifold Clustering with Schatten p-norm Maximization

Manifold clustering, with its exceptional ability to capture complex data structures, holds a pivotal position in cluster analysis. However, existing methods often focus only on finding the optimal combination between K-means and manifold…

Machine Learning · Computer Science 2025-04-30 Fangfang Li , Quanxue Gao

Structured Graph Learning for Clustering and Semi-supervised Classification

Graphs have become increasingly popular in modeling structures and interactions in a wide variety of problems during the last decade. Graph-based clustering and semi-supervised classification techniques have shown impressive performance.…

Machine Learning · Computer Science 2020-09-01 Zhao Kang , Chong Peng , Qiang Cheng , Xinwang Liu , Xi Peng , Zenglin Xu , Ling Tian

Unified Spectral Clustering with Optimal Graph

Spectral clustering has found extensive use in many areas. Most traditional spectral clustering algorithms work in three separate steps: similarity graph construction; continuous labels learning; discretizing the learned labels by k-means…

Machine Learning · Computer Science 2017-11-15 Zhao Kang , Chong Peng , Qiang Cheng , Zenglin Xu

Differentiable Deep Clustering with Cluster Size Constraints

Clustering is a fundamental unsupervised learning approach. Many clustering algorithms -- such as $k$-means -- rely on the euclidean distance as a similarity measure, which is often not the most relevant metric for high dimensional data…

Machine Learning · Computer Science 2019-10-22 Aude Genevay , Gabriel Dulac-Arnold , Jean-Philippe Vert

No More Than 6ft Apart: Robust K-Means via Radius Upper Bounds

Centroid based clustering methods such as k-means, k-medoids and k-centers are heavily applied as a go-to tool in exploratory data analysis. In many cases, those methods are used to obtain representative centroids of the data manifold for…

Machine Learning · Computer Science 2022-06-16 Ahmed Imtiaz Humayun , Randall Balestriero , Anastasios Kyrillidis , Richard Baraniuk

Structured Graph Learning for Scalable Subspace Clustering: From Single-view to Multi-view

Graph-based subspace clustering methods have exhibited promising performance. However, they still suffer some of these drawbacks: encounter the expensive time overhead, fail in exploring the explicit clusters, and cannot generalize to…

Machine Learning · Computer Science 2021-02-23 Zhao Kang , Zhiping Lin , Xiaofeng Zhu , Wenbo Xu

Unsupervised cryo-EM data clustering through adaptively constrained K-means algorithm

In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate…

Quantitative Methods · Quantitative Biology 2017-01-03 Yaofang Xu , Jiayi Wu , Chang-Cheng Yin , Youdong Mao

Unsupervised Feature Learning with C-SVDDNet

In this paper, we investigate the problem of learning feature representation from unlabeled data using a single-layer K-means network. A K-means network maps the input data into a feature representation by finding the nearest centroid for…

Computer Vision and Pattern Recognition · Computer Science 2015-06-01 Dong Wang , Xiaoyang Tan

Unsupervised Learning: Comparative Analysis of Clustering Techniques on High-Dimensional Data

This paper presents a comprehensive comparative analysis of prominent clustering algorithms K-means, DBSCAN, and Spectral Clustering on high-dimensional datasets. We introduce a novel evaluation framework that assesses clustering…

Machine Learning · Computer Science 2025-07-31 Vishnu Vardhan Baligodugula , Fathi Amsaad

K-means Clustering Based Feature Consistency Alignment for Label-free Model Evaluation

The label-free model evaluation aims to predict the model performance on various test sets without relying on ground truths. The main challenge of this task is the absence of labels in the test data, unlike in classical supervised model…

Machine Learning · Computer Science 2023-04-20 Shuyu Miao , Lin Zheng , Jingjing Liu , and Hong Jin

A semi-supervised sparse K-Means algorithm

We consider the problem of data clustering with unidentified feature quality and when a small amount of labelled data is provided. An unsupervised sparse clustering method can be employed in order to detect the subgroup of features…

Machine Learning · Computer Science 2020-10-20 Avgoustinos Vouros , Eleni Vasilaki

Unsupervised Machine Learning to Classify the Confinement of Waves in Periodic Superstructures

We employ unsupervised machine learning to enhance the accuracy of our recently presented scaling method for wave confinement analysis [1]. We employ the standard k-means++ algorithm as well as our own model-based algorithm. We investigate…

Optics · Physics 2023-12-05 Marek Kozoň , Rutger Schrijver , Matthias Schlottbom , Jaap J. W. van der Vegt , Willem L. Vos

The Laplacian K-modes algorithm for clustering

In addition to finding meaningful clusters, centroid-based clustering algorithms such as K-means or mean-shift should ideally find centroids that are valid patterns in the input space, representative of data in their cluster. This is…

Machine Learning · Computer Science 2014-06-17 Weiran Wang , Miguel Á. Carreira-Perpiñán

One-step Multi-view Clustering with Diverse Representation

Multi-view clustering has attracted broad attention due to its capacity to utilize consistent and complementary information among views. Although tremendous progress has been made recently, most existing methods undergo high complexity,…

Machine Learning · Computer Science 2023-06-28 Xinhang Wan , Jiyuan Liu , Xinwang Liu , Siwei Wang , Yi Wen , Tianjiao Wan , Li Shen , En Zhu

Spatial Transformer K-Means

K-means defines one of the most employed centroid-based clustering algorithms with performances tied to the data's embedding. Intricate data embeddings have been designed to push $K$-means performances at the cost of reduced theoretical…

Machine Learning · Computer Science 2022-02-17 Romain Cosentino , Randall Balestriero , Yanis Bahroun , Anirvan Sengupta , Richard Baraniuk , Behnaam Aazhang

Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions

In this paper, we investigate the learning-augmented $k$-median clustering problem, which aims to improve the performance of traditional clustering algorithms by preprocessing the point set with a predictor of error rate $\alpha \in [0,1)$.…

Data Structures and Algorithms · Computer Science 2026-03-12 Kangke Cheng , Shihong Song , Guanlin Mo , Hu Ding

Fast k-means based on KNN Graph

In the era of big data, k-means clustering has been widely adopted as a basic processing tool in various contexts. However, its computational cost could be prohibitively high as the data size and the cluster number are large. It is well…

Machine Learning · Computer Science 2017-05-05 Cheng-Hao Deng , Wan-Lei Zhao

Embed and Conquer: Scalable Embeddings for Kernel k-Means on MapReduce

The kernel $k$-means is an effective method for data clustering which extends the commonly-used $k$-means algorithm to work on a similarity matrix over complex data structures. The kernel $k$-means algorithm is however computationally very…

Machine Learning · Computer Science 2014-01-30 Ahmed Elgohary , Ahmed K. Farahat , Mohamed S. Kamel , Fakhri Karray

An Approach Towards Learning K-means-friendly Deep Latent Representation

Clustering is a long-standing problem area in data mining. The centroid-based classical approaches to clustering mainly face difficulty in the case of high dimensional inputs such as images. With the advent of deep neural networks, a common…

Machine Learning · Computer Science 2024-12-02 Debapriya Roy

Distributed Kernel K-Means for Large Scale Clustering

Clustering samples according to an effective metric and/or vector space representation is a challenging unsupervised learning task with a wide spectrum of applications. Among several clustering algorithms, k-means and its kernelized version…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-10 Marco Jacopo Ferrarotti , Sergio Decherchi , Walter Rocchia