Related papers: CAST: A Correlation-based Adaptive Spectral Cluste…

Spectral Clustering with Smooth Tiny Clusters

Spectral clustering is one of the most prominent clustering approaches. The distance-based similarity is the most widely used method for spectral clustering. However, people have already noticed that this is not suitable for multi-scale…

Machine Learning · Computer Science 2020-09-11 Hengrui Wang , Yubo Zhang , Mingzhi Chen , Tong Yang

Correlation Adaptive Subspace Segmentation by Trace Lasso

This paper studies the subspace segmentation problem. Given a set of data points drawn from a union of subspaces, the goal is to partition them into their underlying subspaces they were drawn from. The spectral clustering method is used as…

Computer Vision and Pattern Recognition · Computer Science 2015-01-20 Canyi Lu , Jiashi Feng , Zhouchen Lin , Shuicheng Yan

Spectral Clustering: An empirical study of Approximation Algorithms and its Application to the Attrition Problem

Clustering is the problem of separating a set of objects into groups (called clusters) so that objects within the same cluster are more similar to each other than to those in different clusters. Spectral clustering is a now well-known…

Machine Learning · Computer Science 2012-11-16 B. Cung , T. Jin , J. Ramirez , A. Thompson , C. Boutsidis , D. Needell

Research on Clustering Performance of Sparse Subspace Clustering

Recently, sparse subspace clustering has been a valid tool to deal with high-dimensional data. There are two essential steps in the framework of sparse subspace clustering. One is solving the coefficient matrix of data, and the other is…

Computer Vision and Pattern Recognition · Computer Science 2019-12-24 Wen-Jin Fu , Xiao-Jun Wu , He-Feng Yin , Wen-Bo Hu

Scalable Spectral Clustering with Nystrom Approximation: Practical and Theoretical Aspects

Spectral clustering techniques are valuable tools in signal processing and machine learning for partitioning complex data sets. The effectiveness of spectral clustering stems from constructing a non-linear embedding based on creating a…

Machine Learning · Computer Science 2021-02-02 Farhad Pourkamali-Anaraki

Contraction Clustering (RASTER): A Very Fast Big Data Algorithm for Sequential and Parallel Density-Based Clustering in Linear Time, Constant Memory, and a Single Pass

Clustering is an essential data mining tool for analyzing and grouping similar objects. In big data applications, however, many clustering algorithms are infeasible due to their high memory requirements and/or unfavorable runtime…

Data Structures and Algorithms · Computer Science 2026-01-27 Gregor Ulm , Simon Smith , Adrian Nilsson , Emil Gustavsson , Mats Jirstrand

Factor Adjusted Spectral Clustering for Mixture Models

This paper studies a factor modeling-based approach for clustering high-dimensional data generated from a mixture of strongly correlated variables. Statistical modeling with correlated structures pervades modern applications in economics,…

Statistics Theory · Mathematics 2024-08-23 Shange Tang , Soham Jana , Jianqing Fan

Toward Efficient and Incremental Spectral Clustering via Parametric Spectral Clustering

Spectral clustering is a popular method for effectively clustering nonlinearly separable data. However, computational limitations, memory requirements, and the inability to perform incremental learning challenge its widespread application.…

Machine Learning · Computer Science 2023-11-15 Jo-Chun Chen , Hung-Hsuan Chen

Sparse-Dense Subspace Clustering

Subspace clustering refers to the problem of clustering high-dimensional data into a union of low-dimensional subspaces. Current subspace clustering approaches are usually based on a two-stage framework. In the first stage, an affinity…

Machine Learning · Computer Science 2019-10-22 Shuai Yang , Wenqi Zhu , Yuesheng Zhu

Construction of the similarity matrix for the spectral clustering method: numerical experiments

Spectral clustering is a powerful method for finding structure in a dataset through the eigenvectors of a similarity matrix. It often outperforms traditional clustering algorithms such as $k$-means when the structure of the individual…

Numerical Analysis · Mathematics 2019-04-26 Paola Favati , Grazia Lotti , Ornella Menchi , Francesco Romani

On some spectral properties of stochastic similarity matrices for data clustering

Clustering in image analysis is a central technique that allows to classify elements of an image. We describe a simple clustering technique that uses the method of similarity matrices. We expand upon recent results in spectral analysis for…

Statistics Theory · Mathematics 2022-03-23 Denis Gaidashev , Ralf Pihlström , Martin Ryner

A fast and Accurate Similarity-constrained Subspace Clustering Framework for Unsupervised Hyperspectral Image Classification

Accurate land cover segmentation of spectral images is challenging and has drawn widespread attention in remote sensing due to its inherent complexity. Although significant efforts have been made for developing a variety of methods, most of…

Image and Video Processing · Electrical Eng. & Systems 2021-11-30 Carlos Hinojosa , Esteban Vera , Henry Arguello

Clustering by Mining Density Distributions and Splitting Manifold Structure

Spectral clustering requires the time-consuming decomposition of the Laplacian matrix of the similarity graph, thus limiting its applicability to large datasets. To improve the efficiency of spectral clustering, a top-down approach was…

Machine Learning · Computer Science 2024-12-19 Zhichang Xu , Zhiguo Long , Hua Meng

Large Scale Spectral Clustering Using Approximate Commute Time Embedding

Spectral clustering is a novel clustering method which can detect complex shapes of data clusters. However, it requires the eigen decomposition of the graph Laplacian matrix, which is proportion to $O(n^3)$ and thus is not suitable for…

Machine Learning · Computer Science 2013-07-02 Nguyen Lu Dang Khoa , Sanjay Chawla

Scalable Co-Clustering for Large-Scale Data through Dynamic Partitioning and Hierarchical Merging

Co-clustering simultaneously clusters rows and columns, revealing more fine-grained groups. However, existing co-clustering methods suffer from poor scalability and cannot handle large-scale data. This paper presents a novel and scalable…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-20 Zihan Wu , Zhaoke Huang , Hong Yan

Efficient Clustering of Correlated Variables and Variable Selection in High-Dimensional Linear Models

In this paper, we introduce Adaptive Cluster Lasso(ACL) method for variable selection in high dimensional sparse regression models with strongly correlated variables. To handle correlated variables, the concept of clustering or grouping…

Machine Learning · Statistics 2016-03-14 Niharika Gauraha , Swapan K. Parui

Robust Two-Layer Partition Clustering of Sparse Multivariate Functional Data

A novel elastic time distance for sparse multivariate functional data is proposed and used to develop a robust distance-based two-layer partition clustering method. With this proposed distance, the new approach not only can detect correct…

Methodology · Statistics 2023-03-21 Zhuo Qu , Wenlin Dai , Marc G. Genton

Robust Subspace Clustering via Thresholding

The problem of clustering noisy and incompletely observed high-dimensional data points into a union of low-dimensional subspaces and a set of outliers is considered. The number of subspaces, their dimensions, and their orientations are…

Machine Learning · Statistics 2015-08-24 Reinhard Heckel , Helmut Bölcskei

Clustering Plotted Data by Image Segmentation

Clustering algorithms are one of the main analytical methods to detect patterns in unlabeled data. Existing clustering methods typically treat samples in a dataset as points in a metric space and compute distances to group together similar…

Machine Learning · Computer Science 2021-10-12 Tarek Naous , Srinjay Sarkar , Abubakar Abid , James Zou

Scalable and Robust Sparse Subspace Clustering Using Randomized Clustering and Multilayer Graphs

Sparse subspace clustering (SSC) is one of the current state-of-the-art methods for partitioning data points into the union of subspaces, with strong theoretical guarantees. However, it is not practical for large data sets as it requires…

Computer Vision and Pattern Recognition · Computer Science 2019-08-06 Maryam Abdolali , Nicolas Gillis , Mohammad Rahmati