Related papers: Robust Continuous Co-Clustering

Scalable Co-Clustering for Large-Scale Data through Dynamic Partitioning and Hierarchical Merging

Co-clustering simultaneously clusters rows and columns, revealing more fine-grained groups. However, existing co-clustering methods suffer from poor scalability and cannot handle large-scale data. This paper presents a novel and scalable…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-20 Zihan Wu , Zhaoke Huang , Hong Yan

Non-Exhaustive, Overlapping Co-Clustering: An Extended Analysis

The goal of co-clustering is to simultaneously identify a clustering of rows as well as columns of a two dimensional data matrix. A number of co-clustering techniques have been proposed including information-theoretic co-clustering and the…

Machine Learning · Computer Science 2020-04-27 Joyce Jiyoung Whang , Inderjit S. Dhillon

Efficient Dynamic Clustering: Capturing Patterns from Historical Cluster Evolution

Clustering aims to group unlabeled objects based on similarity inherent among them into clusters. It is important for many tasks such as anomaly detection, database sharding, record linkage, and others. Some clustering methods are taken as…

Databases · Computer Science 2024-12-02 Binbin Gu , Saeed Kargar , Faisal Nawab

Relational Multi-Manifold Co-Clustering

Co-clustering targets on grouping the samples (e.g., documents, users) and the features (e.g., words, ratings) simultaneously. It employs the dual relation and the bilateral information between the samples and features. In many realworld…

Machine Learning · Computer Science 2016-11-18 Ping Li , Jiajun Bu , Chun Chen , Zhanying He , Deng Cai

Predictive Overlapping Co-Clustering

In the past few years co-clustering has emerged as an important data mining tool for two way data analysis. Co-clustering is more advantageous over traditional one dimensional clustering in many ways such as, ability to find highly…

Machine Learning · Computer Science 2014-12-02 Chandrima Sarkar , Jaideep Srivastava

Scalable Robust Bayesian Co-Clustering with Compositional ELBOs

Co-clustering exploits the duality of instances and features to simultaneously uncover meaningful groups in both dimensions, often outperforming traditional clustering in high-dimensional or sparse data settings. Although recent deep…

Machine Learning · Computer Science 2025-04-10 Ashwin Vinod , Chandrajit Bajaj

Co-Clustering Under the Maximum Norm

Co-clustering, that is, partitioning a numerical matrix into homogeneous submatrices, has many applications ranging from bioinformatics to election analysis. Many interesting variants of co-clustering are NP-hard. We focus on the basic…

Discrete Mathematics · Computer Science 2019-06-17 Laurent Bulteau , Vincent Froese , Sepp Hartung , Rolf Niedermeier

Convex Biclustering

In the biclustering problem, we seek to simultaneously group observations and features. While biclustering has applications in a wide array of domains, ranging from text mining to collaborative filtering, the problem of identifying…

Methodology · Statistics 2018-06-07 Eric C. Chi , Genevera I. Allen , Richard G. Baraniuk

Co-clustering through Optimal Transport

In this paper, we present a novel method for co-clustering, an unsupervised learning approach that aims at discovering homogeneous groups of data instances and features by grouping them simultaneously. The proposed method uses the entropy…

Machine Learning · Statistics 2017-05-22 Charlotte Laclau , Ievgen Redko , Basarab Matei , Younès Bennani , Vincent Brault

Clustering with Obstacles in Spatial Databases

Clustering large spatial databases is an important problem, which tries to find the densely populated regions in a spatial area to be used in data mining, knowledge discovery, or efficient information retrieval. However most algorithms have…

Databases · Computer Science 2009-09-25 Mohamed A. El-Zawawy , Mohamed E. El-Sharkawi

Rethinking Feature Discrimination and Polymerization for Large-scale Recognition

Feature matters. How to train a deep network to acquire discriminative features across categories and polymerized features within classes has always been at the core of many computer vision tasks, specially for large-scale recognition…

Computer Vision and Pattern Recognition · Computer Science 2017-10-31 Yu Liu , Hongyang Li , Xiaogang Wang

Convex Hierarchical Clustering for Graph-Structured Data

Convex clustering is a recent stable alternative to hierarchical clustering. It formulates the recovery of progressively coalescing clusters as a regularized convex problem. While convex clustering was originally designed for handling…

Applications · Statistics 2019-12-12 Claire Donnat , Susan Holmes

Mining frequency-based sequential trajectory co-clusters

Co-clustering is a specific type of clustering that addresses the problem of finding groups of objects without necessarily considering all attributes. This technique has shown to have more consistent results in high-dimensional sparse data…

Machine Learning · Computer Science 2021-10-28 Yuri Santos , Jônata Tyska , Vania Bogorny

Improving Unsupervised Image Clustering With Robust Learning

Unsupervised image clustering methods often introduce alternative objectives to indirectly train the model and are subject to faulty predictions and overconfident results. To overcome these challenges, the current research proposes an…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Sungwon Park , Sungwon Han , Sundong Kim , Danu Kim , Sungkyu Park , Seunghoon Hong , Meeyoung Cha

Robust Hierarchical Clustering

One of the most widely used techniques for data clustering is agglomerative clustering. Such algorithms have been long used across many different fields ranging from computational biology to social sciences to computer vision in part…

Machine Learning · Computer Science 2014-07-15 Maria-Florina Balcan , Yingyu Liang , Pramod Gupta

PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms

Clustering the nodes of a graph is a cornerstone of graph analysis and has been extensively studied. However, some popular methods are not suitable for very large graphs: e.g., spectral clustering requires the computation of the spectral…

Machine Learning · Computer Science 2025-06-13 Etienne Lasalle , Rémi Vaudaine , Titouan Vayer , Pierre Borgnat , Rémi Gribonval , Paulo Gonçalves , Màrton Karsai

CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

We study the problem of applying spectral clustering to cluster multi-scale data, which is data whose clusters are of various sizes and densities. Traditional spectral clustering techniques discover clusters by processing a similarity…

Machine Learning · Computer Science 2020-06-09 Xiang Li , Ben Kao , Caihua Shan , Dawei Yin , Martin Ester

Spectral Clustering with Smooth Tiny Clusters

Spectral clustering is one of the most prominent clustering approaches. The distance-based similarity is the most widely used method for spectral clustering. However, people have already noticed that this is not suitable for multi-scale…

Machine Learning · Computer Science 2020-09-11 Hengrui Wang , Yubo Zhang , Mingzhi Chen , Tong Yang

A Hash-based Co-Clustering Algorithm for Categorical Data

Many real-life data are described by categorical attributes without a pre-classification. A common data mining method used to extract information from this type of data is clustering. This method group together the samples from the data…

Machine Learning · Computer Science 2014-07-30 Fabricio Olivetti de França

DAOC: Stable Clustering of Large Networks

Clustering is a crucial component of many data mining systems involving the analysis and exploration of various data. Data diversity calls for clustering algorithms to be accurate while providing stable (i.e., deterministic and robust)…

Social and Information Networks · Computer Science 2019-12-19 Artem Lutov , Mourad Khayati , Philippe Cudré-Mauroux