Related papers: Constraint-Based Clustering Selection

Semi-supervised clustering methods

Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning…

Methodology · Statistics 2014-07-11 Eric Bair

Semi-Supervised Constrained Clustering: An In-Depth Overview, Ranked Taxonomy and Future Research Directions

Clustering is a well-known unsupervised machine learning approach capable of automatically grouping discrete sets of instances with similar characteristics. Constrained clustering is a semi-supervised extension to this process that can be…

Machine Learning · Computer Science 2023-03-02 Germán González-Almagro , Daniel Peralta , Eli De Poorter , José-Ramón Cano , Salvador García

Semi-supervised Clustering Ensemble by Voting

Clustering ensemble is one of the most recent advances in unsupervised learning. It aims to combine the clustering results obtained using different algorithms or from different runs of the same clustering algorithm for the same data set,…

Machine Learning · Computer Science 2012-08-22 Ashraf Mohammed Iqbal , Abidalrahman Moh'd , Zahoor Khan

Active Clustering with Model-Based Uncertainty Reduction

Semi-supervised clustering seeks to augment traditional clustering methods by incorporating side information provided via human expertise in order to increase the semantic meaningfulness of the resulting clusters. However, most current…

Machine Learning · Computer Science 2014-02-17 Caiming Xiong , David Johnson , Jason J. Corso

COBRAS-TS: A new approach to Semi-Supervised Clustering of Time Series

Clustering is ubiquitous in data analysis, including analysis of time series. It is inherently subjective: different users may prefer different clusterings for a particular dataset. Semi-supervised clustering addresses this by allowing the…

Machine Learning · Statistics 2018-05-03 Toon Van Craenendonck , Wannes Meert , Sebastijan Dumancic , Hendrik Blockeel

Seeking the Truth Beyond the Data. An Unsupervised Machine Learning Approach

Clustering is an unsupervised machine learning methodology where unlabeled elements/objects are grouped together aiming to the construction of well-established clusters that their elements are classified according to their similarity. The…

Machine Learning · Statistics 2023-10-20 Dimitrios Saligkaras , Vasileios E. Papageorgiou

Semi-Supervised Information-Maximization Clustering

Semi-supervised clustering aims to introduce prior knowledge in the decision process of a clustering algorithm. In this paper, we propose a novel semi-supervised clustering algorithm based on the information-maximization principle. The…

Machine Learning · Computer Science 2013-05-02 Daniele Calandriello , Gang Niu , Masashi Sugiyama

Ensemble Method for Cluster Number Determination and Algorithm Selection in Unsupervised Learning

Unsupervised learning, and more specifically clustering, suffers from the need for expertise in the field to be of use. Researchers must make careful and informed decisions on which algorithm to use with which set of hyperparameters for a…

Machine Learning · Computer Science 2021-12-28 Antoine Zambelli

Constrained Clustering and Multiple Kernel Learning without Pairwise Constraint Relaxation

Clustering under pairwise constraints is an important knowledge discovery tool that enables the learning of appropriate kernels or distance metrics to improve clustering performance. These pairwise constraints, which come in the form of…

Machine Learning · Computer Science 2022-03-24 Benedikt Boecking , Vincent Jeanselme , Artur Dubrawski

A Classification-Based Approach to Semi-Supervised Clustering with Pairwise Constraints

In this paper, we introduce a neural network framework for semi-supervised clustering (SSC) with pairwise (must-link or cannot-link) constraints. In contrast to existing approaches, we decompose SSC into two simpler classification…

Machine Learning · Computer Science 2020-01-22 Marek Śmieja , Łukasz Struski , Mário A. T. Figueiredo

A semi-supervised sparse K-Means algorithm

We consider the problem of data clustering with unidentified feature quality and when a small amount of labelled data is provided. An unsupervised sparse clustering method can be employed in order to detect the subgroup of features…

Machine Learning · Computer Science 2020-10-20 Avgoustinos Vouros , Eleni Vasilaki

Learning with Clustering Structure

We study supervised learning problems using clustering constraints to impose structure on either features or samples, seeking to help both prediction and interpretation. The problem of clustering features arises naturally in text…

Machine Learning · Computer Science 2016-09-20 Vincent Roulet , Fajwel Fogel , Alexandre d'Aspremont , Francis Bach

Fairness, Semi-Supervised Learning, and More: A General Framework for Clustering with Stochastic Pairwise Constraints

Metric clustering is fundamental in areas ranging from Combinatorial Optimization and Data Mining, to Machine Learning and Operations Research. However, in a variety of situations we may have additional requirements or knowledge, distinct…

Machine Learning · Computer Science 2021-03-04 Brian Brubach , Darshan Chakrabarti , John P. Dickerson , Aravind Srinivasan , Leonidas Tsepenekas

Semi-Supervised Clustering via Structural Entropy with Different Constraints

Semi-supervised clustering techniques have emerged as valuable tools for leveraging prior information in the form of constraints to improve the quality of clustering outcomes. Despite the proliferation of such methods, the ability to…

Machine Learning · Computer Science 2023-12-19 Guangjie Zeng , Hao Peng , Angsheng Li , Zhiwei Liu , Runze Yang , Chunyang Liu , Lifang He

ConstraintMatch for Semi-constrained Clustering

Constrained clustering allows the training of classification models using pairwise constraints only, which are weak and relatively easy to mine, while still yielding full-supervision-level model performance. While they perform well even in…

Machine Learning · Computer Science 2023-11-28 Jann Goschenhofer , Bernd Bischl , Zsolt Kira

A Computational Theory and Semi-Supervised Algorithm for Clustering

A computational theory for clustering and a semi-supervised clustering algorithm is presented. Clustering is defined to be the obtainment of groupings of data such that each group contains no anomalies with respect to a chosen grouping…

Machine Learning · Computer Science 2025-07-17 Nassir Mohammad

Unsupervised clustering analysis: a multiscale complex networks approach

Unsupervised clustering, also known as natural clustering, stands for the classification of data according to their similarities. Here we study this problem from the perspective of complex networks. Mapping the description of data…

Data Analysis, Statistics and Probability · Physics 2012-08-22 Clara Granell , Sergio Gomez , Alex Arenas

Constrained Clustering: General Pairwise and Cardinality Constraints

We study constrained clustering, where constraints guide the clustering process. In existing works, two categories of constraints have been widely explored, namely pairwise and cardinality constraints. Pairwise constraints enforce the…

Machine Learning · Computer Science 2023-01-30 Adel Bibi , Ali Alqahtani , Bernard Ghanem

SACA: Selective Attention-Based Clustering Algorithm

Clustering algorithms are fundamental tools across many fields, with density-based methods offering particular advantages in identifying arbitrarily shaped clusters and handling noise. However, their effectiveness is often limited by the…

Machine Learning · Computer Science 2025-12-01 Meysam Shirdel Bilehsavar , Razieh Ghaedi , Samira Seyed Taheri , Xinqi Fan , Christian O'Reilly

Hierarchical Clustering with Structural Constraints

Hierarchical clustering is a popular unsupervised data analysis method. For many real-world applications, we would like to exploit prior information about the data that imposes constraints on the clustering hierarchy, and is not captured by…

Data Structures and Algorithms · Computer Science 2018-07-17 Vaggos Chatziafratis , Rad Niazadeh , Moses Charikar