Related papers: Query-augmented Active Metric Learning

Constrained Clustering and Multiple Kernel Learning without Pairwise Constraint Relaxation

Clustering under pairwise constraints is an important knowledge discovery tool that enables the learning of appropriate kernels or distance metrics to improve clustering performance. These pairwise constraints, which come in the form of…

Machine Learning · Computer Science 2022-03-24 Benedikt Boecking , Vincent Jeanselme , Artur Dubrawski

Information-Theoretic Active Correlation Clustering

Correlation clustering is a flexible framework for partitioning data based solely on pairwise similarity or dissimilarity information, without requiring the number of clusters as input. However, in many practical scenarios, these pairwise…

Machine Learning · Computer Science 2025-12-11 Linus Aronsson , Morteza Haghir Chehreghani

A3S: A General Active Clustering Method with Pairwise Constraints

Active clustering aims to boost the clustering performance by integrating human-annotated pairwise constraints through strategic querying. Conventional approaches with semi-supervised clustering schemes encounter high query costs when…

Machine Learning · Computer Science 2024-07-16 Xun Deng , Junlong Liu , Han Zhong , Fuli Feng , Chen Shen , Xiangnan He , Jieping Ye , Zheng Wang

Correlation Clustering with Active Learning of Pairwise Similarities

Correlation clustering is a well-known unsupervised learning setting that deals with positive and negative pairwise similarities. In this paper, we study the case where the pairwise similarities are not given in advance and must be queried…

Machine Learning · Computer Science 2024-02-14 Linus Aronsson , Morteza Haghir Chehreghani

Active metric learning and classification using similarity queries

Active learning is commonly used to train label-efficient models by adaptively selecting the most informative queries. However, most active learning strategies are designed to either learn a representation of the data (e.g., embedding or…

Machine Learning · Computer Science 2022-02-07 Namrata Nadagouda , Austin Xu , Mark A. Davenport

Bounded Memory Active Learning through Enriched Queries

The explosive growth of easily-accessible unlabeled data has lead to growing interest in active learning, a paradigm in which data-hungry learning algorithms adaptively select informative examples in order to lower prohibitively expensive…

Machine Learning · Computer Science 2021-02-11 Max Hopkins , Daniel Kane , Shachar Lovett , Michal Moshkovitz

Bayesian Active Distance Metric Learning

Distance metric learning is an important component for many tasks, such as statistical classification and content-based image retrieval. Existing approaches for learning distance metrics from pairwise constraints typically suffer from two…

Machine Learning · Computer Science 2012-06-26 Liu Yang , Rong Jin , Rahul Sukthankar

Subspace Clustering with Active Learning

Subspace clustering is a growing field of unsupervised learning that has gained much popularity in the computer vision community. Applications can be found in areas such as motion segmentation and face clustering. It assumes that data…

Machine Learning · Statistics 2019-11-12 Hankui Peng , Nicos G. Pavlidis

Active Learning in Incomplete Label Multiple Instance Multiple Label Learning

In multiple instance multiple label learning, each sample, a bag, consists of multiple instances. To alleviate labeling complexity, each sample is associated with a set of bag-level labels leaving instances within the bag unlabeled. This…

Machine Learning · Computer Science 2021-07-28 Tam Nguyen , Raviv Raich

Active Learning for Regression with Aggregated Outputs

Due to the privacy protection or the difficulty of data collection, we cannot observe individual outputs for each instance, but we can observe aggregated outputs that are summed over multiple instances in a set in some real-world…

Machine Learning · Statistics 2022-10-05 Tomoharu Iwata

Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

In this paper, we proposed a new clustering-based active learning framework, namely Active Learning using a Clustering-based Sampling (ALCS), to address the shortage of labeled data. ALCS employs a density-based clustering approach to…

Machine Learning · Computer Science 2022-07-08 Xuyang Yan , Shabnam Nazmi , Biniam Gebru , Mohd Anwar , Abdollah Homaifar , Mrinmoy Sarkar , Kishor Datta Gupta

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Conventional active learning algorithms assume a single labeler that produces noiseless label at a given, fixed cost, and aim to achieve the best generalization performance for given classifier under a budget constraint. However, in many…

Machine Learning · Computer Science 2021-05-25 Ruijiang Gao , Maytal Saar-tsechansky

MaxiMin Active Learning in Overparameterized Model Classes}

Generating labeled training datasets has become a major bottleneck in Machine Learning (ML) pipelines. Active ML aims to address this issue by designing learning algorithms that automatically and adaptively select the most informative…

Machine Learning · Computer Science 2020-04-29 Mina Karzand , Robert D. Nowak

Active Statistical Inference

Inspired by the concept of active learning, we propose active inference$\unicode{x2013}$a methodology for statistical inference with machine-learning-assisted data collection. Assuming a budget on the number of labels that can be collected,…

Machine Learning · Statistics 2026-04-09 Tijana Zrnic , Emmanuel J. Candès

Exploring dual information in distance metric learning for clustering

Distance metric learning algorithms aim to appropriately measure similarities and distances between data points. In the context of clustering, metric learning is typically applied with the assist of side-information provided by experts,…

Machine Learning · Computer Science 2021-05-27 Rodrigo Randel , Daniel Aloise , Alain Hertz

Active clustering for labeling training data

Gathering training data is a key step of any supervised learning task, and it is both critical and expensive. Critical, because the quantity and quality of the training data has a high impact on the performance of the learned function.…

Data Structures and Algorithms · Computer Science 2021-10-28 Quentin Lutz , Élie de Panafieu , Alex Scott , Maya Stein

Active Learning and Transfer Learning for Anomaly Detection in Time-Series Data

This paper examines the effectiveness of combining active learning and transfer learning for anomaly detection in cross-domain time-series data. Our results indicate that there is an interaction between clustering and active learning and in…

Machine Learning · Computer Science 2025-08-07 John D. Kelleher , Matthew Nicholson , Rahul Agrahari , Clare Conran

ConstraintMatch for Semi-constrained Clustering

Constrained clustering allows the training of classification models using pairwise constraints only, which are weak and relatively easy to mine, while still yielding full-supervision-level model performance. While they perform well even in…

Machine Learning · Computer Science 2023-11-28 Jann Goschenhofer , Bernd Bischl , Zsolt Kira

Combining MixMatch and Active Learning for Better Accuracy with Fewer Labels

We propose using active learning based techniques to further improve the state-of-the-art semi-supervised learning MixMatch algorithm. We provide a thorough empirical evaluation of several active-learning and baseline methods, which…

Machine Learning · Computer Science 2019-12-04 Shuang Song , David Berthelot , Afshin Rostamizadeh

Active Learning Using Uncertainty Information

Many active learning methods belong to the retraining-based approaches, which select one unlabeled instance, add it to the training set with its possible labels, retrain the classification model, and evaluate the criteria that we base our…

Machine Learning · Statistics 2017-03-01 Yazhou Yang , Marco Loog