Related papers: Frugal Algorithm Selection

Active learning to optimise time-expensive algorithm selection

Hard optimisation problems such as Boolean Satisfiability typically have long solving times and can usually be solved by many algorithms, although the performance can vary widely in practice. Research has shown that no single algorithm…

Machine Learning · Computer Science 2019-09-10 Riccardo Volpato , Guangyan Song

Limitations of Assessing Active Learning Performance at Runtime

Classification algorithms aim to predict an unknown label (e.g., a quality class) for a new instance (e.g., a product). Therefore, training samples (instances and labels) are used to deduct classification hypotheses. Often, it is relatively…

Machine Learning · Computer Science 2019-01-30 Daniel Kottke , Jim Schellinger , Denis Huseljic , Bernhard Sick

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Conventional active learning algorithms assume a single labeler that produces noiseless label at a given, fixed cost, and aim to achieve the best generalization performance for given classifier under a budget constraint. However, in many…

Machine Learning · Computer Science 2021-05-25 Ruijiang Gao , Maytal Saar-tsechansky

Frugal Reinforcement-based Active Learning

Most of the existing learning models, particularly deep neural networks, are reliant on large datasets whose hand-labeling is expensive and time demanding. A current trend is to make the learning of these models frugal and less dependent on…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Sebastien Deschamps , Hichem Sahbi

A Data Management Approach for Dataset Selection Using Human Computation

As the number of applications that use machine learning algorithms increases, the need for labeled data useful for training such algorithms intensifies. Getting labels typically involves employing humans to do the annotation, which directly…

Machine Learning · Computer Science 2013-07-16 Alexandros Ntoulas , Omar Alonso , Vasilis Kandylas

Active clustering for labeling training data

Gathering training data is a key step of any supervised learning task, and it is both critical and expensive. Critical, because the quantity and quality of the training data has a high impact on the performance of the learned function.…

Data Structures and Algorithms · Computer Science 2021-10-28 Quentin Lutz , Élie de Panafieu , Alex Scott , Maya Stein

Nearly Optimal Active Preference Learning and Its Application to LLM Alignment

Aligning large language models (LLMs) depends on high-quality datasets of human preference labels, which are costly to collect. Although active learning has been studied to improve sample efficiency relative to passive collection, many…

Machine Learning · Computer Science 2026-02-03 Yao Zhao , Kwang-Sung Jun

Automated Algorithm Selection: Survey and Perspectives

It has long been observed that for practically any computational problem that has been intensely studied, different instances are best solved using different algorithms. This is particularly pronounced for computationally hard problems,…

Machine Learning · Computer Science 2018-11-29 Pascal Kerschke , Holger H. Hoos , Frank Neumann , Heike Trautmann

The Cost of Replicability in Active Learning

Active learning aims to reduce the number of labeled data points required by machine learning algorithms by selectively querying labels from initially unlabeled data. Ensuring replicability, where an algorithm produces consistent outcomes…

Machine Learning · Computer Science 2026-03-24 Rupkatha Hira , Dominik Kau , Jessica Sorrell

Practical Active Learning with Model Selection for Small Data

Active learning is of great interest for many practical applications, especially in industry and the physical sciences, where there is a strong need to minimize the number of costly experiments necessary to train predictive models. However,…

Machine Learning · Computer Science 2021-12-23 Maryam Pardakhti , Nila Mandal , Anson W. K. Ma , Qian Yang

Improving Automated Algorithm Selection by Advancing Fitness Landscape Analysis

Optimization is ubiquitous in our daily lives. In the past, (sub-)optimal solutions to any problem have been derived by trial and error, sheer luck, or the expertise of knowledgeable individuals. In our contemporary age, there thankfully…

Neural and Evolutionary Computing · Computer Science 2023-12-07 Raphael Patrick Prager

Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Active learning allows machine learning models to be trained using fewer labels while retaining similar performance to traditional supervised learning. An active learner selects the most informative data points, requests their labels, and…

Machine Learning · Computer Science 2023-11-22 Zac Pullar-Strecker , Katharina Dost , Eibe Frank , Jörg Wicker

Auditing: Active Learning with Outcome-Dependent Query Costs

We propose a learning setting in which unlabeled data is free, and the cost of a label depends on its value, which is not known in advance. We study binary classification in an extreme case, where the algorithm only pays for negative…

Machine Learning · Computer Science 2015-07-14 Sivan Sabato , Anand D. Sarwate , Nathan Srebro

Ranking Algorithms by Performance

A common way of doing algorithm selection is to train a machine learning model and predict the best algorithm from a portfolio to solve a particular problem. While this method has been highly successful, choosing only a single algorithm has…

Artificial Intelligence · Computer Science 2013-11-19 Lars Kotthoff

Radically Lower Data-Labeling Costs for Visually Rich Document Extraction Models

A key bottleneck in building automatic extraction models for visually rich documents like invoices is the cost of acquiring the several thousand high-quality labeled documents that are needed to train a model with acceptable accuracy. We…

Computation and Language · Computer Science 2022-11-01 Yichao Zhou , James B. Wendt , Navneet Potti , Jing Xie , Sandeep Tata

Automatic model training under restrictive time constraints

We develop a hyperparameter optimisation algorithm, Automated Budget Constrained Training (AutoBCT), which balances the quality of a model with the computational cost required to tune it. The relationship between hyperparameters, model…

Machine Learning · Statistics 2024-02-06 Lukas Cironis , Jan Palczewski , Georgios Aivaliotis

Algorithm Selection on a Meta Level

The problem of selecting an algorithm that appears most suitable for a specific instance of an algorithmic problem class, such as the Boolean satisfiability problem, is called instance-specific algorithm selection. Over the past decade, the…

Machine Learning · Computer Science 2021-07-21 Alexander Tornede , Lukas Gehring , Tanja Tornede , Marcel Wever , Eyke Hüllermeier

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Active learning (AL) uses a data selection algorithm to select useful training samples to minimize annotation cost. This is now an essential tool for building low-resource syntactic analyzers such as part-of-speech (POS) taggers. Existing…

Computation and Language · Computer Science 2020-11-24 Aditi Chaudhary , Antonios Anastasopoulos , Zaid Sheikh , Graham Neubig

Active Model Selection

Classical learning assumes the learner is given a labeled data sample, from which it learns a model. The field of Active Learning deals with the situation where the learner begins not with a training sample, but instead with resources that…

Machine Learning · Computer Science 2012-07-19 Omid Madani , Daniel J. Lizotte , Russell Greiner

Indexing Cost Sensitive Prediction

Predictive models are often used for real-time decision making. However, typical machine learning techniques ignore feature evaluation cost, and focus solely on the accuracy of the machine learning models obtained utilizing all the features…

Machine Learning · Computer Science 2014-08-19 Leilani Battle , Edward Benson , Aditya Parameswaran , Eugene Wu