Related papers: Machine Learning for Online Algorithm Selection un…

Run2Survive: A Decision-theoretic Approach to Algorithm Selection based on Survival Analysis

Algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidate algorithms most suitable for a specific instance of an algorithmic problem class, where "suitability" often refers to an algorithm's…

Machine Learning · Computer Science 2020-07-13 Alexander Tornede , Marcel Wever , Stefan Werner , Felix Mohr , Eyke Hüllermeier

Reinforcement Learning Algorithm Selection

This paper formalises the problem of online algorithm selection in the context of Reinforcement Learning. The setup is as follows: given an episodic task and a finite number of off-policy RL algorithms, a meta-algorithm has to decide which…

Machine Learning · Statistics 2017-11-16 Romain Laroche , Raphael Feraud

Continuous-Time Multi-Armed Bandits with Controlled Restarts

Time-constrained decision processes have been ubiquitous in many fundamental applications in physics, biology and computer science. Recently, restart strategies have gained significant attention for boosting the efficiency of…

Machine Learning · Computer Science 2020-07-02 Semih Cayci , Atilla Eryilmaz , R. Srikant

Algorithm Selection as a Bandit Problem with Unbounded Losses

Algorithm selection is typically based on models of algorithm performance, learned during a separate offline training sequence, which can be prohibitively expensive. In recent work, we adopted an online approach, in which a performance…

Artificial Intelligence · Computer Science 2013-01-31 Matteo Gagliolo , Juergen Schmidhuber

Online and Scalable Model Selection with Multi-Armed Bandits

Many online applications running on live traffic are powered by machine learning models, for which training, validation, and hyper-parameter tuning are conducted on historical data. However, it is common for models demonstrating strong…

Machine Learning · Computer Science 2021-01-27 Jiayi Xie , Michael Tashman , John Hoffman , Lee Winikor , Rouzbeh Gerami

Towards Meta-Algorithm Selection

Instance-specific algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidates most suitable for a specific instance of an algorithmic problem class, where "suitability" often refers to an…

Machine Learning · Computer Science 2020-11-18 Alexander Tornede , Marcel Wever , Eyke Hüllermeier

Extreme Algorithm Selection With Dyadic Feature Representation

Algorithm selection (AS) deals with selecting an algorithm from a fixed set of candidate algorithms most suitable for a specific instance of an algorithmic problem, e.g., choosing solvers for SAT problems. Benchmark suites for AS usually…

Machine Learning · Computer Science 2020-10-23 Alexander Tornede , Marcel Wever , Eyke Hüllermeier

When Switching Algorithms Helps: A Theoretical Study of Online Algorithm Selection

Online algorithm selection (OAS) aims to adapt the optimization process to changes in the fitness landscape and is expected to outperform any single algorithm from a given portfolio. Although this expectation is supported by numerous…

Neural and Evolutionary Computing · Computer Science 2026-04-10 Denis Antipov , Carola Doerr

Incorporating Behavioral Constraints in Online AI Systems

AI systems that learn through reward feedback about the actions they take are increasingly deployed in domains that have significant impact on our daily life. However, in many cases the online rewards should not be the only guiding…

Artificial Intelligence · Computer Science 2018-09-18 Avinash Balakrishnan , Djallel Bouneffouf , Nicholas Mattei , Francesca Rossi

Neural Model-based Optimization with Right-Censored Observations

In many fields of study, we only observe lower bounds on the true response value of some experiments. When fitting a regression model to predict the distribution of the outcomes, we cannot simply drop these right-censored observations, but…

Artificial Intelligence · Computer Science 2020-09-30 Katharina Eggensperger , Kai Haase , Philipp Müller , Marius Lindauer , Frank Hutter

Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs

We describe and study a model for an Automated Online Recommendation System (AORS) in which a user's preferences can be time-dependent and can also depend on the history of past recommendations and play-outs. The three key features of the…

Machine Learning · Computer Science 2016-03-31 Rahul Meshram , Aditya Gopalan , D. Manjunath

Online Learning and Bandits with Queried Hints

We consider the classic online learning and stochastic multi-armed bandit (MAB) problems, when at each step, the online policy can probe and find out which of a small number ($k$) of choices has better reward (or loss) before making its…

Data Structures and Algorithms · Computer Science 2022-11-08 Aditya Bhaskara , Sreenivas Gollapudi , Sungjin Im , Kostas Kollias , Kamesh Munagala

Multi-armed Bandits with Cost Subsidy

In this paper, we consider a novel variant of the multi-armed bandit (MAB) problem, MAB with cost subsidy, which models many real-life applications where the learning agent has to pay to select an arm and is concerned about optimizing…

Machine Learning · Computer Science 2021-03-16 Deeksha Sinha , Karthik Abinav Sankararama , Abbas Kazerouni , Vashist Avadhanula

Trading-off price for data quality to achieve fair online allocation

We consider the problem of online allocation subject to a long-term fairness penalty. Contrary to existing works, however, we do not assume that the decision-maker observes the protected attributes -- which is often unrealistic in practice.…

Machine Learning · Computer Science 2023-12-05 Mathieu Molina , Nicolas Gast , Patrick Loiseau , Vianney Perchet

Safe Linear Thompson Sampling with Side Information

The design and performance analysis of bandit algorithms in the presence of stage-wise safety or reliability constraints has recently garnered significant interest. In this work, we consider the linear stochastic bandit problem under…

Machine Learning · Computer Science 2020-03-03 Ahmadreza Moradipari , Sanae Amani , Mahnoosh Alizadeh , Christos Thrampoulidis

Satisficing in Time-Sensitive Bandit Learning

Much of the recent literature on bandit learning focuses on algorithms that aim to converge on an optimal action. One shortcoming is that this orientation does not account for time sensitivity, which can play a crucial role when learning an…

Machine Learning · Computer Science 2020-01-09 Daniel Russo , Benjamin Van Roy

Online Uniform Sampling: Randomized Learning-Augmented Approximation Algorithms with Application to Digital Health

Motivated by applications in digital health, this work studies the novel problem of online uniform sampling (OUS), where the goal is to distribute a sampling budget uniformly across unknown decision times. In the OUS problem, the algorithm…

Machine Learning · Computer Science 2024-10-22 Xueqing Liu , Kyra Gan , Esmaeil Keyvanshokooh , Susan Murphy

Linear Bandits with Stochastic Delayed Feedback

Stochastic linear bandits are a natural and well-studied model for structured exploration/exploitation problems and are widely used in applications such as online marketing and recommendation. One of the main challenges faced by…

Machine Learning · Statistics 2020-03-03 Claire Vernade , Alexandra Carpentier , Tor Lattimore , Giovanni Zappella , Beyza Ermis , Michael Brueckner

Active Learning with Safety Constraints

Active learning methods have shown great promise in reducing the number of samples necessary for learning. As automated learning systems are adopted into real-time, real-world decision-making pipelines, it is increasingly important that…

Machine Learning · Computer Science 2022-06-23 Romain Camilleri , Andrew Wagenmaker , Jamie Morgenstern , Lalit Jain , Kevin Jamieson

Semi-Bandit Learning for Monotone Stochastic Optimization

Stochastic optimization is a widely used approach for optimization under uncertainty, where uncertain input parameters are modeled by random variables. Exact or approximation algorithms have been obtained for several fundamental problems in…

Machine Learning · Computer Science 2025-08-14 Arpit Agarwal , Rohan Ghuge , Viswanath Nagarajan , Zhengjia Zhuo