Related papers: Reinforcement-based Simultaneous Algorithm and its…

Auto-WEKA: Combined Selection and Hyperparameter Optimization of Classification Algorithms

Many different machine learning algorithms exist; taking into account each algorithm's hyperparameters, there is a staggeringly large number of possible alternatives overall. We consider the problem of simultaneously selecting a learning…

Machine Learning · Computer Science 2013-03-08 Chris Thornton , Frank Hutter , Holger H. Hoos , Kevin Leyton-Brown

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through…

Machine Learning · Computer Science 2018-06-20 Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

Algorithm Design and Stronger Guarantees for the Improving Multi-Armed Bandits Problem

The improving multi-armed bandits problem is a formal model for allocating effort under uncertainty, motivated by scenarios such as investing research effort into new technologies, performing clinical trials, and hyperparameter selection…

Machine Learning · Computer Science 2026-05-22 Avrim Blum , Marten Garicano , Kavya Ravichandran , Dravyansh Sharma

An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization

The evaluation of hyperparameters, neural architectures, or data augmentation policies becomes a critical model selection problem in advanced deep learning with a large hyperparameter search space. In this paper, we propose an efficient and…

Machine Learning · Statistics 2020-12-17 Yimin Huang , Yujun Li , Hanrong Ye , Zhenguo Li , Zhihua Zhang

Bayesian Optimization for Categorical and Category-Specific Continuous Inputs

Many real-world functions are defined over both categorical and category-specific continuous variables and thus cannot be optimized by traditional Bayesian optimization (BO) methods. To optimize such functions, we propose a new method that…

Machine Learning · Computer Science 2019-12-02 Dang Nguyen , Sunil Gupta , Santu Rana , Alistair Shilton , Svetha Venkatesh

Robust and Performance Incentivizing Algorithms for Multi-Armed Bandits with Strategic Agents

Motivated by applications such as online labor markets we consider a variant of the stochastic multi-armed bandit problem where we have a collection of arms representing strategic agents with different performance characteristics. The…

Computer Science and Game Theory · Computer Science 2025-03-11 Seyed A. Esmaeili , Suho Shin , Aleksandrs Slivkins

Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key…

Artificial Intelligence · Computer Science 2018-05-15 Juan Cruz Barsce , Jorge A. Palombarini , Ernesto C. Martínez

The Max $K$-Armed Bandit: A PAC Lower Bound and tighter Algorithms

We consider the Max $K$-Armed Bandit problem, where a learning agent is faced with several sources (arms) of items (rewards), and interested in finding the best item overall. At each time step the agent chooses an arm, and obtains a random…

Machine Learning · Statistics 2015-08-25 Yahel David , Nahum Shimkin

Reinforcement Learning Methods for Neighborhood Selection in Local Search

Reinforcement learning has recently gained traction as a means to improve combinatorial optimization methods, yet its effectiveness within local search metaheuristics specifically remains comparatively underexamined. In this study, we…

Machine Learning · Computer Science 2026-01-14 Yannick Molinghen , Augustin Delecluse , Renaud De Landtsheer , Stefano Michelini

HAMLET -- A Learning Curve-Enabled Multi-Armed Bandit for Algorithm Selection

Automated algorithm selection and hyperparameter tuning facilitates the application of machine learning. Traditional multi-armed bandit strategies look to the history of observed rewards to identify the most promising arms for optimizing…

Machine Learning · Computer Science 2020-05-29 Mischa Schmidt , Julia Gastinger , Sébastien Nicolas , Anett Schülke

Combinatorial Multi-armed Bandits: Arm Selection via Group Testing

This paper considers the problem of combinatorial multi-armed bandits with semi-bandit feedback and a cardinality constraint on the super-arm size. Existing algorithms for solving this problem typically involve two key sub-routines: (1) a…

Machine Learning · Computer Science 2025-08-14 Arpan Mukherjee , Shashanka Ubaru , Keerthiram Murugesan , Karthikeyan Shanmugam , Ali Tajer

Non-stochastic Best Arm Identification and Hyperparameter Optimization

Motivated by the task of hyperparameter optimization, we introduce the non-stochastic best-arm identification problem. Within the multi-armed bandit literature, the cumulative regret objective enjoys algorithms and analyses for both the…

Machine Learning · Computer Science 2015-03-02 Kevin Jamieson , Ameet Talwalkar

A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

Deep Reinforcement Learning has been shown to be very successful in complex games, e.g. Atari or Go. These games have clearly defined rules, and hence allow simulation. In many practical applications, however, interactions with the…

Machine Learning · Computer Science 2019-02-12 Andreas Merentitis , Kashif Rasul , Roland Vollgraf , Abdul-Saboor Sheikh , Urs Bergmann

An Algorithm for Fixed Budget Best Arm Identification with Combinatorial Exploration

We consider the best arm identification (BAI) problem in the $K-$armed bandit framework with a modification - the agent is allowed to play a subset of arms at each time slot instead of one arm. Consequently, the agent observes the sample…

Machine Learning · Computer Science 2026-01-30 Siddhartha Parupudi , Gourab Ghatak

Best Arm Identification in Linked Bandits

We consider the problem of best arm identification in a variant of multi-armed bandits called linked bandits. In a single interaction with linked bandits, multiple arms are played sequentially until one of them receives a positive reward.…

Machine Learning · Computer Science 2019-01-29 Anant Gupta

Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

This paper considers the multi-armed thresholding bandit problem -- identifying all arms whose expected rewards are above a predefined threshold via as few pulls (or rounds) as possible -- proposed by Locatelli et al. [2016] recently.…

Machine Learning · Statistics 2017-07-11 Jie Zhong , Yijun Huang , Ji Liu

Neural Dueling Bandits: Preference-Based Optimization with Human Feedback

Contextual dueling bandit is used to model the bandit problems, where a learner's goal is to find the best arm for a given context using observed noisy human preference feedback over the selected arms for the past contexts. However,…

Machine Learning · Computer Science 2025-04-17 Arun Verma , Zhongxiang Dai , Xiaoqiang Lin , Patrick Jaillet , Bryan Kian Hsiang Low

Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit

An automatic machine learning (AutoML) task is to select the best algorithm and its hyper-parameters simultaneously. Previously, the hyper-parameters of all algorithms are joint as a single search space, which is not only huge but also…

Machine Learning · Computer Science 2019-06-03 Yi-Qi Hu , Yang Yu , Jun-Da Liao

Best Arm Identification in Batched Multi-armed Bandit Problems

Recently multi-armed bandit problem arises in many real-life scenarios where arms must be sampled in batches, due to limited time the agent can wait for the feedback. Such applications include biological experimentation and online…

Machine Learning · Statistics 2023-12-22 Shengyu Cao , Simai He , Ruoqing Jiang , Jin Xu , Hongsong Yuan

HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings

This paper explores the application of bandit algorithms in both stochastic and adversarial settings, with a focus on theoretical analysis and practical applications. The study begins by introducing bandit problems, distinguishing between…

Machine Learning · Computer Science 2025-03-14 Samih Karroum , Saad Mazhar