Related papers: Bandits attack function optimization

Bayesian Optimization for Categorical and Category-Specific Continuous Inputs

Many real-world functions are defined over both categorical and category-specific continuous variables and thus cannot be optimized by traditional Bayesian optimization (BO) methods. To optimize such functions, we propose a new method that…

Machine Learning · Computer Science 2019-12-02 Dang Nguyen , Sunil Gupta , Santu Rana , Alistair Shilton , Svetha Venkatesh

Simulation optimization: A review of algorithms and applications

Simulation Optimization (SO) refers to the optimization of an objective function subject to constraints, both of which can be evaluated through a stochastic simulation. To address specific features of a particular simulation---discrete or…

Data Structures and Algorithms · Computer Science 2017-06-28 Satyajith Amaran , Nikolaos V. Sahinidis , Bikram Sharda , Scott J. Bury

The best of both worlds: stochastic and adversarial bandits

We present a new bandit algorithm, SAO (Stochastic and Adversarial Optimal), whose regret is, essentially, optimal both for adversarial rewards and for stochastic rewards. Specifically, SAO combines the square-root worst-case regret of Exp3…

Machine Learning · Computer Science 2012-02-22 Sebastien Bubeck , Aleksandrs Slivkins

Stochastic simultaneous optimistic optimization

We study the problem of global maximization of a function f given a finite number of evaluations perturbed by noise. We consider a very weak assumption on the function, namely that it is locally smooth (in some precise sense) with respect…

Machine Learning · Computer Science 2026-04-28 Michal Valko , Alexandra Carpentier , Rémi Munos

Continuum-armed Bandit Optimization with Batch Pairwise Comparison Oracles

This paper studies a bandit optimization problem where the goal is to maximize a function $f(x)$ over $T$ periods for some unknown strongly concave function $f$. We consider a new pairwise comparison oracle, where the decision-maker chooses…

Machine Learning · Computer Science 2025-05-29 Xiangyu Chang , Xi Chen , Yining Wang , Zhiyi Zeng

Offline Local Search for Online Stochastic Bandits

Combinatorial multi-armed bandits provide a fundamental online decision-making environment where a decision-maker interacts with an environment across $T$ time steps, each time selecting an action and learning the cost of that action. The…

Machine Learning · Computer Science 2026-04-13 Gerdus Benadè , Rathish Das , Thomas Lavastida

Divide and Learn: Multi-Objective Combinatorial Optimization at Scale

Multi-objective combinatorial optimization seeks Pareto-optimal solutions over exponentially large discrete spaces, yet existing methods sacrifice generality, scalability, or theoretical guarantees. We reformulate it as an online learning…

Machine Learning · Computer Science 2026-02-13 Esha Singh , Dongxia Wu , Chien-Yi Yang , Tajana Rosing , Rose Yu , Yi-An Ma

Multi-Objective Simultaneous Optimistic Optimization

Optimistic methods have been applied with success to single-objective optimization. Here, we attempt to bridge the gap between optimistic methods and multi-objective optimization. In particular, this paper is concerned with solving…

Optimization and Control · Mathematics 2016-12-28 Abdullah Al-Dujaili , S. Suresh

Semi-Bandit Learning for Monotone Stochastic Optimization

Stochastic optimization is a widely used approach for optimization under uncertainty, where uncertain input parameters are modeled by random variables. Exact or approximation algorithms have been obtained for several fundamental problems in…

Machine Learning · Computer Science 2025-08-14 Arpit Agarwal , Rohan Ghuge , Viswanath Nagarajan , Zhengjia Zhuo

Continuous-Time Multi-Armed Bandits with Controlled Restarts

Time-constrained decision processes have been ubiquitous in many fundamental applications in physics, biology and computer science. Recently, restart strategies have gained significant attention for boosting the efficiency of…

Machine Learning · Computer Science 2020-07-02 Semih Cayci , Atilla Eryilmaz , R. Srikant

Semi-bandit Optimization in the Dispersed Setting

The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of…

Machine Learning · Computer Science 2020-12-22 Maria-Florina Balcan , Travis Dick , Wesley Pegden

Communication-Efficient Federated Non-Linear Bandit Optimization

Federated optimization studies the problem of collaborative function optimization among multiple clients (e.g. mobile devices or organizations) under the coordination of a central server. Since the data is collected separately by each…

Machine Learning · Computer Science 2023-11-06 Chuanhao Li , Chong Liu , Yu-Xiang Wang

Matroid Bandits: Fast Combinatorial Optimization with Learning

A matroid is a notion of independence in combinatorial optimization which is closely related to computational efficiency. In particular, it is well known that the maximum of a constrained modular function can be found greedily if and only…

Machine Learning · Computer Science 2015-04-15 Branislav Kveton , Zheng Wen , Azin Ashkan , Hoda Eydgahi , Brian Eriksson

Selecting the Best Optimizing System

We formulate selecting the best optimizing system (SBOS) problems and provide solutions for those problems. In an SBOS problem, a finite number of systems are contenders. Inside each system, a continuous decision variable affects the…

Methodology · Statistics 2025-11-04 Nian Si , Yifu Tang , Zeyu Zheng

HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings

This paper explores the application of bandit algorithms in both stochastic and adversarial settings, with a focus on theoretical analysis and practical applications. The study begins by introducing bandit problems, distinguishing between…

Machine Learning · Computer Science 2025-03-14 Samih Karroum , Saad Mazhar

An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization

The evaluation of hyperparameters, neural architectures, or data augmentation policies becomes a critical model selection problem in advanced deep learning with a large hyperparameter search space. In this paper, we propose an efficient and…

Machine Learning · Statistics 2020-12-17 Yimin Huang , Yujun Li , Hanrong Ye , Zhenguo Li , Zhihua Zhang

Algorithm Selection as a Bandit Problem with Unbounded Losses

Algorithm selection is typically based on models of algorithm performance, learned during a separate offline training sequence, which can be prohibitively expensive. In recent work, we adopted an online approach, in which a performance…

Artificial Intelligence · Computer Science 2013-01-31 Matteo Gagliolo , Juergen Schmidhuber

Bayesian Optimization -- Multi-Armed Bandit Problem

In this report, we survey Bayesian Optimization methods focussed on the Multi-Armed Bandit Problem. We take the help of the paper "Portfolio Allocation for Bayesian Optimization". We report a small literature survey on the acquisition…

Machine Learning · Computer Science 2020-12-16 Abhilash Nandy , Chandan Kumar , Deepak Mewada , Soumya Sharma

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through…

Machine Learning · Computer Science 2018-06-20 Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

The Minimal Search Space for Conditional Causal Bandits

Causal knowledge can be used to support decision-making problems. This has been recognized in the causal bandits literature, where a causal (multi-armed) bandit is characterized by a causal graphical model and a target variable. The arms…

Machine Learning · Computer Science 2025-10-14 Francisco N. F. Q. Simoes , Itai Feigenbaum , Mehdi Dastani , Thijs van Ommen