Related papers: Multi-Armed Bandits in Metric Spaces

Bandits and Experts in Metric Spaces

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of trials so as to maximize the total payoff of the chosen strategies. While the performance of bandit algorithms with a small finite…

Data Structures and Algorithms · Computer Science 2019-04-16 Robert Kleinberg , Aleksandrs Slivkins , Eli Upfal

Contextual Bandits with Similarity Information

In a multi-armed bandit (MAB) problem, an online algorithm makes a sequence of choices. In each round it chooses from a time-invariant set of alternatives and receives the payoff associated with this alternative. While the case of small…

Data Structures and Algorithms · Computer Science 2014-05-21 Aleksandrs Slivkins

Sharp Dichotomies for Regret Minimization in Metric Spaces

The Lipschitz multi-armed bandit (MAB) problem generalizes the classical multi-armed bandit problem by assuming one is given side information consisting of a priori upper bounds on the difference in expected payoff between certain pairs of…

Data Structures and Algorithms · Computer Science 2009-11-09 Robert Kleinberg , Aleksandrs Slivkins

Multi-armed Bandits with Cost Subsidy

In this paper, we consider a novel variant of the multi-armed bandit (MAB) problem, MAB with cost subsidy, which models many real-life applications where the learning agent has to pay to select an arm and is concerned about optimizing…

Machine Learning · Computer Science 2021-03-16 Deeksha Sinha , Karthik Abinav Sankararama , Abbas Kazerouni , Vashist Avadhanula

A Survey on Practical Applications of Multi-Armed and Contextual Bandits

In recent years, multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance, due to its stellar performance combined with certain…

Machine Learning · Computer Science 2019-04-24 Djallel Bouneffouf , Irina Rish

Multi-Armed Bandits in Brain-Computer Interfaces

The multi-armed bandit (MAB) problem models a decision-maker that optimizes its actions based on current and acquired new knowledge to maximize its reward. This type of online decision is prominent in many procedures of Brain-Computer…

Artificial Intelligence · Computer Science 2022-11-10 Frida Heskebeck , Carolina Bergeling , Bo Bernhardsson

Multiplayer Information Asymmetric Bandits in Metric Spaces

In recent years the information asymmetric Lipschitz bandits In this paper we studied the Lipschitz bandit problem applied to the multiplayer information asymmetric problem studied in \cite{chang2022online, chang2023optimal}. More…

Machine Learning · Computer Science 2025-03-14 William Chang , Aditi Karthik

Solving Multi-Arm Bandit Using a Few Bits of Communication

The multi-armed bandit (MAB) problem is an active learning framework that aims to select the best among a set of actions by sequentially observing rewards. Recently, it has become popular for a number of applications over wireless networks,…

Machine Learning · Computer Science 2021-11-12 Osama A. Hanna , Lin F. Yang , Christina Fragouli

Optimal Algorithms for Range Searching over Multi-Armed Bandits

This paper studies a multi-armed bandit (MAB) version of the range-searching problem. In its basic form, range searching considers as input a set of points (on the real line) and a collection of (real) intervals. Here, with each specified…

Machine Learning · Computer Science 2021-05-05 Siddharth Barman , Ramakrishnan Krishnamurthy , Saladi Rahul

Stochastic Multi-Objective Multi-Armed Bandits: Regret Definition and Algorithm

Multi-armed bandit (MAB) problems are widely applied to online optimization tasks that require balancing exploration and exploitation. In practical scenarios, these tasks often involve multiple conflicting objectives, giving rise to…

Machine Learning · Computer Science 2025-06-17 Mansoor Davoodi , Setareh Maghsudi

A Gang of Bandits

Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more…

Machine Learning · Computer Science 2013-11-05 Nicolò Cesa-Bianchi , Claudio Gentile , Giovanni Zappella

Adversarial Bandit over Bandits: Hierarchical Bandits for Online Configuration Management

Motivated by dynamic parameter optimization in finite, but large action (configurations) spaces, this work studies the nonstochastic multi-armed bandit (MAB) problem in metric action spaces with oblivious Lipschitz adversaries. We propose…

Machine Learning · Computer Science 2025-05-27 Chen Avin , Zvi Lotker , Shie Mannor , Gil Shabat , Hanan Shteingart , Roey Yadgar

Best Arm Identification in Batched Multi-armed Bandit Problems

Recently multi-armed bandit problem arises in many real-life scenarios where arms must be sampled in batches, due to limited time the agent can wait for the feedback. Such applications include biological experimentation and online…

Machine Learning · Statistics 2023-12-22 Shengyu Cao , Simai He , Ruoqing Jiang , Jin Xu , Hongsong Yuan

Multi-armed Bandit Learning on a Graph

The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively studied in the context of decision-making under uncertainty. In many real-world applications, such as robotic applications, selecting an arm…

Machine Learning · Computer Science 2023-03-21 Tianpeng Zhang , Kasper Johansson , Na Li

Robust Batched Bandits

The batched multi-armed bandit (MAB) problem, in which rewards are collected in batches, is crucial for applications such as clinical trials. Existing research predominantly assumes light-tailed reward distributions, yet many real-world…

Machine Learning · Computer Science 2026-03-24 Yunwen Guo , Yunlun Shu , Gongyi Zhuo , Tianyu Wang

Multiarmed Bandits Problem Under the Mean-Variance Setting

The classical multi-armed bandit (MAB) problem involves a learner and a collection of K independent arms, each with its own ex ante unknown independent reward distribution. At each one of a finite number of rounds, the learner selects one…

Optimization and Control · Mathematics 2024-05-07 Hongda Hu , Arthur Charpentier , Mario Ghossoub , Alexander Schied

Speed Up the Cold-Start Learning in Two-Sided Bandits with Many Arms

Multi-armed bandit (MAB) algorithms are efficient approaches to reduce the opportunity cost of online experimentation and are used by companies to find the best product from periodically refreshed product catalogs. However, these algorithms…

Machine Learning · Computer Science 2024-12-19 Mohsen Bayati , Junyu Cao , Wanning Chen

Multi-Armed Bandits with Minimum Aggregated Revenue Constraints

We examine a multi-armed bandit problem with contextual information, where the objective is to ensure that each arm receives a minimum aggregated reward across contexts while simultaneously maximizing the total cumulative reward. This…

Machine Learning · Computer Science 2025-10-15 Ahmed Ben Yahmed , Hafedh El Ferchichi , Marc Abeille , Vianney Perchet

Dynamic Multi-Arm Bandit Game Based Multi-Agents Spectrum Sharing Strategy Design

For a wireless avionics communication system, a Multi-arm bandit game is mathematically formulated, which includes channel states, strategies, and rewards. The simple case includes only two agents sharing the spectrum which is fully studied…

Signal Processing · Electrical Eng. & Systems 2017-11-15 Jingyang Lu , Lun Li , Dan Shen , Genshe Chen , Bin Jia , Erik Blasch , Khanh Pham

Introduction to Multi-Armed Bandits

Multi-armed bandits a simple but very powerful framework for algorithms that make decisions over time under uncertainty. An enormous body of work has accumulated over the years, covered in several books and surveys. This book provides a…

Machine Learning · Computer Science 2024-04-05 Aleksandrs Slivkins