Related papers: An Efficient Bandit Algorithm for Realtime Multiva…

Enhancing Evolutionary Conversion Rate Optimization via Multi-armed Bandit Algorithms

Conversion rate optimization means designing web interfaces such that more visitors perform a desired action (such as register or purchase) on the site. One promising approach, implemented in Sentient Ascend, is to optimize the design using…

Neural and Evolutionary Computing · Computer Science 2018-11-19 Xin Qiu , Risto Miikkulainen

Adaptive Budget Optimization for Multichannel Advertising Using Combinatorial Bandits

Effective budget allocation is crucial for optimizing the performance of digital advertising campaigns. However, the development of practical budget allocation algorithms remain limited, primarily due to the lack of public datasets and…

Machine Learning · Computer Science 2025-02-06 Briti Gangopadhyay , Zhao Wang , Alberto Silvio Chiappa , Shingo Takamatsu

An Algorithmic Framework to Control Bias in Bandit-based Personalization

Personalization is pervasive in the online space as it leads to higher efficiency and revenue by allowing the most relevant content to be served to each user. However, recent studies suggest that personalization methods can propagate…

Machine Learning · Computer Science 2018-02-26 L. Elisa Celis , Sayash Kapoor , Farnood Salehi , Nisheeth K. Vishnoi

A Contextual-Bandit Approach to Personalized News Article Recommendation

Personalized web services strive to adapt their services (advertisements, news articles, etc) to individual users by making use of both content and user information. Despite a few recent advances, this problem remains challenging for at…

Machine Learning · Computer Science 2012-03-05 Lihong Li , Wei Chu , John Langford , Robert E. Schapire

Offline Local Search for Online Stochastic Bandits

Combinatorial multi-armed bandits provide a fundamental online decision-making environment where a decision-maker interacts with an environment across $T$ time steps, each time selecting an action and learning the cost of that action. The…

Machine Learning · Computer Science 2026-04-13 Gerdus Benadè , Rathish Das , Thomas Lavastida

Low-Rank Bandit Methods for High-Dimensional Dynamic Pricing

We consider dynamic pricing with many products under an evolving but low-dimensional demand model. Assuming the temporal variation in cross-elasticities exhibits low-rank structure based on fixed (latent) features of the products, we show…

Machine Learning · Computer Science 2019-09-12 Jonas Mueller , Vasilis Syrgkanis , Matt Taddy

Contextual Bandits with Budgeted Information Reveal

Contextual bandit algorithms are commonly used in digital health to recommend personalized treatments. However, to ensure the effectiveness of the treatments, patients are often requested to take actions that have no immediate benefit to…

Machine Learning · Computer Science 2024-03-14 Kyra Gan , Esmaeil Keyvanshokooh , Xueqing Liu , Susan Murphy

Optimizing Ranking Systems Online as Bandits

Ranking system is the core part of modern retrieval and recommender systems, where the goal is to rank candidate items given user contexts. Optimizing ranking systems online means that the deployed system can serve user requests, e.g.,…

Information Retrieval · Computer Science 2021-10-13 Chang Li

A Contextual Bandit Bake-off

Contextual bandit algorithms are essential for solving many real-world interactive machine learning problems. Despite multiple recent successes on statistically and computationally efficient methods, the practical behavior of these…

Machine Learning · Statistics 2021-06-08 Alberto Bietti , Alekh Agarwal , John Langford

Divide and Learn: Multi-Objective Combinatorial Optimization at Scale

Multi-objective combinatorial optimization seeks Pareto-optimal solutions over exponentially large discrete spaces, yet existing methods sacrifice generality, scalability, or theoretical guarantees. We reformulate it as an online learning…

Machine Learning · Computer Science 2026-02-13 Esha Singh , Dongxia Wu , Chien-Yi Yang , Tajana Rosing , Rose Yu , Yi-An Ma

Online Variance Reduction for Stochastic Optimization

Modern stochastic optimization methods often rely on uniform sampling which is agnostic to the underlying characteristics of the data. This might degrade the convergence by yielding estimates that suffer from a high variance. A possible…

Machine Learning · Statistics 2018-06-07 Zalán Borsos , Andreas Krause , Kfir Y. Levy

Dynamic Matching Bandit For Two-Sided Online Markets

Two-sided online matching platforms are employed in various markets. However, agents' preferences in the current market are usually implicit and unknown, thus needing to be learned from data. With the growing availability of dynamic side…

Machine Learning · Computer Science 2024-05-30 Yuantong Li , Chi-hua Wang , Guang Cheng , Will Wei Sun

Interactive Recommendations for Optimal Allocations in Markets with Constraints

Recommendation systems when employed in markets play a dual role: they assist users in selecting their most desired items from a large pool and they help in allocating a limited number of items to the users who desire them the most. Despite…

Machine Learning · Computer Science 2022-08-01 Yigit Efe Erginbas , Soham Phade , Kannan Ramchandran

Productization Challenges of Contextual Multi-Armed Bandits

Contextual Multi-Armed Bandits is a well-known and accepted online optimization algorithm, that is used in many Web experiences to tailor content or presentation to users' traffic. Much has been published on theoretical guarantees (e.g.…

Information Retrieval · Computer Science 2019-07-12 David Abensur , Ivan Balashov , Shaked Bar , Ronny Lempel , Nurit Moscovici , Ilan Orlov , Danny Rosenstein , Ido Tamir

A Gang of Bandits

Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more…

Machine Learning · Computer Science 2013-11-05 Nicolò Cesa-Bianchi , Claudio Gentile , Giovanni Zappella

Adaptively Learning to Select-Rank in Online Platforms

Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key…

Machine Learning · Computer Science 2024-06-10 Jingyuan Wang , Perry Dong , Ying Jin , Ruohan Zhan , Zhengyuan Zhou

Semi-Bandit Learning for Monotone Stochastic Optimization

Stochastic optimization is a widely used approach for optimization under uncertainty, where uncertain input parameters are modeled by random variables. Exact or approximation algorithms have been obtained for several fundamental problems in…

Machine Learning · Computer Science 2025-08-14 Arpit Agarwal , Rohan Ghuge , Viswanath Nagarajan , Zhengjia Zhuo

Applying Multi-armed Bandit Algorithms to Computational Advertising

Over the last two decades, we have seen extensive industrial research in the area of computational advertising. In this paper, our goal is to study the performance of various online learning algorithms to identify and display the best…

Information Retrieval · Computer Science 2020-11-24 Kazem Jahanbakhsh

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through…

Machine Learning · Computer Science 2018-06-20 Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

A Contextual-bandit-based Approach for Informed Decision-making in Clinical Trials

Clinical trials involving multiple treatments utilize randomization of the treatment assignments to enable the evaluation of treatment efficacies in an unbiased manner. Such evaluation is performed in post hoc studies that usually use…

Artificial Intelligence · Computer Science 2018-09-10 Yogatheesan Varatharajah , Brent Berry , Sanmi Koyejo , Ravishankar Iyer