Related papers: Characterizing Truthful Multi-Armed Bandit Mechani…

Multi-Armed Bandit Mechanisms for Multi-Slot Sponsored Search Auctions

In pay-per click sponsored search auctions which are currently extensively used by search engines, the auction for a keyword involves a certain number of advertisers (say k) competing for available slots (say m) to display their ads. This…

Computer Science and Game Theory · Computer Science 2010-01-17 Akash Das Sarma , Sujit Gujar , Y. Narahari

Designing Truthful Contextual Multi-Armed Bandits based Sponsored Search Auctions

For sponsored search auctions, we consider contextual multi-armed bandit problem in the presence of strategic agents. In this setting, at each round, an advertising platform (center) runs an auction to select the best-suited ads relevant to…

Computer Science and Game Theory · Computer Science 2020-02-27 Kumar Abhishek , Shweta Jain , Sujit Gujar

Fairness of Exposure in Stochastic Bandits

Contextual bandit algorithms have become widely used for recommendation in online systems (e.g. marketplaces, music streaming, news), where they now wield substantial influence on which items get exposed to the users. This raises questions…

Machine Learning · Computer Science 2021-09-14 Lequn Wang , Yiwei Bai , Wen Sun , Thorsten Joachims

No-Regret Online Autobidding Algorithms in First-price Auctions

Automated bidding to optimize online advertising with various constraints, e.g. ROI constraints and budget constraints, is widely adopted by advertisers. A key challenge lies in designing algorithms for non-truthful mechanisms with ROI…

Computer Science and Game Theory · Computer Science 2025-10-21 Yuan Deng , Yilin Li , Wei Tang , Hanrui Zhang

Truthful Auctions for Automated Bidding in Online Advertising

Automated bidding, an emerging intelligent decision making paradigm powered by machine learning, has become popular in online advertising. Advertisers in automated bidding evaluate the cumulative utilities and have private financial…

Computer Science and Game Theory · Computer Science 2023-08-22 Yidan Xing , Zhilin Zhang , Zhenzhe Zheng , Chuan Yu , Jian Xu , Fan Wu , Guihai Chen

Truthful Reverse Auctions for Adaptive Selection via Contextual Multi-Armed Bandits

We study the problem of selecting large language models (LLMs) for user queries in settings where multiple LLM providers submit the cost of solving a query. From the users' perspective, choosing an optimal model is a sequential,…

Computer Science and Game Theory · Computer Science 2026-02-17 Pronoy Patra , Sankarshan Damle , Manisha Padala , Sujit Gujar

Truthfulness with Value-Maximizing Bidders

In markets such as digital advertising auctions, bidders want to maximize value rather than payoff. This is different to the utility functions typically assumed in auction theory and leads to different strategies and outcomes. We refer to…

Computer Science and Game Theory · Computer Science 2016-07-14 Salman Fadaei , Martin Bichler

Truthful Learning Mechanisms for Multi-Slot Sponsored Search Auctions with Externalities

Sponsored search auctions constitute one of the most successful applications of microeconomic mechanisms. In mechanism design, auctions are usually designed to incentivize advertisers to bid their truthful valuations and to assure both the…

Computer Science and Game Theory · Computer Science 2014-05-13 Nicola Gatti , Alessandro Lazaric , Marco Rocco , Francesco Trovò

Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness

We investigate the problem of maximizing social welfare while ensuring fairness in a multi-agent multi-armed bandit (MA-MAB) setting. In this problem, a centralized decision-maker takes actions over time, generating random rewards for…

Machine Learning · Computer Science 2025-06-23 Piyushi Manupriya , Himanshu , SakethaNath Jagarlapudi , Ganesh Ghalme

Response Prediction for Low-Regret Agents

Companies like Google and Microsoft run billions of auctions every day to sell advertising opportunities. Any change to the rules of these auctions can have a tremendous effect on the revenue of the company and the welfare of the…

Computer Science and Game Theory · Computer Science 2019-11-07 Saeed Alaei , Ashwinkumar Badanidiyuru , Mohammad Mahdian , Sadra Yazdanbod

A Dominant Strategy Truthful, Deterministic Multi-Armed Bandit Mechanism with Logarithmic Regret

Stochastic multi-armed bandit (MAB) mechanisms are widely used in sponsored search auctions, crowdsourcing, online procurement, etc. Existing stochastic MAB mechanisms with a deterministic payment rule, proposed in the literature,…

Computer Science and Game Theory · Computer Science 2020-06-01 Divya Padmanabhan , Satyanath Bhat , Prabuchandran K. J. , Shirish Shevade , Y. Narahari

Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

We study a game between autobidding algorithms that compete in an online advertising platform. Each autobidder is tasked with maximizing its advertiser's total value over multiple rounds of a repeated auction, subject to budget and…

Computer Science and Game Theory · Computer Science 2024-12-03 Brendan Lucier , Sarath Pattathil , Aleksandrs Slivkins , Mengxiao Zhang

Mechanism Design with Moral Bidders

A rapidly growing literature on lying in behavioral economics and psychology shows that individuals often do not lie even when lying maximizes their utility. In this work, we attempt to incorporate these findings into the theory of…

Computer Science and Game Theory · Computer Science 2021-11-23 Shahar Dobzinski , Sigal Oren

Auctions Between Regret-Minimizing Agents

We analyze a scenario in which software agents implemented as regret-minimizing algorithms engage in a repeated auction on behalf of their users. We study first-price and second-price auctions, as well as their generalized versions (e.g.,…

Computer Science and Game Theory · Computer Science 2022-03-28 Yoav Kolumbus , Noam Nisan

On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms

Efficient learning in multi-armed bandit mechanisms such as pay-per-click (PPC) auctions typically involves three challenges: 1) inducing truthful bidding behavior (incentives), 2) using personalization in the users (context), and 3)…

Machine Learning · Computer Science 2023-07-18 Yinglun Xu , Bhuvesh Kumar , Jacob Abernethy

VCG Mechanism Design with Unknown Agent Values under Stochastic Bandit Feedback

We study a multi-round welfare-maximising mechanism design problem in instances where agents do not know their values. On each round, a mechanism first assigns an allocation each to a set of agents and charges them a price; at the end of…

Machine Learning · Statistics 2022-01-25 Kirthevasan Kandasamy , Joseph E. Gonzalez , Michael I. Jordan , Ion Stoica

Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting

We consider the classical multi-armed bandit problem, but with strategic arms. In this context, each arm is characterized by a bounded support reward distribution and strategically aims to maximize its own utility by potentially retaining a…

Machine Learning · Computer Science 2025-01-28 Ahmed Ben Yahmed , Clément Calauzènes , Vianney Perchet

No-Regret Algorithms in non-Truthful Auctions with Budget and ROI Constraints

Advertisers increasingly use automated bidding to optimize their ad campaigns on online advertising platforms. Autobidding optimizes an advertiser's objective subject to various constraints, e.g. average ROI and budget constraints. In this…

Computer Science and Game Theory · Computer Science 2024-04-16 Gagan Aggarwal , Giannis Fikioris , Mingfei Zhao

Budget Pacing in Repeated Auctions: Regret and Efficiency without Convergence

We study the aggregate welfare and individual regret guarantees of dynamic \emph{pacing algorithms} in the context of repeated auctions with budgets. Such algorithms are commonly used as bidding agents in Internet advertising platforms,…

Computer Science and Game Theory · Computer Science 2026-01-06 Jason Gaitonde , Yingkai Li , Bar Light , Brendan Lucier , Aleksandrs Slivkins

Trading off rewards and errors in multi-armed bandits

In multi-armed bandits, the most-explored arms are the most informative, while reward maximization typically pulls only the best arm. We study the tradeoff between identifying arm means accurately and accumulating reward, and present an…

Machine Learning · Computer Science 2026-05-04 Akram Erraqabi , Alessandro Lazaric , Michal Valko , Emma Brunskill , Yun-En Liu