Related papers: Distributed No-Regret Learning in Multi-Agent Syst…

Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback

This paper studies multi-stage systems with end-to-end bandit feedback. In such systems, each job needs to go through multiple stages, each managed by a different agent, before generating an outcome. Each agent can only control its own…

Machine Learning · Computer Science 2024-08-20 I-Hong Hou

Distributed learning in congested environments with partial information

How can non-communicating agents learn to share congested resources efficiently? This is a challenging task when the agents can access the same resource simultaneously (in contrast to multi-agent multi-armed bandit problems) and the…

Multiagent Systems · Computer Science 2021-05-13 Tomer Boyarski , Amir Leshem , Vikram Krishnamurthy

Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria

Understanding and predicting the behavior of large-scale multi-agents in games remains a fundamental challenge in multi-agent systems. This paper examines the role of heterogeneity in equilibrium formation by analyzing how smooth…

Computer Science and Game Theory · Computer Science 2025-07-24 Die Hu , Shuyue Hu , Chunjiang Mu , Shiqi Fan , Chen Chu , Jinzhuo Liu , Zhen Wang

Do LLM Agents Have Regret? A Case Study in Online Learning and Games

Large language models (LLMs) have been increasingly employed for (interactive) decision-making, via the development of LLM-based autonomous agents. Despite their emerging successes, the performance of LLM agents in decision-making has not…

Machine Learning · Computer Science 2025-10-16 Chanwoo Park , Xiangyu Liu , Asuman Ozdaglar , Kaiqing Zhang

On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

Most of the literature on learning in games has focused on the restrictive setting where the underlying repeated game does not change over time. Much less is known about the convergence of no-regret learning algorithms in dynamic multiagent…

Machine Learning · Computer Science 2023-10-19 Ioannis Anagnostides , Ioannis Panageas , Gabriele Farina , Tuomas Sandholm

Learning to Recommend in Unknown Games

We study preference learning through recommendations in multi-agent game settings, where a moderator repeatedly interacts with agents whose utility functions are unknown. In each round, the moderator issues action recommendations and…

Computer Science and Game Theory · Computer Science 2026-03-06 Arwa Alanqary , Zakaria Baba , Manxi Wu , Alexandre M. Bayen

No-Regret Learning in Unknown Games with Correlated Payoffs

We consider the problem of learning to play a repeated multi-agent game with an unknown reward function. Single player online learning algorithms attain strong regret bounds when provided with full information feedback, which unfortunately…

Machine Learning · Computer Science 2019-10-29 Pier Giuseppe Sessa , Ilija Bogunovic , Maryam Kamgarpour , Andreas Krause

Multi-Agent Learning in Contextual Games under Unknown Constraints

We consider the problem of learning to play a repeated contextual game with unknown reward and unknown constraints functions. Such games arise in applications where each agent's action needs to belong to a feasible set, but the feasible set…

Computer Science and Game Theory · Computer Science 2024-05-27 Anna M. Maddux , Maryam Kamgarpour

Is Learning in Games Good for the Learners?

We consider a number of questions related to tradeoffs between reward and regret in repeated gameplay between two agents. To facilitate this, we introduce a notion of $\textit{generalized equilibrium}$ which allows for asymmetric regret…

Computer Science and Game Theory · Computer Science 2023-12-19 William Brown , Jon Schneider , Kiran Vodrahalli

Distributed Estimation of Dynamic Parameters : Regret Analysis

This paper addresses the estimation of a time- varying parameter in a network. A group of agents sequentially receive noisy signals about the parameter (or moving target), which does not follow any particular dynamics. The parameter is not…

Optimization and Control · Mathematics 2016-03-03 Shahin Shahrampour , Alexander Rakhlin , Ali Jadbabaie

Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often…

Computer Science and Game Theory · Computer Science 2021-10-19 Yu-Guan Hsieh , Kimon Antonakopoulos , Panayotis Mertikopoulos

Bandit Learning in General Open Multi-agent Systems

Recent developments in digital platforms have highlighted the prevalence of open systems, where agents can arrive and depart over time. While bandit learning in open systems has recently received initial attention, existing work imposes…

Machine Learning · Computer Science 2026-05-08 Mengfan Xu

Learning in Time-Varying Monotone Network Games with Dynamic Populations

In this paper, we present a framework for multi-agent learning in a nonstationary dynamic network environment. More specifically, we examine projected gradient play in smooth monotone repeated network games in which the agents'…

Computer Science and Game Theory · Computer Science 2024-08-13 Feras Al Taha , Kiran Rokade , Francesca Parise

Learning not to Regret

The literature on game-theoretic equilibrium finding predominantly focuses on single games or their repeated play. Nevertheless, numerous real-world scenarios feature playing a game sampled from a distribution of similar, but not identical…

Computer Science and Game Theory · Computer Science 2024-02-21 David Sychrovský , Michal Šustr , Elnaz Davoodi , Michael Bowling , Marc Lanctot , Martin Schmid

A Regret Minimization Approach to Multi-Agent Control

We study the problem of multi-agent control of a dynamical system with known dynamics and adversarial disturbances. Our study focuses on optimal control without centralized precomputed policies, but rather with adaptive control policies for…

Optimization and Control · Mathematics 2022-07-27 Udaya Ghai , Udari Madhushani , Naomi Leonard , Elad Hazan

No-Regret and Incentive-Compatible Online Learning

We study online learning settings in which experts act strategically to maximize their influence on the learning algorithm's predictions by potentially misreporting their beliefs about a sequence of binary events. Our goal is twofold.…

Machine Learning · Computer Science 2020-07-02 Rupert Freeman , David M. Pennock , Chara Podimata , Jennifer Wortman Vaughan

Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory

Deep reinforcement learning (RL) has achieved outstanding results in recent years, which has led a dramatic increase in the number of methods and applications. Recent works are exploring learning beyond single-agent scenarios and…

Computer Science and Game Theory · Computer Science 2020-02-03 Yunlong Lu , Kai Yan

Multi-agent learning under uncertainty: Recurrence vs. concentration

In this paper, we examine the convergence landscape of multi-agent learning under uncertainty. Specifically, we analyze two stochastic models of regularized learning in continuous games -- one in continuous and one in discrete time with the…

Computer Science and Game Theory · Computer Science 2025-12-10 Kyriakos Lotidis , Panayotis Mertikopoulos , Nicholas Bambos , Jose Blanchet

Distributed Computing with Adaptive Heuristics

We use ideas from distributed computing to study dynamic environments in which computational nodes, or decision makers, follow adaptive heuristics (Hart 2005), i.e., simple and unsophisticated rules of behavior, e.g., repeatedly "best…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-10-13 Aaron D. Jaggard , Michael Schapira , Rebecca N. Wright

No-Regret Learning in Games with Noisy Feedback: Faster Rates and Adaptivity via Learning Rate Separation

We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret…

Computer Science and Game Theory · Computer Science 2023-03-20 Yu-Guan Hsieh , Kimon Antonakopoulos , Volkan Cevher , Panayotis Mertikopoulos