Related papers: Distributed learning in congested environments wit…

Distributed Online Learning for Joint Regret with Communication Constraints

We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a…

Machine Learning · Computer Science 2021-10-26 Dirk van der Hoeven , Hédi Hadiji , Tim van Erven

Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret

The problem of distributed learning and channel access is considered in a cognitive network with multiple secondary users. The availability statistics of the channels are initially unknown to the secondary users and are estimated using…

Networking and Internet Architecture · Computer Science 2016-11-17 Animashree Anandkumar , Nithin Michael , Ao Kevin Tang , Ananthram Swami

Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness

We consider distributed kernel bandits where $N$ agents aim to collaboratively maximize an unknown reward function that lies in a reproducing kernel Hilbert space. Each agent sequentially queries the function to obtain noisy observations at…

Machine Learning · Computer Science 2024-02-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

Online Learning in Decentralized Multiuser Resource Sharing Problems

In this paper, we consider the general scenario of resource sharing in a decentralized system when the resource rewards/qualities are time-varying and unknown to the users, and using the same resource by multiple users leads to reduced…

Machine Learning · Computer Science 2012-10-23 Cem Tekin , Mingyan Liu

Distributed No-Regret Learning in Multi-Agent Systems

In this tutorial article, we give an overview of new challenges and representative results on distributed no-regret learning in multi-agent systems modeled as repeated unknown games. Four emerging game characteristics---dynamicity,…

Computer Science and Game Theory · Computer Science 2020-02-24 Xiao Xu , Qing Zhao

Distributed Stochastic Bandit Learning with Delayed Context Observation

We consider the problem where M agents collaboratively interact with an instance of a stochastic K-armed contextual bandit, where K>>M. The goal of the agents is to simultaneously minimize the cumulative regret over all the agents over a…

Machine Learning · Computer Science 2022-11-16 Jiabin Lin , Shana Moothedath

Collaborative Learning of Stochastic Bandits over a Social Network

We consider a collaborative online learning paradigm, wherein a group of agents connected through a social network are engaged in playing a stochastic multi-armed bandit game. Each time an agent takes an action, the corresponding reward is…

Machine Learning · Computer Science 2016-07-12 Ravi Kumar Kolla , Krishna Jagannathan , Aditya Gopalan

Distributed Resource Allocation for Human-Autonomy Teaming under Coupled Constraints

This paper studies the optimal resource allocation problem within a multi-agent network composed of both autonomous agents and humans. The main challenge lies in the globally coupled constraints that link the decisions of autonomous agents…

Systems and Control · Electrical Eng. & Systems 2025-04-04 Yichen Yao , Ryan Mbagna Nanko , Yue Wang , Xuan Wang

Distributed Learning in the Presence of Disturbances

We consider a problem where multiple agents must learn an action profile that maximises the sum of their utilities in a distributed manner. The agents are assumed to have no knowledge of either the utility functions or the actions and…

Systems and Control · Computer Science 2016-03-31 Chithrupa Ramesh , Marius Schmitt , John Lygeros

A Communication-Efficient Adaptive Algorithm for Federated Learning under Cumulative Regret

We consider the problem of online stochastic optimization in a distributed setting with $M$ clients connected through a central server. We develop a distributed online learning algorithm that achieves order-optimal cumulative regret with…

Machine Learning · Computer Science 2023-06-07 Sudeep Salgia , Qing Zhao , Tamir Gabay , Kobi Cohen

Communication-Efficient Distributed Online Nonconvex Optimization with Time-Varying Constraints

This paper considers distributed online nonconvex optimization with time-varying inequality constraints over a network of agents, where the nonconvex local loss and convex local constraint functions can vary arbitrarily across iterations.…

Optimization and Control · Mathematics 2025-11-19 Kunpeng Zhang , Lei Xu , Xinlei Yi , Guanghui Wen , Ming Cao , Karl H. Johansson , Tianyou Chai , Tao Yang

Distributed Online Optimization with Long-Term Constraints

We consider distributed online convex optimization problems, where the distributed system consists of various computing units connected through a time-varying communication graph. In each time step, each computing unit selects a constrained…

Machine Learning · Computer Science 2019-12-23 Deming Yuan , Alexandre Proutiere , Guodong Shi

Online distributed algorithms for mixed equilibrium problems in dynamic environments

In this paper, the mixed equilibrium problem with coupled inequality constraints in dynamic environments is solved by employing a multi-agent system, where each agent only has access to its own bifunction, its own constraint function, and…

Systems and Control · Electrical Eng. & Systems 2024-12-30 Hang Xu , Kaihong Lu , Yu-Long Wang , Qixin Zhu

Optimization and Learning in Open Multi-Agent Systems

Modern artificial intelligence relies on networks of agents that collect data, process information, and exchange it with neighbors to collaboratively solve optimization and learning problems. This article introduces a novel distributed…

Optimization and Control · Mathematics 2026-01-15 Diego Deplano , Nicola Bastianello , Mauro Franceschelli , Karl H. Johansson

Federated Learning for Heterogeneous Bandits with Unobserved Contexts

We study the problem of federated stochastic multi-arm contextual bandits with unknown contexts, in which M agents are faced with different bandits and collaborate to learn. The communication model consists of a central server and the…

Machine Learning · Computer Science 2024-01-31 Jiabin Lin , Shana Moothedath

Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs

Recently, there has been extensive study of cooperative multi-agent multi-armed bandits where a set of distributed agents cooperatively play the same multi-armed bandit game. The goal is to develop bandit algorithms with the optimal group…

Machine Learning · Computer Science 2023-08-09 Lin Yang , Xuchuang Wang , Mohammad Hajiesmaili , Lijun Zhang , John C. S. Lui , Don Towsley

Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback

This paper studies multi-stage systems with end-to-end bandit feedback. In such systems, each job needs to go through multiple stages, each managed by a different agent, before generating an outcome. Each agent can only control its own…

Machine Learning · Computer Science 2024-08-20 I-Hong Hou

Distributed Linear Bandits under Communication Constraints

We consider distributed linear bandits where $M$ agents learn collaboratively to minimize the overall cumulative regret incurred by all agents. Information exchange is facilitated by a central server, and both the uplink and downlink…

Machine Learning · Computer Science 2025-11-17 Sudeep Salgia , Qing Zhao

No-regret distributed learning in subnetwork zero-sum games

In this paper, we consider a distributed learning problem in a subnetwork zero-sum game, where agents are competing in different subnetworks. These agents are connected through time-varying graphs where each agent has its own cost function…

Optimization and Control · Mathematics 2021-08-05 Shijie Huang , Jinlong Lei , Yiguang Hong , Uday V. Shanbhag , Jie Chen

Regret vs. Communication: Distributed Stochastic Multi-Armed Bandits and Beyond

In this paper, we consider the distributed stochastic multi-armed bandit problem, where a global arm set can be accessed by multiple players independently. The players are allowed to exchange their history of observations with each other at…

Machine Learning · Computer Science 2020-02-13 Shuang Liu , Cheng Chen , Zhihua Zhang