Related papers: Continuous-Time Distributed Dynamic Programming fo…

$QD$-Learning: A Collaborative Distributed Strategy for Multi-Agent Reinforcement Learning Through Consensus + Innovations

The paper considers a class of multi-agent Markov decision processes (MDPs), in which the network agents respond differently (as manifested by the instantaneous one-stage random costs) to a global controlled state and the control actions of…

Machine Learning · Statistics 2015-06-04 Soummya Kar , Jose' M. F. Moura , H. Vincent Poor

Distributed Planning in Hierarchical Factored MDPs

We present a principled and efficient planning algorithm for collaborative multiagent dynamical systems. All computation, during both the planning and the execution phases, is distributed among the agents; each agent only needs to model and…

Artificial Intelligence · Computer Science 2013-01-07 Carlos E. Guestrin , Geoffrey Gordon

Decentralized Planning Using Probabilistic Hyperproperties

Multi-agent planning under stochastic dynamics is usually formalised using decentralized (partially observable) Markov decision processes ( MDPs) and reachability or expected reward specifications. In this paper, we propose a different…

Logic in Computer Science · Computer Science 2025-02-20 Francesco Pontiggia , Filip Macák , Roman Andriushchenko , Michele Chiari , Milan Češka

Distributed Differential Dynamic Programming Architectures for Large-Scale Multi-Agent Control

In this paper, we propose two novel decentralized optimization frameworks for multi-agent nonlinear optimal control problems in robotics. The aim of this work is to suggest architectures that inherit the computational efficiency and…

Systems and Control · Electrical Eng. & Systems 2022-08-09 Augustinos D. Saravanos , Yuichiro Aoyama , Hongchang Zhu , Evangelos A. Theodorou

Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces

In this work, we study dynamic programming (DP) algorithms for partially observable Markov decision processes with jointly continuous and discrete state-spaces. We consider a class of stochastic systems which have coupled discrete and…

Optimization and Control · Mathematics 2019-03-07 Donghwan Lee , Niao He , Jianghai Hu

Communication-Based Decomposition Mechanisms for Decentralized MDPs

Multi-agent planning in stochastic environments can be framed formally as a decentralized Markov decision problem. Many real-life distributed problems that arise in manufacturing, multi-robot coordination and information gathering scenarios…

Artificial Intelligence · Computer Science 2011-11-02 Claudia V. Goldman , Shlomo Zilberstein

Policy Dispersion in Non-Markovian Environment

Markov Decision Process (MDP) presents a mathematical framework to formulate the learning processes of agents in reinforcement learning. MDP is limited by the Markovian assumption that a reward only depends on the immediate state and…

Machine Learning · Computer Science 2024-06-04 Bohao Qu , Xiaofeng Cao , Jielong Yang , Hechang Chen , Chang Yi , Ivor W. Tsang , Yew-Soon Ong

Scalable spectral representations for multi-agent reinforcement learning in network MDPs

Network Markov Decision Processes (MDPs), a popular model for multi-agent control, pose a significant challenge to efficient learning due to the exponential growth of the global state-action space with the number of agents. In this work,…

Multiagent Systems · Computer Science 2024-11-19 Zhaolin Ren , Runyu Zhang , Bo Dai , Na Li

Towards time-varying proximal dynamics in Multi-Agent Network Games

Distributed decision making in multi-agent networks has recently attracted significant research attention thanks to its wide applicability, e.g. in the management and optimization of computer networks, power systems, robotic teams, sensor…

Optimization and Control · Mathematics 2018-11-13 Carlo Cenedese , Yu Kawano , Sergio Grammatico , Ming Cao

Distributed Online Planning for Min-Max Problems in Networked Markov Games

Min-max problems are important in multi-agent sequential decision-making because they improve the performance of the worst-performing agent in the network. However, solving the multi-agent min-max problem is challenging. We propose a…

Multiagent Systems · Computer Science 2024-05-31 Alexandros E. Tzikas , Jinkyoo Park , Mykel J. Kochenderfer , Ross E. Allen

A Multi-Scale Method for Distributed Convex Optimization with Constraints

This paper proposes a multi-scale method to design a continuous-time distributed algorithm for constrained convex optimization problems by using multi-agents with Markov switched network dynamics and noisy inter-agent communications. Unlike…

Optimization and Control · Mathematics 2021-03-02 Wei Ni , Xiaoli Wang

Approximate Linear Programming for Decentralized Policy Iteration in Cooperative Multi-agent Markov Decision Processes

In this work, we consider a cooperative multi-agent Markov decision process (MDP) involving m agents. At each decision epoch, all the m agents independently select actions in order to maximize a common long-term objective. In the policy…

Machine Learning · Computer Science 2024-05-01 Lakshmi Mandal , Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

Optimal Coordinated Planning Amongst Self-Interested Agents with Private State

Consider a multi-agent system in a dynamic and uncertain environment. Each agent's local decision problem is modeled as a Markov decision process (MDP) and agents must coordinate on a joint action in each period, which provides a reward to…

Computer Science and Game Theory · Computer Science 2012-07-02 Ruggiero Cavallo , David C. Parkes , Satinder Singh

Decentralized Decision-Making Over Multi-Task Networks

In important applications involving multi-task networks with multiple objectives, agents in the network need to decide between these multiple objectives and reach an agreement about which single objective to follow for the network. In this…

Optimization and Control · Mathematics 2018-12-27 Sahar Khawatmi , Abdelhak M. Zoubir , Ali H. Sayed

Multi-agent Multi-target Path Planning in Markov Decision Processes

Missions for autonomous systems often require agents to visit multiple targets in complex operating conditions. This work considers the problem of visiting a set of targets in minimum time by a team of non-communicating agents in a Markov…

Optimization and Control · Mathematics 2023-06-21 Farhad Nawaz , Melkior Ornik

Distributed Model Predictive Control Design for Multi-agent Systems via Bayesian Optimization

This paper introduces a new approach that leverages Multi-agent Bayesian Optimization (MABO) to design Distributed Model Predictive Control (DMPC) schemes for multi-agent systems. The primary objective is to learn optimal DMPC schemes even…

Systems and Control · Electrical Eng. & Systems 2025-05-21 Hossein Nejatbakhsh Esfahani , Kai Liu , Javad Mohammadpour Velni

Feature Markov Decision Processes

General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observations, actions, and rewards. On the other hand, reinforcement learning is well-developed for small finite state Markov Decision Processes…

Artificial Intelligence · Computer Science 2009-12-30 Marcus Hutter

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

We study online learning in episodic constrained Markov decision processes (CMDPs), where the learner aims at collecting as much reward as possible over the episodes, while satisfying some long-term constraints during the learning process.…

Machine Learning · Computer Science 2024-08-30 Jacopo Germano , Francesco Emanuele Stradi , Gianmarco Genalti , Matteo Castiglioni , Alberto Marchesi , Nicola Gatti

A Design Method of Distributed Algorithms via Discrete-time Blended Dynamics Theorem

We develop a discrete-time version of the blended dynamics theorem for the use of designing distributed computation algorithms. The blended dynamics theorem enables to predict the behavior of heterogeneous multi-agent systems. Therefore,…

Systems and Control · Electrical Eng. & Systems 2023-12-01 Jeong Woo Kim , Jin Gyu Lee , Donggil Lee , Hyungbo Shim

Primal-Dual Distributed Temporal Difference Learning

The goal of this paper is to study a distributed version of the gradient temporal-difference (GTD) learning algorithm for a class of multi-agent Markov decision processes (MDPs). The temporal-difference (TD) learning is a reinforcement…

Optimization and Control · Mathematics 2020-04-29 Donghwan Lee , Jianghai Hu