A Communication-Efficient Decentralized Actor-Critic Algorithm

Xiaoxing Ren; Nicola Bastianello; Thomas Parisini; Andreas A. Malikopoulos

A Communication-Efficient Decentralized Actor-Critic Algorithm

Machine Learning 2025-10-23 v1 Optimization and Control

Authors: Xiaoxing Ren , Nicola Bastianello , Thomas Parisini , Andreas A. Malikopoulos

Abstract

In this paper, we study the problem of reinforcement learning in multi-agent systems where communication among agents is limited. We develop a decentralized actor-critic learning framework in which each agent performs several local updates of its policy and value function, where the latter is approximated by a multi-layer neural network, before exchanging information with its neighbors. This local training strategy substantially reduces the communication burden while maintaining coordination across the network. We establish finite-time convergence analysis for the algorithm under Markov-sampling. Specifically, to attain the $\varepsilon$ -accurate stationary point, the sample complexity is of order $\mathcal{O}(\varepsilon^{-3})$ and the communication complexity is of order $\mathcal{O}(\varepsilon^{-1}\tau^{-1})$ , where tau denotes the number of local training steps. We also show how the final error bound depends on the neural network's approximation quality. Numerical experiments in a cooperative control setting illustrate and validate the theoretical findings.

Keywords

multi-agent reinforcement learning reinforcement learning multi-agent systems

Cite

@article{arxiv.2510.19199,
  title  = {A Communication-Efficient Decentralized Actor-Critic Algorithm},
  author = {Xiaoxing Ren and Nicola Bastianello and Thomas Parisini and Andreas A. Malikopoulos},
  journal= {arXiv preprint arXiv:2510.19199},
  year   = {2025}
}

A Communication-Efficient Decentralized Actor-Critic Algorithm

Abstract

Keywords

Cite

Related papers