Decentralized Deterministic Multi-Agent Reinforcement Learning

Antoine Grosnit; Desmond Cai; Laura Wynter

Decentralized Deterministic Multi-Agent Reinforcement Learning

Machine Learning 2021-02-22 v1

Authors: Antoine Grosnit , Desmond Cai , Laura Wynter

Abstract

[Zhang, ICML 2018] provided the first decentralized actor-critic algorithm for multi-agent reinforcement learning (MARL) that offers convergence guarantees. In that work, policies are stochastic and are defined on finite action spaces. We extend those results to offer a provably-convergent decentralized actor-critic algorithm for learning deterministic policies on continuous action spaces. Deterministic policies are important in real-world settings. To handle the lack of exploration inherent in deterministic policies, we consider both off-policy and on-policy settings. We provide the expression of a local deterministic policy gradient, decentralized deterministic actor-critic algorithms and convergence guarantees for linearly-approximated value functions. This work will help enable decentralized MARL in high-dimensional action spaces and pave the way for more widespread use of MARL.

Keywords

multi-agent reinforcement learning reinforcement learning multi-agent systems

Cite

@article{arxiv.2102.09745,
  title  = {Decentralized Deterministic Multi-Agent Reinforcement Learning},
  author = {Antoine Grosnit and Desmond Cai and Laura Wynter},
  journal= {arXiv preprint arXiv:2102.09745},
  year   = {2021}
}

Decentralized Deterministic Multi-Agent Reinforcement Learning

Abstract

Keywords

Cite

Related papers