English

Foolproof Cooperative Learning

Computer Science and Game Theory 2020-10-16 v3 Artificial Intelligence Machine Learning

Abstract

This paper extends the notion of learning equilibrium in game theory from matrix games to stochastic games. We introduce Foolproof Cooperative Learning (FCL), an algorithm that converges to a Tit-for-Tat behavior. It allows cooperative strategies when played against itself while being not exploitable by selfish players. We prove that in repeated symmetric games, this algorithm is a learning equilibrium. We illustrate the behavior of FCL on symmetric matrix and grid games, and its robustness to selfish learners.

Keywords

Cite

@article{arxiv.1906.09831,
  title  = {Foolproof Cooperative Learning},
  author = {Alexis Jacq and Julien Perolat and Matthieu Geist and Olivier Pietquin},
  journal= {arXiv preprint arXiv:1906.09831},
  year   = {2020}
}
R2 v1 2026-06-23T10:01:42.328Z