Related papers: A Strategic Learning Algorithm for State-based Gam…

Decentralized Learning for Optimality in Stochastic Dynamic Teams and Games with Local Control and Global State Information

Stochastic dynamic teams and games are rich models for decentralized systems and challenging testing grounds for multi-agent learning. Previous work that guaranteed team optimality assumed stateless dynamics, or an explicit coordination…

Optimization and Control · Mathematics 2024-03-28 Bora Yongacoglu , Gürdal Arslan , Serdar Yüksel

Strategic Teaching and Learning in Games

It is known that there are uncoupled learning heuristics leading to Nash equilibrium in all finite games. Why should players use such learning heuristics and where could they come from? We show that there is no uncoupled learning heuristic…

Computer Science and Game Theory · Computer Science 2015-04-27 Burkhard C. Schipper

A Heuristic Search Algorithm Using the Stability of Learning Algorithms in Certain Scenarios as the Fitness Function: An Artificial General Intelligence Engineering Approach

This paper presents a non-manual design engineering method based on heuristic search algorithm to search for candidate agents in the solution space which formed by artificial intelligence agents modeled on the base of bionics.Compared with…

Artificial Intelligence · Computer Science 2018-07-30 Zengkun Li

Improved Trial and Error Learning for Random Games

When a game involves many agents or when communication between agents is not possible, it is useful to resort to distributed learning where each agent acts in complete autonomy without any information on the other agents' situations.…

Optimization and Control · Mathematics 2025-09-24 Jérôme Taupin , Xavier Leturc , Christophe J. Le Martret

Priority Based Synchronization for Faster Learning in Games

Learning in games has been widely used to solve many cooperative multi-agent problems such as coverage control, consensus, self-reconfiguration or vehicle-target assignment. One standard approach in this domain is to formulate the problem…

Systems and Control · Electrical Eng. & Systems 2022-09-07 Abbasali Koochakzadeh , Yasin Yazıcıoğlu

Convergence of Heterogeneous Learning Dynamics in Zero-sum Stochastic Games

This paper presents new families of algorithms for the repeated play of two-agent (near) zero-sum games and two-agent zero-sum stochastic games. For example, the family includes fictitious play and its variants as members. Commonly, the…

Computer Science and Game Theory · Computer Science 2023-11-03 Yuksel Arslantas , Ege Yuceel , Yigit Yalin , Muhammed O. Sayin

Independent Policy Gradient Methods for Competitive Reinforcement Learning

We obtain global, non-asymptotic convergence guarantees for independent learning algorithms in competitive reinforcement learning settings with two agents (i.e., zero-sum stochastic games). We consider an episodic setting where in each…

Machine Learning · Computer Science 2021-01-13 Constantinos Daskalakis , Dylan J. Foster , Noah Golowich

Learning in games with continuous action sets and unknown payoff functions

This paper examines the convergence of no-regret learning in games with continuous action sets. For concreteness, we focus on learning via "dual averaging", a widely used class of no-regret learning schemes where players take small steps…

Optimization and Control · Mathematics 2018-01-17 Panayotis Mertikopoulos , Zhengyuan Zhou

Statistical-mechanics approach to a reinforcement learning model with memory

We introduce a two-player model of reinforcement learning with memory. Past actions of an iterated game are stored in a memory and used to determine player's next action. To examine the behaviour of the model some approximate methods are…

Statistical Mechanics · Physics 2009-11-13 Adam Lipowski , Krzysztof Gontarek , Marcel Ausloos

Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games. We propose two ways of doing this, the first way…

Artificial Intelligence · Computer Science 2023-02-10 Quentin Cohen-Solal , Tristan Cazenave

On Learning with Finite Memory

We consider an infinite collection of agents who make decisions, sequentially, about an unknown underlying binary state of the world. Each agent, prior to making a decision, receives an independent private signal whose distribution depends…

Computer Science and Game Theory · Computer Science 2012-09-07 Kimon Drakopoulos , Asuman Ozdaglar , John Tsitsiklis

Balancing Two-Player Stochastic Games with Soft Q-Learning

Within the context of video games the notion of perfectly rational agents can be undesirable as it leads to uninteresting situations, where humans face tough adversarial decision makers. Current frameworks for stochastic games and…

Artificial Intelligence · Computer Science 2019-01-09 Jordi Grau-Moya , Felix Leibfried , Haitham Bou-Ammar

Robust Learning Equilibrium

We introduce robust learning equilibrium. The idea of learning equilibrium is that learning algorithms in multi-agent systems should themselves be in equilibrium rather than only lead to equilibrium. That is, learning equilibrium is immune…

Computer Science and Game Theory · Computer Science 2012-07-02 Itai Ashlagi , Dov Monderer , Moshe Tennenholtz

Independent Learning in Stochastic Games

Reinforcement learning (RL) has recently achieved tremendous successes in many artificial intelligence applications. Many of the forefront applications of RL involve multiple agents, e.g., playing chess and Go games, autonomous driving, and…

Computer Science and Game Theory · Computer Science 2021-11-24 Asuman Ozdaglar , Muhammed O. Sayin , Kaiqing Zhang

Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria

Stochastic games have become a prevalent framework for studying long-term multi-agent interactions, especially in the context of multi-agent reinforcement learning. In this work, we comprehensively investigate the concept of constant-memory…

Computer Science and Game Theory · Computer Science 2025-10-16 Fengming Zhu , Fangzhen Lin

Efficient Competitive Self-Play Policy Optimization

Reinforcement learning from self-play has recently reported many successes. Self-play, where the agents compete with themselves, is often used to generate training data for iterative policy improvement. In previous work, heuristic rules are…

Machine Learning · Computer Science 2020-09-15 Yuanyi Zhong , Yuan Zhou , Jian Peng

No-Regret Learning in Dynamic Stackelberg Games

In a Stackelberg game, a leader commits to a randomized strategy, and a follower chooses their best strategy in response. We consider an extension of a standard Stackelberg game, called a discrete-time dynamic Stackelberg game, that has an…

Computer Science and Game Theory · Computer Science 2022-02-11 Niklas Lauffer , Mahsa Ghasemi , Abolfazl Hashemi , Yagiz Savas , Ufuk Topcu

The equivalence of dynamic and strategic stability under regularized learning in games

In this paper, we examine the long-run behavior of regularized, no-regret learning in finite games. A well-known result in the field states that the empirical frequencies of no-regret play converge to the game's set of coarse correlated…

Computer Science and Game Theory · Computer Science 2023-11-07 Victor Boone , Panayotis Mertikopoulos

On the robustness of learning in games with stochastically perturbed payoff observations

Motivated by the scarcity of accurate payoff feedback in practical applications of game theory, we examine a class of learning dynamics where players adjust their choices based on past payoff observations that are subject to noise and…

Optimization and Control · Mathematics 2016-06-03 Mario Bravo , Panayotis Mertikopoulos

Limiting dynamics for Q-learning with memory one in symmetric two-player, two-action games

We develop a method based on computer algebra systems to represent the mutual pure strategy best-response dynamics of symmetric two-player, two-action repeated games played by players with a one-period memory. We apply this method to the…

Dynamical Systems · Mathematics 2022-10-04 Janusz M Meylahn , Lars Janssen