Related papers: Dynamic mean field programming

Discrete-Time Mean Field Control with Environment States

Multi-agent reinforcement learning methods have shown remarkable potential in solving complex multi-agent problems but mostly lack theoretical guarantees. Recently, mean field control and mean field games have been established as a…

Machine Learning · Computer Science 2021-12-20 Kai Cui , Anam Tahir , Mark Sinzger , Heinz Koeppl

A Theoretical Connection Between Statistical Physics and Reinforcement Learning

Sequential decision making in the presence of uncertainty and stochastic dynamics gives rise to distributions over state/action trajectories in reinforcement learning (RL) and optimal control problems. This observation has led to a variety…

Machine Learning · Computer Science 2021-09-30 Jad Rahme , Ryan P. Adams

Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods

We investigate reinforcement learning in the setting of Markov decision processes for a large number of exchangeable agents interacting in a mean field manner. Applications include, for example, the control of a large number of robots…

Optimization and Control · Mathematics 2025-04-30 René Carmona , Mathieu Laurière , Zongjun Tan

Mean-Field Reinforcement Learning without Synchrony

Mean-field reinforcement learning (MF-RL) scales multi-agent RL to large populations by reducing each agent's dependence on others to a single summary statistic -- the mean action. However, this reduction requires every agent to act at…

Multiagent Systems · Computer Science 2026-02-23 Shan Yang

Learning Deep Mean Field Games for Modeling Large Population Behavior

We consider the problem of representing collective behavior of large populations and predicting the evolution of a population distribution over a discrete state space. A discrete time mean field game (MFG) is motivated as an interpretable…

Machine Learning · Computer Science 2018-04-24 Jiachen Yang , Xiaojing Ye , Rakshit Trivedi , Huan Xu , Hongyuan Zha

Distributional Bellman Operators over Mean Embeddings

We propose a novel algorithmic framework for distributional reinforcement learning, based on learning finite-dimensional mean embeddings of return distributions. We derive several new algorithms for dynamic programming and…

Machine Learning · Statistics 2024-03-05 Li Kevin Wenliang , Grégoire Delétang , Matthew Aitchison , Marcus Hutter , Anian Ruoss , Arthur Gretton , Mark Rowland

Deterministic mean field games with control on the acceleration

In the present work, we study deterministic mean field games (MFGs) with finite time horizon in which the dynamics of a generic agent is controlled by the acceleration. They are described by a system of PDEs coupling a continuity equation…

Analysis of PDEs · Mathematics 2020-07-29 Yves Achdou , Paola Mannucci , Claudio Marchi , Nicoletta Tchou

Master equations for finite state mean field games with nonlinear activations

We formulate a class of mean field games on a finite state space with variational principles resembling those in continuous-state mean field games. We construct a controlled continuity equation featuring a nonlinear activation function on…

Optimization and Control · Mathematics 2023-10-10 Yuan Gao , Wuchen Li , Jian-Guo Liu

Mean field optimal stopping with uncontrolled state

We study a specific class of finite-horizon mean field optimal stopping problems by means of the dynamic programming approach. In particular, we consider problems where the state process is not affected by the stopping time. Such problems…

Optimization and Control · Mathematics 2025-03-07 Andrea Cosso , Laura Perelli

Budgeted Reinforcement Learning in Continuous State Space

A Budgeted Markov Decision Process (BMDP) is an extension of a Markov Decision Process to critical applications requiring safety constraints. It relies on a notion of risk implemented in the shape of a cost signal constrained to lie below…

Machine Learning · Computer Science 2019-05-29 Nicolas Carrara , Edouard Leurent , Romain Laroche , Tanguy Urvoy , Odalric-Ambrym Maillard , Olivier Pietquin

Mean Field Games and Applications: Numerical Aspects

The theory of mean field games aims at studying deterministic or stochastic differential games (Nash equilibria) as the number of agents tends to infinity. Since very few mean field games have explicit or semi-explicit solutions, numerical…

Optimization and Control · Mathematics 2020-03-11 Yves Achdou , Mathieu Laurière

Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning

We study infinite horizon discounted Mean Field Control (MFC) problems with common noise through the lens of Mean Field Markov Decision Processes (MFMDP). We allow the agents to use actions that are randomized not only at the individual…

Optimization and Control · Mathematics 2021-10-14 René Carmona , Mathieu Laurière , Zongjun Tan

Dynamic programming equation for the mean field optimal stopping problem

We study the optimal stopping problem of McKean-Vlasov diffusions when the criterion is a function of the law of the stopped process. A remarkable new feature in this setting is that the stopping time also impacts the dynamics of the…

Probability · Mathematics 2023-01-18 Mehdi Talbi , Nizar Touzi , Jianfeng Zhang

On Reward Structures of Markov Decision Processes

A Markov decision process can be parameterized by a transition kernel and a reward function. Both play essential roles in the study of reinforcement learning as evidenced by their presence in the Bellman equations. In our inquiry of various…

Machine Learning · Computer Science 2023-09-04 Falcon Z. Dai

Actor-Critic learning for mean-field control in continuous time

We study policy gradient for mean-field control in continuous time in a reinforcement learning setting. By considering randomised policies with entropy regularisation, we derive a gradient expectation representation of the value function,…

Machine Learning · Statistics 2023-03-14 Noufel Frikha , Maximilien Germain , Mathieu Laurière , Huyên Pham , Xuanye Song

Solving Continuous Mean Field Games: Deep Reinforcement Learning for Non-Stationary Dynamics

Mean field games (MFGs) have emerged as a powerful framework for modeling interactions in large-scale multi-agent systems. Despite recent advancements in reinforcement learning (RL) for MFGs, existing methods are typically limited to finite…

Machine Learning · Computer Science 2025-10-28 Lorenzo Magnino , Kai Shao , Zida Wu , Jiacheng Shen , Mathieu Laurière

Mean Field Multi-Agent Reinforcement Learning

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential…

Multiagent Systems · Computer Science 2020-12-16 Yaodong Yang , Rui Luo , Minne Li , Ming Zhou , Weinan Zhang , Jun Wang

Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

Approximation of deterministic mean field games with control-affine dynamics

We consider deterministic mean field games where the dynamics of a typical agent is non-linear with respect to the state variable and affine with respect to the control variable. Particular instances of the problem considered here are mean…

Optimization and Control · Mathematics 2022-12-21 Justina Gianatti , Francisco J. Silva

Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach

Traditional reinforcement learning (RL) assumes the agents make decisions based on Markov decision processes (MDPs) with one-step transition models. In many real-world applications, such as energy management and stock investment, agents can…

Machine Learning · Computer Science 2025-10-22 Chenbei Lu , Zaiwei Chen , Tongxin Li , Chenye Wu , Adam Wierman