Related papers: Computing Approximate Nash Equilibria and Robust B…

Monte Carlo Continual Resolving for Online Strategy Computation in Imperfect Information Games

Online game playing algorithms produce high-quality strategies with a fraction of memory and computation required by their offline alternatives. Continual Resolving (CR) is a recent theoretically sound approach to online game playing that…

Computer Science and Game Theory · Computer Science 2019-03-11 Michal Sustr , Vojtech Kovarik , Viliam Lisy

Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious Play

Counterfactual Regret Minimization (CFR) and its variants are widely recognized as effective algorithms for solving extensive-form imperfect information games. Recently, many improvements have been focused on enhancing the convergence speed…

Artificial Intelligence · Computer Science 2024-10-29 Ju Qi , Falin Hei , Ting Feng , Dengbing Yi , Zhemei Fang , Yunfeng Luo

Convergence of Monte Carlo Tree Search in Simultaneous Move Games

We study Monte Carlo tree search (MCTS) in zero-sum extensive-form games with perfect information and simultaneous moves. We present a general template of MCTS algorithms for these games, which can be instantiated by various selection…

Computer Science and Game Theory · Computer Science 2013-12-16 Viliam Lisý , Vojtěch Kovařík , Marc Lanctot , Branislav Bošanský

Robust Deep Monte Carlo Counterfactual Regret Minimization: Addressing Theoretical Risks in Neural Fictitious Self-Play

Monte Carlo Counterfactual Regret Minimization (MCCFR) has emerged as a cornerstone algorithm for solving extensive-form games, but its integration with deep neural networks introduces scale-dependent challenges that manifest differently…

Artificial Intelligence · Computer Science 2026-01-29 Zakaria El Jaafari

Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds

Monte Carlo Tree Search (MCTS), most famously used in game-play artificial intelligence (e.g., the game of Go), is a well-known strategy for constructing approximate solutions to sequential decision problems. Its primary innovation is the…

Optimization and Control · Mathematics 2017-04-21 Daniel R. Jiang , Lina Al-Kanj , Warren B. Powell

Stochastic Regret Minimization in Extensive-Form Games

Monte-Carlo counterfactual regret minimization (MCCFR) is the state-of-the-art algorithm for solving sequential games that are too large for full tree traversals. It works by using gradient estimates that can be computed via sampling.…

Computer Science and Game Theory · Computer Science 2020-02-21 Gabriele Farina , Christian Kroer , Tuomas Sandholm

Monte Carlo *-Minimax Search

This paper introduces Monte Carlo *-Minimax Search (MCMS), a Monte Carlo search algorithm for turned-based, stochastic, two-player, zero-sum games of perfect information. The algorithm is designed for the class of of densely stochastic…

Computer Science and Game Theory · Computer Science 2013-04-23 Marc Lanctot , Abdallah Saffidine , Joel Veness , Christopher Archibald , Mark H. M. Winands

Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games

Researchers on artificial intelligence have achieved human-level intelligence in large-scale perfect-information games, but it is still a challenge to achieve (nearly) optimal results (in other words, an approximate Nash Equilibrium) in…

Artificial Intelligence · Computer Science 2019-04-09 Li Zhang , Wei Wang , Shijian Li , Gang Pan

Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games

Monte Carlo Tree Search (MCTS) has recently been successfully used to create strategies for playing imperfect-information games. Despite its popularity, there are no theoretic results that guarantee its convergence to a well-defined…

Computer Science and Game Theory · Computer Science 2015-09-02 Vojtěch Kovařík , Viliam Lisý

Multiple Tree for Partially Observable Monte-Carlo Tree Search

We propose an algorithm for computing approximate Nash equilibria of partially observable games using Monte-Carlo tree search based on recent bandit methods. We obtain experimental results for the game of phantom tic-tac-toe, showing that…

Computer Science and Game Theory · Computer Science 2011-02-09 David Auger

Meta-Learning in Self-Play Regret Minimization

Regret minimization is a general approach to online optimization which plays a crucial role in many algorithms for approximating Nash equilibria in two-player zero-sum games. The literature mainly focuses on solving individual games in…

Computer Science and Game Theory · Computer Science 2025-04-29 David Sychrovský , Martin Schmid , Michal Šustr , Michael Bowling

D2CFR: Minimize Counterfactual Regret with Deep Dueling Neural Network

Counterfactual Regret Minimization (CFR)} is the popular method for finding approximate Nash equilibrium in two-player zero-sum games with imperfect information. CFR solves games by travsersing the full game tree iteratively, which limits…

Artificial Intelligence · Computer Science 2022-01-04 Huale Li , Xuan Wang , Zengyue Guo , Jiajia Zhang , Shuhan Qi

Proof Number Based Monte-Carlo Tree Search

This paper proposes a new game-search algorithm, PN-MCTS, which combines Monte-Carlo Tree Search (MCTS) and Proof-Number Search (PNS). These two algorithms have been successfully applied for decision making in a range of domains. We define…

Artificial Intelligence · Computer Science 2024-05-30 Jakub Kowalski , Elliot Doe , Mark H. M. Winands , Daniel Górski , Dennis J. N. J. Soemers

Neural Architecture Search by Learning a Hierarchical Search Space

Monte-Carlo Tree Search (MCTS) is a powerful tool for many non-differentiable search related problems such as adversarial games. However, the performance of such approach highly depends on the order of the nodes that are considered at each…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Mehraveh Javan Roshtkhari , Matthew Toews , Marco Pedersoli

Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games

We study reinforcement learning for two-player zero-sum Markov games with simultaneous moves in the finite-horizon setting, where the transition kernel of the underlying Markov games can be parameterized by a linear function over the…

Machine Learning · Computer Science 2022-04-21 Zixiang Chen , Dongruo Zhou , Quanquan Gu

Last-Iterate Convergence in Adaptive Regret Minimization for Approximate Extensive-Form Perfect Equilibrium

The Nash Equilibrium (NE) assumes rational play in imperfect-information Extensive-Form Games (EFGs) but fails to ensure optimal strategies for off-equilibrium branches of the game tree, potentially leading to suboptimal outcomes in…

Computer Science and Game Theory · Computer Science 2025-08-12 Hang Ren , Xiaozhen Sun , Tianzi Ma , Jiajia Zhang , Xuan Wang

Approximating Nash Equilibria in General-Sum Games via Meta-Learning

Nash equilibrium is perhaps the best-known solution concept in game theory. Such a solution assigns a strategy to each player which offers no incentive to unilaterally deviate. While a Nash equilibrium is guaranteed to always exist, the…

Computer Science and Game Theory · Computer Science 2025-04-29 David Sychrovský , Christopher Solinas , Revan MacQueen , Kevin Wang , James R. Wright , Nathan R. Sturtevant , Michael Bowling

Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?

We study multi-player general-sum Markov games with one of the players designated as the leader and the other players regarded as followers. In particular, we focus on the class of games where the followers are myopic, i.e., they aim to…

Machine Learning · Computer Science 2021-12-28 Han Zhong , Zhuoran Yang , Zhaoran Wang , Michael I. Jordan

Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search

Bayes-optimal behavior, while well-defined, is often difficult to achieve. Recent advances in the use of Monte-Carlo tree search (MCTS) have shown that it is possible to act near-optimally in Markov Decision Processes (MDPs) with very large…

Artificial Intelligence · Computer Science 2012-02-20 John Asmuth , Michael L. Littman

Solving Large Imperfect Information Games Using CFR+

Counterfactual Regret Minimization and variants (e.g. Public Chance Sampling CFR and Pure CFR) have been known as the best approaches for creating approximate Nash equilibrium solutions for imperfect information games such as poker. This…

Computer Science and Game Theory · Computer Science 2014-07-21 Oskari Tammelin