Related papers: Polygames: Improved Zero Learning

Minimax Strikes Back

Deep Reinforcement Learning reaches a superhuman level of play in many complete information games. The state of the art algorithm for learning with zero knowledge is AlphaZero. We take another approach, Ath\'enan, which uses a different,…

Artificial Intelligence · Computer Science 2026-04-24 Quentin Cohen-Solal , Tristan Cazenave

Neurohex: A Deep Q-learning Hex Agent

DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e.g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises…

Artificial Intelligence · Computer Science 2016-04-27 Kenny Young , Ryan Hayward , Gautham Vasan

Self-Play Learning Without a Reward Metric

The AlphaZero algorithm for the learning of strategy games via self-play, which has produced superhuman ability in the games of Go, chess, and shogi, uses a quantitative reward function for game outcomes, requiring the users of the…

Machine Learning · Computer Science 2019-12-17 Dan Schmidt , Nick Moran , Jonathan S. Rosenfeld , Jonathan Rosenthal , Jonathan Yedidia

Multiplayer AlphaZero

The AlphaZero algorithm has achieved superhuman performance in two-player, deterministic, zero-sum games where perfect information of the game state is available. This success has been demonstrated in Chess, Shogi, and Go where learning…

Artificial Intelligence · Computer Science 2019-12-10 Nick Petosa , Tucker Balch

Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations

Reinforcement Learning (RL) has been widely used in many applications, particularly in gaming, which serves as an excellent training ground for AI models. Google DeepMind has pioneered innovations in this field, employing reinforcement…

Artificial Intelligence · Computer Science 2026-02-12 Abdelrhman Shaheen , Anas Badr , Ali Abohendy , Hatem Alsaadawy , Nadine Alsayad , Ehab H. El-Shazly

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

The game of chess is the most widely-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation…

Artificial Intelligence · Computer Science 2017-12-06 David Silver , Thomas Hubert , Julian Schrittwieser , Ioannis Antonoglou , Matthew Lai , Arthur Guez , Marc Lanctot , Laurent Sifre , Dharshan Kumaran , Thore Graepel , Timothy Lillicrap , Karen Simonyan , Demis Hassabis

Improved Reinforcement Learning with Curriculum

Humans tend to learn complex abstract concepts faster if examples are presented in a structured manner. For instance, when learning how to play a board game, usually one of the first concepts learned is how the game ends, i.e. the actions…

Machine Learning · Computer Science 2019-06-11 Joseph West , Frederic Maire , Cameron Browne , Simon Denman

Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero

Artificial Intelligence (AI) systems have made remarkable progress, attaining super-human performance across various domains. This presents us with an opportunity to further human knowledge and improve human expert performance by leveraging…

Artificial Intelligence · Computer Science 2023-10-26 Lisa Schut , Nenad Tomasev , Tom McGrath , Demis Hassabis , Ulrich Paquet , Been Kim

Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks

Hex and Counter Wargames are adversarial two-player simulations of real military conflicts requiring complex strategic decision-making. Unlike classical board games, these games feature intricate terrain/unit interactions, unit stacking,…

Machine Learning · Computer Science 2025-02-20 Guilherme Palma , Pedro A. Santos , João Dias

Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex

AlphaZero, an approach to reinforcement learning that couples neural networks and Monte Carlo tree search (MCTS), has produced state-of-the-art strategies for traditional board games like chess, Go, shogi, and Hex. While researchers and…

Artificial Intelligence · Computer Science 2022-11-29 Charles Lovering , Jessica Zosa Forde , George Konidaris , Ellie Pavlick , Michael L. Littman

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

The combination of deep reinforcement learning and search at both training and test time is a powerful paradigm that has led to a number of successes in single-agent settings and perfect-information games, best exemplified by AlphaZero.…

Computer Science and Game Theory · Computer Science 2020-12-01 Noam Brown , Anton Bakhtin , Adam Lerer , Qucheng Gong

Mastering the Game of Go with Self-play Experience Replay

The game of Go has long served as a benchmark for artificial intelligence, demanding sophisticated strategic reasoning and long-term planning. Previous approaches such as AlphaGo and its successors, have predominantly relied on model-based…

Artificial Intelligence · Computer Science 2026-01-08 Jingbin Liu , Xuechun Wang

Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games

Zero-sum games have long guided artificial intelligence research, since they possess both a rich strategy space of best-responses and a clear evaluation metric. What's more, competition is a vital mechanism in many real-world multi-agent…

Computer Science and Game Theory · Computer Science 2020-03-03 Edward Hughes , Thomas W. Anthony , Tom Eccles , Joel Z. Leibo , David Balduzzi , Yoram Bachrach

Deep Learning for General Game Playing with Ludii and Polygames

Combinations of Monte-Carlo tree search and Deep Neural Networks, trained through self-play, have produced state-of-the-art results for automated game-playing in many board games. The training and search algorithms are not game-specific,…

Artificial Intelligence · Computer Science 2021-01-26 Dennis J. N. J. Soemers , Vegard Mella , Cameron Browne , Olivier Teytaud

Warm-Start AlphaZero Self-Play Search Enhancements

Recently, AlphaZero has achieved landmark results in deep reinforcement learning, by providing a single self-play architecture that learned three different games at super human level. AlphaZero is a large and complicated system with many…

Artificial Intelligence · Computer Science 2021-01-11 Hui Wang , Mike Preuss , Aske Plaat

Score vs. Winrate in Score-Based Games: which Reward for Reinforcement Learning?

In the last years, the DeepMind algorithm AlphaZero has become the state of the art to efficiently tackle perfect information two-player zero-sum games with a win/lose outcome. However, when the win/lose outcome is decided by a final score…

Artificial Intelligence · Computer Science 2023-01-10 Luca Pasqualini , Gianluca Amato , Marco Fantozzi , Rosa Gini , Alessandro Marchetti , Carlo Metta , Francesco Morandin , Maurizio Parton

Train on Small, Play the Large: Scaling Up Board Games with AlphaZero and GNN

Playing board games is considered a major challenge for both humans and AI researchers. Because some complicated board games are quite hard to learn, humans usually begin with playing on smaller boards and incrementally advance to master…

Machine Learning · Computer Science 2021-07-20 Shai Ben-Assayag , Ran El-Yaniv

Impartial Games: A Challenge for Reinforcement Learning

AlphaZero-style reinforcement learning (RL) algorithms have achieved superhuman performance in many complex board games such as Chess, Shogi, and Go. However, we showcase that these algorithms encounter significant and fundamental…

Machine Learning · Computer Science 2026-01-22 Bei Zhou , Søren Riis

Fast and Knowledge-Free Deep Learning for General Game Playing (Student Abstract)

We develop a method of adapting the AlphaZero model to General Game Playing (GGP) that focuses on faster model generation and requires less knowledge to be extracted from the game rules. The dataset generation uses MCTS playing instead of…

Artificial Intelligence · Computer Science 2023-12-22 Michał Maras , Michał Kępa , Jakub Kowalski , Marek Szykuła

SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go (extended version)

We develop a new model that can be applied to any perfect information two-player zero-sum game to target a high score, and thus a perfect play. We integrate this model into the Monte Carlo tree search-policy iteration learning pipeline…

Artificial Intelligence · Computer Science 2019-11-28 Francesco Morandin , Gianluca Amato , Marco Fantozzi , Rosa Gini , Carlo Metta , Maurizio Parton