Related papers: Bayesian Optimization in AlphaGo

Towards Assessing the Impact of Bayesian Optimization's Own Hyperparameters

Bayesian Optimization (BO) is a common approach for hyperparameter optimization (HPO) in automated machine learning. Although it is well-accepted that HPO is crucial to obtain well-performing machine learning models, tuning BO's own…

Machine Learning · Computer Science 2019-08-20 Marius Lindauer , Matthias Feurer , Katharina Eggensperger , André Biedenkapp , Frank Hutter

Accelerating and Improving AlphaZero Using Population Based Training

AlphaZero has been very successful in many games. Unfortunately, it still consumes a huge amount of computing resources, the majority of which is spent in self-play. Hyperparameter tuning exacerbates the training cost since each…

Artificial Intelligence · Computer Science 2020-03-16 Ti-Rong Wu , Ting-Han Wei , I-Chen Wu

Bayesian Optimization-based Search for Agent Control in Automated Game Testing

This work introduces an automated testing approach that employs agents controlling game characters to detect potential bugs within a game level. Harnessing the power of Bayesian Optimization (BO) to execute sample-efficient search, the…

Artificial Intelligence · Computer Science 2025-08-19 Carlos Celemin

Accelerating Self-Play Learning in Go

By introducing several improvements to the AlphaZero process and architecture, we greatly accelerate self-play learning in Go, achieving a 50x reduction in computation over comparable methods. Like AlphaZero and replications such as ELF…

Machine Learning · Computer Science 2020-11-10 David J. Wu

Bayesian Optimization for Iterative Learning

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence.…

Machine Learning · Computer Science 2021-01-19 Vu Nguyen , Sebastian Schulze , Michael A Osborne

Hyper-Parameter Sweep on AlphaZero General

Since AlphaGo and AlphaGo Zero have achieved breakground successes in the game of Go, the programs have been generalized to solve other tasks. Subsequently, AlphaZero was developed to play Go, Chess and Shogi. In the literature, the…

Machine Learning · Computer Science 2019-03-20 Hui Wang , Michael Emmerich , Mike Preuss , Aske Plaat

Adversarial Policies Beat Superhuman Go AIs

We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies against it, achieving a >97% win rate against KataGo running at superhuman settings. Our adversaries do not win by playing Go well. Instead, they…

Machine Learning · Computer Science 2023-07-14 Tony T. Wang , Adam Gleave , Tom Tseng , Kellin Pelrine , Nora Belrose , Joseph Miller , Michael D. Dennis , Yawen Duan , Viktor Pogrebniak , Sergey Levine , Stuart Russell

ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are remarkable demonstrations of deep reinforcement learning's capabilities, achieving superhuman performance in the complex game of Go with progressively increasing autonomy.…

Artificial Intelligence · Computer Science 2022-06-06 Yuandong Tian , Jerry Ma , Qucheng Gong , Shubho Sengupta , Zhuoyuan Chen , James Pinkerton , C. Lawrence Zitnick

How Does Artificial Intelligence Improve Human Decision-Making? Evidence from the AI-Powered Go Program

We study how humans learn from AI, leveraging an introduction of an AI-powered Go program (APG) that unexpectedly outperformed the best professional player. We compare the move quality of professional players to APG's superior solutions…

General Economics · Economics 2025-01-13 Sukwoong Choi , Hyo Kang , Namil Kim , Junsik Kim

Automated Configuration and Usage of Strategy Portfolios for Bargaining

Bargaining can be used to resolve mixed-motive games in multi-agent systems. Although there is an abundance of negotiation strategies implemented in automated negotiating agents, most agents are based on single fixed strategies, while it is…

Multiagent Systems · Computer Science 2022-12-21 Bram M. Renting , Holger H. Hoos , Catholijn M. Jonker

Human vs. Computer Go: Review and Prospect

The Google DeepMind challenge match in March 2016 was a historic achievement for computer Go development. This article discusses the development of computational intelligence (CI) and its relative strength in comparison with human…

Artificial Intelligence · Computer Science 2019-04-15 Chang-Shing Lee , Mei-Hui Wang , Shi-Jim Yen , Ting-Han Wei , I-Chen Wu , Ping-Chiang Chou , Chun-Hsun Chou , Ming-Wan Wang , Tai-Hsiung Yang

Automated Configuration of Negotiation Strategies

Bidding and acceptance strategies have a substantial impact on the outcome of negotiations in scenarios with linear additive and nonlinear utility functions. Over the years, it has become clear that there is no single best strategy for all…

Multiagent Systems · Computer Science 2020-09-15 Bram M. Renting , Holger H. Hoos , Catholijn M. Jonker

Measuring Human Adaptation to AI in Decision Making: Application to Evaluate Changes after AlphaGo

Across a growing number of domains, human experts are expected to learn from and adapt to AI with superior decision making abilities. But how can we quantify such human adaptation to AI? We develop a simple measure of human adaptation to AI…

Human-Computer Interaction · Computer Science 2021-02-02 Minkyu Shin , Jin Kim , Minkyung Kim

Very simple statistical evidence that AlphaGo has exceeded human limits in playing GO game

Deep learning technology is making great progress in solving the challenging problems of artificial intelligence, hence machine learning based on artificial neural networks is in the spotlight again. In some areas, artificial intelligence…

Artificial Intelligence · Computer Science 2020-02-27 Okyu Kwon

Bayesian statistics approach to chess engines optimization

We develop a new method for stochastic optimization using the Bayesian statistics approach. More precisely, we optimize parameters of chess engines as those data are available to us, but the method should apply to all situations where we…

Optimization and Control · Mathematics 2022-07-06 Ivan Ivec , Ivana Vojnović

Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key…

Artificial Intelligence · Computer Science 2018-05-15 Juan Cruz Barsce , Jorge A. Palombarini , Ernesto C. Martínez

Alpha-Mini: Minichess Agent with Deep Reinforcement Learning

We train an agent to compete in the game of Gardner minichess, a downsized variation of chess played on a 5x5 board. We motivated and applied a SOTA actor-critic method Proximal Policy Optimization with Generalized Advantage Estimation. Our…

Machine Learning · Computer Science 2021-12-28 Michael Sun , Robert Tan

Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a…

Systems and Control · Computer Science 2019-01-24 Matthias Neumann-Brosig , Alonso Marco , Dieter Schwarzmann , Sebastian Trimpe

Elo Ratings for Large Tournaments of Software Agents in Asymmetric Games

The Elo rating system has been used world wide for individual sports and team sports, as exemplified by the European Go Federation (EGF), International Chess Federation (FIDE), International Federation of Association Football (FIFA), and…

Artificial Intelligence · Computer Science 2021-05-04 Ben Wise

Optimising Game Tactics for Football

In this paper we present a novel approach to optimise tactical and strategic decision making in football (soccer). We model the game of football as a multi-stage game which is made up from a Bayesian game to model the pre-match decisions…

Artificial Intelligence · Computer Science 2020-03-24 Ryan Beal , Georgios Chalkiadakis , Timothy J. Norman , Sarvapali D. Ramchurn