English
Related papers

Related papers: Bayesian Optimization in AlphaGo

200 papers

Bayesian Optimization (BO) is a common approach for hyperparameter optimization (HPO) in automated machine learning. Although it is well-accepted that HPO is crucial to obtain well-performing machine learning models, tuning BO's own…

Machine Learning · Computer Science 2019-08-20 Marius Lindauer , Matthias Feurer , Katharina Eggensperger , André Biedenkapp , Frank Hutter

AlphaZero has been very successful in many games. Unfortunately, it still consumes a huge amount of computing resources, the majority of which is spent in self-play. Hyperparameter tuning exacerbates the training cost since each…

Artificial Intelligence · Computer Science 2020-03-16 Ti-Rong Wu , Ting-Han Wei , I-Chen Wu

This work introduces an automated testing approach that employs agents controlling game characters to detect potential bugs within a game level. Harnessing the power of Bayesian Optimization (BO) to execute sample-efficient search, the…

Artificial Intelligence · Computer Science 2025-08-19 Carlos Celemin

By introducing several improvements to the AlphaZero process and architecture, we greatly accelerate self-play learning in Go, achieving a 50x reduction in computation over comparable methods. Like AlphaZero and replications such as ELF…

Machine Learning · Computer Science 2020-11-10 David J. Wu

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence.…

Machine Learning · Computer Science 2021-01-19 Vu Nguyen , Sebastian Schulze , Michael A Osborne

Since AlphaGo and AlphaGo Zero have achieved breakground successes in the game of Go, the programs have been generalized to solve other tasks. Subsequently, AlphaZero was developed to play Go, Chess and Shogi. In the literature, the…

Machine Learning · Computer Science 2019-03-20 Hui Wang , Michael Emmerich , Mike Preuss , Aske Plaat

We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies against it, achieving a >97% win rate against KataGo running at superhuman settings. Our adversaries do not win by playing Go well. Instead, they…

The AlphaGo, AlphaGo Zero, and AlphaZero series of algorithms are remarkable demonstrations of deep reinforcement learning's capabilities, achieving superhuman performance in the complex game of Go with progressively increasing autonomy.…

Artificial Intelligence · Computer Science 2022-06-06 Yuandong Tian , Jerry Ma , Qucheng Gong , Shubho Sengupta , Zhuoyuan Chen , James Pinkerton , C. Lawrence Zitnick

We study how humans learn from AI, leveraging an introduction of an AI-powered Go program (APG) that unexpectedly outperformed the best professional player. We compare the move quality of professional players to APG's superior solutions…

General Economics · Economics 2025-01-13 Sukwoong Choi , Hyo Kang , Namil Kim , Junsik Kim

Bargaining can be used to resolve mixed-motive games in multi-agent systems. Although there is an abundance of negotiation strategies implemented in automated negotiating agents, most agents are based on single fixed strategies, while it is…

Multiagent Systems · Computer Science 2022-12-21 Bram M. Renting , Holger H. Hoos , Catholijn M. Jonker

The Google DeepMind challenge match in March 2016 was a historic achievement for computer Go development. This article discusses the development of computational intelligence (CI) and its relative strength in comparison with human…

Artificial Intelligence · Computer Science 2019-04-15 Chang-Shing Lee , Mei-Hui Wang , Shi-Jim Yen , Ting-Han Wei , I-Chen Wu , Ping-Chiang Chou , Chun-Hsun Chou , Ming-Wan Wang , Tai-Hsiung Yang

Bidding and acceptance strategies have a substantial impact on the outcome of negotiations in scenarios with linear additive and nonlinear utility functions. Over the years, it has become clear that there is no single best strategy for all…

Multiagent Systems · Computer Science 2020-09-15 Bram M. Renting , Holger H. Hoos , Catholijn M. Jonker

Across a growing number of domains, human experts are expected to learn from and adapt to AI with superior decision making abilities. But how can we quantify such human adaptation to AI? We develop a simple measure of human adaptation to AI…

Human-Computer Interaction · Computer Science 2021-02-02 Minkyu Shin , Jin Kim , Minkyung Kim

Deep learning technology is making great progress in solving the challenging problems of artificial intelligence, hence machine learning based on artificial neural networks is in the spotlight again. In some areas, artificial intelligence…

Artificial Intelligence · Computer Science 2020-02-27 Okyu Kwon

We develop a new method for stochastic optimization using the Bayesian statistics approach. More precisely, we optimize parameters of chess engines as those data are available to us, but the method should apply to all situations where we…

Optimization and Control · Mathematics 2022-07-06 Ivan Ivec , Ivana Vojnović

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key…

Artificial Intelligence · Computer Science 2018-05-15 Juan Cruz Barsce , Jorge A. Palombarini , Ernesto C. Martínez

We train an agent to compete in the game of Gardner minichess, a downsized variation of chess played on a 5x5 board. We motivated and applied a SOTA actor-critic method Proximal Policy Optimization with Generalized Advantage Estimation. Our…

Machine Learning · Computer Science 2021-12-28 Michael Sun , Robert Tan

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a…

Systems and Control · Computer Science 2019-01-24 Matthias Neumann-Brosig , Alonso Marco , Dieter Schwarzmann , Sebastian Trimpe

The Elo rating system has been used world wide for individual sports and team sports, as exemplified by the European Go Federation (EGF), International Chess Federation (FIDE), International Federation of Association Football (FIFA), and…

Artificial Intelligence · Computer Science 2021-05-04 Ben Wise

In this paper we present a novel approach to optimise tactical and strategic decision making in football (soccer). We model the game of football as a multi-stage game which is made up from a Bayesian game to model the pre-match decisions…

Artificial Intelligence · Computer Science 2020-03-24 Ryan Beal , Georgios Chalkiadakis , Timothy J. Norman , Sarvapali D. Ramchurn
‹ Prev 1 2 3 10 Next ›