Related papers: Evaluation Function Approximation for Scrabble

First Results from Using Game Refinement Measure and Learning Coefficient in Scrabble

This paper explores the entertainment experience and learning experience in Scrabble. It proposes a new measure from the educational point of view, which we call learning coefficient, based on the balance between the learner's skill and the…

Artificial Intelligence · Computer Science 2017-11-13 Kananat Suwanviwatana , Hiroyuki Iida

Monte-Carlo Tree Search for Simulation-based Strategy Analysis

Games are often designed to shape player behavior in a desired way; however, it can be unclear how design decisions affect the space of behaviors in a game. Designers usually explore this space through human playtesting, which can be…

Artificial Intelligence · Computer Science 2019-08-06 Alexander Zook , Brent Harrison , Mark O. Riedl

Approximating Auction Equilibria with Reinforcement Learning

Traditional methods for computing equilibria in auctions become computationally intractable as auction complexity increases, particularly in multi-item and dynamic auctions. This paper introduces a self-play based reinforcement learning…

General Economics · Economics 2024-10-21 Pranjal Rawat

A Sampling-Based Approach to Computing Equilibria in Succinct Extensive-Form Games

A central task of artificial intelligence is the design of artificial agents that act towards specified goals in partially observed environments. Since such environments frequently include interaction over time with other agents with their…

Computer Science and Game Theory · Computer Science 2012-05-14 Miroslav Dudik , Geoffrey Gordon

Learning Local Stackelberg Equilibria from Repeated Interactions with a Learning Agent

Motivated by the question of how a principal can maximize its utility in repeated interactions with a learning agent, we study repeated games between an principal and an agent employing a mean-based learning algorithm. Prior work has shown…

Computer Science and Game Theory · Computer Science 2025-10-28 Nivasini Ananthakrishnan , Yuval Dagan , Kunhe Yang

Improving Search with Supervised Learning in Trick-Based Card Games

In trick-taking card games, a two-step process of state sampling and evaluation is widely used to approximate move values. While the evaluation component is vital, the accuracy of move value estimates is also fundamentally linked to how…

Artificial Intelligence · Computer Science 2019-09-12 Christopher Solinas , Douglas Rebstock , Michael Buro

Approximate exploitability: Learning a best response in large games

Researchers have demonstrated that neural networks are vulnerable to adversarial examples and subtle environment changes, both of which one can view as a form of distribution shift. To humans, the resulting errors can look like blunders,…

Machine Learning · Computer Science 2022-11-07 Finbarr Timbers , Nolan Bard , Edward Lockhart , Marc Lanctot , Martin Schmid , Neil Burch , Julian Schrittwieser , Thomas Hubert , Michael Bowling

Improved Trial and Error Learning for Random Games

When a game involves many agents or when communication between agents is not possible, it is useful to resort to distributed learning where each agent acts in complete autonomy without any information on the other agents' situations.…

Optimization and Control · Mathematics 2025-09-24 Jérôme Taupin , Xavier Leturc , Christophe J. Le Martret

Active learning for efficiently training emulators of computationally expensive mathematical models

An emulator is a fast-to-evaluate statistical approximation of a detailed mathematical model (simulator). When used in lieu of simulators, emulators can expedite tasks that require many repeated evaluations, such as sensitivity analyses,…

Methodology · Statistics 2020-01-06 Alexandra G. Ellis , Rowan Iskandar , Christopher H. Schmid , John B. Wong , Thomas A. Trikalinos

ApproxED: Approximate exploitability descent via learned best responses

There has been substantial progress on finding game-theoretic equilibria. Most of that work has focused on games with finite, discrete action spaces. However, many games involving space, time, money, and other fine-grained quantities have…

Computer Science and Game Theory · Computer Science 2025-10-28 Carlos Martin , Tuomas Sandholm

Approximating Poker Probabilities with Deep Learning

Many poker systems, whether created with heuristics or machine learning, rely on the probability of winning as a key input. However calculating the precise probability using combinatorics is an intractable problem, so instead we approximate…

Artificial Intelligence · Computer Science 2018-08-24 Brandon Da Silva

Supervised and Reinforcement Learning from Observations in Reconnaissance Blind Chess

In this work, we adapt a training approach inspired by the original AlphaGo system to play the imperfect information game of Reconnaissance Blind Chess. Using only the observations instead of a full description of the game state, we first…

Artificial Intelligence · Computer Science 2022-08-04 Timo Bertram , Johannes Fürnkranz , Martin Müller

Solving Strongly Convex and Smooth Stackelberg Games Without Modeling the Follower

Stackelberg games have been widely used to model interactive decision-making problems in a variety of domains such as energy systems, transportation, cybersecurity, and human-robot interaction. However, existing algorithms for solving…

Optimization and Control · Mathematics 2023-03-14 Yansong Li , Shuo Han

Independent Learning in Constrained Markov Potential Games

Constrained Markov games offer a formal mathematical framework for modeling multi-agent reinforcement learning problems where the behavior of the agents is subject to constraints. In this work, we focus on the recently introduced class of…

Machine Learning · Computer Science 2024-02-29 Philip Jordan , Anas Barakat , Niao He

Statistical mechanics for Scrabble predicts strategy, entropy and language

The crossword-like patterns of tiles in Scrabble form connected graphs of occupied sites on a square lattice. We find the most structureless description that reproduces means and covariances observed in real Scrabble games by adapting a…

Biological Physics · Physics 2026-05-04 Olivier Witteveen , Marianne Bauer

Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

We consider interactive learning in the realizable setting and develop a general framework to handle problems ranging from best arm identification to active classification. We begin our investigation with the observation that agnostic…

Machine Learning · Computer Science 2021-11-10 Julian Katz-Samuels , Blake Mason , Kevin Jamieson , Rob Nowak

A Monte Carlo AIXI Approximation

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents.…

Artificial Intelligence · Computer Science 2010-12-30 Joel Veness , Kee Siong Ng , Marcus Hutter , William Uther , David Silver

Scalable agent alignment via reward modeling: a research direction

One obstacle to applying reinforcement learning algorithms to real-world problems is the lack of suitable reward functions. Designing such reward functions is difficult in part because the user only has an implicit understanding of the task…

Machine Learning · Computer Science 2018-11-20 Jan Leike , David Krueger , Tom Everitt , Miljan Martic , Vishal Maini , Shane Legg

Efficient Empowerment

Empowerment quantifies the influence an agent has on its environment. This is formally achieved by the maximum of the expected KL-divergence between the distribution of the successor state conditioned on a specific action and a distribution…

Machine Learning · Statistics 2015-09-29 Maximilian Karl , Justin Bayer , Patrick van der Smagt

On Value Functions and the Agent-Environment Boundary

When function approximation is deployed in reinforcement learning (RL), the same problem may be formulated in different ways, often by treating a pre-processing step as a part of the environment or as part of the agent. As a consequence,…

Machine Learning · Computer Science 2020-06-02 Nan Jiang