Related papers: Learning Provably Correct Distributed Protocols Wi…

Transformer Based Planning in the Observation Space with Applications to Trick Taking Card Games

Traditional search algorithms have issues when applied to games of imperfect information where the number of possible underlying states and trajectories are very large. This challenge is particularly evident in trick-taking card games.…

Artificial Intelligence · Computer Science 2024-04-23 Douglas Rebstock , Christopher Solinas , Nathan R. Sturtevant , Michael Buro

Learning to Give Checkable Answers with Prover-Verifier Games

Our ability to know when to trust the decisions made by machine learning systems has not kept up with the staggering improvements in their performance, limiting their applicability in high-stakes domains. We introduce Prover-Verifier Games…

Machine Learning · Computer Science 2021-08-30 Cem Anil , Guodong Zhang , Yuhuai Wu , Roger Grosse

Generalization Analysis for Game-Theoretic Machine Learning

For Internet applications like sponsored search, cautions need to be taken when using machine learning to optimize their mechanisms (e.g., auction) since self-interested agents in these applications may change their behaviors (and thus the…

Machine Learning · Computer Science 2014-10-14 Haifang Li , Fei Tian , Wei Chen , Tao Qin , Tie-Yan Liu

Monte Carlo Search Algorithm Discovery for One Player Games

Much current research in AI and games is being devoted to Monte Carlo search (MCS) algorithms. While the quest for a single unified MCS algorithm that would perform well on all problems is of major interest for AI, practitioners often know…

Artificial Intelligence · Computer Science 2015-03-20 Francis Maes , David Lupien St-Pierre , Damien Ernst

Distributed Semidefinite Programming with Application to Large-scale System Analysis

Distributed algorithms for solving coupled semidefinite programs (SDPs) commonly require many iterations to converge. They also put high computational demand on the computational agents. In this paper we show that in case the coupled…

Optimization and Control · Mathematics 2015-04-30 Sina Khoshfetrat Pakazad , Anders Hansson , Martin S. Andersen , Anders Rantzer

Exploring Explainable Multi-agent MCTS-minimax Hybrids in Board Game Using Process Mining

Monte-Carlo Tree Search (MCTS) is a family of sampling-based search algorithms widely used for online planning in sequential decision-making domains and at the heart of many recent advances in artificial intelligence. Understanding the…

Artificial Intelligence · Computer Science 2025-09-25 Yiyu Qian , Tim Miller , Zheng Qian , Liyuan Zhao

Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning

Standard planners for sequential decision making (including Monte Carlo planning, tree search, dynamic programming, etc.) are constrained by an implicit sequential planning assumption: The order in which a plan is constructed is the same in…

Machine Learning · Computer Science 2020-04-27 Giambattista Parascandolo , Lars Buesing , Josh Merel , Leonard Hasenclever , John Aslanides , Jessica B. Hamrick , Nicolas Heess , Alexander Neitz , Theophane Weber

Uncertainty-aware Planning with Inaccurate Models for Robotized Liquid Handling

Physics-based simulations and learning-based models are vital for complex robotics tasks like deformable object manipulation and liquid handling. However, these models often struggle with accuracy due to epistemic uncertainty or the…

Robotics · Computer Science 2025-07-29 Marco Faroni , Carlo Odesco , Andrea Zanchettin , Paolo Rocco

MASTER: A Multi-Agent System with LLM Specialized MCTS

Large Language Models (LLM) are increasingly being explored for problem-solving tasks. However, their strategic planning capability is often viewed with skepticism. Recent studies have incorporated the Monte Carlo Tree Search (MCTS)…

Artificial Intelligence · Computer Science 2025-02-05 Bingzheng Gan , Yufan Zhao , Tianyi Zhang , Jing Huang , Yusu Li , Shu Xian Teo , Changwang Zhang , Wei Shi

Decentralized MCTS via Learned Teammate Models

Decentralized online planning can be an attractive paradigm for cooperative multi-agent systems, due to improved scalability and robustness. A key difficulty of such approach lies in making accurate predictions about the decisions of other…

Artificial Intelligence · Computer Science 2020-11-11 Aleksander Czechowski , Frans A. Oliehoek

Toward Template-Free Explainability for Monte Carlo Tree Search

Probabilistic search algorithms, such as Monte Carlo Tree Search (MCTS), have proven very effective in solving sequential decision-making tasks under uncertainty. However, interpreting asymmetric search trees that incorporate bandit-based…

Human-Computer Interaction · Computer Science 2026-05-21 Siqi Lu , Mirsaleh Bahavarnia , Hiba Baroud , Yixuan Zhang , Hemant Purohit , Ayan Mukhopadhyay

Decision trees compensate for model misspecification

The best-performing models in ML are not interpretable. If we can explain why they outperform, we may be able to replicate these mechanisms and obtain both interpretability and performance. One example are decision trees and their…

Machine Learning · Statistics 2023-02-09 Hugh Panton , Gavin Leech , Laurence Aitchison

A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems

Learning in games provides a powerful framework to design control policies for self-interested agents that may be coupled through their dynamics, costs, or constraints. We consider the case where the dynamics of the coupled system can be…

Systems and Control · Electrical Eng. & Systems 2024-09-18 Mostafa M. Shibl , Vijay Gupta

Probable Approximate Coordination

We study the problem of how to coordinate the actions of independent agents in a distributed system where message arrival times are unbounded, but are determined by an exponential probability distribution. Asynchronous protocols executed in…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-10 Ariel Livshits , Yoram Moses

Ordinal Monte Carlo Tree Search

In many problem settings, most notably in game playing, an agent receives a possibly delayed reward for its actions. Often, those rewards are handcrafted and not naturally given. Even simple terminal-only rewards, like winning equals 1 and…

Artificial Intelligence · Computer Science 2020-12-09 Tobias Joppen , Johannes Fürnkranz

Safety Guarantees for Planning Based on Iterative Gaussian Processes

Gaussian Processes (GPs) are widely employed in control and learning because of their principled treatment of uncertainty. However, tracking uncertainty for iterative, multi-step predictions in general leads to an analytically intractable…

Machine Learning · Computer Science 2020-09-08 Kyriakos Polymenakos , Luca Laurenti , Andrea Patane , Jan-Peter Calliess , Luca Cardelli , Marta Kwiatkowska , Alessandro Abate , Stephen Roberts

Anytime Discovery of a Diverse Set of Patterns with Monte Carlo Tree Search

The discovery of patterns that accurately discriminate one class label from another remains a challenging data mining task. Subgroup discovery (SD) is one of the frameworks that enables to elicit such interesting hypotheses from labeled…

Data Structures and Algorithms · Computer Science 2017-12-07 Guillaume Bosc , Jean-François Boulicaut , Chedy Raïssi , Mehdi Kaytoue

GPT as a Monte Carlo Language Tree: A Probabilistic Perspective

Large Language Models (LLMs), such as GPT, are considered to learn the latent distributions within large-scale web-crawl datasets and accomplish natural language processing (NLP) tasks by predicting the next token. However, this mechanism…

Computation and Language · Computer Science 2025-02-04 Kun-Peng Ning , Jia-Yu Yao , Yu-Yang Liu , Mu-Nan Ning , Li Yuan

Verifiable Learning for Robust Tree Ensembles

Verifying the robustness of machine learning models against evasion attacks at test time is an important research problem. Unfortunately, prior work established that this problem is NP-hard for decision tree ensembles, hence bound to be…

Machine Learning · Computer Science 2023-11-14 Stefano Calzavara , Lorenzo Cazzaro , Giulio Ermanno Pibiri , Nicola Prezza

Optimal Transport for Domain Adaptation through Gaussian Mixture Models

Machine learning systems operate under the assumption that training and test data are sampled from a fixed probability distribution. However, this assumptions is rarely verified in practice, as the conditions upon which data was acquired…

Machine Learning · Computer Science 2025-07-09 Eduardo Fernandes Montesuma , Fred Maurice Ngolè Mboula , Antoine Souloumiac