Related papers: A Tree Search Algorithm for Sequence Labeling

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Recent methodologies in LLM self-training mostly rely on LLM generating responses and filtering those with correct output answers as training data. This approach often yields a low-quality fine-tuning training set (e.g., incorrect plans or…

Computation and Language · Computer Science 2024-11-19 Dan Zhang , Sining Zhoubian , Ziniu Hu , Yisong Yue , Yuxiao Dong , Jie Tang

An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search

We consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of finite-horizon Markov decision process. We propose a dynamic sampling tree policy that…

Artificial Intelligence · Computer Science 2023-05-09 Gongbo Zhang , Yijie Peng , Yilong Xu

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search

We introduce MCTS-RAG, a novel approach that enhances the reasoning capabilities of small language models on knowledge-intensive tasks by leveraging retrieval-augmented generation (RAG) to provide relevant context and Monte Carlo Tree…

Computation and Language · Computer Science 2025-10-09 Yunhai Hu , Yilun Zhao , Chen Zhao , Arman Cohan

Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods

We study how to efficiently combine formal methods, Monte Carlo Tree Search (MCTS), and deep learning in order to produce high-quality receding horizon policies in large Markov Decision processes (MDPs). In particular, we use model-checking…

Artificial Intelligence · Computer Science 2023-08-16 Debraj Chakraborty , Damien Busatto-Gaston , Jean-François Raskin , Guillermo A. Pérez

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte…

Artificial Intelligence · Computer Science 2024-06-19 Yuxi Xie , Anirudh Goyal , Wenyue Zheng , Min-Yen Kan , Timothy P. Lillicrap , Kenji Kawaguchi , Michael Shieh

Optimizing Prompt Sequences using Monte Carlo Tree Search for LLM-Based Optimization

Large language models (LLMs) have demonstrated remarkable capabilities in code generation and structured reasoning; however, their performance often degrades on complex tasks that require consistent multi-step planning. Recent work has…

Machine Learning · Computer Science 2025-08-11 Fei Xu Yu , Gina Adam , Nathaniel D. Bastian , Tian Lan

MBT: A Memory-Based Part of Speech Tagger-Generator

We introduce a memory-based approach to part of speech tagging. Memory-based learning is a form of supervised learning based on similarity-based reasoning. The part of speech tag of a word in a particular context is extrapolated from the…

cmp-lg · Computer Science 2008-02-03 Walter Daelemans , Jakub Zavrel , Peter Berck , Steven Gillis

Non-Asymptotic Analysis of Monte Carlo Tree Search

In this work, we consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of infinite-horizon discounted cost Markov Decision Process (MDP). While…

Machine Learning · Statistics 2020-01-14 Devavrat Shah , Qiaomin Xie , Zhi Xu

I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search

Recent advancements in large language models (LLMs) have shown remarkable potential in automating machine learning tasks. However, existing LLM-based agents often struggle with low-diversity and suboptimal code generation. While recent work…

Computation and Language · Computer Science 2026-01-26 Zujie Liang , Feng Wei , Wujiang Xu , Lin Chen , Yuxi Qian , Xinhui Wu

Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling

Despite their outstanding capabilities, large language models (LLMs) are prone to hallucination and producing factually incorrect information. This challenge has spurred efforts in attributed text generation, which prompts LLMs to generate…

Computation and Language · Computer Science 2025-06-23 Junyi Li , Hwee Tou Ng

Follow The Rules: Online Signal Temporal Logic Tree Search for Guided Imitation Learning in Stochastic Domains

Seamlessly integrating rules in Learning-from-Demonstrations (LfD) policies is a critical requirement to enable the real-world deployment of AI agents. Recently, Signal Temporal Logic (STL) has been shown to be an effective language for…

Robotics · Computer Science 2025-03-06 Jasmine Jerry Aloor , Jay Patrikar , Parv Kapoor , Jean Oh , Sebastian Scherer

Enhancing Reinforcement Learning Through Guided Search

With the aim of improving performance in Markov Decision Problem in an Off-Policy setting, we suggest taking inspiration from what is done in Offline Reinforcement Learning (RL). In Offline RL, it is a common practice during policy learning…

Artificial Intelligence · Computer Science 2024-10-29 Jérôme Arjonilla , Abdallah Saffidine , Tristan Cazenave

RPM-MCTS: Knowledge-Retrieval as Process Reward Model with Monte Carlo Tree Search for Code Generation

Tree search-based methods have made significant progress in enhancing the code generation capabilities of large language models. However, due to the difficulty in effectively evaluating intermediate algorithmic steps and the inability to…

Artificial Intelligence · Computer Science 2025-12-18 Yuanyuan Lin , Xiangyu Ouyang , Teng Zhang , Kaixin Sui

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Despite recent advances in large language models, open-source models often struggle to consistently perform well on complex reasoning tasks. Existing ensemble methods, whether applied at the token or output levels, fail to address these…

Computation and Language · Computer Science 2024-12-23 Sungjin Park , Xiao Liu , Yeyun Gong , Edward Choi

Improving Monte Carlo Tree Search for Symbolic Regression

Symbolic regression aims to discover concise, interpretable mathematical expressions that satisfy desired objectives, such as fitting data, posing a highly combinatorial optimization problem. While genetic programming has been the dominant…

Machine Learning · Computer Science 2025-09-25 Zhengyao Huang , Daniel Zhengyu Huang , Tiannan Xiao , Dina Ma , Zhenyu Ming , Hao Shi , Yuanhui Wen

Deep Reinforcement Learning Aided Monte Carlo Tree Search for MIMO Detection

This paper proposes a novel multiple-input multiple-output (MIMO) symbol detector that incorporates a deep reinforcement learning (DRL) agent into the Monte Carlo tree search (MCTS) detection algorithm. We first describe how the MCTS…

Signal Processing · Electrical Eng. & Systems 2021-02-02 Tz-Wei Mo , Ronald Y. Chang , Te-Yi Kan

Feedback-Based Tree Search for Reinforcement Learning

Inspired by recent successes of Monte-Carlo tree search (MCTS) in a number of artificial intelligence (AI) application domains, we propose a model-based reinforcement learning (RL) technique that iteratively applies MCTS on batches of…

Artificial Intelligence · Computer Science 2018-05-16 Daniel R. Jiang , Emmanuel Ekwedike , Han Liu

Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning

Recent advances in reasoning with large language models (LLMs) have shown the effectiveness of Monte Carlo Tree Search (MCTS) for generating high quality intermediate trajectories, particularly in math and symbolic domains. Inspired by…

Artificial Intelligence · Computer Science 2025-12-23 Bingning Huang , Tu Nguyen , Matthieu Zimmer

Optimized Monte Carlo Tree Search for Enhanced Decision Making in the FrozenLake Environment

Monte Carlo Tree Search (MCTS) is a powerful algorithm for solving complex decision-making problems. This paper presents an optimized MCTS implementation applied to the FrozenLake environment, a classic reinforcement learning task…

Artificial Intelligence · Computer Science 2024-09-26 Esteban Aldana Guerra

Top-down Tree Long Short-Term Memory Networks

Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have been successfully applied to a variety of sequence modeling tasks. In this paper we develop Tree Long Short-Term Memory…

Computation and Language · Computer Science 2016-04-05 Xingxing Zhang , Liang Lu , Mirella Lapata