English
Related papers

Related papers: A Tree Search Algorithm for Sequence Labeling

200 papers

Recent methodologies in LLM self-training mostly rely on LLM generating responses and filtering those with correct output answers as training data. This approach often yields a low-quality fine-tuning training set (e.g., incorrect plans or…

Computation and Language · Computer Science 2024-11-19 Dan Zhang , Sining Zhoubian , Ziniu Hu , Yisong Yue , Yuxiao Dong , Jie Tang

We consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of finite-horizon Markov decision process. We propose a dynamic sampling tree policy that…

Artificial Intelligence · Computer Science 2023-05-09 Gongbo Zhang , Yijie Peng , Yilong Xu

We introduce MCTS-RAG, a novel approach that enhances the reasoning capabilities of small language models on knowledge-intensive tasks by leveraging retrieval-augmented generation (RAG) to provide relevant context and Monte Carlo Tree…

Computation and Language · Computer Science 2025-10-09 Yunhai Hu , Yilun Zhao , Chen Zhao , Arman Cohan

We study how to efficiently combine formal methods, Monte Carlo Tree Search (MCTS), and deep learning in order to produce high-quality receding horizon policies in large Markov Decision processes (MDPs). In particular, we use model-checking…

Artificial Intelligence · Computer Science 2023-08-16 Debraj Chakraborty , Damien Busatto-Gaston , Jean-François Raskin , Guillermo A. Pérez

We introduce an approach aimed at enhancing the reasoning capabilities of Large Language Models (LLMs) through an iterative preference learning process inspired by the successful strategy employed by AlphaZero. Our work leverages Monte…

Artificial Intelligence · Computer Science 2024-06-19 Yuxi Xie , Anirudh Goyal , Wenyue Zheng , Min-Yen Kan , Timothy P. Lillicrap , Kenji Kawaguchi , Michael Shieh

Large language models (LLMs) have demonstrated remarkable capabilities in code generation and structured reasoning; however, their performance often degrades on complex tasks that require consistent multi-step planning. Recent work has…

Machine Learning · Computer Science 2025-08-11 Fei Xu Yu , Gina Adam , Nathaniel D. Bastian , Tian Lan

We introduce a memory-based approach to part of speech tagging. Memory-based learning is a form of supervised learning based on similarity-based reasoning. The part of speech tag of a word in a particular context is extrapolated from the…

cmp-lg · Computer Science 2008-02-03 Walter Daelemans , Jakub Zavrel , Peter Berck , Steven Gillis

In this work, we consider the popular tree-based search strategy within the framework of reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of infinite-horizon discounted cost Markov Decision Process (MDP). While…

Machine Learning · Statistics 2020-01-14 Devavrat Shah , Qiaomin Xie , Zhi Xu

Recent advancements in large language models (LLMs) have shown remarkable potential in automating machine learning tasks. However, existing LLM-based agents often struggle with low-diversity and suboptimal code generation. While recent work…

Computation and Language · Computer Science 2026-01-26 Zujie Liang , Feng Wei , Wujiang Xu , Lin Chen , Yuxi Qian , Xinhui Wu

Despite their outstanding capabilities, large language models (LLMs) are prone to hallucination and producing factually incorrect information. This challenge has spurred efforts in attributed text generation, which prompts LLMs to generate…

Computation and Language · Computer Science 2025-06-23 Junyi Li , Hwee Tou Ng

Seamlessly integrating rules in Learning-from-Demonstrations (LfD) policies is a critical requirement to enable the real-world deployment of AI agents. Recently, Signal Temporal Logic (STL) has been shown to be an effective language for…

Robotics · Computer Science 2025-03-06 Jasmine Jerry Aloor , Jay Patrikar , Parv Kapoor , Jean Oh , Sebastian Scherer

With the aim of improving performance in Markov Decision Problem in an Off-Policy setting, we suggest taking inspiration from what is done in Offline Reinforcement Learning (RL). In Offline RL, it is a common practice during policy learning…

Artificial Intelligence · Computer Science 2024-10-29 Jérôme Arjonilla , Abdallah Saffidine , Tristan Cazenave

Tree search-based methods have made significant progress in enhancing the code generation capabilities of large language models. However, due to the difficulty in effectively evaluating intermediate algorithmic steps and the inability to…

Artificial Intelligence · Computer Science 2025-12-18 Yuanyuan Lin , Xiangyu Ouyang , Teng Zhang , Kaixin Sui

Despite recent advances in large language models, open-source models often struggle to consistently perform well on complex reasoning tasks. Existing ensemble methods, whether applied at the token or output levels, fail to address these…

Computation and Language · Computer Science 2024-12-23 Sungjin Park , Xiao Liu , Yeyun Gong , Edward Choi

Symbolic regression aims to discover concise, interpretable mathematical expressions that satisfy desired objectives, such as fitting data, posing a highly combinatorial optimization problem. While genetic programming has been the dominant…

Machine Learning · Computer Science 2025-09-25 Zhengyao Huang , Daniel Zhengyu Huang , Tiannan Xiao , Dina Ma , Zhenyu Ming , Hao Shi , Yuanhui Wen

This paper proposes a novel multiple-input multiple-output (MIMO) symbol detector that incorporates a deep reinforcement learning (DRL) agent into the Monte Carlo tree search (MCTS) detection algorithm. We first describe how the MCTS…

Signal Processing · Electrical Eng. & Systems 2021-02-02 Tz-Wei Mo , Ronald Y. Chang , Te-Yi Kan

Inspired by recent successes of Monte-Carlo tree search (MCTS) in a number of artificial intelligence (AI) application domains, we propose a model-based reinforcement learning (RL) technique that iteratively applies MCTS on batches of…

Artificial Intelligence · Computer Science 2018-05-16 Daniel R. Jiang , Emmanuel Ekwedike , Han Liu

Recent advances in reasoning with large language models (LLMs) have shown the effectiveness of Monte Carlo Tree Search (MCTS) for generating high quality intermediate trajectories, particularly in math and symbolic domains. Inspired by…

Artificial Intelligence · Computer Science 2025-12-23 Bingning Huang , Tu Nguyen , Matthieu Zimmer

Monte Carlo Tree Search (MCTS) is a powerful algorithm for solving complex decision-making problems. This paper presents an optimized MCTS implementation applied to the FrozenLake environment, a classic reinforcement learning task…

Artificial Intelligence · Computer Science 2024-09-26 Esteban Aldana Guerra

Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have been successfully applied to a variety of sequence modeling tasks. In this paper we develop Tree Long Short-Term Memory…

Computation and Language · Computer Science 2016-04-05 Xingxing Zhang , Liang Lu , Mirella Lapata
‹ Prev 1 2 3 10 Next ›