English
Related papers

Related papers: StateFlow: Enhancing LLM Task-Solving through Stat…

200 papers

Despite recent advancements in large language models (LLMs), their performance on complex reasoning problems requiring multi-step thinking and combining various skills is still limited. To address this, we propose a novel framework HDFlow…

Computation and Language · Computer Science 2024-09-27 Wenlin Yao , Haitao Mi , Dong Yu

Large language models (LLMs) are increasingly used as autonomous agents, tackling tasks from robotics to web navigation. Their performance depends on the underlying base agent. Existing methods, however, struggle with long-context reasoning…

Artificial Intelligence · Computer Science 2025-04-09 Nikolai Rozanov , Marek Rei

In high-stakes domains such as healthcare and finance, effective decision-making demands not just accurate outcomes but transparent and explainable reasoning. However, current language models often lack the structured deliberation needed…

Computation and Language · Computer Science 2025-08-26 Xiusi Chen , Shanyong Wang , Cheng Qian , Hongru Wang , Peixuan Han , Heng Ji

We introduce Simulation Streams, a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows. Our primary goal is to create a minimally interfering…

Artificial Intelligence · Computer Science 2025-02-03 Peter Sunehag , Joel Z. Leibo

In an Information Retrieval (IR) system, reranking plays a critical role by sorting candidate passages according to their relevance to a specific query. This process demands a nuanced understanding of the variations among passages linked to…

Information Retrieval · Computer Science 2026-04-20 Can Jin , Hongwu Peng , Anxiang Zhang , Nuo Chen , Jiahui Zhao , Xi Xie , Kuangzheng Li , Shuya Feng , Kai Zhong , Caiwen Ding , Dimitris N. Metaxas

Reinforcement learning (RL) shows promise for enhancing LLM agentic reasoning, yet sparse terminal rewards hinder fine-grained optimization. Process reward modeling offers an alternative but incurs high computational costs, reward hacking…

Artificial Intelligence · Computer Science 2026-05-29 Xiao Feng , Bo Han , Zhanke Zhou , Jiaqi Fan , Jiangchao Yao , Ka Ho Li , Dahai Yu , Michael Kwok-Po Ng

Recently, large language models (LLMs) have achieved widespread application across various fields. Despite their impressive capabilities, LLMs suffer from a lack of structured reasoning ability, particularly for complex tasks requiring…

Artificial Intelligence · Computer Science 2025-09-03 Boqi Chen , Kua Chen , José Antonio Hernández López , Gunter Mussbacher , Dániel Varró , Amir Feizpour

Legal practitioners, particularly those early in their careers, face complex, high-stakes tasks that require adaptive, context-sensitive reasoning. While AI holds promise in supporting legal work, current datasets and models are narrowly…

While Long Chain-of-Thought (CoT) reasoning significantly improves Large Language Models (LLMs) performance on complex reasoning tasks, the substantial computational and memory costs of generating long CoT sequences limit their efficiency…

Artificial Intelligence · Computer Science 2026-02-03 Liang Zhang , Yu Zhao , Longyue Wang , Tianqi Shi , Weihua Luo , Kaifu Zhang , Jinsong Su

Robust workflow composition is critical for effective agent performance, yet progress in Large Language Model (LLM) planning and reasoning is hindered by a scarcity of scalable evaluation data. This work introduces NL2Flow, a fully…

Artificial Intelligence · Computer Science 2025-10-16 Jungkoo Kang

Recent advancements in Large Language Models (LLMs) have shown significant progress in understanding complex natural language. One important application of LLM is LLM-based AI Agent, which leverages the ability of LLM as well as external…

Computation and Language · Computer Science 2024-07-19 Zelong Li , Shuyuan Xu , Kai Mei , Wenyue Hua , Balaji Rama , Om Raheja , Hao Wang , He Zhu , Yongfeng Zhang

Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through…

Robotics · Computer Science 2024-05-22 Arushi Jain , Shubham Paliwal , Monika Sharma , Lovekesh Vig , Gautam Shroff

Supervised fine-tuning (SFT) is a common method to enhance the tool calling capabilities of Large Language Models (LLMs), with the training data often being synthesized. The current data synthesis process generally involves sampling a set…

Computation and Language · Computer Science 2025-03-18 Zezhong Wang , Xingshan Zeng , Weiwen Liu , Liangyou Li , Yasheng Wang , Lifeng Shang , Xin Jiang , Qun Liu , Kam-Fai Wong

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extend their utility, enabling them to solve practical tasks. Previous methods manually parse tool documentation and create in-context…

Computation and Language · Computer Science 2025-03-05 Zhengliang Shi , Shen Gao , Lingyong Yan , Yue Feng , Xiuyi Chen , Zhumin Chen , Dawei Yin , Suzan Verberne , Zhaochun Ren

Agent systems based on large language models (LLMs) have shown great potential in complex reasoning tasks, but building efficient and generalizable workflows remains a major challenge. Most existing approaches rely on manually designed…

Computation and Language · Computer Science 2025-10-01 Yanbo Wang , Zixiang Xu , Yue Huang , Xiangqi Wang , Zirui Song , Lang Gao , Chenxi Wang , Xiangru Tang , Yue Zhao , Arman Cohan , Xiangliang Zhang , Xiuying Chen

Despite their powerful text generation capabilities, large language models (LLMs) still struggle to effectively utilize external tools to solve complex tasks, a challenge known as tool learning. Existing methods primarily rely on supervised…

Computation and Language · Computer Science 2025-08-19 Yuanqing Yu , Zhefan Wang , Weizhi Ma , Shuai Wang , Chuhan Wu , Zhiqiang Guo , Min Zhang

Reinforcement learning (RL) is increasingly used to improve the reasoning, coding, and tool-use capabilities of large language models, but agentic RL remains prohibitively expensive. Scaling RL to agentic LLMs requires supporting complex…

Machine Learning · Computer Science 2026-05-18 Haizhong Zheng , Yizhuo Di , Jiahui Wang , Shuowei Jin , Xueshen Liu , Yongji Wu , Z. Morley Mao , Ion Stoica , Jiawei Zhao , Beidi Chen

Modern software systems require code that is not only functional but also maintainable and well-structured. Although Large Language Models (LLMs) are increasingly used to automate software development, most studies focus on isolated,…

Software Engineering · Computer Science 2025-11-14 Wasique Islam Shafin , Md Nakhla Rafi , Zhenhao Li , Tse-Hsun Chen

We propose integration of reasoning into speech large language models (speechLLMs) for the end-to-end slot-filling task. Inspired by the recent development of reasoning LLMs, we use a chain-of-thought framework to decompose the slot-filling…

Computation and Language · Computer Science 2026-02-04 Kadri Hacioglu , Manjunath K E , Andreas Stolcke
‹ Prev 1 2 3 10 Next ›