Related papers: StateFlow: Enhancing LLM Task-Solving through Stat…

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Despite recent advancements in large language models (LLMs), their performance on complex reasoning problems requiring multi-step thinking and combining various skills is still limited. To address this, we propose a novel framework HDFlow…

Computation and Language · Computer Science 2024-09-27 Wenlin Yao , Haitao Mi , Dong Yu

StateAct: Enhancing LLM Base Agents via Self-prompting and State-tracking

Large language models (LLMs) are increasingly used as autonomous agents, tackling tasks from robotics to web navigation. Their performance depends on the underlying base agent. Existing methods, however, struggle with long-context reasoning…

Artificial Intelligence · Computer Science 2025-04-09 Nikolai Rozanov , Marek Rei

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

In high-stakes domains such as healthcare and finance, effective decision-making demands not just accurate outcomes but transparent and explainable reasoning. However, current language models often lack the structured deliberation needed…

Computation and Language · Computer Science 2025-08-26 Xiusi Chen , Shanyong Wang , Cheng Qian , Hongru Wang , Peixuan Han , Heng Ji

Simulation Streams: A Programming Paradigm for Controlling Large Language Models and Building Complex Systems with Generative AI

We introduce Simulation Streams, a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows. Our primary goal is to create a minimally interfering…

Artificial Intelligence · Computer Science 2025-02-03 Peter Sunehag , Joel Z. Leibo

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models

In an Information Retrieval (IR) system, reranking plays a critical role by sorting candidate passages according to their relevance to a specific query. This process demands a nuanced understanding of the variations among passages linked to…

Information Retrieval · Computer Science 2026-04-20 Can Jin , Hongwu Peng , Anxiang Zhang , Nuo Chen , Jiahui Zhao , Xi Xie , Kuangzheng Li , Shuya Feng , Kai Zhong , Caiwen Ding , Dimitris N. Metaxas

RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models

Reinforcement learning (RL) shows promise for enhancing LLM agentic reasoning, yet sparse terminal rewards hinder fine-grained optimization. Process reward modeling offers an alternative but incurs high computational costs, reward hacking…

Artificial Intelligence · Computer Science 2026-05-29 Xiao Feng , Bo Han , Zhanke Zhou , Jiaqi Fan , Jiangchao Yao , Ka Ho Li , Dahai Yu , Michael Kwok-Po Ng

SHERPA: A Model-Driven Framework for Large Language Model Execution

Recently, large language models (LLMs) have achieved widespread application across various fields. Despite their impressive capabilities, LLMs suffer from a lack of structured reasoning ability, particularly for complex tasks requiring…

Artificial Intelligence · Computer Science 2025-09-03 Boqi Chen , Kua Chen , José Antonio Hernández López , Gunter Mussbacher , Dániel Varró , Amir Feizpour

LawFlow: Collecting and Simulating Lawyers' Thought Processes on Business Formation Case Studies

Legal practitioners, particularly those early in their careers, face complex, high-stakes tasks that require adaptive, context-sensitive reasoning. While AI holds promise in supporting legal work, current datasets and models are narrowly…

Computation and Language · Computer Science 2025-09-04 Debarati Das , Khanh Chi Le , Ritik Sachin Parkar , Karin De Langis , Brendan Madson , Chad M. Berryman , Robin M. Willis , Daniel H. Moses , Brett McDonnell , Daniel Schwarcz , Dongyeop Kang

A State-Transition Framework for Efficient LLM Reasoning

While Long Chain-of-Thought (CoT) reasoning significantly improves Large Language Models (LLMs) performance on complex reasoning tasks, the substantial computational and memory costs of generating long CoT sequences limit their efficiency…

Artificial Intelligence · Computer Science 2026-02-03 Liang Zhang , Yu Zhao , Longyue Wang , Tianqi Shi , Weihua Luo , Kaifu Zhang , Jinsong Su

Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation

Robust workflow composition is critical for effective agent performance, yet progress in Large Language Model (LLM) planning and reasoning is hindered by a scarcity of scalable evaluation data. This work introduces NL2Flow, a fully…

Artificial Intelligence · Computer Science 2025-10-16 Jungkoo Kang

AutoFlow: Automated Workflow Generation for Large Language Model Agents

Recent advancements in Large Language Models (LLMs) have shown significant progress in understanding complex natural language. One important application of LLM is LLM-based AI Agent, which leverages the ability of LLM as well as external…

Computation and Language · Computer Science 2024-07-19 Zelong Li , Shuyuan Xu , Kai Mei , Wenyue Hua , Balaji Rama , Om Raheja , Hao Wang , He Zhu , Yongfeng Zhang

SmartFlow: Robotic Process Automation using LLMs

Robotic Process Automation (RPA) systems face challenges in handling complex processes and diverse screen layouts that require advanced human-like decision-making capabilities. These systems typically rely on pixel-level encoding through…

Robotics · Computer Science 2024-05-22 Arushi Jain , Shubham Paliwal , Monika Sharma , Lovekesh Vig , Gautam Shroff

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Supervised fine-tuning (SFT) is a common method to enhance the tool calling capabilities of Large Language Models (LLMs), with the training data often being synthesized. The current data synthesis process generally involves sampling a set…

Computation and Language · Computer Science 2025-03-18 Zezhong Wang , Xingshan Zeng , Weiwen Liu , Liangyou Li , Yasheng Wang , Lifeng Shang , Xin Jiang , Qun Liu , Kam-Fai Wong

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Machine Learning · Computer Science 2025-12-19 Hao Liang , Xiaochen Ma , Zhou Liu , Zhen Hao Wong , Zhengyang Zhao , Zimo Meng , Runming He , Chengyu Shen , Qifeng Cai , Zhaoyang Han , Meiyi Qiang , Yalin Feng , Tianyi Bai , Zewei Pan , Ziyi Guo , Yizhen Jiang , Jingwen Deng , Qijie You , Peichao Lai , Tianyu Guo , Chi Hsu Tsai , Hengyi Feng , Rui Hu , Wenkai Yu , Junbo Niu , Bohan Zeng , Ruichuan An , Lu Ma , Jihao Huang , Yaowei Zheng , Conghui He , Linpeng Tang , Bin Cui , Weinan E , Wentao Zhang

Tool Learning in the Wild: Empowering Language Models as Automatic Tool Agents

Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extend their utility, enabling them to solve practical tasks. Previous methods manually parse tool documentation and create in-context…

Computation and Language · Computer Science 2025-03-05 Zhengliang Shi , Shen Gao , Lingyong Yan , Yue Feng , Xiuyi Chen , Zhumin Chen , Dawei Yin , Suzan Verberne , Zhaochun Ren

DyFlow: Dynamic Workflow Framework for Agentic Reasoning

Agent systems based on large language models (LLMs) have shown great potential in complex reasoning tasks, but building efficient and generalizable workflows remains a major challenge. Most existing approaches rely on manually designed…

Computation and Language · Computer Science 2025-10-01 Yanbo Wang , Zixiang Xu , Yue Huang , Xiangqi Wang , Zirui Song , Lang Gao , Chenxi Wang , Xiangru Tang , Yue Zhao , Arman Cohan , Xiangliang Zhang , Xiuying Chen

StepTool: Enhancing Multi-Step Tool Usage in LLMs via Step-Grained Reinforcement Learning

Despite their powerful text generation capabilities, large language models (LLMs) still struggle to effectively utilize external tools to solve complex tasks, a challenge known as tool learning. Existing methods primarily rely on supervised…

Computation and Language · Computer Science 2025-08-19 Yuanqing Yu , Zhefan Wang , Weizhi Ma , Shuai Wang , Chuhan Wu , Zhiqiang Guo , Min Zhang

AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs

Reinforcement learning (RL) is increasingly used to improve the reasoning, coding, and tool-use capabilities of large language models, but agentic RL remains prohibitively expensive. Scaling RL to agentic LLMs requires supporting complex…

Machine Learning · Computer Science 2026-05-18 Haizhong Zheng , Yizhuo Di , Jiahui Wang , Shuowei Jin , Xueshen Liu , Yongji Wu , Z. Morley Mao , Ion Stoica , Jiawei Zhao , Beidi Chen

Evaluating Software Process Models for Multi-Agent Class-Level Code Generation

Modern software systems require code that is not only functional but also maintainable and well-structured. Although Large Language Models (LLMs) are increasingly used to automate software development, most studies focus on isolated,…

Software Engineering · Computer Science 2025-11-14 Wasique Islam Shafin , Md Nakhla Rafi , Zhenhao Li , Tse-Hsun Chen

Slot Filling as a Reasoning Task for SpeechLLMs

We propose integration of reasoning into speech large language models (speechLLMs) for the end-to-end slot-filling task. Inspired by the recent development of reasoning LLMs, we use a chain-of-thought framework to decompose the slot-filling…

Computation and Language · Computer Science 2026-02-04 Kadri Hacioglu , Manjunath K E , Andreas Stolcke