Related papers: Divide-Then-Aggregate: An Efficient Tool Learning …

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Tool-augmented large language models (LLMs) leverage tools, often in the form of APIs, to improve their reasoning capabilities on complex tasks. This enables them to act as intelligent agents interacting with the real world. The recently…

Computation and Language · Computer Science 2025-03-24 Sijia Chen , Yibo Wang , Yi-Feng Wu , Qing-Guo Chen , Zhao Xu , Weihua Luo , Kaifu Zhang , Lijun Zhang

Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG

Large language models (LLMs) augmented with retrieval systems have significantly advanced natural language processing tasks by integrating external knowledge sources, enabling more accurate and contextually rich responses. To improve the…

Computation and Language · Computer Science 2025-05-28 Xin Sun , Jianan Xie , Zhongqi Chen , Qiang Liu , Shu Wu , Yuehe Chen , Bowen Song , Weiqiang Wang , Zilei Wang , Liang Wang

DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models

Large Language Models (LLMs) have demonstrated promising reasoning capabilities in robotics; however, their application in multi-robot systems remains limited, particularly in handling task dependencies. This paper introduces DART-LLM, a…

Robotics · Computer Science 2025-03-05 Yongdong Wang , Runze Xiao , Jun Younes Louhi Kasahara , Ryosuke Yajima , Keiji Nagatani , Atsushi Yamashita , Hajime Asama

Try, Check and Retry: A Divide-and-Conquer Framework for Boosting Long-context Tool-Calling Performance of LLMs

Tool-calling empowers Large Language Models (LLMs) to interact with external environments. However, current methods often struggle to handle massive and noisy candidate tools in long-context tool-calling tasks, limiting their real-world…

Computation and Language · Computer Science 2026-03-13 Kunfeng Chen , Qihuang Zhong , Juhua Liu , Bo Du , Dacheng Tao

Dynamic Parallel Tree Search for Efficient LLM Reasoning

Tree of Thoughts (ToT) enhances Large Language Model (LLM) reasoning by structuring problem-solving as a spanning tree. However, recent methods focus on search accuracy while overlooking computational efficiency. The challenges of…

Artificial Intelligence · Computer Science 2025-02-28 Yifu Ding , Wentao Jiang , Shunyu Liu , Yongcheng Jing , Jinyang Guo , Yingjie Wang , Jing Zhang , Zengmao Wang , Ziwei Liu , Bo Du , Xianglong Liu , Dacheng Tao

AREAL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Reinforcement learning (RL) based post-training for large language models (LLMs) is computationally expensive, as it generates many rollout sequences that could frequently share long token prefixes. Existing RL frameworks usually process…

Machine Learning · Computer Science 2026-02-03 Jiarui Zhang , Yuchen Yang , Ran Yan , Zhiyu Mei , Liyuan Zhang , Daifeng Li , Wei Fu , Jiaxuan Gao , Shusheng Xu , Yi Wu , Binhang Yuan

Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning

Parameter-Efficient Fine-Tuning (PEFT) is essential for adapting Large Language Models (LLMs). In practice, LLMs are often required to handle a diverse set of tasks from multiple domains, a scenario naturally addressed by multi-task…

Computation and Language · Computer Science 2025-08-08 Jinda Liu , Bo Cheng , Yi Chang , Yuan Wu

DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models

Recent advancements in Large Language Models (LLMs) have sparked a revolution across many research fields. In robotics, the integration of common-sense knowledge from LLMs into task and motion planning has drastically advanced the field by…

Robotics · Computer Science 2025-04-02 Yuchen Liu , Luigi Palmieri , Sebastian Koch , Ilche Georgievski , Marco Aiello

DynTaskMAS: A Dynamic Task Graph-driven Framework for Asynchronous and Parallel LLM-based Multi-Agent Systems

The emergence of Large Language Models (LLMs) in Multi-Agent Systems (MAS) has opened new possibilities for artificial intelligence, yet current implementations face significant challenges in resource management, task coordination, and…

Multiagent Systems · Computer Science 2025-12-03 Junwei Yu , Yepeng Ding , Hiroyuki Sato

AdaMTL: Adaptive Input-dependent Inference for Efficient Multi-Task Learning

Modern Augmented reality applications require performing multiple tasks on each input frame simultaneously. Multi-task learning (MTL) represents an effective approach where multiple tasks share an encoder to extract representative features…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Marina Neseem , Ahmed Agiza , Sherief Reda

Temporal-Aware GPU Resource Allocation for Distributed LLM Inference via Reinforcement Learning

The rapid growth of large language model (LLM) services imposes increasing demands on distributed GPU inference infrastructure. Most existing scheduling systems follow a reactive paradigm, relying solely on the current system state to make…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-17 Chengze Du , Zhiwei Yu , Heng Xu , Haojie Wang , Bo liu , Jialong Li

RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

Tool learning with foundation models aims to endow AI systems with the ability to invoke external resources -- such as APIs, computational utilities, and specialized models -- to solve complex tasks beyond the reach of standalone language…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Gabriele Mattioli , Evelyn Turri , Sara Sarto , Lorenzo Baraldi , Marcella Cornia , Lorenzo Baraldi , Rita Cucchiara

Tree-Planner: Efficient Close-loop Task Planning with Large Language Models

This paper studies close-loop task planning, which refers to the process of generating a sequence of skills (a plan) to accomplish a specific goal while adapting the plan based on real-time observations. Recently, prompting Large Language…

Computation and Language · Computer Science 2024-07-25 Mengkang Hu , Yao Mu , Xinmiao Yu , Mingyu Ding , Shiguang Wu , Wenqi Shao , Qiguang Chen , Bin Wang , Yu Qiao , Ping Luo

DeltaLLM: A Training-Free Framework Exploiting Temporal Sparsity for Efficient Edge LLM Inference

Deploying Large Language Models (LLMs) on edge devices remains challenging due to their quadratically increasing computations with the sequence length. Existing studies for dynamic attention pruning are designed for hardware with massively…

Artificial Intelligence · Computer Science 2025-07-29 Jiawen Qi , Chang Gao , Zhaochun Ren , Qinyu Chen

Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient On-Device Agents

The rapid expansion of web content has made on-device AI assistants indispensable for helping users manage the increasing complexity of online tasks. The emergent reasoning ability in large language models offer a promising path for…

Computation and Language · Computer Science 2025-02-10 Chenyang Shao , Xinyuan Hu , Yutang Lin , Fengli Xu

MetaToolAgent: Towards Generalizable Tool Usage in LLMs through Meta-Learning

Tool learning is increasingly important for large language models (LLMs) to effectively coordinate and utilize a diverse set of tools in order to solve complex real-world tasks. By selecting and integrating appropriate tools, LLMs extend…

Machine Learning · Computer Science 2026-01-21 Zheng Fang , Wolfgang Mayer , Zeyu Zhang , Jian Wang , Hong-Yu Zhang , Wanli Li , Zaiwen Feng

Unlocking Graph Structure Learning with Tree-Guided Large Language Models

Recently, the emergence of large language models (LLMs) has motivated integrating language descriptions into graphs, forming text-attributed graphs (TAGs) that enhance model encoding capabilities from a data-centric perspective. A review of…

Machine Learning · Computer Science 2026-02-03 Zhihan Zhang , Xunkai Li , Lei Zhu , Guang Zeng , Bowen Fan , Yanzhe Wen , Hongchao Qin , Rong-Hua Li , Guoren Wang

Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge

Large language models (LLMs) demonstrate strong task-specific capabilities through fine-tuning, but merging multiple fine-tuned models often leads to degraded performance due to overlapping instruction-following components. Task Arithmetic…

Computation and Language · Computer Science 2025-02-28 Yan-Lun Chen , Yi-Ru Wei , Chia-Yi Hsu , Chia-Mu Yu , Chun-Ying Huang , Ying-Dar Lin , Yu-Sung Wu , Wei-Bin Lee

An LLM-Tool Compiler for Fused Parallel Function Calling

State-of-the-art sequential reasoning in Large Language Models (LLMs) has expanded the capabilities of Copilots beyond conversational tasks to complex function calling, managing thousands of API calls. However, the tendency of compositional…

Programming Languages · Computer Science 2024-05-29 Simranjit Singh , Andreas Karatzas , Michael Fore , Iraklis Anagnostopoulos , Dimitrios Stamoulis

Parallelization Strategies for Dense LLM Deployment: Navigating Through Application-Specific Tradeoffs and Bottlenecks

Breakthroughs in the generative AI domain have fueled an explosion of large language model (LLM)-powered applications, whose workloads fundamentally consist of sequences of inferences through transformer architectures. Within this rapidly…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-09 Burak Topcu , Musa Oguzhan Cim , Poovaiah Palangappa , Meena Arunachalam , Mahmut Taylan Kandemir