English
Related papers

Related papers: Divide-Then-Aggregate: An Efficient Tool Learning …

200 papers

Tool-augmented large language models (LLMs) leverage tools, often in the form of APIs, to improve their reasoning capabilities on complex tasks. This enables them to act as intelligent agents interacting with the real world. The recently…

Computation and Language · Computer Science 2025-03-24 Sijia Chen , Yibo Wang , Yi-Feng Wu , Qing-Guo Chen , Zhao Xu , Weihua Luo , Kaifu Zhang , Lijun Zhang

Large language models (LLMs) augmented with retrieval systems have significantly advanced natural language processing tasks by integrating external knowledge sources, enabling more accurate and contextually rich responses. To improve the…

Computation and Language · Computer Science 2025-05-28 Xin Sun , Jianan Xie , Zhongqi Chen , Qiang Liu , Shu Wu , Yuehe Chen , Bowen Song , Weiqiang Wang , Zilei Wang , Liang Wang

Large Language Models (LLMs) have demonstrated promising reasoning capabilities in robotics; however, their application in multi-robot systems remains limited, particularly in handling task dependencies. This paper introduces DART-LLM, a…

Tool-calling empowers Large Language Models (LLMs) to interact with external environments. However, current methods often struggle to handle massive and noisy candidate tools in long-context tool-calling tasks, limiting their real-world…

Computation and Language · Computer Science 2026-03-13 Kunfeng Chen , Qihuang Zhong , Juhua Liu , Bo Du , Dacheng Tao

Tree of Thoughts (ToT) enhances Large Language Model (LLM) reasoning by structuring problem-solving as a spanning tree. However, recent methods focus on search accuracy while overlooking computational efficiency. The challenges of…

Artificial Intelligence · Computer Science 2025-02-28 Yifu Ding , Wentao Jiang , Shunyu Liu , Yongcheng Jing , Jinyang Guo , Yingjie Wang , Jing Zhang , Zengmao Wang , Ziwei Liu , Bo Du , Xianglong Liu , Dacheng Tao

Reinforcement learning (RL) based post-training for large language models (LLMs) is computationally expensive, as it generates many rollout sequences that could frequently share long token prefixes. Existing RL frameworks usually process…

Machine Learning · Computer Science 2026-02-03 Jiarui Zhang , Yuchen Yang , Ran Yan , Zhiyu Mei , Liyuan Zhang , Daifeng Li , Wei Fu , Jiaxuan Gao , Shusheng Xu , Yi Wu , Binhang Yuan

Parameter-Efficient Fine-Tuning (PEFT) is essential for adapting Large Language Models (LLMs). In practice, LLMs are often required to handle a diverse set of tasks from multiple domains, a scenario naturally addressed by multi-task…

Computation and Language · Computer Science 2025-08-08 Jinda Liu , Bo Cheng , Yi Chang , Yuan Wu

Recent advancements in Large Language Models (LLMs) have sparked a revolution across many research fields. In robotics, the integration of common-sense knowledge from LLMs into task and motion planning has drastically advanced the field by…

Robotics · Computer Science 2025-04-02 Yuchen Liu , Luigi Palmieri , Sebastian Koch , Ilche Georgievski , Marco Aiello

The emergence of Large Language Models (LLMs) in Multi-Agent Systems (MAS) has opened new possibilities for artificial intelligence, yet current implementations face significant challenges in resource management, task coordination, and…

Multiagent Systems · Computer Science 2025-12-03 Junwei Yu , Yepeng Ding , Hiroyuki Sato

Modern Augmented reality applications require performing multiple tasks on each input frame simultaneously. Multi-task learning (MTL) represents an effective approach where multiple tasks share an encoder to extract representative features…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Marina Neseem , Ahmed Agiza , Sherief Reda

The rapid growth of large language model (LLM) services imposes increasing demands on distributed GPU inference infrastructure. Most existing scheduling systems follow a reactive paradigm, relying solely on the current system state to make…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-17 Chengze Du , Zhiwei Yu , Heng Xu , Haojie Wang , Bo liu , Jialong Li

Tool learning with foundation models aims to endow AI systems with the ability to invoke external resources -- such as APIs, computational utilities, and specialized models -- to solve complex tasks beyond the reach of standalone language…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Gabriele Mattioli , Evelyn Turri , Sara Sarto , Lorenzo Baraldi , Marcella Cornia , Lorenzo Baraldi , Rita Cucchiara

This paper studies close-loop task planning, which refers to the process of generating a sequence of skills (a plan) to accomplish a specific goal while adapting the plan based on real-time observations. Recently, prompting Large Language…

Computation and Language · Computer Science 2024-07-25 Mengkang Hu , Yao Mu , Xinmiao Yu , Mingyu Ding , Shiguang Wu , Wenqi Shao , Qiguang Chen , Bin Wang , Yu Qiao , Ping Luo

Deploying Large Language Models (LLMs) on edge devices remains challenging due to their quadratically increasing computations with the sequence length. Existing studies for dynamic attention pruning are designed for hardware with massively…

Artificial Intelligence · Computer Science 2025-07-29 Jiawen Qi , Chang Gao , Zhaochun Ren , Qinyu Chen

The rapid expansion of web content has made on-device AI assistants indispensable for helping users manage the increasing complexity of online tasks. The emergent reasoning ability in large language models offer a promising path for…

Computation and Language · Computer Science 2025-02-10 Chenyang Shao , Xinyuan Hu , Yutang Lin , Fengli Xu

Tool learning is increasingly important for large language models (LLMs) to effectively coordinate and utilize a diverse set of tools in order to solve complex real-world tasks. By selecting and integrating appropriate tools, LLMs extend…

Machine Learning · Computer Science 2026-01-21 Zheng Fang , Wolfgang Mayer , Zeyu Zhang , Jian Wang , Hong-Yu Zhang , Wanli Li , Zaiwen Feng

Recently, the emergence of large language models (LLMs) has motivated integrating language descriptions into graphs, forming text-attributed graphs (TAGs) that enhance model encoding capabilities from a data-centric perspective. A review of…

Machine Learning · Computer Science 2026-02-03 Zhihan Zhang , Xunkai Li , Lei Zhu , Guang Zeng , Bowen Fan , Yanzhe Wen , Hongchao Qin , Rong-Hua Li , Guoren Wang

Large language models (LLMs) demonstrate strong task-specific capabilities through fine-tuning, but merging multiple fine-tuned models often leads to degraded performance due to overlapping instruction-following components. Task Arithmetic…

Computation and Language · Computer Science 2025-02-28 Yan-Lun Chen , Yi-Ru Wei , Chia-Yi Hsu , Chia-Mu Yu , Chun-Ying Huang , Ying-Dar Lin , Yu-Sung Wu , Wei-Bin Lee

State-of-the-art sequential reasoning in Large Language Models (LLMs) has expanded the capabilities of Copilots beyond conversational tasks to complex function calling, managing thousands of API calls. However, the tendency of compositional…

Programming Languages · Computer Science 2024-05-29 Simranjit Singh , Andreas Karatzas , Michael Fore , Iraklis Anagnostopoulos , Dimitrios Stamoulis

Breakthroughs in the generative AI domain have fueled an explosion of large language model (LLM)-powered applications, whose workloads fundamentally consist of sequences of inferences through transformer architectures. Within this rapidly…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-09 Burak Topcu , Musa Oguzhan Cim , Poovaiah Palangappa , Meena Arunachalam , Mahmut Taylan Kandemir
‹ Prev 1 2 3 10 Next ›