English
Related papers

Related papers: TaskGen: A Task-Based, Memory-Infused Agentic Fram…

200 papers

Agentic systems, AI architectures that autonomously execute multi-step workflows to achieve complex goals, are often built using repeated large language model (LLM) calls for closed-set decision tasks such as routing, shortlisting, gating,…

Computation and Language · Computer Science 2026-02-19 Ido Levy , Eilam Shapira , Yinon Goldshtein , Avi Yaeli , Nir Mashkif , Segev Shlomov

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes…

The evolution of Large Language Models (LLMs) from static instruction-followers to autonomous agents necessitates operating within complex, stateful environments to achieve precise state-transition objectives. However, this paradigm is…

Artificial Intelligence · Computer Science 2026-03-03 Yucheng Zeng , Weipeng Lu , Linyun Liu , Shupeng Li , Zitian Qu , Chenghao Zhu , Shaofei Li , Zhengdong Tan , Mengyue Liu , Haotian Zhao , Zhe Zhou , Jianmin Wu

Most existing language model agentic systems today are built and optimized for large language models (e.g., GPT, Claude, Gemini) via API calls. While powerful, this approach faces several limitations including high token costs and privacy…

Computation and Language · Computer Science 2026-02-03 Gaurav Srivastava , Aafiya Hussain , Chi Wang , Yingyan Celine Lin , Xuan Wang

Large language model (LLM) agents often suffer from high reasoning overhead, excessive token consumption, unstable execution, and inability to reuse past experiences in complex tasks like business queries, tool use, and workflow…

Machine Learning · Computer Science 2026-04-23 Ruocan Wei , Shufeng Wang , Ziwei Shi

Extracting structured information from scientific literature is critical for accelerating discovery, yet Large Language Models (LLMs) often struggle in specialized domains that require expert knowledge and generalize poorly across tasks. We…

Computation and Language · Computer Science 2026-05-22 Tek Raj Chhetri , Yibei Chen , Puja Trivedi , Dorota Jarecka , Saif Haobsh , Patrick Ray , Lydia Ng , Satrajit S. Ghosh

Code generation models based on large language models (LLMs) have gained wide adoption, but challenges remain in ensuring safety, accuracy, and controllability, especially for complex tasks. Existing methods often lack dynamic integration…

Software Engineering · Computer Science 2025-10-13 Aofan Liu , Haoxuan Li , Bin Wang , Ao Yang , Hui Li

Retrieval-Augmented Generation (RAG) utilizes external knowledge to augment Large Language Models' (LLMs) reliability. For flexibility, agentic RAG employs autonomous, multi-round retrieval and reasoning to resolve queries. Although recent…

Information Retrieval · Computer Science 2025-11-10 Chao Zhang , Yuhao Wang , Derong Xu , Haoxin Zhang , Yuanjie Lyu , Yuhao Chen , Shuochen Liu , Tong Xu , Xiangyu Zhao , Yan Gao , Yao Hu , Enhong Chen

Existing benchmarks for tool-augmented language models (TaLMs) lack fine-grained control over task difficulty and remain vulnerable to data contamination. We present FuncBenchGen, a unified, contamination-free framework that evaluates TaLMs…

Computation and Language · Computer Science 2026-02-10 Seiji Maekawa , Jackson Hassell , Pouya Pezeshkpour , Tom Mitchell , Estevam Hruschka

The advancement of LLM agents with tool-use capabilities requires diverse and complex training corpora. Existing data generation methods, which predominantly follow a paradigm of random sampling and shallow generation, often yield simple…

The emergence of Large Language Models (LLMs) like ChatGPT has inspired the development of LLM-based agents capable of addressing complex, real-world tasks. However, these agents often struggle during task execution due to methodological…

Computation and Language · Computer Science 2025-01-22 Yaoxiang Wang , Zhiyong Wu , Junfeng Yao , Jinsong Su

We present NetGent, an AI-agent framework for automating complex application workflows to generate realistic network traffic datasets. Developing generalizable ML models for networking requires data collection from network environments with…

Artificial Intelligence · Computer Science 2025-11-17 Jaber Daneshamooz , Eugene Vuong , Laasya Koduru , Sanjay Chandrasekaran , Arpit Gupta

Recent agent frameworks and inference-time algorithms often struggle with complex planning problems due to limitations in verifying generated plans or reasoning and varying complexity of instances within a single task. Many existing methods…

Active learning (AL) has demonstrated remarkable potential in reducing the annotation effort required for training machine learning models. However, despite the surging popularity of natural language generation (NLG) tasks in recent years,…

We introduce Adaptive Procedural Task Generation (APT-Gen), an approach to progressively generate a sequence of tasks as curricula to facilitate reinforcement learning in hard-exploration problems. At the heart of our approach, a task…

Machine Learning · Computer Science 2021-03-19 Kuan Fang , Yuke Zhu , Silvio Savarese , Li Fei-Fei

Executing complex terminal tasks remains a significant challenge for open-weight LLMs, constrained by two fundamental limitations. First, high-fidelity, executable training environments are scarce: environments synthesized from real-world…

Artificial Intelligence · Computer Science 2026-02-10 Kaijie Zhu , Yuzhou Nie , Yijiang Li , Yiming Huang , Jialian Wu , Jiang Liu , Ximeng Sun , Zhenfei Yin , Lun Wang , Zicheng Liu , Emad Barsoum , William Yang Wang , Wenbo Guo

Large Language Model-based agents have garnered significant attention and are becoming increasingly popular. Furthermore, planning ability is a crucial component of an LLM-based agent, which generally entails achieving a desired goal from…

Computation and Language · Computer Science 2025-02-07 Mengkang Hu , Pu Zhao , Can Xu , Qingfeng Sun , Jianguang Lou , Qingwei Lin , Ping Luo , Saravan Rajmohan

The relation extraction (RE) in complex scenarios faces challenges such as diverse relation types and ambiguous relations between entities within a single sentence, leading to the poor performance of pure "text-in, text-out" language models…

Computation and Language · Computer Science 2024-09-04 Yuchen Shi , Guochao Jiang , Tian Qiu , Deqing Yang

Simulated environments play an essential role in embodied AI, functionally analogous to test cases in software engineering. However, existing environment generation methods often emphasize visual realism (e.g., object diversity and layout…

Robotics · Computer Science 2026-01-21 Jianan Wang , Siyang Zhang , Bin Li , Juan Chen , Jingtao Qi , Zhuo Zhang , Chen Qian

Large language models are increasingly deployed as multi-agent systems, where specialized roles communicate and collaborate through structured interactions to solve complex tasks that often exceed the capacity of a single agent. However,…

Computation and Language · Computer Science 2026-01-28 Yimeng Wang , Jiaxing Zhao , Hongbin Xie , Hexing Ma , Yuzhen Lei , Shuangxue Liu , Xuan Song , Zichen Zhang , Haoran Zhang
‹ Prev 1 2 3 10 Next ›