Related papers: TaskGen: A Task-Based, Memory-Infused Agentic Fram…

TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers

Agentic systems, AI architectures that autonomously execute multi-step workflows to achieve complex goals, are often built using repeated large language model (LLM) calls for closed-set decision tasks such as routing, shortlisting, gating,…

Computation and Language · Computer Science 2026-02-19 Ido Levy , Eilam Shapira , Yinon Goldshtein , Avi Yaeli , Nir Mashkif , Segev Shlomov

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes…

Artificial Intelligence · Computer Science 2023-10-05 Qingyun Wu , Gagan Bansal , Jieyu Zhang , Yiran Wu , Beibin Li , Erkang Zhu , Li Jiang , Xiaoyun Zhang , Shaokun Zhang , Jiale Liu , Ahmed Hassan Awadallah , Ryen W White , Doug Burger , Chi Wang

LOGIGEN: Logic-Driven Generation of Verifiable Agentic Tasks

The evolution of Large Language Models (LLMs) from static instruction-followers to autonomous agents necessitates operating within complex, stateful environments to achieve precise state-transition objectives. However, this paradigm is…

Artificial Intelligence · Computer Science 2026-03-03 Yucheng Zeng , Weipeng Lu , Linyun Liu , Shupeng Li , Zitian Qu , Chenghao Zhu , Shaofei Li , Zhengdong Tan , Mengyue Liu , Haotian Zhao , Zhe Zhou , Jianmin Wu

EffGen: Enabling Small Language Models as Capable Autonomous Agents

Most existing language model agentic systems today are built and optimized for large language models (e.g., GPT, Claude, Gemini) via API calls. While powerful, this approach faces several limitations including high token costs and privacy…

Computation and Language · Computer Science 2026-02-03 Gaurav Srivastava , Aafiya Hussain , Chi Wang , Yingyan Celine Lin , Xuan Wang

WorkflowGen:an adaptive workflow generation mechanism driven by trajectory experience

Large language model (LLM) agents often suffer from high reasoning overhead, excessive token consumption, unstable execution, and inability to reuse past experiences in complex tasks like business queries, tool use, and workflow…

Machine Learning · Computer Science 2026-04-23 Ruocan Wei , Shufeng Wang , Ziwei Shi

STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking

Extracting structured information from scientific literature is critical for accelerating discovery, yet Large Language Models (LLMs) often struggle in specialized domains that require expert knowledge and generalize poorly across tasks. We…

Computation and Language · Computer Science 2026-05-22 Tek Raj Chhetri , Yibei Chen , Puja Trivedi , Dorota Jarecka , Saif Haobsh , Patrick Ray , Lydia Ng , Satrajit S. Ghosh

RA-Gen: A Controllable Code Generation Framework Using ReAct for Multi-Agent Task Execution

Code generation models based on large language models (LLMs) have gained wide adoption, but challenges remain in ensuring safety, accuracy, and controllability, especially for complex tasks. Existing methods often lack dynamic integration…

Software Engineering · Computer Science 2025-10-13 Aofan Liu , Haoxuan Li , Bin Wang , Ao Yang , Hui Li

TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

Retrieval-Augmented Generation (RAG) utilizes external knowledge to augment Large Language Models' (LLMs) reliability. For flexibility, agentic RAG employs autonomous, multi-round retrieval and reasoning to resolve queries. Although recent…

Information Retrieval · Computer Science 2025-11-10 Chao Zhang , Yuhao Wang , Derong Xu , Haoxin Zhang , Yuanjie Lyu , Yuhao Chen , Shuochen Liu , Tong Xu , Xiangyu Zhao , Yan Gao , Yao Hu , Enhong Chen

Towards Reliable Benchmarking: A Contamination Free, Controllable Evaluation Framework for Multi-step LLM Function Calling

Existing benchmarks for tool-augmented language models (TaLMs) lack fine-grained control over task difficulty and remain vulnerable to data contamination. We present FuncBenchGen, a unified, contamination-free framework that evaluates TaLMs…

Computation and Language · Computer Science 2026-02-10 Seiji Maekawa , Jackson Hassell , Pouya Pezeshkpour , Tom Mitchell , Estevam Hruschka

From Failure to Mastery: Generating Hard Samples for Tool-use Agents

The advancement of LLM agents with tool-use capabilities requires diverse and complex training corpora. Existing data generation methods, which predominantly follow a paradigm of random sampling and shallow generation, often yield simple…

Computation and Language · Computer Science 2026-01-06 Bingguang Hao , Zengzhuang Xu , Yuntao Wen , Xinyi Xu , Yang Liu , Tong Zhao , Maolin Wang , Long Chen , Dong Wang , Yicheng Chen , Cunyin Peng , Xiangyu Zhao , Chenyi Zhuang , Ji Zhang

TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation

The emergence of Large Language Models (LLMs) like ChatGPT has inspired the development of LLM-based agents capable of addressing complex, real-world tasks. However, these agents often struggle during task execution due to methodological…

Computation and Language · Computer Science 2025-01-22 Yaoxiang Wang , Zhiyong Wu , Junfeng Yao , Jinsong Su

NetGent: Agent-Based Automation of Network Application Workflows

We present NetGent, an AI-agent framework for automating complex application workflows to generate realistic network traffic datasets. Developing generalizable ML models for networking requires data collection from network environments with…

Artificial Intelligence · Computer Science 2025-11-17 Jaber Daneshamooz , Eugene Vuong , Laasya Koduru , Sanjay Chandrasekaran , Arpit Gupta

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Recent agent frameworks and inference-time algorithms often struggle with complex planning problems due to limitations in verifying generated plans or reasoning and varying complexity of instances within a single task. Many existing methods…

Artificial Intelligence · Computer Science 2025-02-25 Mihir Parmar , Xin Liu , Palash Goyal , Yanfei Chen , Long Le , Swaroop Mishra , Hossein Mobahi , Jindong Gu , Zifeng Wang , Hootan Nakhost , Chitta Baral , Chen-Yu Lee , Tomas Pfister , Hamid Palangi

ATGen: A Framework for Active Text Generation

Active learning (AL) has demonstrated remarkable potential in reducing the annotation effort required for training machine learning models. However, despite the surging popularity of natural language generation (NLG) tasks in recent years,…

Computation and Language · Computer Science 2025-07-01 Akim Tsvigun , Daniil Vasilev , Ivan Tsvigun , Ivan Lysenko , Talgat Bektleuov , Aleksandr Medvedev , Uliana Vinogradova , Nikita Severin , Mikhail Mozikov , Andrey Savchenko , Rostislav Grigorev , Ramil Kuleev , Fedor Zhdanov , Artem Shelmanov , Ilya Makarov

Adaptive Procedural Task Generation for Hard-Exploration Problems

We introduce Adaptive Procedural Task Generation (APT-Gen), an approach to progressively generate a sequence of tasks as curricula to facilitate reinforcement learning in hard-exploration problems. At the heart of our approach, a task…

Machine Learning · Computer Science 2021-03-19 Kuan Fang , Yuke Zhu , Silvio Savarese , Li Fei-Fei

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Executing complex terminal tasks remains a significant challenge for open-weight LLMs, constrained by two fundamental limitations. First, high-fidelity, executable training environments are scarce: environments synthesized from real-world…

Artificial Intelligence · Computer Science 2026-02-10 Kaijie Zhu , Yuzhou Nie , Yijiang Li , Yiming Huang , Jialian Wu , Jiang Liu , Ximeng Sun , Zhenfei Yin , Lun Wang , Zicheng Liu , Emad Barsoum , William Yang Wang , Wenbo Guo

AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation

Large Language Model-based agents have garnered significant attention and are becoming increasingly popular. Furthermore, planning ability is a crucial component of an LLM-based agent, which generally entails achieving a desired goal from…

Computation and Language · Computer Science 2025-02-07 Mengkang Hu , Pu Zhao , Can Xu , Qingfeng Sun , Jianguang Lou , Qingwei Lin , Ping Luo , Saravan Rajmohan

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

The relation extraction (RE) in complex scenarios faces challenges such as diverse relation types and ambiguous relations between entities within a single sentence, leading to the poor performance of pure "text-in, text-out" language models…

Computation and Language · Computer Science 2024-09-04 Yuchen Shi , Guochao Jiang , Tian Qiu , Deqing Yang

LogicEnvGen: Task-Logic Driven Generation of Diverse Simulated Environments for Embodied AI

Simulated environments play an essential role in embodied AI, functionally analogous to test cases in software engineering. However, existing environment generation methods often emphasize visual realism (e.g., object diversity and layout…

Robotics · Computer Science 2026-01-21 Jianan Wang , Siyang Zhang , Bin Li , Juan Chen , Jingtao Qi , Zhuo Zhang , Chen Qian

MetaGen: Self-Evolving Roles and Topologies for Multi-Agent LLM Reasoning

Large language models are increasingly deployed as multi-agent systems, where specialized roles communicate and collaborate through structured interactions to solve complex tasks that often exceed the capacity of a single agent. However,…

Computation and Language · Computer Science 2026-01-28 Yimeng Wang , Jiaxing Zhao , Hongbin Xie , Hexing Ma , Yuzhen Lei , Shuangxue Liu , Xuan Song , Zichen Zhang , Haoran Zhang