Related papers: Autonomous Data Processing using Meta-Agents

From Intent to AI Pipelines: A Controlled Agentic Framework for Non-AI Expert Scientists

Artificial Intelligence (AI) pipelines have become integral to modern research, supporting fields such as Medical Sciences, Agriculture, and Social Sciences, and enabling large-scale data analysis, predictive modeling, and the automation of…

Information Retrieval · Computer Science 2026-05-20 Hyacinth Ali , Jessie Galasso-Carbonnel , Houari Sahraoui

Meta-Agent: From Task Descriptions to Verified Multi-Agent Systems

AI agents are increasingly used to solve complex, multi-step tasks, but existing multi-agent frameworks remain brittle as workflows grow in scale and depth. Small errors at intermediate stages can propagate through agent interactions, while…

Artificial Intelligence · Computer Science 2026-05-26 Andy Xu , Yu-Wing Tai

Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems

Traditional Data+AI systems utilize data-driven techniques to optimize performance, but they rely heavily on human experts to orchestrate system pipelines, enabling them to adapt to changes in data, queries, tasks, and environments. For…

Databases · Computer Science 2025-07-03 Zhaoyan Sun , Jiayi Wang , Xinyang Zhao , Jiachi Wang , Guoliang Li

Adaptive Multimodal Agents-Based Framework for Automatic Workflow Execution

Modern information systems require autonomous agents capable of navigating complex workflows, yet current methodologies often struggle with the transition from structured metadata parsing to general environmental perception. While the…

Artificial Intelligence · Computer Science 2026-05-28 Susanna Cifani , Mario Luca Bernardi , Marta Cimitile

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Public research results on large-scale supervised finetuning of AI agents remain relatively rare, since the collection of agent training data presents unique challenges. In this work, we argue that the bottleneck is not a lack of underlying…

Computation and Language · Computer Science 2026-03-05 Yueqi Song , Ketan Ramaneti , Zaid Sheikh , Ziru Chen , Boyu Gou , Tianbao Xie , Yiheng Xu , Danyang Zhang , Apurva Gandhi , Fan Yang , Joseph Liu , Tianyue Ou , Zhihao Yuan , Frank Xu , Shuyan Zhou , Xingyao Wang , Xiang Yue , Tao Yu , Huan Sun , Yu Su , Graham Neubig

DeepPrep: An LLM-Powered Agentic System for Autonomous Data Preparation

Data preparation, which aims to transform heterogeneous and noisy raw tables into analysis-ready data, remains a major bottleneck in data science. Recent approaches leverage large language models (LLMs) to automate data preparation from…

Databases · Computer Science 2026-02-10 Meihao Fan , Ju Fan , Yuxin Zhang , Shaolei Zhang , Xiaoyong Du , Jie Song , Peng Li , Fuxin Jiang , Tieying Zhang , Jianjun Chen

Governing Cloud Data Pipelines with Agentic AI

Cloud data pipelines increasingly operate under dynamic workloads, evolving schemas, cost constraints, and strict governance requirements. Despite advances in cloud-native orchestration frameworks, most production pipelines rely on static…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-01 Aswathnarayan Muthukrishnan Kirubakaran , Adithya Parthasarathy , Nitin Saksena , Ram Sekhar Bodala , Akshay Deshpande , Suhas Malempati , Shiva Carimireddy , Abhirup Mazumder

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

AI agents are increasingly expected to operate as digital employees: accessing enterprise data, making decisions, and taking actions autonomously. But agents are simultaneously less predictable than humans -- prone to hallucination,…

Artificial Intelligence · Computer Science 2026-05-29 Tyler Akidau , Tyler Rockwood , Johannes Brüderl , Marc Millstone

A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops

Agentic AI systems use specialized agents to handle tasks within complex workflows, enabling automation and efficiency. However, optimizing these systems often requires labor-intensive, manual adjustments to refine roles, tasks, and…

Computation and Language · Computer Science 2024-12-24 Kamer Ali Yuksel , Hassan Sawaf

Towards Next Generation Data Engineering Pipelines

Data engineering pipelines are a widespread way to provide high-quality data for all kinds of data science applications. However, numerous challenges still remain in the composition and operation of such pipelines. Data engineering…

Databases · Computer Science 2025-07-30 Kevin M. Kramer , Valerie Restat , Sebastian Strasser , Uta Störl , Meike Klettke

TO-Agents: A Multi-Agent AI Pipeline for Preference-Guided Topology Optimization

Topology optimization can generate efficient structures, but designers often must manually translate qualitative intent, such as desired visual style, product experience, or manufacturability into solver settings that are not directly tied…

Artificial Intelligence · Computer Science 2026-05-22 Isabella A. Stewart , Hongrui Chen , Faez Ahmed

Autonomous Data Agents: A New Opportunity for Smart Data

As data continues to grow in scale and complexity, preparing, transforming, and analyzing it remains labor-intensive, repetitive, and difficult to scale. Since data contains knowledge and AI learns knowledge from it, the alignment between…

Artificial Intelligence · Computer Science 2025-10-07 Yanjie Fu , Dongjie Wang , Wangyang Ying , Xinyuan Wang , Xiangliang Zhang , Huan Liu , Jian Pei

MADP: A Multi-Agent Pipeline for Sustainable Document Processing with Human-in-the-Loop

Document processing automation remains a critical challenge in enterprise environments, where traditional manual approaches are labor-intensive and error-prone. We present MADP, a multi-agent architecture that addresses the challenge of…

Artificial Intelligence · Computer Science 2026-05-19 Diego Gosmar , Giovanni Zenezini

CoDA: Agentic Systems for Collaborative Data Visualization

Deep research has revolutionized data analysis, yet data scientists still devote substantial time to manually crafting visualizations, highlighting the need for robust automation from natural language queries. However, current systems…

Artificial Intelligence · Computer Science 2025-10-06 Zichen Chen , Jiefeng Chen , Sercan Ö. Arik , Misha Sra , Tomas Pfister , Jinsung Yoon

Simple Agents Outperform Experts in Biomedical Imaging Workflow Optimization

Adapting production-level computer vision tools to bespoke scientific datasets is a critical "last mile" bottleneck. Current solutions are impractical: fine-tuning requires large annotated datasets scientists often lack, while manual code…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Xuefei , Wang , Kai A. Horstmann , Ethan Lin , Jonathan Chen , Alexander R. Farhang , Sophia Stiles , Atharva Sehgal , Jonathan Light , David Van Valen , Yisong Yue , Jennifer J. Sun

PE-MA: Parameter-Efficient Co-Evolution of Multi-Agent Systems

Multi-Agent Systems have recently emerged as a promising paradigm for collaborative reasoning and solving complex tasks. However, the design of collaborative learning algorithms in multi-agent systems faces several challenges, including…

Multiagent Systems · Computer Science 2025-08-27 Yingfan Deng , Anhao Zhou , Yuan Yuan , Xiao Zhang , Yifei Zou , Dongxiao Yu

LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning

Large Language Models (LLMs) can be fine-tuned on domain-specific data to enhance their performance in specialized fields. However, such data often contains numerous low-quality samples, necessitating effective data processing (DP). In…

Machine Learning · Computer Science 2026-05-08 Wei Huang , Anda Cheng , Yinggui Wang , Lei Wang , Tao Wei

GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning

Autonomous agents powered by large language models (LLMs) have shown impressive capabilities in tool manipulation for complex task-solving. However, existing paradigms such as ReAct rely on sequential reasoning and execution, failing to…

Artificial Intelligence · Computer Science 2025-10-30 Jiaqi Wu , Qinlao Zhao , Zefeng Chen , Kai Qin , Yifei Zhao , Xueqian Wang , Yuhang Yao

Automated data processing and feature engineering for deep learning and big data applications: a survey

Modern approach to artificial intelligence (AI) aims to design algorithms that learn directly from data. This approach has achieved impressive results and has contributed significantly to the progress of AI, particularly in the sphere of…

Machine Learning · Computer Science 2024-03-20 Alhassan Mumuni , Fuseini Mumuni

Autonomous Deep Agent

This technical brief introduces Deep Agent, an advanced autonomous AI system designed to manage complex multi-phase tasks through a novel hierarchical task management architecture. The system's foundation is built on our Hierarchical Task…

Artificial Intelligence · Computer Science 2025-02-12 Amy Yu , Erik Lebedev , Lincoln Everett , Xiaoxin Chen , Terry Chen