Related papers: ProfiliTable: Profiling-Driven Tabular Data Proces…

TableMind: An Autonomous Programmatic Agent for Tool-Augmented Table Reasoning

Table reasoning requires models to jointly perform comprehensive semantic understanding and precise numerical operations. Although recent large language model (LLM)-based methods have achieved promising results, most of them still rely on a…

Artificial Intelligence · Computer Science 2025-12-23 Chuang Jiang , Mingyue Cheng , Xiaoyu Tao , Qingyang Mao , Jie Ouyang , Qi Liu

Agentic CLEAR: Automating Multi-Level Evaluation of LLM Agents

Agentic systems are becoming more capable: agents define strategies, take actions, and interact with different environments. This autonomy poses serious challenges for overseeing and assessing agent behavior. Most current tools are limited,…

Computation and Language · Computer Science 2026-05-22 Asaf Yehudai , Lilach Eden , Michal Shmueli-Scheuer

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Data science tasks involving tabular data present complex challenges that require sophisticated problem-solving approaches. We propose AutoKaggle, a powerful and user-centric framework that assists data scientists in completing daily data…

Artificial Intelligence · Computer Science 2024-11-07 Ziming Li , Qianbo Zang , David Ma , Jiawei Guo , Tuney Zheng , Minghao Liu , Xinyao Niu , Yue Wang , Jian Yang , Jiaheng Liu , Wanjun Zhong , Wangchunshu Zhou , Wenhao Huang , Ge Zhang

A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops

Agentic AI systems use specialized agents to handle tasks within complex workflows, enabling automation and efficiency. However, optimizing these systems often requires labor-intensive, manual adjustments to refine roles, tasks, and…

Computation and Language · Computer Science 2024-12-24 Kamer Ali Yuksel , Hassan Sawaf

LLM driven Text-to-Table Generation through Sub-Tasks Guidance and Iterative Refinement

Transforming unstructured text into structured data is a complex task, requiring semantic understanding, reasoning, and structural comprehension. While Large Language Models (LLMs) offer potential, they often struggle with handling…

Computation and Language · Computer Science 2025-08-13 Rajmohan C , Sarthak Harne , Arvind Agarwal

FABRIC: Framework for Agent-Based Realistic Intelligence Creation

Large language models (LLMs) are increasingly deployed as agents, expected to decompose goals, invoke tools, and verify results in dynamic environments. Realizing these capabilities requires access to agentic data-structured interaction…

Artificial Intelligence · Computer Science 2025-10-22 Abhigya Verma , Seganrasan Subramanian , Nandhakumar Kandasamy , Naman Gupta

AgenticControl: An Automated Control Design Framework Using Large Language Models

Traditional control system design, reliant on expert knowledge and precise models, struggles with complex, nonlinear, or uncertain dynamics. This paper introduces AgenticControl, a novel multi-agent framework that automates controller…

Systems and Control · Electrical Eng. & Systems 2025-06-25 Mohammad Narimani , Seyyed Ali Emami

DeepPrep: An LLM-Powered Agentic System for Autonomous Data Preparation

Data preparation, which aims to transform heterogeneous and noisy raw tables into analysis-ready data, remains a major bottleneck in data science. Recent approaches leverage large language models (LLMs) to automate data preparation from…

Databases · Computer Science 2026-02-10 Meihao Fan , Ju Fan , Yuxin Zhang , Shaolei Zhang , Xiaoyong Du , Jie Song , Peng Li , Fuxin Jiang , Tieying Zhang , Jianjun Chen

AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Tabular data is ubiquitous in real-world applications and abundant on the web, yet its annotation has traditionally required human labor, posing a significant scalability bottleneck for tabular machine learning. Our methodology can…

Machine Learning · Computer Science 2024-06-25 Yaojie Hu , Ilias Fountalis , Jin Tian , Nikolaos Vasiloglou

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benchmarks often require annotation-intensive task construction, rely on static…

Computation and Language · Computer Science 2026-01-15 Yibo Wang , Lei Wang , Yue Deng , Keming Wu , Yao Xiao , Huanjin Yao , Liwei Kang , Hai Ye , Yongcheng Jing , Lidong Bing

Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics

The rapid advancement of LLMs has led to the creation of diverse agentic systems in data analysis, utilizing LLMs' capabilities to improve insight generation and visualization. In this paper, we present an agentic system that automates the…

Artificial Intelligence · Computer Science 2025-05-30 Ran Zhang , Mohannad Elhamod

An Automatic Prompt Generation System for Tabular Data Tasks

Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through…

Machine Learning · Computer Science 2024-08-22 Ashlesha Akella , Abhijit Manatkar , Brij Chavda , Hima Patel

Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning

Despite the remarkable capabilities of large language models (LLMs) in various reasoning tasks, they still struggle with table reasoning tasks, particularly in maintaining consistency throughout multi-step reasoning processes. While…

Artificial Intelligence · Computer Science 2025-05-26 Peiying Yu , Guoxin Chen , Jingjing Wang

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

We introduce TableLLM, a robust large language model (LLM) with 8 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to…

Computation and Language · Computer Science 2025-02-18 Xiaokang Zhang , Sijia Luo , Bohan Zhang , Zeyao Ma , Jing Zhang , Yang Li , Guanlin Li , Zijun Yao , Kangli Xu , Jinchang Zhou , Daniel Zhang-Li , Jifan Yu , Shu Zhao , Juanzi Li , Jie Tang

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data

Data preprocessing is a crucial step in the machine learning process that transforms raw data into a more usable format for downstream ML models. However, it can be costly and time-consuming, often requiring the expertise of domain experts.…

Databases · Computer Science 2023-08-23 Peng Li , Zhiyi Chen , Xu Chu , Kexin Rong

MultiTab: A Scalable Foundation for Multitask Learning on Tabular Data

Tabular data is the most abundant data type in the world, powering systems in finance, healthcare, e-commerce, and beyond. As tabular datasets grow and span multiple related targets, there is an increasing need to exploit shared task…

Machine Learning · Computer Science 2025-11-14 Dimitrios Sinodinos , Jack Yi Wei , Narges Armanfard

ToolSelf: Unifying Task Execution and Self-Reconfiguration via Tool-Driven Intrinsic Adaptation

Agentic systems powered by Large Language Models (LLMs) have demonstrated remarkable potential in tackling complex, long-horizon tasks. However, their efficacy is fundamentally constrained by static configurations governing agent behaviors,…

Artificial Intelligence · Computer Science 2026-02-24 Jingqi Zhou , Sheng Wang , DeZhao Deng , Junwen Lu , Junwei Su , Qintong Li , Jiahui Gao , Hao Wu , Jiyue Jiang , Lingpeng Kong , Chuan Wu

TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning

Tabular data serves as the backbone of modern data analysis and scientific research. While Large Language Models (LLMs) fine-tuned via Supervised Fine-Tuning (SFT) have significantly improved natural language interaction with such…

Machine Learning · Computer Science 2025-12-29 Saisai Yang , Qingyi Huang , Jing Yuan , Liangyu Zha , Kai Tang , Yuhang Yang , Ning Wang , Yucheng Wei , Liyao Li , Wentao Ye , Hao Chen , Tao Zhang , Junlin Zhou , Haobo Wang , Gang Chen , Junbo Zhao

Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search

Recent work has made significant progress in helping users to automate single data preparation steps, such as string-transformations and table-manipulation operators (e.g., Join, GroupBy, Pivot, etc.). We in this work propose to automate…

Databases · Computer Science 2021-08-05 Junwen Yang , Yeye He , Surajit Chaudhuri

TablePilot: Recommending Human-Preferred Tabular Data Analysis with Large Language Models

Tabular data analysis is crucial in many scenarios, yet efficiently identifying the most relevant data analysis queries and results for a new table remains a significant challenge. The complexity of tabular data, diverse analytical…

Computation and Language · Computer Science 2025-04-01 Deyin Yi , Yihao Liu , Lang Cao , Mengyu Zhou , Haoyu Dong , Shi Han , Dongmei Zhang