English
Related papers

Related papers: ProfiliTable: Profiling-Driven Tabular Data Proces…

200 papers

Table reasoning requires models to jointly perform comprehensive semantic understanding and precise numerical operations. Although recent large language model (LLM)-based methods have achieved promising results, most of them still rely on a…

Artificial Intelligence · Computer Science 2025-12-23 Chuang Jiang , Mingyue Cheng , Xiaoyu Tao , Qingyang Mao , Jie Ouyang , Qi Liu

Agentic systems are becoming more capable: agents define strategies, take actions, and interact with different environments. This autonomy poses serious challenges for overseeing and assessing agent behavior. Most current tools are limited,…

Computation and Language · Computer Science 2026-05-22 Asaf Yehudai , Lilach Eden , Michal Shmueli-Scheuer

Data science tasks involving tabular data present complex challenges that require sophisticated problem-solving approaches. We propose AutoKaggle, a powerful and user-centric framework that assists data scientists in completing daily data…

Agentic AI systems use specialized agents to handle tasks within complex workflows, enabling automation and efficiency. However, optimizing these systems often requires labor-intensive, manual adjustments to refine roles, tasks, and…

Computation and Language · Computer Science 2024-12-24 Kamer Ali Yuksel , Hassan Sawaf

Transforming unstructured text into structured data is a complex task, requiring semantic understanding, reasoning, and structural comprehension. While Large Language Models (LLMs) offer potential, they often struggle with handling…

Computation and Language · Computer Science 2025-08-13 Rajmohan C , Sarthak Harne , Arvind Agarwal

Large language models (LLMs) are increasingly deployed as agents, expected to decompose goals, invoke tools, and verify results in dynamic environments. Realizing these capabilities requires access to agentic data-structured interaction…

Artificial Intelligence · Computer Science 2025-10-22 Abhigya Verma , Seganrasan Subramanian , Nandhakumar Kandasamy , Naman Gupta

Traditional control system design, reliant on expert knowledge and precise models, struggles with complex, nonlinear, or uncertain dynamics. This paper introduces AgenticControl, a novel multi-agent framework that automates controller…

Systems and Control · Electrical Eng. & Systems 2025-06-25 Mohammad Narimani , Seyyed Ali Emami

Data preparation, which aims to transform heterogeneous and noisy raw tables into analysis-ready data, remains a major bottleneck in data science. Recent approaches leverage large language models (LLMs) to automate data preparation from…

Databases · Computer Science 2026-02-10 Meihao Fan , Ju Fan , Yuxin Zhang , Shaolei Zhang , Xiaoyong Du , Jie Song , Peng Li , Fuxin Jiang , Tieying Zhang , Jianjun Chen

Tabular data is ubiquitous in real-world applications and abundant on the web, yet its annotation has traditionally required human labor, posing a significant scalability bottleneck for tabular machine learning. Our methodology can…

Machine Learning · Computer Science 2024-06-25 Yaojie Hu , Ilias Fountalis , Jin Tian , Nikolaos Vasiloglou

Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benchmarks often require annotation-intensive task construction, rely on static…

Computation and Language · Computer Science 2026-01-15 Yibo Wang , Lei Wang , Yue Deng , Keming Wu , Yao Xiao , Huanjin Yao , Liwei Kang , Hai Ye , Yongcheng Jing , Lidong Bing

The rapid advancement of LLMs has led to the creation of diverse agentic systems in data analysis, utilizing LLMs' capabilities to improve insight generation and visualization. In this paper, we present an agentic system that automates the…

Artificial Intelligence · Computer Science 2025-05-30 Ran Zhang , Mohannad Elhamod

Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through…

Machine Learning · Computer Science 2024-08-22 Ashlesha Akella , Abhijit Manatkar , Brij Chavda , Hima Patel

Despite the remarkable capabilities of large language models (LLMs) in various reasoning tasks, they still struggle with table reasoning tasks, particularly in maintaining consistency throughout multi-step reasoning processes. While…

Artificial Intelligence · Computer Science 2025-05-26 Peiying Yu , Guoxin Chen , Jingjing Wang

We introduce TableLLM, a robust large language model (LLM) with 8 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to…

Computation and Language · Computer Science 2025-02-18 Xiaokang Zhang , Sijia Luo , Bohan Zhang , Zeyao Ma , Jing Zhang , Yang Li , Guanlin Li , Zijun Yao , Kangli Xu , Jinchang Zhou , Daniel Zhang-Li , Jifan Yu , Shu Zhao , Juanzi Li , Jie Tang

Data preprocessing is a crucial step in the machine learning process that transforms raw data into a more usable format for downstream ML models. However, it can be costly and time-consuming, often requiring the expertise of domain experts.…

Databases · Computer Science 2023-08-23 Peng Li , Zhiyi Chen , Xu Chu , Kexin Rong

Tabular data is the most abundant data type in the world, powering systems in finance, healthcare, e-commerce, and beyond. As tabular datasets grow and span multiple related targets, there is an increasing need to exploit shared task…

Machine Learning · Computer Science 2025-11-14 Dimitrios Sinodinos , Jack Yi Wei , Narges Armanfard

Agentic systems powered by Large Language Models (LLMs) have demonstrated remarkable potential in tackling complex, long-horizon tasks. However, their efficacy is fundamentally constrained by static configurations governing agent behaviors,…

Artificial Intelligence · Computer Science 2026-02-24 Jingqi Zhou , Sheng Wang , DeZhao Deng , Junwen Lu , Junwei Su , Qintong Li , Jiahui Gao , Hao Wu , Jiyue Jiang , Lingpeng Kong , Chuan Wu

Tabular data serves as the backbone of modern data analysis and scientific research. While Large Language Models (LLMs) fine-tuned via Supervised Fine-Tuning (SFT) have significantly improved natural language interaction with such…

Recent work has made significant progress in helping users to automate single data preparation steps, such as string-transformations and table-manipulation operators (e.g., Join, GroupBy, Pivot, etc.). We in this work propose to automate…

Databases · Computer Science 2021-08-05 Junwen Yang , Yeye He , Surajit Chaudhuri

Tabular data analysis is crucial in many scenarios, yet efficiently identifying the most relevant data analysis queries and results for a new table remains a significant challenge. The complexity of tabular data, diverse analytical…

Computation and Language · Computer Science 2025-04-01 Deyin Yi , Yihao Liu , Lang Cao , Mengyu Zhou , Haoyu Dong , Shi Han , Dongmei Zhang
‹ Prev 1 2 3 10 Next ›