Related papers: SQL-Factory: A Multi-Agent Framework for High-Qual…

Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents

Text-to-SQL task aims to automatically yield SQL queries according to user text questions. To address this problem, we propose a Cooperative SQL Generation framework based on Multi-functional Agents (CSMA) through information interaction…

Computation and Language · Computer Science 2024-12-10 Zhiguang Wu , Fengbin Zhu , Xuequn Shang , Yupei Zhang , Pan Zhou

DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering

Table Question Answering (TableQA) enables natural language interaction with structured tabular data. However, existing large language model (LLM) approaches face critical limitations: context length constraints that restrict data handling…

Artificial Intelligence · Computer Science 2026-03-11 Tong Wang , Chi Jin , Yongkang Chen , Huan Deng , Xiaohui Kuang , Gang Zhao

SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent Collaboration

While fine-tuned large language models (LLMs) excel in generating grammatically valid SQL in Text-to-SQL parsing, they often struggle to ensure semantic accuracy in queries, leading to user confusion and diminished system usability. To…

Computation and Language · Computer Science 2025-05-20 Jipeng Cen , Jiaxin Liu , Zhixu Li , Jingjing Wang

Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL

The data-centric paradigm has emerged as a pivotal direction in artificial intelligence (AI), emphasizing the role of high-quality training data. This shift is especially critical in the Text-to-SQL task, where the scarcity, limited…

Computation and Language · Computer Science 2026-02-11 Qifeng Cai , Hao Liang , Chang Xu , Tao Xie , Wentao Zhang , Bin Cui

Semantic Parsing with Syntax- and Table-Aware SQL Generation

We present a generative model to map natural language questions into SQL queries. Existing neural network based approaches typically generate a SQL query word-by-word, however, a large portion of the generated results are incorrect or not…

Computation and Language · Computer Science 2018-04-24 Yibo Sun , Duyu Tang , Nan Duan , Jianshu Ji , Guihong Cao , Xiaocheng Feng , Bing Qin , Ting Liu , Ming Zhou

XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL

To leverage the advantages of LLM in addressing challenges in the Text-to-SQL task, we present XiYan-SQL, an innovative framework effectively generating and utilizing multiple SQL candidates. It consists of three components: 1) a Schema…

Computation and Language · Computer Science 2026-04-07 Yifu Liu , Yin Zhu , Yingqi Gao , Zhiling Luo , Xiaoxia Li , Xiaorong Shi , Yuntao Hong , Jinyang Gao , Yu Li , Bolin Ding , Jingren Zhou

CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

In tackling the challenges of large language model (LLM) performance for Text-to-SQL tasks, we introduce CHASE-SQL, a new framework that employs innovative strategies, using test-time compute in multi-agent modeling to improve candidate…

Machine Learning · Computer Science 2024-10-04 Mohammadreza Pourreza , Hailong Li , Ruoxi Sun , Yeounoh Chung , Shayan Talaei , Gaurav Tarlok Kakkar , Yu Gan , Amin Saberi , Fatma Ozcan , Sercan O. Arik

Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems

Multi-Agent Systems (MAS) built on large language models typically solve complex tasks by coordinating multiple agents through workflows. Existing approaches generates workflows either at task level or query level, but their relative costs…

Artificial Intelligence · Computer Science 2026-01-19 Zixu Wang , Bingbing Xu , Yige Yuan , Huawei Shen , Xueqi Cheng

MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL

Recent In-Context Learning based methods have achieved remarkable success in Text-to-SQL task. However, there is still a large gap between the performance of these models and human performance on datasets with complex database schema and…

Computation and Language · Computer Science 2024-11-08 Wenxuan Xie , Gaochen Wu , Bowen Zhou

Structure Guided Large Language Model for SQL Generation

Recent advancements in large language models (LLMs) have shown promise in bridging the gap between natural language queries and database management systems, enabling users to interact with databases without the background of SQL. However,…

Databases · Computer Science 2025-07-11 Qinggang Zhang , Hao Chen , Junnan Dong , Shengyuan Chen , Feiran Huang , Xiao Huang

Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing

Data augmentation has attracted a lot of research attention in the deep learning era for its ability in alleviating data sparseness. The lack of labeled data for unseen evaluation databases is exactly the major challenge for cross-domain…

Computation and Language · Computer Science 2022-11-16 Kun Wu , Lijie Wang , Zhenghua Li , Ao Zhang , Xinyan Xiao , Hua Wu , Min Zhang , Haifeng Wang

Chain-of-Query: Unleashing the Power of LLMs in SQL-Aided Table Understanding via Multi-Agent Collaboration

Table understanding requires structured, multi-step reasoning. Large Language Models (LLMs) struggle with it due to the structural complexity of tabular data. Recently, multi-agent frameworks for SQL generation have shown promise in…

Computation and Language · Computer Science 2025-12-02 Songyuan Sui , Hongyi Liu , Serena Liu , Li Li , Soo-Hyun Choi , Rui Chen , Xia Hu

SQLAgent: Learning to Explore Before Generating as a Data Engineer

Large Language Models have recently shown impressive capabilities in reasoning and code generation, making them promising tools for natural language interfaces to relational databases. However, existing approaches often fail to generalize…

Databases · Computer Science 2026-02-03 Wenjia Jiang , Yiwei Wang , Boyan Han , Joey Tianyi Zhou , Chi Zhang

Multi-Agent Collaborative Framework For Math Problem Generation

Automatic question generation (AQG) for mathematics education remains an elusive goal for Intelligent Tutoring Systems and educators. While pre-trained transformer-based language models have significantly advanced natural language…

Multiagent Systems · Computer Science 2025-11-07 Kia Karbasi , Kevin Hong , Mohammad Amin Samadi , Gregory Pottie

MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation

Text-to-SQL generation enables non-experts to interact with databases via natural language. Recent advances rely on large closed-source models like GPT-4 that present challenges in accessibility, privacy, and latency. To address these…

Computation and Language · Computer Science 2025-02-18 Satya Krishna Gorti , Ilan Gofman , Zhaoyan Liu , Jiapeng Wu , Noël Vouitsis , Guangwei Yu , Jesse C. Cresswell , Rasa Hosseinzadeh

A Multi-agent Text2SQL Framework using Small Language Models and Execution Feedback

Text2SQL, the task of generating SQL queries from natural language text, is a critical challenge in data engineering. Recently, Large Language Models (LLMs) have demonstrated superior performance for this task due to their advanced…

Databases · Computer Science 2025-12-23 Thanh Dat Hoang , Thanh Trung Huynh , Matthias Weidlich , Thanh Tam Nguyen , Tong Chen , Hongzhi Yin , Quoc Viet Hung Nguyen

Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations

Relational databases play an important role in business, science, and more. However, many users cannot fully unleash the analytical power of relational databases, because they are not familiar with database languages such as SQL. Many…

Databases · Computer Science 2024-01-08 Yuan Tian , Zheng Zhang , Zheng Ning , Toby Jia-Jun Li , Jonathan K. Kummerfeld , Tianyi Zhang

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

Recent LLM-based Text-to-SQL methods usually suffer from significant performance degradation on "huge" databases and complex user questions that require multi-step reasoning. Moreover, most existing methods neglect the crucial significance…

Computation and Language · Computer Science 2025-03-19 Bing Wang , Changyu Ren , Jian Yang , Xinnian Liang , Jiaqi Bai , LinZheng Chai , Zhao Yan , Qian-Wen Zhang , Di Yin , Xing Sun , Zhoujun Li

CHESS: Contextual Harnessing for Efficient SQL Synthesis

Translating natural language questions into SQL queries, known as text-to-SQL, is a long-standing research problem. Effective text-to-SQL synthesis can become very challenging due to (i) the extensive size of database catalogs (descriptions…

Machine Learning · Computer Science 2024-11-27 Shayan Talaei , Mohammadreza Pourreza , Yu-Chen Chang , Azalia Mirhoseini , Amin Saberi

Facilitating SQL Query Composition and Analysis

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger