Related papers: PExA: Parallel Exploration Agent for Complex Text-…

Multilingual Text-to-SQL: Benchmarking the Limits of Language Models with Collaborative Language Agents

Text-to-SQL enables natural access to databases, yet most benchmarks are English-only, limiting multilingual progress. We introduce MultiSpider 2.0, extending Spider 2.0 to eight languages (English, German, French, Spanish, Portuguese,…

Computation and Language · Computer Science 2025-09-30 Khanh Trinh Pham , Thu Huong Nguyen , Jun Jo , Quoc Viet Hung Nguyen , Thanh Tam Nguyen

PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency

Recent advancements in Text-to-SQL (Text2SQL) emphasize stimulating the large language models (LLM) on in-context learning, achieving significant results. Nevertheless, they face challenges when dealing with verbose database information and…

Computation and Language · Computer Science 2024-06-04 Zhishuai Li , Xiang Wang , Jingjing Zhao , Sun Yang , Guoqing Du , Xiaoru Hu , Bin Zhang , Yuxiao Ye , Ziyue Li , Rui Zhao , Hangyu Mao

APEX-SQL: Talking to the data via Agentic Exploration for Text-to-SQL

Text-to-SQL systems powered by Large Language Models have excelled on academic benchmarks but struggle in complex enterprise environments. The primary limitation lies in their reliance on static schema representations, which fails to…

Databases · Computer Science 2026-02-20 Bowen Cao , Weibin Liao , Yushi Sun , Dong Fang , Haitao Li , Wai Lam

Solid-SQL: Enhanced Schema-linking based In-context Learning for Robust Text-to-SQL

Recently, large language models (LLMs) have significantly improved the performance of text-to-SQL systems. Nevertheless, many state-of-the-art (SOTA) approaches have overlooked the critical aspect of system robustness. Our experiments…

Computation and Language · Computer Science 2024-12-18 Geling Liu , Yunzhi Tan , Ruichao Zhong , Yuanzhen Xie , Lingchen Zhao , Qian Wang , Bo Hu , Zang Li

Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios

Recent Text-to-SQL methods leverage large language models (LLMs) by incorporating feedback from the database management system. While these methods effectively address execution errors in SQL queries, they struggle with database mismatches…

Computation and Language · Computer Science 2024-09-02 Zhongyuan Wang , Richong Zhang , Zhijie Nie , Jaein Kim

MARS-SQL: A multi-agent reinforcement learning framework for Text-to-SQL

Large Language Models (LLMs) often struggle with the precise logic and schema alignment required for complex Text-to-SQL tasks. While current methods rely heavily on static prompting, they lack the ability to dynamically adapt and…

Computation and Language · Computer Science 2026-05-12 Haolin Yang , Jipeng Zhang , Zhitao He , Alexander Zhou , Yi R. Fung

FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark

Text-to-SQL systems have become crucial for translating natural language into SQL queries in various industries, enabling non-technical users to perform complex data operations. The need for accurate evaluation methods has increased as…

Computation and Language · Computer Science 2024-10-29 Heegyu Kim , Taeyang Jeon , Seunghwan Choi , Seungtaek Choi , Hyunsouk Cho

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Real-world enterprise text-to-SQL workflows often involve complex cloud or local data across various database systems, multiple SQL queries in various dialects, and diverse operations from data transformation to analytics. We introduce…

Computation and Language · Computer Science 2025-03-18 Fangyu Lei , Jixuan Chen , Yuxiao Ye , Ruisheng Cao , Dongchan Shin , Hongjin Su , Zhaoqing Suo , Hongcheng Gao , Wenjing Hu , Pengcheng Yin , Victor Zhong , Caiming Xiong , Ruoxi Sun , Qian Liu , Sida Wang , Tao Yu

AgentSM: Semantic Memory for Agentic Text-to-SQL

Recent advances in LLM-based Text-to-SQL have achieved remarkable gains on public benchmarks such as BIRD and Spider. Yet, these systems struggle to scale in realistic enterprise settings with large, complex schemas, diverse SQL dialects,…

Artificial Intelligence · Computer Science 2026-01-23 Asim Biswal , Chuan Lei , Xiao Qin , Aodong Li , Balakrishnan Narayanaswamy , Tim Kraska

Leveraging Prior Experience: An Expandable Auxiliary Knowledge Base for Text-to-SQL

Large Language Models (LLMs) exhibit impressive problem-solving skills across many tasks, but they still underperform compared to humans in various downstream applications, such as text-to-SQL. On the BIRD benchmark leaderboard, human…

Computation and Language · Computer Science 2024-11-21 Zhibo Chu , Zichong Wang , Qitao Qin

Enhancing LLM Fine-tuning for Text-to-SQLs by SQL Quality Measurement

Text-to-SQLs enables non-expert users to effortlessly retrieve desired information from relational databases using natural language queries. While recent advancements, particularly with Large Language Models (LLMs) like GPT and T5, have…

Databases · Computer Science 2024-10-04 Shouvon Sarker , Xishuang Dong , Xiangfang Li , Lijun Qian

Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation

Large language models (LLMs) have emerged as a new paradigm for Text-to-SQL task. However, the absence of a systematical benchmark inhibits the development of designing effective, efficient and economic LLM-based Text-to-SQL solutions. To…

Databases · Computer Science 2023-11-21 Dawei Gao , Haibin Wang , Yaliang Li , Xiuyu Sun , Yichen Qian , Bolin Ding , Jingren Zhou

MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL

Recent LLM-based Text-to-SQL methods usually suffer from significant performance degradation on "huge" databases and complex user questions that require multi-step reasoning. Moreover, most existing methods neglect the crucial significance…

Computation and Language · Computer Science 2025-03-19 Bing Wang , Changyu Ren , Jian Yang , Xinnian Liang , Jiaqi Bai , LinZheng Chai , Zhao Yan , Qian-Wen Zhang , Di Yin , Xing Sun , Zhoujun Li

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Consensus Enforcement, and Column Exploration

We present ReFoRCE, a Text-to-SQL agent that tops the Spider 2.0 leaderboard--a challenging benchmark reflecting complex, real-world Text-to-SQL scenarios. While Text-to-SQL systems enable natural language queries over structured databases,…

Computation and Language · Computer Science 2025-06-05 Minghang Deng , Ashwin Ramachandran , Canwen Xu , Lanxiang Hu , Zhewei Yao , Anupam Datta , Hao Zhang

End-to-End Text-to-SQL with Dataset Selection: Leveraging LLMs for Adaptive Query Generation

Text-to-SQL bridges the gap between natural language and structured database language, thus allowing non-technical users to easily query databases. Traditional approaches model text-to-SQL as a direct translation task, where a given Natural…

Machine Learning · Computer Science 2025-08-12 Anurag Tripathi , Vaibhav Patle , Abhinav Jain , Ayush Pundir , Sairam Menon , Ajeet Kumar Singh , Dorien Herremans

Both Ends Count! Just How Good are LLM Agents at "Text-to-Big SQL"?

Text-to-SQL and Big Data are both extensively benchmarked fields, yet there is limited research that evaluates them jointly. In the real world, Text-to-SQL systems are often embedded with Big Data workflows, such as large-scale data…

Databases · Computer Science 2026-04-30 Germán T. Eizaguirre , Lars Tissen , Marc Sánchez-Artigas

X-SQL: Expert Schema Linking and Understanding of Text-to-SQL with Multi-LLMs

With Large Language Models' (LLMs) emergent abilities on code generation tasks, Text-to-SQL has become one of the most popular downstream applications. Despite the strong results of multiple recent LLM-based Text-to-SQL frameworks, the…

Machine Learning · Computer Science 2025-09-09 Dazhi Peng

Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness

Neural text-to-SQL models have achieved remarkable performance in translating natural language questions into SQL queries. However, recent studies reveal that text-to-SQL models are vulnerable to task-specific perturbations. Previous…

Computation and Language · Computer Science 2023-01-31 Shuaichen Chang , Jun Wang , Mingwen Dong , Lin Pan , Henghui Zhu , Alexander Hanbo Li , Wuwei Lan , Sheng Zhang , Jiarong Jiang , Joseph Lilien , Steve Ash , William Yang Wang , Zhiguo Wang , Vittorio Castelli , Patrick Ng , Bing Xiang

Enhancing Text-to-SQL Translation for Financial System Design

Text-to-SQL, the task of translating natural language questions into SQL queries, is part of various business processes. Its automation, which is an emerging challenge, will empower software practitioners to seamlessly interact with…

Software Engineering · Computer Science 2024-01-10 Yewei Song , Saad Ezzini , Xunzhu Tang , Cedric Lothritz , Jacques Klein , Tegawendé Bissyandé , Andrey Boytsov , Ulrick Ble , Anne Goujon

FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

Text-to-SQL over large analytical databases requires navigating complex schemas, resolving ambiguous queries, and grounding decisions in actual data. Most current systems follow a fixed pipeline where schema elements are retrieved once…

Computation and Language · Computer Science 2026-05-05 Quang Hieu Pham , Yang He , Ping Nie , Canwen Xu , Davood Rafiei , Yuepeng Wang , Xi Ye , Jocelyn Qiaochu Chen