English
Related papers

Related papers: CodeS: Towards Building Open-source Language Model…

200 papers

Recently, large language models (LLMs) have significantly improved the performance of text-to-SQL systems. Nevertheless, many state-of-the-art (SOTA) approaches have overlooked the critical aspect of system robustness. Our experiments…

Computation and Language · Computer Science 2024-12-18 Geling Liu , Yunzhi Tan , Ruichao Zhong , Yuanzhen Xie , Lingchen Zhao , Qian Wang , Bo Hu , Zang Li

In addressing the pivotal role of translating natural language queries into SQL commands, we propose a suite of compact, fine-tuned models and self-refine mechanisms to democratize data access and analysis for non-expert users, mitigating…

Artificial Intelligence · Computer Science 2024-09-25 Lixia Wu , Peng Li , Junhong Lou , Lei Fu

Text-to-SQLs enables non-expert users to effortlessly retrieve desired information from relational databases using natural language queries. While recent advancements, particularly with Large Language Models (LLMs) like GPT and T5, have…

Databases · Computer Science 2024-10-04 Shouvon Sarker , Xishuang Dong , Xiangfang Li , Lijun Qian

Text-to-SQL parsing, which aims at converting natural language instructions into executable SQLs, has gained increasing attention in recent years. In particular, Codex and ChatGPT have shown impressive results in this task. However, most of…

We perform an empirical evaluation of Text-to-SQL capabilities of the Codex language model. We find that, without any finetuning, Codex is a strong baseline on the Spider benchmark; we also analyze the failure modes of Codex in this…

Computation and Language · Computer Science 2022-04-04 Nitarshan Rajkumar , Raymond Li , Dzmitry Bahdanau

Large language models (LLMs) have emerged as a new paradigm for Text-to-SQL task. However, the absence of a systematical benchmark inhibits the development of designing effective, efficient and economic LLM-based Text-to-SQL solutions. To…

Databases · Computer Science 2023-11-21 Dawei Gao , Haibin Wang , Yaliang Li , Xiuyu Sun , Yichen Qian , Bolin Ding , Jingren Zhou

Despite the success of large language models (LLMs) in Text-to-SQL tasks, open-source LLMs encounter challenges in contextual understanding and response coherence. To tackle these issues, we present \ours, a systematic methodology tailored…

Computation and Language · Computer Science 2024-05-14 Xiaojun Chen , Tianle Wang , Tianhao Qiu , Jianbin Qin , Min Yang

The conversion of natural language into SQL language for querying databases (Text-to-SQL) has broad application prospects and has attracted widespread attention. At present, the mainstream Text-to-SQL methods are mainly divided into…

Computation and Language · Computer Science 2025-02-18 Lei Sheng , Shuai-Shuai Xu , Wei Xie

Real-world enterprise text-to-SQL workflows often involve complex cloud or local data across various database systems, multiple SQL queries in various dialects, and diverse operations from data transformation to analytics. We introduce…

Despite recent progress in text-to-SQL parsing, current semantic parsers are still not accurate enough for practical use. In this paper, we investigate how to build automatic text-to-SQL error correction models. Noticing that token-level…

Computation and Language · Computer Science 2023-05-30 Ziru Chen , Shijie Chen , Michael White , Raymond Mooney , Ali Payani , Jayanth Srinivasa , Yu Su , Huan Sun

The previous state-of-the-art (SOTA) method achieved a remarkable execution accuracy on the Spider dataset, which is one of the largest and most diverse datasets in the Text-to-SQL domain. However, during our reproduction of the business…

Artificial Intelligence · Computer Science 2023-11-01 Guanghu Sui , Zhishuai Li , Ziyue Li , Sun Yang , Jingqing Ruan , Hangyu Mao , Rui Zhao

Text-to-SQL systems facilitate smooth interaction with databases by translating natural language queries into Structured Query Language (SQL), bridging the gap between non-technical users and complex database management systems. This survey…

Artificial Intelligence · Computer Science 2025-01-24 Aditi Singh , Akash Shetty , Abul Ehtesham , Saket Kumar , Tala Talaei Khoei

This study investigates various approaches to using Large Language Models (LLMs) for Text-to-SQL program synthesis, focusing on the outcomes and insights derived. Employing the popular Text-to-SQL dataset, spider, the goal was to input a…

Artificial Intelligence · Computer Science 2024-01-24 Richard Roberson , Gowtham Kaki , Ashutosh Trivedi

Neural text-to-SQL models have achieved remarkable performance in translating natural language questions into SQL queries. However, recent studies reveal that text-to-SQL models are vulnerable to task-specific perturbations. Previous…

A practical text-to-SQL system should generalize well on a wide variety of natural language questions, unseen database schemas, and novel SQL query structures. To comprehensively evaluate text-to-SQL systems, we introduce a UNIfied…

There is currently a significant gap between the performance of fine-tuned models and prompting approaches using Large Language Models (LLMs) on the challenging task of text-to-SQL, as evaluated on datasets such as Spider. To improve the…

Computation and Language · Computer Science 2023-11-06 Mohammadreza Pourreza , Davood Rafiei

Text-to-SQL generation enables non-experts to interact with databases via natural language. Recent advances rely on large closed-source models like GPT-4 that present challenges in accessibility, privacy, and latency. To address these…

Computation and Language · Computer Science 2025-02-18 Satya Krishna Gorti , Ilan Gofman , Zhaoyan Liu , Jiapeng Wu , Noël Vouitsis , Guangwei Yu , Jesse C. Cresswell , Rasa Hosseinzadeh

Text-to-SQL generation aims to translate natural language questions into SQL statements. In Text-to-SQL based on large language models, schema linking is a widely adopted strategy to streamline the input for LLMs by selecting only relevant…

Computation and Language · Computer Science 2024-11-27 Zhenbiao Cao , Yuanlei Zheng , Zhihao Fan , Xiaojin Zhang , Wei Chen , Xiang Bai

Text-to-SQL enables natural access to databases, yet most benchmarks are English-only, limiting multilingual progress. We introduce MultiSpider 2.0, extending Spider 2.0 to eight languages (English, German, French, Spanish, Portuguese,…

Computation and Language · Computer Science 2025-09-30 Khanh Trinh Pham , Thu Huong Nguyen , Jun Jo , Quoc Viet Hung Nguyen , Thanh Tam Nguyen

Large language models (LLMs) have demonstrated strong performance in translating natural language questions into SQL queries (Text-to-SQL). In contrast, small language models (SLMs) ranging from 0.5B to 1.5B parameters currently…

Computation and Language · Computer Science 2025-07-31 Lei Sheng , Shuai-Shuai Xu
‹ Prev 1 2 3 10 Next ›