English
Related papers

Related papers: EGREFINE: An Execution-Grounded Optimization Frame…

200 papers

Text-to-SQL is a technology that converts natural language queries into the structured query language SQL. A novel research approach that has recently gained attention focuses on methods based on the complexity of SQL queries, achieving…

Computation and Language · Computer Science 2024-06-14 Jiawen Yi , Guo Chen , Zixiang Shen

Despite the remarkable performance of large language models (LLMs) in text-to-SQL (SQL generation), correctly producing SQL queries remains challenging during initial generation. The SQL refinement task is subsequently introduced to correct…

Computation and Language · Computer Science 2026-03-05 Zijin Hong , Hao Chen , Zheng Yuan , Qinggang Zhang , Luyao Zhuang , Qing Liao , Feiran Huang , Yangqiu Song , Xiao Huang

We present ReFoRCE, a Text-to-SQL agent that tops the Spider 2.0 leaderboard--a challenging benchmark reflecting complex, real-world Text-to-SQL scenarios. While Text-to-SQL systems enable natural language queries over structured databases,…

Computation and Language · Computer Science 2025-06-05 Minghang Deng , Ashwin Ramachandran , Canwen Xu , Lanxiang Hu , Zhewei Yao , Anupam Datta , Hao Zhang

Translating Natural Language Queries into Structured Query Language (Text-to-SQL or NLQ-to-SQL) is a critical task extensively studied by both the natural language processing and database communities, aimed at providing a natural language…

Computation and Language · Computer Science 2025-01-29 Hasan Alp Caferoğlu , Özgür Ulusoy

The task of converting natural language queries into SQL queries is intricate, necessitating a blend of precise techniques for an accurate translation. The DIN-SQL (Decomposed-In-Context SQL) methodology represents a significant development…

Databases · Computer Science 2024-03-05 Shai Volvovsky , Marco Marcassa , Mustafa Panbiharwala

In the context of the Text-to-SQL task, table and column descriptions are crucial for bridging the gap between natural language and database schema. This report proposes a method for automatically generating effective database descriptions…

Artificial Intelligence · Computer Science 2025-03-03 Yingqi Gao , Zhiling Luo

Large language models (LLMs) have advanced Text-to-SQL, yet existing solutions still fall short of system-level reliability. The limitation is not merely in individual modules -- e.g., schema linking, reasoning, and verification -- but more…

Databases · Computer Science 2026-04-06 Boyan Li , Chong Chen , Zhujun Xue , Yinan Mei , Yuyu Luo

Robust text-to-SQL over complex, real-world databases remains brittle even with modern LLMs: iterative refinement often introduces syntactic and semantic drift, corrections tend to be non-transferable across queries, and naive use of large…

Databases · Computer Science 2026-01-13 Isabelle Mohr , Joao Gandarela , John Dujany , Andre Freitas

Enterprise data pipelines, characterized by complex transformations across multiple programming languages, often cause a semantic disconnect between original metadata and downstream data. This "semantic drift" compromises data…

Computation and Language · Computer Science 2025-08-12 Jiaqi Yin , Yi-Wei Chen , Meng-Lung Lee , Xiya Liu

SQL query rewriting aims to reformulate a query into a more efficient form while preserving equivalence. Most existing methods rely on predefined rewrite rules. However, such rule-based approaches face fundamental limitations: (1) fixed…

Databases · Computer Science 2025-08-18 Dongjie Xu , Yue Cui , Weijie Shi , Qingzhi Ma , Hanghui Guo , Jiaming Li , Yao Zhao , Ruiyuan Zhang , Shimin Di , Jia Zhu , Kai Zheng , Jiajie Xu

Schema linking -- the process of aligning natural language questions with database schema elements -- is a critical yet underexplored component of Text-to-SQL systems. While recent methods have focused primarily on improving SQL generation,…

Computation and Language · Computer Science 2026-01-28 Md Mahadi Hasan Nahid , Davood Rafiei , Weiwei Zhang , Yong Zhang

Text-to-SQL parsers typically struggle with databases unseen during the train time. Adapting parsers to new databases is a challenging problem due to the lack of natural language queries in the new schemas. We present ReFill, a framework…

Computation and Language · Computer Science 2022-11-01 Abhijeet Awasthi , Ashutosh Sathe , Sunita Sarawagi

Schema linking is a crucial step in Text-to-SQL pipelines. Its goal is to retrieve the relevant tables and columns of a target database for a user's query while disregarding irrelevant ones. However, imperfect schema linking can often…

Computation and Language · Computer Science 2024-08-20 Karime Maamari , Fadhil Abubaker , Daniel Jaroslawicz , Amine Mhedhbi

The generalizability to new databases is of vital importance to Text-to-SQL systems which aim to parse human utterances into SQL statements. Existing works achieve this goal by leveraging the exact matching method to identify the lexical…

Computation and Language · Computer Science 2022-08-09 Aiwei Liu , Xuming Hu , Li Lin , Lijie Wen

To access data stored in relational databases, users need to understand the database schema and write a query using a query language such as SQL. To simplify this task, text-to-SQL models attempt to translate a user's natural language…

Computation and Language · Computer Science 2020-11-05 Amol Kelkar , Rohan Relan , Vaishali Bhardwaj , Saurabh Vaichal , Chandra Khatri , Peter Relan

Text revision refers to a family of natural language generation tasks, where the source and target sequences share moderate resemblance in surface form but differentiate in attributes, such as text formality and simplicity. Current…

Computation and Language · Computer Science 2022-04-18 Jingjing Li , Zichao Li , Tao Ge , Irwin King , Michael R. Lyu

Real-world RAG applications often encounter long-context input scenarios, where redundant information and noise results in higher inference costs and reduced performance. To address these challenges, we propose LongRefiner, an efficient…

Computation and Language · Computer Science 2025-05-16 Jiajie Jin , Xiaoxi Li , Guanting Dong , Yuyao Zhang , Yutao Zhu , Yongkang Wu , Zhonghua Li , Qi Ye , Zhicheng Dou

Retrieval augmented generation (RAG) pipelines are commonly used in tasks such as question-answering (QA), relying on retrieving relevant documents from a vector store computed using a pretrained embedding model. However, if the retrieved…

Computation and Language · Computer Science 2024-10-18 Ambuje Gupta , Mrinal Rawat , Andreas Stolcke , Roberto Pieraccini

Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often constrained…

Computation and Language · Computer Science 2026-05-25 Tianhao Qiu , Xiaojun Chen

Schema linking is a difficult and important step in large-scale Text-to-SQL, where systems must identify a compact yet sufficient schema context from large and ambiguous databases. Existing methods often treat schema linking as…

Computation and Language · Computer Science 2026-05-29 Huawei Zheng , Sen Yang , Zhaorui Yang , Yuhui Zhang , Haozhe Feng , Haoxuan Li , Xuan Yi , Chao Hu , Defeng Xie , Chen Hou , Danqing Huang , Wei Chen , Yingcai Wu , Peng Chen , Dazhen Deng
‹ Prev 1 2 3 10 Next ›