English
Related papers

Related papers: Knapsack Optimization-based Schema Linking for LLM…

200 papers

Text-to-SQL generation aims to translate natural language questions into SQL statements. In Text-to-SQL based on large language models, schema linking is a widely adopted strategy to streamline the input for LLMs by selecting only relevant…

Computation and Language · Computer Science 2024-11-27 Zhenbiao Cao , Yuanlei Zheng , Zhihao Fan , Xiaojin Zhang , Wei Chen , Xiang Bai

In sophisticated existing Text-to-SQL methods exhibit errors in various proportions, including schema-linking errors (incorrect columns, tables, or extra columns), join errors, nested errors, and group-by errors. Consequently, there is a…

Databases · Computer Science 2024-05-17 Sun Yang , Qiong Su , Zhishuai Li , Ziyue Li , Hangyu Mao , Chenxi Liu , Rui Zhao

This work, for the first time, introduces two constant factor approximation algorithms with linear query complexity for non-monotone submodular maximization over a ground set of size $n$ subject to a knapsack constraint, $\mathsf{DLA}$ and…

Data Structures and Algorithms · Computer Science 2023-07-11 Canh V. Pham , Tan D. Tran , Dung T. K. Ha , My T. Thai

Schema linking -- the process of aligning natural language questions with database schema elements -- is a critical yet underexplored component of Text-to-SQL systems. While recent methods have focused primarily on improving SQL generation,…

Computation and Language · Computer Science 2026-01-28 Md Mahadi Hasan Nahid , Davood Rafiei , Weiwei Zhang , Yong Zhang

Schema linking is a crucial step in Text-to-SQL pipelines. Its goal is to retrieve the relevant tables and columns of a target database for a user's query while disregarding irrelevant ones. However, imperfect schema linking can often…

Computation and Language · Computer Science 2024-08-20 Karime Maamari , Fadhil Abubaker , Daniel Jaroslawicz , Amine Mhedhbi

Recently, large language models (LLMs) have significantly improved the performance of text-to-SQL systems. Nevertheless, many state-of-the-art (SOTA) approaches have overlooked the critical aspect of system robustness. Our experiments…

Computation and Language · Computer Science 2024-12-18 Geling Liu , Yunzhi Tan , Ruichao Zhong , Yuanzhen Xie , Lingchen Zhao , Qian Wang , Bo Hu , Zang Li

Schema linking is a critical bottleneck in applying existing Text-to-SQL models to real-world, large-scale, multi-database environments. Through error analysis, we identify two major challenges in schema linking: (1) Database Retrieval:…

Computation and Language · Computer Science 2025-09-09 Yihan Wang , Peiyu Liu , Xin Yang

It is challenging to convert natural language (NL) questions into executable structured query language (SQL) queries for text-to-SQL tasks due to the vast number of database schemas with redundancy, which interferes with semantic learning,…

Databases · Computer Science 2025-02-11 Zhuopan Yang , Yuanzhen Xie , Ruichao Zhong , Yunzhi Tan , Enjie Liu , Zhenguo Yang , Mochi Gao , Bo Hu , Zang Li

Text-to-SQL, which translates a natural language question into an SQL query, has advanced with in-context learning of Large Language Models (LLMs). However, existing methods show little improvement in performance compared to randomly chosen…

Artificial Intelligence · Computer Science 2025-07-23 Jihyung Lee , Jin-Seop Lee , Jaehoon Lee , YunSeok Choi , Jee-Hyong Lee

Schema linking is a critical step in Text-to-SQL task, aiming to accurately predict the table names and column names required for the SQL query based on the given question. However, current fine-tuning approaches for schema linking models…

Artificial Intelligence · Computer Science 2025-06-16 Wuzhenghong Wen , Su Pan , yuwei Sun

Despite advances in large language model (LLM)-based natural language interfaces for databases, scaling to enterprise-level data catalogs remains an under-explored challenge. Prior works addressing this challenge rely on domain-specific…

Computation and Language · Computer Science 2025-08-01 Jeffrey Eben , Aitzaz Ahmad , Stephen Lau

The problem of non-monotone $k$-submodular maximization under a knapsack constraint ($\kSMK$) over the ground set size $n$ has been raised in many applications in machine learning, such as data summarization, information propagation, etc.…

Data Structures and Algorithms · Computer Science 2023-09-22 Dung T. K. Ha , Canh V. Pham , Tan D. Tran , Huan X. Hoang

With Large Language Models' (LLMs) emergent abilities on code generation tasks, Text-to-SQL has become one of the most popular downstream applications. Despite the strong results of multiple recent LLM-based Text-to-SQL frameworks, the…

Machine Learning · Computer Science 2025-09-09 Dazhi Peng

Text-to-SQL prompt strategies based on Large Language Models (LLMs) achieve remarkable performance on well-known benchmarks. However, when applied to real-world databases, their performance is significantly less than for these benchmarks,…

Text-to-SQL systems translate natural language questions into executable SQL queries, and recent progress with large language models (LLMs) has driven substantial improvements in this task. Schema linking remains a critical component in…

Computation and Language · Computer Science 2025-05-27 AmirHossein Safdarian , Milad Mohammadi , Ehsan Jahanbakhsh , Mona Shahamat Naderi , Heshaam Faili

Large language models (LLMs) have demonstrated significant potential in code generation tasks. However, there remains a performance gap between open-source and closed-source models. To address this gap, existing approaches typically…

Computation and Language · Computer Science 2025-04-18 Weijie Lv , Xuan Xia , Sheng-Jun Huang

Schema matching is essential for integrating heterogeneous data sources and enhancing dataset discovery, yet it remains a complex and resource-intensive problem. We introduce SCHEMORA, a schema matching framework that combines large…

Databases · Computer Science 2025-07-22 Osman Erman Gungor , Derak Paulsen , William Kang

The generalizability to new databases is of vital importance to Text-to-SQL systems which aim to parse human utterances into SQL statements. Existing works achieve this goal by leveraging the exact matching method to identify the lexical…

Computation and Language · Computer Science 2022-08-09 Aiwei Liu , Xuming Hu , Li Lin , Lijie Wen

In this paper, we investigate the possibility of improving the performance of multi-objective optimization solution approaches using machine learning techniques. Specifically, we focus on multi-objective binary linear programs and employ…

Machine Learning · Statistics 2019-01-31 Alvaro Sierra-Altamiranda , Hadi Charkhgard , Iman Dayarian , Ali Eshragh , Sorna Javadi

A search engine's ability to retrieve desirable datasets is important for data sharing and reuse. Existing dataset search engines typically rely on matching queries to dataset descriptions. However, a user may not have enough prior…

Information Retrieval · Computer Science 2020-01-29 Zhiyu Chen , Haiyan Jia , Jeff Heflin , Brian D. Davison
‹ Prev 1 2 3 10 Next ›