English
Related papers

Related papers: Data Generation for Testing and Grading SQL Querie…

200 papers

Generation of sample data for testing SQL queries has been an important task for many years, with applications such as testing of SQL queries used for data analytics and in application software, as well as student SQL queries. More…

Databases · Computer Science 2024-09-30 Sunanda Somwase , Parismita Das , S. Sudarshan

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger

We propose a novel approach for generating complex outputs that significantly improves accuracy in text-to-SQL tasks. Our method leverages execution results to select the most semantically consistent query from multiple candidates, enabling…

Computation and Language · Computer Science 2025-04-01 Łukasz Borchmann , Marek Wydmuch

The ability of generative language models (GLMs) to generate text has improved considerably in the last few years, enabling their use for generative data augmentation. In this work, we propose CONDA, an approach to further improve GLMs'…

Computation and Language · Computer Science 2022-10-26 Dheeraj Mekala , Tu Vu , Timo Schick , Jingbo Shang

Question Generation (QG), as a challenging Natural Language Processing task, aims at generating questions based on given answers and context. Existing QG methods mainly focus on building or training models for specific QG datasets. These…

Computation and Language · Computer Science 2022-12-06 Wei Yuan , Hongzhi Yin , Tieke He , Tong Chen , Qiufeng Wang , Lizhen Cui

Question Answering (QA) systems require a large amount of annotated data which is costly and time-consuming to gather. Converting datasets of existing QA benchmarks are challenging due to different formats and complexities. To address these…

Computation and Language · Computer Science 2022-10-14 Saptarashmi Bandyopadhyay , Shraman Pal , Hao Zou , Abhranil Chandra , Jordan Boyd-Graber

Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have…

Computers and Society · Computer Science 2024-06-25 Donald R. Schwartz , Pablo Rivas

Automatic SQL generation has been an active research area, aiming at streamlining the access to databases by writing natural language with the given intent instead of writing SQL. Current SOTA methods for semantic parsing depend on LLMs to…

Machine Learning · Computer Science 2022-09-22 Samuel Arcadinho , David Aparício , Hugo Veiga , António Alegria

Database systems are widely used to store and query data. Test oracles have been proposed to find logic bugs in such systems, that is, bugs that cause the database system to compute an incorrect result. To realize a fully automated testing…

Software Engineering · Computer Science 2024-01-01 Jinsheng Ba , Manuel Rigger

Data analysts use SQL queries to access and manipulate data on their databases. However, these queries are often challenging to write, and small mistakes can lead to unexpected data output. Recent work has explored several ways to…

Grading student SQL queries manually is a tedious and error-prone process. Earlier work on testing correctness of student SQL queries, such as the XData system, can be used to test correctness of a student query. However, in case a student…

Databases · Computer Science 2019-12-20 Bikash Chandra , Ananyo Banerjee , Udbhas Hazra , Mathew Joseph , S. Sudarshan

Database Management System (DBMS) plays a core role in modern software from mobile apps to online banking. It is critical that DBMS should provide correct data to all applications. When the DBMS returns incorrect data, a correctness bug is…

Software Engineering · Computer Science 2021-07-09 Yushan Zhang , Peisen Yao , Rongxin Wu , Charles Zhang

Having access to realistic workloads for a given database instance is extremely important to enable stress and vulnerability testing, as well as to optimize for cost and performance. Recent advances in learned cost models have shown that…

Existing question answering (QA) systems owe much of their success to large, high-quality training data. Such annotation efforts are costly, and the difficulty compounds in the cross-lingual setting. Therefore, prior cross-lingual QA work…

Computation and Language · Computer Science 2023-10-18 Bryan Li , Chris Callison-Burch

We present a generative model to map natural language questions into SQL queries. Existing neural network based approaches typically generate a SQL query word-by-word, however, a large portion of the generated results are incorrect or not…

Computation and Language · Computer Science 2018-04-24 Yibo Sun , Duyu Tang , Nan Duan , Jianshu Ji , Guihong Cao , Xiaocheng Feng , Bing Qin , Ting Liu , Ming Zhou

Query-document relevance prediction is a critical problem in Information Retrieval systems. This problem has increasingly been tackled using (pretrained) transformer-based models which are finetuned using large collections of labeled data.…

Information Retrieval · Computer Science 2023-06-21 Aditi Chaudhary , Karthik Raman , Krishna Srinivasan , Kazuma Hashimoto , Mike Bendersky , Marc Najork

Question and answer generation is a data augmentation method that aims to improve question answering (QA) models given the limited amount of human labeled data. However, a considerable gap remains between synthetic and human-generated…

Computation and Language · Computer Science 2020-02-25 Raul Puri , Ryan Spring , Mostofa Patwary , Mohammad Shoeybi , Bryan Catanzaro

Quantum machine learning integrates the strengths of quantum computing and machine learning, enabling models to learn complex features using fewer parameters than their classical counterparts. Due to the increasing complexity of quantum…

Quantum Physics · Physics 2026-05-04 Emma Andrews , Prabhat Mishra

We study how to learn a semantic parser of state-of-the-art accuracy with less supervised training data. We conduct our study on WikiSQL, the largest hand-annotated semantic parsing dataset to date. First, we demonstrate that question…

Computation and Language · Computer Science 2018-08-28 Daya Guo , Yibo Sun , Duyu Tang , Nan Duan , Jian Yin , Hong Chi , James Cao , Peng Chen , Ming Zhou

SQL Injection (SQLi) continues to pose a significant threat to the security of web applications, enabling attackers to manipulate databases and access sensitive information without authorisation. Although advancements have been made in…

Cryptography and Security · Computer Science 2025-02-10 Naga Sai Dasari , Atta Badii , Armin Moin , Ahmed Ashlam
‹ Prev 1 2 3 10 Next ›