English
Related papers

Related papers: Sema: A High-performance System for LLM-based Sema…

200 papers

Relational databases excel at structured data analysis, but real-world queries increasingly require capabilities beyond standard SQL, such as semantically matching entities across inconsistent names, extracting information not explicitly…

Databases · Computer Science 2026-05-15 Yin Lin , Tianjing Zeng , Zhongjun Ding , Rong Zhu , Bolin Ding , H. V. Jagadish , Jingren Zhou

LLMs enable an exciting new class of data processing applications over large collections of unstructured documents. Several new programming frameworks have enabled developers to build these applications by composing them out of semantic…

Structured Query Language (SQL) has remained the standard query language for databases. SQL is highly optimized for processing structured data laid out in relations. Meanwhile, in the present application development landscape, it is highly…

Databases · Computer Science 2026-04-24 Udesh Kumarasinghe , Tyler Liu , Ahmed R. Mahmood , Chunwei Liu , Walid G. Aref

With the increasing use of multi-modal data, semantic query has become more and more demanded in data management systems, which is an important way to access and analyze multi-modal data. As unstructured data, most information of…

Databases · Computer Science 2026-03-03 Ruyu Li , Tinghui Zhang , Haodi Ma , Daisy Zhe Wang , Yifan Wang

With advances in large language models (LLMs), researchers are creating new systems that can perform AI-driven analytics over large unstructured datasets. Recent work has explored executing such analytics queries using semantic operators --…

Artificial Intelligence · Computer Science 2025-09-04 Matthew Russo , Tim Kraska

Contemporary database systems, while effective, suffer severe issues related to complexity and usability, especially among individuals who lack technical expertise but are unfamiliar with query languages like Structured Query Language…

Databases · Computer Science 2025-07-25 M. Tedeschi , S. Rizwan , C. Shringi , V. Devram Chandgir , S. Belich

The advent of Large Language Models (LLMs) provides an opportunity to change the way queries are processed, moving beyond the constraints of conventional SQL-based database systems. However, using an LLM to answer a prediction query is…

Information Retrieval · Computer Science 2024-09-04 Ziyu Li , Wenjie Zhao , Asterios Katsifodimos , Rihan Hai

Recent database systems have introduced semantic operators that leverage large language models (LLMs) to filter, join, and project over structured data using natural language predicates. In practice, these operators are combined with…

Traditional query processing relies on engines that are carefully optimized and engineered by many experts. However, new techniques and user requirements evolve rapidly, and existing systems often cannot keep pace. At the same time, these…

Databases · Computer Science 2026-03-03 Jiale Lao , Immanuel Trummer

Generating accurate SQL from users' natural language questions (text-to-SQL) remains a long-standing challenge due to the complexities involved in user question understanding, database schema comprehension, and SQL generation. Traditional…

Computation and Language · Computer Science 2025-11-25 Zijin Hong , Zheng Yuan , Qinggang Zhang , Hao Chen , Junnan Dong , Feiran Huang , Xiao Huang

Large Language Models (LLMs) have made significant progress in assisting users to query databases in natural language. While LLM-based techniques provide state-of-the-art results on many standard benchmarks, their performance significantly…

Artificial Intelligence · Computer Science 2024-07-09 Nina Narodytska , Shay Vargaftik

When complex SQL queries suffer slow executions despite query optimization, DBAs typically invoke automated query rewriting tools to recommend ``lean'' equivalents that are conducive to faster execution. The rewritings are usually achieved…

Databases · Computer Science 2025-09-03 Sriram Dharwada , Himanshu Devrani , Jayant Haritsa , Harish Doraiswamy

While recent advances in large language models have significantly improved Text-to-SQL and table question answering systems, most existing approaches assume that all query-relevant information is explicitly represented in structured…

Databases · Computer Science 2026-04-06 Nima Shahbazi , Seiji Maekawa , Nikita Bhutani , Estevam Hruschka

Recent advancements in Text-to-SQL (Text2SQL) emphasize stimulating the large language models (LLM) on in-context learning, achieving significant results. Nevertheless, they face challenges when dealing with verbose database information and…

Computation and Language · Computer Science 2024-06-04 Zhishuai Li , Xiang Wang , Jingjing Zhao , Sun Yang , Guoqing Du , Xiaoru Hu , Bin Zhang , Yuxiao Ye , Ziyue Li , Rui Zhao , Hangyu Mao

Semantic query processing engines often support semantic joins, enabling users to match rows that satisfy conditions specified in natural language. Such join conditions can be evaluated using large language models (LLMs) that solve novel…

Databases · Computer Science 2025-10-10 Immanuel Trummer

Large Language Models (LLMs) have shown great promise in automating data analytics tasks by interpreting natural language queries and generating multi-operation execution plans. However, existing LLM-agent-based analytics frameworks operate…

Artificial Intelligence · Computer Science 2025-11-03 Haichao Ji , Zibo Wang , Cheng Pan , Meng Han , Yifei Zhu , Dan Wang , Zhu Han

Large Language Models (LLMs) exhibit strong capabilities in text processing, and recent research has augmented SQL and DataFrame with LLM-powered semantic operators for data analysis. However, LLM-based data processing is hindered by slower…

Databases · Computer Science 2026-03-10 Chao Hui , Weizheng Lu , Yanjie Gao , Lingfeng Xiong , Yunhai Wang , Yueguo Chen

Despite the remarkable capabilities of Language Models (LMs) across diverse tasks, no single model consistently outperforms others, necessitating efficient methods to combine their strengths without expensive retraining. Existing model…

Computation and Language · Computer Science 2025-05-27 Jian Gu , Aldeida Aleti , Chunyang Chen , Hongyu Zhang

Automatic SQL generation has been an active research area, aiming at streamlining the access to databases by writing natural language with the given intent instead of writing SQL. Current SOTA methods for semantic parsing depend on LLMs to…

Machine Learning · Computer Science 2022-09-22 Samuel Arcadinho , David Aparício , Hugo Veiga , António Alegria

The integration of heterogeneous databases into a unified querying framework remains a critical challenge, particularly in resource-constrained environments. This paper presents a novel Small Language Model(SLM)-driven system that…

Databases · Computer Science 2025-05-27 Teng Lin
‹ Prev 1 2 3 10 Next ›