English
Related papers

Related papers: UQE: A Query Engine for Unstructured Databases

200 papers

Nowadays, the explosion of unstructured data presents immense analytical value. Leveraging the remarkable capability of large language models (LLMs) in extracting attributes of structured tables from unstructured data, researchers are…

The rise of Large Language Models (LLMs) has accelerated the long-standing goal of enabling natural language querying over complex, hybrid databases. Yet, this ambition exposes a dual challenge: reasoning jointly over structured,…

Databases · Computer Science 2025-10-22 Aymane Hassini

While most conversational agents are grounded on either free-text or structured knowledge, many knowledge corpora consist of hybrid sources. This paper presents the first conversational agent that supports the full generality of hybrid data…

Computation and Language · Computer Science 2024-03-14 Shicheng Liu , Jialiang Xu , Wesley Tjangnaka , Sina J. Semnani , Chen Jie Yu , Monica S. Lam

With the increasing use of multi-modal data, semantic query has become more and more demanded in data management systems, which is an important way to access and analyze multi-modal data. As unstructured data, most information of…

Databases · Computer Science 2026-03-03 Ruyu Li , Tinghui Zhang , Haodi Ma , Daisy Zhe Wang , Yifan Wang

Most recently, researchers have started building large language models (LLMs) powered data systems that allow users to analyze unstructured text documents like working with a database because LLMs are very effective in extracting attributes…

Databases · Computer Science 2025-07-14 Zhaoze Sun , Qiyan Deng , Chengliang Chai , Kaisen Jin , Xinyu Guo , Han Han , Ye Yuan , Guoren Wang , Lei Cao

The advent of Large Language Models (LLMs) provides an opportunity to change the way queries are processed, moving beyond the constraints of conventional SQL-based database systems. However, using an LLM to answer a prediction query is…

Information Retrieval · Computer Science 2024-09-04 Ziyu Li , Wenjie Zhao , Asterios Katsifodimos , Rihan Hai

Querying tables with unstructured data is challenging due to the presence of text (or image), either embedded in the table or in external paragraphs, which traditional SQL struggles to process, especially for tasks requiring semantic…

Artificial Intelligence · Computer Science 2025-09-25 Rohit Khoja , Devanshu Gupta , Yanjie Fu , Dan Roth , Vivek Gupta

Modern information retrieval must reconcile short, ambiguous queries with increasingly diverse and dynamic corpora. Query expansion (QE) remains a core technique for mitigating vocabulary mismatch, but its design space has been reshaped by…

Information Retrieval · Computer Science 2026-05-08 Minghan Li , Xinxuan Lv , Junjie Zou , Tongna Chen , Chao Zhang , Suchao An , Ercong Nie , Guodong Zhou

In many use-cases, information is stored in text but not available in structured data. However, extracting data from natural language text to precisely fit a schema, and thus enable querying, is a challenging task. With the rise of…

Databases · Computer Science 2023-10-26 Mohammed Saeed , Nicola De Cao , Paolo Papotti

With the breakthroughs in large language models (LLMs), query generation techniques that expand documents and queries with related terms are becoming increasingly popular in the information retrieval field. Such techniques have been shown…

Information Retrieval · Computer Science 2025-07-16 Adam Yang , Gustavo Penha , Enrico Palumbo , Hugues Bouchard

Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks due to large training datasets and powerful transformer architecture. However, the reliability of responses from LLMs remains a question.…

Computation and Language · Computer Science 2025-02-26 Tiejin Chen , Xiaoou Liu , Longchao Da , Jia Chen , Vagelis Papalexakis , Hua Wei

Unstructured data, in the form of text, images, video, and audio, is produced at exponentially higher rates. In tandem, machine learning (ML) methods have become increasingly powerful at analyzing unstructured data. Modern ML methods can…

Databases · Computer Science 2025-11-06 Daniel Kang

This paper presents an opinion on the potential of using large language models to query on both unstructured and structured data. It also outlines some research challenges related to the topic of building question-answering systems for both…

Databases · Computer Science 2023-07-07 Wang-Chiew Tan

Recent knowledge editing methods have primarily focused on modifying structured knowledge in large language models. However, this task setting overlooks the fact that a significant portion of real-world knowledge is stored in an…

Computation and Language · Computer Science 2025-02-26 Jingcheng Deng , Zihao Wei , Liang Pang , Hanxing Ding , Huawei Shen , Xueqi Cheng

Query optimization, which finds the optimized execution plan for a given query, is a complex planning and decision-making problem within the exponentially growing plan space in database management systems (DBMS). Traditional optimizers…

Databases · Computer Science 2025-02-11 Jie Tan , Kangfei Zhao , Rui Li , Jeffrey Xu Yu , Chengzhi Piao , Hong Cheng , Helen Meng , Deli Zhao , Yu Rong

Table understanding requires structured, multi-step reasoning. Large Language Models (LLMs) struggle with it due to the structural complexity of tabular data. Recently, multi-agent frameworks for SQL generation have shown promise in…

Computation and Language · Computer Science 2025-12-02 Songyuan Sui , Hongyi Liu , Serena Liu , Li Li , Soo-Hyun Choi , Rui Chen , Xia Hu

Designing effective data manipulation methods is a long standing problem in data lakes. Traditional methods, which rely on rules or machine learning models, require extensive human efforts on training data collection and tuning models.…

Artificial Intelligence · Computer Science 2024-05-13 Yichen Qian , Yongyi He , Rong Zhu , Jintao Huang , Zhijian Ma , Haibin Wang , Yaohua Wang , Xiuyu Sun , Defu Lian , Bolin Ding , Jingren Zhou

The popularity of data science as a discipline and its importance in the emerging economy and industrial progress dictate that machine learning be democratized for the masses. This also means that the current practice of workforce training…

Machine Learning · Computer Science 2024-05-28 Hasan M Jamil

Large Language Models (LLMs) excel in text generation, reasoning, and decision-making, enabling their adoption in high-stakes domains such as healthcare, law, and transportation. However, their reliability is a major concern, as they often…

Computation and Language · Computer Science 2025-06-05 Xiaoou Liu , Tiejin Chen , Longchao Da , Chacha Chen , Zhen Lin , Hua Wei

Querying, conversing, and controlling search and information-seeking interfaces using natural language are fast becoming ubiquitous with the rise and adoption of large-language models (LLM). In this position paper, we describe a generic…

Information Retrieval · Computer Science 2023-06-29 Avishek Anand , Venktesh V , Abhijit Anand , Vinay Setty
‹ Prev 1 2 3 10 Next ›