Related papers: Object-Relational Database Representations for Tex…

On the Scalability of Multidimensional Databases

It is commonly accepted in the practice of on-line analytical processing of databases that the multidimensional database organization is less scalable than the relational one. It is easy to see that the size of the multidimensional…

Databases · Computer Science 2011-04-27 István Szépkúti

The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes

Information Retrieval using dense low-dimensional representations recently became popular and showed out-performance to traditional sparse-representations like BM25. However, no previous work investigated how dense representations perform…

Information Retrieval · Computer Science 2021-06-10 Nils Reimers , Iryna Gurevych

Multidimensional or Relational? / How to Organize an On-line Analytical Processing Database

In the past few years, the number of OLAP applications increased quickly. These applications use two significantly different DB structures: multidimensional (MD) and table-based. One can show that the traditional model of relational…

Databases · Computer Science 2011-04-21 István Szépkúti

Serving Deep Learning Models with Deduplication from Relational Databases

There are significant benefits to serve deep learning models from relational databases. First, features extracted from databases do not need to be transferred to any decoupled deep learning systems for inferences, and thus the system…

Databases · Computer Science 2022-10-24 Lixi Zhou , Jiaqing Chen , Amitabh Das , Hong Min , Lei Yu , Ming Zhao , Jia Zou

Deductive Optimization of Relational Data Storage

Optimizing the physical data storage and retrieval of data are two key database management problems. In this paper, we propose a language that can express a wide range of physical database layouts, going well beyond the row- and…

Programming Languages · Computer Science 2020-02-07 John K. Feser , Samuel Madden , Nan Tang , Armando Solar-Lezama

Hippo: A Fast, yet Scalable, Database Indexing Approach

Even though existing database indexes (e.g., B+-Tree) speed up the query execution, they suffer from two main drawbacks: (1) A database index usually yields 5% to 15% additional storage overhead which results in non-ignorable dollar cost in…

Databases · Computer Science 2016-04-13 Jia Yu , Mohamed Sarwat

Object-Oriented Knowledge Representation and Data Storage Using Inhomogeneous Classes

This paper contains analysis of concept of a class within different object-oriented knowledge representation models. The main attention is paid to structure of the class and its efficiency in the context of data storage, using…

Artificial Intelligence · Computer Science 2018-11-02 Dmytro Terletskyi

Size-l Object Summaries for Relational Keyword Search

A previously proposed keyword search paradigm produces, as a query result, a ranked list of Object Summaries (OSs). An OS is a tree structure of related tuples that summarizes all data held in a relational database about a particular Data…

Databases · Computer Science 2011-12-01 Georgios J. Fakas , Zhi Cai , Nikos Mamoulis

Aggregation and Ordering in Factorised Databases

A common approach to data analysis involves understanding and manipulating succinct representations of data. In earlier work, we put forward a succinct representation system for relational data called factorised databases and reported on…

Databases · Computer Science 2013-07-02 Nurzhan Bakibayev , Tomáš Kočiský , Dan Olteanu , Jakub Závodný

Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections

For text retrieval systems, the assumption that all data structures reside in main memory is increasingly common. In this context, we present a novel incremental inverted indexing algorithm for web-scale collections that directly constructs…

Information Retrieval · Computer Science 2013-05-06 Nima Asadi , Jimmy Lin

Fast and Simple Relational Processing of Uncertain Data

This paper introduces U-relations, a succinct and purely relational representation system for uncertain databases. U-relations support attribute-level uncertainty using vertical partitioning. If we consider positive relational algebra…

Databases · Computer Science 2007-07-12 Lyublena Antova , Thomas Jansen , Christoph Koch , Dan Olteanu

Minimizing the Number of Matching Queries for Object Retrieval

To increase the computational efficiency of interest-point based object retrieval, researchers have put remarkable research efforts into improving the efficiency of kNN-based feature matching, pursuing to match thousands of features against…

Computer Vision and Pattern Recognition · Computer Science 2015-08-19 Johannes Niedermayer , Peer Kröger

Towards Adaptive Storage Views in Virtual Memory

Traditionally, DBMSs separate their storage layer from their indexing layer. While the storage layer physically materializes the database and provides low-level access methods to it, the indexing layer on top enables a faster locating of…

Databases · Computer Science 2022-12-07 Felix Schuhknecht , Justus Henneberg

Difference-Huffman Coding of Multidimensional Databases

A new compression method called difference-Huffman coding (DHC) is introduced in this paper. It is verified empirically that DHC results in a smaller multidimensional physical representation than those for other previously published…

Databases · Computer Science 2011-04-21 István Szépkúti

The OverRelational Manifesto

The OverRelational Manifesto (below ORM) proposes a possible approach to creation of data storage systems of the next generation. ORM starts from the requirement that information in a relational database is represented by a set of relation…

Databases · Computer Science 2007-05-23 Evgeniy Grigoriev

MaskSearch: Querying Image Masks at Scale

Machine learning tasks over image databases often generate masks that annotate image content (e.g., saliency maps, segmentation maps, depth maps) and enable a variety of applications (e.g., determine if a model is learning spurious…

Databases · Computer Science 2024-01-09 Dong He , Jieyu Zhang , Maureen Daum , Alexander Ratner , Magdalena Balazinska

Odysseus/DFS: Integration of DBMS and Distributed File System for Transaction Processing of Big Data

The relational DBMS (RDBMS) has been widely used since it supports various high-level functionalities such as SQL, schemas, indexes, and transactions that do not exist in the O/S file system. But, a recent advent of big data technology…

Databases · Computer Science 2014-06-03 Jun-Sung Kim , Kyu-Young Whang , Hyuk-Yoon Kwon , Il-Yeol Song

FDB: A Query Engine for Factorised Relational Databases

Factorised databases are relational databases that use compact factorised representations at the physical layer to reduce data redundancy and boost query performance. This paper introduces FDB, an in-memory query engine for…

Databases · Computer Science 2012-03-14 Nurzhan Bakibayev , Dan Olteanu , Jakub Závodný

The Case for Instance-Optimized LLMs in OLAP Databases

Large Language Models (LLMs) can enhance analytics systems with powerful data summarization, cleaning, and semantic transformation capabilities. However, deploying LLMs at scale -- processing millions to billions of rows -- remains…

Databases · Computer Science 2025-07-08 Bardia Mohammadi , Laurent Bindschaedler

Efficient Immediate-Access Dynamic Indexing

In a dynamic retrieval system, documents must be ingested as they arrive, and be immediately findable by queries. Our purpose in this paper is to describe an index structure and processing regime that accommodates that requirement for…

Information Retrieval · Computer Science 2023-01-12 Alistair Moffat , Joel Mackenzie