Related papers: Approximate Query Processing via Tuple Bubbles

ADQUEX: Adaptive Processing of Federated Queries over Linked Data based on Tuple Routing

Due to the distribution of linked data across the web, the methods that process federated queries through a distributed approach are more attractive to the users and have gained more prosperity. In distributed processing of federated…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-05-20 Amin Beiranvand , Nasser Ghadiri

EntropyDB: A Probabilistic Approach to Approximate Query Processing

We present EntropyDB, an interactive data exploration system that uses a probabilistic approach to generate a small, query-able summary of a dataset. Departing from traditional summarization techniques, we use the Principle of Maximum…

Databases · Computer Science 2019-11-13 Laurel Orr , Magdalena Balazinska , Dan Suciu

Query Answering under Volume-Based Diversity Functions

When query evaluation produces too many tuples, a new approach in query answering is to retrieve a diverse subset of them. The standard approach for measuring the diversity of a set of tuples is to use a distance function between tuples,…

Databases · Computer Science 2025-09-23 Marcelo Arenas , Timo Camillo Merkl , Reinhard Pichler , Cristian Riveros

Active Ordinal Querying for Tuplewise Similarity Learning

Many machine learning tasks such as clustering, classification, and dataset search benefit from embedding data points in a space where distances reflect notions of relative similarity as perceived by humans. A common way to construct such…

Machine Learning · Statistics 2019-11-25 Gregory Canal , Stefano Fenu , Christopher Rozell

Factorised Representations of Query Results

Query tractability has been traditionally defined as a function of input database and query sizes, or of both input and output sizes, where the query result is represented as a bag of tuples. In this report, we introduce a framework that…

Databases · Computer Science 2011-04-06 Dan Olteanu , Jakub Zavodny

Approximate Queries and Representations for Large Data Sequences

Many new database application domains such as experimental sciences and medicine are characterized by large sequences as their main form of data. Using approximate representation can significantly reduce the required storage and search…

Databases · Computer Science 2019-04-22 Hagit Shatkay , Stanley B. Zdonik

ML-AQP: Query-Driven Approximate Query Processing based on Machine Learning

As more and more organizations rely on data-driven decision making, large-scale analytics become increasingly important. However, an analyst is often stuck waiting for an exact result. As such, organizations turn to Cloud providers that…

Databases · Computer Science 2020-03-17 Fotis Savva , Christos Anagnostopoulos , Peter Triantafillou

Intelligent Database Flexible Querying System by Approximate Query Processing

Database flexible querying is an alternative to the classic one for users. The use of Formal Concepts Analysis (FCA) makes it possible to make approximate answers that those turned over by a classic DataBase Management System (DBMS). Some…

Databases · Computer Science 2012-04-17 Oussama Tlili , Minyar Sassi , Habib Ounelli

Scalable Package Queries in Relational Database Systems

Traditional database queries follow a simple model: they define constraints that each tuple in the result must satisfy. This model is computationally efficient, as the database system can evaluate the query conditions on each tuple…

Databases · Computer Science 2015-12-17 Matteo Brucato , Juan Felipe Beltran , Azza Abouzied , Alexandra Meliou

Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms

In this paper, we introduce a novel approach to computing the contribution of input tuples to the result of the query, quantified by the Banzhaf and Shapley values. In contrast to prior algorithmic work that focuses on…

Databases · Computer Science 2025-06-23 Omer Abramovich , Daniel Deutch , Nave Frost , Ahmet Kara , Dan Olteanu

Finding Favourite Tuples on Data Streams with Provably Few Comparisons

One of the most fundamental tasks in data science is to assist a user with unknown preferences in finding high-utility tuples within a large database. To accurately elicit the unknown user preferences, a widely-adopted way is by asking the…

Databases · Computer Science 2023-07-07 Guangyi Zhang , Nikolaj Tatti , Aristides Gionis

Scaling Package Queries to a Billion Tuples via Hierarchical Partitioning and Customized Optimization

A package query returns a package - a multiset of tuples - that maximizes or minimizes a linear objective function subject to linear constraints, thereby enabling in-database decision support. Prior work has established the equivalence of…

Databases · Computer Science 2023-11-16 Anh L. Mai , Pengyu Wang , Azza Abouzied , Matteo Brucato , Peter J. Haas , Alexandra Meliou

An Algebraic Approach for Approximity

Comparison to traditionally accurate computing, approximate computing focuses on the rapidity of the satisfactory solution, but not the unnecessary accuracy of the solution. Approximate bisimularity is the approximate one corresponding to…

Logic in Computer Science · Computer Science 2015-12-01 Yong Wang

Approximate Query Processing for Group-By Queries based on Conditional Generative Models

The Group-By query is an important kind of query, which is common and widely used in data warehouses, data analytics, and data visualization. Approximate query processing is an effective way to increase the querying efficiency on big data.…

Databases · Computer Science 2021-01-11 Meifan Zhang , Hongzhi Wang

On an Empirical Likelihood based Solution to the Approximate Bayesian Computation Problem

Approximate Bayesian Computation (ABC) methods are applicable to statistical models specified by generative processes with analytically intractable likelihoods. These methods try to approximate the posterior density of a model parameter by…

Methodology · Statistics 2024-03-11 Sanjay Chaudhuri , Subhroshekhar Ghosh , Kim Cuc Pham

Scheduling of Intermittent Query Processing

Stream processing is usually done either on a tuple-by-tuple basis or in micro-batches. There are many applications where tuples over a predefined duration/window must be processed within certain deadlines. Processing such queries using…

Databases · Computer Science 2024-09-23 Saranya Chandrasekaran , S. Sudarshan

Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing

Sample-based approximate query processing (AQP) suffers from many pitfalls such as the inability to answer very selective queries and unreliable confidence intervals when sample sizes are small. Recent research presented an intriguing…

Databases · Computer Science 2021-03-31 Xi Liang , Stavros Sintos , Zechao Shang , Sanjay Krishnan

Simple Methods to Represent Shapes with Sample Spheres

Representing complex shapes with simple primitives in high accuracy is important for a variety of applications in computer graphics and geometry processing. Existing solutions may produce suboptimal samples or are complex to implement. We…

Graphics · Computer Science 2020-12-04 Li-Yi Wei , Arjun V Anand , Shally Kumar , Tarun Beri

Tuple spaces implementations and their efficiency

Among the paradigms for parallel and distributed computing, the one popularized with Linda, and based on tuple spaces, is one of the least used, despite the fact of being intuitive, easy to understand and to use. A tuple space is a…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-12 Vitaly Buravlev , Rocco De Nicola , Claudio Antares Mezzina

A Scheme for Approximating Probabilistic Inference

This paper describes a class of probabilistic approximation algorithms based on bucket elimination which offer adjustable levels of accuracy and efficiency. We analyze the approximation for several tasks: finding the most probable…

Artificial Intelligence · Computer Science 2013-02-08 Rina Dechter , Irina Rish