Related papers: Making massive probabilistic databases practical

Independence in Infinite Probabilistic Databases

Probabilistic databases (PDBs) model uncertainty in data. The current standard is to view PDBs as finite probability spaces over relational database instances. Since many attributes in typical databases have infinite domains, such as…

Databases · Computer Science 2022-06-01 Martin Grohe , Peter Lindner

Probabilistic Query Evaluation with Bag Semantics

We study the complexity of evaluating queries on probabilistic databases under bag semantics. We focus on self-join free conjunctive queries, and probabilistic databases where occurrences of different facts are independent, which is the…

Databases · Computer Science 2023-07-18 Martin Grohe , Peter Lindner , Christoph Standke

Tuple-Independent Representations of Infinite Probabilistic Databases

Probabilistic databases (PDBs) are probability spaces over database instances. They provide a framework for handling uncertainty in databases, as occurs due to data integration, noisy data, data from unreliable sources or randomized…

Databases · Computer Science 2022-04-20 Nofar Carmeli , Martin Grohe , Peter Lindner , Christoph Standke

Querying Incomplete Numerical Data: Between Certain and Possible Answers

Queries with aggregation and arithmetic operations, as well as incomplete data, are common in real-world database, but we lack a good understanding of how they should interact. On the one hand, systems based on SQL provide ad-hoc rules for…

Databases · Computer Science 2022-11-02 Marco Console , Leonid Libkin , Liat Peterfreund

On Constrained Open-World Probabilistic Databases

Increasing amounts of available data have led to a heightened need for representing large-scale probabilistic knowledge bases. One approach is to use a probabilistic database, a model with strong assumptions that allow for efficiently…

Artificial Intelligence · Computer Science 2019-04-04 Tal Friedman , Guy Van den Broeck

Probabilistic Databases with an Infinite Open-World Assumption

Probabilistic databases (PDBs) introduce uncertainty into relational databases by specifying probabilities for several possible instances. Traditionally, they are finite probability spaces over database instances. Such finite PDBs…

Databases · Computer Science 2019-04-12 Martin Grohe , Peter Lindner

The Consistency of Probabilistic Databases with Independent Cells

A probabilistic database with attribute-level uncertainty consists of relations where cells of some attributes may hold probability distributions rather than deterministic content. Such databases arise, implicitly or explicitly, in the…

Databases · Computer Science 2022-12-26 Amir Gilad , Aviram Imber , Benny Kimelfeld

Computing expected multiplicities for bag-TIDBs with bounded multiplicities

In this work, we study the problem of computing a tuple's expected multiplicity over probabilistic databases with bag semantics (where each tuple is associated with a multiplicity) exactly and approximately. We consider bag-TIDBs where we…

Databases · Computer Science 2022-07-04 Su Feng , Boris Glavic , Aaron Huber , Oliver Kennedy , Atri Rudra

Bayes Networks for Supporting Query Processing Over Incomplete Autonomous Databases

As the information available to lay users through autonomous data sources continues to increase, mediators become important to ensure that the wealth of information available is tapped effectively. A key challenge that these information…

Databases · Computer Science 2012-08-29 Rohit Raghunathan , Sushovan De , Subbarao Kambhampati

Distinctiveness Maximization in Datasets Assemblage

In this paper, given a user's query set and budget, we aim to use the limited budget to help users assemble a set of datasets that can enrich a base dataset by introducing the maximum number of distinct tuples (i.e., maximizing…

Databases · Computer Science 2025-02-28 Tingting Wang , Shixun Huang , Zhifeng Bao , J. Shane Culpepper , Volkan Dedeoglu , Reza Arablouei

A Method for Implementing a Probabilistic Model as a Relational Database

This paper discusses a method for implementing a probabilistic inference system based on an extended relational data model. This model provides a unified approach for a variety of applications such as dynamic programming, solving sparse…

Artificial Intelligence · Computer Science 2013-02-21 Michael S. K. M. Wong , C. J. Butz , Yang Xiang

Infinite Probabilistic Databases

Probabilistic databases (PDBs) are used to model uncertainty in data in a quantitative way. In the standard formal framework, PDBs are finite probability spaces over relational database instances. It has been argued convincingly that this…

Databases · Computer Science 2020-01-09 Martin Grohe , Peter Lindner

Infinite Probabilistic Databases

Probabilistic databases (PDBs) model uncertainty in data in a quantitative way. In the established formal framework, probabilistic (relational) databases are finite probability spaces over relational database instances. This finiteness can…

Databases · Computer Science 2023-06-22 Martin Grohe , Peter Lindner

Semantics and Evaluation of Top-k Queries in Probabilistic Databases

We study here fundamental issues involved in top-k query evaluation in probabilistic databases. We consider simple probabilistic databases in which probabilities are associated with individual tuples, and general probabilistic databases in…

Databases · Computer Science 2009-06-08 Xi Zhang , Jan Chomicki

Probabilistic Relational Model Benchmark Generation

The validation of any database mining methodology goes through an evaluation process where benchmarks availability is essential. In this paper, we aim to randomly generate relational database benchmarks that allow to check probabilistic…

Machine Learning · Computer Science 2016-03-03 Mouna Ben Ishak , Rajani Chulyadyo , Philippe Leray

Model Counting of Query Expressions: Limitations of Propositional Methods

Query evaluation in tuple-independent probabilistic databases is the problem of computing the probability of an answer to a query given independent probabilities of the individual tuples in a database instance. There are two main approaches…

Databases · Computer Science 2013-12-17 Paul Beame , Jerry Li , Sudeepa Roy , Dan Suciu

Probabilistic Data with Continuous Distributions

Statistical models of real world data typically involve continuous probability distributions such as normal, Laplace, or exponential distributions. Such distributions are supported by many probabilistic modelling formalisms, including…

Databases · Computer Science 2021-03-08 Martin Grohe , Benjamin Lucien Kaminski , Joost-Pieter Katoen , Peter Lindner

Fully Dynamic Data Structure for Top-k Queries on Uncertain Data

Top-$k$ queries allow end-users to focus on the most important (top-$k$) answers amongst those which satisfy the query. In traditional databases, a user defined score function assigns a score value to each tuple and a top-$k$ query returns…

Databases · Computer Science 2010-07-30 Manish Patil , Rahul Shah , Sharma V. Thankachan

A Unified Approach to Ranking in Probabilistic Databases

The dramatic growth in the number of application domains that naturally generate probabilistic, uncertain data has resulted in a need for efficiently supporting complex querying and decision-making over such data. In this paper, we present…

Databases · Computer Science 2010-12-17 Jian Li , Barna Saha , Amol Deshpande

Defining and Mining Functional Dependencies in Probabilistic Databases

Functional dependencies -- traditional, approximate and conditional are of critical importance in relational databases, as they inform us about the relationships between attributes. They are useful in schema normalization, data…

Databases · Computer Science 2010-12-14 Sushovan De , Subbarao Kambhampati