Related papers: Probabilistic Databases with MarkoViews

Probabilistic Query Evaluation with Bag Semantics

We study the complexity of evaluating queries on probabilistic databases under bag semantics. We focus on self-join free conjunctive queries, and probabilistic databases where occurrences of different facts are independent, which is the…

Databases · Computer Science 2023-07-18 Martin Grohe , Peter Lindner , Christoph Standke

Semantics and Evaluation of Top-k Queries in Probabilistic Databases

We study here fundamental issues involved in top-k query evaluation in probabilistic databases. We consider simple probabilistic databases in which probabilities are associated with individual tuples, and general probabilistic databases in…

Databases · Computer Science 2009-06-08 Xi Zhang , Jan Chomicki

Making massive probabilistic databases practical

Existence of incomplete and imprecise data has moved the database paradigm from deterministic to proba- babilistic information. Probabilistic databases contain tuples that may or may not exist with some probability. As a result, the number…

Databases · Computer Science 2013-07-04 Andrei Todor , Alin Dobra , Tamer Kahveci , Christopher Dudley

Scalable Probabilistic Databases with Factor Graphs and MCMC

Probabilistic databases play a crucial role in the management and understanding of uncertain data. However, incorporating probabilities into the semantics of incomplete databases has posed many challenges, forcing systems to sacrifice…

Databases · Computer Science 2015-03-17 Michael Wick , Andrew McCallum , Gerome Miklau

Independence in Infinite Probabilistic Databases

Probabilistic databases (PDBs) model uncertainty in data. The current standard is to view PDBs as finite probability spaces over relational database instances. Since many attributes in typical databases have infinite domains, such as…

Databases · Computer Science 2022-06-01 Martin Grohe , Peter Lindner

Extracting Multiple Viewpoint Models from Relational Databases

Much time in process mining projects is spent on finding and understanding data sources and extracting the event data needed. As a result, only a fraction of time is spent actually applying techniques to discover, control and predict the…

Databases · Computer Science 2020-01-09 Alessandro Berti , Wil van der Aalst

Symbolic Querying of Vector Spaces: Probabilistic Databases Meets Relational Embeddings

We propose unifying techniques from probabilistic databases and relational embedding models with the goal of performing complex queries on incomplete and uncertain data. We formalize a probabilistic database model with respect to which all…

Artificial Intelligence · Computer Science 2020-06-30 Tal Friedman , Guy Van den Broeck

Multi-view predictive partitioning in high dimensions

Many modern data mining applications are concerned with the analysis of datasets in which the observations are described by paired high-dimensional vectorial representations or "views". Some typical examples can be found in web mining and…

Machine Learning · Statistics 2012-02-07 Brian McWilliams , Giovanni Montana

Challenges for Efficient Query Evaluation on Structured Probabilistic Data

Query answering over probabilistic data is an important task but is generally intractable. However, a new approach for this problem has recently been proposed, based on structural decompositions of input databases, following, e.g., tree…

Databases · Computer Science 2019-08-28 Antoine Amarilli , Silviu Maniu , Mikaël Monet

Fast and Simple Relational Processing of Uncertain Data

This paper introduces U-relations, a succinct and purely relational representation system for uncertain databases. U-relations support attribute-level uncertainty using vertical partitioning. If we consider positive relational algebra…

Databases · Computer Science 2007-07-12 Lyublena Antova , Thomas Jansen , Christoph Koch , Dan Olteanu

Materialized View Replacement using Markovs Analysis

Materialized view is used in large data centric applications to expedite query processing. The efficiency of materialized view depends on degree of result found against the queries over the existing materialized views. Materialized views…

Databases · Computer Science 2014-02-12 Partha Ghosh , Soumya Sen

Cost-Driven Ontology-Based Data Access (Extended Version)

In ontology-based data access (OBDA), users are provided with a conceptual view of a (relational) data source that abstracts away details about data storage. This conceptual view is realized through an ontology that is connected to the data…

Databases · Computer Science 2018-02-05 Davide Lanti , Guohui Xiao , Diego Calvanese

Discriminative Probabilistic Models for Relational Data

In many supervised learning tasks, the entities to be labeled are related to each other in complex ways and their labels are not independent. For example, in hypertext classification, the labels of linked pages are highly correlated. A…

Machine Learning · Computer Science 2013-01-07 Ben Taskar , Pieter Abbeel , Daphne Koller

Computing expected multiplicities for bag-TIDBs with bounded multiplicities

In this work, we study the problem of computing a tuple's expected multiplicity over probabilistic databases with bag semantics (where each tuple is associated with a multiplicity) exactly and approximately. We consider bag-TIDBs where we…

Databases · Computer Science 2022-07-04 Su Feng , Boris Glavic , Aaron Huber , Oliver Kennedy , Atri Rudra

MVDLite: a Fast Validation Algorithm for Model View Definition Rules

Model View Definition (MVD) is the standard methodology to define the data exchange requirements and rule constraints for Building Information Models (BIMs). In this paper, the MVDLite algorithm is proposed for the fast validation of MVD…

Databases · Computer Science 2023-12-29 Han Liu , Ge Gao , Hehua Zhang , Yu-Shen Liu , Yan Song , Ming Gu

A Visual Query Language for Complex-Value Databases

In this paper, a visual language, VCP, for queries on complex-value databases is proposed. The main strength of the new language is that it is purely visual: (i) It has no notion of variable, quantification, partiality, join, pattern…

Databases · Computer Science 2009-09-29 Christoph Koch

Tuple-Independent Representations of Infinite Probabilistic Databases

Probabilistic databases (PDBs) are probability spaces over database instances. They provide a framework for handling uncertainty in databases, as occurs due to data integration, noisy data, data from unreliable sources or randomized…

Databases · Computer Science 2022-04-20 Nofar Carmeli , Martin Grohe , Peter Lindner , Christoph Standke

BayesDB: A probabilistic programming system for querying the probable implications of data

Is it possible to make statistical inference broadly accessible to non-statisticians without sacrificing mathematical rigor or inference quality? This paper describes BayesDB, a probabilistic programming platform that aims to enable users…

Artificial Intelligence · Computer Science 2015-12-17 Vikash Mansinghka , Richard Tibbetts , Jay Baxter , Pat Shafto , Baxter Eaves

Scalable Package Queries in Relational Database Systems

Traditional database queries follow a simple model: they define constraints that each tuple in the result must satisfy. This model is computationally efficient, as the database system can evaluate the query conditions on each tuple…

Databases · Computer Science 2015-12-17 Matteo Brucato , Juan Felipe Beltran , Azza Abouzied , Alexandra Meliou

View Selection in Semantic Web Databases

We consider the setting of a Semantic Web database, containing both explicit data encoded in RDF triples, and implicit data, implied by the RDF semantics. Based on a query workload, we address the problem of selecting a set of views to be…

Databases · Computer Science 2011-11-01 François Goasdoué , Konstantinos Karanasos , Julien Leblay , Ioana Manolescu