Related papers: Database Queries that Explain their Work

Provenance Traces

Provenance is information about the origin, derivation, ownership, or history of an object. It has recently been studied extensively in scientific databases and other settings due to its importance in helping scientists judge data validity,…

Programming Languages · Computer Science 2008-12-03 James Cheney , Umut Acar , Amal Ahmed

A Core Calculus for Provenance

Provenance is an increasing concern due to the ongoing revolution in sharing and processing scientific data on the Web and in other computer systems. It is proposed that many computer systems will need to become provenance-aware in order to…

Programming Languages · Computer Science 2014-01-06 Umut A. Acar , Amal Ahmed , James Cheney , Roly Perera

Language-integrated provenance in Haskell

Scientific progress increasingly depends on data management, particularly to clean and curate data so that it can be systematically analyzed and reused. A wealth of techniques for managing and curating data (and its provenance) have been…

Programming Languages · Computer Science 2018-03-28 Jan Stolarek , James Cheney

Provenance as Dependency Analysis

Provenance is information recording the source, derivation, or history of some information. Provenance tracking has been studied in a variety of settings; however, although many design points have been explored, the mathematical or semantic…

Databases · Computer Science 2009-12-22 James Cheney , Amal Ahmed , Umut Acar

Language-integrated provenance by trace analysis

Language-integrated provenance builds on language-integrated query techniques to make provenance information explaining query results readily available to programmers. In previous work we have explored language-integrated approaches to…

Programming Languages · Computer Science 2019-05-07 Stefan Fehrenbach , James Cheney

Query By Provenance

To assist non-specialists in formulating database queries, multiple frameworks that automatically infer queries from a set of examples have been proposed. While highly useful, a shortcoming of the approach is that if users can only provide…

Databases · Computer Science 2016-05-17 Daniel Deutch , Amir Gilad

Explaining Natural Language Query Results

Multiple lines of research have developed Natural Language (NL) interfaces for formulating database queries. We build upon this work, but focus on presenting a highly detailed form of the answers in NL. The answers that we present are…

Databases · Computer Science 2020-07-10 Daniel Deutch , Nave Frost , Amir Gilad

You Say 'What', I Hear 'Where' and 'Why': (Mis-)Interpreting SQL to Derive Fine-Grained Provenance

SQL declaratively specifies what the desired output of a query is. This work shows that a non-standard interpretation of the SQL semantics can, instead, disclose where a piece of the output originated in the input and why that piece found…

Databases · Computer Science 2018-07-04 Tobias Müller , Benjamin Dietrich , Torsten Grust

Language-integrated provenance

Provenance, or information about the origin or derivation of data, is important for assessing the trustworthiness of data and identifying and correcting mistakes. Most prior implementations of data provenance have involved heavyweight…

Programming Languages · Computer Science 2017-08-23 Stefan Fehrenbach , James Cheney

Towards Inferring Queries from Simple and Partial Provenance Examples

The field of query-by-example aims at inferring queries from output examples given by non-expert users, by finding the underlying logic that binds the examples. However, for a very small set of examples, it is difficult to correctly infer…

Databases · Computer Science 2020-08-21 Amir Gilad , Yuval Moskovitch

Validation and Inference of Schema-Level Workflow Data-Dependency Annotations

An advantage of scientific workflow systems is their ability to collect runtime provenance information as an execution trace. Traces include the computation steps invoked as part of the workflow run along with the corresponding data…

Databases · Computer Science 2018-07-27 Shawn Bowers , Timothy McPhillips , Bertram Ludäscher

Provenance for Aggregate Queries

We study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance…

Databases · Computer Science 2015-03-17 Yael Amsterdamer , Daniel Deutch , Val Tannen

Hypothetical Reasoning via Provenance Abstraction

Data analytics often involves hypothetical reasoning: repeatedly modifying the data and observing the induced effect on the computation result of a data-centric application. Previous work has shown that fine-grained data provenance can help…

Databases · Computer Science 2020-07-13 Daniel Deutch , Yuval Moskovitch , Noam Rinetzky

Efficiently Computing Provenance Graphs for Queries with Negation

Explaining why an answer is in the result of a query or why it is missing from the result is important for many applications including auditing, debugging data and queries, and answering hypothetical questions about data. Both types of…

Databases · Computer Science 2017-01-23 Seokki Lee , Sven Koehler , Bertram Ludaescher , Boris Glavic

Scientific Workflows and Provenance: Introduction and Research Opportunities

Scientific workflows are becoming increasingly popular for compute-intensive and data-intensive scientific applications. The vision and promise of scientific workflows includes rapid, easy workflow design, reuse, scalable execution, and…

Databases · Computer Science 2013-11-26 Víctor Cuevas-Vicenttín , Saumen Dey , Sven Köhler , Sean Riddle , Bertram Ludäscher

ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data

We present the data model, design choices, and performance of ProvSQL, a general and easy-to-deploy provenance tracking and probabilistic database system implemented as a PostgreSQL extension. ProvSQL's data and query models closely reflect…

Databases · Computer Science 2025-12-18 Aryak Sen , Silviu Maniu , Pierre Senellart

A Provenance Tracking Model for Data Updates

For data-centric systems, provenance tracking is particularly important when the system is open and decentralised, such as the Web of Linked Data. In this paper, a concise but expressive calculus which models data updates is presented. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-08-24 Gabriel Ciobanu , Ross Horne

Provenance for SPARQL queries

Determining trust of data available in the Semantic Web is fundamental for applications and users, in particular for linked open data obtained from SPARQL endpoints. There exist several proposals in the literature to annotate SPARQL query…

Databases · Computer Science 2015-03-20 C. V. Damásio , A. Analyti , G. Antoniou

Putting Things into Context: Rich Explanations for Query Answers using Join Graphs (extended version)

In many data analysis applications, there is a need to explain why a surprising or interesting result was produced by a query. Previous approaches to explaining results have directly or indirectly used data provenance (input tuples…

Databases · Computer Science 2021-03-30 Chenjie Li , Zhengjie Miao , Qitian Zeng , Boris Glavic , Sudeepa Roy

Causality and the Semantics of Provenance

Provenance, or information about the sources, derivation, custody or history of data, has been studied recently in a number of contexts, including databases, scientific workflows and the Semantic Web. Many provenance mechanisms have been…

Logic in Computer Science · Computer Science 2010-06-09 James Cheney