English
Related papers

Related papers: Toward Temporal Attribution Analytics in Dataflows

200 papers

In temporal interaction networks, vertices correspond to entities, which exchange data quantities (e.g., money, bytes, messages) over time. Tracking the origin of data that have reached a given vertex at any time can help data analysts to…

Databases · Computer Science 2021-10-12 Chrysanthi Kosyfaki Nikos Mamoulis

Provenance refers to the documentation of an object's lifecycle. This documentation (often represented as a graph) should include all the information necessary to reproduce a certain piece of data or the process that led to it. In a dynamic…

Databases · Computer Science 2012-11-22 Seyed-Mehdi-Reza Beheshti , Hamid Reza Motahari-Nezhad , Boualem Benatallah

In the world of science new technology have opened up the possibility to rely on advanced computational methods and models to conduct and produce scientific research. An important aspect of scientific and business workflows is provenance -…

Software Engineering · Computer Science 2025-04-01 Ludwig Stage , Julia Dahlberg , Dimka Karastoyanova

Demand is growing for more accountability regarding the technological systems that increasingly occupy our world. However, the complexity of many of these systems - often systems-of-systems - poses accountability challenges. A key reason…

Computers and Society · Computer Science 2019-11-18 Jatinder Singh , Jennifer Cobbe , Chris Norval

Provenance is information recording the source, derivation, or history of some information. Provenance tracking has been studied in a variety of settings; however, although many design points have been explored, the mathematical or semantic…

Databases · Computer Science 2009-12-22 James Cheney , Amal Ahmed , Umut Acar

Data provenance is a valuable tool for detecting and preventing cyber attack, providing insight into the nature of suspicious events. For example, an administrator can use provenance to identify the perpetrator of a data leak, track an…

Cryptography and Security · Computer Science 2016-09-02 Adam Bates , Kevin Butler , Alin Dobra , Brad Reaves , Patrick Cable , Thomas Moyer , Nabil Schear

Workflow provenance typically assumes that each module is a "black-box", so that each output depends on all inputs (coarse-grained dependencies). Furthermore, it does not model the internal state of a module, which can change between…

Databases · Computer Science 2012-01-04 Yael Amsterdamer , Susan B. Davidson , Daniel Deutch , Tova Milo , Julia Stoyanovich , Val Tannen

In this paper, we investigate how we can leverage Spark platform for efficiently processing provenance queries on large volumes of workflow provenance data. We focus on processing provenance queries at attribute-value level which is the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-26 Rajmohan C , Pranay Lohia , Himanshu Gupta , Siddhartha Brahma , Mauricio Hernandez , Sameep Mehta

The complexity of exploratory data analysis poses significant challenges for collaboration and effective communication of analytic workflows. Automated methods can alleviate these challenges by summarizing workflows into more interpretable…

Human-Computer Interaction · Computer Science 2024-10-16 Shaghayegh Esmaeili , Irelis D. Suarez , Ezekiel Ajayi , Eric D. Ragan

For data-centric systems, provenance tracking is particularly important when the system is open and decentralised, such as the Web of Linked Data. In this paper, a concise but expressive calculus which models data updates is presented. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-08-24 Gabriel Ciobanu , Ross Horne

To benefit from the abundance of data and the insights it brings data processing pipelines are being used in many areas of research and development in both industry and academia. One approach to automating data processing pipelines is the…

Cryptography and Security · Computer Science 2023-10-18 Ludwig Stage , Dimka Karastoyanova

Data provenance describes how data came to be in its present form. It includes data sources and the transformations that have been applied to them. Data provenance has many uses, from forensics and security to aiding the reproducibility of…

Cryptography and Security · Computer Science 2017-11-16 Thomas Pasquier , Xueyuan Han , Mark Goldstein , Thomas Moyer , David Eyers , Margo Seltzer , Jean Bacon

Analytic provenance can be visually encoded to help users track their ongoing analysis trajectories, recall past interactions, and inform new analytic directions. Despite its significance, provenance is often hardwired into analytics…

Human-Computer Interaction · Computer Science 2025-05-20 Arpit Narechania , Shunan Guo , Eunyee Koh , Alex Endert , Jane Hoffswell

The transformations, analyses and interpretations of data in scientific workflows are vital for the repeatability and reliability of scientific workflows. This provenance of scientific workflows has been effectively carried out in Grid…

Databases · Computer Science 2016-11-18 Khawar Hasham , Kamran Munir , Jetendr Shamdasani , Richard McClatchey

One of the foundations of science is that researchers must publish the methodology used to achieve their results so that others can attempt to reproduce them. This has the added benefit of allowing methods to be adopted and adapted for…

Databases · Computer Science 2014-06-05 Paolo Missier , Simon Woodman , Hugo Hiden , Paul Watson

We establish a translation between a formalism for dynamic programming over hypergraphs and the computation of semiring-based provenance for Datalog programs. The benefit of this translation is a new method for computing provenance for a…

Databases · Computer Science 2021-12-03 Yann Ramusat , Silviu Maniu , Pierre Senellart

Data provenance consists in bookkeeping meta information during query evaluation, in order to enrich query results with their trust level, likelihood, evaluation cost, and more. The framework of semiring provenance abstracts from the…

Databases · Computer Science 2022-05-09 Camille Bourgaux , Pierre Bourhis , Liat Peterfreund , Michael Thomazo

Provenance in scientific workflows is essential for understand- ing and reproducing processes, while in business processes, it can ensure compliance and correctness and facilitates process mining. However, the provenance of process…

Cryptography and Security · Computer Science 2025-10-08 Ludwig Stage , Mirela Riveni , Raimundas Matulevičius , Dimka Karastoyanova

Successful data-driven science requires complex data engineering pipelines to clean, transform, and alter data in preparation for machine learning, and robust results can only be achieved when each step in the pipeline can be justified, and…

Databases · Computer Science 2024-04-08 Adriane Chapman , Luca Lauro , Paolo Missier , Riccardo Torlone
‹ Prev 1 2 3 10 Next ›