Automatic capture and efficient storage of e‐Science experiment provenance

Open Access

8 August 2007

journal article
research article
Published by Wiley in Concurrency and Computation: Practice and Experience

Vol. 20 (5) , 419-429
https://doi.org/10.1002/cpe.1235

Abstract

For the first provenance challenge, we introduce a layered model to represent workflow provenance that allows navigation from an abstract model of the experiment to instance data collected during a specific experiment run. We outline modest extensions to a commercial workflow engine so it will automatically capture provenance at workflow runtime. We also present an approach to store this provenance data in a relational database. Finally, we demonstrate how core provenance queries in the challenge can be expressed in SQL and discuss the merits of our layered representation. Copyright © 2007 John Wiley & Sons, Ltd.

Keywords

This publication has 9 references indexed in Scilit:

Tackling the Provenance Challenge one layer at a time
Concurrency and Computation: Practice and Experience, 2007
Special Issue: The First Provenance Challenge
Concurrency and Computation: Practice and Experience, 2007
gLite Job Provenance—a job-centric view
Concurrency and Computation: Practice and Experience, 2007
Mining Taverna's semantic web of provenance
Concurrency and Computation: Practice and Experience, 2007
Provenance trails in the Wings/Pegasus system
Concurrency and Computation: Practice and Experience, 2007
Provenance in collection‐oriented scientific workflows
Concurrency and Computation: Practice and Experience, 2007
Extracting causal graphs from an open provenance data model
Concurrency and Computation: Practice and Experience, 2007
WOODSS and the Web
ACM SIGMOD Record, 2005
Interoperability for GIS Document Management in Environmental Planning
Published by Springer Nature ,2005