Automatic capture and efficient storage of e‐Science experiment provenance
Open Access
- 8 August 2007
- journal article
- research article
- Published by Wiley in Concurrency and Computation: Practice and Experience
- Vol. 20 (5) , 419-429
- https://doi.org/10.1002/cpe.1235
Abstract
For the first provenance challenge, we introduce a layered model to represent workflow provenance that allows navigation from an abstract model of the experiment to instance data collected during a specific experiment run. We outline modest extensions to a commercial workflow engine so it will automatically capture provenance at workflow runtime. We also present an approach to store this provenance data in a relational database. Finally, we demonstrate how core provenance queries in the challenge can be expressed in SQL and discuss the merits of our layered representation. Copyright © 2007 John Wiley & Sons, Ltd.Keywords
This publication has 9 references indexed in Scilit:
- Tackling the Provenance Challenge one layer at a timeConcurrency and Computation: Practice and Experience, 2007
- Special Issue: The First Provenance ChallengeConcurrency and Computation: Practice and Experience, 2007
- gLite Job Provenance—a job-centric viewConcurrency and Computation: Practice and Experience, 2007
- Mining Taverna's semantic web of provenanceConcurrency and Computation: Practice and Experience, 2007
- Provenance trails in the Wings/Pegasus systemConcurrency and Computation: Practice and Experience, 2007
- Provenance in collection‐oriented scientific workflowsConcurrency and Computation: Practice and Experience, 2007
- Extracting causal graphs from an open provenance data modelConcurrency and Computation: Practice and Experience, 2007
- WOODSS and the WebACM SIGMOD Record, 2005
- Interoperability for GIS Document Management in Environmental PlanningPublished by Springer Nature ,2005