Opportunities and Challenges in Running Scientific Workflows on the Cloud
- 1 October 2011
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 455-462
- https://doi.org/10.1109/cyberc.2011.80
Abstract
Cloud computing is gaining tremendous momentum in both academia and industry. The application of Cloud computing, however, has mostly focused on Web applications and business applications; while the recognition of using Cloud computing to support large-scale workflows, especially data-intensive scientific workflows on the Cloud is still largely overlooked. We coin the term "Cloud Workflow", to refer to the specification, execution, provenance tracking of large-scale scientific workflows, as well as the management of data and computing resources to enable the execution of scientific workflows on the Cloud. In this paper, we analyze why there has been such a gap between the two technologies, and what it means to bring Cloud and workflow together; we then present the key challenges in running Cloud workflow, and discuss the research opportunities in realizing workflows on the Cloud.Keywords
This publication has 20 references indexed in Scilit:
- Collaborative Scientific WorkflowsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- A MapReduce-Enabled Scientific Workflow Composition FrameworkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- A Reference Architecture for Scientific Workflow Management Systems and the VIEW SOA SolutionIEEE Transactions on Services Computing, 2009
- The Eucalyptus Open-Source Cloud-Computing SystemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- On the Use of Cloud Computing for Scientific WorkflowsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Service-Oriented Architecture for VIEW: A Visual Scientific Workflow Management SystemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Swift: Fast, Reliable, Loosely Coupled Parallel ComputationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Taverna: a tool for building and running workflows of servicesNucleic Acids Research, 2006
- A notation and system for expressing and executing cleanly typed workflows on messy scientific dataACM SIGMOD Record, 2005
- Chimera: a virtual data system for representing, querying, and automating data derivationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003