Many-task computing for grids and supercomputers
Top Cited Papers
- 1 November 2008
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 21511683,p. 1-11
- https://doi.org/10.1109/mtags.2008.4777912
Abstract
Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing. Many task computing differs from high throughput computing in the emphasis of using large number of computing resources over short periods of time to accomplish many computational tasks (i.e. including both dependent and independent tasks), where primary metrics are measured in seconds (e.g. FLOPS, tasks/sec, MB/s I/O rates), as opposed to operations (e.g. jobs) per month. Many task computing denotes high-performance computations comprising multiple distinct activities, coupled via file system operations. Tasks may be small or large, uniprocessor or multiprocessor, compute-intensive or data-intensive. The set of tasks may be static or dynamic, homogeneous or heterogeneous, loosely coupled or tightly coupled. The aggregate number of tasks, quantity of computing, and volumes of data may be extremely large. Many task computing includes loosely coupled applications that are generally communication-intensive but not naturally expressed using standard message passing interface commonly found in high performance computing, drawing attention to the many computations that are heterogeneous but not ldquohappilyrdquo parallel.Keywords
This publication has 28 references indexed in Scilit:
- All-pairs: An abstraction for data-intensive cloud computing2008 IEEE International Symposium on Parallel and Distributed Processing, 2008
- Project Kittyhawk: building a global-scale computerACM SIGOPS Operating Systems Review, 2008
- DryadPublished by Association for Computing Machinery (ACM) ,2007
- Development and validation of a modular, extensible docking program: DOCK 5Journal of Computer-Aided Molecular Design, 2006
- Distributed computing in practice: the Condor experienceConcurrency and Computation: Practice and Experience, 2005
- Interpreting the Data: Parallel Analysis with SawzallScientific Programming, 2005
- Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed SystemsScientific Programming, 2005
- The Data Deluge: An e‐Science PerspectivePublished by Wiley ,2003
- Scripting: higher level programming for the 21st CenturyComputer, 1998
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990