XtremWeb & Condor : sharing resources between Internet connected Condor pool
- 1 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Grid computing presents two major challenges for deploying large scale applications across wide area networks gathering volunteers PC and clusters/parallel computers as computational resources: security and fault tolerance. This paper presents a lightweight Grid solution for the deployment of multi-parameters applications on a set of clusters protected by firewalls. The system uses a hierarchical design based on Condor for managing each cluster locally and XtremWeb for enabling resource sharing among the clusters. We discuss the security and fault tolerance mechanisms used for this design and demonstrate the usefulness of the approach measuring the performances of a multi-parameters bio-chemistry application deployed on two sites: University of Wisconsin/Madison and Paris South University. This experiment shows that we can efficiently and safely harness the computational power of about 200 PC distributed on two geographic sites.Keywords
This publication has 4 references indexed in Scilit:
- A fault detection service for wide area distributed computationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Nimrod: a tool for performing parametrised simulations using distributed workstationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- High performance parametric modeling with Nimrod/G: killer application for the global grid?Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Globus: a Metacomputing Infrastructure ToolkitThe International Journal of Supercomputer Applications and High Performance Computing, 1997