Abstract
A general management framework for distributed heterogenous supercomputing systems (DHSSs) that is based on an application-characterization technique is presented. The technique uses code profiling and analytical benchmarking of supercomputers. An optimal scheduling of tasks in these systems is an NP-complete problem. The use of network caching to reduce the complexity associated with the scheduling decisions is discussed. An experimental prototype of a DHSS management system is described.