Virtual machine aware communication libraries for high performance computing
- 10 November 2007
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
As the size and complexity of modern computing systems keep increasing to meet the demanding requirements of High Performance Computing (HPC) applications, manageability is becoming a critical concern to achieve both high performance and high productivity computing. Meanwhile, virtual machine (VM) technologies have become popular in both industry and academia due to various features designed to ease system management and administration. While a VM-based environment can greatly help manageability on large-scale computing systems, concerns over performance have largely blocked the HPC community from embracing VM technologies. In this paper, we follow three steps to demonstrate the ability to achieve near-native performance in a VM-based environment for HPC. First, we propose Inter-VM Communication (IVC), a VM-aware communication library to support efficient shared memory communication among computing processes on the same physical host, even though they may be in different VMs. This is critical for multi-core systems, especially when individual computing processes are hosted on different VMs to achieve fine-grained control. Second, we design a VM-aware MPI library based on MVAPICH2 (a popular MPI library), called MVAPICH2-ivc, which allows HPC MPI applications to transparently benefit from IVC. Finally, we evaluate MVAPICH2-ivc on clusters featuring multi-core systems and high performance InfiniBand interconnects. Our evaluation demonstrates that MVAPICH2-ivc can improve NAS Parallel Benchmark performance by up to 11% in VM-based environment on eight-core Intel Clover-town systems, where each compute process is in a separate VM. A detailed performance evaluation for up to 128 processes (64 node dual-socket single-core systems) shows only a marginal performance overhead of MVAPICH2-ivc as compared with MVAPICH2 running in a native environment. This study indicates that performance should no longer be a barrier preventing HPC environments from taking advantage of the various features available through VM technologies.Keywords
Funding Information
- Division of Computing and Communication Foundations (CCF-0702675)
- U.S. Department of Energy (DE-FC02-06ER25749DE-FC02-06ER25755)
This publication has 16 references indexed in Scilit:
- NomadPublished by Association for Computing Machinery (ACM) ,2007
- Concurrent Direct Network Access for Virtual Machine MonitorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Virtualization for high-performance computingACM SIGOPS Operating Systems Review, 2006
- Design and evaluation of Nemesis, a scalable, low-latency, message-passing communication subsystemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Scalable molecular dynamics with NAMDJournal of Computational Chemistry, 2005
- HyperSpectorPublished by Association for Computing Machinery (ACM) ,2005
- Open MPI: Goals, Concept, and Design of a Next Generation MPI ImplementationPublished by Springer Nature ,2004
- Xen and the art of virtualizationPublished by Association for Computing Machinery (ACM) ,2003
- MPICH-G2: A Grid-enabled implementation of the Message Passing InterfaceJournal of Parallel and Distributed Computing, 2003
- Semicoarsening Multigrid on Distributed Memory MachinesSIAM Journal on Scientific Computing, 2000