The design and implementation of zero copy MPI using commodity hardware with a high performance network

Abstract
This paper designs an implementation of the MPI message passing interface using a zero copy message transfer primitive supported by a lower communica- tion layer to realize a high performance communica- tion library. The zero copy message transfer primi- tive requires a memory area pinned down to physical memory, which is a restricted quantity resource un- der a paging memory system. Allocation of pinned down memory by multiple simultaneous requests for sending and receiving without any control can cause deadlock. To avoid this deadlock, we have introduced: i) separate of control of send/receive pin-down mem- ory areas to ensure that at least one send and receive may be processed concurrently, and ii) delayed queues to handle the postponed message passing operations which could not be pinned-down.

This publication has 4 references indexed in Scilit: