Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Publications of SPCL

S. Di Girolamo, P. Jolivet, K. D. Underwood, T. Hoefler:

 Exploiting Offload Enabled Network Interfaces

(IEEE MICRO. Vol 36, Nr. 4, IEEE, Jul. 2016)

Publisher Reference

Abstract

Network interface cards are one of the key components to achieve efficient parallel performance. In the past, they have gained new functionalities, such as lossless transmission and remote direct memory access, that are now ubiquitous in high-performance systems. Prototypes of next-generation network cards now offer new features that facilitate device programming. In this article, the authors discuss an abstract machine model for offloading architectures. They used the Portals 4 network interface to implement the proposed abstraction model, and they present two microbenchmarks to show the effects of fully offloaded collective communications. They then propose the concept of persistent offloaded operations that can reduce the creation/offloading overhead, and they discuss a possible extension to the current Portals 4 interface to enable their support. The results obtained show how this work can be used to accelerate existing MPI applications.

Documents

download article:
 

BibTeX

@article{,
  author={Salvatore Di Girolamo and P. Jolivet and K. D. Underwood and Torsten Hoefler},
  title={{Exploiting Offload Enabled Network Interfaces}},
  journal={IEEE MICRO},
  year={2016},
  month={7},
  volume={36},
  number={4},
  publisher={IEEE},
}