The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.
Publications of SPCL
|T. Hoefler and T. Schneider and A. Lumsdaine:|
|The Effect of Network Noise on Large-Scale Collective Communications|
(Parallel Processing Letters (PPL). Vol 19, Nr. 4, In , presented in , pages 573-593, World Scientific Publishing Company, ISSN: , ISBN: , Aug. 2009, )
AbstractThe effect of operating system (OS) noise on the performance of large-scale applications is a growing concern and ameliorating the influence of OS noise is a subject of active research. A related problem is that of network noise which arises from the shared use of the interconnection network by parallel processes of different allocations or other background processes. To characterize the effect of network noise on parallel applications we conducted a series of experiments with a specially crafted benchmark and simulations. Experimental results show a decrease in the communication performance of a parallel reduction operation by a factor of 2 on 246 nodes on an InfiniBand fat-tree and by several orders of magnitude on a BlueGene/P torus. Simulations show how influence of network noise grows with the system size. Although network noise is not as well-studied as OS noise, our results clearly show that it is an important factor that must be considered when running and analyzing large-scale applications.