Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Publications of SPCL

M. Ritter, A. Geiss, J. Wehrstein, A. Calotoiu, T. Reimann, T. Hoefler, F. Wolf:

 Noise-Resilient Empirical Performance Modeling with Deep Neural Networks

(In IPDPS '21: Proceedings of the 35th IEEE Interational Parallel and Distributed Processing Symposium (to appear), May 2021)

Abstract

Empirical performance modeling is a proven instrument to analyze the scaling behavior of HPC applications. Using a set of smaller-scale experiments, it can provide important insights into application behavior at larger scales. Extra-P is an empirical modeling tool that applies linear regression to automatically generate human-readable performance models. Similar to other regression-based modeling techniques, the accuracy of the models created by Extra-P decreases as the amount of noise in the underlying data increases. This is why the performance variability observed in many contemporary systems can become a serious challenge. In this paper, we introduce a novel adaptive modeling approach that makes Extra-P more noise resilient, exploiting the ability of deep neural networks to discover the effects of numerical parameters, such as the number of processes or the problem size, on performance when dealing with noisy measurements. Using synthetic analysis and data from three different case studies, we demonstrate that our solution improves the model accuracy at high noise levels by up to 25% while increasing their predictive power by about 15%.

Documents

download article:
 

BibTeX

@inproceedings{,
  author={Marcus Ritter and Alexander Geiss and Johannes Wehrstein and Alexandru Calotoiu and Thorsten Reimann and Torsten Hoefler and Felix Wolf},
  title={{Noise-Resilient Empirical Performance Modeling with Deep Neural Networks}},
  year={2021},
  month={05},
  booktitle={IPDPS '21: Proceedings of the 35th IEEE Interational Parallel and Distributed Processing Symposium (to appear)},
}