Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Publications of SPCL

Selection by year

2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007

Peer-Reviewed Conference or Journal Articles

ML4PS'19
[1] P. Grönquist, T. Ben-Nun, N. Dryden, P. Dueben, L. Lavarini, S. Li, T. Hoefler:
 Predicting Weather Uncertainty with Deep Convnets In Machine Learning and the Physical Sciences Workshop at the 33rd Conference on Neural Information Processing Systems (NeurIPS), presented in Vancouver, BC, Canada, Dec. 2019,
BAMS
[2] C. Schär, O. Fuhrer, A. Arteaga, N. Ban, C. Charpilloz, S. Di Girolamo, L. Hentgen, T. Hoefler, X. Lapillonne, D. Leutwyler, K. Osterried, D. Panosetti, S. Rüdisühli, L. Schlemmer, T. Schulthess, M. Sprenger, S. Ubbiali, H. Wernli:
 Kilometer-scale climate models: Prospects and challenges Bulletin of the American Meteorological Society. Vol 100, Nr. 12, American Meteorological Society, Dec. 2019, Early Online Release
SC19
[3] C. Renggli, D. Alistarh, M. Aghagolzadeh, T. Hoefler:
 SparCML: High-Performance Sparse Communication for Machine Learning In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[4] T. De Matteis, J. de Fine Licht, J. Beránek, T. Hoefler:
 Streaming Message Interface: High-Performance DistributedMemory Programming on Reconfigurable Hardware In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[5] A. Nikolaos Ziogas, T. Ben-Nun, G. Indalecio Fernández, T. Schneider, M. Luisier, T. Hoefler:
 Optimizing the Data Movement in Quantum Transport Simulations via Data-Centric Parallel Programming In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[6] A. Nikolaos Ziogas, T. Ben-Nun, G. Indalecio Fernández, T. Schneider, M. Luisier, T. Hoefler:
 A Data-Centric Approach to Extreme-Scale Ab initio Dissipative Quantum Transport Simulations In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, Won ACM Gordon Bell Prize
SC19
[7] S. Di Girolamo, K. Taranov, A. Kurth, M. Schaffner, T. Schneider, J. Beránek, M. Besta, L. Benini, D. Roweth, T. Hoefler:
 Network-Accelerated Non-Contiguous Memory Transfers In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[8] D. De Sensi, S. Di Girolamo, T. Hoefler:
 Mitigating Network Noise on Dragonfly Networks through Application-Aware Routing In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[9] T. Ben-Nun, J. de Fine Licht, A. Nikolaos Ziogas, T. Schneider, T. Hoefler:
 Stateful Dataflow Multigraphs: A Data-Centric Model for Performance Portability on Heterogeneous Architectures In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344))
SC19
[10] M. Besta, S. Weber, L. Gianinazzi, R. Gerstenberger, A. Ivanov, Y. Oltchik, T. Hoefler:
 Slim Graph: Practical Lossy Graph Compression for Approximate Graph Processing, Storage, and Analytics In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344)) Best Paper Finalist, Best Student Paper Finalist
SC19
[11] G. Kwasniewski, M. Kabić, M. Besta, J. VandeVondele, R. Solcà, T. Hoefler:
 Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix-Matrix Multiplication In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019, (acceptance rate: 22.7% (78/344)) Best Paper Finalist, SC19 Best Student Paper (1/87)
arXiv
[12] M. Besta, E. Peter, R. Gerstenberger, M. Fischer, M. Podstawski, C. Barthels, G. Alonso, T. Hoefler:
 Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries CoRR. Vol abs/1910.09017, Oct. 2019,
PACT'19
[13] T. Gysi, T. Grosser, T. Hoefler:
 Absinthe: Learning an Analytical Performance Model to Fuse and Tile Stencil Codes in One Shot In Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques (PACT), presented in Seattle, WA, USA, IEEE, Sep. 2019,
ACM CSUR
[14] T. Ben-Nun, T. Hoefler:
 Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis ACM Comput. Surv.. Vol 52, Nr. 4, pages 65:1--65:43, ACM, ISSN: 0360-0300, Aug. 2019,
IEEE TPDS
[15] S. Shudler, Y. Berens, A. Calotoiu, T. Hoefler, A. Strube, F. Wolf:
 Engineering Algorithms for Scalability through Continuous Validation of Performance Expectations IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol 30, Nr. 8, IEEE, Jul. 2019,
arXiv
[16] T. De Matteis, J. de Fine Licht, T. Hoefler:
 FBLAS: Streaming Linear Algebra on FPGA CoRR. Vol abs/1907.07929, Jul. 2019,
PASC'19
[17] F. Thaler, S. Moosbrugger, C. Osuna, M. Bianco, H. Vogt, A. Afanasyev, L. Mosimann, O. Fuhrer, T. Schulthess, T. Hoefler:
 Porting the COSMO Weather Model to Intel KNL presented in Zurich, Switzerland, ACM, Jun. 2019, Accepted at the ACM Platform for Advanced Scientific Computing Conference (PASC19)
DAC'19
[18] N. Gleinig, F. Ann Hubis, T. Hoefler:
 Embedding Functions Into Reversible Circuits: A Probabilistic Approach to the Number of Lines In Proceedings of the 56th Annual Design Automation Conference, presented in Las Vegas, NV, USA, ACM, ISBN: 978-1-4503-6725-7/19/06, Jun. 2019,
PLDI'19
[19] T. Gysi, T. Grosser, L. Brandner, T. Hoefler:
 A Fast Analytical Model of Fully Associative Caches In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, presented in Phoenix, AZ, USA, pages 816--829, ACM, ISBN: 978-1-4503-6712-7, Jun. 2019,
ICS'19
[20] P. R. Eller, T. Hoefler, W. Gropp:
 Using Performance Models to Understand Scalable Krylov Solver Performance at Scale for Structured Grid Problems In Proceedings of the 2019 ACM International Conference on Supercomputing (ICS'19), presented in Phoenix, AZ, ACM, Jun. 2019,
IPDPS'19
[21] S. Di Girolamo, P. Schmid, T. Schulthess, T. Hoefler:
 SimFS: A Simulation Data Virtualizing File System Interface In Proceedings of the 33st IEEE International Parallel & Distributed Processing Symposium (IPDPS'19), presented in Rio de Janeiro, Brazil, IEEE, May 2019,
IPDPS'19
[22] T. Ben-Nun, M. Besta, S. Huber, A. Nikolaos Ziogas, D. Peter, T. Hoefler:
 A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning IEEE, May 2019, Accepted at the 33rd IEEE International Parallel & Distributed Processing Symposium (IPDPS'19)
arXiv
[23] M. Besta, M. Schneider, K. Cynk, M. Konieczny, E. Henriksson, S. Di Girolamo, A. Singla, T. Hoefler:
 FatPaths: Routing in Supercomputers, Data Centers, and Clouds with Low-Diameter Networks when Shortest Paths Fall Short CoRR. Vol abs/1906.10885, May 2019,
PPoPP'19
[24] M. Kuettler, M. Planeta, J. Bierbaum, C. Weinhold, H. Haertig, A. Barak, T. Hoefler:
 Corrected Trees for Reliable Group Communication Feb. 2019, Accepted at The ACM Conference Principles and Practice of Parallel Programming 2019 (PPoPP'19) (acceptance rate: 19% (29/152))
FPGA'19
[25] M. Besta, M. Fischer, T. Ben-Nun, J. de Fine Licht, T. Hoefler:
 Substream-Centric Maximum Matchings on FPGA Feb. 2019, In Proceedings of the 27th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (acceptance rate: 23%) Best Paper Finalist (4/30)
arXiv
[26] M. Besta, D. Stanojevic, J. de Fine Licht, T. Ben-Nun, T. Hoefler:
 Graph Processing on FPGAs: Taxonomy, Survey, Challenges CoRR. Vol abs/1903.06697, Feb. 2019,
CiSE
[27] T. Schulthess, P. Bauer, O. Fuhrer, T. Hoefler, C. Schaer, N. Wedi:
 Reflecting on the goal and baseline for exascale computing: a roadmap based on weather and climate simulations Computing in Science and Engineering (CiSE). Vol 21, Nr. 1, IEEE Computer Society, ISSN: 1521-9615, Jan. 2019,

Invited Talks and Presentations

MLHPC
[28] T. Hoefler:
 HPC for ML and ML for HPC - Scalability, Communication, and Programming (Presentation) presented in Denver, CO, USA, Nov. 2019, Keynote talk at the International Machine Learning in High-Performance Computing (MLHPC'19 in conjunction with ACM/IEEE Supercomputing, SC19)
PARCO
[29] T. Hoefler:
 Data-Centric Parallel Programming (Presentation) presented in Prague, Czech Republic, Sep. 2019, Keynote talk at the The 18th International Parallel Computing conference (ParCo'19)
PPAM
[30] T. Hoefler:
 High-Performance Communication in Machine Learning (Presentation) presented in Bialystok, Poland, Sep. 2019, Keynote talk at the 13th International Conference on Parallel Processing and Applied Mathematics (PPAM'19)
ISC'19
[31] T. Hoefler, A. Nikolaos Ziogas, T. Ben-Nun, G. Indalecio Fernández, T. Schneider, M. Luisier, J. de Fine Licht:
 Data-Centric Parallel Programming (Presentation) presented in Frankfurt, Germany, Jun. 2019, invited talk at the International Conference on Supercomputing (ISC'19)
GG500
[32] T. Hoefler:
 The Green Graph500 List (June 2019) (Presentation) presented in Frankfurt, Germany, Jun. 2019, Presented at the Green Graph 500 BoF at the International Conference on Supercomputing (ISC'19)
ISC'19 ML
[33] T. Hoefler, T. Ben-Nun:
 Optimizing and Benchmarking Large-Scale Deep Learning (Presentation) presented in Frankfurt, Germany, Jun. 2019, Invited talk at the Machine Learning day at the International Conference on Supercomputing (ISC'19)
NRE'19
[34] T. Hoefler:
 Performance Reproducibility in HPC and Deep Learning (Presentation) presented in Frankfurt, Germany, Jun. 2019, Keynote talk at the Numerical Reproducibility at Exascale Workshop (NRE2019), ISC’19
AsHES
[35] T. Hoefler:
 Performance Portability with Data-Centric Parallel Programming (Presentation) presented in Rio de Janeiro, Brasil, May 2019, Keynote talk at the The Ninth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES) (delayed online)
HPCAC
[36] T. Hoefler:
 RDMA, Scalable MPI-3 RMA, and Next-Generation Post-RDMA Interconnects (Presentation) Apr. 2019, Best talk award winner at Swiss HPC Advisory Council Conference 2019
EMiT
[37] T. Hoefler:
 High-Performance Communication for Machine Learning (Presentation) presented in Huddersfield, UK, Apr. 2019, Keynote talk at the 5th Conference on Emerging Technologies – EMiT2019
SCFE'19
[38] T. Hoefler:
 Extreme-Scale Graphs (Presentation) presented in Warsaw, Poland, Mar. 2019, Invited talk at Supercomputing Frontiers Europe 2019
AHPC'19
[39] T. Hoefler:
 High-Performance Communication in Machine Learning (Presentation) presented in Grundlsee, Austria, Feb. 2019, Keynote at the Austrian HPC meeting 2019
ICL
[40] T. Hoefler:
 High-Performance Communication in Machine Learning (Presentation) presented in Knowville, TN, Feb. 2019,
RWTH Aachen
[41] T. Hoefler:
 High-Performance Communication for Machine Learning (Presentation) presented in Aachen, Germany, Jan. 2019,
RWTH Aachen
[42] T. Hoefler:
 MPI Remote Memory Access Programming and Scientific Benchmarking of Parallel Codes (Presentation) presented in Aachen, Germany, Jan. 2019,
TU Darmstadt
[43] T. Hoefler:
 An HPC Systems Guy’s View of Quantum Computing (Presentation) presented in Darstadt, Germany, Jan. 2019,

Other Publications or Technical Reports

H2RC'19
[44] J. de Fine Licht, T. Hoefler:
 hlslib: Software Engineering for Hardware Design In Fifth International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC'19), presented in Denver, CO, United States, IEEE, Nov. 2019,
MB3
[45] A. Nigay, T. Schneider, T. Hoefler:
 TinyMPI tasking prototype Feb. 2019,