Home Publications edited volumes Awards Research Teaching Miscellaneous Full CV [pdf] BLOG
Events
Past Events
|
Publications of Torsten Hoefler
Torsten Hoefler, Timo Schneider and Andrew Lumsdaine:
| | Accurately Measuring Collective Operations at Massive Scale
(In Proceedings of the 22nd IEEE International Parallel & Distributed Processing Symposium, PMEO'08 Workshop, presented in Miami, FL, ISSN: 1530-2075, ISBN: 978-1-4244-1694-3, Apr. 2008) Invited to a journal special issue on top picks from PMEO'08.
Abstract Accurate, reproducible and comparable measurement
of collective operations is a complicated task. Although
Different measurement schemes are implemented in well-known benchmarks, many of these schemes introduce different systematic errors in their measurements. We characterize these errors and select a window-based approach as the
most accurate method. However, this approach complicates
measurements significantly and introduces a clock synchronization as a new source of systematic errors. We analyze
approaches to avoid or correct those errors and develop a
scalable synchronization scheme to conduct benchmarks on
massively parallel systems. Our results are compared to the
window-based scheme implemented in the SKaMPI benchmarks and show a reduction of the synchronization overhead by a factor of 16 on 128 processes.
Documentsdownload article: download slides: | | BibTeX | @inproceedings{hoefler-pmeo08, author={Torsten Hoefler and Timo Schneider and Andrew Lumsdaine}, title={{Accurately Measuring Collective Operations at Massive Scale}}, year={2008}, month={Apr.}, booktitle={Proceedings of the 22nd IEEE International Parallel \& Distributed Processing Symposium, PMEO'08 Workshop}, location={Miami, FL}, issn={1530-2075}, isbn={978-1-4244-1694-3}, source={http://www.unixer.de/~htor/publications/}, } |
|
|