Discamus continentiam augere, luxuriam coercere
Home -> Publications
Home
  Publications
    
all years
    2019
    2018
    2017
    2016
    2015
    2014
    2013
    2012
    2011
    2010
    2009
    2008
    2007
    2006
    2005
    2004
    theses
    techreports
    presentations
    edited volumes
    conferences
  Awards
  Research
  Teaching
  BLOG
  Miscellaneous
  Full CV [pdf]






  Events








  Past Events





Publications of Torsten Hoefler
Copyright Notice:

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Citation Listings: DBLP   CSB   Google Scholar   ACM Digital Library   Semantic Scholar

Research overview                  Using Advanced MPI                 Edited volumes
      

2018

Peer-Reviewed Conference or Journal Articles

NIPS'18
[1] Tal Ben-Nun, Alice Shoshana Jakobovits, Torsten Hoefler:
 Neural Code Comprehension: A Learnable Representation of Code Semantics In Advances in Neural Information Processing Systems 31, presented in Montreal, Canada, Curran Associates, Inc., Dec. 2018,
NIPS'18
[2] Dan Alistarh, Torsten Hoefler, Mikael Johansson, Sarit Khirirat, Nikola Konstantinov, Cedric Renggli:
 The Convergence of Sparsified Gradient Methods In Advances in Neural Information Processing Systems 31, presented in Montreal, Canada, Curran Associates, Inc., Dec. 2018,
PACT'18
[3] M. Besta, D. Stanojevic, T. Zivic, J. Singh, M. Hoerold, T. Hoefler:
 Log(Graph): A Near-Optimal High-Performance Graph Representation presented in Limassol, Cyprus, ACM, Nov. 2018, Accepted at the 27th International Conference on Parallel Architectures and Compilation (PACT'18)
SC18
[4] Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, Weimin Zheng, Jingfang Xu:
 ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC18) - Gordon Bell Award Finalist, presented in Denver, CO, USA, ACM, Nov. 2018,
CACM
[5] R. Gerstenberger, M. Besta, T. Hoefler:
 Enabling Highly-Scalable Remote Memory Access Programming with MPI-3 One Sided In Communications of the ACM, ACM, Oct. 2018, Research Highlights
Cluster'18
[6] Y. Oyama, T. Ben-Nun, T. Hoefler, S. Matsuoka:
 Accelerating Deep Learning Frameworks with Micro-batches presented in Belfast, UK, IEEE, Sep. 2018, To appear in IEEE International Conference on Cluster Computing (Cluster'18)
Cluster'18
[7] Alexandru Calotoiu, Alexander Graf, Torsten Hoefler, Daniel Lorenz, Sebastian Rinke, Felix Wolf:
 Lightweight Requirements Engineering for Exascale Co-design presented in Belfast, UK, IEEE, Sep. 2018, To appear in IEEE International Conference on Cluster Computing (Cluster'18)
arXiv
[8] Maciej Besta, Torsten Hoefler:
 Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations CoRR. Vol abs/1806.01799, Jun. 2018,
GMD
[9] O. Fuhrer, T. Chadha, T. Hoefler, G. Kwasniewski, X. Lapillonne, D. Leutwyler, D. Luethi, C. Osuna, C. Schaer, T. C. Schulthess, H. Vogt:
 Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0 Geoscientific Model Development. Vol 11, Nr. 4, Copernicus Publications, May 2018,
arXiv
[10] J. de Fine Licht, S. Meierhans, T. Hoefler:
 Transformations of High-Level Synthesis Codes for High-Performance Computing CoRR. Vol abs/1805.08288, May 2018,
EuroSys' 18
[11] K. Taranov, G. Alonso, T. Hoefler:
 Fast and strongly-consistent per-item resilience in key-value stores ISBN: 978-1-4503-5584-1/18/04, Apr. 2018, EuroSys '18: Thirteenth EuroSys Conference 2018, April 23--26, 2018, Porto, Portugal (acceptance rate: 16% (43/262))
IEEE TPDS
[12] Shigang Li, Yunquan Zhang, Torsten Hoefler:
 Cache-Oblivious MPI All-to-All Communications Based on Morton Order IEEE Transactions on Parallel and Distributed Systems (TPDS). Vol 29, Nr. 3, IEEE, Mar. 2018,
ASPLOS'18
[13] M. Besta, S. M. Hassan, S. Yalamanchili, R. Ausavarungnirun, O. Mutlu, T. Hoefler:
 Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability Mar. 2018, Accepted at the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'18)
PPoPP'18
[14] Lukas Gianinazzi, Pavel Kalvoda, Alessandro De Palma, Maciej Besta, Torsten Hoefler:
 Communication-Avoiding Parallel Minimum Cuts and Connected Components Feb. 2018, Accepted at The ACM Conference Principles and Practice of Parallel Programming 2018 (PPoPP'18) (acceptance rate: 20% (28/138))
PPoPP'18
[15] J. de Fine Licht, M. Blott, T. Hoefler:
 Designing scalable FPGA architectures using high-level synthesis In Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, presented in Vienna, Austria, pages 403--404, ACM, ISBN: 978-1-4503-4982-6, Feb. 2018,
arXiv
[16] T. Ben-Nun, T. Hoefler:
 Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis CoRR. Vol abs/1802.09941, Feb. 2018,
VMCAI
[17] Cedric Baumann, Andrei Marian Dan, Yuri Meshman, Torsten Hoefler, Martin Vechev:
 Automatic Verification of RMA Programs via Abstraction Extrapolation Springer International Publishing, Feb. 2018,
ICDE'18
[18] Ingo Mueller, Andrea Arteaga, Torsten Hoefler, Gustavo Alonso:
 Reproducible Floating-Point Aggregation in RDBMSs Feb. 2018, In Proceedings of the 2018 IEEE 34th International Conference on Data Enineering

Invited Talks and Presentations

IPAM UCLA
[19] T. Hoefler:
 Twelve ways to fool the masses when reporting performance of deep learning workloads (Presentation) presented in Los Angeles, CA, Nov. 2018, Workshop III: HPC for Computationally and Data-Intensive Problems
IPAM UCLA
[20] T. Hoefler:
 High-Performance Communication for Machine Learning (Presentation) presented in Los Angeles, CA, Nov. 2018, Workshop III: HPC for Computationally and Data-Intensive Problems
SC18
[21] T. Hoefler:
 High Level Programming Languages for Quantum Computation (Presentation) presented in Dallas, TX, USA, Nov. 2018,
SC18
[22] T. Hoefler:
 RDMA, Scalable MPI-3 RMA, and Next-Generation Post-RDMA Interconnects (Presentation) presented in Dallas, TX, USA, Nov. 2018,
SC18
[23] T. Hoefler:
 Will FPGAs make it this time? (Presentation) presented in Dallas, TX, USA, Nov. 2018,
SC18
[24] T. Hoefler:
  (Presentation) presented in Dallas, TX, USA, Nov. 2018,
FacSum
[25] T. Hoefler:
 An HPC System's guy's view of Quantum Computing (Presentation) presented in Redmond, WA, Aug. 2018, Presentation at the Microsoft Faculty Summit 2018
Tsinghua
[26] T. Hoefler:
 Performance Modeling for Future Computing Technologies (Presentation) Jun. 2018, Invited talk at 60 years of CS @ Tsinghua celebration
HPCAC
[27] T. Hoefler:
 Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis (Presentation) Apr. 2018, Keynote at Swiss HPC Advisory Council Conference 2018
SOS'18
[28] T. Hoefler:
 Performance Portability - An Oxymoron? (Presentation) presented in Kona, HI, USA, Mar. 2018, Invited talk at SOS'18 Workshop
Multicore @ Siemens
[29] T. Hoefler:
 Developing high-performance software, from modeling to programming (Presentation) presented in Nuremberg, Germany, Feb. 2018, Invited opening presentation at the Multicore@Siemens conference
HiPINEB @ HPCA'18
[30] T. Hoefler:
 The three L's in modern high-performance networking: low latency, low cost, low processing load (Presentation) presented in Vienna, Austria, Feb. 2018, Keynote at the HiPINEB workshop at HPCA'18

serving: 54.227.186.112:48652© Torsten Hoefler