跳到主要导航 跳到搜索 跳到主要内容

Retrospection on the Performance Analysis Tools for Large-Scale HPC Programs

  • Beihang University
  • CAS - Institute of Computing Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

As the performance gap between hardware and software widens, performance analysis tools are essential for understanding the behavior of large-scale High-Performance Computing (HPC) programs. These tools provide insights into the performance bottlenecks and help in optimizing the performance of the programs. In this paper, we present a comprehensive study of performance analysis tools for large-scale HPC systems including both sampling-based and instrumentation-based tools that are commonly adopted in the HPC community. We investigate the abundance and overheads of data collection as well as the analysis capabilities of HPCToolkit, TAU, and Scalasca with representative programs at scale. Our study shows that different performance analysis tools have distinct strengths and weaknesses, and the choice of a performance analysis tool depends on the specific requirements of the user. We also discuss the challenges and future directions in the field of performance analysis tools for large-scale HPC systems.

源语言英语
主期刊名Proceedings - 2024 IEEE 31st International Conference on High Performance Computing, Data, and Analytics, HiPC 2024
出版商Institute of Electrical and Electronics Engineers Inc.
34-44
页数11
ISBN(电子版)9798331509095
DOI
出版状态已出版 - 2024
活动31st Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2024 - Bangalore, 印度
期限: 18 12月 202421 12月 2024

出版系列

姓名Proceedings - 2024 IEEE 31st International Conference on High Performance Computing, Data, and Analytics, HiPC 2024

会议

会议31st Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2024
国家/地区印度
Bangalore
时期18/12/2421/12/24

指纹

探究 'Retrospection on the Performance Analysis Tools for Large-Scale HPC Programs' 的科研主题。它们共同构成独一无二的指纹。

引用此