跳到主要导航 跳到搜索 跳到主要内容

SimTrace: Exploiting Spatial and Temporal Sampling for Large-Scale Performance Analysis

  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

MPI tracing tools is essential to collect the communication events and performance metrics of large-scale programs for further performance analysis and optimization. However, toward the exascale era, the performance and storage overhead for tracing becomes extremely prohibitive that significantly disturbs the original execution of MPI programs, leading to distorted tracing data and thus mislead analysis results. Although process sampling can effectively reduce the tracing overhead, it can easily miss important execution information that is necessary for subsequent performance analysis. In this article, we propose SimTrace, a scalable MPI tracing tool with novel spatial and temporal sampling strategies that exploits the similarity among MPI processes to achieve both low tracing overhead as well as obtain sufficient tracing information. The experimental results demonstrate that SimTrace can significantly reduce the MPI tracing overhead compared to the state-of-the-art tracing tools, meanwhile enabling effective analysis to guide performance optimization of large-scale programs.

源语言英语
文章编号55
期刊ACM Transactions on Architecture and Code Optimization
22
2
DOI
出版状态已出版 - 30 6月 2025

指纹

探究 'SimTrace: Exploiting Spatial and Temporal Sampling for Large-Scale Performance Analysis' 的科研主题。它们共同构成独一无二的指纹。

引用此