Abstract
Performance analysis is a crucial functionality required in the domain of high-performance computing for effective performance optimization. However, collecting performance traces of parallel programs often incurs significant overhead due to the need to trace numerous performance metrics such as function timestamps and hardware counters. Such overhead prohibits performance analysis tools from being practically applied to large-scale parallel programs. To mitigate the above overhead, existing tools typically require program pre-execution to identify hotspot functions for subsequent trace collection, which can effectively reduce the overhead. However, for large-scale performance analysis, such an approach still imposes substantial and unacceptable execution overhead, due to an additional profiling run to obtain a list of hotspot functions for further performance tracing and analysis. To address such drawback, we propose Hotspy, a performance analysis tool for identifying hotspot functions based on graph neural networks. Hotspy performs static analysis at LLVM intermediate representation level and predicts potential hotspot functions for instrumentation, without program pre-execution. The experimental results demonstrate that Hotspy can significantly reduce the overhead of hotspot function identification with notable prediction accuracy.
| Original language | English |
|---|---|
| Pages (from-to) | 260-274 |
| Number of pages | 15 |
| Journal | CCF Transactions on High Performance Computing |
| Volume | 7 |
| Issue number | 3 |
| DOIs | |
| State | Published - Jun 2025 |
Keywords
- Graph neural networks
- Hotspot functions
- Performance analysis
- Static analysis
Fingerprint
Dive into the research topics of 'Hotspy: identifying performance hotspot with graph neural network based static analysis'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver