TY - GEN
T1 - OVERT
T2 - 54th International Conference on Parallel Processing, ICPP 2025
AU - Lei, Kelun
AU - Yang, Hailong
AU - Zhang, Kaige
AU - Du, Shaokang
AU - Casas, Marc
AU - Xu, Yufan
AU - Luan, Zhongzhi
AU - Liu, Yi
AU - Qian, Depei
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/12/20
Y1 - 2025/12/20
N2 - Sparse Matrix-Vector Multiplication (SpMV) is a key operation in many applications, and optimizing its performance is crucial for achieving high computational efficiency. Existing efforts have optimized SpMV performance on CPUs with corresponding sparse matrix formats adopted. However, the performance of existing SpMV implementations primarily focuses on maximizing hardware's vector unit usage, neglecting the potential for exploiting idle scalar units simultaneously. To address such limitation, we propose OVERT, a new storage format of sparse matrix designed to exploit both vector and scalar execution units on modern CPUs for accelerating SpMV performance. OVERT, containing two format variants (OVERT-S and OVERT-E), outperforms existing formats by partitioning the matrix into multiple data panels, which can efficiently utilize vector and scalar units. Moreover, we propose an effective format selection model that dynamically chooses the optimal format variant from OVERT according to the characteristics of the input matrix. Experimental results on SuiteSparse show that OVERT achieves an average speedup of 3.91 × against Intel MKL on X86 CPU and an average speedup of 1.24 × against ArmPL on ARM CPU.
AB - Sparse Matrix-Vector Multiplication (SpMV) is a key operation in many applications, and optimizing its performance is crucial for achieving high computational efficiency. Existing efforts have optimized SpMV performance on CPUs with corresponding sparse matrix formats adopted. However, the performance of existing SpMV implementations primarily focuses on maximizing hardware's vector unit usage, neglecting the potential for exploiting idle scalar units simultaneously. To address such limitation, we propose OVERT, a new storage format of sparse matrix designed to exploit both vector and scalar execution units on modern CPUs for accelerating SpMV performance. OVERT, containing two format variants (OVERT-S and OVERT-E), outperforms existing formats by partitioning the matrix into multiple data panels, which can efficiently utilize vector and scalar units. Moreover, we propose an effective format selection model that dynamically chooses the optimal format variant from OVERT according to the characteristics of the input matrix. Experimental results on SuiteSparse show that OVERT achieves an average speedup of 3.91 × against Intel MKL on X86 CPU and an average speedup of 1.24 × against ArmPL on ARM CPU.
KW - SIMD
KW - SpMV
KW - Vector-Scalar Execution
UR - https://www.scopus.com/pages/publications/105026457308
U2 - 10.1145/3754598.3754632
DO - 10.1145/3754598.3754632
M3 - 会议稿件
AN - SCOPUS:105026457308
T3 - 54th International Conference on Parallel Processing, ICPP 2025 - Main Conference Proceedings
SP - 564
EP - 574
BT - 54th International Conference on Parallel Processing, ICPP 2025 - Main Conference Proceedings
PB - Association for Computing Machinery, Inc
Y2 - 8 September 2025 through 11 September 2025
ER -