跳到主要导航 跳到搜索 跳到主要内容

Performance optimization for sparse at Ax in parallel on multicore CPU

  • Yuan Tao
  • , Yangdong Deng
  • , Shuai Mu
  • , Zhenzhong Zhang
  • , Mingfa Zhu*
  • , Limin Xiao
  • , Li Ruan
  • *此作品的通讯作者
  • Beihang University
  • Jilin Normal University
  • Tsinghua University

科研成果: 期刊稿件文章同行评审

摘要

The sparse matrix operation, y ? y+AtAx, where A is a sparse matrix and x and y are dense vectors, is a widely used computing pattern in High Performance Computing (HPC) applications. The pattern poses challenge to efficient solutions because both a matrix and its transposed version are involved. An efficient sparse matrix format, Compressed Sparse Blocks (CSB), has been proposed to provide nearly the same performance for both Ax and Atx. We develop a multithreaded implementation for the CSB format and apply it to solve y?y+AtAx. Experiments show that our technique outperforms the Compressed Sparse Row (CSR) based solution in POSKI by up to 2.5 fold on over 70% of benchmarking matrices.

源语言英语
页(从-至)315-318
页数4
期刊IEICE Transactions on Information and Systems
E97-D
2
DOI
出版状态已出版 - 2014

指纹

探究 'Performance optimization for sparse at Ax in parallel on multicore CPU' 的科研主题。它们共同构成独一无二的指纹。

引用此