Skip to main navigation Skip to search Skip to main content

Performance optimization for sparse at Ax in parallel on multicore CPU

  • Yuan Tao
  • , Yangdong Deng
  • , Shuai Mu
  • , Zhenzhong Zhang
  • , Mingfa Zhu*
  • , Limin Xiao
  • , Li Ruan
  • *Corresponding author for this work
  • Beihang University
  • Jilin Normal University
  • Tsinghua University

Research output: Contribution to journalArticlepeer-review

Abstract

The sparse matrix operation, y ? y+AtAx, where A is a sparse matrix and x and y are dense vectors, is a widely used computing pattern in High Performance Computing (HPC) applications. The pattern poses challenge to efficient solutions because both a matrix and its transposed version are involved. An efficient sparse matrix format, Compressed Sparse Blocks (CSB), has been proposed to provide nearly the same performance for both Ax and Atx. We develop a multithreaded implementation for the CSB format and apply it to solve y?y+AtAx. Experiments show that our technique outperforms the Compressed Sparse Row (CSR) based solution in POSKI by up to 2.5 fold on over 70% of benchmarking matrices.

Original languageEnglish
Pages (from-to)315-318
Number of pages4
JournalIEICE Transactions on Information and Systems
VolumeE97-D
Issue number2
DOIs
StatePublished - 2014

Keywords

  • Compressed sparse block
  • Compressed sparse rows
  • Multicore platform
  • Sparse AtAx

Fingerprint

Dive into the research topics of 'Performance optimization for sparse at Ax in parallel on multicore CPU'. Together they form a unique fingerprint.

Cite this