跳到主要导航 跳到搜索 跳到主要内容

S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

科研成果: 期刊稿件文章同行评审

摘要

Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks. However, execution of CNNs requires a large amount of computing resources and generates heavy memory traffic, which imposes a severe challenge on computing system design. Through optimizing parallel executions and data reuse in convolution, systolic architecture demonstrates great advantages in accelerating CNN computations. However, regular internal data transmission path in traditional systolic architecture prevents the systolic architecture from completely leveraging the benefits introduced by neural network sparsity. Deployment of fine-grained sparsity on the existing systolic architectures is greatly hindered by the incurred computational overheads. In this work, we propose S2Engine - a novel systolic architecture that can fully exploit the sparsity in CNNs with maximized data reuse. S2Engine transmits compressed data internally and allows each processing element to dynamically select an aligned data from the compressed dataflow in convolution. Compared to the naïve systolic array, S2Engine achieves about 3.2× and about 3.0× improvements on speed and energy efficiency, respectively.

源语言英语
页(从-至)1440-1452
页数13
期刊IEEE Transactions on Computers
71
6
DOI
出版状态已出版 - 1 6月 2022

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 7 - 经济适用的清洁能源
    可持续发展目标 7 经济适用的清洁能源

指纹

探究 'S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks' 的科研主题。它们共同构成独一无二的指纹。

引用此