Promoting the Harmony between Sparsity and Regularity: A Relaxed Synchronous Architecture for Convolutional Neural Networks

  • Wenyan Lu*
  • , Guihai Yan
  • , Jiajun Li
  • , Shijun Gong
  • , Shuhao Jiang
  • , Jingya Wu
  • , Xiaowei Li
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

There are two approaches to improve the performance of Convolutional Neural Networks (CNNs): 1) accelerating computation and 2) reducing the amount of computation. The acceleration approaches take the advantage of CNN computing regularity which enables abundant fine-grained parallelisms in feature maps, neurons, and synapses. Alternatively, reducing computations leverages the intrinsic sparsity of CNN neurons and synapses. The sparsity represents as the computing bubbles, i.e., zero or tiny-valued neurons and synapses. These bubbles can be removed to reduce the volume of computations. Although distinctly different from each other in principle, we find that the two types of approaches are not orthogonal to each other. Even worse, they may conflict to each other when working together. The conditional branches introduced by some bubble-removing mechanisms in the original computations destroy the regularity of deeply nested loops, thereby impairing the intrinsic parallelisms. Therefore, enabling the synergy between the two types of approaches is critical to arrive at superior performance. This paper proposed a relaxed synchronous computing architecture, FlexFlow-Pro, to fulfill this purpose. Compared with the state-of-the-art accelerators, the FlexFlow-Pro gains more than 2.5× performance on average and 2× energy efficiency.

Original languageEnglish
Article number8594573
Pages (from-to)867-881
Number of pages15
JournalIEEE Transactions on Computers
Volume68
Issue number6
DOIs
StatePublished - 1 Jun 2019
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Convolutional neural networks
  • accelerator
  • architecture
  • parallelism
  • sparsity

Fingerprint

Dive into the research topics of 'Promoting the Harmony between Sparsity and Regularity: A Relaxed Synchronous Architecture for Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this