Skip to main navigation Skip to search Skip to main content

Energy-Efficient CNNs Accelerator Implementation on FPGA with Optimized Storage and Dataflow

  • Yonghua Zhang
  • , Hongxu Jiang
  • , Xiaobin Li
  • , Rui Miao
  • , Jinyan Nie
  • , Yu Du
  • Beihang University
  • Beijing Union University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolutional neural networks (CNNs) has been widely used in computer vision and speech processing, and has achieved great success. However, the deployment of large-scale CNN model is limited by computing and memory in the smart embedded system. Through the current high parallel computing paradigm of CNNs accelerator, the computing requirements can be effectively met to achieve high throughput. However, because the communication cost may be higher than the computing cost for small smart platform, the energy consumption is still very high. In order to solve this problem, a new CNNs accelerator based on storage and data flow is proposed, which realizes energy-saving CNNs inference acceleration by minimizing data access and maximizing data reuse. This paper implements the accelerator on the Zynq UltraScale+ MPSoC ZCU102 evaluation board, and evaluates the throughput and energy efficiency of the accelerator for typical vgg16 and tiny Yolo benchmark networks. Compared with other accelerators, the our accelerator improves the system energy efficiency by 6.3×, the system throughput by 41×, and the throughput of a single DSP by 7.63×.

Original languageEnglish
Title of host publication19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1209-1214
Number of pages6
ISBN (Electronic)9781665435741
DOIs
StatePublished - 2021
Event19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021 - New York, United States
Duration: 30 Sep 20213 Oct 2021

Publication series

Name19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021

Conference

Conference19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021
Country/TerritoryUnited States
CityNew York
Period30/09/213/10/21

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • CNNs Accelerator
  • Dataflow
  • Energy-Efficient
  • FPGA
  • Storage

Fingerprint

Dive into the research topics of 'Energy-Efficient CNNs Accelerator Implementation on FPGA with Optimized Storage and Dataflow'. Together they form a unique fingerprint.

Cite this