Abstract
Convolutional neural networks (CNNs) has been widely used in computer vision and speech processing, and has achieved great success. However, the deployment of large-scale CNN model is limited by computing and memory in the smart embedded system. Through the current high parallel computing paradigm of CNNs accelerator, the computing requirements can be effectively met to achieve high throughput. However, because the communication cost may be higher than the computing cost for small smart platform, the energy consumption is still very high. In order to solve this problem, a new CNNs accelerator based on storage and data flow is proposed, which realizes energy-saving CNNs inference acceleration by minimizing data access and maximizing data reuse. This paper implements the accelerator on the Zynq UltraScale+ MPSoC ZCU102 evaluation board, and evaluates the throughput and energy efficiency of the accelerator for typical vgg16 and tiny Yolo benchmark networks. Compared with other accelerators, the our accelerator improves the system energy efficiency by 6.3×, the system throughput by 41×, and the throughput of a single DSP by 7.63×.
| Original language | English |
|---|---|
| Title of host publication | 19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 1209-1214 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665435741 |
| DOIs | |
| State | Published - 2021 |
| Event | 19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021 - New York, United States Duration: 30 Sep 2021 → 3 Oct 2021 |
Publication series
| Name | 19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021 |
|---|
Conference
| Conference | 19th IEEE International Symposium on Parallel and Distributed Processing with Applications, 11th IEEE International Conference on Big Data and Cloud Computing, 14th IEEE International Conference on Social Computing and Networking and 11th IEEE International Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021 |
|---|---|
| Country/Territory | United States |
| City | New York |
| Period | 30/09/21 → 3/10/21 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- CNNs Accelerator
- Dataflow
- Energy-Efficient
- FPGA
- Storage
Fingerprint
Dive into the research topics of 'Energy-Efficient CNNs Accelerator Implementation on FPGA with Optimized Storage and Dataflow'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver