Abstract
Convolutional Neural Networks (CNNs) have achieved remarkable performance at a huge computational cost. By improving the model sparsity, it can effectively reduce the complexity. However, with deepening of sparsity, the problems of unbalanced workloads, computing fragmentation and mapping access conflict caused by irregular sparsity have become more and more remarkable. These problems pose great challenges for efficient computation of sparse CNN s. In order to make full use of two side of sparsity introduced by activations and weights, and overcome the above problems, this paper proposes an efficient sparse CNN s accelerator on FPGA to achieve the inference acceleration. We designed and implemented the accelerator on the Zynq UltraScale+ MPSoC ZCU102 evaluation board. By running AlexNet, VGG16 and ResNet50 networks on the accelerator to evaluated the peeformance. Experimental results show that the method proposed in this paper can achieve more than 97% reduction in collision rate and 2.35x improvement in computing performance and 9.37x improvement in energy efficiency.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 504-505 |
| Number of pages | 2 |
| ISBN (Electronic) | 9781665498562 |
| DOIs | |
| State | Published - 2022 |
| Event | 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022 - Heidelberg, Germany Duration: 6 Sep 2022 → 9 Sep 2022 |
Publication series
| Name | Proceedings - IEEE International Conference on Cluster Computing, ICCC |
|---|---|
| Volume | 2022-September |
| ISSN (Print) | 1552-5244 |
Conference
| Conference | 2022 IEEE International Conference on Cluster Computing, CLUSTER 2022 |
|---|---|
| Country/Territory | Germany |
| City | Heidelberg |
| Period | 6/09/22 → 9/09/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- FPGA
- Hash Mapping Conflict
- Inference Acceleration
- Load Balancing
- Sparse CNN s
Fingerprint
Dive into the research topics of 'An Efficient Sparse CNNs Accelerator on FPGA'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver