跳到主要导航 跳到搜索 跳到主要内容

WinoGen: A Highly Configurable Winograd Convolution IP Generator for Efficient CNN Acceleration on FPGA

  • Mingjun Li*
  • , Pengjia Li
  • , Shuo Yin
  • , Shixin Chen
  • , Beichen Li
  • , Chong Tong
  • , Jianlei Yang
  • , Tinghuan Chen
  • , Bei Yu
  • *此作品的通讯作者
  • Chinese University of Hong Kong
  • The Chinese University of Hong Kong, Shenzhen

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The convolution neural network (CNN) has been widely adopted in computer vision tasks. In the FPGA-based CNN accelerator design, Winograd convolution can effectively improve computation performance and save hardware resources. However, building efficient and highly compatible IP for arbitrary Winograd convolution on FPGA remains underexplored. To address this issue, we propose a novel and efficient reformulation of Winograd convolution, named Structured Direct Winograd Convolution (SDW). We further develop WinoGen, a Chisel-based highly configurable Winograd convolution IP generator. Given arbitrary input/output tile size and kernel size, it can generate optimized high-performance IP automatically. Meanwhile, our generated IP can be compatible with multiple kernel sizes and tile sizes. Experimental results show that the IP generated by WinoGen achieves DSP efficiency up to 3.80 GOPS/DSP and energy efficiency up to 652.77 GOPS/W while showing 2.45× and 3.10× improvements when processing a same CNN model compared with state-of-the-arts.

源语言英语
主期刊名Proceedings of the 61st ACM/IEEE Design Automation Conference, DAC 2024
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798400706011
DOI
出版状态已出版 - 7 11月 2024
活动61st ACM/IEEE Design Automation Conference, DAC 2024 - San Francisco, 美国
期限: 23 6月 202427 6月 2024

出版系列

姓名Proceedings - Design Automation Conference
ISSN(印刷版)0738-100X

会议

会议61st ACM/IEEE Design Automation Conference, DAC 2024
国家/地区美国
San Francisco
时期23/06/2427/06/24

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 7 - 经济适用的清洁能源
    可持续发展目标 7 经济适用的清洁能源

指纹

探究 'WinoGen: A Highly Configurable Winograd Convolution IP Generator for Efficient CNN Acceleration on FPGA' 的科研主题。它们共同构成独一无二的指纹。

引用此