跳到主要导航 跳到搜索 跳到主要内容

T1000: Mitigating the memory footprint of convolution neural networks with decomposition and re-fusion

  • Beihang University
  • Sun Yat-Sen University

科研成果: 期刊稿件文章同行评审

摘要

In recent years, convolution neural networks have significantly advanced the frontier of computer vision and other intelligent applications due to its promising accuracy. However, the improved accuracy comes with the formidable computation complexity with deeper convolution layers, which prevents its adoption on resource constrained system such as embedded and mobile. Although research efforts have been devoted to reduce the computation complexity of convolution neural networks through tensor decomposition, the volume of intermediate data generated by the tensor decomposition grows dramatically, which consumes more memory resource and has not been addressed by existing work. In this work, we propose T1000 to re-fuse the convolutions across tensors after applying the canonical polyadic decomposition to conventional convolution layers so that we can receive the benefit of reduced computation complexity, in the meanwhile mitigate the memory occupancy of the intermediate data. We demonstrate the effectiveness of our approach by applying canonical polyadic decomposition and re-fusion to the convolution layers of two well-known convolution neural networks, AlexNet and VGG-19 implemented with Caffe. Compared to the default canonical polyadic decomposition, our approach reduces the memory occupancy of the intermediate data by 84.6% and 77.4% for AlexNet and VGG-19 respectively. In addition, our approach improves the performance of AlexNet and VGG-19 by 1.77× and 1.4× respectively.

源语言英语
页(从-至)1-10
页数10
期刊Future Generation Computer Systems
84
DOI
出版状态已出版 - 7月 2018

指纹

探究 'T1000: Mitigating the memory footprint of convolution neural networks with decomposition and re-fusion' 的科研主题。它们共同构成独一无二的指纹。

引用此