Efficient structured pruning based on deep feature stabilization

Research output: Contribution to journalArticlepeer-review

Abstract

The application of convolutional neural networks (CNNs) in computer vision highly depends on the consumption of computation and memory resources, which affects its development on resource-limited devices. Accordingly, CNN compression has attracted increasing attention. In this paper, we propose an efficient end-to-end pruning method based on feature stabilization (EPFS), which is feasible to be implemented for structured pruning such as filter pruning and block pruning. For block pruning, we introduce a mask to scale the output of structures and the ℓ1-regularization term to sparsify the mask. For filter pruning, a novel ℓ2-regularization term is proposed to constraint the mask along with the ℓ1-regularization. Besides, we introduce the Center Loss to stabilize the deep feature and fast iterative shrinkage-thresholding algorithm (FISTA) to accelerate the convergence of mask. Extensive experiments demonstrate the superiority of our EPFS. On CIFAR-10, EPFS saves 47.5 % FLOPs on VGGNet with 1.17 % Top-1 accuracy increase. Furthermore, on ImageNet ILSVRC2012, EPFS reduces 55.2 % FLOPs on ResNet-18 with o.nly 1.63 % Top-1 accuracy decrease, which promotes the state-of-the-arts.

Original languageEnglish
Pages (from-to)7409-7420
Number of pages12
JournalNeural Computing and Applications
Volume33
Issue number13
DOIs
StatePublished - Jul 2021

Keywords

  • CNN compression
  • CNN pruning
  • Convolutional neural network
  • Image classification

Fingerprint

Dive into the research topics of 'Efficient structured pruning based on deep feature stabilization'. Together they form a unique fingerprint.

Cite this