Abstract
The application of convolutional neural networks (CNNs) in computer vision highly depends on the consumption of computation and memory resources, which affects its development on resource-limited devices. Accordingly, CNN compression has attracted increasing attention. In this paper, we propose an efficient end-to-end pruning method based on feature stabilization (EPFS), which is feasible to be implemented for structured pruning such as filter pruning and block pruning. For block pruning, we introduce a mask to scale the output of structures and the ℓ1-regularization term to sparsify the mask. For filter pruning, a novel ℓ2-regularization term is proposed to constraint the mask along with the ℓ1-regularization. Besides, we introduce the Center Loss to stabilize the deep feature and fast iterative shrinkage-thresholding algorithm (FISTA) to accelerate the convergence of mask. Extensive experiments demonstrate the superiority of our EPFS. On CIFAR-10, EPFS saves 47.5 % FLOPs on VGGNet with 1.17 % Top-1 accuracy increase. Furthermore, on ImageNet ILSVRC2012, EPFS reduces 55.2 % FLOPs on ResNet-18 with o.nly 1.63 % Top-1 accuracy decrease, which promotes the state-of-the-arts.
| Original language | English |
|---|---|
| Pages (from-to) | 7409-7420 |
| Number of pages | 12 |
| Journal | Neural Computing and Applications |
| Volume | 33 |
| Issue number | 13 |
| DOIs | |
| State | Published - Jul 2021 |
Keywords
- CNN compression
- CNN pruning
- Convolutional neural network
- Image classification
Fingerprint
Dive into the research topics of 'Efficient structured pruning based on deep feature stabilization'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver