Abstract
Recently, extensive approaches have been proposed for reducing the encoding complexity of high efficiency video coding, by predicting the coding tree unit partition using deep neural networks. However, these approaches cannot work in real time due to the complexity of the network architectures. In this paper, we propose a network pruning approach to accelerate a state-of-the-art deep neural network model, for real-time coding tree unit partition. Specifically, we first investigate the computational complexity throughout the network, and find that most calculations can be simplified by pruning the weight parameters. Considering that the number of weight parameters drastically differs by network layer and partition level, we design an adaptive pruning scheme by applying a well-suitable retention ratio of weight parameters to each layer at a level. The retention ratio indicates the ratio of weight parameters after and before pruning. By varying the retention ratios, we can obtain several accelerated network models with different levels of complexity. We further propose a complexity control algorithm by applying different accelerated models to different coding tree units, to ensure that the actual encoding complexity is close to a given target. To guarantee the rate-distortion performance, we model the complexity control algorithm as a convex optimization problem, and we can obtain a closed-form solution. Experimental results show that our approach can accelerate the original deep neural network model by 17-20 times, with little expense on the Bjontegaard delta bit-rate. For complexity control, we achieve high control accuracy with a control error of less than 2% for most video sequences.
| Original language | English |
|---|---|
| Article number | 9126122 |
| Pages (from-to) | 7482-7496 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Image Processing |
| Volume | 29 |
| DOIs | |
| State | Published - 2020 |
Keywords
- High efficiency video coding
- coding tree unit partition
- complexity control
Fingerprint
Dive into the research topics of 'Accelerate CTU Partition to Real Time for HEVC Encoding with Complexity Control'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver