跳到主要导航 跳到搜索 跳到主要内容

Reducing complexity of HEVC: A deep learning approach

科研成果: 期刊稿件文章同行评审

摘要

High efficiency video coding (HEVC) significantly reduces bit rates over the preceding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of the coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the brute-force search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra-and inter-modes, which is based on convolutional neural network (CNN) and long- A nd short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for the HEVC intra- A nd inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit in the form of a hierarchical CU partition map (HCPM). Then, we propose an early terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an ETH-LSTM is proposed to learn the temporal correlation of the CU partition. Then, we combine the ETH-LSTM and the ETH-CNN to predict the CU partition for reducing the HEVC complexity at inter-mode. Finally, experimental results show that our approach outperforms the other state-of-the-art approaches in reducing the HEVC complexity at both intra- A nd inter-modes.

源语言英语
页(从-至)5044-5059
页数16
期刊IEEE Transactions on Image Processing
27
10
DOI
出版状态已出版 - 10月 2018

指纹

探究 'Reducing complexity of HEVC: A deep learning approach' 的科研主题。它们共同构成独一无二的指纹。

引用此