TY - JOUR
T1 - Reducing complexity of HEVC
T2 - A deep learning approach
AU - Xu, Mai
AU - Li, Tianyi
AU - Wang, Zulin
AU - Deng, Xin
AU - Yang, Ren
AU - Guan, Zhenyu
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10
Y1 - 2018/10
N2 - High efficiency video coding (HEVC) significantly reduces bit rates over the preceding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of the coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the brute-force search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra-and inter-modes, which is based on convolutional neural network (CNN) and long- A nd short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for the HEVC intra- A nd inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit in the form of a hierarchical CU partition map (HCPM). Then, we propose an early terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an ETH-LSTM is proposed to learn the temporal correlation of the CU partition. Then, we combine the ETH-LSTM and the ETH-CNN to predict the CU partition for reducing the HEVC complexity at inter-mode. Finally, experimental results show that our approach outperforms the other state-of-the-art approaches in reducing the HEVC complexity at both intra- A nd inter-modes.
AB - High efficiency video coding (HEVC) significantly reduces bit rates over the preceding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of the coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the brute-force search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra-and inter-modes, which is based on convolutional neural network (CNN) and long- A nd short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for the HEVC intra- A nd inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit in the form of a hierarchical CU partition map (HCPM). Then, we propose an early terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an ETH-LSTM is proposed to learn the temporal correlation of the CU partition. Then, we combine the ETH-LSTM and the ETH-CNN to predict the CU partition for reducing the HEVC complexity at inter-mode. Finally, experimental results show that our approach outperforms the other state-of-the-art approaches in reducing the HEVC complexity at both intra- A nd inter-modes.
KW - High efficiency video coding
KW - complexity reduction
KW - convolutional neural network
KW - deep learning
KW - long- A nd short-term memory network
UR - https://www.scopus.com/pages/publications/85048548131
U2 - 10.1109/TIP.2018.2847035
DO - 10.1109/TIP.2018.2847035
M3 - 文章
AN - SCOPUS:85048548131
SN - 1057-7149
VL - 27
SP - 5044
EP - 5059
JO - IEEE Transactions on Image Processing
JF - IEEE Transactions on Image Processing
IS - 10
ER -