TY - GEN
T1 - A Lightweight Network Model for Video Frame Interpolation Using Spatial Pyramids
AU - Zhuang, Jiankai
AU - Qin, Zengchang
AU - Chen, Jialu
AU - Wan, Tao
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10
Y1 - 2020/10
N2 - In recent years, deep learning based video frame interpolation methods have shown impressive results in handling occlusion, blur and large motion. However, they are usually very heavy in terms of model size, and they hardly to be employed in i.e. mobile phones or other portable devices with limited computing power. To address the problem, we propose light-weighted Spatial Pyramid Frame Interpolation Network (SPFIN), a hierarchical network in a coarse-to-fine approach to reconstruct frames. At each pyramid level, we apply two light sub-networks to model optical flow and visibility mask instead of commonly used U-Net architecture. The flow and mask are up-sampled and optimized progressively. Finally, the intermediate frame is formed by linearly blending warped frames and masks. Experimental results on two benchmark problems show that our model has the smallest size, but better or comparable performance comparing to existing state-of-the art models.
AB - In recent years, deep learning based video frame interpolation methods have shown impressive results in handling occlusion, blur and large motion. However, they are usually very heavy in terms of model size, and they hardly to be employed in i.e. mobile phones or other portable devices with limited computing power. To address the problem, we propose light-weighted Spatial Pyramid Frame Interpolation Network (SPFIN), a hierarchical network in a coarse-to-fine approach to reconstruct frames. At each pyramid level, we apply two light sub-networks to model optical flow and visibility mask instead of commonly used U-Net architecture. The flow and mask are up-sampled and optimized progressively. Finally, the intermediate frame is formed by linearly blending warped frames and masks. Experimental results on two benchmark problems show that our model has the smallest size, but better or comparable performance comparing to existing state-of-the art models.
KW - Deep learning
KW - Frame interpolation
KW - Optical flow
KW - Pyramid network
UR - https://www.scopus.com/pages/publications/85098638634
U2 - 10.1109/ICIP40778.2020.9191039
DO - 10.1109/ICIP40778.2020.9191039
M3 - 会议稿件
AN - SCOPUS:85098638634
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 543
EP - 547
BT - 2020 IEEE International Conference on Image Processing, ICIP 2020 - Proceedings
PB - IEEE Computer Society
T2 - 2020 IEEE International Conference on Image Processing, ICIP 2020
Y2 - 25 September 2020 through 28 September 2020
ER -