跳到主要导航 跳到搜索 跳到主要内容

Learning to Segment Video Object with Accurate Boundaries

  • Jingchun Cheng
  • , Yuhui Yuan
  • , Yali Li*
  • , Jingdong Wang
  • , Shengjin Wang
  • *此作品的通讯作者
  • Tsinghua University
  • Microsoft USA

科研成果: 期刊稿件文章同行评审

摘要

Video object segmentation has attracted considerable research interest these years. Top-performing video object segmentation methods mainly rely on fully convolutional neural networks which are specifically trained for predicting high-performance masks, resulting in a lack of preciseness in boundary details. This paper tackles the problem of predicting both mask-accurate and boundary-precise segmentation masks in videos. To solve this problem, we propose a simple and efficient network structure: the Mask-boundAry-Consistent Network (MAC-Net). The MAC-Net is an end-to-end fully convolutional network, where both mask and boundaries are jointly optimized during training, enabling it to predict masks along with accurate boundaries. An inner-net boundary-computing module is incorporated in the MAC-Net for producing spontaneously mask-consistent boundaries. We analyze the influence of parameter settings, network constructions of the MAC-Net, and compare with state-of-the-art algorithms on three widely-adopted datasets. Experimental results show that the MAC-Net achieves state-of-the-art performance, demonstrating the effectiveness of its mask-boundary-consistent network structure. We also propose that the boundary module in MAC-Net has high compatibility, and can be easily adapted to other segmentation-related techniques.

源语言英语
页(从-至)3112-3123
页数12
期刊IEEE Transactions on Multimedia
23
DOI
出版状态已出版 - 2021

指纹

探究 'Learning to Segment Video Object with Accurate Boundaries' 的科研主题。它们共同构成独一无二的指纹。

引用此