跳到主要导航 跳到搜索 跳到主要内容

PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation

  • Cilin Yan
  • , Haochen Wang
  • , Jie Liu
  • , Xiaolong Jiang
  • , Yao Hu
  • , Xu Tang
  • , Guoliang Kang*
  • , Efstratios Gavves
  • *此作品的通讯作者
  • Beihang University
  • University of Amsterdam
  • Xiaohongshu

科研成果: 期刊稿件文章同行评审

摘要

Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing. In such a task, target ambiguity remains a problem hindering the accuracy and efficiency of segmentation. That is, in scenes with rich context, one click may correspond to multiple potential targets, while most previous interactive segmentors only generate a single mask and fail to deal with target ambiguity. In this paper, we propose a novel interactive segmentation network named PiClick, to yield all potentially reasonable masks and suggest the most plausible one for the user. Specifically, PiClick utilizes a Transformer-based architecture to generate all potential target masks by mutually interactive mask queries. Moreover, a Target Reasoning module is designed in PiClick to automatically suggest the user-desired mask from all candidates, relieving target ambiguity and extra-human efforts. Extensive experiments on 9 interactive segmentation datasets demonstrate PiClick performs favorably against previous state-of-the-arts considering the segmentation results. Moreover, we show that PiClick effectively reduces human efforts in annotating and picking the desired masks. To ease the usage and inspire future research, we release the source code of PiClick together with a plug-and-play annotation tool at https://github.com/cilinyan/PiClick.

源语言英语
文章编号128083
期刊Neurocomputing
599
DOI
出版状态已出版 - 28 9月 2024

指纹

探究 'PiClick: Picking the desired mask from multiple candidates in click-based interactive segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此