Skip to main navigation Skip to search Skip to main content

DropQueries: A Simple Way to Discover Comprehensive Segment Representations

  • Haojie Ding
  • , Bin Wang
  • , Guoliang Kang
  • , Weijia Li
  • , Conghui He
  • , Yao Zhao
  • , Yunchao Wei*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Inspired by the recent progress in object detection (i.e., DETR), the set prediction mechanism significantly advances the research of semantic segmentation and achieves state-of-the-art performance on popular segmentation benchmarks. The generic pipeline of such a mechanism often firstly takes learnable query features to predict classes and segment masks separately and then blends these class-aware segment masks into the final segmentation mask. One key factor behind the successful training of this pipeline is to apply the bipartite matching strategy between the set of predictions and ground-truth segments. However, we find that the bipartite matching-based assignment often tends to segment one target class with only a few learnable queries, making many other pre-defined queries useless. In this article, we propose a simple way, named DropQueries (DQ), to facilitate the set prediction based segmentation architectures. At each iteration of training, our DQ randomly and independently drops each learnable query with a certain probability before bipartite matching. In this way, more queries are encouraged to participate in the segmentation process to discover comprehensive segment representations. We conduct extensive experiments using MaskFormer and Mask2Former as two basic yet powerful segmentation architectures. Without bells and whistles, our DQ strategy can bring consistent improvements over strong baselines on popular semantic segmentation benchmarks, including ADE 20 K, Cityscapes, COCO Stuff 10 K and VSPW.

Original languageEnglish
Pages (from-to)3481-3490
Number of pages10
JournalIEEE Transactions on Multimedia
Volume26
DOIs
StatePublished - 2024

Keywords

  • Semantic segmentation

Fingerprint

Dive into the research topics of 'DropQueries: A Simple Way to Discover Comprehensive Segment Representations'. Together they form a unique fingerprint.

Cite this