跳到主要导航 跳到搜索 跳到主要内容

CUS3D: Clip-Based Unsupervised 3D Segmentation via Object-Level Denoise

  • Fuyang Yu
  • , Runze Tian
  • , Zhen Wang
  • , Xiaochuan Wang
  • , Xiaohui Liang*
  • *此作品的通讯作者
  • Beihang University
  • Institute of Science Tokyo
  • Beijing Technology and Business University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To ease the difficulty of acquiring annotation labels in 3D data, a common method is using unsupervised and open-vocabulary semantic segmentation, which leverage 2D CLIP semantic knowledge. In this paper, unlike previous research that ignores the "noise"raised during feature projection from 2D to 3D, we propose a novel distillation learning framework named CUS3D. In our approach, an object-level denosing projection module is designed to screen out the "noise"and ensure more accurate 3D feature. Based on the obtained features, a multimodal distillation learning module is designed to align the 3D feature with CLIP semantic feature space with object-centered constrains to achieve advanced unsupervised semantic segmentation. We conduct comprehensive experiments in both unsupervised and open-vocabulary segmentation, and the results consistently showcase the superiority of our model in achieving advanced unsupervised segmentation results and its effectiveness in open-vocabulary segmentation.

源语言英语
主期刊名2024 IEEE International Conference on Multimedia and Expo, ICME 2024
出版商IEEE Computer Society
ISBN(电子版)9798350390155
DOI
出版状态已出版 - 2024
活动2024 IEEE International Conference on Multimedia and Expo, ICME 2024 - Niagra Falls, 加拿大
期限: 15 7月 202419 7月 2024

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2024 IEEE International Conference on Multimedia and Expo, ICME 2024
国家/地区加拿大
Niagra Falls
时期15/07/2419/07/24

指纹

探究 'CUS3D: Clip-Based Unsupervised 3D Segmentation via Object-Level Denoise' 的科研主题。它们共同构成独一无二的指纹。

引用此