Skip to main navigation Skip to search Skip to main content

Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor

  • Lei Liu*
  • , Zhihao Hu
  • , Zhenghao Chen
  • *Corresponding author for this work
  • Beihang University
  • University of Sydney

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Point cloud compression has garnered significant interest in computer vision. However, existing algorithms primarily cater to human vision, while most point cloud data is utilized for machine vision tasks. To address this, we propose a point cloud compression framework that simultaneously handles both human and machine vision tasks. Our framework learns a scalable bit-stream, using only subsets for different machine vision tasks to save bit-rate, while employing the entire bit-stream for human vision tasks. Building on mainstream octree-based frameworks like VoxelContext-Net, OctAttention, and G-PCC, we introduce a new octree depth-level predictor. This predictor adaptively determines the optimal depth level for each octree constructed from a point cloud, controlling the bit-rate for machine vision tasks. For simpler tasks (e.g., classification) or objects/scenarios, we use fewer depth levels with fewer bits, saving bit-rate. Conversely, for more complex tasks (e.g., segmentation) or objects/scenarios, we use deeper depth levels with more bits to enhance performance. Experimental results on various datasets (e.g., ModelNet10, ModelNet40, ShapeNet, ScanNet, and KITTI) show that our point cloud compression approach improves performance for machine vision tasks without compromising human vision quality.

Original languageEnglish
Title of host publicationGeneralizing from Limited Resources in the Open World - 2nd International Workshop, GLOW 2024, Held in Conjunction with IJCAI 2024, Proceedings
EditorsJinyang Guo, Yuqing Ma, Yifu Ding, Xingyu Zheng, Changyi He, Xianglong Liu, Ruihao Gong, Yantao Lu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-17
Number of pages15
ISBN (Print)9789819761241
DOIs
StatePublished - 2024
Event2nd International Workshop on Generalizing from Limited Resources in the Open World, GLOW 2024, Held in Conjunction with International Joint Conference on Artificial Intelligence, IJCAI 2024 - Jeju Island, Korea, Republic of
Duration: 3 Aug 20243 Aug 2024

Publication series

NameCommunications in Computer and Information Science
Volume2160 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference2nd International Workshop on Generalizing from Limited Resources in the Open World, GLOW 2024, Held in Conjunction with International Joint Conference on Artificial Intelligence, IJCAI 2024
Country/TerritoryKorea, Republic of
CityJeju Island
Period3/08/243/08/24

Keywords

  • Point Cloud Compression
  • Scalable Coding for Machine

Fingerprint

Dive into the research topics of 'Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor'. Together they form a unique fingerprint.

Cite this