跳到主要导航 跳到搜索 跳到主要内容

UGV Swarm Multi-View Fusion Under Occlusion: A Graph-Based Calibration-Free Framework

  • Jiaqi Jing
  • , Weilong Song
  • , Hangcheng Zhang
  • , Yong Liu
  • , Fuyong Feng
  • , Dezhi Zheng
  • , Shangchun Fan*
  • *此作品的通讯作者
  • Beihang University
  • China North Artificial Intelligence & Innovation Research Institute
  • Collective Intelligence & Collaboration Laboratory
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Highlights: What are the main findings? A calibration-free, end-to-end graph-based framework is proposed for joint camera and subject registration in UGV swarms, operating robustly under severe occlusion and low inter-view co-visibility. A graph-based pose propagation module (GPPM) is proposed that enables global alignment via BFS-guided pose propagation along local co-visibility links, eliminating the need for full co-visibility with a root node or pre-calibrated extrinsics. What are the implications of the main findings? Despite being a purely vision-based solution, the method enables infrastructure-free perception in GPS-denied or complex environments by decoupling multi-view fusion from explicit agent pose estimation, therefore supporting dynamic deployment of UGV swarms with a central node for robust collaborative perception tasks. The method enables robust BEV scene representation and collective situational awareness for UGV swarms, even when inter-camera overlap is minimal and occlusions are severe. In unmanned ground vehicle (UGV) swarm systems, comprehensive environmental awareness is critical for coordinated operations. Yet they are frequently deployed in occlusion-rich, constrained environments where multi-agent visual fusion is essential. However, existing methods are critically limited by offline-calibrated extrinsic parameters, hindering flexible deployment, and by a strong co-visibility assumption, which fails under severe occlusion. To overcome these constraints, we introduce an end-to-end, calibration-free framework for the joint registration of cameras and subjects. Our approach begins with a single-view module that estimates subjects’ poses and appearance features. Subsequently, a novel graph-based pose propagation module (GPPM) treats UGVs’ cameras as nodes in a graph, connecting them with edges when they share co-visible subjects identified via appearance matching. Breadth-first search (BFS) then finds the shortest registration path from any camera to a designated root camera, enabling pose propagation via local co-visibility links and global alignment of all subjects into a unified bird’s-eye-view (BEV) space. This strategy relaxes the stringent requirement of full co-visibility with the root node. A multi-task loss function is proposed to jointly optimize pose estimation and feature matching. Trained and evaluated on a synthetic dataset with occlusions (CSRD-O) collected by a UGV swarm system, our framework achieves mean camera pose errors of 1.57 m/8.70° and mean subject pose errors of 1.40 m/9.14°. Furthermore, we demonstrate a scene monitoring task using a UGV swarm system. Experiments show that the proposed method generates robust BEV estimates even under severe occlusion and low inter-view overlap. This work presents a purely visual, self-calibrating multi-view fusion perception scheme, demonstrating its potential to support cooperative perception, task-oriented monitoring, and collective situational awareness in UGV swarm systems.

源语言英语
文章编号214
期刊Drones
10
3
DOI
出版状态已出版 - 3月 2026

指纹

探究 'UGV Swarm Multi-View Fusion Under Occlusion: A Graph-Based Calibration-Free Framework' 的科研主题。它们共同构成独一无二的指纹。

引用此