跳到主要导航 跳到搜索 跳到主要内容

Accurate semantic segmentation of very high-resolution remote sensing images considering feature state sequences: From benchmark datasets to urban applications

  • Zijie Wang
  • , Jizheng Yi*
  • , Aibin Chen
  • , Lijiang Chen
  • , Hui Lin
  • , Kai Xu
  • *此作品的通讯作者
  • Central South University of Forestry & Technology
  • Key Laboratory of Forestry Remote Sensing Based Big Data and Ecological Security for Hunan Province
  • Key Laboratory on Forest Resources Management and Monitoring in Southern China of National Forestry and Grassland Administration
  • National University of Defense Technology

科研成果: 期刊稿件文章同行评审

摘要

Very High-Resolution (VHR) urban remote sensing images segmentation is widely used in ecological environmental protection, urban dynamic monitoring, fine urban management and other related fields. However, the large-scale variation and discrete distribution of objects in VHR images presents a significant challenge to accurate segmentation. The existing studies have primarily concentrated on the internal correlations within a single features, while overlooking the inherent sequential relationships across different feature state. In this paper, a novel Urban Spatial Segmentation Framework (UrbanSSF) is proposed, which fully considers the connections between feature states at different phases. Specifically, the Feature State Interaction (FSI) Mamba with powerful sequence modeling capabilities is designed based on state space modules. It effectively facilitates interactions between the information across different features. Given the disparate semantic information and spatial details of features at different scales, a Global Semantic Enhancer (GSE) module and a Spatial Interactive Attention (SIA) mechanism are designed. The GSE module operates on the high-level features, while the SIA mechanism processes the middle and low-level features. To address the computational challenges of large-scale dense feature fusion, a Channel Space Reconstruction (CSR) algorithm is proposed. This algorithm effectively reduces the computational burden while ensuring efficient processing and maintaining accuracy. In addition, the lightweight UrbanSSF-T, the efficient UrbanSSF-S and the accurate UrbanSSF-L are designed to meet different application requirements in urban scenarios. Comprehensive experiments on the UAVid, ISPRS Vaihingen and Potsdam datasets validate the superior performance of UrbanSSF series. Especially, the UrbanSSF-L achieves a mean intersection over union of 71.0% on the UAVid dataset. Code is available at https://github.com/KotlinWang/UrbanSSF.

源语言英语
页(从-至)824-840
页数17
期刊ISPRS Journal of Photogrammetry and Remote Sensing
220
DOI
出版状态已出版 - 2月 2025

指纹

探究 'Accurate semantic segmentation of very high-resolution remote sensing images considering feature state sequences: From benchmark datasets to urban applications' 的科研主题。它们共同构成独一无二的指纹。

引用此