TY - JOUR
T1 - Semantic-aware style transfer for unsupervised aircraft pose estimation
AU - Luo, Qifeng
AU - Zhou, Danya
AU - Wei, Zhenzhong
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/8
Y1 - 2025/8
N2 - The aircraft pose estimation based on deep learning has become prevalent in aviation, but its performance heavily relies on expensive and scarce annotated real aircraft data. Although researchers adopt style transfer methods to generate real-style images to reduce dependency on real annotations, existing methods mainly focus on global style while neglecting local style variations. Crucially, aircraft exhibit distinct local styles that correlate strongly with their semantics. These limitations lead to suboptimal stylization effects, ultimately degrading pose estimation accuracy. To address these challenges, we propose a semantic-aware style transfer method that employs adaptive multi-head attention transfer modules to establish correspondences between local semantics and styles. Meanwhile, we introduce a contextual style loss and a contrastive consistency content loss to guide the network in learning semantic-style relationships, achieving highly realistic stylized effects. Based on this method, we develop an unsupervised aircraft pose estimation framework that relies solely on rendering data and unlabeled real images. The framework employs stylized images to train pose estimator and incorporates a pseudo-supervised segmentation loss to enhance cross-domain feature consistency. This loss utilizes pseudo-labels generated via our unsupervised segmentation approach, which exploits intra-image feature self-similarity and inter-image feature mutual similarity to produce accurate masks. Extensive experiments demonstrate that our style transfer method outperforms existing approaches in both qualitative and quantitative evaluations. Particularly, our unsupervised pose estimation method achieves a 3.39% higher proportion of predictions with errors under 10∘ compared to the best supervised baseline. Our method significantly reduces the reliance on manual annotations, thus facilitating the practical deployment of deep learning-based pose estimation in real aircraft scenes.
AB - The aircraft pose estimation based on deep learning has become prevalent in aviation, but its performance heavily relies on expensive and scarce annotated real aircraft data. Although researchers adopt style transfer methods to generate real-style images to reduce dependency on real annotations, existing methods mainly focus on global style while neglecting local style variations. Crucially, aircraft exhibit distinct local styles that correlate strongly with their semantics. These limitations lead to suboptimal stylization effects, ultimately degrading pose estimation accuracy. To address these challenges, we propose a semantic-aware style transfer method that employs adaptive multi-head attention transfer modules to establish correspondences between local semantics and styles. Meanwhile, we introduce a contextual style loss and a contrastive consistency content loss to guide the network in learning semantic-style relationships, achieving highly realistic stylized effects. Based on this method, we develop an unsupervised aircraft pose estimation framework that relies solely on rendering data and unlabeled real images. The framework employs stylized images to train pose estimator and incorporates a pseudo-supervised segmentation loss to enhance cross-domain feature consistency. This loss utilizes pseudo-labels generated via our unsupervised segmentation approach, which exploits intra-image feature self-similarity and inter-image feature mutual similarity to produce accurate masks. Extensive experiments demonstrate that our style transfer method outperforms existing approaches in both qualitative and quantitative evaluations. Particularly, our unsupervised pose estimation method achieves a 3.39% higher proportion of predictions with errors under 10∘ compared to the best supervised baseline. Our method significantly reduces the reliance on manual annotations, thus facilitating the practical deployment of deep learning-based pose estimation in real aircraft scenes.
KW - Aircraft
KW - Segmentation
KW - Semantic-aware
KW - Style transfer
KW - Unsupervised pose estimation
UR - https://www.scopus.com/pages/publications/105012258664
U2 - 10.1007/s44443-025-00171-7
DO - 10.1007/s44443-025-00171-7
M3 - 文章
AN - SCOPUS:105012258664
SN - 1319-1578
VL - 37
JO - Journal of King Saud University - Computer and Information Sciences
JF - Journal of King Saud University - Computer and Information Sciences
IS - 6
M1 - 146
ER -