TY - JOUR
T1 - Semi-supervised domain generalization for fault diagnosis using adaptive pseudo-label selection and distributionally robust optimization
AU - Qi, Zhikuan
AU - Luo, Zhi
AU - Miao, Yonghao
AU - Zhou, Shaoping
N1 - Publisher Copyright:
© 2026 Elsevier Ltd.
PY - 2026/7/1
Y1 - 2026/7/1
N2 - Deep learning's potential for intelligent fault diagnosis (IFD) is constrained by inadequate model generalization to unseen working conditions, which motivates research in domain generalization-based fault diagnosis (DGFD). However, most DGFD methods rely on multiple labeled source domains during the training phase, conflicting with the prevalent scarcity of labeled industrial data. To bridge this gap, this paper introduces a novel semi-supervised domain generalization-based fault diagnosis (SemiDGFD) method using adaptive pseudo-label selection and distributionally robust optimization (Ada-DRO). The Ada-DRO only requires one labeled source domain and multiple unlabeled source domains. Furthermore, this method combines the primary branch model with auxiliary branch models. Firstly, it employs multiple auxiliary branch models to assign pseudo-labels to unlabeled data through distribution alignment. Subsequently, an adaptive threshold selects high-confidence pseudo-labels to mitigate noise. Finally, an uncertainty set is constructed using mask-augmented labeled source data and selected pseudo-labeled data. Wasserstein distance constrains the set scope, and distributionally robust optimization (DRO) is performed over this set to enhance the primary branch model's cross-domain generalization performance. Extensive experiments demonstrate the proposed method's superior accuracy over state-of-the-art SemiDGFD methods.
AB - Deep learning's potential for intelligent fault diagnosis (IFD) is constrained by inadequate model generalization to unseen working conditions, which motivates research in domain generalization-based fault diagnosis (DGFD). However, most DGFD methods rely on multiple labeled source domains during the training phase, conflicting with the prevalent scarcity of labeled industrial data. To bridge this gap, this paper introduces a novel semi-supervised domain generalization-based fault diagnosis (SemiDGFD) method using adaptive pseudo-label selection and distributionally robust optimization (Ada-DRO). The Ada-DRO only requires one labeled source domain and multiple unlabeled source domains. Furthermore, this method combines the primary branch model with auxiliary branch models. Firstly, it employs multiple auxiliary branch models to assign pseudo-labels to unlabeled data through distribution alignment. Subsequently, an adaptive threshold selects high-confidence pseudo-labels to mitigate noise. Finally, an uncertainty set is constructed using mask-augmented labeled source data and selected pseudo-labeled data. Wasserstein distance constrains the set scope, and distributionally robust optimization (DRO) is performed over this set to enhance the primary branch model's cross-domain generalization performance. Extensive experiments demonstrate the proposed method's superior accuracy over state-of-the-art SemiDGFD methods.
KW - Data augmentation
KW - Distributionally robust optimization
KW - Intelligent fault diagnosis
KW - Pseudo-label
KW - Semi-supervised domain generalization
UR - https://www.scopus.com/pages/publications/105034380686
U2 - 10.1016/j.engappai.2026.114607
DO - 10.1016/j.engappai.2026.114607
M3 - 文章
AN - SCOPUS:105034380686
SN - 0952-1976
VL - 175
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 114607
ER -