TY - JOUR
T1 - Collaborative Multitask Learning Framework Based on Constrained Bi-Level Gradient Optimization for Mechanical Fault Diagnosis
AU - Su, Xuanyuan
AU - Ma, Yongzhe
AU - Ragulskis, Minvydas
AU - Suo, Mingliang
AU - Lu, Chen
AU - Wang, Xinwei
AU - Song, Dengwei
AU - Tao, Laifa
N1 - Publisher Copyright:
© 1963-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - In recent years, multitask learning (MTL) has attracted the increasing focus in the field of mechanical fault diagnosis. Relevant research shows that the MTL-based fault diagnosis frameworks generally achieve more satisfactory performance than the single-task ones. The key to promoting the above MTL frameworks is how to assign the task weights to balance the model training among multiple tasks. However, this issue has not received sufficient attention. In these frameworks, the task weights are generally statically assigned by manual predefinition or simple grid search, which is thus unable to provide the real-time guidance for the model training. In this regard, this article aims to integrate the multitask model parameters update and task weights assignment into a collaborative training procedure via a gradient-based approach. To this end, we propose a constrained bi-level gradient optimization (C-BLGO) algorithm. For each training epoch, C-BLGO can adaptively optimize the real-time task weights through the task-level and layer-level gradients calculated from the dynamically updated multitask model. In addition, C-BLGO can integrate prior task priorities to modify real-time task weights within a constrained range, so as to make the task weights assignment more controllable and targeted. Subsequently, these assigned task weights further provide the real-time guidance for the update of the entire multitask model. The proposed framework makes the training dynamics well-controlled and thus brings satisfactory performance gains. The experimental results on three public datasets and an electromechanical measurement system (EMS), demonstrate that our work can better improve the training stability and fault diagnosis accuracy compared to other mainstream MTL methods. Thanks to the model-independent characteristics, C-BLGO is scalable to various model architectures and thus is adapted to different fault diagnosis scenarios.
AB - In recent years, multitask learning (MTL) has attracted the increasing focus in the field of mechanical fault diagnosis. Relevant research shows that the MTL-based fault diagnosis frameworks generally achieve more satisfactory performance than the single-task ones. The key to promoting the above MTL frameworks is how to assign the task weights to balance the model training among multiple tasks. However, this issue has not received sufficient attention. In these frameworks, the task weights are generally statically assigned by manual predefinition or simple grid search, which is thus unable to provide the real-time guidance for the model training. In this regard, this article aims to integrate the multitask model parameters update and task weights assignment into a collaborative training procedure via a gradient-based approach. To this end, we propose a constrained bi-level gradient optimization (C-BLGO) algorithm. For each training epoch, C-BLGO can adaptively optimize the real-time task weights through the task-level and layer-level gradients calculated from the dynamically updated multitask model. In addition, C-BLGO can integrate prior task priorities to modify real-time task weights within a constrained range, so as to make the task weights assignment more controllable and targeted. Subsequently, these assigned task weights further provide the real-time guidance for the update of the entire multitask model. The proposed framework makes the training dynamics well-controlled and thus brings satisfactory performance gains. The experimental results on three public datasets and an electromechanical measurement system (EMS), demonstrate that our work can better improve the training stability and fault diagnosis accuracy compared to other mainstream MTL methods. Thanks to the model-independent characteristics, C-BLGO is scalable to various model architectures and thus is adapted to different fault diagnosis scenarios.
KW - Fault diagnosis
KW - gradient descent
KW - mechanical systems
KW - multiobjective optimization
KW - multitask learning (MTL)
UR - https://www.scopus.com/pages/publications/105014511743
U2 - 10.1109/TIM.2025.3604133
DO - 10.1109/TIM.2025.3604133
M3 - 文章
AN - SCOPUS:105014511743
SN - 0018-9456
VL - 74
JO - IEEE Transactions on Instrumentation and Measurement
JF - IEEE Transactions on Instrumentation and Measurement
M1 - 3558318
ER -