TY - GEN
T1 - Multimodal Feature Fusion for Lithium Battery Remaining Capacity Prediction
AU - Gao, Yudong
AU - Zhao, Nuo
AU - Jiao, Jian
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Accurate prediction of the remaining capacity of lithium batteries is a core challenge in battery health management systems. To address the limitations of existing approaches, which are often sensitive to noise in capacity sequences and insufficient in modeling complex temporal dependencies, this paper proposes a novel multimodal feature fusion method that integrates timedomain, frequency-domain, and state transition representations for remaining capacity prediction. First, Singular Spectrum Analysis is employed to capture the trend components of the original capacity signals. The Transformer encoder is then used to extract time-domain features from the raw capacity sequences. Subsequently, the original signal is transformed into Continuous Wavelet Transform and Markov Transition Field representations to capture frequency characteristics and state transition patterns of the time series, respectively. Convolutional Neural Networks are applied to the CWT spectrogram to extract deep spectral features and to the MTF image to learn temporal correlation features. Finally, the features from different modalities are fused and passed through fully connected layers for final prediction of battery remaining capacity. Experimental results demonstrate that the proposed multimodal fusion model outperforms traditional models, validating its accuracy and effectiveness for battery health management.
AB - Accurate prediction of the remaining capacity of lithium batteries is a core challenge in battery health management systems. To address the limitations of existing approaches, which are often sensitive to noise in capacity sequences and insufficient in modeling complex temporal dependencies, this paper proposes a novel multimodal feature fusion method that integrates timedomain, frequency-domain, and state transition representations for remaining capacity prediction. First, Singular Spectrum Analysis is employed to capture the trend components of the original capacity signals. The Transformer encoder is then used to extract time-domain features from the raw capacity sequences. Subsequently, the original signal is transformed into Continuous Wavelet Transform and Markov Transition Field representations to capture frequency characteristics and state transition patterns of the time series, respectively. Convolutional Neural Networks are applied to the CWT spectrogram to extract deep spectral features and to the MTF image to learn temporal correlation features. Finally, the features from different modalities are fused and passed through fully connected layers for final prediction of battery remaining capacity. Experimental results demonstrate that the proposed multimodal fusion model outperforms traditional models, validating its accuracy and effectiveness for battery health management.
KW - lithium battery
KW - Markov transition field
KW - multimodal learning
KW - remaining capacity prediction
KW - Transformer
UR - https://www.scopus.com/pages/publications/105030096422
U2 - 10.1109/ICRMS65480.2025.00075
DO - 10.1109/ICRMS65480.2025.00075
M3 - 会议稿件
AN - SCOPUS:105030096422
T3 - Proceedings - 2025 16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025
SP - 400
EP - 405
BT - Proceedings - 2025 16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025
Y2 - 27 July 2025 through 30 July 2025
ER -