TY - GEN
T1 - Train Delay Prediction based on a Multimodal Deep-learning Method
AU - Chen, Shuai
AU - Wu, Xingtang
AU - Zhou, Min
AU - Yang, Bo
AU - Lü, Jinhu
AU - Dong, Hairong
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - High-speed rail (HSR) trains may suffer delays under emergencies, which will disturb the normal operation of trains and reduce passengers' travel satisfaction. Accurately predicting the delay time can help the dispatchers make advanced rescheduling strategies and thus improve train operation efficiency. Therefore, to quickly and effectively predict the delay time of trains for the HSR network under emergencies, a delay prediction model based on a multi-modal deep learning framework is proposed. The model considers the temporal and spatial characteristics and time-series characteristics of train operation, and then the Transformer and the three-dimensional convolutional neural network (3D CNN) are adopted to independently extract and merge these features. Specifically, the Transformer with the attention mechanism solves the interference mechanism among trains and stations on the delay of the train of interest. At the same time, 3D CNN is used to extract the temporal and spatial data characteristics of trains. Finally, the real operation data of the Beijing Railway Administration HSR network is adopted to verify the effectiveness of the proposed model. Results show that compared with the results obtained by other benchmark models, the predicted delays by the proposed model can lower the average absolute error (MAE), and root mean square error (RMSE) by 10% and 12%, specifically.
AB - High-speed rail (HSR) trains may suffer delays under emergencies, which will disturb the normal operation of trains and reduce passengers' travel satisfaction. Accurately predicting the delay time can help the dispatchers make advanced rescheduling strategies and thus improve train operation efficiency. Therefore, to quickly and effectively predict the delay time of trains for the HSR network under emergencies, a delay prediction model based on a multi-modal deep learning framework is proposed. The model considers the temporal and spatial characteristics and time-series characteristics of train operation, and then the Transformer and the three-dimensional convolutional neural network (3D CNN) are adopted to independently extract and merge these features. Specifically, the Transformer with the attention mechanism solves the interference mechanism among trains and stations on the delay of the train of interest. At the same time, 3D CNN is used to extract the temporal and spatial data characteristics of trains. Finally, the real operation data of the Beijing Railway Administration HSR network is adopted to verify the effectiveness of the proposed model. Results show that compared with the results obtained by other benchmark models, the predicted delays by the proposed model can lower the average absolute error (MAE), and root mean square error (RMSE) by 10% and 12%, specifically.
KW - 3D CNN
KW - deep learning
KW - delay prediction
KW - high-speed rail network
KW - Transformer
UR - https://www.scopus.com/pages/publications/85128092038
U2 - 10.1109/CAC53003.2021.9728179
DO - 10.1109/CAC53003.2021.9728179
M3 - 会议稿件
AN - SCOPUS:85128092038
T3 - Proceeding - 2021 China Automation Congress, CAC 2021
SP - 3241
EP - 3246
BT - Proceeding - 2021 China Automation Congress, CAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 China Automation Congress, CAC 2021
Y2 - 22 October 2021 through 24 October 2021
ER -