TY - JOUR
T1 - Pedestrian behavior prediction model with a convolutional LSTM encoder–decoder
AU - Chen, Kai
AU - Song, Xiao
AU - Han, Daolin
AU - Sun, Jinghan
AU - Cui, Yong
AU - Ren, Xiaoxiang
N1 - Publisher Copyright:
© 2020
PY - 2020/12/15
Y1 - 2020/12/15
N2 - Pedestrian behavior modeling is a challenging problem especially in crowded transportation scenarios. Some recent studies have addressed this problem using deep neural network, but the accuracy of trajectory prediction is still not high because the internal structure of the typical deep neural network with long short-term memory (LSTM) is a one-dimensional vector, which destroys the spatial information around a pedestrian. Therefore, these models cannot fully learn spatial sensing behavior of pedestrians. To solve this, we recommend using multi-channel tensors to represent the environmental information of pedestrians. Meanwhile, the spatiotemporal interactions among the pedestrians are represented by convolution operations of these tensors. Then, an end-to-end fully convolutional LSTM encoder–decoder is designed, trained and tested. Finally, our approach is compared with existing LSTM-based methods using five crowded video sequences with public datasets. The results show that our method reduces the displacement offset error and provides more realistic trajectory prediction in manifold cases.
AB - Pedestrian behavior modeling is a challenging problem especially in crowded transportation scenarios. Some recent studies have addressed this problem using deep neural network, but the accuracy of trajectory prediction is still not high because the internal structure of the typical deep neural network with long short-term memory (LSTM) is a one-dimensional vector, which destroys the spatial information around a pedestrian. Therefore, these models cannot fully learn spatial sensing behavior of pedestrians. To solve this, we recommend using multi-channel tensors to represent the environmental information of pedestrians. Meanwhile, the spatiotemporal interactions among the pedestrians are represented by convolution operations of these tensors. Then, an end-to-end fully convolutional LSTM encoder–decoder is designed, trained and tested. Finally, our approach is compared with existing LSTM-based methods using five crowded video sequences with public datasets. The results show that our method reduces the displacement offset error and provides more realistic trajectory prediction in manifold cases.
KW - Convolution
KW - Encoder–decoder
KW - Long short-term memory
KW - Pedestrian behavior model
KW - Trajectory prediction
UR - https://www.scopus.com/pages/publications/85090151801
U2 - 10.1016/j.physa.2020.125132
DO - 10.1016/j.physa.2020.125132
M3 - 文章
AN - SCOPUS:85090151801
SN - 0378-4371
VL - 560
JO - Physica A: Statistical Mechanics and its Applications
JF - Physica A: Statistical Mechanics and its Applications
M1 - 125132
ER -