TY - JOUR
T1 - Bi-Layered Synchronized Optimization Control With Prescribed Performance for Vehicle Platoon
AU - Zhang, Yuxiang
AU - Liang, Xiaoling
AU - Li, Dongyu
AU - Sam Ge, Shuzhi
AU - Heng Lee, Tong
N1 - Publisher Copyright:
© 2000-2011 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper studies synchronized optimization control for the cooperatively connected autonomous vehicle platoon formulation applicable in various driving scenarios and accommodates multiple vehicles dynamically entering or exiting the platoon. More specifically, the proposed algorithm consists of bi-layered synchronized optimization that enables the ultimate optimized control to attain the synchronized convergence property, and also importantly, ensures the satisfaction of safety performance requirements. The first layer of the proposed approach involves formulating the platoon dynamics and ensuring that the platoon operates within safe boundaries while optimizing its overall performance. To achieve this, the prescribed performance control is utilized to ensure that the state-variables remain within a predefined region throughout the synchronized optimization process. In the second layer, the control optimization takes into account the vehicle dynamics and actuators of either heterogeneous or homogeneous individual vehicles, improving performance and coordination within the platoon. In each optimization layer, the optimized backstepping is utilized, and the norm-normalized sign function is appropriately incorporated with the decomposition design to establish the learning framework with the outcome that attains the synchronized properties simultaneously. The adaptive dynamic programming and gradient-constrained method are utilized in the learning design to iteratively optimize system control while keeping the learning parts within the admissible policy region. Importantly, it is rigorously shown that this particular development and methodology attains the noteworthy time-synchronized stability property and outcome that all vehicle agents arrive at the desired relative position at the same time with synchronized convergence. Additionally, it is also shown that the methodology of our specific algorithmic strategy significantly also attains the desired outcomes of 'string stability' (jointly with the above-mentioned desired outcomes of 'time-synchronized stability'). To evaluate its effectiveness, comparative studies with different methods are carried out to showcase the significantly better desired outcomes attained with this methodology of synchronized optimization. Further evaluations in scenarios involving dynamic entry and exit of multiple vehicles demonstrate the corresponding capability and effectiveness in achieving the desired objectives.
AB - This paper studies synchronized optimization control for the cooperatively connected autonomous vehicle platoon formulation applicable in various driving scenarios and accommodates multiple vehicles dynamically entering or exiting the platoon. More specifically, the proposed algorithm consists of bi-layered synchronized optimization that enables the ultimate optimized control to attain the synchronized convergence property, and also importantly, ensures the satisfaction of safety performance requirements. The first layer of the proposed approach involves formulating the platoon dynamics and ensuring that the platoon operates within safe boundaries while optimizing its overall performance. To achieve this, the prescribed performance control is utilized to ensure that the state-variables remain within a predefined region throughout the synchronized optimization process. In the second layer, the control optimization takes into account the vehicle dynamics and actuators of either heterogeneous or homogeneous individual vehicles, improving performance and coordination within the platoon. In each optimization layer, the optimized backstepping is utilized, and the norm-normalized sign function is appropriately incorporated with the decomposition design to establish the learning framework with the outcome that attains the synchronized properties simultaneously. The adaptive dynamic programming and gradient-constrained method are utilized in the learning design to iteratively optimize system control while keeping the learning parts within the admissible policy region. Importantly, it is rigorously shown that this particular development and methodology attains the noteworthy time-synchronized stability property and outcome that all vehicle agents arrive at the desired relative position at the same time with synchronized convergence. Additionally, it is also shown that the methodology of our specific algorithmic strategy significantly also attains the desired outcomes of 'string stability' (jointly with the above-mentioned desired outcomes of 'time-synchronized stability'). To evaluate its effectiveness, comparative studies with different methods are carried out to showcase the significantly better desired outcomes attained with this methodology of synchronized optimization. Further evaluations in scenarios involving dynamic entry and exit of multiple vehicles demonstrate the corresponding capability and effectiveness in achieving the desired objectives.
KW - Reinforcement learning
KW - adaptive dynamic programming
KW - autonomous vehicles
KW - time-synchronized convergence
UR - https://www.scopus.com/pages/publications/85208742767
U2 - 10.1109/TITS.2024.3432149
DO - 10.1109/TITS.2024.3432149
M3 - 文章
AN - SCOPUS:85208742767
SN - 1524-9050
VL - 25
SP - 16473
EP - 16489
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 11
ER -