TY - JOUR
T1 - Learning Human Behavior in Shared Control
T2 - Adaptive Inverse Differential Game Approach
AU - Wu, Huai Ning
AU - Wang, Mi
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - To enhance the collaborative intelligence of a machine, it is important for the machine to understand what behavior a human may adopt to interact with the machine when performing a task in shared control. In this study, an online behavior learning method is proposed for continuous-time linear human-in-the-loop shared control systems by using the system state data only. A two-player nonzero-sum linear quadratic dynamic game paradigm is used for modeling the control interaction between a human operator and an automation that actively compensates for human control action. In this game model, the cost function representing the human behavior is assumed to have an unknown weighting matrix. Here, we want to learn the human behavior or retrieve the weighting matrix by using the system state data only. Accordingly, a new adaptive inverse differential game (IDG) method, which integrates concurrent learning (CL) and linear matrix inequality (LMI) optimization, is proposed. First, a CL-based adaptive law and an interactive controller of the automation are developed to estimate the feedback gain matrix of the human online, and second, an LMI optimization problem is solved to determine the weighting matrix of the human cost function. Finally, simulation results on a cooperative shared control driver assistance system are provided to elucidate the feasibility of the developed method.
AB - To enhance the collaborative intelligence of a machine, it is important for the machine to understand what behavior a human may adopt to interact with the machine when performing a task in shared control. In this study, an online behavior learning method is proposed for continuous-time linear human-in-the-loop shared control systems by using the system state data only. A two-player nonzero-sum linear quadratic dynamic game paradigm is used for modeling the control interaction between a human operator and an automation that actively compensates for human control action. In this game model, the cost function representing the human behavior is assumed to have an unknown weighting matrix. Here, we want to learn the human behavior or retrieve the weighting matrix by using the system state data only. Accordingly, a new adaptive inverse differential game (IDG) method, which integrates concurrent learning (CL) and linear matrix inequality (LMI) optimization, is proposed. First, a CL-based adaptive law and an interactive controller of the automation are developed to estimate the feedback gain matrix of the human online, and second, an LMI optimization problem is solved to determine the weighting matrix of the human cost function. Finally, simulation results on a cooperative shared control driver assistance system are provided to elucidate the feasibility of the developed method.
KW - Adaptive estimation
KW - concurrent learning (CL)
KW - human behavior learning
KW - inverse differential game (IDG)
KW - shared control
UR - https://www.scopus.com/pages/publications/85149425034
U2 - 10.1109/TCYB.2023.3244559
DO - 10.1109/TCYB.2023.3244559
M3 - 文章
C2 - 37027753
AN - SCOPUS:85149425034
SN - 2168-2267
VL - 54
SP - 3705
EP - 3715
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 6
ER -