Skip to main navigation Skip to search Skip to main content

Time-and-Angle-Constrained Cooperative Guidance Based on Reinforcement Learning

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Time-and-angle-constrained cooperative guidance based on reinforcement learning is developed to address the problem of cooperative guidance. Firstly, time-constrained guidance law and angle-constrained guidance law are designed separately, and the calculation methods for cooperative time and angle are designed. Time-and-angle-constrained cooperative guidance law is obtained by a weight coefficient. Secondly, reinforcement learning is used to optimize the coefficient. A simple observation variable set is created. Deep deterministic policy gradient (DDPG) algorithm is applied to agents, as well as network structure and reward are designed. Thirdly, agents are trained and the coefficient of cooperative guidance law can be output by trained agents. Comparative simulation is conducted to verify the performance of the guidance law.

Original languageEnglish
Title of host publicationProceedings of 2023 7th Chinese Conference on Swarm Intelligence and Cooperative Control - Swarm Guidance Technologies
EditorsGuo-Ping Jiang, Mengyi Wang, Zhang Ren
PublisherSpringer Science and Business Media Deutschland GmbH
Pages317-329
Number of pages13
ISBN (Print)9789819733392
DOIs
StatePublished - 2024
Event7th Chinese Conference on Swarm Intelligence and Cooperative Control, CCSICC 2023 - Nanjing, China
Duration: 24 Nov 202327 Nov 2023

Publication series

NameLecture Notes in Electrical Engineering
Volume1204 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference7th Chinese Conference on Swarm Intelligence and Cooperative Control, CCSICC 2023
Country/TerritoryChina
CityNanjing
Period24/11/2327/11/23

Keywords

  • angle constraint
  • cooperative guidance
  • reinforcement learning
  • time constraint

Fingerprint

Dive into the research topics of 'Time-and-Angle-Constrained Cooperative Guidance Based on Reinforcement Learning'. Together they form a unique fingerprint.

Cite this