Skip to main navigation Skip to search Skip to main content

An Improved Deep Reinforcement Learning-Based Method for Optimal Kill Chain Combination Selection

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses key challenges in kill chain optimization, including complex state structures, diverse resource constraints, and inefficient strategy generation. To solve these issues, an improved Advantage Actor-Critic algorithm based on deep reinforcement learning is proposed for selecting optimal kill chain combinations. A simulation environment that incorporates resource and time constraints is developed to model distributed combat scenarios. A feature extraction network integrating a gated recurrent unit with a multi-head attention mechanism is designed to enhance the model's ability to capture state-time dependencies and identify key features. The model is trained using the Advantage ActorCritic algorithm to improve policy exploration efficiency and convergence. Experimental results from a combat scenario show that the proposed method outperforms baseline algorithms, including the basic Advantage Actor-Critic algorithm, Proximal Policy Optimization, as well as genetic algorithms in Heuristic algorithms. The results confirm the method's adaptability and practicality, providing a feasible solution for selecting kill chains in complex combat systems.

Original languageEnglish
Title of host publicationProceedings - 2025 16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages316-321
Number of pages6
ISBN (Electronic)9798331535131
DOIs
StatePublished - 2025
Event16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025 - Shanghai, China
Duration: 27 Jul 202530 Jul 2025

Publication series

NameProceedings - 2025 16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025

Conference

Conference16th International Conference on Reliability, Maintainability and Safety, ICRMS 2025
Country/TerritoryChina
CityShanghai
Period27/07/2530/07/25

Keywords

  • Combinatorial Optimization
  • Deep Reinforcement Learning
  • Improved A2C Algorithm
  • Optimal Kill Chain Combination Selection
  • Resource Constraints

Fingerprint

Dive into the research topics of 'An Improved Deep Reinforcement Learning-Based Method for Optimal Kill Chain Combination Selection'. Together they form a unique fingerprint.

Cite this