发行时间:2024.05.21
总页数:21
编辑:
harlan
摘要:
www.nature.com/scientificreports
OPEN
A hierarchical reinforcement learning method for missile evasion and guidance
Mengda Yan1,2*, Rennong Yang1,2, Ying Zhang1,2, Longfei Yue1 & Dongyuan Hu1
This paper proposes an algorithm for missile manoeuvring based on a hierarchical proximal policy optimization (PPO) reinforcement learning algorithm, which enables a missile to guide to a target and evade an interceptor at the same time. Based on the idea of task hierarchy, the agent has a two-layer str...