Dynamic and adaptive learning for autonomous decision-making in beyond visual range air combat

Wenfei Wang, Le Ru*, Maolong Lv, Li Mo

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

The environment of beyond-visual-range (BVR) air combat is complex and dynamic, making traditional decision-making methods insufficient for modern combat scenarios. This paper first analyzes the confrontation process in BVR air combat and develops a corresponding decision-making model for air combat. To address the challenge of coupling maneuver and missile launch decisions, we propose a hybrid bifurcation action space design method, allowing for more precise control and improved learning. Additionally, this paper introduces Progressive Opponent Reinforcement Learning (PORL), which incorporates progressively challenging opponents to simulate real-world adversary strategies. Based on the Soft Actor-Critic (SAC) algorithm, this method strengthens the exploration and utilization of learning balance through maximum entropy, and dynamically adjusts the opponent's tactics according to the agent's performance, thus improving the agent's learning efficiency and adaptability in the rapidly changing confrontation environment. Furthermore, a dynamic opponent sampling mechanism is designed to select adversaries with varying difficulty levels based on the agent's current performance, ensuring a balanced training process. Simulation results demonstrate that the proposed decision-making framework significantly improves the autonomous decision-making capabilities and countermeasure effectiveness of agents in BVR air combat.

源语言英语
文章编号110327
期刊Aerospace Science and Technology
163
DOI
出版状态已出版 - 8月 2025
已对外发布

指纹

探究 'Dynamic and adaptive learning for autonomous decision-making in beyond visual range air combat' 的科研主题。它们共同构成独一无二的指纹。

引用此