Scalable Cooperative Decision-Making in Multi-UAV Confrontations: An Attention-Based Multi-Agent Actor-Critic Approach

Can Chen, Tao Song, Li Mo*, Maolong Lv, Yinan Yu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

With the increasing use of unmanned aerial vehicles (UAVs) in military operations, autonomous cooperative decision-making for multiple UAVs in aerial confrontations has become a critical research challenge. This paper presents an attention-based multi-agent actor-critic (AMAAC) algorithm for UAV aerial confrontation decision-making. The algorithm combines multi-head attention and self-play within the centralized training-distributed execution (CTDE) framework, extending the actor-critic approach based on the missile hit probability prediction model (MHPAC) to multi-UAV scenarios. A fighter observation encoder (OFE) and a centralized critic network based on the attention mechanism are introduced to adapt to varying number of UAVs (different scales) and enhance training performance. Additionally, self-play-based extended training is used to generalize offensive and defensive strategies from small-scale aerial confrontations to larger scenarios. Experimental results demonstrate that the AMAAC algorithm achieves superior training effectiveness, and the strategies it produces perform well across various confrontation scales, even beyond the training scenario's scale. Compared to other decision-making algorithms, such as Multi-agent Proximal Policy Optimization (MAPPO), Multi-agent Hierarchical Policy Gradient (MAHPG), and the State-Event-Condition-Action (SECA) algorithm, the AMAAC-trained strategies yield higher win ratios and kill-death ratios in different scenarios, validating the algorithm's effectiveness and scalability.

Original languageEnglish
JournalIEEE Transactions on Aerospace and Electronic Systems
DOIs
Publication statusAccepted/In press - 2025
Externally publishedYes

Keywords

  • Aerial confrontation
  • attention mechanism
  • reinforcement learning
  • scalability
  • unmanned aerial vehicles

Fingerprint

Dive into the research topics of 'Scalable Cooperative Decision-Making in Multi-UAV Confrontations: An Attention-Based Multi-Agent Actor-Critic Approach'. Together they form a unique fingerprint.

Cite this