Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat

Can Chen; Dengyu Yin; Li Mo; Maolong Lv; Dan Lin

doi:10.1007/978-981-96-2240-5_39

Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat

Can Chen, Dengyu Yin, Li Mo^*, Maolong Lv, Dan Lin

^*此作品的通讯作者

空天科学与技术学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

In recent years, the significance of cooperative decision-making in autonomous air combat scenarios has gained widespread recognition. Consequently, this paper introduces an innovative algorithm named Multi-Agent Recurrent Actor-Critic (MARAC), explicitly designed to enhance cooperative decision-making in autonomous within visual range (WVR) air combat. By leveraging the Centralized-Training-Distributed-Execution (CTDE) framework and utilizing recurrent neural networks, the MARAC algorithm improves the efficacy of communication-independent cooperative air combat strategies, resulting in more effective outcomes. Furthermore, the incorporation of curriculum learning (CL) and self-play (SP) techniques is proposed to boost the algorithm’s learning efficiency. Experimental results demonstrate that the MARAC algorithm significantly enhances the performance of cooperative decision-making by effectively addressing challenges associated with partial observations and complex confrontation dynamics.

源语言	英语
主期刊名	Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11
编辑	Liang Yan, Haibin Duan, Yimin Deng
出版商	Springer Science and Business Media Deutschland GmbH
页	392-401
页数	10
ISBN（印刷版）	9789819622399
DOI	http://doi.org/10.1007/978-981-96-2240-5_39
出版状态	已出版 - 2025
活动	International Conference on Guidance, Navigation and Control, ICGNC 2024 - Changsha, 中国期限: 9 8月 2024 → 11 8月 2024

出版系列

姓名	Lecture Notes in Electrical Engineering
卷	1347 LNEE
ISSN（印刷版）	1876-1100
ISSN（电子版）	1876-1119

会议

会议	International Conference on Guidance, Navigation and Control, ICGNC 2024
国家/地区	中国
市	Changsha
时期	9/08/24 → 11/08/24

访问文件

10.1007/978-981-96-2240-5_39

其它文件与链接

链接到 Scopus 的出版物

引用此

Chen, C., Yin, D., Mo, L., Lv, M., & Lin, D. (2025). Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat. 在 L. Yan, H. Duan, & Y. Deng (编辑), Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11 (页码 392-401). (Lecture Notes in Electrical Engineering; 卷 1347 LNEE). Springer Science and Business Media Deutschland GmbH. http://doi.org/10.1007/978-981-96-2240-5_39

Chen, Can ; Yin, Dengyu ; Mo, Li 等. / Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat. Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11. 编辑 / Liang Yan ; Haibin Duan ; Yimin Deng. Springer Science and Business Media Deutschland GmbH, 2025. 页码 392-401 (Lecture Notes in Electrical Engineering).

@inproceedings{a9ddc0dfb7c84a1cbe71fb45748c57f8,

title = "Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat",

abstract = "In recent years, the significance of cooperative decision-making in autonomous air combat scenarios has gained widespread recognition. Consequently, this paper introduces an innovative algorithm named Multi-Agent Recurrent Actor-Critic (MARAC), explicitly designed to enhance cooperative decision-making in autonomous within visual range (WVR) air combat. By leveraging the Centralized-Training-Distributed-Execution (CTDE) framework and utilizing recurrent neural networks, the MARAC algorithm improves the efficacy of communication-independent cooperative air combat strategies, resulting in more effective outcomes. Furthermore, the incorporation of curriculum learning (CL) and self-play (SP) techniques is proposed to boost the algorithm{\textquoteright}s learning efficiency. Experimental results demonstrate that the MARAC algorithm significantly enhances the performance of cooperative decision-making by effectively addressing challenges associated with partial observations and complex confrontation dynamics.",

keywords = "Autonoumous Air Combat, Cooperative Decision-Making, Multi-Agent Reinforcement Learning, Recurrent Neural Network",

author = "Can Chen and Dengyu Yin and Li Mo and Maolong Lv and Dan Lin",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.; International Conference on Guidance, Navigation and Control, ICGNC 2024 ; Conference date: 09-08-2024 Through 11-08-2024",

year = "2025",

doi = "10.1007/978-981-96-2240-5\_39",

language = "English",

isbn = "9789819622399",

series = "Lecture Notes in Electrical Engineering",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "392--401",

editor = "Liang Yan and Haibin Duan and Yimin Deng",

booktitle = "Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11",

address = "Germany",

}

Chen, C, Yin, D, Mo, L, Lv, M & Lin, D 2025, Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat. 在 L Yan, H Duan & Y Deng (编辑), Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11. Lecture Notes in Electrical Engineering, 卷 1347 LNEE, Springer Science and Business Media Deutschland GmbH, 页码 392-401, International Conference on Guidance, Navigation and Control, ICGNC 2024, Changsha, 中国, 9/08/24. http://doi.org/10.1007/978-981-96-2240-5_39

Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat. / Chen, Can; Yin, Dengyu; Mo, Li 等.
Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11. 编辑 / Liang Yan; Haibin Duan; Yimin Deng. Springer Science and Business Media Deutschland GmbH, 2025. 页码 392-401 (Lecture Notes in Electrical Engineering; 卷 1347 LNEE).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat

AU - Chen, Can

AU - Yin, Dengyu

AU - Mo, Li

AU - Lv, Maolong

AU - Lin, Dan

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

PY - 2025

Y1 - 2025

N2 - In recent years, the significance of cooperative decision-making in autonomous air combat scenarios has gained widespread recognition. Consequently, this paper introduces an innovative algorithm named Multi-Agent Recurrent Actor-Critic (MARAC), explicitly designed to enhance cooperative decision-making in autonomous within visual range (WVR) air combat. By leveraging the Centralized-Training-Distributed-Execution (CTDE) framework and utilizing recurrent neural networks, the MARAC algorithm improves the efficacy of communication-independent cooperative air combat strategies, resulting in more effective outcomes. Furthermore, the incorporation of curriculum learning (CL) and self-play (SP) techniques is proposed to boost the algorithm’s learning efficiency. Experimental results demonstrate that the MARAC algorithm significantly enhances the performance of cooperative decision-making by effectively addressing challenges associated with partial observations and complex confrontation dynamics.

AB - In recent years, the significance of cooperative decision-making in autonomous air combat scenarios has gained widespread recognition. Consequently, this paper introduces an innovative algorithm named Multi-Agent Recurrent Actor-Critic (MARAC), explicitly designed to enhance cooperative decision-making in autonomous within visual range (WVR) air combat. By leveraging the Centralized-Training-Distributed-Execution (CTDE) framework and utilizing recurrent neural networks, the MARAC algorithm improves the efficacy of communication-independent cooperative air combat strategies, resulting in more effective outcomes. Furthermore, the incorporation of curriculum learning (CL) and self-play (SP) techniques is proposed to boost the algorithm’s learning efficiency. Experimental results demonstrate that the MARAC algorithm significantly enhances the performance of cooperative decision-making by effectively addressing challenges associated with partial observations and complex confrontation dynamics.

KW - Autonoumous Air Combat

KW - Cooperative Decision-Making

KW - Multi-Agent Reinforcement Learning

KW - Recurrent Neural Network

UR - http://www.scopus.com/pages/publications/105000772841

U2 - 10.1007/978-981-96-2240-5_39

DO - 10.1007/978-981-96-2240-5_39

M3 - Conference contribution

AN - SCOPUS:105000772841

SN - 9789819622399

T3 - Lecture Notes in Electrical Engineering

SP - 392

EP - 401

BT - Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11

A2 - Yan, Liang

A2 - Duan, Haibin

A2 - Deng, Yimin

PB - Springer Science and Business Media Deutschland GmbH

T2 - International Conference on Guidance, Navigation and Control, ICGNC 2024

Y2 - 9 August 2024 through 11 August 2024

ER -

Chen C, Yin D, Mo L, Lv M, Lin D. Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat. 在 Yan L, Duan H, Deng Y, 编辑, Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 11. Springer Science and Business Media Deutschland GmbH. 2025. 页码 392-401. (Lecture Notes in Electrical Engineering). doi: 10.1007/978-981-96-2240-5_39

Multi-agent Recurrent Actor-Critic for Cooperative Decision-Making in Within Visual Range Air Combat

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此