Multi-Agent Global Prioritized Experience Learning for UAV Cooperative Jamming in Secure Communication

Saier Wang; Yan Zhang; Mingyu Chen; Wancheng Zhang; Zunwen He

doi:10.1109/TSIPN.2025.3592341

Multi-Agent Global Prioritized Experience Learning for UAV Cooperative Jamming in Secure Communication

Saier Wang, Yan Zhang^*, Mingyu Chen, Wancheng Zhang, Zunwen He

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In uncrewed aerial vehicle (UAV) communication networks, the line-of-sight (LoS) propagation link makes the communication information vulnerable to being wiretapped by ground eavesdroppers (GEs). This paper focuses on the maximization of the average secrecy rate with multiple UAV jammers helping multiple UAV transmitters to defend against GEs. We propose a multi-agent global prioritized experience learning (MAGPEL) algorithm. The allocation of UAV transmitters’ sub-channels, locations, and power levels, along with the allocation of UAV jammers’ locations and power levels are jointly optimized. Each UAV takes the role of an agent and uses the global information for training, which comprises specifics on the states and actions of all UAVs. Besides, temporal difference error (TD-error) is used to measure the significance of the experience and calculate the probability that the experience is sampled. Experiences of greater significance can be extracted with a higher probability for training. Simulation results show that the proposed algorithm has better convergence performance and a higher secrecy rate compared with other state-of-the-art methods.

源语言	英语
页（从-至）	916-927
页数	12
期刊	IEEE Transactions on Signal and Information Processing over Networks
卷	11
DOI	http://doi.org/10.1109/TSIPN.2025.3592341
出版状态	已出版 - 2025
已对外发布	是

访问文件

10.1109/TSIPN.2025.3592341

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{6446b4fe9225470fbb6cf589d583494a,

title = "Multi-Agent Global Prioritized Experience Learning for UAV Cooperative Jamming in Secure Communication",

abstract = "In uncrewed aerial vehicle (UAV) communication networks, the line-of-sight (LoS) propagation link makes the communication information vulnerable to being wiretapped by ground eavesdroppers (GEs). This paper focuses on the maximization of the average secrecy rate with multiple UAV jammers helping multiple UAV transmitters to defend against GEs. We propose a multi-agent global prioritized experience learning (MAGPEL) algorithm. The allocation of UAV transmitters{\textquoteright} sub-channels, locations, and power levels, along with the allocation of UAV jammers{\textquoteright} locations and power levels are jointly optimized. Each UAV takes the role of an agent and uses the global information for training, which comprises specifics on the states and actions of all UAVs. Besides, temporal difference error (TD-error) is used to measure the significance of the experience and calculate the probability that the experience is sampled. Experiences of greater significance can be extracted with a higher probability for training. Simulation results show that the proposed algorithm has better convergence performance and a higher secrecy rate compared with other state-of-the-art methods.",

keywords = "Location deployment, multi-agent deep reinforcement learning, physical layer security, resource allocation, uncrewed aerial vehicle",

author = "Saier Wang and Yan Zhang and Mingyu Chen and Wancheng Zhang and Zunwen He",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2025",

doi = "10.1109/TSIPN.2025.3592341",

language = "English",

volume = "11",

pages = "916--927",

journal = "IEEE Transactions on Signal and Information Processing over Networks",

issn = "2373-776X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Multi-Agent Global Prioritized Experience Learning for UAV Cooperative Jamming in Secure Communication

AU - Wang, Saier

AU - Zhang, Yan

AU - Chen, Mingyu

AU - Zhang, Wancheng

AU - He, Zunwen

PY - 2025

Y1 - 2025

N2 - In uncrewed aerial vehicle (UAV) communication networks, the line-of-sight (LoS) propagation link makes the communication information vulnerable to being wiretapped by ground eavesdroppers (GEs). This paper focuses on the maximization of the average secrecy rate with multiple UAV jammers helping multiple UAV transmitters to defend against GEs. We propose a multi-agent global prioritized experience learning (MAGPEL) algorithm. The allocation of UAV transmitters’ sub-channels, locations, and power levels, along with the allocation of UAV jammers’ locations and power levels are jointly optimized. Each UAV takes the role of an agent and uses the global information for training, which comprises specifics on the states and actions of all UAVs. Besides, temporal difference error (TD-error) is used to measure the significance of the experience and calculate the probability that the experience is sampled. Experiences of greater significance can be extracted with a higher probability for training. Simulation results show that the proposed algorithm has better convergence performance and a higher secrecy rate compared with other state-of-the-art methods.

AB - In uncrewed aerial vehicle (UAV) communication networks, the line-of-sight (LoS) propagation link makes the communication information vulnerable to being wiretapped by ground eavesdroppers (GEs). This paper focuses on the maximization of the average secrecy rate with multiple UAV jammers helping multiple UAV transmitters to defend against GEs. We propose a multi-agent global prioritized experience learning (MAGPEL) algorithm. The allocation of UAV transmitters’ sub-channels, locations, and power levels, along with the allocation of UAV jammers’ locations and power levels are jointly optimized. Each UAV takes the role of an agent and uses the global information for training, which comprises specifics on the states and actions of all UAVs. Besides, temporal difference error (TD-error) is used to measure the significance of the experience and calculate the probability that the experience is sampled. Experiences of greater significance can be extracted with a higher probability for training. Simulation results show that the proposed algorithm has better convergence performance and a higher secrecy rate compared with other state-of-the-art methods.

KW - Location deployment

KW - multi-agent deep reinforcement learning

KW - physical layer security

KW - resource allocation

KW - uncrewed aerial vehicle

UR - http://www.scopus.com/pages/publications/105011856378

U2 - 10.1109/TSIPN.2025.3592341

DO - 10.1109/TSIPN.2025.3592341

M3 - Article

AN - SCOPUS:105011856378

SN - 2373-776X

VL - 11

SP - 916

EP - 927

JO - IEEE Transactions on Signal and Information Processing over Networks

JF - IEEE Transactions on Signal and Information Processing over Networks

ER -

Multi-Agent Global Prioritized Experience Learning for UAV Cooperative Jamming in Secure Communication

摘要

访问文件

其它文件与链接

指纹

引用此