학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Multiagent RL-Based Joint Trajectory Scheduling and Resource Allocation in NOMA-Assisted UAV Swarm Network

Resource Type: Periodical
Authors: Dai, X.; Lu, Z.; Chen, X.; Xu, X.; Tang, F.
Source: IEEE Internet of Things Journal IEEE Internet Things J. Internet of Things Journal, IEEE. 11(8):14153-14167 Apr, 2024
Subject: Computing and Processing
Communication, Networking and Broadcast Technologies
Autonomous aerial vehicles
NOMA
Interference
Trajectory
Downlink
Clustering algorithms
Resource management
Clustering
interference
nonorthogonal multiple access (NOMA)
reinforcement learning (RL)
unmanned aerial vehicle (UAV) swarm network
Language
ISSN: 2327-4662
2372-2541

Online Access

초록

In this article, we propose a downlink communication scheme for large-scale high-interference unmanned aerial vehicle (UAV) swarm network based on nonorthogonal multiple access (NOMA), clustering, and reinforcement learning (RL). Since a large number of UAVs increases the complexity of downlink communication, we first introduce a load-balancing fuzzy C-Means (LB-FCMs) algorithm for UAV clustering. Downlink communication consists of three stages: 1) UAV clustering; 2) data aggregation; and 3) data offloading. We have two goals: 1) maximize the data aggregation rate of the network while ensuring fairness of UAVs’ spectrum access for UAV-to-UAV (U2U) communications during data aggregation and 2) maximize network data offloading rate while ensuring ground station priority for UAV-to-ground (U2G) communications during data offloading. To address these two problems, first, we introduce uplink NOMA and downlink NOMA to eliminate part of the intrasystem interference, respectively. Then, we propose a multiagent RL framework for optimizing channel, transmit power, and trajectory scheduling (MARL-CPT). MARL-CPT consists of two parts of the algorithm, which solve the optimization problems in two stages, respectively. Simulation results show that our proposed method outperforms random decision-making and polling-based single-agent RL methods in terms of final score, fairness, and priority. For trajectory scheduling during data offloading, our method finds the optimal hover position while taking less than half the time compared to single-agent RL methods.

공지

DAU Library

학술논문

요약정보

Multiagent RL-Based Joint Trajectory Scheduling and Resource Allocation in NOMA-Assisted UAV Swarm Network

Online Access

초록