Task Offloading with LLM-Enhanced Multi-Agent Reinforcement Learning in UAV-Assisted Edge Computing

Feifan Zhu; Fei Huang; Yantao Yu; Guojin Liu; Tiancong Huang

doi:10.3390/s25010175

Task Offloading with LLM-Enhanced Multi-Agent Reinforcement Learning in UAV-Assisted Edge Computing

Sensors (Basel). 2024 Dec 31;25(1):175. doi: 10.3390/s25010175.

Authors

Feifan Zhu¹, Fei Huang², Yantao Yu¹, Guojin Liu¹, Tiancong Huang¹

Affiliations

¹ School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.
² State Grid Chongqing Electric Power Company, Electric Power Research Institute, Chongqing 401123, China.

Abstract

Unmanned aerial vehicles (UAVs) furnished with computational servers enable user equipment (UE) to offload complex computational tasks, thereby addressing the limitations of edge computing in remote or resource-constrained environments. The application of value decomposition algorithms for UAV trajectory planning has drawn considerable research attention. However, existing value decomposition algorithms commonly encounter obstacles in effectively associating local observations with the global state of UAV clusters, which hinders their task-solving capabilities and gives rise to reduced task completion rates and prolonged convergence times. To address these challenges, this paper introduces an innovative multi-agent deep learning framework that conceptualizes multi-UAV trajectory optimization as a decentralized partially observable Markov decision process (Dec-POMDP). This framework integrates the QTRAN algorithm with a large language model (LLM) for efficient region decomposition and employs graph convolutional networks (GCNs) combined with self-attention mechanisms to adeptly manage inter-subregion relationships. The simulation results demonstrate that the proposed method significantly outperforms existing deep reinforcement learning methods, with improvements in convergence speed and task completion rate exceeding 10%. Overall, this framework significantly advances UAV trajectory optimization and enhances the performance of multi-agent systems within UAV-assisted edge computing environments.

Keywords: LLM; UAV; multi-agent deep learning; trajectory planning.

Grants and funding

5700-202141454A-0-0-00/2021 State Grid Corporation of China Science and Technology Program