UPM Institutional Repository

Efficient multi-agent deep reinforcement learning algorithm for multi UAV collision avoidance


Citation

Rezaee, Mohammad Reza and Abdul Hamid, Nor Asilah Wati and Hussin, Masnida and Ahmad Zukarnain, Zuriati (2026) Efficient multi-agent deep reinforcement learning algorithm for multi UAV collision avoidance. Applied Soft Computing, 197. art. no. 115145. pp. 1-12. ISSN 1568-4946

Abstract

The rapid expansion of unmanned aerial vehicles (UAVs) across industries has led to increased airspace congestion. The increasing use of drones across many fields and locations has caused serious problems, especially in avoiding collisions. In the rapidly developing field of drone technology, ensuring UAV flight safety and reducing the risk of UAV collisions have therefore become urgent concerns. There are many artificial intelligence (AI) algorithms designed to solve this problem, but most work only in situations with a single agent. Multi-agent reinforcement learning is a promising way to solve these problems. It enables drones to operate with greater intelligence and flexibility, even in challenging situations, alongside other agents. This work presents a Multi-Agent Deep Reinforcement Learning algorithm based on efficient graph attention network for collision avoidance in a dense, complex multi UAV environment. We propose both curriculum learning and transfer learning by adding more agents over time and subsequently employing learning models. This makes the system more scalable and more coordinated. The training process is significantly enhanced by the suggested method, which outperforms the current baselines in continuous settings. Our findings indicate that the proposed approach achieves 17% higher cumulative reward, up to 10% fewer loss-of-separation time steps, and about 44% fewer active interaction edges than the benchmark. Furthermore, the proposed method reduces action-selection bias, improving decision-making stability in dense multi-UAV settings.


Download File

[img] Text
125195.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (3MB)

Additional Metadata

Item Type: Article
Subject: Software
Divisions: Faculty of Computer Science and Information Technology
Institute for Mathematical Research
DOI Number: https://doi.org/10.1016/j.asoc.2026.115145
Publisher: Elsevier Ltd
Keywords: Collision avoidance; Deep reinforcement learning; Graph attention network; Multi-agent learning; Unmanned aerial vehicle
Sustainable Development Goals (SDGs): SDG 9: Industry, Innovation and Infrastructure, SDG 11: Sustainable Cities and Communities, SDG 7: Affordable and Clean Energy
Depositing User: Ms. Siti Radziah Mohamed@mahmod
Date Deposited: 05 May 2026 00:43
Last Modified: 05 May 2026 00:43
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1016/j.asoc.2026.115145
URI: http://psasir.upm.edu.my/id/eprint/125195
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item