TY - GEN
T1 - Multi-Head PPO for Joint Resource Allocation in FSO-RF Space-Air-Ground Networks
AU - Ali, Zain
AU - Ammuri, Rula
AU - Hasna, Mazen Omar
AU - Qaraqe, Khalid
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/12/5
Y1 - 2025/12/5
N2 - Non-terrestrial networks extend connectivity to re-mote and disaster-affected areas and provide high-capacity backhaul, making them attractive for resilient wide-area communications. This paper studies joint resource allocation in a space-air-ground system where a free-space optical (FSO) satellite backhaul serves multiple multi-antenna unmanned aerial vehicle (UAV) relays that in turn serve ground users over RF links. The resulting optimization mixes continuous power-splitting decisions with discrete beam-selection choices from a discrete Fourier transform codebook, producing a large nonconvex problem. We propose a hybrid Proximal Policy Optimization (PPO) actor-critic with Dirichlet heads for continuous power fractions and categorical heads for beam selection, implemented as a shared-backbone, multi-head policy. Simulation results show reliable convergence, consistent gains over a random-beam baseline, and strong performance across varying power regimes.
AB - Non-terrestrial networks extend connectivity to re-mote and disaster-affected areas and provide high-capacity backhaul, making them attractive for resilient wide-area communications. This paper studies joint resource allocation in a space-air-ground system where a free-space optical (FSO) satellite backhaul serves multiple multi-antenna unmanned aerial vehicle (UAV) relays that in turn serve ground users over RF links. The resulting optimization mixes continuous power-splitting decisions with discrete beam-selection choices from a discrete Fourier transform codebook, producing a large nonconvex problem. We propose a hybrid Proximal Policy Optimization (PPO) actor-critic with Dirichlet heads for continuous power fractions and categorical heads for beam selection, implemented as a shared-backbone, multi-head policy. Simulation results show reliable convergence, consistent gains over a random-beam baseline, and strong performance across varying power regimes.
KW - Non-Terrestrial Networks
KW - Proximal Policy Optimization
KW - Space-Air-Ground Systems
UR - https://www.scopus.com/pages/publications/105032155777
U2 - 10.1109/CommNet68224.2025.11288878
DO - 10.1109/CommNet68224.2025.11288878
M3 - Conference contribution
AN - SCOPUS:105032155777
T3 - 8th International Conference on Advanced Communication Technologies and Networking, CommNet 2025 - Proceedings
BT - 8th International Conference on Advanced Communication Technologies and Networking, CommNet 2025 - Proceedings
A2 - El Bouanani, Faissal
A2 - Ayoub, Fouad
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th International Conference on Advanced Communication Technologies and Networking, CommNet 2025
Y2 - 3 December 2025 through 5 December 2025
ER -