Skip to main navigation Skip to search Skip to main content

Model-Free Deep Reinforcement Learning Control for Grid-Connected Packed U-Cell Multilevel Inverters

  • Alamera Nouran Alquennah*
  • , Tassneem Zamzam
  • , Ahmed Kouzou
  • , Azadeh Kermansaravi
  • , Mohamed Trabelsi
  • , Sertac Bayhan
  • , Haitham Abu-Rub
  • , Ali Ghrayeb
  • , Hani Vahedi
  • , Sunil Khatri
  • *Corresponding author for this work
  • Texas A&M University
  • Texas A&M University at Qatar
  • Delft University of Technology
  • Kuwait College of Science and Technology
  • Abdullah Al Salem University

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes an innovative model-free deep reinforcement learning-based controller (RL-C) for a grid-connected 5-level packed-U-cell (PUC5) multilevel inverter (MLI). The controller is designed to deliver a high-quality grid current while maintaining the PUC5 floating capacitor voltage at its reference level. In addition, the proposed controller supports both active and reactive power exchanges, adapts to variations in voltage and current references, and remains robust under grid voltage variations. The RL agent learns optimal switching actions through direct interaction with the PUC5 system, eliminating the need for data collection or reliance on existing control models. An Actor-Critic architecture is adopted, and the Proximal Policy Optimization (PPO) algorithm is applied for training (offline) using MATLAB/Simulink, where the RL-C is evaluated under diverse PUC5 configurations and operating conditions in the testing phase. The trained agent has been implemented on an Opal-RT real-time system and validated experimentally using a laboratory-made PUC5 prototype. The performance of the proposed RL-C approach is compared to both traditional approaches including finite control set model predictive control, sliding mode control, and PI control, and other state-of-the-art RL algorithms, demonstrating superior generalization and training efficiency. Moreover, a sensitivity analysis quantifying the impact of reward design, state space, network size, and key hyperparameters on convergence and performance is carried out.

Original languageEnglish
Pages (from-to)1360-1376
Number of pages17
JournalIEEE Open Journal of Power Electronics
Volume7
DOIs
Publication statusPublished - 16 Apr 2026

Keywords

  • AI Controllers
  • Antennas
  • Capacitors
  • Circuits
  • Feedback
  • Feeds
  • Filtering
  • Filters
  • Frequency modulation
  • Packed-U-Cell Inverter
  • Radio broadcasting
  • Reinforcement Learning
  • Voltage multipliers

Fingerprint

Dive into the research topics of 'Model-Free Deep Reinforcement Learning Control for Grid-Connected Packed U-Cell Multilevel Inverters'. Together they form a unique fingerprint.

Cite this