Abstract
This paper proposes an innovative model-free deep reinforcement learning-based controller (RL-C) for a grid-connected 5-level packed-U-cell (PUC5) multilevel inverter (MLI). The controller is designed to deliver a high-quality grid current while maintaining the PUC5 floating capacitor voltage at its reference level. In addition, the proposed controller supports both active and reactive power exchanges, adapts to variations in voltage and current references, and remains robust under grid voltage variations. The RL agent learns optimal switching actions through direct interaction with the PUC5 system, eliminating the need for data collection or reliance on existing control models. An Actor-Critic architecture is adopted, and the Proximal Policy Optimization (PPO) algorithm is applied for training (offline) using MATLAB/Simulink, where the RL-C is evaluated under diverse PUC5 configurations and operating conditions in the testing phase. The trained agent has been implemented on an Opal-RT real-time system and validated experimentally using a laboratory-made PUC5 prototype. The performance of the proposed RL-C approach is compared to both traditional approaches including finite control set model predictive control, sliding mode control, and PI control, and other state-of-the-art RL algorithms, demonstrating superior generalization and training efficiency. Moreover, a sensitivity analysis quantifying the impact of reward design, state space, network size, and key hyperparameters on convergence and performance is carried out.
| Original language | English |
|---|---|
| Pages (from-to) | 1360-1376 |
| Number of pages | 17 |
| Journal | IEEE Open Journal of Power Electronics |
| Volume | 7 |
| DOIs | |
| Publication status | Published - 16 Apr 2026 |
Keywords
- AI Controllers
- Antennas
- Capacitors
- Circuits
- Feedback
- Feeds
- Filtering
- Filters
- Frequency modulation
- Packed-U-Cell Inverter
- Radio broadcasting
- Reinforcement Learning
- Voltage multipliers
Fingerprint
Dive into the research topics of 'Model-Free Deep Reinforcement Learning Control for Grid-Connected Packed U-Cell Multilevel Inverters'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver