Skip to main navigation Skip to search Skip to main content

Lyapunov-Based Reward Function Design for Reinforcement Learning Control of Grid-Connected Multilevel Inverters: A PUC5 Case Study

  • Texas A&M University
  • Kuwait College of Science and Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a Lyapunov-based reward function design for Reinforcement Learning Control (RL-C) of a grid-connected 5-level Packed U Cell (PUC5) inverter. Unlike existing RL approaches that rely solely on tracking-error-based rewards, the proposed method integrates the Lyapunov stability condition directly into the reward formulation, ensuring stable learning and robust closed-loop performance. The control objectives are to regulate the flying capacitor voltage and inject a low total harmonic distortion (THD) grid current. The Proximal Policy Optimization (PPO) algorithm is employed and trained in a MATLAB/Simulink environment under randomized operating conditions to enhance generalization. Simulation results demonstrate that the Lyapunov-based RL-C achieves stable operation with THD as low as 1.9 % and capacitor voltage error below 1 V across different current levels. Moreover, the trained agent exhibits strong adaptability to parameter variations and untrained operating points, confirming the proposed framework's robustness and suitability for real-time power electronics applications.

Original languageEnglish
Title of host publication2025 IEEE 4th Industrial Electronics Society Annual On-Line Conference, ONCON 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331589646
DOIs
Publication statusPublished - 2025
Event2025 IEEE 4th Industrial Electronics Society Annual On-Line Conference, ONCON 2025 - Kharagpur, India
Duration: 11 Dec 202513 Dec 2025

Publication series

Name2025 IEEE 4th Industrial Electronics Society Annual On-Line Conference, ONCON 2025

Conference

Conference2025 IEEE 4th Industrial Electronics Society Annual On-Line Conference, ONCON 2025
Country/TerritoryIndia
CityKharagpur
Period11/12/2513/12/25

Keywords

  • Lyapunov stability
  • Multilevel inverter
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Lyapunov-Based Reward Function Design for Reinforcement Learning Control of Grid-Connected Multilevel Inverters: A PUC5 Case Study'. Together they form a unique fingerprint.

Cite this