TY - JOUR
T1 - Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming
AU - Ma, Yong Sheng
AU - Sun, Jian
AU - Xu, Yong
AU - Cui, Shi Sheng
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.
AB - This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.
KW - Adaptive dynamic programming
KW - data-driven iteration learning algorithm
KW - optimal output feedback control
UR - http://www.scopus.com/pages/publications/105012101365
U2 - 10.1109/TASE.2025.3593481
DO - 10.1109/TASE.2025.3593481
M3 - Article
AN - SCOPUS:105012101365
SN - 1545-5955
VL - 22
SP - 19187
EP - 19196
JO - IEEE Transactions on Automation Science and Engineering
JF - IEEE Transactions on Automation Science and Engineering
ER -