Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming

Yong Sheng Ma; Jian Sun; Yong Xu; Shi Sheng Cui

doi:10.1109/TASE.2025.3593481

Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming

Yong Sheng Ma, Jian Sun, Yong Xu^*, Shi Sheng Cui

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.

源语言	英语
页（从-至）	19187-19196
页数	10
期刊	IEEE Transactions on Automation Science and Engineering
卷	22
DOI	http://doi.org/10.1109/TASE.2025.3593481
出版状态	已出版 - 2025
已对外发布	是

访问文件

10.1109/TASE.2025.3593481

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{03a51670db2845ebb146a85061c61d87,

title = "Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming",

abstract = "This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.",

keywords = "Adaptive dynamic programming, data-driven iteration learning algorithm, optimal output feedback control",

author = "Ma, \{Yong Sheng\} and Jian Sun and Yong Xu and Cui, \{Shi Sheng\}",

note = "Publisher Copyright: {\textcopyright} 2004-2012 IEEE.",

year = "2025",

doi = "10.1109/TASE.2025.3593481",

language = "English",

volume = "22",

pages = "19187--19196",

journal = "IEEE Transactions on Automation Science and Engineering",

issn = "1545-5955",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming

AU - Ma, Yong Sheng

AU - Sun, Jian

AU - Xu, Yong

AU - Cui, Shi Sheng

PY - 2025

Y1 - 2025

N2 - This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.

AB - This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal control theory, an online data-driven iteration learning algorithm is developed to learn an optimal controller from system data. The main advantage of the proposed algorithm is that it does not require an initial stabilizing control policy, a full-rank condition, or historical data storage to guarantee algorithm convergence. This is fundamentally different from the existing results based on the least-squares method, which requires these conditions. Moreover, the developed algorithm uses only the input and output data of the system, which solves the problem of unmeasurable system states. The simulation results demonstrate the efficacy of the proposed algorithm, and its superiority is demonstrated by comparison with the existing algorithms.

KW - Adaptive dynamic programming

KW - data-driven iteration learning algorithm

KW - optimal output feedback control

UR - http://www.scopus.com/pages/publications/105012101365

U2 - 10.1109/TASE.2025.3593481

DO - 10.1109/TASE.2025.3593481

M3 - Article

AN - SCOPUS:105012101365

SN - 1545-5955

VL - 22

SP - 19187

EP - 19196

JO - IEEE Transactions on Automation Science and Engineering

JF - IEEE Transactions on Automation Science and Engineering

ER -

Data-Driven Optimal Output Feedback Control of Unknown System Model via Adaptive Dynamic Programming

摘要

访问文件

其它文件与链接

指纹

引用此