Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Qinglai Wei; Derong Liu; Qiao Lin; Ruizhuo Song

doi:10.1109/TCYB.2016.2586082

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Qinglai Wei, Derong Liu, Qiao Lin, Ruizhuo Song

Research output: Journal Publication › Article › peer-review

102 Citations (Scopus)

Abstract

In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.

Original language	English
Article number	7515142
Pages (from-to)	3367-3379
Number of pages	13
Journal	IEEE Transactions on Cybernetics
Volume	47
Issue number	10
DOIs	https://doi.org/10.1109/TCYB.2016.2586082
Publication status	Published - Oct 2017
Externally published	Yes

Keywords

Adaptive critic designs
adaptive dynamic programming (ADP)
approximate dynamic programming
local policy iteration
neuro-dynamic programming
nonlinear systems
optimal control

ASJC Scopus subject areas

Software
Control and Systems Engineering
Information Systems
Human-Computer Interaction
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TCYB.2016.2586082

http://ieeexplore.ieee.org/document/7515142/

Cite this

@article{8b0631649bdb4c1fb6a9d345cbf2160b,

title = "Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming",

abstract = "In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.",

keywords = "Adaptive critic designs, adaptive dynamic programming (ADP), approximate dynamic programming, local policy iteration, neuro-dynamic programming, nonlinear systems, optimal control",

author = "Qinglai Wei and Derong Liu and Qiao Lin and Ruizhuo Song",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2017",

month = oct,

doi = "10.1109/TCYB.2016.2586082",

language = "English",

volume = "47",

pages = "3367--3379",

journal = "IEEE Transactions on Cybernetics",

issn = "2168-2267",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

AU - Wei, Qinglai

AU - Liu, Derong

AU - Lin, Qiao

AU - Song, Ruizhuo

PY - 2017/10

Y1 - 2017/10

N2 - In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.

AB - In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.

KW - Adaptive critic designs

KW - adaptive dynamic programming (ADP)

KW - approximate dynamic programming

KW - local policy iteration

KW - neuro-dynamic programming

KW - nonlinear systems

KW - optimal control

UR - http://www.scopus.com/inward/record.url?scp=84978818227&partnerID=8YFLogxK

U2 - 10.1109/TCYB.2016.2586082

DO - 10.1109/TCYB.2016.2586082

M3 - Article

C2 - 27448382

SN - 2168-2267

VL - 47

SP - 3367

EP - 3379

JO - IEEE Transactions on Cybernetics

JF - IEEE Transactions on Cybernetics

IS - 10

M1 - 7515142

ER -

Discrete-Time Optimal Control via Local Policy Iteration Adaptive Dynamic Programming

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this