A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm

Qiao Lin; Qinglai Wei; Derong Liu

doi:10.1080/00207721.2016.1188177

A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm

Qiao Lin, Qinglai Wei, Derong Liu

Research output: Journal Publication › Article › peer-review

37 Citations (Scopus)

Abstract

In this paper, a novel iterative adaptive dynamic programming (ADP) algorithm, called generalised policy iteration ADP algorithm, is developed to solve optimal tracking control problems for discrete-time nonlinear systems. The idea is to use two iteration procedures, including an i-iteration and a j-iteration, to obtain the iterative tracking control laws and the iterative value functions. By system transformation, we first convert the optimal tracking control problem into an optimal regulation problem. Then the generalised policy iteration ADP algorithm, which is a general idea of interacting policy and value iteration algorithms, is introduced to deal with the optimal regulation problem. The convergence and optimality properties of the generalised policy iteration algorithm are analysed. Three neural networks are used to implement the developed algorithm. Finally, simulation examples are given to illustrate the performance of the present algorithm.

Original language	English
Pages (from-to)	525-534
Number of pages	10
Journal	International Journal of Systems Science
Volume	48
Issue number	3
DOIs	https://doi.org/10.1080/00207721.2016.1188177
Publication status	Published - 17 Feb 2017
Externally published	Yes

Keywords

Adaptive dynamic programming
affine nonlinear systems
discrete-time
generalised policy iteration
neural network
tracking control

ASJC Scopus subject areas

Control and Systems Engineering
Theoretical Computer Science
Computer Science Applications

Access to Document

10.1080/00207721.2016.1188177

https://www.tandfonline.com/doi/full/10.1080/00207721.2016.1188177

Cite this

@article{f6fc2aa3b2aa4fe8921cc79cad55a75b,

title = "A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm",

abstract = "In this paper, a novel iterative adaptive dynamic programming (ADP) algorithm, called generalised policy iteration ADP algorithm, is developed to solve optimal tracking control problems for discrete-time nonlinear systems. The idea is to use two iteration procedures, including an i-iteration and a j-iteration, to obtain the iterative tracking control laws and the iterative value functions. By system transformation, we first convert the optimal tracking control problem into an optimal regulation problem. Then the generalised policy iteration ADP algorithm, which is a general idea of interacting policy and value iteration algorithms, is introduced to deal with the optimal regulation problem. The convergence and optimality properties of the generalised policy iteration algorithm are analysed. Three neural networks are used to implement the developed algorithm. Finally, simulation examples are given to illustrate the performance of the present algorithm.",

keywords = "Adaptive dynamic programming, affine nonlinear systems, discrete-time, generalised policy iteration, neural network, tracking control",

author = "Qiao Lin and Qinglai Wei and Derong Liu",

note = "Publisher Copyright: {\textcopyright} 2016 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2017",

month = feb,

day = "17",

doi = "10.1080/00207721.2016.1188177",

language = "English",

volume = "48",

pages = "525--534",

journal = "International Journal of Systems Science",

issn = "0020-7721",

publisher = "Taylor and Francis Ltd.",

number = "3",

}

TY - JOUR

T1 - A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm

AU - Lin, Qiao

AU - Wei, Qinglai

AU - Liu, Derong

PY - 2017/2/17

Y1 - 2017/2/17

N2 - In this paper, a novel iterative adaptive dynamic programming (ADP) algorithm, called generalised policy iteration ADP algorithm, is developed to solve optimal tracking control problems for discrete-time nonlinear systems. The idea is to use two iteration procedures, including an i-iteration and a j-iteration, to obtain the iterative tracking control laws and the iterative value functions. By system transformation, we first convert the optimal tracking control problem into an optimal regulation problem. Then the generalised policy iteration ADP algorithm, which is a general idea of interacting policy and value iteration algorithms, is introduced to deal with the optimal regulation problem. The convergence and optimality properties of the generalised policy iteration algorithm are analysed. Three neural networks are used to implement the developed algorithm. Finally, simulation examples are given to illustrate the performance of the present algorithm.

AB - In this paper, a novel iterative adaptive dynamic programming (ADP) algorithm, called generalised policy iteration ADP algorithm, is developed to solve optimal tracking control problems for discrete-time nonlinear systems. The idea is to use two iteration procedures, including an i-iteration and a j-iteration, to obtain the iterative tracking control laws and the iterative value functions. By system transformation, we first convert the optimal tracking control problem into an optimal regulation problem. Then the generalised policy iteration ADP algorithm, which is a general idea of interacting policy and value iteration algorithms, is introduced to deal with the optimal regulation problem. The convergence and optimality properties of the generalised policy iteration algorithm are analysed. Three neural networks are used to implement the developed algorithm. Finally, simulation examples are given to illustrate the performance of the present algorithm.

KW - Adaptive dynamic programming

KW - affine nonlinear systems

KW - discrete-time

KW - generalised policy iteration

KW - neural network

KW - tracking control

UR - http://www.scopus.com/inward/record.url?scp=84969764901&partnerID=8YFLogxK

U2 - 10.1080/00207721.2016.1188177

DO - 10.1080/00207721.2016.1188177

M3 - Article

SN - 0020-7721

VL - 48

SP - 525

EP - 534

JO - International Journal of Systems Science

JF - International Journal of Systems Science

IS - 3

ER -

A novel optimal tracking control scheme for a class of discrete-time nonlinear systems using generalised policy iteration adaptive dynamic programming algorithm

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this