Abstract
In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value function and iterative control law can be updated in a subset of the state space, where the computational burden is relaxed compared with the traditional policy iteration algorithm. Convergence properties of the local policy iteration algorithm are presented to show that the iterative value function is monotonically nonincreasing and converges to the optimum under some mild conditions. The admissibility of the iterative control law is proven, which shows that the control system can be stabilized under any of the iterative control laws, even if the iterative control law is updated in a subset of the state space. Finally, two simulation examples are given to illustrate the performance of the developed method.
Original language | English |
---|---|
Article number | 7515142 |
Pages (from-to) | 3367-3379 |
Number of pages | 13 |
Journal | IEEE Transactions on Cybernetics |
Volume | 47 |
Issue number | 10 |
DOIs | |
Publication status | Published - Oct 2017 |
Externally published | Yes |
Keywords
- Adaptive critic designs
- adaptive dynamic programming (ADP)
- approximate dynamic programming
- local policy iteration
- neuro-dynamic programming
- nonlinear systems
- optimal control
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Information Systems
- Human-Computer Interaction
- Computer Science Applications
- Electrical and Electronic Engineering