The design of optimal control laws for nonlinear systems is tackled without knowledge of the underlying plant and of a functional description of the cost function. The proposed data-driven method is based only on real-time measurements of the state of the plant and of the (instantaneous) value of the reward signal and relies on a combination of ideas borrowed from the theories of optimal and adaptive control problems. As a result, the architecture implements a policy iteration strategy in which, hinging on the use of neural networks, the policy evaluation step and the computation of the relevant information instrumental for the policy improvement step are performed in a purely continuous-time fashion. Furthermore, the desirable features of the design method, including convergence rate and robustness properties, are discussed. Finally, the theory is validated via two benchmark numerical simulations.

Possieri, C., Sassano, M. (2023). Data-driven policy iteration for nonlinear optimal control problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 34(10), 7365-7376 [10.1109/TNNLS.2022.3142501].

Data-driven policy iteration for nonlinear optimal control problems

Possieri C.;Sassano M.
2023-01-01

Abstract

The design of optimal control laws for nonlinear systems is tackled without knowledge of the underlying plant and of a functional description of the cost function. The proposed data-driven method is based only on real-time measurements of the state of the plant and of the (instantaneous) value of the reward signal and relies on a combination of ideas borrowed from the theories of optimal and adaptive control problems. As a result, the architecture implements a policy iteration strategy in which, hinging on the use of neural networks, the policy evaluation step and the computation of the relevant information instrumental for the policy improvement step are performed in a purely continuous-time fashion. Furthermore, the desirable features of the design method, including convergence rate and robustness properties, are discussed. Finally, the theory is validated via two benchmark numerical simulations.
2023
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore ING-INF/04 - AUTOMATICA
Settore IINF-04/A - Automatica
English
Closed loop systems; Costs; Data-driven methods; Learning systems; Neural networks; Nonlinear dynamical systems; Nonlinear systems; Optimal control; Policy iteration; Real-time systems
Possieri, C., Sassano, M. (2023). Data-driven policy iteration for nonlinear optimal control problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 34(10), 7365-7376 [10.1109/TNNLS.2022.3142501].
Possieri, C; Sassano, M
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
Data-Driven_Policy_Iteration_for_Nonlinear_Optimal_Control_Problems.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 1.1 MB
Formato Adobe PDF
1.1 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/294504
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
social impact