Data-Driven Policy Iteration for Nonlinear Optimal Control Problems

IRIS

The design of optimal control laws for nonlinear systems is tackled without knowledge of the underlying plant and of a functional description of the cost function. The proposed data-driven method is based only on real-time measurements of the state of the plant and of the (instantaneous) value of the reward signal and relies on a combination of ideas borrowed from the theories of optimal and adaptive control problems. As a result, the architecture implements a policy iteration strategy in which, hinging on the use of neural networks, the policy evaluation step and the computation of the relevant information instrumental for the policy improvement step are performed in a purely continuous-time fashion. Furthermore, the desirable features of the design method, including convergence rate and robustness properties, are discussed. Finally, the theory is validated via two benchmark numerical simulations.

Possieri, C., Sassano, M. (2023). Data-Driven Policy Iteration for Nonlinear Optimal Control Problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 34(10), 7365-7376 [10.1109/TNNLS.2022.3142501].

Data-Driven Policy Iteration for Nonlinear Optimal Control Problems

Possieri C.;Sassano M.

2023-01-01

Abstract

The design of optimal control laws for nonlinear systems is tackled without knowledge of the underlying plant and of a functional description of the cost function. The proposed data-driven method is based only on real-time measurements of the state of the plant and of the (instantaneous) value of the reward signal and relies on a combination of ideas borrowed from the theories of optimal and adaptive control problems. As a result, the architecture implements a policy iteration strategy in which, hinging on the use of neural networks, the policy evaluation step and the computation of the relevant information instrumental for the policy improvement step are performed in a purely continuous-time fashion. Furthermore, the desirable features of the design method, including convergence rate and robustness properties, are discussed. Finally, the theory is validated via two benchmark numerical simulations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2023
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1109/TNNLS.2022.3142501
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Articolo
			
	Referee
	
				Esperti anonimi
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore ING-INF/04 - AUTOMATICA
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Closed loop systems
Costs
Data-driven methods
Learning systems
Neural networks
Nonlinear dynamical systems
nonlinear systems
Optimal control
optimal control
policy iteration.
Real-time systems
			
	Citazione
	
				Possieri, C., Sassano, M. (2023). Data-Driven Policy Iteration for Nonlinear Optimal Control Problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 34(10), 7365-7376 [10.1109/TNNLS.2022.3142501].
			
	Tutti gli autori
	
						Possieri, C; Sassano, M
					
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/294504

Citazioni

0

5

4

social impact