We propose an efficient technique for performing data-driven optimal control of discrete-time systems. In particular, we show that log-sum-exp ($lse$) neural networks, which are smooth and convex universal approximators of convex functions, can be efficiently used to approximate Q-factors arising from finite-horizon optimal control problems with continuous state space. The key advantage of these networks over classical approximation techniques is that they are convex and hence readily amenable to efficient optimization.

Calafiore, G., Possieri, C. (2020). Efficient model-free Q-factor approximation in value space via log-sum-exp neural networks. In European Control Conference 2020. New York : IEEE [10.23919/ECC51009.2020.9143765].

Efficient model-free Q-factor approximation in value space via log-sum-exp neural networks

Corrado Possieri
2020-01-01

Abstract

We propose an efficient technique for performing data-driven optimal control of discrete-time systems. In particular, we show that log-sum-exp ($lse$) neural networks, which are smooth and convex universal approximators of convex functions, can be efficiently used to approximate Q-factors arising from finite-horizon optimal control problems with continuous state space. The key advantage of these networks over classical approximation techniques is that they are convex and hence readily amenable to efficient optimization.
European Control Conference (ECC2020)
Saint Petersburg, Russia
2020
Rilevanza nazionale
2020
Settore ING-INF/04 - AUTOMATICA
Settore IINF-04/A - Automatica
English
Optimal control
Q-factors
Pptimization
Neural networks
Intervento a convegno
Calafiore, G., Possieri, C. (2020). Efficient model-free Q-factor approximation in value space via log-sum-exp neural networks. In European Control Conference 2020. New York : IEEE [10.23919/ECC51009.2020.9143765].
Calafiore, G; Possieri, C
File in questo prodotto:
File Dimensione Formato  
Calafiore-Efficient.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 272.44 kB
Formato Adobe PDF
272.44 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/294351
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 2
social impact