The application of data driven machine learning and advanced statistical tools to complex physics experiments, such as Magnetic Confinement Nuclear Fusion, can be problematic, due the varying conditions of the systems to be studied. In particular, new experiments have to be planned in unexplored regions of the operational space. As a consequence, care must be taken because the input quantities used to train and test the performance of the analysis tools are not necessarily sampled by the same probability distribution as in the final applications. The regressors and dependent variables cannot therefore be assumed to verify the i.i.d. (independent and identical distribution) hypothesis and learning has therefore to take place under non stationary conditions. In the present paper, a new data driven methodology is proposed to guide planning of experiments, to explore the operational space and to optimise performance. The approach is based on the falsification of existing models. The deployment of Symbolic Regression via Genetic Programming to the available data is used to identify a set of candidate models, using the method of the Pareto Frontier. The confidence intervals for the predictions of such models are then used to find the best region of the parameter space for their falsification, where the next set of experiments can be most profitably carried out. Extensive numerical tests and applications to the scaling laws in Tokamaks prove the viability of the proposed methodology.

Murari, A., Lungaroni, M., Peluso, E., Craciunescu, T., Gelfusa, M. (2019). A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design. SCIENTIFIC REPORTS, 9(1), 17880 [10.1038/s41598-019-54145-7].

A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design

Lungaroni M.;Peluso E.;Gelfusa M.
2019-01-01

Abstract

The application of data driven machine learning and advanced statistical tools to complex physics experiments, such as Magnetic Confinement Nuclear Fusion, can be problematic, due the varying conditions of the systems to be studied. In particular, new experiments have to be planned in unexplored regions of the operational space. As a consequence, care must be taken because the input quantities used to train and test the performance of the analysis tools are not necessarily sampled by the same probability distribution as in the final applications. The regressors and dependent variables cannot therefore be assumed to verify the i.i.d. (independent and identical distribution) hypothesis and learning has therefore to take place under non stationary conditions. In the present paper, a new data driven methodology is proposed to guide planning of experiments, to explore the operational space and to optimise performance. The approach is based on the falsification of existing models. The deployment of Symbolic Regression via Genetic Programming to the available data is used to identify a set of candidate models, using the method of the Pareto Frontier. The confidence intervals for the predictions of such models are then used to find the best region of the parameter space for their falsification, where the next set of experiments can be most profitably carried out. Extensive numerical tests and applications to the scaling laws in Tokamaks prove the viability of the proposed methodology.
2019
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore ING-IND/18 - FISICA DEI REATTORI NUCLEARI
English
Murari, A., Lungaroni, M., Peluso, E., Craciunescu, T., Gelfusa, M. (2019). A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design. SCIENTIFIC REPORTS, 9(1), 17880 [10.1038/s41598-019-54145-7].
Murari, A; Lungaroni, M; Peluso, E; Craciunescu, T; Gelfusa, M
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
A-Model-Falsification-Approach-to-Learning-in-NonStationary-Environments-for-Experimental-Design2019Scientific-ReportsOpen-Access.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Non specificato
Dimensione 1.25 MB
Formato Adobe PDF
1.25 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/240166
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact