Many processes in plasma physics are inherently complex and highly nonlinear. Typically their behaviour is difficult to interpret with theoretical models based on first principles. To perform high-quality inferences, these processes have to be modelled starting directly from the experimental data. In this contribution we study and analyse the capabilities of Symbolic Regression via Genetic Programming as a tool for advanced data mining in Nuclear Fusion to derive Empirical Models. Whereas traditional linear and non-linear regression techniques simply try to find the best parameters of predefined model by fitting the available data, Symbolic Regression via Genetic Programming searches for the Best Unconstrained Empirical Model Structure. This implies deriving the significant variables, the functional form of the model and its parameters. A set of synthetic problems are used to assess some important capabilities of SR tools: over-fitting avoidance, extrapolation properties, identification of model constants, scalability to higher-dimensional problems and capacity to handle noisy data. As an example of application to Nuclear Fusion research, the method has been applied to the ITPA database of the energy confinement time of Tokamak plasmas in H mode.
Peluso, E., Murari, A., Lupelli, I., Gelfusa, M., Gaudio, P. (2014). Symbolic regression via genetic programming to derive empirical models and scaling laws as monomial or polynomial expansions. In 41st EPS Conference on Plasma Physics, EPS 2014. European Physical Society (EPS).
Symbolic regression via genetic programming to derive empirical models and scaling laws as monomial or polynomial expansions
PELUSO, EMMANUELE;LUPELLI, IVAN;GELFUSA, MICHELA;GAUDIO, PASQUALINO
2014-01-01
Abstract
Many processes in plasma physics are inherently complex and highly nonlinear. Typically their behaviour is difficult to interpret with theoretical models based on first principles. To perform high-quality inferences, these processes have to be modelled starting directly from the experimental data. In this contribution we study and analyse the capabilities of Symbolic Regression via Genetic Programming as a tool for advanced data mining in Nuclear Fusion to derive Empirical Models. Whereas traditional linear and non-linear regression techniques simply try to find the best parameters of predefined model by fitting the available data, Symbolic Regression via Genetic Programming searches for the Best Unconstrained Empirical Model Structure. This implies deriving the significant variables, the functional form of the model and its parameters. A set of synthetic problems are used to assess some important capabilities of SR tools: over-fitting avoidance, extrapolation properties, identification of model constants, scalability to higher-dimensional problems and capacity to handle noisy data. As an example of application to Nuclear Fusion research, the method has been applied to the ITPA database of the energy confinement time of Tokamak plasmas in H mode.File | Dimensione | Formato | |
---|---|---|---|
P2.pdf
solo utenti autorizzati
Licenza:
Copyright dell'editore
Dimensione
101.33 kB
Formato
Adobe PDF
|
101.33 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.