Dimensional analysis is a well-known approach to model building in engineering, because it can contribute to identifying more parsimonious and meaningful equations for describing complex phenomena. Unfortunately, it is not always exploited to the full, because it is typically applied prior to any form of statistical evaluation, often resulting in poor choices of the dimensionless quantities and consequently in suboptimal models. A completely general and data driven technique is proposed, which integrates dimensional and statistical analysis with the help of genetic programming supported symbolic regression and neural computing. The methodology exploits the potential of various machine-learning techniques and allows extracting mathematical models in terms of dimensionless quantities directly from the dimensional databases available. A battery of numerical tests and examples from fluid dynamics and thermonuclear fusion illustrate the unquestionable advantages of the approach for statistical inference and for the interpretation of the large amounts of data produced by modern physics experiments and engineering studies.

Murari, A., Spolladore, L., Rossi, R., Gelfusa, M. (2023). Combining dimensional and statistical analysis for efficient data driven modelling of complex systems. INFORMATION SCIENCES, 644 [10.1016/j.ins.2023.119243].

Combining dimensional and statistical analysis for efficient data driven modelling of complex systems

R. Rossi;M. Gelfusa
2023-01-01

Abstract

Dimensional analysis is a well-known approach to model building in engineering, because it can contribute to identifying more parsimonious and meaningful equations for describing complex phenomena. Unfortunately, it is not always exploited to the full, because it is typically applied prior to any form of statistical evaluation, often resulting in poor choices of the dimensionless quantities and consequently in suboptimal models. A completely general and data driven technique is proposed, which integrates dimensional and statistical analysis with the help of genetic programming supported symbolic regression and neural computing. The methodology exploits the potential of various machine-learning techniques and allows extracting mathematical models in terms of dimensionless quantities directly from the dimensional databases available. A battery of numerical tests and examples from fluid dynamics and thermonuclear fusion illustrate the unquestionable advantages of the approach for statistical inference and for the interpretation of the large amounts of data produced by modern physics experiments and engineering studies.
2023
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore PHYS-03/A - Fisica sperimentale della materia e applicazioni
Settore PHYS-04/A - Fisica teorica della materia, modelli, metodi matematici e applicazioni
Settore IIND-07/C - Fisica dei reattori nucleari
English
Data driven science; Dimensional analysis; Genetic programming; Statistical analysis; Symbolic regression; Thermonuclear fusion
Murari, A., Spolladore, L., Rossi, R., Gelfusa, M. (2023). Combining dimensional and statistical analysis for efficient data driven modelling of complex systems. INFORMATION SCIENCES, 644 [10.1016/j.ins.2023.119243].
Murari, A; Spolladore, L; Rossi, R; Gelfusa, M
Articolo su rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/447027
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact