Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing on-the-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation's critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node's rate of data production varies during simulation.

Du, X., Meng, Z., Siguenza-Torres, A., Knoll, A., Pimpini, A., Piccione, A., et al. (2023). Autonomic orchestration of in-situ and in-transit data analytics for simulation studies. In 2023 Winter Simulation Conference (WSC) (pp.781-792). IEEE [10.1109/wsc60868.2023.10408191].

Autonomic orchestration of in-situ and in-transit data analytics for simulation studies

Pellegrini, Alessandro
2023-01-01

Abstract

Modern parallel/distributed simulations can produce large amounts of data. The historical approach of performing analyses at the end of the simulation is unlikely to cope with modern, extremely large-scale analytics jobs. Indeed, the I/O subsystem can quickly become the global bottleneck. Similarly, processing on-the-fly the data produced by simulations can significantly impair the performance in terms of computational capacity and network load. We present a methodology and reference architecture for constructing an autonomic control system to determine at runtime the best placement for data processing (on simulation nodes or a set of external nodes). This allows for a good tradeoff between the load on the simulation's critical path and the data communication system. Our preliminary experimentation shows that autonomic orchestration is crucial to improve the global performance of a data analysis system, especially when the simulation node's rate of data production varies during simulation.
Winter Simulation Conference
San Antonio, TX, USA
2023
Rilevanza internazionale
2023
Settore ING-INF/05
English
Intervento a convegno
Du, X., Meng, Z., Siguenza-Torres, A., Knoll, A., Pimpini, A., Piccione, A., et al. (2023). Autonomic orchestration of in-situ and in-transit data analytics for simulation studies. In 2023 Winter Simulation Conference (WSC) (pp.781-792). IEEE [10.1109/wsc60868.2023.10408191].
Du, X; Meng, Z; Siguenza-Torres, A; Knoll, A; Pimpini, A; Piccione, A; Bortoli, S; Pellegrini, A
File in questo prodotto:
File Dimensione Formato  
DuX23.pdf

solo utenti autorizzati

Tipologia: Documento in Post-print
Licenza: Copyright dell'editore
Dimensione 3.8 MB
Formato Adobe PDF
3.8 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/363226
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact