Checkpointing techniques for speculative parallel simulation of discrete event models have been widely studied in the literature. However, there has been a very marginal attempt to exploit operating system page-protection services, which have instead been largely exploited in the context of checkpointing for fault tolerance. In this article, we discuss how these services can effectively manage simulation models with large states and write-intensive events in zones of the state layout. In particular, we present a solution where the correlation of write operations on buddy pages in the state layout can be exploited to achieve effective incremental checkpointing support, which allows scaling down the costs of operating system services. Our solution does not require any instrumentation of the simulation application code and is usable on any Posix-compliant operating system. We also discuss its integration within the USE (Ultimate-Share-Everything) open-source speculative simulation package and report some experimental data for its assessment.

Marotta, R., Montesano, F., Pellegrini, A., Quaglia, F. (2023). Incremental Checkpointing of Large State Simulation Models with Write-Intensive Events via Memory Update Correlation on Buddy Pages. In 2023 IEEE/ACM 27th International Symposium on Distributed Simulation and Real Time Applications (DS-RT) (pp.40-47). IEEE [10.1109/ds-rt58998.2023.00014].

Incremental Checkpointing of Large State Simulation Models with Write-Intensive Events via Memory Update Correlation on Buddy Pages

Marotta, Romolo;Montesano, Federica;Pellegrini, Alessandro;Quaglia, Francesco
2023-01-01

Abstract

Checkpointing techniques for speculative parallel simulation of discrete event models have been widely studied in the literature. However, there has been a very marginal attempt to exploit operating system page-protection services, which have instead been largely exploited in the context of checkpointing for fault tolerance. In this article, we discuss how these services can effectively manage simulation models with large states and write-intensive events in zones of the state layout. In particular, we present a solution where the correlation of write operations on buddy pages in the state layout can be exploited to achieve effective incremental checkpointing support, which allows scaling down the costs of operating system services. Our solution does not require any instrumentation of the simulation application code and is usable on any Posix-compliant operating system. We also discuss its integration within the USE (Ultimate-Share-Everything) open-source speculative simulation package and report some experimental data for its assessment.
27th International Symposium on Distributed Simulation and Real Time Applications (DS-RT)
Singapore
2023
27
IEEE; ACM
Rilevanza internazionale
2023
Settore ING-INF/05
English
Intervento a convegno
Marotta, R., Montesano, F., Pellegrini, A., Quaglia, F. (2023). Incremental Checkpointing of Large State Simulation Models with Write-Intensive Events via Memory Update Correlation on Buddy Pages. In 2023 IEEE/ACM 27th International Symposium on Distributed Simulation and Real Time Applications (DS-RT) (pp.40-47). IEEE [10.1109/ds-rt58998.2023.00014].
Marotta, R; Montesano, F; Pellegrini, A; Quaglia, F
File in questo prodotto:
File Dimensione Formato  
Mar23.pdf

solo utenti autorizzati

Tipologia: Documento in Post-print
Licenza: Copyright dell'editore
Dimensione 348.7 kB
Formato Adobe PDF
348.7 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/363225
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact