Sampling policies for near-optimal device choice in parallel simulations on CPU/GPU platforms

IRIS

Heterogeneous hardware platforms comprised of CPUs, GPUs, and other accelerators offer the opportunity to choose the best-suited device for executing a given scientific simulation in order to minimize execution time and energy consumption. To this end, the recently proposed "Follow the Leader" approach dynamically selects a suitable device based on runtime performance measurements during speculative discrete-event simulations. A currently active "leader" device is periodically challenged by a "follower" device in order to negotiate the new leader. The optimality of the device choices and the associated overhead depends critically on the challenge frequency and timing. Here, we explore policies to schedule challenges with the goal of attaining Pareto-optimal combinations of execution time and energy consumption. Several heuristics are first evaluated in an abstract fashion using a "meta-simulation" by mimicking the progress and energy consumption of an idealized co-execution. In this setting, we optimize the heuristics' tuning parameters to assess their relative merits in near-optimal configurations when compared to challenge timings based on perfect knowledge. We find that under challenging stochastic workloads based on a class of mean-reverting random walks, the best heuristics can closely approximate the execution time and energy consumption achievable under an optimal device choice. Empirical support for this observation is given by measurements of a CPU/GPU co-execution of the Time Warp algorithm on physical hardware.

Andelfinger, P., Pellegrini, A., Marotta, R. (2024). Sampling policies for near-optimal device choice in parallel simulations on CPU/GPU platforms. In 2024 28th International Symposium on Distributed Simulation and Real Time Applications (DS-RT) (pp.101-109). New York : IEEE [10.1109/DS-RT62209.2024.00023].

Sampling policies for near-optimal device choice in parallel simulations on CPU/GPU platforms

Andelfinger, Philipp;Pellegrini, Alessandro;Marotta, Romolo

2024-01-01

Abstract

Heterogeneous hardware platforms comprised of CPUs, GPUs, and other accelerators offer the opportunity to choose the best-suited device for executing a given scientific simulation in order to minimize execution time and energy consumption. To this end, the recently proposed "Follow the Leader" approach dynamically selects a suitable device based on runtime performance measurements during speculative discrete-event simulations. A currently active "leader" device is periodically challenged by a "follower" device in order to negotiate the new leader. The optimality of the device choices and the associated overhead depends critically on the challenge frequency and timing. Here, we explore policies to schedule challenges with the goal of attaining Pareto-optimal combinations of execution time and energy consumption. Several heuristics are first evaluated in an abstract fashion using a "meta-simulation" by mimicking the progress and energy consumption of an idealized co-execution. In this setting, we optimize the heuristics' tuning parameters to assess their relative merits in near-optimal configurations when compared to challenge timings based on perfect knowledge. We find that under challenging stochastic workloads based on a class of mean-reverting random walks, the best heuristics can closely approximate the execution time and energy consumption achievable under an optimal device choice. Empirical support for this observation is given by measurements of a CPU/GPU co-execution of the Time Warp algorithm on physical hardware.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				International Symposium on Distributed Simulation and Real Time Applications
			
	Luogo del convegno
	
				Urbino (Italy)
			
	Anno del convegno
	
				2024
			
	Numero del convegno
	
				28
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				2024
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1109/DS-RT62209.2024.00023
			
	Settore disciplinare dell'intervento (valido dal 09/05/2024)
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				GPU-based simulation
Time Warp
Heterogeneous hardware
Discrete-event simulation
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Andelfinger, P., Pellegrini, A., Marotta, R. (2024). Sampling policies for near-optimal device choice in parallel simulations on CPU/GPU platforms. In 2024 28th International Symposium on Distributed Simulation and Real Time Applications (DS-RT) (pp.101-109). New York : IEEE [10.1109/DS-RT62209.2024.00023].
			
	Tutti gli autori
	
						Andelfinger, P; Pellegrini, A; Marotta, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
And24.pdf solo utenti autorizzati Tipologia: Documento in Post-print Licenza: Copyright dell'editore Dimensione 376.42 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	376.42 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/453448

Citazioni

ND

2

2

social impact