Processing data in a timely manner, data stream processing (DSP) applications are receiving an increasing interest for building new pervasive services. Due to the unpredictability of data sources, these applications often operate in dynamic environments; therefore, they require the ability to elastically scale in response to workload variations. In this paper, we deal with a key problem for the effective runtime management of a DSP application in geo-distributed environments: We investigate the placement and replication decisions while considering the application and resource heterogeneity and the migration overhead, so to select the optimal adaptation strategy that can minimize migration costs while satisfying the application quality of service (QoS) requirements. We present elastic DSP replication and placement (EDRP), a unified framework for the QoS-aware initial deployment and runtime elasticity management of DSP applications. In EDRP, the deployment and runtime decisions are driven by the solution of a suitable integer linear programming problem, whose objective function captures the relative importance between QoS goals and reconfiguration costs. We also present the implementation of EDRP and the related mechanisms on Apache Storm. We conduct a thorough experimental evaluation, both numerical and prototype-based, that shows the benefits achieved by EDRP on the application performance.
Cardellini, V., Lo Presti, F., Nardelli, M., & Russo Russo, G. (2018). Optimal operator deployment and replication for elastic distributed data stream processing. CONCURRENCY AND COMPUTATION, 30(9).
|Tipologia:||Articolo su rivista|
|Citazione:||Cardellini, V., Lo Presti, F., Nardelli, M., & Russo Russo, G. (2018). Optimal operator deployment and replication for elastic distributed data stream processing. CONCURRENCY AND COMPUTATION, 30(9).|
|IF:||Con Impact Factor ISI|
|Settore Scientifico Disciplinare:||Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni|
|Revisione (peer review):||Esperti anonimi|
|Digital Object Identifier (DOI):||http://dx.doi.org/10.1002/cpe.4334|
|Stato di pubblicazione:||Pubblicato|
|Data di pubblicazione:||mag-2018|
|Titolo:||Optimal operator deployment and replication for elastic distributed data stream processing|
|Autori:||Cardellini, V; Lo Presti, F; Nardelli, M; Russo Russo, G|
|Appare nelle tipologie:||01 - Articolo su rivista|