Service meshes are becoming pivotal software frameworks for managing communication among microservices in distributed applications. Each microservice in a service mesh is paired with an L7 sidecar proxy, which intercepts incoming and outgoing requests to provide enhanced observability, traffic management, and security. These sidecar proxies use application-level load balancing policies to route outgoing requests to available replicas of destination microservices. A widely adopted policy is the Least Outstanding Request (LOR), which directs requests to the replica with the fewest outstanding requests. While LOR effectively reduces latency in applications with a small number of replicas, our comprehensive investigation- combining analytical, simulation, and experimental methods- uncovers a novel and critical issue for large-scale microservice applications: the performance of LOR significantly degrades as the number of microservice replicas increases, eventually converging to the performance of a random load balancing policy. To recover LOR performance at scale, we propose an opensource solution named Proxy-Service, tailored for microservice applications where load balancing incurs significantly lower resource demands than microservice execution. The core idea is to consolidate load balancing decisions per microservice into one or a few reverse proxies, transparently injected into the application.
Detti, A., Funari, L. (2025). Critical limitations of the Least Outstanding Request load balancing policy in service meshes for large-scale microservice applications. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 1-12 [10.1109/TNSM.2025.3593870].
Critical limitations of the Least Outstanding Request load balancing policy in service meshes for large-scale microservice applications
Andrea Detti
;Ludovico Funari
2025-01-01
Abstract
Service meshes are becoming pivotal software frameworks for managing communication among microservices in distributed applications. Each microservice in a service mesh is paired with an L7 sidecar proxy, which intercepts incoming and outgoing requests to provide enhanced observability, traffic management, and security. These sidecar proxies use application-level load balancing policies to route outgoing requests to available replicas of destination microservices. A widely adopted policy is the Least Outstanding Request (LOR), which directs requests to the replica with the fewest outstanding requests. While LOR effectively reduces latency in applications with a small number of replicas, our comprehensive investigation- combining analytical, simulation, and experimental methods- uncovers a novel and critical issue for large-scale microservice applications: the performance of LOR significantly degrades as the number of microservice replicas increases, eventually converging to the performance of a random load balancing policy. To recover LOR performance at scale, we propose an opensource solution named Proxy-Service, tailored for microservice applications where load balancing incurs significantly lower resource demands than microservice execution. The core idea is to consolidate load balancing decisions per microservice into one or a few reverse proxies, transparently injected into the application.| File | Dimensione | Formato | |
|---|---|---|---|
|
Critical_Limitations_of_the_Least_Outstanding_Request_Load_Balancing_Policy_in_Service_Meshes_for_Large-Scale_Microservice_Applications.pdf
solo utenti autorizzati
Tipologia:
Versione Editoriale (PDF)
Licenza:
Copyright dell'editore
Dimensione
1.55 MB
Formato
Adobe PDF
|
1.55 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


