Load balancing policies with server-side cancellation of replicas

10/26/2020
by   Rooji Jinan, et al.
0

Popular dispatching policies such as the join shortest queue (JSQ), join smallest work (JSW) and their power of two variants are used in load balancing systems where the instantaneous queue length or workload information at all queues or a subset of them can be queried. In situations where the dispatcher has an associated memory, one can minimize this query overhead by maintaining a list of idle servers to which jobs can be dispatched. Recent alternative approaches that do not require querying such information include the cancel on start and cancel on complete based replication policies. The downside of such policies however is that the servers must communicate the start or completion of each service to the dispatcher and must allow cancellation of redundant copies. In this work, we consider a load balancing environment where the dispatcher cannot query load information, does not have a memory, and cannot cancel any replica that it may have created. In such a rigid environment, we allow the dispatcher to possibly append a server side cancellation criteria to each job or its replica. A job or a replica is served only if it satisfies the predefined criteria at the time of service. We focus on a criteria that is based on the waiting time experienced by a job or its replica and analyze several variants of this policy based on the assumption of asymptotic independence of queues. The proposed policies are novel and perform remarkably well in spite of the rigid operating constraints.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2020

Improved Load Balancing in Large Scale Systems using Attained Service Time Reporting

Our interest lies in load balancing jobs in large scale systems consisti...
research
08/20/2020

Optimal Load Balancing in Bipartite Graphs

Applications in cloud platforms motivate the study of efficient load bal...
research
02/17/2020

Performance Analysis of Load Balancing Policies with Memory

Joining the shortest or least loaded queue among d randomly selected que...
research
04/02/2020

Heavy Traffic Analysis of the Mean Response Time for Load Balancing Policies in the Mean Field Regime

Mean field models are a popular tool used to analyse load balancing poli...
research
06/24/2020

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale ...
research
05/27/2020

Threshold-based rerouting and replication for resolving job-server affinity relations

We consider a system with several job types and two parallel server pool...
research
12/10/2021

A General "Power-of-d" Dispatching Framework for Heterogeneous Systems

Intelligent dispatching is crucial to obtaining low response times in la...

Please sign up or login with your details

Forgot password? Click here to reset