DPA Load Balancer: Load balancing for Data Parallel Actor-based systems

08/02/2023
by   Ziheng Wang, et al.
0

In this project we explore ways to dynamically load balance actors in a streaming framework. This is used to address input data skew that might lead to stragglers. We continuously monitor actors' input queue lengths for load, and redistribute inputs among reducers using consistent hashing if we detect stragglers. To ensure consistent processing post-redistribution, we adopt an approach that uses input forwarding combined with a state merge step at the end of the processing. We show that this approach can greatly alleviate stragglers for skewed data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2018

Distributed dynamic load balancing for task parallel programming

In this paper, we derive and investigate approaches to dynamically load ...
research
11/01/2022

Benchmarking Hashing Algorithms for Load Balancing in a Distributed Database Environment

Modern high load applications store data using multiple database instanc...
research
05/24/2021

DynaHash: Efficient Data Rebalancing in Apache AsterixDB (Extended Version)

Parallel shared-nothing data management systems have been widely used to...
research
08/23/2019

Revisiting Consistent Hashing with Bounded Loads

Dynamic load balancing lies at the heart of distributed caching. Here, t...
research
10/18/2019

DLB: Deep Learning Based Load Balancing

Load balancing mechanisms have been widely adopted by distributed platfo...
research
08/03/2020

Distributed Dispatching in the Parallel Server Model

With the rapid increase in the size and volume of cloud services and dat...
research
02/20/2019

Load-Balancing for Parallel Delaunay Triangulations

Computing the Delaunay triangulation (DT) of a given point set in R^D is...

Please sign up or login with your details

Forgot password? Click here to reset