Coded Distributed Computing over Packet Erasure Channels

01/11/2019
by   Dong-Jun Han, et al.
0

Coded computation is a framework which provides redundancy in distributed computing systems to speed up largescale tasks. Although most existing works assume an error-free scenarios in a master-worker setup, the link failures are common in current wired/wireless networks. In this paper, we consider the straggler problem in coded distributed computing with link failures, by modeling the links between the master node and worker nodes as packet erasure channels. When the master fails to detect the received signal, retransmission is required for each worker which increases the overall run-time to finish the task. We first investigate the expected overall run-time in this setting using an (n; k) maximum distance separable (MDS) code. We obtain the lower and upper bounds on the latency in closed-forms and give guidelines to design MDS code depending on the packet erasure probability. Finally, we consider a setup where the number of retransmissions is limited due to the bandwidth constraint. By formulating practical optimization problems related to latency, transmission bandwidth and probability of successful computation, we obtain achievable performance curves as a function of packet erasure probability.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

page 10

page 11

research
10/08/2019

Timely Distributed Computation with Stragglers

We consider a status update system in which the update packets need to b...
research
10/21/2020

Coded Computing for Master-Aided Distributed Computing Systems

We consider a MapReduce-type task running in a distributed computing mod...
research
05/03/2022

Private Matrix Multiplication From MDS-Coded Storage With Colluding Servers

In this paper, we study the two problems of Private and Secure Matrix Mu...
research
05/24/2018

Coded FFT and Its Communication Overhead

We propose a coded computing strategy and examine communication costs of...
research
07/01/2020

Distributed Linearly Separable Computation

This paper formulates a distributed computation problem, where a master ...
research
03/15/2018

Achieving Spatial Scalability for Coded Caching over Wireless Networks

The coded caching scheme proposed by Maddah-Ali and Niesen considers the...
research
02/27/2019

S-PRAC: Fast Partial Packet Recovery with Network Coding in Very Noisy Wireless Channels

Well-known error detection and correction solutions in wireless communic...

Please sign up or login with your details

Forgot password? Click here to reset