On Optimal Trees for Irregular Gather and Scatter Collectives

11/23/2017
by   Jesper Larsson Träff, et al.
0

This paper studies the complexity of finding cost-optimal communication trees for rooted, irregular gather and scatter collective communication operations in fully connected, one-ported communication networks under a linear, but not necessarily homogeneous transmission cost model. In the irregular gather and scatter problems, different processors may specify data blocks of possibly different sizes. Processors are numbered consecutively, and data blocks shall be collected or distributed from some (given) root processor. Data blocks from and to processors can be combined into larger segments consisting of multiple blocks; but individual data blocks cannot be split. We distinguish between ordered and unordered problems and algorithms. In an ordered, irregular gather tree algorithm all non-leaf processors receive segments of data blocks for consecutively numbered ranges of processors. An unordered tree algorithm permits received segments to consist of blocks for processors in any order. We show that the ordered problems can be solved in polynomial time, and give simple dynamic programming algorithms to construct optimal communication trees for both gather and scatter problems. In contrast, we show that the unordered problems are NP-complete. We have implemented the dynamic programming algorithms to experimentally evaluate the quality of a recent, simple, distributed algorithm which constructs close to optimal trees. Our experiments show that it is indeed very close to the optimum for a selection of data block distributions, and likely sufficient for all practical purposes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2022

(Poly)Logarithmic Time Construction of Round-optimal n-Block Broadcast Schedules for Broadcast and irregular Allgather in MPI

We give a fast(er), communication-free, parallel construction of optimal...
research
09/26/2021

A Doubly-pipelined, Dual-root Reduction-to-all Algorithm and Implementation

We discuss a simple, binary tree-based algorithm for the collective allr...
research
07/13/2022

Four-splitting based coarse-grained multicomputer parallel algorithm for the optimal binary search tree problem

This paper presents a parallel solution based on the coarse-grained mult...
research
08/27/2020

k-ported vs. k-lane Broadcast, Scatter, and Alltoall Algorithms

In k-ported message-passing systems, a processor can simultaneously rece...
research
12/15/2017

New Algorithms for Unordered Tree Inclusion

The tree inclusion problem is, given two node-labeled trees P and T (the...
research
01/12/2019

On Huang and Wong's Algorithm for Generalized Binary Split Trees

Huang and Wong [5] proposed a polynomial-time dynamic-programming algori...
research
10/24/2019

Cutting an alignment with Ockham's razor

In this article, we investigate different parsimony-based approaches tow...

Please sign up or login with your details

Forgot password? Click here to reset