FatPaths: Routing in Supercomputers, Data Centers, and Clouds with Low-Diameter Networks when Shortest Paths Fall Short

06/26/2019
by   Maciej Besta, et al.
0

We introduce FatPaths: a simple, generic, and robust routing architecture for Ethernet stacks. FatPaths enables state-of-the-art low-diameter topologies such as Slim Fly to achieve unprecedented performance, targeting both HPC supercomputers as well as data centers and clusters used by cloud computing. FatPaths exposes and exploits the rich ("fat") diversity of both minimal and non-minimal paths for high-performance multi-pathing. Moreover, FatPaths features a redesigned "purified" transport layer, based on recent advances in data center networking, that removes virtually all TCP performance issues (e.g., the slow start). FatPaths also uses flowlet switching, a technique used to prevent packet reordering in TCP networks, to enable very simple and effective load balancing. Our design enables recent low-diameter topologies to outperform powerful Clos designs, achieving 15 lower latency for comparable cost. FatPaths will significantly accelerate Ethernet clusters that form more than 50 a standard routing scheme for modern topologies.

READ FULL TEXT

page 4

page 7

page 8

page 13

page 14

research
07/07/2020

High-Performance Routing with Multipathing and Path Diversity in Ethernet and HPC Networks

The recent line of research into topology design focuses on lowering net...
research
06/22/2023

Analysing Mechanisms for Virtual Channel Management in Low-Diameter networks

To interconnect their growing number of servers, current supercomputers ...
research
12/19/2019

Slim Fly: A Cost Effective Low-Diameter Network Topology

We introduce a high-performance cost-effective network topology called S...
research
11/01/2018

Expander Datacenters: From Theory to Practice

Recent work has shown that expander-based data center topologies are rob...
research
09/14/2019

Optimal Routing for a Family of Scalable Interconnection Networks

Scalability of interconnection networks for the supercomputers, particul...
research
09/29/2021

Network Scaffolding for Efficient Stabilization of the Chord Overlay Network

Overlay networks, where nodes communicate with neighbors over logical li...
research
05/26/2021

Towards Million-Server Network Simulations on Just a Laptop

The growing size of data center and HPC networks pose unprecedented requ...

Please sign up or login with your details

Forgot password? Click here to reset