Dependability Evaluation of Middleware Technology for Large-scale Distributed Caching

08/16/2020
by   Domenico Cotroneo, et al.
0

Distributed caching systems (e.g., Memcached) are widely used by service providers to satisfy accesses by millions of concurrent clients. Given their large-scale, modern distributed systems rely on a middleware layer to manage caching nodes, to make applications easier to develop, and to apply load balancing and replication strategies. In this work, we performed a dependability evaluation of three popular middleware platforms, namely Twemproxy by Twitter, Mcrouter by Facebook, and Dynomite by Netflix, to assess availability and performance under faults, including failures of Memcached nodes and congestion due to unbalanced workloads and network link bandwidth bottlenecks. We point out the different availability and performance trade-offs achieved by the three platforms, and scenarios in which few faulty components cause cascading failures of the whole distributed system.

READ FULL TEXT

page 4

page 6

page 7

page 9

research
01/24/2019

DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching

Load balancing is critical for distributed storage to meet strict servic...
research
03/02/2021

An Analysis of Distributed Systems Syllabi With a Focus on Performance-Related Topics

We analyze a dataset of 51 current (2019-2020) Distributed Systems sylla...
research
11/08/2019

CFS: A Distributed File System for Large Scale Container Platforms

We propose CFS, a distributed file system for large scale container plat...
research
09/12/2018

Improved Basic Block Reordering

Basic block reordering is an important step for profile-guided binary op...
research
04/06/2022

How SVC enables Distributed Caching in MEC?

With an ever increasing demand for the delivery of internet video servic...
research
01/01/2019

Large Scale Studies of Memory, Storage, and Network Failures in a Modern Data Center

The workloads running in the modern data centers of large scale Internet...
research
06/23/2023

Availability Analysis of Redundant and Replicated Cloud Services with Bayesian Networks

Due to the growing complexity of modern data centers, failures are not u...

Please sign up or login with your details

Forgot password? Click here to reset