A Framework for Energy-aware Evaluation of Distributed Data Processing Platforms in Edge-Cloud Environment

01/06/2022
by   Faheem Ullah, et al.
0

Distributed data processing platforms (e.g., Hadoop, Spark, and Flink) are widely used to distribute the storage and processing of data among computing nodes of a cloud. The centralization of cloud resources has given birth to edge computing, which enables the processing of data closer to the data source instead of sending it to the cloud. However, due to resource constraints such as energy limitations, edge computing cannot be used for deploying all kinds of applications. Therefore, tasks are offloaded from an edge device to the more resourceful cloud. Previous research has evaluated the energy consumption of the distributed data processing platforms in the isolated cloud and edge environments. However, there is a paucity of research on evaluating the energy consumption of these platforms in an integrated edge-cloud environment, where tasks are offloaded from a resource-constraint device to a resource-rich device. Therefore, in this paper, we first present a framework for the energy-aware evaluation of the distributed data processing platforms. We then leverage the proposed framework to evaluate the energy consumption of the three most widely used platforms (i.e., Hadoop, Spark, and Flink) in an integrated edge-cloud environment consisting of Raspberry Pi, edge node, edge server node, private cloud, and public cloud. Our evaluation reveals that (i) Flink is most energy-efficient followed by Spark and Hadoop is found least energy-efficient (ii) offloading tasks from resource-constraint to resource-rich devices reduces energy consumption by 55.2 and server are found key factors impacting the energy consumption.

READ FULL TEXT

page 1

page 6

page 9

research
10/30/2017

Device-centric Energy Optimization for Edge Cloud Offloading

A wireless system is considered, where, computationally complex algorith...
research
09/01/2023

Co-Tuning of Cloud Infrastructure and Distributed Data Processing Platforms

Distributed Data Processing Platforms (e.g., Hadoop, Spark, and Flink) a...
research
11/19/2021

Edge Computing vs Centralized Cloud: Impact of Communication Latency on the Energy Consumption of LTE Terminal Nodes

Edge computing brings several advantages, such as reduced latency, incre...
research
09/14/2021

Complexity-aware Adaptive Training and Inference for Edge-Cloud Distributed AI Systems

The ubiquitous use of IoT and machine learning applications is creating ...
research
02/24/2019

Image Classification on IoT Edge Devices: Profiling and Modeling

With the advent of powerful, low-cost IoT systems, processing data close...
research
02/21/2023

Dynamic Resource Partitioning for Multi-Tenant Systolic Array Based DNN Accelerator

Deep neural networks (DNN) have become significant applications in both ...
research
10/31/2020

An edge-based architecture to support the execution of ambience intelligence tasks using the IoP paradigm

In an IoP environment, edge computing has been proposed to address the p...

Please sign up or login with your details

Forgot password? Click here to reset