Data Placement for Multi-Tenant Data Federation on the Cloud

by   Ji Liu, et al.

Due to privacy concerns of users and law enforcement in data security and privacy, it becomes more and more difficult to share data among organizations. Data federation brings new opportunities to the data-related cooperation among organizations by providing abstract data interfaces. With the development of cloud computing, organizations store data on the cloud to achieve elasticity and scalability for data processing. The existing data placement approaches generally only consider one aspect, which is either execution time or monetary cost, and do not consider data partitioning for hard constraints. In this paper, we propose an approach to enable data processing on the cloud with the data from different organizations. The approach consists of a data federation platform named FedCube and a Lyapunov-based data placement algorithm. FedCube enables data processing on the cloud. We use the data placement algorithm to create a plan in order to partition and store data on the cloud so as to achieve multiple objectives while satisfying the constraints based on a multi-objective cost model. The cost model is composed of two objectives, i.e., reducing monetary cost and execution time. We present an experimental evaluation to show our proposed algorithm significantly reduces the total cost (up to 69.8%) compared with existing approaches.


page 1

page 2

page 3

page 4


Evaluation of Distributed Data Processing Frameworks in Hybrid Clouds

Distributed data processing frameworks (e.g., Hadoop, Spark, and Flink) ...

Co-Tuning of Cloud Infrastructure and Distributed Data Processing Platforms

Distributed Data Processing Platforms (e.g., Hadoop, Spark, and Flink) a...

Resource Allocation in Cloud Computing Using Genetic Algorithm and Neural Network

Cloud computing is one of the most used distributed systems for data pro...

An Enhanced BPSO based Approach for Service Placement in Hybrid Cloud

Due to the challenges of competition and the rapidly evolving market, co...

Panorama: A Framework to Support Collaborative Context Monitoring on Co-Located Mobile Devices

A key challenge in wide adoption of sophisticated context-aware applicat...

Remote Data Auditing and How it May Affect the Chain of Custody in a Cloud Environment

As big data collection continues to grow, more and more organizations ar...

Heterogeneous and Multidimensional Clairvoyant Dynamic Bin Packing for Virtual Machine Placement

Although the public cloud still occupies the largest portion of the tota...

Please sign up or login with your details

Forgot password? Click here to reset