Putting Data Science Pipelines on the Edge

03/14/2021
by   Ali Akoglu, et al.
0

This paper proposes a composable "Just in Time Architecture" for Data Science (DS) Pipelines named JITA-4DS and associated resource management techniques for configuring disaggregated data centers (DCs). DCs under our approach are composable based on vertical integration of the application, middleware/operating system, and hardware layers customized dynamically to meet application Service Level Objectives (SLO - application-aware management). Thereby, pipelines utilize a set of flexible building blocks that can be dynamically and automatically assembled and re-assembled to meet the dynamic changes in the workload's SLOs. To assess disaggregated DC's, we study how to model and validate their performance in large-scale settings.

READ FULL TEXT
research
08/05/2021

JITA4DS: Disaggregated execution of Data Science Pipelines between the Edge and the Data Centre

This paper targets the execution of data science (DS) pipelines supporte...
research
08/20/2022

Graph analytics workflows enactment on just in time data centres, Position Paper

This paper discusses our vision of multirole-capable decision-making sys...
research
11/25/2021

Federated Data Science to Break Down Silos [Vision]

Similar to Open Data initiatives, data science as a community has launch...
research
04/04/2017

Tackling Diversity and Heterogeneity by Vertical Memory Management

Existing memory management mechanisms used in commodity computing machin...
research
05/19/2015

Towards Data-Driven Autonomics in Data Centers

Continued reliance on human operators for managing data centers is a maj...
research
03/06/2023

Data management and execution systems for the Rubin Observatory Science Pipelines

We present the Rubin Observatory system for data storage/retrieval and p...

Please sign up or login with your details

Forgot password? Click here to reset