Adaptive Learning of Aggregate Analytics under Dynamic Workloads

08/13/2019
by   Fotis Savva, et al.
0

Large organizations have seamlessly incorporated data-driven decision making in their operations. However, as data volumes increase, expensive big data infrastructures are called to rescue. In this setting, analytics tasks become very costly in terms of query response time, resource consumption, and money in cloud deployments, especially when base data are stored across geographically distributed data centers. Therefore, we introduce an adaptive Machine Learning mechanism which is light-weight, stored client-side, can estimate the answers of a variety of aggregate queries and can avoid the big data backend. The estimations are performed in milliseconds are inexpensive and accurate as the mechanism learns from past analytical-query patterns. However, as analytic queries are ad-hoc and analysts' interests change over time we develop solutions that can swiftly and accurately detect such changes and adapt to new query patterns. The capabilities of our approach are demonstrated using extensive evaluation with real and synthetic datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2020

ML-AQP: Query-Driven Approximate Query Processing based on Machine Learning

As more and more organizations rely on data-driven decision making, larg...
research
12/18/2022

GAN-based Tabular Data Generator for Constructing Synopsis in Approximate Query Processing: Challenges and Solutions

In data-driven systems, data exploration is imperative for making real-t...
research
02/21/2019

An IDEA: An Ingestion Framework for Data Enrichment in AsterixDB

Big Data today is being generated at an unprecedented rate from various ...
research
02/04/2020

Providing Insights for Queries affected by Failures and Stragglers

Interactive time responses are a crucial requirement for users analyzing...
research
10/12/2020

PolyFrame: A Retargetable Query-based Approach to Scaling DataFrames (Extended Version)

In the last few years, the field of data science has been growing rapidl...
research
12/29/2018

Explaining Aggregates for Exploratory Analytics

Analysts wishing to explore multivariate data spaces, typically pose que...
research
05/23/2020

Implementation of Self-Organizing Network (SON) on Cellular Technology base on Big Data Analytic

The development of cellular technology will be directly proportional to ...

Please sign up or login with your details

Forgot password? Click here to reset