Online Aggregation based Approximate Query Processing: A Literature Survey

04/14/2022
by   Pritom Saha Akash, et al.
0

In the current world, OLAP (Online Analytical Processing) is used intensively by modern organizations to perform ad hoc analysis of data, providing insight for better decision making. Thus, the performance for OLAP is crucial; however, it is costly to support OLAP for a large data-set. An approximate query process (AQP) was proposed to efficiently compute approximate values as close as to the exact answer. Existing AQP techniques can be categorized into two parts, online aggregation, and offline synopsis generation, each having its limitations and challenges. Online aggregation-based AQP progressively generates approximate results with some error estimates (i.e., confidence interval) until the processing of all data is done. In Offline synopsis generation-based AQP, synopses are generated offline using a-priori knowledge such as query workload or data statistics. Later, OLAP queries are answered using these synopses. This paper focuses on surveying only the online aggregation-based AQP. For this purpose, firstly, we discuss the research challenges in online aggregation-based AQP and summarize existing approaches to address these challenges. In addition, we also discuss the advantages and limitations of existing online aggregation mechanisms. Lastly, we discuss some research challenges and opportunities for further advancing online aggregation research. Our goal is for people to understand the current progress in the online aggregation-based AQP area and find new insights into it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2018

Model-based Approximate Query Processing

Interactive visualizations are arguably the most important tool to explo...
research
06/14/2019

DeepSPACE: Approximate Geospatial Query Processing with Deep Learning

The amount of the available geospatial data grows at an ever faster pace...
research
02/01/2019

Incremental Techniques for Large-Scale Dynamic Query Processing

Many applications from various disciplines are now required to analyze f...
research
03/07/2023

A Step Toward Deep Online Aggregation (Extended Version)

For exploratory data analysis, it is often desirable to know what answer...
research
12/21/2022

Statistical Challenges in Online Controlled Experiments: A Review of A/B Testing Methodology

The rise of internet-based services and products in the late 1990's brou...
research
07/13/2018

Probabilistic Re-aggregation Algorithm [First Draft]

Spatial data about individuals or businesses is often aggregated over po...
research
08/27/2018

NNCubes: Learned Structures for Visual Data Exploration

Visual exploration of large multidimensional datasets has seen tremendou...

Please sign up or login with your details

Forgot password? Click here to reset