Beyond Roll-Up's and Drill-Down's: An Intentional Analytics Model to Reinvent OLAP (long-version)

12/19/2018
by   Panos Vassiliadis, et al.
0

This paper structures a novel vision for OLAP by fundamentally redefining several of the pillars on which OLAP has been based for the last 20 years. We redefine OLAP queries, in order to move to higher degrees of abstraction from roll-up's and drill-down's, and we propose a set of novel intentional OLAP operators, namely, describe, assess, explain, predict, and suggest, which express the user's need for results. We fundamentally redefine what a query answer is, and escape from the constraint that the answer is a set of tuples; on the contrary, we complement the set of tuples with models (typically, but not exclusively, results of data mining algorithms over the involved data) that concisely represent the internal structure or correlations of the data. Due to the diverse nature of the involved models, we come up (for the first time ever, to the best of our knowledge) with a unifying framework for them, that places its pillars on the extension of each data cell of a cube with information about the models that pertain to it -- practically converting the small parts that build up the models to data that annotate each cell. We exploit this data-to-model mapping to provide highlights of the data, by isolating data and models that maximize the delivery of new information to the user. We introduce a novel method for assessing the surprise that a new query result brings to the user, with respect to the information contained in previous results the user has seen via a new interestingness measure. The individual parts of our proposal are integrated in a new data model for OLAP, which we call the Intentional Analytics Model. We complement our contribution with a list of significant open problems for the community to address.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2017

Users Constraints in Itemset Mining

Discovering significant itemsets is one of the fundamental problems in d...
research
05/27/2020

Should Answer Immediately or Wait for Further Information? A Novel Wait-or-Answer Task and Its Predictive Approach

Different people have different habits of describing their intents in co...
research
02/18/2019

Comparing Apples and Oranges: Measuring Differences between Exploratory Data Mining Results

Deciding whether the results of two different mining algorithms provide ...
research
02/18/2019

Comparing Apples and Oranges: Measuring Differences between Data Mining Results

Deciding whether the results of two different mining algorithms provide ...
research
04/25/2023

Bridging graph data models: RDF, RDF-star, and property graphs as directed acyclic graphs

Graph database users today face a choice between two technology stacks: ...
research
03/17/2022

A Cube Algebra with Comparative Operations: Containment, Overlap, Distance and Usability

In this paper, we provide a comprehensive rigorous modeling for multidim...
research
01/25/2018

Structuring Spreadsheets with the "Lish" Data Model

A spreadsheet is remarkably flexible in representing various forms of st...

Please sign up or login with your details

Forgot password? Click here to reset