Dependency Weighted Aggregation on Factorized Databases

01/11/2019
by   Florent Capelli, et al.
0

We study a new class of aggregation problems, called dependency weighted aggregation. The underlying idea is to aggregate the answer tuples of a query while accounting for dependencies between them, where two tuples are considered dependent when they have the same value on some attribute. The main problem we are interested in is to compute the dependency weighted count of a conjunctive query. This aggregate can be seen as a form of weighted counting, where the weights of the answer tuples are computed by solving a linear program. This linear program enforces that dependent tuples are not over represented in the final weighted count. The dependency weighted count can be used to compute the s-measure, a measure that is used in data mining to estimate the frequency of a pattern in a graph database. Computing the dependency weighted count of a conjunctive query is NP-hard in general. In this paper, we show that this problem is actually tractable for a large class of structurally restricted conjunctive queries such as acyclic or bounded hypertree width queries. Our algorithm works on a factorized representation of the answer set, in order to avoid enumerating it exhaustively. Our technique produces a succinct representation of the weighting of the answers. It can be used to solve other dependency weighted aggregation tasks, such as computing the (dependency) weighted average of the value of an attribute in the answers set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2019

Aggregate Queries on Sparse Databases

We propose an algebraic framework for studying efficient algorithms for ...
research
03/09/2023

Direct Access for Answers to Conjunctive Queries with Aggregation

We study the fine-grained complexity of conjunctive queries with groupin...
research
10/29/2022

Linear Programs with Conjunctive Database Queries

In this paper, we study the problem of optimizing a linear program whose...
research
11/27/2021

Controlling the Correctness of Aggregation Operations During Sessions of Interactive Analytic Queries

We present a comprehensive set of conditions and rules to control the co...
research
03/04/2021

Consistent Answers of Aggregation Queries using SAT Solvers

The framework of database repairs and consistent answers to queries is a...
research
09/10/2010

The Complexity of Causality and Responsibility for Query Answers and non-Answers

An answer to a query has a well-defined lineage expression (alternativel...
research
04/23/2018

Shared aggregate sets in answer set programming

Aggregates are among the most frequently used linguistic extensions of a...

Please sign up or login with your details

Forgot password? Click here to reset