Understanding Queries by Conditional Instances

by   Amir Gilad, et al.

A powerful way to understand a complex query is by observing how it operates on data instances. However, specific database instances are not ideal for such observations: they often include large amounts of superfluous details that are not only irrelevant to understanding the query but also cause cognitive overload; and one specific database may not be enough. Given a relational query, is it possible to provide a simple and generic "representative" instance that (1) illustrates how the query can be satisfied, (2) summarizes all specific instances that would satisfy the query in the same way by abstracting away unnecessary details? Furthermore, is it possible to find a collection of such representative instances that together completely characterize all possible ways in which the query can be satisfied? This paper takes initial steps towards answering these questions. We design what these representative instances look like, define what they stand for, and formalize what it means for them to satisfy a query in "all possible ways." We argue that this problem is undecidable for general domain relational calculus queries, and develop practical algorithms for computing a minimum collection of such instances subject to other constraints. We evaluate the efficiency of our approach experimentally, and show its effectiveness in helping users debug relational queries through a user study.


page 1

page 2

page 3

page 4


Paraconsistent logic and query answering in inconsistent databases

This paper concerns the paraconsistent logic LPQ^⊃,𝖥 and an application ...

A fine-grained framework for database repairs

We introduce a general abstract framework for database repairing that di...

Efficient Evaluation of Arbitrary Relational Calculus Queries

The relational calculus (RC) is a concise, declarative query language. H...

Explaining Wrong Queries Using Small Examples

For testing the correctness of SQL queries, e.g., evaluating student sub...

A Tutorial on Visual Representations of Relational Queries

Query formulation is increasingly performed by systems that need to gues...

PUG: A Framework and Practical Implementation for Why & Why-Not Provenance (extended version)

Explaining why an answer is (or is not) returned by a query is important...

A Non-Deterministic Multiset Query Language

We develop a multiset query and update language executable in a term rew...

Please sign up or login with your details

Forgot password? Click here to reset