Progressive Evaluation of Queries over Tagged Data

by   Dhrubajyoti Ghosh, et al.

Modern information systems often collect raw data in the form of text, images, video, and sensor readings. Such data needs to be further interpreted/enriched prior to being analyzed. Enrichment is often a result of automated machine learning and or signal processing techniques that associate appropriate but uncertain tags with the data. Traditionally, with the notable exception of a few systems, enrichment is considered to be a separate pre-processing step performed independently prior to data analysis. Such an approach is becoming increasingly infeasible since modern data capture technologies enable creation of very large data collections for which it is computationally difficult/impossible and ultimately not beneficial to derive all tags as a preprocessing step. Hence, approaches that perform tagging at query/analysis time on the data of interest need to be considered. This paper explores the problem of joint tagging and query processing. In particular, the paper considers a scenario where tagging can be performed using several techniques that differ in cost and accuracy and develops a progressive approach to answering Select-Project-Join (SPJ) queries (with a restricted version of the join predicates) that enriches the right data to the right degree so as to maximize the quality of the query results. The experimental results show that the proposed approach performs significantly better compared to baseline approaches.


page 1

page 2

page 3

page 4


Progressive Evaluation of Queries over Untagged Data

Modern information systems often collect raw data in the form of text, i...

Resource Utilization Monitoring for Raw Data Query Processing

Scientific experiments, simulations, and modern applications generate la...

Structure-Guided Query Evaluation: Towards Bridging the Gap from Theory to Practice

Join queries involving many relations pose a severe challenge to today's...

Learnable Front Ends Based on Temporal Modulation for Music Tagging

While end-to-end systems are becoming popular in auditory signal process...

Niffler: A Reference Architecture and System Implementation for View Discovery over Pathless Table Collections by Example

Identifying a project-join view (PJ-view) over collections of tables is ...

Aspect-Based Tagging for Collaborative Media Organization

Organizing multimedia data is very challenging. One of the most importan...

Please sign up or login with your details

Forgot password? Click here to reset