Efficient Contour Computation of Group-based Skyline

by   Wenhui Yu, et al.

Skyline, aiming at finding a Pareto optimal subset of points in a multi-dimensional dataset, has gained great interest due to its extensive use for multi-criteria analysis and decision making. The skyline consists of all points that are not dominated by any other points. It is a candidate set of the optimal solution, which depends on a specific evaluation criterion for optimum. However, conventional skyline queries, which return individual points, are inadequate in group querying case since optimal combinations are required. To address this gap, we study the skyline computation in the group level and propose efficient methods to find the Group-based skyline (G-skyline). For computing the front l skyline layers, we lay out an efficient approach that does the search concurrently on each dimension and investigates each point in the subspace. After that, we present a novel structure to construct the G-skyline with a queue of combinations of the first-layer points. We further demonstrate that the G-skyline is a complete candidate set of top-l solutions, which is the main superiority over previous group-based skyline definitions. However, as G-skyline is complete, it contains a large number of groups which can make it impractical. To represent the "contour" of the G-skyline, we define the Representative G-skyline (RG-skyline). Then, we propose a Group-based clustering (G-clustering) algorithm to find out RG-skyline groups. Experimental results show that our algorithms are several orders of magnitude faster than the previous work.


page 4

page 5

page 7

page 8

page 9

page 10

page 11

page 15


Fair clustering via equitable group representations

What does it mean for a clustering to be fair? One popular approach seek...

SkyLens: Visual Analysis of Skyline on Multi-dimensional Data

Skyline queries have wide-ranging applications in fields that involve mu...

A shortest-path based clustering algorithm for joint human-machine analysis of complex datasets

Clustering is a technique for the analysis of datasets obtained by empir...

Group Decision Support for agriculture planning by a combination of Mathematical Model and Collaborative Tool

Decision making in the Agriculture domain can be a complex task. The lan...

Neutrosophic soft sets with applications in decision making

We firstly present definitions and properties in study of Maji maji-2013...

SkyCell: A Space-Pruning Based Parallel Skyline Algorithm

Skyline computation is an essential database operation that has many app...

Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons

This is the first study on crowdsourcing Pareto-optimal object finding, ...

Please sign up or login with your details

Forgot password? Click here to reset