This paper introduces a new data analysis method for big data using a ne...
In plenty of data analysis tasks, a basic and time-consuming process is ...
The topology-aware Massively Parallel Computation (MPC) model is propose...
We consider the problem of approximate sorting in I/O model. The goal of...
Multi-criteria decision-making often requires finding a small representa...
In this paper we propose the PCP-like theorem for sub-linear time
inappr...
Nearest Neighbor Search (NNS) over generalized weighted distance is
fund...
Nearest neighbor search is fundamental to a wide range of applications. ...
In this paper we propose the PCP-like theorem for sub-linear time
inappr...
The existing algorithms for processing skyline queries cannot adapt to b...
In this paper we propose an algorithm for the approximate k-Nearest-Neig...
Nowadays, there are ubiquitousness of GPS sensors in various devices
col...
Data inconsistency evaluating and repairing are major concerns in data
q...
The problem of hyperparameter optimization exists widely in the real lif...
In many fields, a mass of algorithms with completely different
hyperpara...
Due to the limitation on computational power of existing computers, the
...
Time series prediction with missing values is an important problem of ti...
In this paper we examined an algorithm for the All-k-Nearest-Neighbor pr...
In this paper we examined an algorithm for the All-k-Nearest-Neighbor pr...
In many applications, it is necessary to retrieve pairs of vertices with...
This paper proposes a new algorithm for reducing Approximate Nearest Nei...
For data pricing, data quality is a factor that must be considered. To k...
Current conditional functional dependencies (CFDs) discovery algorithms
...
As the fundamental phrase of collecting and analyzing data, data integra...
Recently, in the area of big data, some popular applications such as web...
Data quality plays a key role in big data management today. With the
exp...
Nowadays, sampling-based Approximate Query Processing (AQP) is widely
re...
The Influence Maximization (IM) problem aims at finding k seed vertices ...
Data quality issues have attracted widespread attention due to the negat...
Currently data explosion poses great challenges to approximate aggregati...
Wireless communication systems, such as wireless sensor networks and RFI...
Missing and incorrect values often cause serious consequences. To deal w...
Mining dense subgraphs on multi-layer graphs is an interesting problem, ...