SmartMTD: A Graph-Based Approach for Effective Multi-Truth Discovery

by   Xiu Susie Fang, et al.

The Big Data era features a huge amount of data that are contributed by numerous sources and used by many critical data-driven applications. Due to the varying reliability of sources, it is common to see conflicts among the multi-source data, making it difficult to determine which data sources to trust. Recently, truth discovery has emerged as a means of addressing this challenging issue by determining data veracity jointly with estimating the reliability of data sources. A fundamental issue with current truth discovery methods is that they generally assume only one true value for each object, while in reality, objects may have multiple true values. In this paper, we propose a graph-based approach, called SmartMTD, to unravel the truth discovery problem beyond the single-truth assumption, or the multi-truth discovery problem. SmartMTD models and quantifies two types of source relations to estimate source reliability precisely and to detect malicious agreement among sources for effective multi-truth discovery. In particular, two graphs are constructed based on the modeled source relations. They are further used to derive the two aspects of source reliability (i.e., positive precision and negative precision) via random walk computation. Empirical studies on two large real-world datasets demonstrate the effectiveness of our approach.


page 1

page 2

page 3

page 4


From Appearance to Essence: Comparing Truth Discovery Methods without Using Ground Truth

Truth discovery has been widely studied in recent years as a fundamental...

Truth Discovery with Memory Network

Truth discovery is to resolve conflicts and find the truth from multiple...

Empirical Bayes approach to Truth Discovery problems

When aggregating information from conflicting sources, one's goal is to ...

Combining Restricted Boltzmann Machines with Neural Networks for Latent Truth Discovery

Latent truth discovery, LTD for short, refers to the problem of aggregat...

MaskLink: Efficient Link Discovery for Spatial Relations via Masking Areas

In this paper, we study the problem of spatial link discovery (LD), focu...

Exploiting Source-Object Network to Resolve Object Conflicts in Linked Data

Considerable effort has been made to increase the scale of Linked Data. ...

A Reliability Theory of Truth

Our approach is basically a coherence approach, but we avoid the well-kn...

Please sign up or login with your details

Forgot password? Click here to reset