Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j

01/25/2019
by   Valentina Franzoni, et al.
0

In this work, we introduce a novel method for entity resolution author disambiguation in bibliographic networks. Such a method is based on a 2-steps network traversal using topological similarity measures for rating candidate nodes. Topological similarity is widely used in the Link Prediction application domain to assess the likelihood of an unknown link. A similarity function can be a good approximation for equality, therefore can be used to disambiguate, basing on the hypothesis that authors with many common co-authors are similar. Our method has experimented on a graph-based representation of the public DBLP Computer Science database. The results obtained are extremely encouraging regarding Precision, Accuracy, and Specificity. Further good aspects are the locality of the method for disambiguation assessment which avoids the need to know the global network, and the exploitation of only a few data, e.g. author name and paper title (i.e., co-authorship data).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2023

A parameterised model for link prediction using node centrality and similarity measure based on graph embedding

Link prediction is a key aspect of graph machine learning, with applicat...
research
03/25/2021

A comparative analysis of local network similarity measurements: application to author citation networks

Understanding the evolution of paper and author citations is of paramoun...
research
02/20/2021

Persistence Homology for Link Prediction: An Interactive View

Link prediction is an important learning task for graph-structured data....
research
12/23/2019

Siamese Networks for Large-Scale Author Identification

Authorship attribution is the process of identifying the author of a tex...
research
09/15/2020

Does Link Prediction Help Detect Feature Interactions in Software Product Lines (SPLs)?

An ongoing challenge for the requirements engineering of software produc...
research
12/13/2018

Minuet: A method to solve Sudoku puzzles by hand

This paper presents a systematic method to solve difficult 9 x 9 Sudoku ...
research
04/14/2020

Author Name Disambiguation in Bibliographic Databases: A Survey

Entity resolution is a challenging and hot research area in the field of...

Please sign up or login with your details

Forgot password? Click here to reset