Homonym Detection in Curated Bibliographies: Learning from dblp's Experience (full version)

06/15/2018
by   Marcel R. Ackermann, et al.
0

Identifying (and fixing) homonymous and synonymous author profiles is one of the major tasks of curating personalized bibliographic metadata repositories like the dblp computer science bibliography. In this paper, we present and evaluate a machine learning approach to identify homonymous author bibliographies using a simple multilayer perceptron setup. We train our model on a novel gold-standard data set derived from the past years of active, manual curation at the dblp computer science bibliography.

READ FULL TEXT
research
04/28/2022

D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research

DBLP is the largest open-access repository of scientific articles on com...
research
10/09/2018

Is your Statement Purposeless? Predicting Computer Science Graduation Admission Acceptance based on Statement Of Purpose

We present a quantitative, data-driven machine learning approach to miti...
research
03/27/2020

Ontology Extraction and Usage in the Scholarly Knowledge Domain

Ontologies of research areas have been proven to be useful in many appli...
research
06/09/2018

A hybrid econometric-machine learning approach for relative importance analysis: Food inflation

A measure of relative importance of variables is often desired by resear...
research
03/07/2019

Hybrid Continued Fractions and n-adic algorithms, with applications to cryptography and "unimaginable' numbers

This paper continues the author's previous studies on continued fraction...
research
11/01/1991

Theory and practice

The author argues to Silicon Valley that the most important and powerful...
research
05/29/2019

Definitively Identifying an Inherent Limitation to Actual Cognition

A century ago, discoveries of a serious kind of logical error made separ...

Please sign up or login with your details

Forgot password? Click here to reset