Document similarity measures can support semi-automated identification of unreported links between trial registrations and published trial articles

09/07/2017
by   Adam G. Dunn, et al.
0

Objectives: Trial registries can be used to measure reporting biases and support systematic reviews but 45 the article reporting on the trial. We evaluated the use of document similarity methods to identify unreported links between ClinicalTrials.gov and PubMed. Study Design and Setting: We extracted terms and concepts from a dataset of 72,469 ClinicalTrials.gov registrations and 276,307 PubMed articles, and tested methods for ranking articles across 16,005 reported links and 90 manually-identified unreported links. Performance was measured by the median rank of matching articles, and the proportion of unreported links that could be found by screening ranked candidate articles in order. Results: The best performing concept-based representation produced a median rank of 3 (IQR 1-21) for reported links and 3 (IQR 1-19) for the manually-identified unreported links, and term-based representations produced a median rank of 2 (1-20) for reported links and 2 (IQR 1-12) in unreported links. The matching article was ranked first for 40 registration identified 86 the growth in the corpus of reported links between ClinicalTrials.gov and PubMed, we found that document similarity methods can assist in the identification of unreported links between trial registrations and corresponding articles.

READ FULL TEXT
research
09/20/2017

A shared latent space matrix factorisation method for recommending new trial evidence for systematic review updates

Clinical trial registries can be used to monitor the production of trial...
research
04/26/2019

Recommending research articles to consumers of online vaccination information

Research communications often introduce biases or misrepresentations wit...
research
11/04/2016

Learning to Rank Scientific Documents from the Crowd

Finding related published articles is an important task in any science, ...
research
07/08/2020

Two-stage single-arm trials are rarely reported adequately

Purpose: Two-stage single-arm trial designs are commonly used in phase I...
research
02/16/2017

Clustering articles based on semantic similarity

Document clustering is generally the first step for topic identification...
research
01/16/2019

Granularity of algorithmically constructed publication-level classifications of research publications: Identification of specialties

In this work, in which we build on, and use the outcome of, an earlier s...

Please sign up or login with your details

Forgot password? Click here to reset