WikiCheck: An end-to-end open source Automatic Fact-Checking API based on Wikipedia

09/02/2021
by   Mykola Trokhymovych, et al.
0

With the growth of fake news and disinformation, the NLP community has been working to assist humans in fact-checking. However, most academic research has focused on model accuracy without paying attention to resource efficiency, which is crucial in real-life scenarios. In this work, we review the State-of-the-Art datasets and solutions for Automatic Fact-checking and test their applicability in production environments. We discover overfitting issues in those models, and we propose a data filtering method that improves the model's performance and generalization. Then, we design an unsupervised fine-tuning of the Masked Language models to improve its accuracy working with Wikipedia. We also propose a novel query enhancing method to improve evidence discovery using the Wikipedia Search API. Finally, we present a new fact-checking system, the WikiCheck API that automatically performs a facts validation process based on the Wikipedia knowledge base. It is comparable to SOTA solutions in terms of accuracy and can be used on low-memory CPU instances.

READ FULL TEXT
research
06/20/2018

The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News

A large body of research work and efforts have been focused on detecting...
research
08/28/2023

Helping Fact-Checkers Identify Fake News Stories Shared through Images on WhatsApp

WhatsApp has introduced a novel avenue for smartphone users to engage wi...
research
05/12/2023

aedFaCT: Scientific Fact-Checking Made Easier via Semi-Automatic Discovery of Relevant Expert Opinions

In this highly digitised world, fake news is a challenging problem that ...
research
05/24/2023

Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models

Fact-checking is an essential task in NLP that is commonly utilized for ...
research
04/14/2022

Automatic Fake News Detection: Are current models "fact-checking" or "gut-checking"?

Automatic fake news detection models are ostensibly based on logic, wher...
research
05/30/2019

Assessing The Factual Accuracy of Generated Text

We propose a model-based metric to estimate the factual accuracy of gene...
research
09/03/2018

Belittling the Source: Trustworthiness Indicators to Obfuscate Fake News on the Web

With the growth of the internet, the number of fake-news online has been...

Please sign up or login with your details

Forgot password? Click here to reset