FedQAS: Privacy-aware machine reading comprehension with federated learning

02/09/2022
by   Addi Ait-Mlouk, et al.
0

Machine reading comprehension (MRC) of text data is one important task in Natural Language Understanding. It is a complex NLP problem with a lot of ongoing research fueled by the release of the Stanford Question Answering Dataset (SQuAD) and Conversational Question Answering (CoQA). It is considered to be an effort to teach computers how to "understand" a text, and then to be able to answer questions about it using deep learning. However, until now large-scale training on private text data and knowledge sharing has been missing for this NLP task. Hence, we present FedQAS, a privacy-preserving machine reading system capable of leveraging large-scale private data without the need to pool those datasets in a central location. The proposed approach combines transformer models and federated learning technologies. The system is developed using the FEDn framework and deployed as a proof-of-concept alliance initiative. FedQAS is flexible, language-agnostic, and allows intuitive participation and execution of local model training. In addition, we present the architecture and implementation of the system, as well as provide a reference evaluation based on the SQUAD dataset, to showcase how it overcomes data privacy issues and enables knowledge sharing between alliance members in a Federated learning setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2018

Medical Exam Question Answering with Large-scale Reading Comprehension

Reading and understanding text is one important component in computer ai...
research
03/22/2022

VLSP 2021 Shared Task: Vietnamese Machine Reading Comprehension

One of the emerging research trends in natural language understanding is...
research
04/04/2023

FedBot: Enhancing Privacy in Chatbots with Federated Learning

Chatbots are mainly data-driven and usually based on utterances that mig...
research
01/27/2021

VisualMRC: Machine Reading Comprehension on Document Images

Recent studies on machine reading comprehension have focused on text-lev...
research
09/06/2018

Dual Ask-Answer Network for Machine Reading Comprehension

There are three modalities in the reading comprehension setting: questio...
research
06/01/2020

Conversational Machine Comprehension: a Literature Review

Conversational Machine Comprehension (CMC) is a research track in conver...
research
06/28/2022

The NLP Sandbox: an efficient model-to-data system to enable federated and unbiased evaluation of clinical NLP models

Objective The evaluation of natural language processing (NLP) models for...

Please sign up or login with your details

Forgot password? Click here to reset