Parameter-Efficient Neural Question Answering Models via Graph-Enriched Document Representations

06/01/2021
by   Louis Castricato, et al.
0

As the computational footprint of modern NLP systems grows, it becomes increasingly important to arrive at more efficient models. We show that by employing graph convolutional document representation, we can arrive at a question answering system that performs comparably to, and in some cases exceeds the SOTA solutions, while using less than 5% of their resources in terms of trainable parameters. As it currently stands, a major issue in applying GCNs to NLP is document representation. In this paper, we show that a GCN enriched document representation greatly improves the results seen in HotPotQA, even when using a trivial topology. Our model (gQA), performs admirably when compared to the current SOTA, and requires little to no preprocessing. In Shao et al. 2020, the authors suggest that graph networks are not necessary for good performance in multi-hop QA. In this paper, we suggest that large language models are not necessary for good performance by showing a naïve implementation of a GCN performs comparably to SoTA models based on pretrained language models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2019

Negated LAMA: Birds cannot fly

Pretrained language models have achieved remarkable improvements in a br...
research
01/21/2022

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

Answering complex questions about textual narratives requires reasoning ...
research
09/01/2019

Incidental Supervision from Question-Answering Signals

Human annotations are costly for many natural language processing (NLP) ...
research
05/24/2022

From Easy to Hard: Two-stage Selector and Reader for Multi-hop Question Answering

Multi-hop question answering (QA) is a challenging task requiring QA sys...
research
03/28/2023

Comparative Analysis of CHATGPT and the evolution of language models

Interest in Large Language Models (LLMs) has increased drastically since...
research
08/07/2023

Trusting Language Models in Education

Language Models are being widely used in Education. Even though modern d...
research
08/01/2023

GRDD: A Dataset for Greek Dialectal NLP

In this paper, we present a dataset for the computational study of a num...

Please sign up or login with your details

Forgot password? Click here to reset