Exploring Neural Net Augmentation to BERT for Question Answering on SQUAD 2.0

08/04/2019
by   Suhas Gupta, et al.
3

Enhancing machine capabilities to answer questions has been a topic of considerable focus in recent years of NLP research. Language models like Embeddings from Language Models (ELMo)[1] and Bidirectional Encoder Representations from Transformers (BERT) [2] have been very successful in developing general purpose language models that can be optimized for a large number of downstream language tasks. In this work, we focused on augmenting the pre-trained BERT language model with different output neural net architectures and compared their performance on question answering task posed by the Stanford Question Answering Dataset 2.0 (SQUAD 2.0) [3]. Additionally, we also fine-tuned the pre-trained BERT model parameters to demonstrate its effectiveness in adapting to specialized language tasks. Our best output network, is the contextualized CNN that performs on both the unanswerable and answerable question answering tasks with F1 scores of 75.32 and 64.85 respectively.

READ FULL TEXT

page 1

page 2

page 3

page 8

page 9

page 10

page 11

research
11/14/2020

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Pre-training a transformer-based model for the language modeling task in...
research
11/14/2022

ALBERT with Knowledge Graph Encoder Utilizing Semantic Similarity for Commonsense Question Answering

Recently, pre-trained language representation models such as bidirection...
research
10/14/2019

Whatcha lookin' at? DeepLIFTing BERT's Attention in Question Answering

There has been great success recently in tackling challenging NLP tasks ...
research
10/16/2019

Unsupervised Question Answering for Fact-Checking

Recent Deep Learning (DL) models have succeeded in achieving human-level...
research
03/30/2020

NukeBERT: A Pre-trained language model for Low Resource Nuclear Domain

Significant advances have been made in recent years on Natural Language ...
research
09/17/2023

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Automated Short Answer Grading (ASAG) has been an active area of machine...
research
05/09/2023

Large Language Model Programs

In recent years, large pre-trained language models (LLMs) have demonstra...

Please sign up or login with your details

Forgot password? Click here to reset