Generative Context Pair Selection for Multi-hop Question Answering

04/18/2021
by   Dheeru Dua, et al.
5

Compositional reasoning tasks like multi-hop question answering, require making latent decisions to get the final answer, given a question. However, crowdsourced datasets often capture only a slice of the underlying task distribution, which can induce unanticipated biases in models performing compositional reasoning. Furthermore, discriminatively trained models exploit such biases to get a better held-out performance, without learning the right way to reason, as they do not necessitate paying attention to the question representation (conditioning variable) in its entirety, to estimate the answer likelihood. In this work, we propose a generative context selection model for multi-hop question answering that reasons about how the given question could have been generated given a context pair. While being comparable to the state-of-the-art answering performance, our proposed generative passage selection model has a better performance (4.9 adversarial held-out set which tests robustness of model's multi-hop reasoning capabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2019

Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QA

Multi-hop question answering requires a model to connect multiple pieces...
research
01/15/2018

An Interpretable Reasoning Network for Multi-Relation Question Answering

Multi-relation Question Answering is a challenging task, due to the requ...
research
08/28/2023

Bayesian artificial brain with ChatGPT

This paper aims to investigate the mathematical problem-solving capabili...
research
10/27/2021

SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning

State-of-the-art approaches to reasoning and question answering over kno...
research
04/14/2020

A Simple Yet Strong Pipeline for HotpotQA

State-of-the-art models for multi-hop question answering typically augme...
research
04/18/2022

StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

Inferring spatial relations in natural language is a crucial ability an ...
research
04/28/2023

Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks

With the wide application of Large Language Models (LLMs) such as ChatGP...

Please sign up or login with your details

Forgot password? Click here to reset