Rationales for Sequential Predictions

09/14/2021
by   Keyon Vafa, et al.
14

Sequence models are a critical component of modern NLP systems, but their predictions are difficult to explain. We consider model explanations though rationales, subsets of context that can explain individual model predictions. We find sequential rationales by solving a combinatorial optimization: the best rationale is the smallest subset of input tokens that would predict the same output as the full sequence. Enumerating all subsets is intractable, so we propose an efficient greedy algorithm to approximate this objective. The algorithm, which is called greedy rationalization, applies to any model. For this approach to be effective, the model should form compatible conditional distributions when making predictions on incomplete subsets of the context. This condition can be enforced with a short fine-tuning step. We study greedy rationalization on language modeling and machine translation. Compared to existing baselines, greedy rationalization is best at optimizing the combinatorial objective and provides the most faithful rationales. On a new dataset of annotated sequential rationales, greedy rationales are most similar to human rationales.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2023

Greedy Heuristics and Linear Relaxations for the Random Hitting Set Problem

Consider the Hitting Set problem where, for a given universe 𝒳 = { 1, .....
research
10/31/2022

Effective Cross-Task Transfer Learning for Explainable Natural Language Inference with T5

We compare sequential fine-tuning with a model for multi-task learning i...
research
05/18/2019

On greedy heuristics for computing D-efficient saturated subsets

Let F be a set consisting of n real vectors of dimension m ≤ n. For any ...
research
05/17/2020

Robust subset selection

The best subset selection (or "best subsets") estimator is a classic too...
research
04/28/2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Randomly masking and predicting word tokens has been a successful approa...
research
03/22/2018

Attention Solves Your TSP

We propose a framework for solving combinatorial optimization problems o...
research
07/04/2012

Efficient Test Selection in Active Diagnosis via Entropy Approximation

We consider the problem of diagnosing faults in a system represented by ...

Please sign up or login with your details

Forgot password? Click here to reset