TLDR: Extreme Summarization of Scientific Documents

04/30/2020
by   Isabel Cachola, et al.
0

We introduce TLDR generation for scientific papers, a new automatic summarization task with high source compression requiring expert background knowledge and complex language understanding. To facilitate research on this task, we introduce SciTLDR, a dataset of 3.9K TLDRs. Furthermore, we introduce a novel annotation protocol for scalably curating additional gold summaries by rewriting peer review comments. We use this protocol to augment our test set, yielding multiple gold TLDRs for evaluation, which is unlike most recent summarization datasets that assume only one valid gold summary. We present a training strategy for adapting pretrained language models that exploits similarities between TLDR generation and the related tasks of extreme summarization and title generation, which outperforms strong extractive and abstractive summarization baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2020

Exploring Content Selection in Summarization of Novel Chapters

We present a new summarization task, generating summaries of novel chapt...
research
09/22/2021

MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization

One of the most challenging aspects of current single-document news summ...
research
09/01/2020

SuperPAL: Supervised Proposition ALignment for Multi-Document Summarization and Derivative Sub-Tasks

Multi-document summarization (MDS) is a challenging task, often decompos...
research
06/01/2019

Efficient Adaptation of Pretrained Transformers for Abstractive Summarization

Large-scale learning of transformer language models has yielded improvem...
research
05/12/2022

CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation

Scientific extreme summarization (TLDR) aims to form ultra-short summari...
research
08/15/2017

Gold Standard Online Debates Summaries and First Experiments Towards Automatic Summarization of Online Debate Data

Usage of online textual media is steadily increasing. Daily, more and mo...
research
11/06/2020

What's New? Summarizing Contributions in Scientific Literature

With thousands of academic articles shared on a daily basis, it has beco...

Please sign up or login with your details

Forgot password? Click here to reset