ScisummNet: A Large Annotated Dataset and Content-Impact Models for Scientific Paper Summarization with Citation Networks

09/04/2019
by   Michihiro Yasunaga, et al.
0

Scientific article summarization is challenging: large, annotated corpora are not available, and the summary should ideally include the article's impacts on research community. This paper provides novel solutions to these two challenges. We 1) develop and release the first large-scale manually-annotated corpus for scientific papers (on computational linguistics) by enabling faster annotation, and 2) propose summarization methods that integrate the authors' original highlights (abstract) and the article's actual impacts on the community (citations), to create comprehensive, hybrid summaries. We conduct experiments to demonstrate the efficacy of our corpus in training data-driven models for scientific paper summarization and the advantage of our hybrid summaries over abstracts and traditional citation-based summaries. Our large annotated corpus and hybrid methods provide a new framework for scientific paper summarization research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2019

ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks

Scientific article summarization is challenging: large, annotated corpor...
research
04/21/2017

Scientific Article Summarization Using Citation-Context and Article's Discourse Structure

We propose a summarization approach for scientific articles which takes ...
research
01/04/2023

A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding

We provide a literature review about Automatic Text Summarization (ATS) ...
research
01/31/2020

Approximate Summaries for Why and Why-not Provenance (Extended Version)

Why and why-not provenance have been studied extensively in recent years...
research
11/13/2019

Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification

This article briefly explains our submitted approach to the DocEng'19 co...
research
05/12/2022

CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation

Scientific extreme summarization (TLDR) aims to form ultra-short summari...
research
06/20/2023

QuOTeS: Query-Oriented Technical Summarization

Abstract. When writing an academic paper, researchers often spend consid...

Please sign up or login with your details

Forgot password? Click here to reset