On Improving Informativity and Grammaticality for Multi-Sentence Compression

05/07/2016
by   Elaheh ShafieiBavani, et al.
0

Multi Sentence Compression (MSC) is of great value to many real world applications, such as guided microblog summarization, opinion summarization and newswire summarization. Recently, word graph-based approaches have been proposed and become popular in MSC. Their key assumption is that redundancy among a set of related sentences provides a reliable way to generate informative and grammatical sentences. In this paper, we propose an effective approach to enhance the word graph-based MSC and tackle the issue that most of the state-of-the-art MSC approaches are confronted with: i.e., improving both informativity and grammaticality at the same time. Our approach consists of three main components: (1) a merging method based on Multiword Expressions (MWE); (2) a mapping strategy based on synonymy between words; (3) a re-ranking step to identify the best compression candidates generated using a POS-based language model (POS-LM). We demonstrate the effectiveness of this novel approach using a dataset made of clusters of English newswire sentences. The observed improvements on informativity and grammaticality of the generated compressions show that our approach is superior to state-of-the-art MSC methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2021

Centrality Meets Centroid: A Graph-based Approach for Unsupervised Document Summarization

Unsupervised document summarization has re-acquired lots of attention in...
research
04/25/2020

Combining Word Embeddings and N-grams for Unsupervised Document Summarization

Graph-based extractive document summarization relies on the quality of t...
research
04/09/2020

A Multilingual Study of Multi-Sentence Compression using Word Vertex-Labeled Graphs and Integer Linear Programming

Multi-Sentence Compression (MSC) aims to generate a short sentence with ...
research
05/14/2018

Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization

We introduce a novel graph-based framework for abstractive meeting speec...
research
02/27/2019

An Editorial Network for Enhanced Document Summarization

We suggest a new idea of Editorial Network - a mixed extractive-abstract...
research
05/16/2018

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

We propose a unified model combining the strength of extractive and abst...
research
04/20/2018

Graph-based Hypothesis Generation for Parallax-tolerant Image Stitching

The seam-driven approach has been proven fairly effective for parallax-t...

Please sign up or login with your details

Forgot password? Click here to reset