SupMMD: A Sentence Importance Model for Extractive Summarization using Maximum Mean Discrepancy

10/06/2020
by   Umanga Bista, et al.
0

Most work on multi-document summarization has focused on generic summarization of information present in each individual document set. However, the under-explored setting of update summarization, where the goal is to identify the new information present in each set, is of equal practical interest (e.g., presenting readers with updates on an evolving news topic). In this work, we present SupMMD, a novel technique for generic and update summarization based on the maximum mean discrepancy from kernel two-sample testing. SupMMD combines both supervised learning for salience and unsupervised learning for coverage and diversity. Further, we adapt multiple kernel learning to make use of similarity across multiple information sources (e.g., text features and knowledge based concepts). We show the efficacy of SupMMD in both generic and update summarization tasks by meeting or exceeding the current state-of-the-art on the DUC-2004 and TAC-2009 datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2020

Combining Word Embeddings and N-grams for Unsupervised Document Summarization

Graph-based extractive document summarization relies on the quality of t...
research
02/09/2020

Attend to the beginning: A study on using bidirectional attention for extractive summarization

Forum discussion data differ in both structure and properties from gener...
research
02/10/2023

PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream

Summarizing text-rich documents has been long studied in the literature,...
research
04/18/2017

Extractive Summarization: Limits, Compression, Generalized Model and Heuristics

Due to its promise to alleviate information overload, text summarization...
research
03/26/2020

Belief Propagation for Maximum Coverage on Weighted Bipartite Graph and Application to Text Summarization

We study text summarization from the viewpoint of maximum coverage probl...
research
10/18/2020

Covapixels

We propose and discuss the summarization of superpixel-type image tiles/...
research
03/19/2021

Extractive Summarization of Call Transcripts

Text summarization is the process of extracting the most important infor...

Please sign up or login with your details

Forgot password? Click here to reset