MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification

05/24/2022
by   Yu Lu Liu, et al.
0

In text summarization and simplification, system outputs must be evaluated along multiple dimensions such as relevance, factual consistency, fluency, and grammaticality, and a wide range of possible outputs could be of high quality. These properties make the development of an adaptable, reference-less evaluation metric both necessary and challenging. We introduce MaskEval, a reference-less metric for text summarization and simplification that operates by performing masked language modeling (MLM) on the concatenation of the candidate and the source texts. It features an attention-like weighting mechanism to modulate the relative importance of each MLM step, which crucially allows MaskEval to be adapted to evaluate different quality dimensions. We demonstrate its effectiveness on English summarization and on multilingual text simplification in terms of correlations with human judgments.

READ FULL TEXT
research
03/27/2023

Large Language Models are Diverse Role-Players for Summarization Evaluation

Text summarization has a wide range of applications in many scenarios. T...
research
04/11/2022

Evaluation of Automatic Text Summarization using Synthetic Facts

Despite some recent advances, automatic text summarization remains unrel...
research
05/22/2023

Evaluating Factual Consistency of Texts with Semantic Role Labeling

Automated evaluation of text generation systems has recently seen increa...
research
03/29/2023

Summarizing Indian Languages using Multilingual Transformers based Models

With the advent of multilingual models like mBART, mT5, IndicBART etc., ...
research
07/01/2015

Dimensionality on Summarization

Summarization is one of the key features of human intelligence. It plays...
research
02/14/2023

Exploiting Summarization Data to Help Text Simplification

One of the major problems with text simplification is the lack of high-q...
research
08/01/2022

SMART: Sentences as Basic Units for Text Evaluation

Widely used evaluation metrics for text generation either do not work we...

Please sign up or login with your details

Forgot password? Click here to reset