A Cross-Domain Transferable Neural Coherence Model

05/28/2019
by   Peng Xu, et al.
0

Coherence is an important aspect of text quality and is crucial for ensuring its readability. One important limitation of existing coherence models is that training on one domain does not easily generalize to unseen categories of text. Previous work advocates for generative models for cross-domain generalization, because for discriminative models, the space of incoherent sentence orderings to discriminate against during training is prohibitively large. In this work, we propose a local discriminative neural model with a much smaller negative sampling space that can efficiently learn against incorrect orderings. The proposed coherence model is simple in structure, yet it significantly outperforms previous state-of-art methods on a standard benchmark dataset on the Wall Street Journal corpus, as well as in multiple new challenging settings of transfer to unseen categories of discourse on Wikipedia articles.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2016

Neural Net Models for Open-Domain Discourse Coherence

Discourse coherence is strongly associated with text quality, making it ...
research
06/06/2022

Discriminative Models Can Still Outperform Generative Models in Aspect Based Sentiment Analysis

Aspect-based Sentiment Analysis (ABSA) helps to explain customers' opini...
research
04/27/2023

Cross-Domain Evaluation of POS Taggers: From Wall Street Journal to Fandom Wiki

The Wall Street Journal section of the Penn Treebank has been the de-fac...
research
11/14/2018

Modeling Coherence for Discourse Neural Machine Translation

Discourse coherence plays an important role in the translation of one te...
research
09/05/2021

Transformer Models for Text Coherence Assessment

Coherence is an important aspect of text quality and is crucial for ensu...
research
03/18/2021

Evaluating Document Coherence Modelling

While pretrained language models ("LM") have driven impressive gains ove...
research
12/20/2022

CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning

Machine-Generated Text (MGT) detection, a task that discriminates MGT fr...

Please sign up or login with your details

Forgot password? Click here to reset