Combining Self-Training and Self-Supervised Learning for Unsupervised Disfluency Detection

10/29/2020
by   Shaolei Wang, et al.
0

Most existing approaches to disfluency detection heavily rely on human-annotated corpora, which is expensive to obtain in practice. There have been several proposals to alleviate this issue with, for instance, self-supervised learning techniques, but they still require human-annotated corpora. In this work, we explore the unsupervised learning paradigm which can potentially work with unlabeled text corpora that are cheaper and easier to obtain. Our model builds upon the recent work on Noisy Student Training, a semi-supervised learning approach that extends the idea of self-training. Experimental results on the commonly used English Switchboard test set show that our approach achieves competitive performance compared to the previous state-of-the-art supervised systems using contextualized word embeddings (e.g. BERT and ELECTRA).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2019

Multi-Task Self-Supervised Learning for Disfluency Detection

Most existing approaches to disfluency detection heavily rely on human-a...
research
12/05/2019

Self-Supervised Contextual Language Representation of Radiology Reports to Improve the Identification of Communication Urgency

Machine learning methods have recently achieved high-performance in biom...
research
03/11/2021

Self-supervised Text-to-SQL Learning with Header Alignment Training

Since we can leverage a large amount of unlabeled data without any human...
research
10/19/2017

Unsupervised Context-Sensitive Spelling Correction of English and Dutch Clinical Free-Text with Word and Character N-Gram Embeddings

We present an unsupervised context-sensitive spelling correction method ...
research
05/26/2023

Unsupervised Embedding Quality Evaluation

Unsupervised learning has recently significantly gained in popularity, e...
research
08/28/2019

Self-supervised blur detection from synthetically blurred scenes

Blur detection aims at segmenting the blurred areas of a given image. Re...
research
01/13/2016

Predicting the Effectiveness of Self-Training: Application to Sentiment Classification

The goal of this paper is to investigate the connection between the perf...

Please sign up or login with your details

Forgot password? Click here to reset