Semi-supervised Relation Extraction via Data Augmentation and Consistency-training

06/16/2023
by   Komal K. Teru, et al.
0

Due to the semantic complexity of the Relation extraction (RE) task, obtaining high-quality human labelled data is an expensive and noisy process. To improve the sample efficiency of the models, semi-supervised learning (SSL) methods aim to leverage unlabelled data in addition to learning from limited labelled data points. Recently, strong data augmentation combined with consistency-based semi-supervised learning methods have advanced the state of the art in several SSL tasks. However, adapting these methods to the RE task has been challenging due to the difficulty of data augmentation for RE. In this work, we leverage the recent advances in controlled text generation to perform high quality data augmentation for the RE task. We further introduce small but significant changes to model architecture that allows for generation of more training data by interpolating different data points in their latent space. These data augmentations along with consistency training result in very competitive results for semi-supervised relation extraction on four benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2021

On Data-Augmentation and Consistency-Based Semi-Supervised Learning

Recently proposed consistency-based Semi-Supervised Learning (SSL) metho...
research
03/05/2017

Using Graphs of Classifiers to Impose Declarative Constraints on Semi-supervised Learning

We propose a general approach to modeling semi-supervised learning (SSL)...
research
04/24/2018

Semi-Supervised Learning with Declaratively Specified Entropy Constraints

We propose a technique for declaratively specifying strategies for semi-...
research
09/09/2019

Nearly-Unsupervised Hashcode Representations for Relation Extraction

Recently, kernelized locality sensitive hashcodes have been successfully...
research
08/21/2017

Scientific Information Extraction with Semi-supervised Neural Tagging

This paper addresses the problem of extracting keyphrases from scientifi...
research
10/05/2021

Deep Subspace analysing for Semi-Supervised multi-label classification of Diabetic Foot Ulcer

Diabetes is a global raising pandemic. Diabetes patients are at risk of ...

Please sign up or login with your details

Forgot password? Click here to reset