AutoRC: Improving BERT Based Relation Classification Models via Architecture Search

09/22/2020
by   Wei Zhu, et al.
0

Although BERT based relation classification (RC) models have achieved significant improvements over the traditional deep learning models, it seems that no consensus can be reached on what is the optimal architecture. Firstly, there are multiple alternatives for entity span identification. Second, there are a collection of pooling operations to aggregate the representations of entities and contexts into fixed length vectors. Third, it is difficult to manually decide which feature vectors, including their interactions, are beneficial for classifying the relation types. In this work, we design a comprehensive search space for BERT based RC models and employ neural architecture search (NAS) method to automatically discover the design choices mentioned above. Experiments on seven benchmark RC tasks show that our method is efficient and effective in finding better architectures than the baseline BERT based RC model. Ablation study demonstrates the necessity of our search space design and the effectiveness of our search method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2020

Task-Aware Neural Architecture Search

The design of handcrafted neural networks requires a lot of time and res...
research
06/01/2023

Training-free Neural Architecture Search for RNNs and Transformers

Neural architecture search (NAS) has allowed for the automatic creation ...
research
06/05/2020

AutoHAS: Differentiable Hyper-parameter and Architecture Search

Neural Architecture Search (NAS) has achieved significant progress in pu...
research
04/03/2020

Neural Architecture Generator Optimization

Neural Architecture Search (NAS) was first proposed to achieve state-of-...
research
08/15/2020

Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition

Transformer-based models have achieved stateof-the-art results in many t...
research
05/05/2020

Adaptive Interaction Modeling via Graph Operations Search

Interaction modeling is important for video action analysis. Recently, s...
research
10/20/2020

Optimal Subarchitecture Extraction For BERT

We extract an optimal subset of architectural parameters for the BERT ar...

Please sign up or login with your details

Forgot password? Click here to reset