Transfer Learning for Information Extraction with Limited Data

03/06/2020
by   Minh-Tien Nguyen, et al.
0

This paper presents a practical approach to fine-grained information extraction. Through plenty of experiences of authors in practically applying information extraction to business process automation, there can be found a couple of fundamental technical challenges: (i) the availability of labeled data is usually limited and (ii) highly detailed classification is required. The main idea of our proposal is to leverage the concept of transfer learning, which is to reuse the pre-trained model of deep neural networks, with a combination of common statistical classifiers to determine the class of each extracted term. To do that, we first exploit BERT to deal with the limitation of training data in real scenarios, then stack BERT with Convolutional Neural Networks to learn hidden representation for classification. To validate our approach, we applied our model to an actual case of document processing, which is a process of competitive bids for government projects in Japan. We used 100 documents for training and testing and confirmed that the model enables to extract fine-grained named entities with a detailed level of information preciseness specialized in the targeted business process, such as a department name of application receivers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2019

Pay Attention to Convolution Filters: Towards Fast and Accurate Fine-Grained Transfer Learning

We propose an efficient transfer learning method for adapting ImageNet p...
research
06/08/2021

Speech BERT Embedding For Improving Prosody in Neural TTS

This paper presents a speech BERT model to extract embedded prosody info...
research
01/19/2022

A Review of Deep Transfer Learning and Recent Advancements

A successful deep learning model is dependent on extensive training data...
research
12/16/2021

An Empirical Study on Transfer Learning for Privilege Review

Protecting privileged communications and data from inadvertent disclosur...
research
03/30/2021

An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking

Recently introduced pre-trained contextualized autoregressive models lik...
research
02/25/2021

PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning

The challenge of recognizing named entities in a given text has been a v...
research
02/28/2018

Autonomous Reconfiguration Procedures for EJB-based Enterprise Applications

Enterprise Applications (EA) are complex software systems for supporting...

Please sign up or login with your details

Forgot password? Click here to reset