Large-Scale Domain Adaptation via Teacher-Student Learning

08/17/2017
by   Jinyu Li, et al.
0

High accuracy speech recognition requires a large amount of transcribed data for supervised training. In the absence of such data, domain adaptation of a well-trained acoustic model can be performed, but even here, high accuracy usually requires significant labeled data from the target domain. In this work, we propose an approach to domain adaptation that does not require transcriptions but instead uses a corpus of unlabeled parallel data, consisting of pairs of samples from the source domain of the well-trained model and the desired target domain. To perform adaptation, we employ teacher/student (T/S) learning, in which the posterior probabilities generated by the source-domain model can be used in lieu of labels to train the target-domain model. We evaluate the proposed approach in two scenarios, adapting a clean acoustic model to noisy speech and adapting an adults speech acoustic model to children speech. Significant improvements in accuracy are obtained, with reductions in word error rate of up to 44 for transcribed data in the target domain. Moreover, we show that increasing the amount of unlabeled data results in additional model robustness, which is particularly beneficial when using simulated training data in the target-domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2019

Butterfly: Robust One-step Approach towards Wildly-unsupervised Domain Adaptation

Unsupervised domain adaptation (UDA) trains with clean labeled data in s...
research
03/31/2021

PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems

Run-time domain shifts from training-phase domains are common in sensing...
research
04/25/2020

L-Vector: Neural Label Embedding for Domain Adaptation

We propose a novel neural label embedding (NLE) scheme for the domain ad...
research
08/02/2021

Domain Adaptation for Autoencoder-Based End-to-End Communication Over Wireless Channels

The problem of domain adaptation conventionally considers the setting wh...
research
03/17/2020

Teacher-Student Domain Adaptation for Biosensor Models

We present an approach to domain adaptation, addressing the case where d...
research
04/02/2019

Lessons from Building Acoustic Models with a Million Hours of Speech

This is a report of our lessons learned building acoustic models from 1 ...
research
03/16/2023

Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation

We study unsupervised domain adaptation (UDA) for semantic segmentation....

Please sign up or login with your details

Forgot password? Click here to reset