Variational Information Bottleneck for Effective Low-resource Audio Classification

07/10/2021
by   Shijing Si, et al.
0

Large-scale deep neural networks (DNNs) such as convolutional neural networks (CNNs) have achieved impressive performance in audio classification for their powerful capacity and strong generalization ability. However, when training a DNN model on low-resource tasks, it is usually prone to overfitting the small data and learning too much redundant information. To address this issue, we propose to use variational information bottleneck (VIB) to mitigate overfitting and suppress irrelevant information. In this work, we conduct experiments ona 4-layer CNN. However, the VIB framework is ready-to-use and could be easily utilized with many other state-of-the-art network architectures. Evaluation on a few audio datasets shows that our approach significantly outperforms baseline methods, yielding more than 5.0 accuracy in some low-source settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning

While large-scale pretrained language models have obtained impressive re...
research
09/29/2016

CNN Architectures for Large-Scale Audio Classification

Convolutional Neural Networks (CNNs) have proven very effective in image...
research
07/10/2018

Deep Learning for Audio Transcription on Low-Resource Datasets

In training a deep learning system to perform audio transcription, two p...
research
07/10/2018

Deep Learning on Low-Resource Datasets

In training a deep learning system to perform audio transcription, two p...
research
03/21/2023

ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain Generalization

Domain generalization (DG) aims to learn a model that generalizes well t...
research
11/12/2021

Exploiting all samples in low-resource sentence classification: early stopping and initialization parameters

In low resource settings, deep neural models have often shown lower perf...
research
10/12/2022

SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection

Audio DeepFakes are utterances generated with the use of deep neural net...

Please sign up or login with your details

Forgot password? Click here to reset