Efficient dynamic filter for robust and low computational feature extraction

05/03/2022
by   Donghyeon Kim, et al.
0

Unseen noise signal which is not considered in a model training process is difficult to anticipate and would lead to performance degradation. Various methods have been investigated to mitigate unseen noise. In our previous work, an Instance-level Dynamic Filter (IDF) and a Pixel Dynamic Filter (PDF) were proposed to extract noise-robust features. However, the performance of the dynamic filter might be degraded since simple feature pooling is used to reduce the computational resource in the IDF part. In this paper, we propose an efficient dynamic filter to enhance the performance of the dynamic filter. Instead of utilizing the simple feature mean, we separate Time-Frequency (T-F) features as non-overlapping chunks, and separable convolutions are carried out for each feature direction (inter chunks and intra chunks). Additionally, we propose Dynamic Attention Pooling that maps high dimensional features as low dimensional feature embeddings. These methods are applied to the IDF for keyword spotting and speaker verification tasks. We confirm that our proposed method performs better in unseen environments (unseen noise and unseen speakers) than state-of-the-art models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2022

Discriminatory and orthogonal feature learning for noise robust keyword spotting

Keyword Spotting (KWS) is an essential component in a smart device for a...
research
01/13/2023

DINF: Dynamic Instance Noise Filter for Occluded Pedestrian Detection

Occlusion issue is the biggest challenge in pedestrian detection. RCNN-b...
research
03/20/2023

Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification

The time-delay neural network (TDNN) is one of the state-of-the-art mode...
research
11/05/2019

ROI Pooled Correlation Filters for Visual Tracking

The ROI (region-of-interest) based pooling method performs pooling opera...
research
08/17/2021

Adaptive Convolutions with Per-pixel Dynamic Filter Atom

Applying feature dependent network weights have been proved to be effect...
research
10/11/2021

Multi-query multi-head attention pooling and Inter-topK penalty for speaker verification

This paper describes the multi-query multi-head attention (MQMHA) poolin...
research
09/14/2022

I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization

Noise robustness in keyword spotting remains a challenge as many models ...

Please sign up or login with your details

Forgot password? Click here to reset