THLNet: two-stage heterogeneous lightweight network for monaural speech enhancement

01/19/2023
by   Feng Dang, et al.
0

In this paper, we propose a two-stage heterogeneous lightweight network for monaural speech enhancement. Specifically, we design a novel two-stage framework consisting of a coarse-grained full-band mask estimation stage and a fine-grained low-frequency refinement stage. Instead of using a hand-designed real-valued filter, we use a novel learnable complex-valued rectangular bandwidth (LCRB) filter bank as an extractor of compact features. Furthermore, considering the respective characteristics of the proposed two-stage task, we used a heterogeneous structure, i.e., a U-shaped subnetwork as the backbone of CoarseNet and a single-scale subnetwork as the backbone of FineNet. We conducted experiments on the VoiceBank + DEMAND and DNS datasets to evaluate the proposed approach. The experimental results show that the proposed method outperforms the current state-of-the-art methods, while maintaining relatively small model size and low computational complexity.

READ FULL TEXT
research
06/27/2022

A two-stage full-band speech enhancement model with effective spectral compression mapping

The direct expansion of deep neural network (DNN) based wide-band speech...
research
03/01/2022

DMF-Net: A decoupling-style multi-band fusion model for real-time full-band speech enhancement

Full-band speech enhancement based on deep neural networks is still chal...
research
04/27/2021

DPT-FSNet:Dual-path Transformer Based Full-band and Sub-band Fusion Network for Speech Enhancement

Recently, dual-path networks have achieved promising performance due to ...
research
06/01/2023

Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model

With fewer feature dimensions, filter banks are often used in light-weig...
research
09/24/2022

Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations

To address the monaural speech enhancement problem, numerous research st...
research
09/19/2018

New insights on the optimality of parameterized wiener filters for speech enhancement applications

This work presents a unified framework for defining a family of noise re...
research
12/14/2020

Group Communication with Context Codec for Ultra-Lightweight Source Separation

Ultra-lightweight model design is an important topic for the deployment ...

Please sign up or login with your details

Forgot password? Click here to reset