DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement

05/06/2021
by   Kanghao Zhang, et al.
0

In real acoustic environment, speech enhancement is an arduous task to improve the quality and intelligibility of speech interfered by background noise and reverberation. Over the past years, deep learning has shown great potential on speech enhancement. In this paper, we propose a novel real-time framework called DBNet which is a dual-branch structure with alternate interconnection. Each branch incorporates an encoder-decoder architecture with skip connections. The two branches are responsible for spectrum and waveform modeling, respectively. A bridge layer is adopted to exchange information between the two branches. Systematic evaluation and comparison show that the proposed system substantially outperforms related algorithms under very challenging environments. And in INTERSPEECH 2021 Deep Noise Suppression (DNS) challenge, the proposed system ranks the top 8 in real-time track 1 in terms of the Mean Opinion Score (MOS) of the ITU-T P.835 framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2019

Speech Enhancement via Deep Spectrum Image Translation Network

Quality and intelligibility of speech signals are degraded under additiv...
research
09/19/2023

PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement

Multi-channel speech enhancement seeks to utilize spatial information to...
research
06/23/2020

Real Time Speech Enhancement in the Waveform Domain

We present a causal speech enhancement model working on the raw waveform...
research
10/12/2021

Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning

Recent single-channel speech enhancement methods usually convert wavefor...
research
06/27/2022

ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement

We present ClearBuds, the first hardware and software system that utiliz...
research
05/15/2020

Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression

This paper introduces a dual-signal transformation LSTM network (DTLN) f...
research
04/05/2021

Real-time Streaming Wave-U-Net with Temporal Convolutions for Multichannel Speech Enhancement

In this paper, we describe the work that we have done to participate in ...

Please sign up or login with your details

Forgot password? Click here to reset