Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility

02/05/2022
by   Tianqu Kang, et al.
0

The optimization of a wavelet-based algorithm to improve speech intelligibility is reported. The discrete-time speech signal is split into frequency sub-bands via a multi-level discrete wavelet transform. Various gains are applied to the sub-band signals before they are recombined to form a modified version of the speech. The sub-band gains are adjusted while keeping the overall signal energy unchanged, and the speech intelligibility under various background interference and simulated hearing loss conditions is enhanced and evaluated objectively and quantitatively using Google Speech-to-Text transcription. For English and Chinese noise-free speech, overall intelligibility is improved, and the transcription accuracy can be increased by as much as 80 percentage points by reallocating the spectral energy toward the mid-frequency sub-bands, effectively increasing the consonant-vowel intensity ratio. This is reasonable since the consonants are relatively weak and of short duration, which are therefore the most likely to become indistinguishable in the presence of background noise or high-frequency hearing impairment. For speech already corrupted by noise, improving intelligibility is challenging but still realizable. The proposed algorithm is implementable for real-time signal processing and comparatively simpler than previous algorithms. Potential applications include speech enhancement, hearing aids, machine listening, and a better understanding of speech intelligibility.

READ FULL TEXT

page 6

page 8

page 9

research
03/01/2022

DMF-Net: A decoupling-style multi-band fusion model for real-time full-band speech enhancement

Full-band speech enhancement based on deep neural networks is still chal...
research
10/29/2020

FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement

This paper proposes a full-band and sub-band fusion model, named as Full...
research
11/16/2021

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

In speech enhancement, complex neural network has shown promising perfor...
research
06/27/2022

A two-stage full-band speech enhancement model with effective spectral compression mapping

The direct expansion of deep neural network (DNN) based wide-band speech...
research
02/20/2023

Real-Time Speech Enhancement Using Spectral Subtraction with Minimum Statistics and Spectral Floor

An initial real-time speech enhancement method is presented to reduce th...
research
04/20/2020

Multi-frequency-band tests for white noise under heteroskedasticity

This paper proposes a new family of multi-frequency-band (MFB) tests for...
research
09/24/2017

A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement

Despite noise suppression being a mature area in signal processing, it r...

Please sign up or login with your details

Forgot password? Click here to reset