N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System

11/16/2019
by   Shuo Liu, et al.
0

N-HANS is a Python toolkit for in-the-wild audio enhancement, including speech, music, and general audio denoising, separation, and selective noise or source suppression. The functionalities are realised based on two neural network models sharing the same architecture, but trained separately. The models are comprised of stacks of residual blocks, each conditioned on additional speech or environmental noise recordings for adapting to different unseen speakers or environments in real life. In addition to a Python API, a command line interface is provided to researchers and developers, both of which are documented at https://github.com/N-HANS/N-HANS. Experimental results indicate that N-HANS achieves outstanding performance, and ensure its reliable usage in real-life audio and speech-related tasks, reaching very high audio and speech quality.

READ FULL TEXT
research
12/12/2017

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks

auDeep is a Python toolkit for deep unsupervised representation learning...
research
08/24/2023

Exploiting Time-Frequency Conformers for Music Audio Enhancement

With the proliferation of video platforms on the internet, recording mus...
research
01/13/2020

Two Channel Audio Zooming System For Smartphone

In this paper, two microphone based systems for audio zooming is propose...
research
12/26/2021

Bilingual Speech Recognition by Estimating Speaker Geometry from Video Data

Speech recognition is very challenging in student learning environments ...
research
10/17/2022

TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning Library

The DIVA model is a computational model of speech motor control that com...
research
01/04/2023

Audio-Visual Efficient Conformer for Robust Speech Recognition

End-to-end Automatic Speech Recognition (ASR) systems based on neural ne...
research
11/02/2022

SpectroMap: Peak detection algorithm for audio fingerprinting

We present SpectroMap, an open source GitHub repository for audio finger...

Please sign up or login with your details

Forgot password? Click here to reset