Previous methods for predicting room acoustic parameters and speech qual...
Deep learning models have become widely adopted in various domains, but ...
The recent ubiquitous adoption of remote conferencing has been accompani...
Since the mental states of the speaker modulate speech, stress introduce...
Estimating the quality of remote speech communication is a complex task
...
Voice Activity Detection (VAD) is a fundamental module in many audio
app...
Automatic speech quality assessment is essential for audio researchers,
...
Methods for extracting audio and speech features have been studied since...
The acoustic environment can degrade speech quality during communication...
As a neurophysiological response to threat or adverse conditions, stress...
This paper presents AC-VC (Almost Causal Voice Conversion), a phonetic
p...
The digital signal processing-based representations like the Mel-Frequen...
Recent developments in speech emotion recognition (SER) often leverage d...
Speech audio quality is subject to degradation caused by an acoustic
env...
Acoustic environment characterization opens doors for sound reproduction...
Music source separation represents the task of extracting all the instru...
This paper introduces FastVC, an end-to-end model for fast Voice Convers...
Advances of deep learning for Artificial Neural Networks(ANNs) have led ...
Recent advances in Voice Activity Detection (VAD) are driven by artifici...
A growing number of studies in the field of speech processing employ fea...
In particularly noisy environments, transient loud intrusions can comple...
Most current very low bit rate (VLBR) speech coding systems use hidden M...
Using phonological speech vocoding, we propose a platform for exploring
...
The speech signal conveys information on different time scales from shor...