b'Milos Cernak'

research

∙ 09/21/2023

Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model

Previous methods for predicting room acoustic parameters and speech qual...

0 Jozef Coldenhoff, et al. ∙

research

∙ 09/21/2023

Cluster-based pruning techniques for audio data

Deep learning models have become widely adopted in various domains, but ...

0 Boris Bergsma, et al. ∙

research

∙ 09/05/2023

In-Ear-Voice: Towards Milli-Watt Audio Enhancement With Bone-Conduction Microphones for In-Ear Sensing Platforms

The recent ubiquitous adoption of remote conferencing has been accompani...

0 Philipp Schilk, et al. ∙

research

∙ 06/09/2023

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Since the mental states of the speaker modulate speech, stress introduce...

0 Zihan Wu, et al. ∙

research

∙ 03/01/2023

Personalized Task Load Prediction in Speech Communication

Estimating the quality of remote speech communication is a complex task ...

0 Robert P. Spang, et al. ∙

research

∙ 12/06/2022

BC-VAD: A Robust Bone Conduction Voice Activity Detection

Voice Activity Detection (VAD) is a fundamental module in many audio app...

0 Niccolo' Polvani, et al. ∙

research

∙ 11/12/2022

Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings

Automatic speech quality assessment is essential for audio researchers, ...

0 Karl El Hajal, et al. ∙

research

∙ 06/24/2022

BYOL-S: Learning Self-supervised Speech Representations by Bootstrapping

Methods for extracting audio and speech features have been studied since...

0 Gasser Elbanna, et al. ∙

research

∙ 04/04/2022

MOSRA: Joint Mean Opinion Score and Room Acoustics Speech Quality Assessment

The acoustic environment can degrade speech quality during communication...

0 Karl El Hajal, et al. ∙

research

∙ 03/30/2022

Hybrid Handcrafted and Learnable Audio Representation for Analysis of Speech Under Cognitive and Physical Load

As a neurophysiological response to threat or adverse conditions, stress...

0 Gasser Elbanna, et al. ∙

research

∙ 11/12/2021

AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion

This paper presents AC-VC (Almost Causal Voice Conversion), a phonetic p...

0 Damien Ronssin, et al. ∙

research

∙ 10/07/2021

Power efficient analog features for audio recognition

The digital signal processing-based representations like the Mel-Frequen...

0 Boris Bergsma, et al. ∙

research

∙ 10/07/2021

SERAB: A multi-lingual benchmark for speech emotion recognition

Recent developments in speech emotion recognition (SER) often leverage d...

0 Neil Scheidwasser-Clow, et al. ∙

research

∙ 09/29/2021

A Universal Deep Room Acoustics Estimator

Speech audio quality is subject to degradation caused by an acoustic env...

0 Paula Sánchez López, et al. ∙

research

∙ 10/21/2020

Joint Blind Room Acoustic Characterization From Speech And Music Signals Using Convolutional Recurrent Neural Networks

Acoustic environment characterization opens doors for sound reproduction...

0 Paul Callens, et al. ∙

research

∙ 10/19/2020

Fast accuracy estimation of deep learning based multi-class musical source separation

Music source separation represents the task of extracting all the instru...

0 Alexandru Mocanu, et al. ∙

research

∙ 10/08/2020

FastVC: Fast Voice Conversion with non-parallel data

This paper introduces FastVC, an end-to-end model for fast Voice Convers...

0 Oriol Barbany Mayor, et al. ∙

research

∙ 10/28/2019

A Bin Encoding Training of a Spiking Neural Network-based Voice Activity Detection

Advances of deep learning for Artificial Neural Networks(ANNs) have led ...

0 Giorgia Dellaferrera, et al. ∙

research

∙ 10/22/2019

Spiking neural networks trained with backpropagation for low power neuromorphic implementation of voice activity detection

Recent advances in Voice Activity Detection (VAD) are driven by artifici...

0 Flavio Martinelli, et al. ∙

research

∙ 10/22/2019

Speech-VGG: A deep feature extractor for speech processing

A growing number of studies in the field of speech processing employ fea...

0 Pierre Beckmann, et al. ∙

research

∙ 10/20/2019

Deep speech inpainting of time-frequency masks

In particularly noisy environments, transient loud intrusions can comple...

0 Mikolaj Kegler, et al. ∙

research

∙ 04/15/2016

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding

Most current very low bit rate (VLBR) speech coding systems use hidden M...

0 Milos Cernak, et al. ∙

research

∙ 01/22/2016

Speech vocoding for laboratory phonology

Using phonological speech vocoding, we propose a platform for exploring ...

0 Milos Cernak, et al. ∙

research

∙ 01/21/2016

On Structured Sparsity of Phonological Posteriors for Linguistic Parsing

The speech signal conveys information on different time scales from shor...

0 Milos Cernak, et al. ∙

Milos Cernak

Featured Co-authors

Sign in with Google

Consider DeepAI Pro