Soo-Whan Chung

research

∙ 06/02/2023

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders

This paper introduces an end-to-end neural speech restoration model, HD-...

0 Doyeon Kim, et al. ∙

research

∙ 02/27/2023

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Multi-lingual speech recognition aims to distinguish linguistic expressi...

0 Yoohwan Kwon, et al. ∙

research

∙ 02/27/2023

Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech

The goal of this work is zero-shot text-to-speech synthesis, with speaki...

0 Jiyoung Lee, et al. ∙

research

∙ 10/31/2022

Diffusion-based Generative Speech Source Separation

We propose DiffSep, a new single channel source separation method based ...

0 Robin Scheibler, et al. ∙

research

∙ 06/30/2022

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting

In this paper, we propose a novel end-to-end user-defined keyword spotti...

0 Hyeon-Kyeong Shin, et al. ∙

research

∙ 04/21/2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

Deep learning has brought impressive progress in the study of both autom...

0 Hye-Jin Shim, et al. ∙

research

∙ 02/24/2022

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement

Modern neural speech enhancement models usually include various forms of...

0 Doyeon Kim, et al. ∙

research

∙ 01/25/2022

SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan

ASV (automatic speaker verification) systems are intrinsically required ...

0 Jee-weon Jung, et al. ∙

research

∙ 08/17/2021

Look Who's Talking: Active Speaker Detection in the Wild

In this work, we present a novel audio-visual dataset for active speaker...

0 You Jin Kim, et al. ∙

research

∙ 03/25/2021

Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation

In this paper, we address the problem of separating individual speech si...

0 Jiyoung Lee, et al. ∙

research

∙ 08/04/2020

MIRNet: Learning multiple identities representations in overlapped speech

Many approaches can derive information about a single speaker's identity...

0 Hyewon Han, et al. ∙

research

∙ 08/04/2020

Intra-class variation reduction of speaker representation in disentanglement framework

In this paper, we propose an effective training strategy to ex-tract rob...

0 Yoohwan Kwon, et al. ∙

research

∙ 05/18/2020

End-to-End Lip Synchronisation

The goal of this work is to synchronise audio and video of a talking fac...

0 You Jin Kim, et al. ∙

research

∙ 05/14/2020

FaceFilter: Audio-visual speech separation using still images

The objective of this paper is to separate a target speaker's speech fro...

9 Soo-Whan Chung, et al. ∙

research

∙ 04/29/2020

Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision

The goal of this work is to train discriminative cross-modal embeddings ...

11 Soo-Whan Chung, et al. ∙

research

∙ 01/15/2019

Orthonormal Embedding-based Deep Clustering for Single-channel Speech Separation

Deep clustering is a deep neural network-based speech separation algorit...

0 Soyeon Choe, et al. ∙

research

∙ 09/21/2018

Perfect match: Improved cross-modal embeddings for audio-visual synchronisation

This paper proposes a new strategy for learning powerful cross-modal emb...

0 Soo-Whan Chung, et al. ∙

Soo-Whan Chung

Featured Co-authors

Sign in with Google

Consider DeepAI Pro