Sefik Emre Eskimez

research

∙ 11/09/2022

Speech separation with large-scale self-supervised learning

Self-supervised learning (SSL) methods such as WavLM have shown promisin...

0 Zhuo Chen, et al. ∙

research

∙ 11/05/2022

Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation

Personalized speech enhancement (PSE) models achieve promising results c...

0 Hassan Taherian, et al. ∙

research

∙ 11/04/2022

Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net

Personalized speech enhancement (PSE), a process of estimating a clean t...

0 Sefik Emre Eskimez, et al. ∙

research

∙ 04/07/2022

Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation

Existing multi-channel continuous speech separation (CSS) models are hea...

0 Xiaofei Wang, et al. ∙

research

∙ 04/02/2022

Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation

This paper investigates how to improve the runtime speed of personalized...

0 Manthan Thakker, et al. ∙

research

∙ 02/27/2022

ICASSP 2022 Deep Noise Suppression Challenge

The Deep Noise Suppression (DNS) challenge is designed to foster innovat...

0 Harishchandra Dubey, et al. ∙

research

∙ 12/10/2021

Sequence-level self-learning with multiple hypotheses

In this work, we develop new self-learning techniques with an attention-...

0 Kenichi Kumatani, et al. ∙

research

∙ 10/27/2021

Separating Long-Form Speech with Group-Wise Permutation Invariant Training

Multi-talker conversational speech processing has drawn many interests f...

0 Wangyou Zhang, et al. ∙

research

∙ 10/20/2021

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

With the recent surge of video conferencing tools usage, providing high-...

0 Hassan Taherian, et al. ∙

research

∙ 10/18/2021

Personalized Speech Enhancement: New Models and Comprehensive Evaluation

Personalized speech enhancement (PSE) models utilize additional cues, su...

0 Sefik Emre Eskimez, et al. ∙

research

∙ 10/13/2021

All-neural beamformer for continuous speech separation

Continuous speech separation (CSS) aims to separate overlapping voices f...

0 Zhuohuang Zhang, et al. ∙

research

∙ 06/14/2021

Dynamic Gradient Aggregation for Federated Domain Adaptation

In this paper, a new learning algorithm for Federated Learning (FL) is i...

0 Dimitrios Dimitriadis, et al. ∙

research

∙ 08/08/2020

Speech Driven Talking Face Generation from a Single Image and an Emotion Condition

Visual emotion expression plays an important role in audiovisual speech ...

0 Sefik Emre Eskimez, et al. ∙

research

∙ 08/06/2020

Federated Transfer Learning with Dynamic Gradient Aggregation

In this paper, a Federated Learning (FL) simulation platform is introduc...

23 Dimitrios Dimitriadis, et al. ∙

research

∙ 04/09/2020

Improving Readability for Automatic Speech Recognition Transcription

Modern Automatic Speech Recognition (ASR) systems can achieve high perfo...

0 Junwei Liao, et al. ∙

research

∙ 03/26/2018

Generating Talking Face Landmarks from Speech

The presence of a corresponding talking face has been shown to significa...

0 Sefik Emre Eskimez, et al. ∙

Sefik Emre Eskimez

Featured Co-authors

Sign in with Google

Consider DeepAI Pro