Sachin Kajarekar

research

∙ 02/08/2022

CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations

Deriving multimodal representations of audio and lexical inputs is a cen...

0 Vin Sachidananda, et al. ∙

research

∙ 10/09/2021

Streaming on-device detection of device directed speech from voice and touch-based invocation

When interacting with smart devices such as mobile phones or wearables, ...

10 Ognjen Rudovic, et al. ∙

research

∙ 06/18/2021

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Dysfluencies and variations in speech pronunciation can severely degrade...

0 Vikramjit Mitra, et al. ∙

research

∙ 02/24/2021

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

The ability to automatically detect stuttering events in speech could he...

0 Colin Lea, et al. ∙

research

∙ 10/20/2020

Knowledge Transfer for Efficient On-device False Trigger Mitigation

In this paper, we address the task of determining whether a given uttera...

0 Pranay Dighe, et al. ∙

research

∙ 08/03/2020

Audiovisual Speech Synthesis using Tacotron2

Audiovisual speech synthesis is the problem of synthesizing a talking fa...

0 Ahmed Hussen Abdelaziz, et al. ∙

research

∙ 04/25/2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

We present an introspection of an audiovisual speech enhancement model. ...

0 Zakaria Aldeneh, et al. ∙

research

∙ 01/31/2020

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

Emotion plays an essential role in human-to-human communication, enablin...

1 Vasudha Kowtha, et al. ∙

research

∙ 01/26/2020

Multi-task Learning for Speaker Verification and Voice Trigger Detection

Automatic speech transcription and speaker recognition are usually treat...

0 Siddharth Sigtia, et al. ∙

Sachin Kajarekar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro