Large pre-trained vision-language models (VLMs) reduce the time for
deve...
In this paper, we introduce UnFuSeD, a novel approach to leverage
self-s...
We present a new Self-Supervised Learning (SSL) approach to pre-train
en...
We present Multiscale Audio Spectrogram Transformer (MAST) for audio
cla...
Self-supervised learning (SSL) to learn high-level speech representation...
Inspired by the recent progress in self-supervised learning for computer...
We introduce DECAR, a self-supervised pre-training approach for learning...
India is home to multiple languages, and training automatic speech
recog...