Layne Berry | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shinji Watanabe
239 publications
Hung-Yi Lee
187 publications
Yu Tsao
127 publications
Eunsol Choi
48 publications
Abdelrahman Mohamed
41 publications
David Harwath
35 publications
Shang-Wen Li
34 publications
Yi-Ting Chen
26 publications
Haibin Wu
23 publications
Po-Yao Huang
20 publications
Kyle Mahowald
18 publications

research

∙ 09/19/2023

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Audio-visual representation learning aims to develop systems with human-...

0 Yuan Tseng, et al. ∙

research

∙ 11/02/2022

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval

This work investigates the use of large-scale, pre-trained models (CLIP ...

0 Layne Berry, et al. ∙

research

∙ 11/01/2022

Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality

Recent visuolinguistic pre-trained models show promising progress on var...

0 Anuj Diwan, et al. ∙

research

∙ 10/03/2022

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Data-driven speech processing models usually perform well with a large a...

0 Yi-Jen Shih, et al. ∙

Success!

An error occurred