Vedanuj Goswami

research

∙ 07/18/2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

In this work, we develop and release Llama 2, a collection of pretrained...

0 Hugo Touvron, et al. ∙

research

∙ 07/17/2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages

Speech-to-speech translation (S2ST) enables spoken communication between...

0 Hongyu Gong, et al. ∙

research

∙ 05/23/2023

Revisiting Machine Translation for Cross-lingual Classification

Machine Translation (MT) has been widely used for cross-lingual classifi...

0 Mikel Artetxe, et al. ∙

research

∙ 05/03/2023

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity

Mixture-of-experts (MoE) models that employ sparse activation have demon...

0 Haoran Xu, et al. ∙

research

∙ 03/01/2023

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

We introduce MuAViC, a multilingual audio-visual corpus for robust speec...

0 Mohamed Anwar, et al. ∙

research

∙ 02/10/2023

Language-Aware Multilingual Machine Translation with Self-Supervised Learning

Multilingual machine translation (MMT) benefits from cross-lingual trans...

0 Haoran Xu, et al. ∙

research

∙ 12/14/2022

Causes and Cures for Interference in Multilingual Translation

Multilingual machine translation models can benefit from synergy between...

0 Uri Shaham, et al. ∙

research

∙ 07/11/2022

No Language Left Behind: Scaling Human-Centered Machine Translation

Driven by the goal of eradicating language barriers on a global scale, m...

9 NLLB team, et al. ∙

research

∙ 12/08/2021

FLAVA: A Foundational Language And Vision Alignment Model

State-of-the-art vision and vision-and-language models rely on large-sca...

2 Amanpreet Singh, et al. ∙

research

∙ 10/15/2021

Tricks for Training Sparse Translation Models

Multi-task learning with an unbalanced data distribution skews model lea...

8 Dheeru Dua, et al. ∙

research

∙ 06/04/2021

Human-Adversarial Visual Question Answering

Performance on the most commonly used Visual Question Answering dataset ...

5 Sasha Sheng, et al. ∙

research

∙ 11/19/2020

Creative Sketch Generation

Sketching or doodling is a popular creative activity that people engage ...

11 Songwei Ge, et al. ∙

research

∙ 05/10/2020

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes

This work proposes a new challenge set for multimodal classification, fo...

7 Douwe Kiela, et al. ∙

research

∙ 04/24/2020

Revisiting Modulated Convolutions for Visual Counting and Beyond

This paper targets at visual counting, where the setup is to estimate th...

1 Duy-Kien Nguyen, et al. ∙

research

∙ 04/19/2020

Are we pretraining it right? Digging deeper into visio-linguistic pretraining

Numerous recent works have proposed pretraining generic visio-linguistic...

1 Amanpreet Singh, et al. ∙

research

∙ 12/05/2019

12-in-1: Multi-Task Vision and Language Representation Learning

Much of vision-and-language research focuses on a small but diverse set ...

22 Jiasen Lu, et al. ∙

research

∙ 07/19/2019

Only Time Can Tell: Discovering Temporal Data for Temporal Modeling

Understanding temporal information and how the visual world changes over...

1 Laura Sevilla-Lara, et al. ∙

Vedanuj Goswami

Featured Co-authors

Sign in with Google

Consider DeepAI Pro