Thomas Hummel | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Zhiyuan Liu
209 publications
Stefan Wermter
125 publications
Zeynep Akata
103 publications
Yuan Yao
99 publications
Cornelius Weber
43 publications
Matthias Kerzel
27 publications
A. Sophia Koepke
16 publications
Tobias Hinz
12 publications
Yanbei Chen
12 publications
Stephan Alaniz
10 publications
Stefan Heinrich
9 publications

research

∙ 09/07/2023

Text-to-feature diffusion for audio-visual few-shot learning

Training deep learning models for video classification from audio-visual...

0 Otniel-Bogdan Mercea, et al. ∙

research

∙ 09/06/2022

Semantic Image Synthesis with Semantically Coupled VQ-Model

Semantic image synthesis enables control over unconditional image genera...

3 Stephan Alaniz, et al. ∙

research

∙ 07/20/2022

Temporal and cross-modal attention for audio-visual zero-shot learning

Audio-visual generalised zero-shot learning for video classification req...

5 Otniel-Bogdan Mercea, et al. ∙

research

∙ 05/04/2021

Where and When: Space-Time Attention for Audio-Visual Explanations

Explaining the decision of a multi-modal decision-maker requires to dete...

10 Yanbei Chen, et al. ∙

research

∙ 06/24/2020

Crossmodal Language Grounding in an Embodied Neurocognitive Model

Human infants are able to acquire natural language seemingly easily at a...

5 Stefan Heinrich, et al. ∙

Success!

An error occurred