Training deep learning models for video classification from audio-visual...
Semantic image synthesis enables control over unconditional image genera...
Audio-visual generalised zero-shot learning for video classification req...
Explaining the decision of a multi-modal decision-maker requires to dete...
Human infants are able to acquire natural language seemingly easily at a...