Video-language pre-training (VLP) has become increasingly important due ...
Predicting the geographic location (geo-localization) from a single
grou...
Multimodal learning is an emerging yet challenging research area. In thi...
Today's Internet is awash in memes as they are humorous, satirical, or i...