Motivated by the superior performance of image diffusion models, more an...
This paper aims to tackle a novel task - Temporal Sentence Grounding in
...
The ability to model intra-modal and inter-modal interactions is fundame...
Image retrieval plays an important role in the Internet world. Usually, ...