Weakly-supervised vision-language (V-L) pre-training (W-VLP) aims at lea...
Text recognition is a major computer vision task with a big set of assoc...
Learning to hash is an efficient paradigm for exact and approximate near...
Using class labels to represent class similarity is a typical approach t...