Contrastive learning methods train visual encoders by comparing views fr...
Although the pre-trained Vision Transformers (ViTs) achieved great succe...
This paper presents the development of a multi-sensor extended reality
p...
This paper proposes a novel pretext task to address the self-supervised ...
This paper addresses the problem of self-supervised video representation...
We address the problem of video representation learning without
human-an...