We introduce Classification with Alternating Normalization (CAN), a
non-...
Recent advances in using retrieval components over external knowledge so...
Visual engagement in social media platforms comprises interactions with ...
An image is worth a thousand words, conveying information that goes beyo...
Many vision-related tasks benefit from reasoning over multiple modalitie...
Traditional action recognition models are constructed around the paradig...
Robotic surgery has been proven to offer clear advantages during surgica...
Registering images from different modalities is an active area of resear...
We present a self-supervised approach to training convolutional neural
n...
Clinical examinations that involve endoscopic exploration of the nasal c...
Despite the growing discriminative capabilities of modern deep learning
...
Limited labeled data are available for the research of estimating facial...
The ability to identify and temporally segment fine-grained human action...
Functional endoscopic sinus surgery (FESS) is a surgical procedure used ...
The dominant paradigm for video-based action segmentation is composed of...
Joint segmentation and classification of fine-grained actions is importa...