Understanding videos is challenging in computer vision. In particular, t...
The visual and audio modalities are highly correlated yet they contain
d...
We present a self-supervised task on point clouds, in order to learn
mea...
Video action detectors are usually trained using video datasets with ful...
The 3rd annual installment of the ActivityNet Large- Scale Activity
Reco...
Despite the recent progress in video understanding and the continuous ra...
The ActivityNet Large Scale Activity Recognition Challenge 2017 Summary:...
Traditional approaches for action detection use trimmed data to learn
so...