High Definition (HD) maps are maps with precise definitions of road lane...
Predicting the future behavior of moving agents is essential for real wo...
Behavior prediction in dynamic, multi-agent systems is an important prob...
Detecting pedestrians and predicting future trajectories for them are
cr...
The task of referring relationships is to localize subject and object
en...
Recent work on 3D object detection advocates point cloud voxelization in...
The labeling cost of large number of bounding boxes is one of the main
c...
We address the problem of language-based temporal localization in untrim...
Temporal action proposal generation is an important task, akin to object...
Video-based person reID is an important task, which has received much
at...
Video Question Answering (QA) is an important task in understanding vide...
Given a natural language query, a phrase grounding system aims to locali...
Fine-grained image labels are desirable for many computer vision
applica...
In this work, we address the problem of spatio-temporal action detection...
Action anticipation aims to detect an action before it happens. Many rea...
This paper focuses on temporal localization of actions in untrimmed vide...
Temporal action detection in long videos is an important problem.
State-...
Temporal Action Proposal (TAP) generation is an important problem, as fa...
Action classification in still images has been a popular research topic ...
Action classification in still images is an important task in computer
v...