Animating an object in 3D often requires an articulated structure, e.g. ...
Building animatable and editable models of clothed humans from raw 3D sc...
Obtaining 3D object representations is important for creating photo-real...
We introduce Housekeep, a benchmark to evaluate commonsense reasoning in...
Video accessibility is crucial for blind and low vision users for equita...
Recent Visual Question Answering (VQA) models have shown impressive
perf...
Textual cues are essential for everyday tasks like buying groceries and ...
This work is a part of ICLR Reproducibility Challenge 2019, we try to
re...