Task planning is an important component of traditional robotics systems
...
Embodied agents need to be able to interact in natural language understa...
Robots operating in human spaces must be able to engage in natural langu...
Recent work has shown that pre-trained language models such as BERT impr...
Visual referring expression recognition is a challenging task that requi...
User generated text on social media often suffers from a lot of undesire...
In this paper, we study abstractive summarization for open-domain videos...
Recent work has shown that visual context improves cross-lingual sense
d...
In this paper we propose a model to learn multimodal multilingual
repres...
A large amount of recent research has focused on tasks that combine lang...
We introduce a new task, visual sense disambiguation for verbs: given an...
We propose an approach for helping agents compose email replies to custo...