Peter Anderson

research

∙ 04/06/2022

Simple and Effective Synthesis of Indoor 3D Scenes

We study the problem of synthesizing immersive 3D indoor scenes from one...

5 Jing Yu Koh, et al. ∙

research

∙ 11/25/2021

Less is More: Generating Grounded Navigation Instructions from Landmarks

We study the automatic generation of navigation instructions from 360-de...

8 Su Wang, et al. ∙

research

∙ 05/18/2021

Pathdreamer: A World Model for Indoor Navigation

People navigating in unfamiliar buildings take advantage of myriad visua...

12 Jing Yu Koh, et al. ∙

research

∙ 03/23/2021

PanGEA: The Panoramic Graph Environment Annotation Toolkit

PanGEA, the Panoramic Graph Environment Annotation toolkit, is a lightwe...

5 Alexander Ku, et al. ∙

research

∙ 01/26/2021

On the Evaluation of Vision-and-Language Navigation Instructions

Vision-and-Language Navigation wayfinding agents can be enhanced by expl...

5 Ming Zhao, et al. ∙

research

∙ 11/16/2020

Where Are You? Localization from Embodied Dialog

We present Where Are You? (WAY), a dataset of 6k dialogs in which two h...

14 Meera Hahn, et al. ∙

research

∙ 11/07/2020

Sim-to-Real Transfer for Vision-and-Language Navigation

We study the challenging problem of releasing a robot in a previously un...

3 Peter Anderson, et al. ∙

research

∙ 10/15/2020

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding

We introduce Room-Across-Room (RxR), a new Vision-and-Language Navigatio...

2 Alexander Ku, et al. ∙

research

∙ 07/23/2020

Spatially Aware Multimodal Transformers for TextVQA

Textual cues are essential for everyday tasks like buying groceries and ...

11 Yash Kant, et al. ∙

research

∙ 04/30/2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

Following a navigation instruction such as 'Walk down the stairs and sto...

10 Arjun Majumdar, et al. ∙

research

∙ 07/03/2019

Chasing Ghosts: Instruction Following as Bayesian State Tracking

A visually-grounded navigation instruction can be interpreted as a seque...

3 Peter Anderson, et al. ∙

research

∙ 04/23/2019

RERERE: Remote Embodied Referring Expressions in Real indoor Environments

One of the long-term challenges of robotics is to enable humans to commu...

8 Yuankai Qi, et al. ∙

research

∙ 01/25/2019

Audio-Visual Scene-Aware Dialog

We introduce the task of scene-aware dialog. Given a follow-up question ...

48 Huda Alamri, et al. ∙

research

∙ 12/20/2018

nocaps: novel object captioning at scale

Image captioning models have achieved impressive results on datasets con...

46 Harsh Agrawal, et al. ∙

research

∙ 08/28/2018

Disfluency Detection using Auto-Correlational Neural Networks

In recent years, the natural language processing community has moved awa...

0 Paria Jamshid Lou, et al. ∙

research

∙ 07/18/2018

On Evaluation of Embodied Navigation Agents

Skillful mobile operation in three-dimensional environments is a primary...

0 Peter Anderson, et al. ∙

research

∙ 07/06/2018

Face-Cap: Image Captioning using Facial Expression Analysis

Image captioning is the process of generating a natural language descrip...

0 Omid Mohamad Nezami, et al. ∙

research

∙ 06/15/2018

Partially-Supervised Image Captioning

Image captioning models are becoming increasingly successful at describi...

6 Peter Anderson, et al. ∙

research

∙ 11/20/2017

Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments

A robot that can carry out a natural-language instruction has been a dre...

0 Peter Anderson, et al. ∙

research

∙ 08/09/2017

Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge

This paper presents a state-of-the-art model for visual question answeri...

0 Damien Teney, et al. ∙

research

∙ 07/25/2017

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Top-down visual attention mechanisms have been used extensively in image...

0 Peter Anderson, et al. ∙

research

∙ 12/02/2016

Guided Open Vocabulary Image Captioning with Constrained Beam Search

Existing image captioning models do not generalize well to out-of-domain...

0 Peter Anderson, et al. ∙

research

∙ 07/29/2016

SPICE: Semantic Propositional Image Caption Evaluation

There is considerable interest in the task of automatically generating i...

0 Peter Anderson, et al. ∙

Peter Anderson

Featured Co-authors

Sign in with Google

Consider DeepAI Pro