Blindfold Baselines for Embodied QA

11/12/2018
by   Ankesh Anand, et al.
6

We explore blindfold (question-only) baselines for Embodied Question Answering. The EmbodiedQA task requires an agent to answer a question by intelligently navigating in a simulated environment, gathering necessary visual information only through first-person vision before finally answering. Consequently, a blindfold baseline which ignores the environment and visual information is a degenerate solution, yet we show through our experiments on the EQAv1 dataset that a simple question-only baseline achieves state-of-the-art results on the EmbodiedQA task in all cases except when the agent is spawned extremely close to the object.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2020

A Dataset and Baselines for Visual Question Answering on Art

Answering questions related to art pieces (paintings) is a difficult tas...
research
04/08/2019

Revisiting EmbodiedQA: A Simple Baseline and Beyond

In Embodied Question Answering (EmbodiedQA), an agent interacts with an ...
research
11/30/2017

Embodied Question Answering

We present a new AI task -- Embodied Question Answering (EmbodiedQA) -- ...
research
10/06/2022

Embodied Referring Expression for Manipulation Question Answering in Interactive Environment

Embodied agents are expected to perform more complicated tasks in an int...
research
10/16/2021

Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation

An embodied task such as embodied question answering (EmbodiedQA), requi...
research
10/31/2019

TAB-VCR: Tags and Attributes based VCR Baselines

Reasoning is an important ability that we learn from a very early age. Y...
research
05/03/2022

Episodic Memory Question Answering

Egocentric augmented reality devices such as wearable glasses passively ...

Please sign up or login with your details

Forgot password? Click here to reset