Towards Embodied Scene Description

04/30/2020
by   Sinan Tan, et al.
4

Embodiment is an important characteristic for all intelligent agents (creatures and robots), while existing scene description tasks mainly focus on analyzing images passively and the semantic understanding of the scenario is separated from the interaction between the agent and the environment. In this work, we propose the Embodied Scene Description, which exploits the embodiment ability of the agent to find an optimal viewpoint in its environment for scene description tasks. A learning framework with the paradigms of imitation learning and reinforcement learning is established to teach the intelligent agent to generate corresponding sensorimotor activities. The proposed framework is tested on both the AI2Thor dataset and a real world robotic platform demonstrating the effectiveness and extendability of the developed method.

READ FULL TEXT

page 1

page 4

page 6

page 8

research
04/29/2019

Argus: Smartphone-enabled Human Cooperation via Multi-Agent Reinforcement Learning for Disaster Situational Awareness

Argus exploits a Multi-Agent Reinforcement Learning (MARL) framework to ...
research
07/16/2022

Scene Graph for Embodied Exploration in Cluttered Scenario

The ability to handle objects in cluttered environment has been long ant...
research
08/14/2019

3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents

Intelligent agents gather information and perceive semantics within the ...
research
02/26/2018

Reinforcement and Imitation Learning for Diverse Visuomotor Skills

We propose a model-free deep reinforcement learning method that leverage...
research
08/17/2021

Indoor Semantic Scene Understanding using Multi-modality Fusion

Seamless Human-Robot Interaction is the ultimate goal of developing serv...
research
06/03/2011

Accelerating Reinforcement Learning through Implicit Imitation

Imitation can be viewed as a means of enhancing learning in multiagent e...
research
03/01/2021

Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning

In this work, we introduce panoramic panoptic segmentation as the most h...

Please sign up or login with your details

Forgot password? Click here to reset