AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

09/21/2017
by   Thanh-Toan Do, et al.
0

We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images. Our AffordanceNet has two branches: an object detection branch to localize and classify the object, and an affordance detection branch to assign each pixel in the object to its most probable affordance label. The proposed framework employs three key components for effectively handling the multiclass problem in the affordance mask: a sequence of deconvolutional layers, a robust resizing strategy, and a multi-task loss function. The experimental results on the public datasets show that our AffordanceNet outperforms recent state-of-the-art methods by a fair margin, while its end-to-end architecture allows the inference at the speed of 150ms per image. This makes our AffordanceNet is well suitable for real-time robotic applications. Furthermore, we demonstrate the effectiveness of AffordanceNet in different testing environments and in real robotic applications. The source code is available at https://github.com/nqanh/affordance-net.

READ FULL TEXT

page 1

page 6

page 7

research
08/25/2021

AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Existing deep learning-based approaches for monocular 3D object detectio...
research
06/08/2023

ExtPerFC: An Efficient 2D and 3D Perception Hardware-Software Framework for Mobile Cobot

As the reliability of the robot's perception correlates with the number ...
research
12/19/2017

Learning Fixation Point Strategy for Object Detection and Classification

We propose a novel recurrent attentional structure to localize and recog...
research
08/27/2022

Multi-Outputs Is All You Need For Deblur

Image deblurring task is an ill-posed one, where exists infinite feasibl...
research
03/07/2017

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

In this paper, we establish a baseline for object symmetry detection in ...
research
07/19/2020

Geometry Constrained Weakly Supervised Object Localization

We propose a geometry constrained network, termed GC-Net, for weakly sup...
research
03/23/2019

V2CNet: A Deep Learning Framework to Translate Videos to Commands for Robotic Manipulation

We propose V2CNet, a new deep learning framework to automatically transl...

Please sign up or login with your details

Forgot password? Click here to reset