Semi-Perspective Decoupled Heatmaps for 3D Robot Pose Estimation from Depth Maps

by   Alessandro Simoni, et al.

Knowing the exact 3D location of workers and robots in a collaborative environment enables several real applications, such as the detection of unsafe situations or the study of mutual interactions for statistical and social purposes. In this paper, we propose a non-invasive and light-invariant framework based on depth devices and deep neural networks to estimate the 3D pose of robots from an external camera. The method can be applied to any robot without requiring hardware access to the internal states. We introduce a novel representation of the predicted pose, namely Semi-Perspective Decoupled Heatmaps (SPDH), to accurately compute 3D joint locations in world coordinates adapting efficient deep networks designed for the 2D Human Pose Estimation. The proposed approach, which takes as input a depth representation based on XYZ coordinates, can be trained on synthetic depth data and applied to real-world settings without the need for domain adaptation techniques. To this end, we present the SimBa dataset, based on both synthetic and real depth images, and use it for the experimental evaluation. Results show that the proposed approach, made of a specific depth map representation and the SPDH, overcomes the current state of the art.


page 1

page 2

page 4

page 5

page 7


On the role of depth predictions for 3D human pose estimation

Following the successful application of deep convolutional neural networ...

3D Robot Pose Estimation from 2D Images

This paper considers the task of locating articulated poses of multiple ...

Robot Pose Nowcasting: Forecast the Future to Improve the Present

In recent years, the effective and safe collaboration between humans and...

V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map

Most of the existing deep learning-based methods for 3D hand and human p...

DRPose3D: Depth Ranking in 3D Human Pose Estimation

In this paper, we propose a two-stage depth ranking based method (DRPose...

S2R-DepthNet: Learning a Generalizable Depth-specific Structural Representation

Human can infer the 3D geometry of a scene from a sketch instead of a re...

Efficient Convolutional Neural Networks for Depth-Based Multi-Person Pose Estimation

Achieving robust multi-person 2D body landmark localization and pose est...

Please sign up or login with your details

Forgot password? Click here to reset