Improved Object Pose Estimation via Deep Pre-touch Sensing

by   Patrick Lancaster, et al.

For certain manipulation tasks, object pose estimation from head-mounted cameras may not be sufficiently accurate. This is at least in part due to our inability to perfectly calibrate the coordinate frames of today's high degree of freedom robot arms that link the head to the end-effectors. We present a novel framework combining pre-touch sensing and deep learning to more accurately estimate pose in an efficient manner. The use of pre-touch sensing allows our method to localize the object directly with respect to the robot's end effector, thereby avoiding error caused by miscalibration of the arms. Instead of requiring the robot to scan the entire object with its pre-touch sensor, we use a deep neural network to detect object regions that contain distinctive geometric features. By focusing pre-touch sensing on these regions, the robot can more efficiently gather the information necessary to adjust its original pose estimate. Our region detection network was trained using a new dataset containing objects of widely varying geometries and has been labeled in a scalable fashion that is free from human bias. This dataset is applicable to any task that involves a pre-touch sensor gathering geometric information, and has been made publicly available. We evaluate our framework by having the robot re-estimate the pose of a number of objects of varying geometries. Compared to two simpler region proposal methods, we find that our deep neural network performs significantly better. In addition, we find that after a sequence of scans, objects can typically be localized to within 0.5 cm of their true position. We also observe that the original pose estimate can often be significantly improved after collecting a single quick scan.


page 1

page 2

page 3

page 4

page 7


Domain Randomization for Active Pose Estimation

Accurate state estimation is a fundamental component of robotic control....

Multi-View Matching Network for 6D Pose Estimation

Applications that interact with the real world such as augmented reality...

UET-Headpose: A sensor-based top-view head pose dataset

Head pose estimation is a challenging task that aims to solve problems r...

EfficientPose – An efficient, accurate and scalable end-to-end 6D multi object pose estimation approach

In this paper we introduce EfficientPose, a new approach for 6D object p...

Hybrid Bayesian Eigenobjects: Combining Linear Subspace and Deep Network Methods for 3D Robot Vision

We introduce Hybrid Bayesian Eigenobjects (HBEOs), a novel representatio...

Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration

Localizing an object accurately with respect to a robot is a key step fo...

Collaborative Self Organizing Map with DeepNNs for Fake Task Prevention in Mobile Crowdsensing

Mobile Crowdsensing (MCS) is a sensing paradigm that has transformed the...

Please sign up or login with your details

Forgot password? Click here to reset