A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators via Ensemble Self-Training

by   Jingnan Shi, et al.

Real-world robotics applications demand object pose estimation methods that work reliably across a variety of scenarios. Modern learning-based approaches require large labeled datasets and tend to perform poorly outside the training domain. Our first contribution is to develop a robust corrector module that corrects pose estimates using depth information, thus enabling existing methods to better generalize to new test domains; the corrector operates on semantic keypoints (but is also applicable to other pose estimators) and is fully differentiable. Our second contribution is an ensemble self-training approach that simultaneously trains multiple pose estimators in a self-supervised manner. Our ensemble self-training architecture uses the robust corrector to refine the output of each pose estimator; then, it evaluates the quality of the outputs using observable correctness certificates; finally, it uses the observably correct outputs for further training, without requiring external supervision. As an additional contribution, we propose small improvements to a regression-based keypoint detection architecture, to enhance its robustness to outliers; these improvements include a robust pooling scheme and a robust centroid computation. Experiments on the YCBV and TLESS datasets show the proposed ensemble self-training outperforms fully supervised baselines while not requiring 3D annotations on real data.


page 1

page 3

page 6

page 13


Correct and Certify: A New Approach to Self-Supervised 3D-Object Perception

We consider an object pose estimation and model fitting problem, where -...

OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation

Real-time object pose estimation is necessary for many robot manipulatio...

Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation

6D object pose estimation is a fundamental yet challenging problem in co...

Self-Supervised 3D Keypoint Learning for Ego-motion Estimation

Generating reliable illumination and viewpoint invariant keypoints is cr...

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Existing self-supervised 3D human pose estimation schemes have largely r...

TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation

In this paper, we introduce neural texture learning for 6D object pose e...

RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy

Unlike in natural images, in endoscopy there is no clear notion of an up...

Please sign up or login with your details

Forgot password? Click here to reset