A Mixed Classification-Regression Framework for 3D Pose Estimation from 2D Images

05/08/2018
by   Siddharth Mahendran, et al.
0

3D pose estimation from a single 2D image is an important and challenging task in computer vision with applications in autonomous driving, robot manipulation and augmented reality. Since 3D pose is a continuous quantity, a natural formulation for this task is to solve a pose regression problem. However, since pose regression methods return a single estimate of the pose, they have difficulties handling multimodal pose distributions (e.g. in the case of symmetric objects). An alternative formulation, which can capture multimodal pose distributions, is to discretize the pose space into bins and solve a pose classification problem. However, pose classification methods can give large pose estimation errors depending on the coarseness of the discretization. In this paper, we propose a mixed classification-regression framework that uses a classification network to produce a discrete multimodal pose estimate and a regression network to produce a continuous refinement of the discrete estimate. The proposed framework can accommodate different architectures and loss functions, leading to multiple classification-regression models, some of which achieve state-of-the-art performance on the challenging Pascal3D+ dataset.

READ FULL TEXT

page 9

page 11

page 12

page 13

page 14

research
01/15/2022

A Critical Analysis of Image-based Camera Pose Estimation Techniques

Camera, and associated with its objects within the field of view, locali...
research
11/27/2019

Multi-View Matching Network for 6D Pose Estimation

Applications that interact with the real world such as augmented reality...
research
01/24/2018

The challenge of simultaneous object detection and pose estimation: a comparative study

Detecting objects and estimating their pose remains as one of the major ...
research
12/04/2022

Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding

6D object pose estimation problem has been extensively studied in the fi...
research
09/26/2020

I Like to Move It: 6D Pose Estimation as an Action Decision Process

Object pose estimation is an integral part of robot vision and augmented...
research
12/20/2020

Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation

In this work, we introduce Deep Bingham Networks (DBN), a generic framew...
research
09/23/2019

Large Scale Joint Semantic Re-Localisation and Scene Understanding via Globally Unique Instance Coordinate Regression

In this work we present a novel approach to joint semantic localisation ...

Please sign up or login with your details

Forgot password? Click here to reset