SeDAR - Semantic Detection and Ranging: Humans can localise without LiDAR, can robots?

09/05/2017
by   Oscar Mendez, et al.
0

How does a person work out their location using a floorplan? It is probably safe to say that we do not explicitly measure depths to every visible surface and try to match them against different pose estimates in the floorplan. And yet, this is exactly how most robotic scan-matching algorithms operate. Similarly, we do not extrude the 2D geometry present in the floorplan into 3D and try to align it to the real-world. And yet, this is how most vision-based approaches localise. Humans do the exact opposite. Instead of depth, we use high level semantic cues. Instead of extruding the floorplan up into the third dimension, we collapse the 3D world into a 2D representation. Evidence of this is that many of the floorplans we use in everyday life are not accurate, opting instead for high levels of discriminative landmarks. In this work, we use this insight to present a global localisation approach that relies solely on the semantic labels present in the floorplan and extracted from RGB images. While our approach is able to use range measurements if available, we demonstrate that they are unnecessary as we can achieve results comparable to state-of-the-art without them.

READ FULL TEXT

page 1

page 5

page 6

page 8

page 9

page 11

research
12/14/2015

Understanding Human-Centric Images: From Geometry to Fashion

Understanding humans from photographs has always been a fundamental goal...
research
12/04/2018

SurfConv: Bridging 3D and 2D Convolution for RGBD Images

We tackle the problem of using 3D information in convolutional neural ne...
research
07/30/2021

SemSegMap- 3D Segment-Based Semantic Localization

Localization is an essential task for mobile autonomous robotic systems ...
research
02/17/2016

PlaNet - Photo Geolocation with Convolutional Neural Networks

Is it possible to build a system to determine the location where a photo...
research
03/04/2020

Semantic sensor fusion: from camera to sparse lidar information

To navigate through urban roads, an automated vehicle must be able to pe...
research
07/24/2018

CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks

Since the resurgence of CNNs the robotic vision community has developed ...
research
09/19/2023

Vision-based Situational Graphs Generating Optimizable 3D Scene Representations

3D scene graphs offer a more efficient representation of the environment...

Please sign up or login with your details

Forgot password? Click here to reset