Generalized Object Search

01/24/2023
by   Kaiyu Zheng, et al.
0

Future collaborative robots must be capable of finding objects. As such a fundamental skill, we expect object search to eventually become an off-the-shelf capability for any robot, similar to e.g., object detection, SLAM, and motion planning. However, existing approaches either make unrealistic compromises (e.g., reduce the problem from 3D to 2D), resort to ad-hoc, greedy search strategies, or attempt to learn end-to-end policies in simulation that are yet to generalize across real robots and environments. This thesis argues that through using Partially Observable Markov Decision Processes (POMDPs) to model object search while exploiting structures in the human world (e.g., octrees, correlations) and in human-robot interaction (e.g., spatial language), a practical and effective system for generalized object search can be achieved. In support of this argument, I develop methods and systems for (multi-)object search in 3D environments under uncertainty due to limited field of view, occlusion, noisy, unreliable detectors, spatial correlations between objects, and possibly ambiguous spatial language (e.g., "The red car is behind Chase Bank"). Besides evaluation in simulators such as PyGame, AirSim, and AI2-THOR, I design and implement a robot-independent, environment-agnostic system for generalized object search in 3D and deploy it on the Boston Dynamics Spot robot, the Kinova MOVO robot, and the Universal Robots UR5e robotic arm, to perform object search in different environments. The system enables, for example, a Spot robot to find a toy cat hidden underneath a couch in a kitchen area in under one minute. This thesis also broadly surveys the object search literature, proposing taxonomies in object search problem settings, methods and systems.

READ FULL TEXT

page 24

page 27

research
03/06/2023

A System for Generalized 3D Multi-Object Search

Searching for objects is a fundamental skill for robots. As such, we exp...
research
12/04/2020

Spatial Language Understanding for Object Search in Partially Observed Cityscale Environments

We present a system that enables robots to interpret spatial language as...
research
10/19/2021

Towards Optimal Correlational Object Search

In realistic applications of object search, robots will need to locate t...
research
01/18/2014

Modelling Observation Correlations for Active Exploration and Robust Object Detection

Today, mobile robots are expected to carry out increasingly complex task...
research
02/24/2023

From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

How can a robot efficiently extract a desired object from a shelf when i...
research
09/13/2023

Language-Conditioned Observation Models for Visual Object Search

Object search is a challenging task because when given complex language ...
research
02/09/2020

Grasping and Manipulation with a Multi-Fingered Hand

This thesis is concerned with deriving planning algorithms for robot man...

Please sign up or login with your details

Forgot password? Click here to reset