Fit to Measure: Reasoning about Sizes for Robust Object Recognition

by   Agnese Chiatti, et al.

Service robots can help with many of our daily tasks, especially in those cases where it is inconvenient or unsafe for us to intervene: e.g., under extreme weather conditions or when social distance needs to be maintained. However, before we can successfully delegate complex tasks to robots, we need to enhance their ability to make sense of dynamic, real world environments. In this context, the first prerequisite to improving the Visual Intelligence of a robot is building robust and reliable object recognition systems. While object recognition solutions are traditionally based on Machine Learning methods, augmenting them with knowledge based reasoners has been shown to improve their performance. In particular, based on our prior work on identifying the epistemic requirements of Visual Intelligence, we hypothesise that knowledge of the typical size of objects could significantly improve the accuracy of an object recognition system. To verify this hypothesis, in this paper we present an approach to integrating knowledge about object sizes in a ML based architecture. Our experiments in a real world robotic scenario show that this combined approach ensures a significant performance increase over state of the art Machine Learning methods.


Towards a Framework for Visual Intelligence in Service Robotics: Epistemic Requirements and Gap Analysis

A key capability required by service robots operating in real-world, dyn...

A Hybrid SLAM and Object Recognition System for Pepper Robot

Humanoid robots are playing increasingly important roles in real-life ta...

Mass Estimation in Manipulation Tasks of Domestic Service Robots using Fault Reconstruction Techniques

Manipulation is a key capability in domestic service robots, as can be s...

Learning Illuminant Estimation from Object Recognition

In this paper we present a deep learning method to estimate the illumina...

SizeNet: Object Recognition via Object Real Size-based convolutional networks

Inspired by the conclusion that human choose the visual cortex regions w...

Commonsense Spatial Reasoning for Visually Intelligent Agents

Service robots are expected to reliably make sense of complex, fast-chan...

Combining Shape Features with Multiple Color Spaces in Open-Ended 3D Object Recognition

As a consequence of an ever-increasing number of camera-based service ro...

Please sign up or login with your details

Forgot password? Click here to reset