MinkSORT: A 3D deep feature extractor using sparse convolutions to improve 3D multi-object tracking in greenhouse tomato plants

07/11/2023
by   David Rapado Rincon, et al.
0

The agro-food industry is turning to robots to address the challenge of labour shortage. However, agro-food environments pose difficulties for robots due to high variation and occlusions. In the presence of these challenges, accurate world models, with information about object location, shape, and properties, are crucial for robots to perform tasks accurately. Building such models is challenging due to the complex and unique nature of agro-food environments, and errors in the model can lead to task execution issues. In this paper, we propose MinkSORT, a novel method for generating tracking features using a 3D sparse convolutional network in a deepSORT-like approach to improve the accuracy of world models in agro-food environments. We evaluated our feature extractor network using real-world data collected in a tomato greenhouse, which significantly improved the performance of our baseline model that tracks tomato positions in 3D using a Kalman filter and Mahalanobis distance. Our deep learning feature extractor improved the HOTA from 42.8 44.77 57.63 training our deep learning feature extractor and demonstrated that our approach leads to improved performance in terms of three separate precision and recall detection outcomes. Our method improves world model accuracy, enabling robots to perform tasks such as harvesting and plant maintenance with greater efficiency and accuracy, which is essential for meeting the growing demand for food in a sustainable manner.

READ FULL TEXT
research
06/17/2016

DeepFood: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment

Worldwide, in 2014, more than 1.9 billion adults, 18 years and older, we...
research
06/18/2020

Computer Vision with Deep Learning for Plant Phenotyping in Agriculture: A Survey

In light of growing challenges in agriculture with ever growing food dem...
research
09/15/2023

Personalized Food Image Classification: Benchmark Datasets and New Baseline

Food image classification is a fundamental step of image-based dietary a...
research
03/18/2019

MUSEFood: Multi-sensor-based Food Volume Estimation on Smartphones

Researches have shown that diet recording can help people increase aware...
research
12/31/2021

Energy-Aware Multi-Robot Task Allocation in Persistent Tasks

The applicability of the swarm robots to perform foraging tasks is inspi...
research
11/12/2019

Pose estimation and bin picking for deformable products

Robotic systems in manufacturing applications commonly assume known obje...

Please sign up or login with your details

Forgot password? Click here to reset