Point-to-set distance functions for weakly supervised segmentation

07/27/2020
by   Bas Peters, et al.
0

When pixel-level masks or partial annotations are not available for training neural networks for semantic segmentation, it is possible to use higher-level information in the form of bounding boxes, or image tags. In the imaging sciences, many applications do not have an object-background structure and bounding boxes are not available. Any available annotation typically comes from ground truth or domain experts. A direct way to train without masks is using prior knowledge on the size of objects/classes in the segmentation. We present a new algorithm to include such information via constraints on the network output, implemented via projection-based point-to-set distance functions. This type of distance functions always has the same functional form of the derivative, and avoids the need to adapt penalty functions to different constraints, as well as issues related to constraining properties typically associated with non-differentiable functions. Whereas object size information is known to enable object segmentation from bounding boxes from datasets with many general and medical images, we show that the applications extend to the imaging sciences where data represents indirect measurements, even in the case of single examples. We illustrate the capabilities in case of a) one or more classes do not have any annotation; b) there is no annotation at all; c) there are bounding boxes. We use data for hyperspectral time-lapse imaging, object segmentation in corrupted images, and sub-surface aquifer mapping from airborne-geophysical remote-sensing data. The examples verify that the developed methodology alleviates difficulties with annotating non-visual imagery for a range of experimental settings.

READ FULL TEXT

page 6

page 7

page 8

research
08/06/2021

Medical image segmentation with imperfect 3D bounding boxes

The development of high quality medical image segmentation algorithms de...
research
04/08/2023

Polygonizer: An auto-regressive building delineator

In geospatial planning, it is often essential to represent objects in a ...
research
09/10/2018

The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

This tech report gives an introduction to two annotation toolboxes that ...
research
09/21/2023

NeuralLabeling: A versatile toolset for labeling vision datasets using Neural Radiance Fields

We present NeuralLabeling, a labeling approach and toolset for annotatin...
research
07/14/2020

Deep Heterogeneous Autoencoder for Subspace Clustering of Sequential Data

We propose an unsupervised learning approach using a convolutional and f...
research
03/26/2023

CRRS: Concentric Rectangles Regression Strategy for Multi-point Representation on Fisheye Images

Modern object detectors take advantage of rectangular bounding boxes as ...
research
04/21/2020

A CNN Framenwork Based on Line Annotations for Detecting Nematodes in Microscopic Images

Plant parasitic nematodes cause damage to crop plants on a global scale....

Please sign up or login with your details

Forgot password? Click here to reset