Equivariance versus Augmentation for Spherical Images

02/08/2022
by   Jan E. Gerken, et al.
4

We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are trained and evaluated on single or multiple items from the MNIST or FashionMNIST dataset projected onto the sphere. For the task of image classification, which is inherently rotationally invariant, we find that by considerably increasing the amount of data augmentation and the size of the networks, it is possible for the standard CNNs to reach at least the same performance as the equivariant network. In contrast, for the inherently equivariant task of semantic segmentation, the non-equivariant networks are consistently outperformed by the equivariant networks with significantly fewer parameters. We also analyze and compare the inference latency and training times of the different networks, enabling detailed tradeoff considerations between equivariant architectures and data augmentation for practical problems. The equivariant spherical networks used in the experiments will be made available at https://github.com/JanEGerken/sem_seg_s2cnn .

READ FULL TEXT

page 1

page 6

page 7

page 15

page 18

page 19

research
12/16/2019

Data augmentation approaches for improving animal audio classification

In this paper we present ensembles of classifiers for automated animal a...
research
01/09/2022

Invariance encoding in sliced-Wasserstein space for image classification with limited training data

Deep convolutional neural networks (CNNs) are broadly considered to be s...
research
01/11/2021

Spherical Transformer: Adapting Spherical Signal to ConvolutionalNetworks

Convolutional neural networks (CNNs) have been widely used in various vi...
research
12/16/2021

How to augment your ViTs? Consistency loss and StyleAug, a random style transfer augmentation

The Vision Transformer (ViT) architecture has recently achieved competit...
research
04/20/2022

SuperpixelGridCut, SuperpixelGridMean and SuperpixelGridMix Data Augmentation

A novel approach of data augmentation based on irregular superpixel deco...
research
08/28/2021

High performing ensemble of convolutional neural networks for insect pest image detection

Pest infestation is a major cause of crop damage and lost revenues world...
research
09/02/2020

Robust Object Classification Approach using Spherical Harmonics

In this paper, we present a robust spherical harmonics approach for the ...

Please sign up or login with your details

Forgot password? Click here to reset