Pretraining Image Encoders without Reconstruction via Feature Prediction Loss

03/16/2020
by   Gustav Grund Pihlgren, et al.
0

This work investigates three different loss functions for autoencoder-based pretraining of image encoders: The commonly used reconstruction loss, the more recently introduced perceptual similarity loss, and a feature prediction loss proposed here; the latter turning out to be the most efficient choice. Former work shows that predictions based on embeddings generated by image autoencoders can be improved by training with perceptual loss. So far the autoencoders trained with perceptual loss networks implemented an explicit comparison of the original and reconstructed images using the loss network. However, given such a loss network we show that there is no need for the timeconsuming task of decoding the entire image. Instead, we propose to decode the features of the loss network, hence the name "feature prediction loss". To evaluate this method we compare six different procedures for training image encoders based on pixel-wise, perceptual similarity, and feature prediction loss. The embedding-based prediction results show that encoders trained with feature prediction loss is as good or better than those trained with the other two losses. Additionally, the encoder is significantly faster to train using feature prediction loss in comparison to the other losses. The method implementation used in this work is available online: https://github.com/guspih/Perceptual-Autoencoders

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2020

Improving Image Autoencoder Embeddings with Perceptual Loss

Autoencoders are commonly trained using element-wise loss. However, elem...
research
04/25/2016

Context Encoders: Feature Learning by Inpainting

We present an unsupervised visual feature learning algorithm driven by c...
research
02/08/2016

Generating Images with Perceptual Similarity Metrics based on Deep Networks

Image-generating machine learning models are typically trained with loss...
research
07/05/2018

Improving Unsupervised Defect Segmentation by Applying Structural Similarity to Autoencoders

Convolutional autoencoders have emerged as popular models for unsupervis...
research
05/05/2021

Perceptual Gradient Networks

Many applications of deep learning for image generation use perceptual l...
research
12/03/2021

Face Reconstruction with Variational Autoencoder and Face Masks

Variational AutoEncoders (VAE) employ deep learning models to learn a co...
research
11/12/2021

Contrastive Feature Loss for Image Prediction

Training supervised image synthesis models requires a critic to compare ...

Please sign up or login with your details

Forgot password? Click here to reset