Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions

12/31/2015
by   Vijay Kumar B G, et al.
0

Recent innovations in training deep convolutional neural network (ConvNet) models have motivated the design of new methods to automatically learn local image descriptors. The latest deep ConvNets proposed for this task consist of a siamese network that is trained by penalising misclassification of pairs of local image patches. Current results from machine learning show that replacing this siamese by a triplet network can improve the classification accuracy in several problems, but this has yet to be demonstrated for local image descriptor learning. Moreover, current siamese and triplet networks have been trained with stochastic gradient descent that computes the gradient from individual pairs or triplets of local image patches, which can make them prone to overfitting. In this paper, we first propose the use of triplet networks for the problem of local image descriptor learning. Furthermore, we also propose the use of a global loss that minimises the overall classification error in the training set, which can improve the generalisation capability of the model. Using the UBC benchmark dataset for comparing local image descriptors, we show that the triplet network produces a more accurate embedding than the siamese network in terms of the UBC dataset errors. Moreover, we also demonstrate that a combination of the triplet and global losses produces the best embedding in the field, using this triplet network. Finally, we also show that the use of the central-surround siamese network trained with the global loss produces the best result of the field on the UBC dataset. Pre-trained models are available online at https://github.com/vijaykbg/deep-patchmatch

READ FULL TEXT
research
06/08/2021

SDGMNet: Statistic-based Dynamic Gradient Modulation for Local Descriptor Learning

Modifications on triplet loss that rescale the back-propagated gradients...
research
11/17/2017

Learning Discriminative Affine Regions via Discriminability

We present an accurate method for estimation of the affine shape of loca...
research
11/11/2019

PoshakNet: Framework for matching dresses from real-life photos using GAN and Siamese Network

Online garment shopping has gained many customers in recent years. Descr...
research
12/19/2014

Fracking Deep Convolutional Image Descriptors

In this paper we propose a novel framework for learning local image desc...
research
05/13/2017

Revisiting IM2GPS in the Deep Learning Era

Image geolocalization, inferring the geographic location of an image, is...
research
02/04/2019

A Two-Stream Siamese Neural Network for Vehicle Re-Identification by Using Non-Overlapping Cameras

We describe in this paper a novel Two-Stream Siamese Neural Network for ...
research
02/14/2020

Spectrum Translation for Cross-Spectral Ocular Matching

Cross-spectral verification remains a big issue in biometrics, especiall...

Please sign up or login with your details

Forgot password? Click here to reset