Generative Model-driven Structure Aligning Discriminative Embeddings for Transductive Zero-shot Learning

by   Omkar Gune, et al.

Zero-shot Learning (ZSL) is a transfer learning technique which aims at transferring knowledge from seen classes to unseen classes. This knowledge transfer is possible because of underlying semantic space which is common to seen and unseen classes. Most existing approaches learn a projection function using labelled seen class data which maps visual data to semantic data. In this work, we propose a shallow but effective neural network-based model for learning such a projection function which aligns the visual and semantic data in the latent space while simultaneously making the latent space embeddings discriminative. As the above projection function is learned using the seen class data, the so-called projection domain shift exists. We propose a transductive approach to reduce the effect of domain shift, where we utilize unlabeled visual data from unseen classes to generate corresponding semantic features for unseen class visual samples. While these semantic features are initially generated using a conditional variational auto-encoder, they are used along with the seen class data to improve the projection function. We experiment on both inductive and transductive setting of ZSL and generalized ZSL and show superior performance on standard benchmark datasets AWA1, AWA2, CUB, SUN, FLO, and APY. We also show the efficacy of our model in the case of extremely less labelled data regime on different datasets in the context of ZSL.


page 1

page 2

page 3

page 4


Joint Concept Matching-Space Projection Learning for Zero-Shot Recognition

Zero-shot learning (ZSL) has been widely researched and achieved a great...

Metric Learning for Projections Bias of Generalized Zero-shot Learning

Generalized zero-shot learning models (GZSL) aim to recognize samples fr...

Joint Concept Matching based Learning for Zero-Shot Recognition

Zero-shot learning (ZSL) which aims to recognize unseen object classes b...

Incorporation of Human Knowledge into Data Embeddings to Improve Pattern Significance and Interpretability

Embedding is a common technique for analyzing multi-dimensional data. Ho...

Generalized Zero-Shot Learning using Multimodal Variational Auto-Encoder with Semantic Concepts

With the ever-increasing amount of data, the central challenge in multim...

Classifier Crafting: Turn Your ConvNet into a Zero-Shot Learner!

In Zero-shot learning (ZSL), we classify unseen categories using textual...

DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning

Generative based strategy has shown great potential in the Generalized Z...

Please sign up or login with your details

Forgot password? Click here to reset