End-to-end Binary Representation Learning via Direct Binary Embedding

03/15/2017
by   Liu Liu, et al.
0

Learning binary representation is essential to large-scale computer vision tasks. Most existing algorithms require a separate quantization constraint to learn effective hashing functions. In this work, we present Direct Binary Embedding (DBE), a simple yet very effective algorithm to learn binary representation in an end-to-end fashion. By appending an ingeniously designed DBE layer to the deep convolutional neural network (DCNN), DBE learns binary code directly from the continuous DBE layer activation without quantization error. By employing the deep residual network (ResNet) as DCNN component, DBE captures rich semantics from images. Furthermore, in the effort of handling multilabel images, we design a joint cross entropy loss that includes both softmax cross entropy and weighted binary cross entropy in consideration of the correlation and independence of labels, respectively. Extensive experiments demonstrate the significant superiority of DBE over state-of-the-art methods on tasks of natural object recognition, image retrieval and image annotation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2018

Discriminative Cross-View Binary Representation Learning

Learning compact representation is vital and challenging for large scale...
research
09/04/2018

Deep Priority Hashing

Deep hashing enables image retrieval by end-to-end learning of deep repr...
research
08/09/2017

SUBIC: A supervised, structured binary code for image search

For large-scale visual search, highly compressed yet meaningful represen...
research
12/16/2016

Deep Residual Hashing

Hashing aims at generating highly compact similarity preserving code wor...
research
03/24/2022

Steganalysis of Image with Adaptively Parametric Activation

Steganalysis as a method to detect whether image contains se-cret messag...
research
03/08/2018

Learning Effective Binary Visual Representations with Deep Networks

Although traditionally binary visual representations are mainly designed...
research
05/01/2023

SafeWebUH at SemEval-2023 Task 11: Learning Annotator Disagreement in Derogatory Text: Comparison of Direct Training vs Aggregation

Subjectivity and difference of opinion are key social phenomena, and it ...

Please sign up or login with your details

Forgot password? Click here to reset