Image classification by visual bag-of-words refinement and reduction

01/18/2015
by   Zhiwu Lu, et al.
0

This paper presents a new framework for visual bag-of-words (BOW) refinement and reduction to overcome the drawbacks associated with the visual BOW model which has been widely used for image classification. Although very influential in the literature, the traditional visual BOW model has two distinct drawbacks. Firstly, for efficiency purposes, the visual vocabulary is commonly constructed by directly clustering the low-level visual feature vectors extracted from local keypoints, without considering the high-level semantics of images. That is, the visual BOW model still suffers from the semantic gap, and thus may lead to significant performance degradation in more challenging tasks (e.g. social image classification). Secondly, typically thousands of visual words are generated to obtain better performance on a relatively large image dataset. Due to such large vocabulary size, the subsequent image classification may take sheer amount of time. To overcome the first drawback, we develop a graph-based method for visual BOW refinement by exploiting the tags (easy to access although noisy) of social images. More notably, for efficient image classification, we further reduce the refined visual BOW model to a much smaller size through semantic spectral clustering. Extensive experimental results show the promising performance of the proposed framework for visual BOW refinement and reduction.

READ FULL TEXT

page 19

page 22

research
09/30/2018

Improving Bag-of-Visual-Words Towards Effective Facial Expressive Image Classification

Bag-of-Visual-Words (BoVW) approach has been widely used in the recent y...
research
03/16/2017

From visual words to a visual grammar: using language modelling for image classification

The Bag--of--Visual--Words (BoVW) is a visual description technique that...
research
09/18/2017

E^2BoWs: An End-to-End Bag-of-Words Model via Deep Convolutional Neural Network

Traditional Bag-of-visual Words (BoWs) model is commonly generated with ...
research
10/17/2022

Natural Scene Image Annotation Using Local Semantic Concepts and Spatial Bag of Visual Words

The use of bag of visual words (BOW) model for modelling images based on...
research
07/19/2022

Shrinking the Semantic Gap: Spatial Pooling of Local Moment Invariants for Copy-Move Forgery Detection

Copy-move forgery is a manipulation of copying and pasting specific patc...
research
12/14/2015

Semantic-enriched Visual Vocabulary Construction in a Weakly Supervised Context

One of the prevalent learning tasks involving images is content-based im...
research
09/27/2013

An Efficient Index for Visual Search in Appearance-based SLAM

Vector-quantization can be a computationally expensive step in visual ba...

Please sign up or login with your details

Forgot password? Click here to reset