Visual Word Ambiguity

Top Cited Papers

26 June 2009

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 32 (7), 1271-1283
https://doi.org/10.1109/tpami.2009.132

Abstract

This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.

Keywords

This publication has 33 references indexed in Scilit:

Learning class-specific affinities for image labelling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Multilevel Image Coding with Hyperfeatures
International Journal of Computer Vision, 2007
Towards optimal bag-of-features for object categorization and semantic video retrieval
Published by Association for Computing Machinery (ACM) ,2007
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories
Computer Vision and Image Understanding, 2007
Robust Scene Categorization by Learning Image Statistics in Context
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Latent Mixture Vocabularies for Object Categorization
Published by British Machine Vision Association and Society for Pattern Recognition ,2006
Creating efficient codebooks for visual recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
A unifying view of image similarity
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002

Cited by 598 articles