Visual Word Ambiguity
Top Cited Papers
- 26 June 2009
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 32 (7), 1271-1283
- https://doi.org/10.1109/tpami.2009.132
Abstract
This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.Keywords
This publication has 33 references indexed in Scilit:
- Learning class-specific affinities for image labellingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Multilevel Image Coding with HyperfeaturesInternational Journal of Computer Vision, 2007
- Towards optimal bag-of-features for object categorization and semantic video retrievalPublished by Association for Computing Machinery (ACM) ,2007
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categoriesComputer Vision and Image Understanding, 2007
- Robust Scene Categorization by Learning Image Statistics in ContextPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Latent Mixture Vocabularies for Object CategorizationPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- Creating efficient codebooks for visual recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- A unifying view of image similarityPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002