Modeling scenes with local descriptors and latent aspects

Top Cited Papers

1 January 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15505499), 883-890 Vol. 1
https://doi.org/10.1109/iccv.2005.152

Abstract

We present a new approach to model visual scenes in image collections, based on local invariant features and probabilistic latent space models. Our formulation provides answers to three open questions:(l) whether the invariant local features are suitable for scene (rather than object) classification; (2) whether unsupennsed latent space models can be used for feature extraction in the classification task; and (3) whether the latent space formulation can discover visual co-occurrence patterns, motivating novel approaches for image organization and segmentation. Using a 9500-image dataset, our approach is validated on each of these issues. First, we show with extensive experiments on binary and multi-class scene classification tasks, that a bag-of-visterm representation, derived from local invariant descriptors, consistently outperforms state-of-the-art approaches. Second, we show that probabilistic latent semantic analysis (PLSA) generates a compact scene representation, discriminative for accurate classification, and significantly more robust when less training data are available. Third, we have exploited the ability of PLSA to automatically extract visually meaningful aspects, to propose new algorithms for aspect-based image ranking and context-sensitive image segmentation.

Keywords

This publication has 13 references indexed in Scilit:

Scale & Affine Invariant Interest Point Detectors
International Journal of Computer Vision, 2004
Video Google: a text retrieval approach to object matching in videos
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Selection of scale-invariant parts for object class recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Context-based vision system for place and object recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Indoor-outdoor image classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Image classification for content-based indexing
IEEE Transactions on Image Processing, 2001
Unsupervised Learning by Probabilistic Latent Semantic Analysis
Machine Learning, 2001
Content-based image retrieval at the end of the early years
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2000
10.1162/jmlr.2003.3.4-5.993
Applied Physics Letters, 2000
10.1162/153244303322533214
Applied Physics Letters, 2000

Cited by 231 articles