Robust Scene Categorization by Learning Image Statistics in Context
- 10 July 2006
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We present a generic and robust approach for scene categorization. A complex scene is described by proto-concepts like vegetation, water, fire, sky etc. These proto-concepts are represented by low level features, where we use natural images statistics to compactly represent color invariant texture information by a Weibull distribution. We introduce the notion of contextures which preserve the context of textures in a visual scene with an occurrence histogram (context) of similarities to proto-concept descriptors (texture). In contrast to a codebook approach, we use the similarity to all vocabulary elements to generalize beyond the code words. Visual descriptors are attained by combining different types of contexts with different texture parameters. The visual scene descriptors are generalized to visual categories by training a support vector machine. We evaluate our approach on 3 different datasets: 1) 50 categories for the TRECVID video dataset; 2) the Caltech 101-object images; 3) 89 categories being the intersection of the Corel photo stock with the Art Explosion photo stock. Results show that our approach is robust over different datasets, while maintaining competitive performance.Keywords
This publication has 16 references indexed in Scilit:
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categoriesComputer Vision and Image Understanding, 2007
- Natural scene classification using overcomplete ICAPattern Recognition, 2005
- Object Recognition with Features Inspired by Visual CortexPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Modeling scenes with local descriptors and latent aspectsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Contextual Priming for Object DetectionInternational Journal of Computer Vision, 2003
- Indexing based on scale invariant interest pointsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001
- Modeling the Shape of the Scene: A Holistic Representation of the Spatial EnvelopeInternational Journal of Computer Vision, 2001
- Image classification for content-based indexingIEEE Transactions on Image Processing, 2001
- ON IMAGE CLASSIFICATION: CITY IMAGES VS. LANDSCAPESPattern Recognition, 1998
- Finding Waldo, or focus of attention using local color informationIEEE Transactions on Pattern Analysis and Machine Intelligence, 1995