Learning bayesian classifiers for scene classification with a visual grammar

Abstract

A challenging problem in image content extraction and classification is building a system that automatically learns high-level semantic interpretations of images. We describe a Bayesian framework for a visual grammar that aims to reduce the gap between low-level features and high-level user semantics. Our approach includes modeling image pixels using automatic fusion of their spectral, textural, and other ancillary attributes; segmentation of image regions using an iterative split-and-merge algorithm; and representing scenes by decomposing them into prototype regions and modeling the interactions between these regions in terms of their spatial relationships. Naive Bayes classifiers are used in the learning of models for region segmentation and classification using positive and negative examples for user-defined semantic land cover labels. The system also automatically learns representative region groups that can distinguish different scenes and builds visual grammar models. Experiments using Landsat scenes show that the visual grammar enables creation of high-level classes that cannot be modeled by individual pixels or regions. Furthermore, learning of the classifiers requires only a few training examples.

Keywords

This publication has 12 references indexed in Scilit:

Probabilistic retrieval with a visual grammar
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
SCENE MODELING AND IMAGE MINING WITH A VISUAL GRAMMAR
Published by World Scientific Pub Co Pte Ltd ,2003
Interactive learning and probabilistic retrieval in remote sensing image archives
IEEE Transactions on Geoscience and Remote Sensing, 2000
Rotation-invariant texture classification using a complete space-frequency model
IEEE Transactions on Image Processing, 1999
Knowledge-based image retrieval with spatial and temporal constructs
IEEE Transactions on Knowledge and Data Engineering, 1998
Laplace's law of succession and universal encoding
IEEE Transactions on Information Theory, 1998
A characterization of the Dirichlet distribution through global and local parameter independence
The Annals of Statistics, 1997
VisualSEEk
Published by Association for Computing Machinery (ACM) ,1996
Textural Features for Image Classification
IEEE Transactions on Systems, Man, and Cybernetics, 1973
Parsing of Graph-Representable Pictures
Journal of the ACM, 1970

Cited by 119 articles