Learning bayesian classifiers for scene classification with a visual grammar
- 22 February 2005
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Geoscience and Remote Sensing
- Vol. 43 (3), 581-589
- https://doi.org/10.1109/tgrs.2004.839547
Abstract
A challenging problem in image content extraction and classification is building a system that automatically learns high-level semantic interpretations of images. We describe a Bayesian framework for a visual grammar that aims to reduce the gap between low-level features and high-level user semantics. Our approach includes modeling image pixels using automatic fusion of their spectral, textural, and other ancillary attributes; segmentation of image regions using an iterative split-and-merge algorithm; and representing scenes by decomposing them into prototype regions and modeling the interactions between these regions in terms of their spatial relationships. Naive Bayes classifiers are used in the learning of models for region segmentation and classification using positive and negative examples for user-defined semantic land cover labels. The system also automatically learns representative region groups that can distinguish different scenes and builds visual grammar models. Experiments using Landsat scenes show that the visual grammar enables creation of high-level classes that cannot be modeled by individual pixels or regions. Furthermore, learning of the classifiers requires only a few training examples.Keywords
This publication has 12 references indexed in Scilit:
- Probabilistic retrieval with a visual grammarPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- SCENE MODELING AND IMAGE MINING WITH A VISUAL GRAMMARPublished by World Scientific Pub Co Pte Ltd ,2003
- Interactive learning and probabilistic retrieval in remote sensing image archivesIEEE Transactions on Geoscience and Remote Sensing, 2000
- Rotation-invariant texture classification using a complete space-frequency modelIEEE Transactions on Image Processing, 1999
- Knowledge-based image retrieval with spatial and temporal constructsIEEE Transactions on Knowledge and Data Engineering, 1998
- Laplace's law of succession and universal encodingIEEE Transactions on Information Theory, 1998
- A characterization of the Dirichlet distribution through global and local parameter independenceThe Annals of Statistics, 1997
- VisualSEEkPublished by Association for Computing Machinery (ACM) ,1996
- Textural Features for Image ClassificationIEEE Transactions on Systems, Man, and Cybernetics, 1973
- Parsing of Graph-Representable PicturesJournal of the ACM, 1970