Clustering art
- 25 August 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 434-441
- https://doi.org/10.1109/cvpr.2001.990994
Abstract
We extend a recently developed method (K. Barnard and D. Forsyth, 2001) for learning the semantics of image databases using text and pictures. We incorporate statistical natural language processing in order to deal with free text. We demonstrate the current system on a difficult dataset, namely 10000 images of work from the Fine Arts Museum of San Francisco. The images include line drawings, paintings, and pictures of sculpture and ceramics. Many of the images have associated free text which varies greatly from physical description to interpretation and mood. We use WordNet to provide semantic grouping information and to help disambiguate word senses, as well as emphasize the hierarchical nature of semantic relationships. This allows us to impose a natural structure on the image collection that reflects semantics to a considerable degree. Our method produces a joint probability distribution for words and picture elements. We demonstrate that this distribution can be used: (a) to provide illustrations for given captions, and (b) to generate words for images outside the training set. Results from this annotation process yield a quantitative study of our method. Finally, the annotation process can be seen as a form of object recognizer that has been learned through a partially supervised process.Keywords
This publication has 12 references indexed in Scilit:
- Combining textual and visual cues for content-based image retrieval on the World Wide WebPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Learning the semantics of words and picturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Blobworld: image segmentation using expectation-maximization and its application to image queryingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Normalized cuts and image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Multimodal browsing of images in Web documentsPublished by SPIE-Intl Soc Optical Eng ,1999
- Analysis of User Need in Image ArchivesJournal of Information Science, 1997
- PROGRESS IN DOCUMENTATION PICTORIAL INFORMATION RETRIEVALJournal of Documentation, 1995
- Unsupervised word sense disambiguation rivaling supervised methodsPublished by Association for Computational Linguistics (ACL) ,1995
- A simple rule-based part of speech taggerPublished by Association for Computational Linguistics (ACL) ,1992
- Introduction to WordNet: An On-line Lexical Database*International Journal of Lexicography, 1990