The effects of segmentation and feature choice in a translation model of object recognition
- 22 December 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 675-684
- https://doi.org/10.1109/cvpr.2003.1211532
Abstract
We work with a model of object recognition where words must be placed on image regions. This approach means that large scale experiments are relatively easy, so we can evaluate the effects of various early and mid- level vision algorithms on recognition performance. We evaluate various image segmentation algorithms by determining word prediction accuracy for images segmented in various ways and represented by various features. We take the view that good segmentations respect object boundaries, and so word prediction should be better for a better segmentation. However, it is usually very difficult in practice to obtain segmentations that do not break up objects, so most practitioners attempt to merge segments to get better putative object representations. We demonstrate that our paradigm of word prediction easily allows us to predict potentially useful segment merges, even for segments that do not look similar (for example, merging the black and whiteKeywords
This publication has 12 references indexed in Scilit:
- Clustering artPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Combining textual and visual cues for content-based image retrieval on the World Wide WebPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Color- and texture-based image segmentation using EM and its application to content-based image retrievalPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Learning the semantics of words and picturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statisticsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Blobworld: image segmentation using expectation-maximization and its application to image queryingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Mean shift: a robust approach toward feature space analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Normalized cuts and image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Interpreting image databases by region classificationPattern Recognition, 1997
- A framework for performance characterization of intermediate-level grouping modulesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997