Decomposing a scene into geometric and semantically consistent regions

Top Cited Papers

1 September 2009

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15505499,p. 1-8
https://doi.org/10.1109/iccv.2009.5459211

Abstract

High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D relationships between them. This requires a representation above the level of pixels that can be endowed with high-level attributes such as class of object/region, its orientation, and (rough 3D) location within the scene. Towards this goal, we propose a region-based model which combines appearance and scene geometry to automatically decompose a scene into semantically meaningful regions. Our model is defined in terms of a unified energy function over scene appearance and structure. We show how this energy function can be learned from data and present an efficient inference technique that makes use of multiple over-segmentations of the image to propose moves in the energy-space. We show, experimentally, that our method achieves state-of-the-art performance on the tasks of both multi-class image segmentation and geometric reasoning. Finally, by understanding region classes and geometry, we show how our model can be used as the basis for 3D reconstruction of the scene.

Keywords

This publication has 12 references indexed in Scilit:

Robust higher order potentials for enforcing label consistency
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Closing the loop in scene interpretation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Multi-Class Segmentation with Relative Location Prior
International Journal of Computer Vision, 2008
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision, 2007
Recovering Surface Layout from an Image
International Journal of Computer Vision, 2007
Using Multiple Segmentations to Discover Objects and their Extent in Image Collections
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
LOCUS: learning object classes with unsupervised segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Robust Real-Time Face Detection
International Journal of Computer Vision, 2004
Multiple View Geometry in Computer Vision
Published by Cambridge University Press (CUP) ,2004

Cited by 416 articles