Learning to detect objects in images via a sparse, part-based representation
Top Cited Papers
- 20 September 2004
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 26 (11), 1475-1490
- https://doi.org/10.1109/tpami.2004.108
Abstract
We study the problem of detecting objects in still, gray-scale images. Our primary focus is the development of a learning-based approach to the problem that makes use of a sparse, part-based representation. A vocabulary of distinctive object parts is automatically constructed from a set of sample images of the object class of interest; images are then represented using parts from this vocabulary, together with spatial relations observed among the parts. Based on this representation, a learning algorithm is used to automatically learn to detect instances of the object class in new images. The approach can be applied to any object with distinguishable parts in a relatively fixed spatial configuration; it is evaluated here on difficult sets of real-world images containing side views of cars, and is seen to successfully detect objects in varying conditions amidst background clutter and mild occlusion. In evaluating object detection approaches, several important methodological issues arise that have not been satisfactorily addressed in the previous work. A secondary focus of this paper is to highlight these issues, and to develop rigorous evaluation standards for the object detection problem. A critical evaluation of our approach under the proposed standards is presented.Keywords
This publication has 19 references indexed in Scilit:
- Visual features of intermediate complexity and their use in classificationNature Neuroscience, 2002
- Learning to Recognize Three-Dimensional ObjectsNeural Computation, 2002
- Example-based object detection in images by componentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001
- A Computational Model for Visual SelectionNeural Computation, 1999
- Neural network-based face detectionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
- Local grayvalue invariants for image retrievalIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
- Visual Object RecognitionAnnual Review of Neuroscience, 1996
- Recognition of Objects and Their Component Parts: Responses of Single Units in the Temporal Cortex of the MacaqueCerebral Cortex, 1994
- Eigenfaces for RecognitionJournal of Cognitive Neuroscience, 1991
- Hierarchical structure in perceptual representationCognitive Psychology, 1977