Learning to detect objects in images via a sparse, part-based representation

Top Cited Papers

20 September 2004

journal article
research article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 26 (11), 1475-1490
https://doi.org/10.1109/tpami.2004.108

Abstract

We study the problem of detecting objects in still, gray-scale images. Our primary focus is the development of a learning-based approach to the problem that makes use of a sparse, part-based representation. A vocabulary of distinctive object parts is automatically constructed from a set of sample images of the object class of interest; images are then represented using parts from this vocabulary, together with spatial relations observed among the parts. Based on this representation, a learning algorithm is used to automatically learn to detect instances of the object class in new images. The approach can be applied to any object with distinguishable parts in a relatively fixed spatial configuration; it is evaluated here on difficult sets of real-world images containing side views of cars, and is seen to successfully detect objects in varying conditions amidst background clutter and mild occlusion. In evaluating object detection approaches, several important methodological issues arise that have not been satisfactorily addressed in the previous work. A secondary focus of this paper is to highlight these issues, and to develop rigorous evaluation standards for the object detection problem. A critical evaluation of our approach under the proposed standards is presented.

Keywords

This publication has 19 references indexed in Scilit:

Visual features of intermediate complexity and their use in classification
Nature Neuroscience, 2002
Learning to Recognize Three-Dimensional Objects
Neural Computation, 2002
Example-based object detection in images by components
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2001
A Computational Model for Visual Selection
Neural Computation, 1999
Neural network-based face detection
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
Local grayvalue invariants for image retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
Visual Object Recognition
Annual Review of Neuroscience, 1996
Recognition of Objects and Their Component Parts: Responses of Single Units in the Temporal Cortex of the Macaque
Cerebral Cortex, 1994
Eigenfaces for Recognition
Journal of Cognitive Neuroscience, 1991
Hierarchical structure in perceptual representation
Cognitive Psychology, 1977

Cited by 627 articles