A Computational Model for Visual Selection

1 October 1999

journal article
Published by MIT Press in Neural Computation

Vol. 11 (7), 1691-1715
https://doi.org/10.1162/089976699300016197

Abstract

We propose a computational model for detecting and localizing instances from an object class in static gray-level images. We divide detection into visual selection and final classification, concentrating on the former: drastically reducing the number of candidate regions that require further, usually more intensive, processing, but with a minimum of computation and missed detections. Bottom-up processing is based on local groupings of edge fragments constrained by loose geometrical relationships. They have no a priori semantic or geometric interpretation. The role of training is to select special groupings that are moderately likely at certain places on the object but rate in the background. We show that the statistics in both populations are stable. The candidate regions are those that contain global arrangements of several local groupings. Whereas our model was not conceived to explain brain functions, it does cohere with evidence about the functions of neurons in V1 and V2, such as responses to coarse or incomplete patterns (e.g., illusory contours) and to scale and translation invariance in IT. Finally, the algorithm is applied to face and symbol detection.

Keywords

This publication has 12 references indexed in Scilit:

Neural network-based face detection
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
Example-based learning for view-based human face detection
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
Shape Quantization and Recognition with Randomized Trees
Neural Computation, 1997
Inferior Temporal Mechanisms for Invariant Object Recognition
Cerebral Cortex, 1994
Macaque VI neurons can signal ‘illusory’ contours
Nature, 1993
Columns for visual features of objects in monkey inferotemporal cortex
Nature, 1992
Psychophysical support for a two-dimensional view interpolation theory of object recognition.
Proceedings of the National Academy of Sciences, 1992
Human image understanding: Recent research and a theory
Computer Vision, Graphics, and Image Processing, 1985
Ferrier lecture - Functional architecture of macaque monkey visual cortex
Proceedings of the Royal Society of London. B. Biological Sciences, 1977
The effects of contextual scenes on the identification of objects
Memory & Cognition, 1975

Cited by 108 articles