A Computational Model for Visual Selection
- 1 October 1999
- journal article
- Published by MIT Press in Neural Computation
- Vol. 11 (7), 1691-1715
- https://doi.org/10.1162/089976699300016197
Abstract
We propose a computational model for detecting and localizing instances from an object class in static gray-level images. We divide detection into visual selection and final classification, concentrating on the former: drastically reducing the number of candidate regions that require further, usually more intensive, processing, but with a minimum of computation and missed detections. Bottom-up processing is based on local groupings of edge fragments constrained by loose geometrical relationships. They have no a priori semantic or geometric interpretation. The role of training is to select special groupings that are moderately likely at certain places on the object but rate in the background. We show that the statistics in both populations are stable. The candidate regions are those that contain global arrangements of several local groupings. Whereas our model was not conceived to explain brain functions, it does cohere with evidence about the functions of neurons in V1 and V2, such as responses to coarse or incomplete patterns (e.g., illusory contours) and to scale and translation invariance in IT. Finally, the algorithm is applied to face and symbol detection.Keywords
This publication has 12 references indexed in Scilit:
- Neural network-based face detectionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
- Example-based learning for view-based human face detectionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1998
- Shape Quantization and Recognition with Randomized TreesNeural Computation, 1997
- Inferior Temporal Mechanisms for Invariant Object RecognitionCerebral Cortex, 1994
- Macaque VI neurons can signal ‘illusory’ contoursNature, 1993
- Columns for visual features of objects in monkey inferotemporal cortexNature, 1992
- Psychophysical support for a two-dimensional view interpolation theory of object recognition.Proceedings of the National Academy of Sciences, 1992
- Human image understanding: Recent research and a theoryComputer Vision, Graphics, and Image Processing, 1985
- Ferrier lecture - Functional architecture of macaque monkey visual cortexProceedings of the Royal Society of London. B. Biological Sciences, 1977
- The effects of contextual scenes on the identification of objectsMemory & Cognition, 1975