A model for size- and rotation-invariant pattern processing in the visual system
- 1 November 1984
- journal article
- research article
- Published by Springer Nature in Biological Cybernetics
- Vol. 51 (2), 113-121
- https://doi.org/10.1007/bf00357924
Abstract
The mapping of retinal space onto the striate cortex of some mammals can be approximated by a log-polar function. It has been proposed that this mapping is of functional importance for scale-and rotation-invariant pattern recognition in the visual system. An exact log-polar transform converts centered scaling and rotation into translations. A subsequent translation-invariant transform, such as the absolute value of the Fourier transform, thus generates overall size-and rotation-invariance. In our model, the translation-invariance is realized via the R-transform. This transform can be executed by simple neural networks, and it does not require the complex computations of the Fourier transform, used in Mellin-transform size-invariance models. The logarithmic space distortion and differentiation in the first processing stage of the model is realized via “Mexican hat” filters whose diameter increases linearly with eccentricity, similar to the characteristics of the receptive fields of retinal ganglion cells. Except for some special cases, the model can explain object recognition independent of size, orientation and position. Some general problems of Mellin-type size-invariance models-that also apply to our model-are discussed.Keywords
This publication has 36 references indexed in Scilit:
- Cortical Anatomy, Size Invariance, and Spatial Frequency AnalysisPerception, 1981
- Size Invariance: Reply to SchwartzPerception, 1981
- On invariant sets of a certain class of fast translation-invariant transformsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
- Size and Position Invariance in the Visual SystemPerception, 1978
- The Fourier–Mellin transform and mammalian hearingThe Journal of the Acoustical Society of America, 1978
- Spatial mapping in the primate sensory projection: Analytic structure and relevance to perceptionBiological Cybernetics, 1977
- A class of translation invariant transformsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1977
- Visual transformation of size.Journal of Experimental Psychology: Human Perception and Performance, 1975
- Visual transformation of size.Journal of Experimental Psychology: Human Perception and Performance, 1975
- Spatial-Frequency Discrimination in Human VisionJournal of the Optical Society of America, 1970