A model for size- and rotation-invariant pattern processing in the visual system

1 November 1984

journal article
research article
Published by Springer Nature in Biological Cybernetics

Vol. 51 (2), 113-121
https://doi.org/10.1007/bf00357924

Abstract

The mapping of retinal space onto the striate cortex of some mammals can be approximated by a log-polar function. It has been proposed that this mapping is of functional importance for scale-and rotation-invariant pattern recognition in the visual system. An exact log-polar transform converts centered scaling and rotation into translations. A subsequent translation-invariant transform, such as the absolute value of the Fourier transform, thus generates overall size-and rotation-invariance. In our model, the translation-invariance is realized via the R-transform. This transform can be executed by simple neural networks, and it does not require the complex computations of the Fourier transform, used in Mellin-transform size-invariance models. The logarithmic space distortion and differentiation in the first processing stage of the model is realized via “Mexican hat” filters whose diameter increases linearly with eccentricity, similar to the characteristics of the receptive fields of retinal ganglion cells. Except for some special cases, the model can explain object recognition independent of size, orientation and position. Some general problems of Mellin-type size-invariance models-that also apply to our model-are discussed.

Keywords

This publication has 36 references indexed in Scilit:

Cortical Anatomy, Size Invariance, and Spatial Frequency Analysis
Perception, 1981
Size Invariance: Reply to Schwartz
Perception, 1981
On invariant sets of a certain class of fast translation-invariant transforms
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
Size and Position Invariance in the Visual System
Perception, 1978
The Fourier–Mellin transform and mammalian hearing
The Journal of the Acoustical Society of America, 1978
Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception
Biological Cybernetics, 1977
A class of translation invariant transforms
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1977
Visual transformation of size.
Journal of Experimental Psychology: Human Perception and Performance, 1975
Visual transformation of size.
Journal of Experimental Psychology: Human Perception and Performance, 1975
Spatial-Frequency Discrimination in Human Vision
Journal of the Optical Society of America, 1970

Cited by 18 articles