Scale-space theory: a basic tool for analyzing structures at different scales
- 1 January 1994
- journal article
- research article
- Published by Taylor & Francis in Journal of Applied Statistics
- Vol. 21 (1-2), 225-270
- https://doi.org/10.1080/757582976
Abstract
An inherent property of objects in the world is that they only exist as meaningful entities over certain ranges of scale. If one aims to describe the structure of unknown real-world signals, then a multi-scale representation of data is of crucial importance. This paper gives a tutorial review of a special type of multi-scale representation—linear scale-space representation—which has been developed by the computer vision community to handle image structures at different scales in a consistent manner. The basic idea is to embed the original signal into a one-parameter family of gradually smoothed signals in which the fine-scale details are successively suppressed. Under rather general conditions on the type of computations that are to be performed at the first stages of visual processing, in what can be termed ‘the visual front-end’, it can be shown that the Gaussian kernel and its derivatives are singled out as the only possible smoothing kernels. The conditions that specify the Gaussian kernel are, basically, linearity and shift invariance, combined with different ways of formalizing the notion that structures at coarse scales should correspond to simplifications of corresponding structures at fine scales-they should not be accidental phenomena created by the smoothing method. Notably, several different ways of choosing scale-space axioms give rise to the same conclusion. The output from the scale-space representation can be used for a variety of early visual tasks; operations such as feature detection, feature classification and shape computation can be expressed directly in terms of (possibly non-linear) combinations of Gaussian derivatives at multiple scales. In this sense the scale-space representation canserve as a basis for early vision. During the last few decades, a number of other approaches to multiscale representations have been developed, which are more or less related to scale-space theory, notably the theories of pyramids, wavelets and multi grid methods.Despite their qualitative differences, the increasing propularity of each of these approaches indicates that the crucial notion of scale is increasingly appreciated by the computer.vision community and by researchers in other related fields. An interesting similarity to biological vision is that the scale-space operators closely resemble receptive field profiles registered in neurophysiological studies of the mam- malian retina and visual cortex.Keywords
This publication has 61 references indexed in Scilit:
- Active detection and classification of junctions by foveation with a head-eye system guided by the scale-space primal sketchLecture Notes in Computer Science, 1992
- Singularity theory and phantom edges in scale spaceIEEE Transactions on Pattern Analysis and Machine Intelligence, 1988
- Edge FocusingIEEE Transactions on Pattern Analysis and Machine Intelligence, 1987
- A Computational Approach to Edge DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence, 1986
- Uniqueness of the Gaussian Kernel for Scale-Space FilteringIEEE Transactions on Pattern Analysis and Machine Intelligence, 1986
- Pyramidal Systems for Computer VisionPublished by Springer Nature ,1986
- A Representation for Shape Based on Peaks and Ridges in the Difference of Low-Pass TransformIEEE Transactions on Pattern Analysis and Machine Intelligence, 1984
- The Laplacian Pyramid as a Compact Image CodeIEEE Transactions on Communications, 1983
- Fast filter transform for image processingComputer Graphics and Image Processing, 1981
- A Practical Guide to SplinesPublished by Springer Nature ,1978