Similarity Metric Learning for a Variable-Kernel Classifier
- 1 January 1995
- journal article
- research article
- Published by MIT Press in Neural Computation
- Vol. 7 (1), 72-85
- https://doi.org/10.1162/neco.1995.7.1.72
Abstract
Nearest-neighbor interpolation algorithms have many useful properties for applications to learning, but they often exhibit poor generalization. In this paper, it is shown that much better generalization can be obtained by using a variable interpolation kernel in combination with conjugate gradient optimization of the similarity metric and kernel size. The resulting method is called variable-kernel similarity metric (VSM) learning. It has been tested on several standard classification data sets, and on these problems it shows better generalization than backpropagation and most other learning methods. The number of parameters that must be determined through optimization are orders of magnitude less than for backpropagation or radial basis function (RBF) networks, which may indicate that the method better captures the essential degrees of variation in learning. Other features of VSM learning are discussed that make it relevant to models for biological learning in the brain.This publication has 12 references indexed in Scilit:
- Local Learning AlgorithmsNeural Computation, 1992
- Refinements to nearest-neighbor searching ink-dimensional treesAlgorithmica, 1991
- Constructing a generalizer superior to NETtalk via a mathematical theory of generalizationNeural Networks, 1990
- Fast Learning in Networks of Locally-Tuned Processing UnitsNeural Computation, 1989
- LEARNING ARM KINEMATICS AND DYNAMICSAnnual Review of Neuroscience, 1989
- Rules and exemplars in categorization, identification, and recognition.Journal of Experimental Psychology: Learning, Memory, and Cognition, 1989
- Locally Weighted Regression: An Approach to Regression Analysis by Local FittingJournal of the American Statistical Association, 1988
- Analysis of hidden units in a layered network trained to classify sonar targetsNeural Networks, 1988
- An Algorithm for Finding Best Matches in Logarithmic Expected TimeACM Transactions on Mathematical Software, 1977
- Nearest neighbor pattern classificationIEEE Transactions on Information Theory, 1967