Similarity Metric Learning for a Variable-Kernel Classifier

1 January 1995

journal article
research article
Published by MIT Press in Neural Computation

Vol. 7 (1), 72-85
https://doi.org/10.1162/neco.1995.7.1.72

Abstract

Nearest-neighbor interpolation algorithms have many useful properties for applications to learning, but they often exhibit poor generalization. In this paper, it is shown that much better generalization can be obtained by using a variable interpolation kernel in combination with conjugate gradient optimization of the similarity metric and kernel size. The resulting method is called variable-kernel similarity metric (VSM) learning. It has been tested on several standard classification data sets, and on these problems it shows better generalization than backpropagation and most other learning methods. The number of parameters that must be determined through optimization are orders of magnitude less than for backpropagation or radial basis function (RBF) networks, which may indicate that the method better captures the essential degrees of variation in learning. Other features of VSM learning are discussed that make it relevant to models for biological learning in the brain.

This publication has 12 references indexed in Scilit:

Local Learning Algorithms
Neural Computation, 1992
Refinements to nearest-neighbor searching ink-dimensional trees
Algorithmica, 1991
Constructing a generalizer superior to NETtalk via a mathematical theory of generalization
Neural Networks, 1990
Fast Learning in Networks of Locally-Tuned Processing Units
Neural Computation, 1989
LEARNING ARM KINEMATICS AND DYNAMICS
Annual Review of Neuroscience, 1989
Rules and exemplars in categorization, identification, and recognition.
Journal of Experimental Psychology: Learning, Memory, and Cognition, 1989
Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting
Journal of the American Statistical Association, 1988
Analysis of hidden units in a layered network trained to classify sonar targets
Neural Networks, 1988
An Algorithm for Finding Best Matches in Logarithmic Expected Time
ACM Transactions on Mathematical Software, 1977
Nearest neighbor pattern classification
IEEE Transactions on Information Theory, 1967

Cited by 158 articles