Perfect metrics

30 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 593-597
https://doi.org/10.1109/icdar.1993.395665

Abstract

The authors describe an experiment in the construction of perfect metrics for minimum-distance classification of character images. A perfect metric is one that, with high probability, is zero for correct classifications and non-zero for incorrect classifications. They promise excellent reject behavior in addition to good rank ordering. The approach is to infer from the training data faithful but concise representations of the empirical class-conditional distributions. In doing this, the authors have abandoned many visual simplifying assumptions about the distributions, e.g., that they are simply-connected, unimodal, convex, or parametric (e.g., Gaussian). The method requires unusually large and representative training sets, which we provide through pseudorandom generation of training samples using a realistic model of printing and imaging distortions. The authors illustrate the method on a challenging recognition problem: 3755 character classes of machine-print Chinese, in four typefaces, over a range of text sizes. In a test on over three million images, the perfect-metric classifier achieved better than 99% top-choice accuracy. In addition, it is shown that it is superior to a conventional parametric classifier.

Keywords

This publication has 4 references indexed in Scilit:

Document Image Defect Models
Published by Springer Nature ,1992
Stochastic discrimination
Annals of Mathematics and Artificial Intelligence, 1990
Fundamentals of Electronic Imaging Systems
Springer Series in Information Sciences, 1986
Research on Machine Recognition of Handprinted Characters
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1984

Cited by 10 articles