Small sample issues for microarray‐based classification
Open Access
- 27 February 2001
- journal article
- Published by Wiley in Comparative and Functional Genomics
- Vol. 2 (1), 28-34
- https://doi.org/10.1002/cfg.62
Abstract
In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray-based gene-expression values. Owing to the limited number of microarrays typically used in these studies, serious issues arise with respect to the design, performance and analysis of classifiers based on microarray data. This paper reviews some fundamental issues facing small-sample classification: classification rules, constrained classifiers, error estimation and feature selection. It discusses both unconstrained and constrained classifier design from sample data, and the contributions to classifier error from constrained optimization and lack of optimality owing to design from sample data. The difficulty with estimating classifier error when confined to small samples is addressed, particularly estimating the error from training data. The impact of small samples on the ability to include more than a few variables as classifier features is explained.Keywords
This publication has 8 references indexed in Scilit:
- General nonlinear framework for the analysis of gene interaction via multivariate expression arraysJournal of Biomedical Optics, 2000
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999
- Expression profiling using cDNA microarraysNature Genetics, 1999
- Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic ScaleScience, 1997
- A Probabilistic Theory of Pattern RecognitionPublished by Springer Nature ,1996
- Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA MicroarrayScience, 1995
- The Origins of OrderPublished by Oxford University Press (OUP) ,1993
- On the Uniform Convergence of Relative Frequencies of Events to Their ProbabilitiesTheory of Probability and Its Applications, 1971