CMA – a comprehensive Bioconductor package for supervised classification with high dimensional data

Open Access

16 October 2008

journal article
research article
Published by Springer Nature in BMC Bioinformatics

Vol. 9 (1), 439
https://doi.org/10.1186/1471-2105-9-439

Abstract

For the last eight years, microarray-based classification has been a major topic in statistics, bioinformatics and biomedicine research. Traditional methods often yield unsatisfactory results or may even be inapplicable in the so-called "p ≫ n" setting where the number of predictors p by far exceeds the number of observations n, hence the term "ill-posed-problem". Careful model selection and evaluation satisfying accepted good-practice standards is a very complex task for statisticians without experience in this area or for scientists with limited statistical background. The multiplicity of available methods for class prediction based on high-dimensional data is an additional practical challenge for inexperienced researchers.

Keywords

This publication has 55 references indexed in Scilit:

Reducing the probability of false positive research findings by pre-publication validation – Experience with a large multiple sclerosis database
BMC Medical Research Methodology, 2008
SignS: a parallelized, open-source, freely available, web-based tool for gene selection and molecular signatures for survival and censored data
BMC Bioinformatics, 2008
Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models
BMC Bioinformatics, 2008
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
Proceedings of the National Academy of Sciences, 2005
PLS Dimension Reduction for Classification with Microarray Data
Statistical Applications in Genetics and Molecular Biology, 2004
A Compendium to Ensure Computational Reproducibility in High-Dimensional Classification Tasks
Statistical Applications in Genetics and Molecular Biology, 2004
Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments
Statistical Applications in Genetics and Molecular Biology, 2004
KEGG: Kyoto Encyclopedia of Genes and Genomes
Nucleic Acids Research, 2000
A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
Journal of Computer and System Sciences, 1997
Regularized Discriminant Analysis
Journal of the American Statistical Association, 1989

Cited by 83 articles