Addressing the Challenge of Defining Valid Proteomic Biomarkers and Classifiers

Open Access

10 December 2010

journal article
research article
Published by Springer Nature in BMC Bioinformatics

Vol. 11 (1), 594
https://doi.org/10.1186/1471-2105-11-594

Abstract

Background: The purpose of this manuscript is to provide, based on an extensive analysis of a proteomic data set, suggestions for proper statistical analysis for the discovery of sets of clinically relevant biomarkers. As tractable example we define the measurable proteomic differences between apparently healthy adult males and females. We choose urine as body-fluid of interest and CE-MS, a thoroughly validated platform technology, allowing for routine analysis of a large number of samples. The second urine of the morning was collected from apparently healthy male and female volunteers (aged 21-40) in the course of the routine medical check-up before recruitment at the Hannover Medical School. Results: We found that the Wilcoxon-test is best suited for the definition of potential biomarkers. Adjustment for multiple testing is necessary. Sample size estimation can be performed based on a small number of observations via resampling from pilot data. Machine learning algorithms appear ideally suited to generate classifiers. Assessment of any results in an independent test-set is essential. Conclusions: Valid proteomic biomarkers for diagnosis and prognosis only can be defined by applying proper statistical data mining procedures. In particular, a justification of the sample size should be part of the study design.

Keywords

This publication has 47 references indexed in Scilit:

Naturally Occurring Human Urinary Peptides for Use in Diagnosis of Chronic Kidney Disease
Molecular & Cellular Proteomics, 2010
Urinary Collagen Fragments Are Significantly Altered in Diabetes: A Link to Pathophysiology
PLOS ONE, 2010
Survival Prediction for Pancreatic Cancer Patients Receiving Gemcitabine Treatment
Molecular & Cellular Proteomics, 2010
Power and sample size estimation in microarray studies
BMC Bioinformatics, 2010
Identification and Validation of Urinary Biomarkers for Differential Diagnosis and Evaluation of Therapeutic Intervention in Anti-neutrophil Cytoplasmic Antibody-associated Vasculitis
Molecular & Cellular Proteomics, 2009
Capillary electrophoresis–mass spectrometry as a powerful tool in biomarker discovery and clinical diagnosis: An update of recent developments
Mass Spectrometry Reviews, 2008
CE‐MS analysis of the human urinary proteome for biomarker discovery and disease diagnostics
Proteomics – Clinical Applications, 2008
A unified approach to false discovery rate estimation
BMC Bioinformatics, 2008
Estimation of the disease-specific diagnostic marker distribution under verification bias
Computational Statistics & Data Analysis, 2008
Protein biomarker discovery and validation: the long and uncertain path to clinical utility
Nature Biotechnology, 2006

Cited by 121 articles