Challenges in biomarker discovery: combining expert insights with statistical analysis of complex omics data
- 27 August 2012
- journal article
- Published by Informa Healthcare in Expert Opinion on Medical Diagnostics
- Vol. 7 (1), 37-51
- https://doi.org/10.1517/17530059.2012.718329
Abstract
Introduction: The advent of high throughput technologies capable of comprehensive analysis of genes, transcripts, proteins and other significant biological molecules has provided an unprecedented opportunity for the identification of molecular markers of disease processes. However, it has simultaneously complicated the problem of extracting meaningful molecular signatures of biological processes from these complex datasets. The process of biomarker discovery and characterization provides opportunities for more sophisticated approaches to integrating purely statistical and expert knowledge-based approaches. Areas covered: In this review we will present examples of current practices for biomarker discovery from complex omic datasets and the challenges that have been encountered in deriving valid and useful signatures of disease. We will then present a high-level review of data-driven (statistical) and knowledge-based methods applied to biomarker discovery, highlighting some current efforts to combine the two distinct approaches. Expert opinion: Effective, reproducible and objective tools for combining data-driven and knowledge-based approaches to identify predictive signatures of disease are key to future success in the biomarker field. We will describe our recommendations for possible approaches to this problem including metrics for the evaluation of biomarkers.Keywords
This publication has 96 references indexed in Scilit:
- A statistical selection strategy for normalization procedures in LC‐MS proteomics experiments through dataset‐dependent ranking of normalization scaling factorsProteomics, 2011
- CA 125 and the detection of recurrent ovarian cancerCancer, 2010
- Systematic and integrative analysis of large gene lists using DAVID bioinformatics resourcesNature Protocols, 2008
- What are decision trees?Nature Biotechnology, 2008
- Combining multiple serum tumor markers improves detection of stage I epithelial ovarian cancerGynecologic Oncology, 2007
- Link test—A statistical method for finding prostate cancer biomarkersComputational Biology and Chemistry, 2006
- Proteome survey reveals modularity of the yeast cell machineryNature, 2006
- A Multigene Assay to Predict Recurrence of Tamoxifen-Treated, Node-Negative Breast CancerNew England Journal of Medicine, 2004
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- A Gene-Expression Signature as a Predictor of Survival in Breast CancerNew England Journal of Medicine, 2002