Multiclass cancer diagnosis using tumor gene expression signatures
Top Cited Papers
- 11 December 2001
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 98 (26), 15149-15154
- https://doi.org/10.1073/pnas.211566398
Abstract
The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.Keywords
This publication has 24 references indexed in Scilit:
- Molecular classification of human carcinomas by use of gene expression signatures.2001
- Chemosensitivity prediction by transcriptional profilingProceedings of the National Academy of Sciences, 2001
- Delineation of prognostic biomarkers in prostate cancerNature, 2001
- Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networksNature Medicine, 2001
- Gene-Expression Profiles in Hereditary Breast CancerNew England Journal of Medicine, 2001
- Identification of a Mouse Homolog of the Human BTEB2 Transcription Factor as a β-Catenin-Independent Wnt-1-Responsive GeneMolecular and Cellular Biology, 2001
- A gene expression database for the molecular pharmacology of cancerNature Genetics, 2000
- Distinct types of diffuse large B-cell lymphoma identified by gene expression profilingNature, 2000
- Regularization Networks and Support Vector MachinesAdvances in Computational Mathematics, 2000
- Treatment of Patients with Cancer of an Unknown Primary SiteNew England Journal of Medicine, 1993