Bayesian approach to discovering pathogenic SNPs in conserved protein domains
- 18 June 2004
- journal article
- research article
- Published by Hindawi Limited in Human Mutation
- Vol. 24 (2), 178-184
- https://doi.org/10.1002/humu.20063
Abstract
The success rate of association studies can be improved by selecting better genetic markers for genotyping or by providing better leads for identifying pathogenic single nucleotide polymorphisms (SNPs) in the regions of linkage disequilibrium with positive disease associations. We have developed a novel algorithm to predict pathogenic single amino acid changes, either nonsynonymous SNPs (nsSNPs) or missense mutations, in conserved protein domains. Using a Bayesian framework, we found that the probability of a microbial missense mutation causing a significant change in phenotype depended on how much difference it made in several phylogenetic, biochemical, and structural features related to the single amino acid substitution. We tested our model on pathogenic allelic variants (missense mutations or nsSNPs) included in OMIM, and on the other nsSNPs in the same genes (from dbSNP) as the nonpathogenic variants. As a result, our model predicted pathogenic variants with a 10% false‐positive rate. The high specificity of our prediction algorithm should make it valuable in genetic association studies aimed at identifying pathogenic SNPs. Hum Mutat 24:178–184, 2004.Keywords
This publication has 22 references indexed in Scilit:
- A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein functionBioinformatics, 2003
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Accounting for Human Polymorphisms Predicted to Affect Protein FunctionGenome Research, 2002
- Prediction of deleterious human allelesHuman Molecular Genetics, 2001
- Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation11Edited by F. CohenJournal of Molecular Biology, 2001
- How many diseases does it take to map a gene with SNPs?Nature Genetics, 2000
- Genetic Studies of the Lac Repressor XV: 4000 Single Amino Acid Substitutions and Analysis of the Resulting Phenotypes on the Basis of the Protein StructureJournal of Molecular Biology, 1996
- Systematic mutation of bacteriophage T4 lysozymeJournal of Molecular Biology, 1991
- A Thermodynamic Scale for the Helix-Forming Tendencies of the Commonly Occurring Amino AcidsScience, 1990
- Complete mutagenesis of the HIV-1 proteaseNature, 1989