Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity
Top Cited Papers
Open Access
- 17 June 2005
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 15 (7), 978-986
- https://doi.org/10.1101/gr.3804205
Abstract
We find that the degree of impairment of protein function by missense variants is predictable by comparative sequence analysis alone. The applicable range of impairment is not confined to binary predictions that distinguish normal from deleterious variants, but extends continuously from mild to severe effects. The accuracy of predictions is strongly dependent on sequence variation and is highest when diverse orthologs are available. High predictive accuracy is achieved by quantification of the physicochemical characteristics in each position of the protein, based on observed evolutionary variation. The strong relationship between physicochemical characteristics of a missense variant and impairment of protein function extends to human disease. By using four diverse proteins for which sufficient comparative sequence data are available, we show that grades of disease, or likelihood of developing cancer, correlate strongly with physicochemical constraint violation by causative amino acid variants.Keywords
This publication has 32 references indexed in Scilit:
- Functional classification of proteins and protein variantsProceedings of the National Academy of Sciences, 2004
- A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein functionBioinformatics, 2003
- Human Gene Mutation Database (HGMD®): 2003 updateHuman Mutation, 2003
- Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex diseaseNature Genetics, 2003
- Human immunodeficiency virus reverse transcriptase and protease sequence databaseNucleic Acids Research, 2003
- Human non-synonymous SNPs: server and surveyNucleic Acids Research, 2002
- The IARC TP53 database: New online mutation analysis and recommendations to usersHuman Mutation, 2002
- Inference of functional regions in proteins by quantification of evolutionary constraintsProceedings of the National Academy of Sciences, 2002
- Accounting for Human Polymorphisms Predicted to Affect Protein FunctionGenome Research, 2002
- The functional importance of disease-associated mutationBMC Bioinformatics, 2002