Evolutionary constraints on structural similarity in orthologs and paralogs
Open Access
- 16 April 2009
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 18 (6), 1306-1315
- https://doi.org/10.1002/pro.143
Abstract
Although a quantitative relationship between sequence similarity and structural similarity has long been established, little is known about the impact of orthology on the relationship between protein sequence and structure. Among homologs, orthologs (derived by speciation) more frequently have similar functions than paralogs (derived by duplication). Here, we hypothesize that an orthologous pair will tend to exhibit greater structural similarity than a paralogous pair at the same level of sequence similarity. To test this hypothesis, we used 284,459 pairwise structure‐based alignments of 12,634 unique domains from SCOP as well as orthology and paralogy assignments from OrthoMCL DB. We divided the comparisons by sequence identity and determined whether the sequence‐structure relationship differed between the orthologs and paralogs. We found that at levels of sequence identity between 30 and 70%, orthologous domain pairs indeed tend to be significantly more structurally similar than paralogous pairs at the same level of sequence identity. An even larger difference is found when comparing ligand binding residues instead of whole domains. These differences between orthologs and paralogs are expected to be useful for selecting template structures in comparative modeling and target proteins in structural genomics.Keywords
This publication has 69 references indexed in Scilit:
- Probing Protein Fold Space with a Simplified ModelJournal of Molecular Biology, 2008
- Quantitative sequence-function relationships in proteins based on gene ontologyBMC Bioinformatics, 2007
- Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic GenomesPLOS ONE, 2007
- Quantitative assessment of relationship between sequence similarity and function similarityBMC Genomics, 2007
- UCSF Chimera—A visualization system for exploratory research and analysisJournal of Computational Chemistry, 2004
- Quantifying Structure–Function Uncertainty: A Graph Theoretical Exploration into the Origins and Limitations of Protein AnnotationJournal of Molecular Biology, 2004
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- Sequence Variations within Protein Families are Linearly Related to Structural VariationsJournal of Molecular Biology, 2002
- An integrated approach to the analysis and modeling of protein sequences and structures. II. On the relationship between sequence and structural similarity for proteins that are not obviously related in sequenceJournal of Molecular Biology, 2000
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993