PIRSF Family Classification System for Protein Functional and Evolutionary Analysis
Open Access
- 1 January 2006
- journal article
- research article
- Published by SAGE Publications in Evolutionary Bioinformatics
- Vol. 2, 197-209
- https://doi.org/10.1177/117693430600200033
Abstract
The PIRSF protein classification system (http://pir.georgetown.edu/pirsf/) reflects evolutionary relationships of full-length proteins and domains. The primary PIRSF classification unit is the homeomorphic family, whose members are both homologous (evolved from a common ancestor) and homeomorphic (sharing full-length sequence similarity and a common domain architecture). PIRSF families are curated systematically based on literature review and integrative sequence and functional analysis, including sequence and structure similarity, domain architecture, functional association, genome context, and phyletic pattern. The results of classification and expert annotation are summarized in PIRSF family reports with graphical viewers for taxonomic distribution, domain architecture, family hierarchy, and multiple alignment and phylogenetic tree. The PIRSF system provides a comprehensive resource for bioinformatics analysis and comparative studies of protein function and evolution. Domain or fold-based searches allow identification of evolutionarily related protein families sharing domains or structural folds. Functional convergence and functional divergence are revealed by the relationships between protein classification and curated family functions. The taxonomic distribution allows the identification of lineage-specific or broadly conserved protein families and can reveal horizontal gene transfer. Here we demonstrate, with illustrative examples, how to use the web-based PIRSF system as a tool for functional and evolutionary studies of protein families.Keywords
This publication has 31 references indexed in Scilit:
- Enzyme genomics: Application of general enzymatic screens to discover new enzymesFEMS Microbiology Reviews, 2005
- Enzyme genomics: Application of general enzymatic screens to discover new enzymesFEMS Microbiology Reviews, 2005
- The iProClass integrated database for protein functional analysisComputational Biology and Chemistry, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Identification of a five-pass transmembrane protein family localizing in the Golgi apparatus and the ERBiochemical and Biophysical Research Communications, 2003
- A Story of Chelatase EvolutionJournal of Biological Chemistry, 2003
- Identification and functional analysis of enzymes required for precorrin-2 dehydrogenation and metal ion insertion in the biosynthesis of sirohaem and cobalamin in Bacillus megateriumBiochemical Journal, 2003
- Genetic data indicate that proteins containing the GGDEF domain possess diguanylate cyclase activityFEMS Microbiology Letters, 2001
- Two-Component Signal TransductionAnnual Review of Biochemistry, 2000
- Structural Analysis of Bacterial Chemotaxis Proteins: Components of a Dynamic Signaling SystemJournal of Structural Biology, 1998