Comparative Analysis of Chloroplast Genomes: Functional Annotation, Genome-Based Phylogeny, and Deduced Evolutionary Patterns
Open Access
- 1 April 2002
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (4), 567-583
- https://doi.org/10.1101/gr.209402
Abstract
All protein sequences from 19 complete chloroplast genomes (cpDNA) have been studied using a new computational method able to analyze functional correlations among series of protein sequences contained in complete proteomes. First, all open reading frames (ORFs) from the cpDNAs, comprising a total of 2266 protein sequences, were compared against the 3168 proteins from Synechocystis PCC6803 complete genome to find functionally related orthologous proteins. Additionally, all cpDNA genomes were pairwise compared to find orthologous groups not present in cyanobacteria. Annotations in the cluster of othologous proteins database and CyanoBase were used as reference for the functional assignments. Following this protocol, new functional assignments were made for ORFs of unknown function and forycfs (hypothetical chloroplast frames), which still lack a functional assignment. Using this information, a matrix of functional relationships was derived from profiles of the presence and/or absence of orthologous proteins; the matrix included 1837 proteins in 277 orthologous clusters. A factor analysis study of this matrix, followed by cluster analysis, allowed us to obtain accurate phylogenetic reconstructions and the detection of genes probably involved in speciation as phylogenetic correlates. Finally, by grouping common evolutionary patterns, we show that it is possible to determine functionally linked protein networks. This has allowed us to suggest putative associations for some unknown ORFs.Keywords
This publication has 50 references indexed in Scilit:
- Caloramator viterbensis sp. nov., a novel thermophilic, glycerol-fermenting bacterium isolated from a hot spring in ItalyInternational Journal of Systematic and Evolutionary Microbiology, 2002
- The Chloroplast Gene ycf9 Encodes a Photosystem II (PSII) Core Subunit, PsbZ, That Participates in PSII Supramolecular ArchitecturePlant Cell, 2001
- A Regulatory Role of the PetM Subunit in a Cyanobacterial Cytochrome b6 f ComplexJournal of Biological Chemistry, 2001
- Benchmarking PSI-BLAST in genome annotation 1 1Edited by G. von HeijneJournal of Molecular Biology, 1999
- Predicting function: from genes to genomes and backJournal of Molecular Biology, 1998
- Identification of a functional respiratory complex in chloroplasts through analysis of tobacco mutants containing disrupted plastid ndh genesThe EMBO Journal, 1998
- Do aligned sequences share the same fold?Journal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Applied Factor Analysis in the Natural Sciences.Biometrics, 1997
- Hierarchical Grouping to Optimize an Objective FunctionJournal of the American Statistical Association, 1963