Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis
Top Cited Papers
Open Access
- 1 April 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 17 (4), 540-552
- https://doi.org/10.1093/oxfordjournals.molbev.a026334
Abstract
The use of some multiple-sequence alignments in phylogenetic analysis, particularly those that are not very well conserved, requires the elimination of poorly aligned positions and divergent regions, since they may not be homologous or may have been saturated by multiple substitutions. A computerized method that eliminates such positions and at the same time tries to minimize the loss of informative sites is presented here. The method is based on the selection of blocks of positions that fulfill a simple set of requirements with respect to the number of contiguous conserved positions, lack of gaps, and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. To illustrate the efficiency of this method, alignments of 10 mitochondrial proteins from several completely sequenced mitochondrial genomes belonging to diverse eukaryotes were used as examples. The percentages of removed positions were higher in the most divergent alignments. After removing divergent segments, the amino acid composition of the different sequences was more uniform, and pairwise distances became much smaller. Phylogenetic trees show that topologies can be different after removing conserved blocks, particularly when there are several poorly resolved nodes. Strong support was found for the grouping of animals and fungi but not for the position of more basal eukaryotes. The use of a computerized method such as the one presented here reduces to a certain extent the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible, and facilitates the reproduction of the final alignment by other researchers.Keywords
This publication has 50 references indexed in Scilit:
- The Complete Mitochondrial DNA Sequences of Nephroselmis olivacea and Pedinomonas minor: Two Radically Different Evolutionary Patterns within Green AlgaePlant Cell, 1999
- Complete Sequence of the Mitochondrial DNA of the Red Alga Porphyra purpurea: Cyanobacterial Introns and Shared Ancestry of Red and Green AlgaePlant Cell, 1999
- Phylogenetic information and experimental design in molecular systematicsProceedings Of The Royal Society B-Biological Sciences, 1998
- Elision: A Method for Accommodating Multiple Molecular Sequence Alignments with Alignment-Ambiguous SitesMolecular Phylogenetics and Evolution, 1995
- The Mitochondrial DNA of the Amoeboid Protozoon,Acanthamoeba castellanii: Complete Sequence, Gene Content and Genome OrganizationJournal of Molecular Biology, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Complete Sequence of the Mitochondrial DNA of the Chlorophyte Alga Prototheca wickerhamii: Gene Content and Genome OrganizationJournal of Molecular Biology, 1994
- A statistical method for detecting regions with different evolutionary dynamics in multialigned sequencesMolecular Phylogenetics and Evolution, 1992
- Gene organization deduced from the complete sequence of liverwort Marchantia polymorpha mitochondrial DNAJournal of Molecular Biology, 1992
- Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoideaJournal of Molecular Evolution, 1989