ImOSM: Intermittent Evolution and Robustness of Phylogenetic Methods
Open Access
- 22 September 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 29 (2), 663-673
- https://doi.org/10.1093/molbev/msr220
Abstract
Among the criteria to evaluate the performance of a phylogenetic method, robustness to model violation is of particular practical importance as complete a priori knowledge of evolutionary processes is typically unavailable. For studies of robustness in phylogenetic inference, a utility to add well-defined model violations to the simulated data would be helpful. We therefore introduce ImOSM, a tool to imbed intermittent evolution as model violation into an alignment. Intermittent evolution refers to extra substitutions occurring randomly on branches of a tree, thus changing alignment site patterns. This means that the extra substitutions are placed on the tree after the typical process of sequence evolution is completed. We then study the robustness of widely used phylogenetic methods: maximum likelihood (ML), maximum parsimony (MP), and a distance-based method (BIONJ) to various scenarios of model violation. Violation of rates across sites (RaS) heterogeneity and simultaneous violation of RaS and the transition/transversion ratio on two nonadjacent external branches hinder all the methods recovery of the true topology for a four-taxon tree. For an eight-taxon balanced tree, the violations cause each of the three methods to infer a different topology. Both ML and MP fail, whereas BIONJ, which calculates the distances based on the ML estimated parameters, reconstructs the true tree. Finally, we report that a test of model homogeneity and goodness of fit tests have enough power to detect such model violations. The outcome of the tests can help to actually gain confidence in the inferred trees. Therefore, we recommend using these tests in practical phylogenetic analyses.Keywords
This publication has 48 references indexed in Scilit:
- INDELible: A Flexible Simulator of Biological Sequence EvolutionMolecular Biology and Evolution, 2009
- A call for likelihood phylogenetics even when the process of sequence evolution is heterogeneousMolecular Phylogenetics and Evolution, 2005
- In silico sequence evolution with site-specific interactions along phylogenetic treesBioinformatics, 2005
- An Empirical Assessment of Long-Branch Attraction Artefacts in Deep Eukaryotic PhylogenomicsSystematic Biology, 2005
- Maximum Likelihood Outperforms Maximum Parsimony Even When Evolutionary Rates Are HeterotachousMolecular Biology and Evolution, 2005
- Should we be worried about long-branch attraction in real data sets? Investigations using metazoan 18S rDNAMolecular Phylogenetics and Evolution, 2004
- Topological bias and inconsistency of maximum likelihood using wrong modelsMolecular Biology and Evolution, 1999
- BIONJ: an improved version of the NJ algorithm based on a simple model of sequence dataMolecular Biology and Evolution, 1997
- Robustness of maximum likelihood tree estimation against different patterns of base substitutionsJournal of Molecular Evolution, 1991
- Cases in which Parsimony or Compatibility Methods Will be Positively MisleadingSystematic Zoology, 1978