New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0
Top Cited Papers
Open Access
- 29 March 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 59 (3), 307-321
- https://doi.org/10.1093/sysbio/syq010
Abstract
PhyML is a phylogeny software based on the maximum-likelihood principle. Early PhyML versions used a fast algorithm performing nearest neighbor interchanges to improve a reasonable starting tree topology. Since the original publication (Guindon S., Gascuel O. 2003. A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52:696–704), PhyML has been widely used (>2500 citations in ISI Web of Science) because of its simplicity and a fair compromise between accuracy and speed. In the meantime, research around PhyML has continued, and this article describes the new algorithms and methods implemented in the program. First, we introduce a new algorithm to search the tree space with user-defined intensity using subtree pruning and regrafting topological moves. The parsimony criterion is used here to filter out the least promising topology modifications with respect to the likelihood function. The analysis of a large collection of real nucleotide and amino acid data sets of various sizes demonstrates the good performance of this method. Second, we describe a new test to assess the support of the data for internal branches of a phylogeny. This approach extends the recently proposed approximate likelihood-ratio test and relies on a nonparametric, Shimodaira–Hasegawa–like procedure. A detailed analysis of real alignments sheds light on the links between this new approach and the more classical nonparametric bootstrap method. Overall, our tests show that the last version (3.0) of PhyML is fast, accurate, stable, and ready to use. A Web server and binary files are available from http://www.atgc-montpellier.fr/phyml/.Keywords
This publication has 32 references indexed in Scilit:
- Approximate Likelihood-Ratio Test for Branches: A Fast, Accurate, and Powerful AlternativeSystematic Biology, 2006
- Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihoodBioinformatics, 2005
- TREEFINDER: a powerful graphical analysis environment for molecular phylogeneticsBMC Ecology and Evolution, 2004
- A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum LikelihoodSystematic Biology, 2003
- Accuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular EvolutionMolecular Biology and Evolution, 2001
- Likelihood-Based Tests of Topologies in PhylogeneticsSystematic Biology, 2000
- BIONJ: an improved version of the NJ algorithm based on a simple model of sequence dataMolecular Biology and Evolution, 1997
- PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITYAnnual Review of Genetics, 1988
- CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAPEvolution, 1985
- Evolution of Protein MoleculesPublished by Elsevier ,1969