PartitionFinder: Combined Selection of Partitioning Schemes and Substitution Models for Phylogenetic Analyses
Top Cited Papers
Open Access
- 20 January 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 29 (6), 1695-1701
- https://doi.org/10.1093/molbev/mss020
Abstract
In phylogenetic analyses of molecular sequence data, partitioning involves estimating independent models of molecular evolution for different sets of sites in a sequence alignment. Choosing an appropriate partitioning scheme is an important step in most analyses because it can affect the accuracy of phylogenetic reconstruction. Despite this, partitioning schemes are often chosen without explicit statistical justification. Here, we describe two new objective methods for the combined selection of best-fit partitioning schemes and nucleotide substitution models. These methods allow millions of partitioning schemes to be compared in realistic time frames and so permit the objective selection of partitioning schemes even for large multilocus DNA data sets. We demonstrate that these methods significantly outperform previous approaches, including both the ad hoc selection of partitioning schemes (e.g., partitioning by gene or codon position) and a recently proposed hierarchical clustering method. We have implemented these methods in an open-source program, PartitionFinder. This program allows users to select partitioning schemes and substitution models using a range of information-theoretic metrics (e.g., the Bayesian information criterion, akaike information criterion [AIC], and corrected AIC). We hope that PartitionFinder will encourage the objective selection of partitioning schemes and thus lead to improvements in phylogenetic analyses. PartitionFinder is written in Python and runs under Mac OSX 10.4 and above. The program, source code, and a detailed manual are freely available from www.robertlanfear.com/partitionfinder.Keywords
This publication has 33 references indexed in Scilit:
- Phylogenetic utility of five genes for dipteran phylogeny: A test case in the Chironomidae leads to generic synonymiesMolecular Phylogenetics and Evolution, 2010
- Recent Trends in Molecular Phylogenetic Analysis: Where to Next?Journal of Heredity, 2010
- BEAST: Bayesian evolutionary analysis by sampling treesBMC Ecology and Evolution, 2007
- The Importance of Data Partitioning and the Utility of Bayes Factors in Bayesian PhylogeneticsSystematic Biology, 2007
- Partitioned Bayesian Analyses, Partition Choice, and the Phylogenetic Relationships of Scincid LizardsSystematic Biology, 2005
- Molecular systematics of armadillos (Xenarthra, Dasypodidae): contribution of maximum likelihood and Bayesian analyses of mitochondrial and nuclear genesMolecular Phylogenetics and Evolution, 2003
- Exploring Data Interaction and Nucleotide Alignment in a Multiple Gene Analysis of Ips (Coleoptera: Scolytinae)Systematic Biology, 2001
- Exploring Among-Site Rate Variation Models in a Maximum Likelihood Framework Using Empirical Data: Effects of Model Assumptions on Estimates of Topology, Branch Lengths, and Bootstrap SupportSystematic Biology, 2001
- A Partitioned Likelihood Analysis of Swallowtail Butterfly Phylogeny (Lepidoptera: Papilionidae)Systematic Biology, 2001
- Exponential NumbersThe American Mathematical Monthly, 1934