Bayesian Inference of Species Trees from Multilocus Data
Top Cited Papers
Open Access
- 11 November 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 27 (3), 570-580
- https://doi.org/10.1093/molbev/msp274
Abstract
Until recently, it has been common practice for a phylogenetic analysis to use a single gene sequence from a single individual organism as a proxy for an entire species. With technological advances, it is now becoming more common to collect data sets containing multiple gene loci and multiple individuals per species. These data sets often reveal the need to directly model intraspecies polymorphism and incomplete lineage sorting in phylogenetic estimation procedures. For a single species, coalescent theory is widely used in contemporary population genetics to model intraspecific gene trees. Here, we present a Bayesian Markov chain Monte Carlo method for the multispecies coalescent. Our method coestimates multiple gene trees embedded in a shared species tree along with the effective population size of both extant and ancestral species. The inference is made possible by multilocus data from multiple individuals per species. Using a multiindividual data set and a series of simulations of rapid species radiations, we demonstrate the efficacy of our new method. These simulations give some insight into the behavior of the method as a function of sampled individuals, sampled loci, and sequence length. Finally, we compare our new method to both an existing method (BEST 2.2) with similar goals and the supermatrix (concatenation) method. We demonstrate that both BEST and our method have much better estimation accuracy for species tree topology than concatenation, and our method outperforms BEST in divergence time and population size estimation.Keywords
This publication has 49 references indexed in Scilit:
- Coalescent methods for estimating phylogenetic treesMolecular Phylogenetics and Evolution, 2009
- What Is the Danger of the Anomaly Zone for Empirical Phylogenetics?Systematic Biology, 2009
- Maximum Likelihood Estimates of Species Trees: How Accuracy of Phylogenetic Inference Depends upon the Divergence History and Sampling DesignSystematic Biology, 2009
- Estimating Species Phylogenies Using Coalescence Times among SequencesSystematic Biology, 2009
- Gene tree discordance, phylogenetic inference and the multispecies coalescentTrends in Ecology & Evolution, 2009
- STEM: species tree estimation using maximum likelihood for gene trees under coalescenceBioinformatics, 2009
- Properties of Consensus Methods for Inferring Species Trees from Gene TreesSystematic Biology, 2009
- Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population geneticsProceedings of the National Academy of Sciences, 2007
- Gene Trees in Species TreesSystematic Biology, 1997
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981