Maximum Likelihood Estimates of Species Trees: How Accuracy of Phylogenetic Inference Depends upon the Divergence History and Sampling Design
Open Access
- 20 August 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 58 (5), 501-508
- https://doi.org/10.1093/sysbio/syp045
Abstract
The understanding that gene trees are often in discord with each other and with the species trees that contain them has led researchers to methods that incorporate the inherent stochasticity of genetic processes in the phylogenetic estimation procedure. Recently developed methods for species-tree estimation that not only consider the retention and sorting of ancestral polymorphism but also quantify the actual probabilities of incomplete lineage sorting are expected to provide an improvement over earlier summary-statistic based approaches that discard much of the information content of gene trees. However, these new methods have yet to be tested on truly challenging evolutionary histories such as those marked by recent rapid speciation where high levels of incomplete lineage sorting and discord among gene trees predominate. Here, we test a new maximum-likelihood method that incorporates stochastic models of both nucleotide substitution and lineage sorting for species-tree estimation. Using a simulation approach, we consider a broad range of species-tree topologies under 2 scenarios representing moderate and severe incomplete lineage sorting. We show that the maximum-likelihood method results in more accurate species trees than a summary-statistic based approach, demonstrating that information contained in discordant gene trees can be effectively extracted using a full probabilistic model. Moreover, we demonstrate that the shape of the original species tree (i.e., the relative lengths of internal branches) has a significant impact on whether the species tree is estimated accurately. In the speciation histories explored here, it is not just the recent origin of species that affects the accuracy of the estimates but the variance in relative species divergence times as well. Additionally, we show that sampling effort (number of individuals and/or loci) and sampling design (ratio of individuals to loci) are both important factors affecting the accuracy of species-tree estimates, which is again affected by the relative timing of divergence among species. The inherent difficulties of estimating relationships when species have undergone a recent radiation are discussed, and in particular, the limitations with maximum-likelihood estimates of species trees that do not consider uncertainty in the estimated gene trees of individual loci. Thus, despite substantial improvements over current summary-statistic based approaches, and the increased sophistication of procedures that incorporate the process of gene lineage coalescence, recent radiations still appear to pose daunting challenges for phylogeneticsKeywords
This publication has 34 references indexed in Scilit:
- Gene tree discordance, phylogenetic inference and the multispecies coalescentTrends in Ecology & Evolution, 2009
- STEM: species tree estimation using maximum likelihood for gene trees under coalescenceBioinformatics, 2009
- ESTIMATING SPECIES TREES USING MULTIPLE-ALLELE DNA SEQUENCE DATAEvolution, 2008
- Recent divergence with gene flow in Tennessee cave salamanders (Plethodontidae:Gyrinophilus) inferred from gene genealogiesMolecular Ecology, 2008
- Integrating Phylogenetic and Population Genetic Analyses of Multiple Loci to Test Species Divergence Hypotheses in Passerina BuntingsGenetics, 2008
- High-resolution species trees without concatenationProceedings of the National Academy of Sciences, 2007
- Widespread Discordance of Gene Trees with Species Tree in Drosophila: Evidence for Incomplete Lineage SortingPLoS Genetics, 2006
- Discordance of Species Trees with Their Most Likely Gene TreesPLoS Genetics, 2006
- A MULTILOCUS PERSPECTIVE ON REFUGIAL ISOLATION AND DIVERGENCE IN RAINFOREST SKINKS (CARLIA)Evolution, 2006
- Gene Trees in Species TreesSystematic Biology, 1997