Improvements to a Class of Distance Matrix Methods for Inferring Species Trees from Gene Trees
- 1 June 2012
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 19 (6), 632-649
- https://doi.org/10.1089/cmb.2012.0042
Abstract
Among the methods currently available for inferring species trees from gene trees, the GLASS method of Mossel and Roch (2010), the Shallowest Divergence (SD) method of Maddison and Knowles (2006), the STEAC method of Liu et al. (2009), and a related method that we call Minimum Average Coalescence (MAC) are computationally efficient and provide branch length estimates. Further, GLASS and STEAC have been shown to be consistent estimators of tree topology under a multispecies coalescent model. However, divergence time estimates obtained with these methods are all systematically biased under the model because the pairwise interspecific gene divergence times on which they rely must be more ancient than the species divergence time. Jewett and Rosenberg (2012) derived an expression for the bias of GLASS and used it to propose an improved method that they termed iGLASS. Here, we derive the biases of SD, STEAC, and MAC, and we propose improved analogues of these methods that we call iSD, iSTEAC, and iMAC. We conduct simulations to compare the performance of these methods with their original counterparts and with GLASS and iGLASS, finding that each of them decreases the bias and mean squared error of pairwise divergence time estimates. The new methods can therefore contribute to improvements in the estimation of species trees from information on gene trees.Keywords
This publication has 16 references indexed in Scilit:
- iGLASS: An Improvement to the GLASS Method for Estimating Species Trees from Gene TreesJournal of Computational Biology, 2012
- Fast and accurate methods for phylogenomic analysesBMC Bioinformatics, 2011
- Species Tree Inference by Minimizing Deep CoalescencesPLoS Computational Biology, 2009
- Estimating Species Phylogenies Using Coalescence Times among SequencesSystematic Biology, 2009
- Gene tree discordance, phylogenetic inference and the multispecies coalescentTrends in Ecology & Evolution, 2009
- Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: A modelTheoretical Population Biology, 2009
- STEM: species tree estimation using maximum likelihood for gene trees under coalescenceBioinformatics, 2009
- Rooted triple consensus and anomalous gene treesBMC Ecology and Evolution, 2008
- Discordance of Species Trees with Their Most Likely Gene TreesPLoS Genetics, 2006
- Gene Trees in Species TreesSystematic Biology, 1997