Improvements to a Class of Distance Matrix Methods for Inferring Species Trees from Gene Trees

1 June 2012

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 19 (6), 632-649
https://doi.org/10.1089/cmb.2012.0042

Abstract

Among the methods currently available for inferring species trees from gene trees, the GLASS method of Mossel and Roch (2010), the Shallowest Divergence (SD) method of Maddison and Knowles (2006), the STEAC method of Liu et al. (2009), and a related method that we call Minimum Average Coalescence (MAC) are computationally efficient and provide branch length estimates. Further, GLASS and STEAC have been shown to be consistent estimators of tree topology under a multispecies coalescent model. However, divergence time estimates obtained with these methods are all systematically biased under the model because the pairwise interspecific gene divergence times on which they rely must be more ancient than the species divergence time. Jewett and Rosenberg (2012) derived an expression for the bias of GLASS and used it to propose an improved method that they termed iGLASS. Here, we derive the biases of SD, STEAC, and MAC, and we propose improved analogues of these methods that we call iSD, iSTEAC, and iMAC. We conduct simulations to compare the performance of these methods with their original counterparts and with GLASS and iGLASS, finding that each of them decreases the bias and mean squared error of pairwise divergence time estimates. The new methods can therefore contribute to improvements in the estimation of species trees from information on gene trees.

Keywords

This publication has 16 references indexed in Scilit:

iGLASS: An Improvement to the GLASS Method for Estimating Species Trees from Gene Trees
Journal of Computational Biology, 2012
Fast and accurate methods for phylogenomic analyses
BMC Bioinformatics, 2011
Species Tree Inference by Minimizing Deep Coalescences
PLoS Computational Biology, 2009
Estimating Species Phylogenies Using Coalescence Times among Sequences
Systematic Biology, 2009
Gene tree discordance, phylogenetic inference and the multispecies coalescent
Trends in Ecology & Evolution, 2009
Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: A model
Theoretical Population Biology, 2009
STEM: species tree estimation using maximum likelihood for gene trees under coalescence
Bioinformatics, 2009
Rooted triple consensus and anomalous gene trees
BMC Ecology and Evolution, 2008
Discordance of Species Trees with Their Most Likely Gene Trees
PLoS Genetics, 2006
Gene Trees in Species Trees
Systematic Biology, 1997

Cited by 17 articles