Inferring Species Trees Directly from Biallelic Genetic Markers: Bypassing Gene Trees in a Full Coalescent Analysis
Top Cited Papers
Open Access
- 14 March 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 29 (8), 1917-1932
- https://doi.org/10.1093/molbev/mss086
Abstract
The multispecies coalescent provides an elegant theoretical framework for estimating species trees and species demographics from genetic markers. However, practical applications of the multispecies coalescent model are limited by the need to integrate or sample over all gene trees possible for each genetic marker. Here we describe a polynomial-time algorithm that computes the likelihood of a species tree directly from the markers under a finite-sites model of mutation effectively integrating over all possible gene trees. The method applies to independent (unlinked) biallelic markers such as well-spaced single nucleotide polymorphisms, and we have implemented it in SNAPP, a Markov chain Monte Carlo sampler for inferring species trees, divergence dates, and population sizes. We report results from simulation experiments and from an analysis of 1997 amplified fragment length polymorphism loci in 69 individuals sampled from six species of Ourisia (New Zealand native foxglove).Keywords
This publication has 52 references indexed in Scilit:
- Bayesian Inference of Species Trees from Multilocus DataMolecular Biology and Evolution, 2009
- Gene tree discordance, phylogenetic inference and the multispecies coalescentTrends in Ecology & Evolution, 2009
- STEM: species tree estimation using maximum likelihood for gene trees under coalescenceBioinformatics, 2009
- Species delimitation and phylogeny of a New Zealand plant species radiationBMC Evolutionary Biology, 2009
- Importance sampling and the two-locus model with subdivided population structureAdvances in Applied Probability, 2008
- BEAST: Bayesian evolutionary analysis by sampling treesBMC Evolutionary Biology, 2007
- Almost Forgotten or Latest Practice? AFLP applications, analyses and advancesTrends in Plant Science, 2007
- Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population geneticsProceedings of the National Academy of Sciences, 2007
- Neighbor-Net: An Agglomerative Method for the Construction of Phylogenetic NetworksMolecular Biology and Evolution, 2003
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981