Simultaneous Bayesian gene tree reconstruction and reconciliation analysis
- 7 April 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (14), 5714-5719
- https://doi.org/10.1073/pnas.0806251106
Abstract
We present GSR, a probabilistic model integrating gene duplication, sequence evolution, and a relaxed molecular clock for substitution rates, that enables genomewide analysis of gene families. The gene duplication and loss process is a major cause for incongruence between gene and species tree, and deterministic methods have been developed to explain such differences through tree reconciliations. Although probabilistic methods for phylogenetic inference have been around for decades, probabilistic reconciliation methods are far less established. Based on our model, we have implemented a Bayesian analysis tool, PrIME-GSR, for gene tree inference that takes a known species tree into account. Our implementation is sound and we demonstrate its utility for genomewide gene-family analysis by applying it to recently presented yeast data. We validate PrIME-GSR by comparing with previous analyses of these data that take advantage of gene order information. In a case study we apply our method to the ADH gene family and are able to draw biologically relevant conclusions concerning gene duplications creating key yeast phenotypes. On a higher level this shows the biological relevance of our method. The obtained results demonstrate the value of a relaxed molecular clock. Our good performance will extend to species where gene order conservation is insufficient.Keywords
This publication has 53 references indexed in Scilit:
- Birth-death prior on phylogeny and speed datingBMC Ecology and Evolution, 2008
- A burst of protein sequence evolution and a prolonged period of asymmetric evolution follow gene duplication in yeastGenome Research, 2007
- Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomesGenome Research, 2007
- Natural history and evolutionary principles of gene duplication in fungiNature, 2007
- Automatic genome-wide reconstruction of phylogenetic gene treesBioinformatics, 2007
- Relaxed Phylogenetics and Dating with ConfidencePLoS Biology, 2006
- The Yeast Gene Order Browser: Combining curated homology and syntenic context reveals gene fate in polyploid speciesGenome Research, 2005
- The modern molecular clockNature Reviews Genetics, 2003
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981