iGTP: A software package for large-scale gene tree parsimony analysis
Open Access
- 23 November 2010
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 11 (1), 574
- https://doi.org/10.1186/1471-2105-11-574
Abstract
Background: The ever-increasing wealth of genomic sequence information provides an unprecedented opportunity for large-scale phylogenetic analysis. However, species phylogeny inference is obfuscated by incongruence among gene trees due to evolutionary events such as gene duplication and loss, incomplete lineage sorting (deep coalescence), and horizontal gene transfer. Gene tree parsimony (GTP) addresses this issue by seeking a species tree that requires the minimum number of evolutionary events to reconcile a given set of incongruent gene trees. Despite its promise, the use of gene tree parsimony has been limited by the fact that existing software is either not fast enough to tackle large data sets or is restricted in the range of evolutionary events it can handle. Results: We introduce iGTP, a platform-independent software program that implements state-of-the-art algorithms that greatly speed up species tree inference under the duplication, duplication-loss, and deep coalescence reconciliation costs. iGTP significantly extends and improves the functionality and performance of existing gene tree parsimony software and offers advanced features such as building effective initial trees using stepwise leaf addition and the ability to have unrooted gene trees in the input. Moreover, iGTP provides a user-friendly graphical interface with integrated tree visualization software to facilitate analysis of the results. Conclusions: iGTP enables, for the first time, gene tree parsimony analyses of thousands of genes from hundreds of taxa using the duplication, duplication-loss, and deep coalescence reconciliation costs, all from within a convenient graphical user interface.Keywords
This publication has 42 references indexed in Scilit:
- Maximum likelihood models and algorithms for gene tree evolution with duplications and lossesBMC Bioinformatics, 2011
- An ILP solution for the gene duplication problemBMC Bioinformatics, 2011
- Genome-Scale Phylogenetics: Inferring the Plant Tree of Life from 18,896 Gene TreesSystematic Biology, 2010
- Efficient genome-scale phylogenetic analysis under the duplication-loss and deep coalescence cost modelsBMC Bioinformatics, 2010
- Simultaneous Bayesian gene tree reconstruction and reconciliation analysisProceedings of the National Academy of Sciences, 2009
- STEM: species tree estimation using maximum likelihood for gene trees under coalescenceBioinformatics, 2009
- Gene Family Evolution by Duplication, Speciation, and LossJournal of Computational Biology, 2008
- Efficient inference of bacterial strain trees from genome-scale multilocus dataBioinformatics, 2008
- A Phylogenomic Approach to Bacterial Phylogeny: Evidence of a Core of Genes Sharing a Common HistoryGenome Research, 2002
- Gene Trees in Species TreesSystematic Biology, 1997