DPRml: distributed phylogeny reconstruction by maximum likelihood
- 28 October 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7), 969-974
- https://doi.org/10.1093/bioinformatics/bti100
Abstract
Motivation: In recent years there has been increased interest in producing large and accurate phylogenetic trees using statistical approaches. However for a large number of taxa, it is not feasible to construct large and accurate trees using only a single processor. A number of specialized parallel programs have been produced in an attempt to address the huge computational requirements of maximum likelihood. We express a number of concerns about the current set of parallel phylogenetic programs which are currently severely limiting the widespread availability and use of parallel computing in maximum likelihood-based phylogenetic analysis.Results: We have identified the suitability of phylogenetic analysis to large-scale heterogeneous distributed computing. We have completed a distributed and fully cross-platform phylogenetic tree building program called distributed phylogeny reconstruction by maximum likelihood. It uses an already proven maximum likelihood-based tree building algorithm and a popular phylogenetic analysis library for all its likelihood calculations. It offers one of the most extensive sets of DNA substitution models currently available. We are the first, to our knowledge, to report the completion of a distributed phylogenetic tree building program that can achieve near-linear speedup while only using the idle clock cycles of machines. For those in an academic or corporate environment with hundreds of idle desktop machines, we have shown how distributed computing can deliver a ‘free’ ML supercomputer.Availability: The software (and user manual) is publicly available under the terms of the GNU general public licence from the system webpage at http://www.cs.may.ie/distributedContact: tom.naughton@may.ieKeywords
This publication has 22 references indexed in Scilit:
- Genomic Databases and the Tree of LifeScience, 2004
- Condor and the GridPublished by Wiley ,2003
- Distributed Java Platform with Programmable MIMD CapabilitiesLecture Notes in Computer Science, 2003
- Improvement of Distance-Based Phylogenetic Methods by a Local Maximum Likelihood Approach Using TripletsMolecular Biology and Evolution, 2002
- SETI@home-massively distributed computing for SETIComputing in Science & Engineering, 2001
- Phylogenetic AnalysisPublished by Wiley ,1998
- fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihoodBioinformatics, 1994
- Two strikes against perfect phylogenyLecture Notes in Computer Science, 1992
- Dating of the human-ape splitting by a molecular clock of mitochondrial DNAJournal of Molecular Evolution, 1985
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981