Learning Multiple Evolutionary Pathways from Cross-Sectional Data
- 1 July 2005
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 12 (6), 584-598
- https://doi.org/10.1089/cmb.2005.12.584
Abstract
We introduce a mixture model of trees to describe evolutionary processes that are characterized by the ordered accumulation of permanent genetic changes. The basic building block of the model is a directed weighted tree that generates a probability distribution on the set of all patterns of genetic events. We present an EM-like algorithm for learning a mixture model of K trees and show how to determine K with a maximum likelihood approach. As a case study, we consider the accumulation of mutations in the HIV-1 reverse transcriptase that are associated with drug resistance. The fitted model is statistically validated as a density estimator, and the stability of the model topology is analyzed. We obtain a generative probabilistic model for the development of drug resistance in HIV that agrees with biological knowledge. Further applications and extensions of the model are discussed.Keywords
This publication has 30 references indexed in Scilit:
- Extended spectrum of HIV-1 reverse transcriptase mutations in patients receiving multiple nucleoside analog inhibitorsAIDS, 2003
- Diversity and complexity of HIV-1 drug resistance: A bioinformatics approach to predicting phenotype from genotypeProceedings of the National Academy of Sciences, 2002
- Geno2pheno: interpreting genotypic HIV drug resistance testsIEEE Intelligent Systems, 2001
- Distance-Based Reconstruction of Tree Models for OncogenesisJournal of Computational Biology, 2000
- Inferring Tree Models for Oncogenesis from Comparative Genome Hybridization DataJournal of Computational Biology, 1999
- Bayesian Network ClassifiersMachine Learning, 1997
- Ordered Appearance of Zidovudine Resistance Mutations during Treatment of 18 Human Immunodeficiency Virus-Positive SubjectsThe Journal of Infectious Diseases, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Approximating discrete probability distributions with dependence treesIEEE Transactions on Information Theory, 1968
- Optimum branchingsJournal of Research of the National Bureau of Standards Section B Mathematics and Mathematical Physics, 1967