Assessing the Effect of Selection at the Amino Acid Level in Malaria Antigen Sequences Through Bayesian Generalized Linear Models
- 1 December 2008
- journal article
- Published by Informa UK Limited in Journal of the American Statistical Association
- Vol. 103 (484), 1496-1507
- https://doi.org/10.1198/016214508000000850
Abstract
We present a statistical approach for identifying residues in DNA sequences for which diversity may be maintained by natural selection. Bayesian generalized linear models (GLMs) are used to describe patterns of mutation in a DNA sequence alignment. Posterior distributions of key quantities, such as probabilities of nonsynonymous and synonymous mutations per site, are studied. Inference in this class of models is achieved through customary Markov chain Monte Carlo methods. Model selection is dealt with by means of a minimum posterior predictive loss approach. We describe how information on the evolutionary process underlying the sequences can be formally incorporated into the models through structured priors. The proposed methodology was designed to analyze several DNA sequences encoding the vaccine candidate apical membrane antigen-1 (AMA-1) of the human malaria parasite Plasmodium falciparum. The study of genetic variability in antigen sequences is relevant to determining whether a particular antigen is a viable target for a vaccine construct. Using a simulation study, we first compare the GLM-based approach to existing methods for detecting sites under selection that are based on stochastic models of sequence evolution. We then apply the proposed models to the AMA-1 sequence data, which allows us to identify residues with the greatest disparities between nonsynonymous and synonymous changes. Recent experimental evidence suggests that several of these residues are immunologically relevant, indicating that the proposed models may be used predictively to identify functionally significant residues in antigens for which experimental results are not yet available.Keywords
This publication has 17 references indexed in Scilit:
- A Dirichlet process model for detecting positive selection in protein-coding DNA sequencesProceedings of the National Academy of Sciences of the United States of America, 2006
- Structure of AMA1 from Plasmodium falciparum reveals a clustering of polymorphisms that surround a conserved hydrophobic pocketProceedings of the National Academy of Sciences, 2005
- Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under SelectionMolecular Biology and Evolution, 2005
- Assessing the effect of natural selection in malaria parasitesTrends in Parasitology, 2004
- Bayesian Estimation of Positively Selected SitesJournal of Molecular Evolution, 2004
- MRBAYES: Bayesian inference of phylogenetic treesBioinformatics, 2001
- Polymorphism in the gene encoding the apical membrane antigen-1 (AMA-1) of Plasmodium falciparum. X. Asembo Bay Cohort ProjectMolecular and Biochemical Parasitology, 2001
- Methods to Detect Selection in Populations with Applications to the HumanAnnual Review of Genomics and Human Genetics, 2000
- Model choice: a minimum posterior predictive loss approachBiometrika, 1998
- The Multinomial-Poisson TransformationJournal of the Royal Statistical Society: Series D (The Statistician), 1994