Models of coding sequence evolution
Open Access
- 8 October 2008
- journal article
- review article
- Published by Oxford University Press (OUP) in Briefings in Bioinformatics
- Vol. 10 (1), 97-109
- https://doi.org/10.1093/bib/bbn049
Abstract
Probabilistic models of sequence evolution are in widespread use in phylogenetics and molecular sequence evolution. These models have become increasingly sophisticated and combined with statistical model comparison techniques have helped to shed light on how genes and proteins evolve. Models of codon evolution have been particularly useful, because, in addition to providing a significant improvement in model realism for protein-coding sequences, codon models can also be designed to test hypotheses about the selective pressures that shape the evolution of the sequences. Such models typically assume a phylogeny and can be used to identify sites or lineages that have evolved adaptively. Recently some of the key assumptions that underlie phylogenetic tests of selection have been questioned, such as the assumption that the rate of synonymous changes is constant across sites or that a single phylogenetic tree can be assumed at all sites for recombining sequences. While some of these issues have been addressed through the development of novel methods, others remain as caveats that need to be considered on a case-by-case basis. Here, we outline the theory of codon models and their application to the detection of positive selection. We review some of the more recent developments that have improved their power and utility, laying a foundation for further advances in the modeling of coding sequence evolution.Keywords
This publication has 91 references indexed in Scilit:
- Directionality in the evolution of influenza A haemagglutininProceedings Of The Royal Society B-Biological Sciences, 2008
- A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A VirusMolecular Biology and Evolution, 2008
- Presence and role of cytosine methylation in DNA viruses of animalsNucleic Acids Research, 2008
- A Site- and Time-Heterogeneous Model of Amino Acid ReplacementMolecular Biology and Evolution, 2008
- Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approachNucleic Acids Research, 2007
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- Bursts of nonsynonymous substitutions in HIV-1 evolution reveal instances of positive selection at conservative protein sitesProceedings of the National Academy of Sciences, 2006
- A Dirichlet process model for detecting positive selection in protein-coding DNA sequencesProceedings of the National Academy of Sciences, 2006
- Hearing silence: non-neutral evolution at synonymous sites in mammalsNature Reviews Genetics, 2006
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981