Models of coding sequence evolution

Open Access

8 October 2008

journal article
review article
Published by Oxford University Press (OUP) in Briefings in Bioinformatics

Vol. 10 (1), 97-109
https://doi.org/10.1093/bib/bbn049

Abstract

Probabilistic models of sequence evolution are in widespread use in phylogenetics and molecular sequence evolution. These models have become increasingly sophisticated and combined with statistical model comparison techniques have helped to shed light on how genes and proteins evolve. Models of codon evolution have been particularly useful, because, in addition to providing a significant improvement in model realism for protein-coding sequences, codon models can also be designed to test hypotheses about the selective pressures that shape the evolution of the sequences. Such models typically assume a phylogeny and can be used to identify sites or lineages that have evolved adaptively. Recently some of the key assumptions that underlie phylogenetic tests of selection have been questioned, such as the assumption that the rate of synonymous changes is constant across sites or that a single phylogenetic tree can be assumed at all sites for recombining sequences. While some of these issues have been addressed through the development of novel methods, others remain as caveats that need to be considered on a case-by-case basis. Here, we outline the theory of codon models and their application to the detection of positive selection. We review some of the more recent developments that have improved their power and utility, laying a foundation for further advances in the modeling of coding sequence evolution.

Keywords

This publication has 91 references indexed in Scilit:

Directionality in the evolution of influenza A haemagglutinin
Proceedings Of The Royal Society B-Biological Sciences, 2008
A Maximum Likelihood Method for Detecting Directional Evolution in Protein Sequences and Its Application to Influenza A Virus
Molecular Biology and Evolution, 2008
Presence and role of cytosine methylation in DNA viruses of animals
Nucleic Acids Research, 2008
A Site- and Time-Heterogeneous Model of Amino Acid Replacement
Molecular Biology and Evolution, 2008
Selecton 2007: advanced models for detecting positive and purifying selection using a Bayesian inference approach
Nucleic Acids Research, 2007
PAML 4: Phylogenetic Analysis by Maximum Likelihood
Molecular Biology and Evolution, 2007
Bursts of nonsynonymous substitutions in HIV-1 evolution reveal instances of positive selection at conservative protein sites
Proceedings of the National Academy of Sciences, 2006
A Dirichlet process model for detecting positive selection in protein-coding DNA sequences
Proceedings of the National Academy of Sciences, 2006
Hearing silence: non-neutral evolution at synonymous sites in mammals
Nature Reviews Genetics, 2006
Evolutionary trees from DNA sequences: A maximum likelihood approach
Journal of Molecular Evolution, 1981

Cited by 66 articles