Tandem Repeats in Protein Coding Regions of Primate Genes
Open Access
- 1 June 2002
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (6), 909-915
- https://doi.org/10.1101/gr.138802
Abstract
Tandem repeats in GenBank primate nucleotide sequences annotated as protein coding regions are analyzed. It is found that only trinucleotide repeats show repeat enrichment well above the threshold of statistical significance. The statistics are improved by a simultaneous search for repeats on both the amino acid and nucleotide levels. The results of the analyses of natural sequences are interpreted by comparing them with the results of the computer simulation of the model dedicated to protein coding regions. According to the simulation results, a limited set of trinucleotides, that is, cgg, ccg, cag, and gaa repeats coding for polyalanine, polyglycine, polyproline, polyglutamine, and polylysine are prone to proliferation. It is also found that within the repeat regions slippage is more frequent by a factor of 10 than point mutations, whereas the ratio of silent versus recognizable point mutations is approximately the same as elsewhere in coding regions. The trinucleotide repeats cover slightly more than 0.3% of the protein coding regions of genes.Keywords
This publication has 14 references indexed in Scilit:
- Initial sequencing and analysis of the human genomeNature, 2001
- How genomic and developmental dynamics affect evolutionary processesBioEssays, 2000
- Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositionsNucleic Acids Research, 2000
- Microsatellites in Different Eukaryotic Genomes: Survey and AnalysisGenome Research, 2000
- GenBankNucleic Acids Research, 2000
- Codon usage tabulated from international DNA sequence databases: status for the year 2000Nucleic Acids Research, 2000
- Tendency for local repetitiveness in amino acid usages in modern proteinsJournal of Molecular Biology, 1999
- Evolution of simple sequence repeatsComputers & Chemistry, 1996
- Tandemly repeated pentanucleotides in DNA sequences of eucaryotesNucleic Acids Research, 1994
- Analysis of Apparent 1/ f α Spectrum in DNA SequencesEurophysics Letters, 1993