Simple repetitive DNA sequences from primates: Compilation and analysis
- 1 February 1995
- journal article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 40 (2), 120-126
- https://doi.org/10.1007/bf00167107
Abstract
Simple repeats composed of tandemly repeated units 1–6 nucleotides (nt) long have been extracted from a selected set of primate genomic DNA sequences. Of the 501 theoretically possible, different types of repeats only 67 were present in the analyzed database in at least two different size ranges over 12 nt. They include all simple repeats known to be polymorphic in the primate genome. A list of moderately expanding and nonexpanding oligonucleotide patterns has also been included. Furthermore, we have compiled statistical data with emphasis on the overall variability of the most abundant 67 types of repeats. We have demonstrated that the expandability of at least some simple repeats may be affected by the overall base composition and by flanking sequences. In particular, the occurrence of tandemly repeated CAG and GCC triplets in exons positively correlates with their G+C content. We also noted that in the vicinity of Alu sequences tetrameric repeats are more abundant than in the total genomic DNA. This paper can be used as a comprehensive guide in identification of the most abundant and potentially polymorphic simple repeats. It is also of broader significance as a step toward understanding the contribution of flanking sequences and the overall sequence composition to variability of simple repeats.Keywords
This publication has 24 references indexed in Scilit:
- Repetitive DNA in and around translocation breakpoints of the Philadelphia chromosomeGene, 1994
- Expansion of an unstable trinucleotide CAG repeat in spinocerebellar ataxia type 1Nature Genetics, 1993
- A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomesCell, 1993
- Human genes containing polymorphic trinucleotide repeatsNature Genetics, 1992
- Survey of human and rat microsatellitesGenomics, 1992
- Molecular basis of myotonic dystrophy: Expansion of a trinucleotide (CTG) repeat at the 3′ end of a transcript encoding a protein kinase family memberCell, 1992
- Variation of the CGG repeat at the fragile X site results in genetic instability: Resolution of the Sherman paradoxCell, 1991
- Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndromeCell, 1991
- Structure and polymorphism of human telomere-associated DNACell, 1990
- Informativeness of human (dC-dA)n · (dG-dT)n polymorphismsGenomics, 1990