Nonrandom Tripeptide Sequence Distributions at Protein Carboxyl Termini
Open Access
- 1 April 2003
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 13 (4), 617-623
- https://doi.org/10.1101/gr.667603
Abstract
The availability of complete genome sequences enables the statistical analysis of sequence features without significant database-imposed bias. The carboxyl termini of proteins often contain regions associated with protein targeting and enhanced translational termination. We analyzed the frequency of occurrence of C-terminal tripeptides in representative archaeal, bacterial, and eukaryotic genomes. The sequence distribution in prokaryotic genomes nearly matches that generated by the randomization of the observed tripeptide set. In contrast, eukaryotic genomes contain large numbers of overrepresented sequences. Some of these correspond to highly repeated sequences from either duplicated endogenous genes or transposon open reading frames. Gratifyingly, others represent previously known targeting signals or sequences associated with an increase in translational termination efficiency. However, a number of overrepresented tripeptides have not been previously noted and may represent novel functional sequences. For example, the sequence XSS may enhance translational termination efficiency in plants, whereas FWC may be a targeting or processing signal for certain amino acid permeases in yeast.Keywords
This publication has 31 references indexed in Scilit:
- Initial sequencing and analysis of the human genomeNature, 2001
- The basal turnover of yeast branched-chain amino acid permease Bap2p requires its C-terminal tailFEMS Microbiology Letters, 2001
- Structure of TPR Domain–Peptide ComplexesCell, 2000
- The influence of 5′ codon context on translation termination in Saccharomyces cerevisiaeEuropean Journal of Biochemistry, 1998
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Life with 6000 GenesScience, 1996
- Common Principles of Protein Translocation Across MembranesScience, 1996
- A C-terminal signal prevents secretion of luminal ER proteinsCell, 1987
- Transfer of proteins across membranes. I. Presence of proteolytically processed and unprocessed nascent immunoglobulin light chains on membrane-bound ribosomes of murine myeloma.The Journal of cell biology, 1975