Conserved sequence features of inteins (protein introns) and their use in identifying new inteins and related proteins
Open Access
- 1 December 1994
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 3 (12), 2340-2350
- https://doi.org/10.1002/pro.5560031218
Abstract
Inteins (protein introns) are internal portions of protein sequences that are posttranslationally excised while the flanking regions are spliced together, making an additional protein product. Inteins have been found in a number of homologous genes in yeast, mycobacteria, and extreme thermophile archaebacteria. The inteins are probably multifunctional, autocatalyzing their own splicing, and some were also shown to be DNA endonucleases. The splice junction regions and two regions similar to homing endonucleases were thought to be the only common sequence features of inteins. This work analyzed all published intein sequences with recently developed methods for detecting weak, conserved sequence features. The methods complemented each other in the identification and assessment of several patterns characterizing the intein sequences. New intein conserved features are discovered and the known ones are quantitatively described and localized. The general sequence description of all the known inteins is derived from the motifs and their relative positions. The intein sequence description is used to search the sequence databases for intein‐like proteins. A sequence region in a mycobacterial open reading frame possessing all of the intein motifs and absent from sequences homologous to both of its flanking sequences is identified as an intein. A newly discovered putative intein in red algae chloroplasts is found not to contain the endonuclease motifs present in all other inteins. The yeast HO endonuclease is found to have an overall intein‐like structure and a few viral polyprotein cleavage sites are found to be significantly similar to the inteins amino‐end splice junction motif. The intein features described may serve for detection of intein sequences.Keywords
This publication has 57 references indexed in Scilit:
- Detecting Patterns in Protein SequencesJournal of Molecular Biology, 1994
- Large ATP synthase operon of the red alga Antithamnion sp. resembles the corresponding operon in cyanobacteriaJournal of Molecular Biology, 1992
- Prokaryotic polyprotein precursorsFEBS Letters, 1992
- Protein Splicing Converts the Yeast TFP1 Gene Product to the 69-kdDSubunit of the Vacuolar H + -Adenosine TriphosphataseScience, 1990
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990
- Group I introns as mobile genetic elements: Facts and mechanistic speculations — a reviewGene, 1989
- Weights for data related by a treeJournal of Molecular Biology, 1989
- Information content of binding sites on nucleotide sequencesJournal of Molecular Biology, 1986
- Two intron sequences in yeast mitochondrial COX1 gene: Homology among URF-containing introns and strain-dependent variation in flanking exonsCell, 1983
- Homothallic switching of yeast mating type cassettes is initiated by a double-stranded cut in the MAT locusCell, 1982