Nucleotide Sequence of Simian Virus 40 DNA: Structure of the Middle Segment of the HindII + III Restriction Fragment B (Sixth Part of the T Antigen Gene and Codon Usage)

Abstract
We report here the nucleotide sequence of the simian virus 40 DNA region that lies between the EcoRII restriction endonuclease cleavage sites at map positions 0.214 and 0.281. The sequence was determined by partial chemical degradation of terminally labeled DNA fragments according to the procedure of Maxam and Gilbert. This region represents 6.7% of the SV40 genome and is located in the middle of HindII + III restriction fragment B. It is expressed as part of the early 19-S messenger RNA, which codes for the large-T antigen protein. Only one open reading frame for translation can be deduced from the message strand of the DNA and this reading frame connects in phase with the one of both neighboring fragments. This publication is the last in a series of papers about the T-antigen gene, and several properties of this gene and its product are discussed. The non-randomness of codon usage is similar to that previously discussed for the late part of the genome. Moreover, it appears that the choice of a third letter can be determined by the nature of the following codon; some codons which start with a pyrimidine are almost never preceded by an adenosine and some ANN-type codons are almost never preceded by a guanosine.