Nucleotide sequence of the genes III. VI and I of bacteriophage M13

Abstract
A DNA region of 2750 base pairs encompassing the genes III, VI and I of bacteriophage M13 has been sequenced by the Maxam-Gilbert procedure. By establishing the nucleotide changes introduced by several amber mutations, the coding region and the regulatory signals of each gene have been deduced. The genes appear to span 1275 base pairs (gene III; mol.wt. 44,748) 339 base pairs (gene VI; mol.wt. 12,264) and 1047 base pairs (gene I; mol. wt. 39,500). Their separating non-codogenic regions are extremely short, namely two and one base pair, respectively. The C-terminal end of gene I, however, intrudes 23 nucleotides into gene IV. From the nucleotide sequence it appears that the minor capsid protein of the phage, which is encoded by gene III is synthesized in a precursor form containing 18 extra amino acids at its N-terminal end. Furthermore, in this capsid protein two clusters of a fourfold repeat of the sequence Glu-Gly-Gly-Gly-Ser are apparent. Gene VI appears to code for a small, extremely hydrophobic polypeptide. Its total hydrophobic amino acids content of 51% suggests that this protein can only function in the host cell membrane.