Molecular organization of Junin virus S RNA: complete nucleotide sequence, relationship with other members of the Arenaviridae and unusual secondary structures

Abstract
In this study, overlapping cDNA clones covering the entire S RNA molecule of Junin virus, an arenavirus that causes Argentine haemorrhagic fever, were generated. The complete sequence of this 3400 nucleotide RNA was determined using the dideoxynucleotide chain termination method. The nucleocapsid protein (N) and the glycoprotein precursor (GPC) genes were identified as two non-overlapping open reading frames of opposite polarity, encoding primary translation products of 564 and 481 amino acids, respectively. Intracellular processing of the latter yields the glycoproteins found in the viral envelope. Comparison of the Junin virus N protein with the homologous proteins of other arenaviruses indicated that amino acid sequences are conserved, the identity ranging from 46 to 76%. The N-terminal half of GPC exhibits an even higher degree of conservation (54 to 82%), whereas the C-terminal half is less conserved (21 to 50%). In all comparisons the highest level of amino acid sequence identity was seen when Junin virus and Tacaribe virus sequences were aligned. The nucleotide sequence at the 5′ end of Junin virus S RNA is not identical to that determined of the other sequenced arenaviruses. However, it is complementary to the 3′-terminal sequences and may form a very stable panhandle structure (ΔG -242.7 kJ/mol) involving the complete non-coding regions upstream from both the N and GPC genes. In addition, a distinct secondary structure was identified in the intergenic region, downstream from the coding sequences; Junin virus S RNA shows a potential secondary structure consisting of two hairpin loops (ΔG -163.2 and -239.3 kJ/mol) instead of the single hairpin loop that is usually found in other arenaviruses. The analysis of the arenavirus S RNA nucleotide sequences and their encoded products is discussed in relation to structure and function.