Sequence analysis of the core gene of 14 hepatitis C virus genotypes.
- 16 August 1994
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 91 (17), 8239-8243
- https://doi.org/10.1073/pnas.91.17.8239
Abstract
We previously sequenced the 5' noncoding region of 44 isolates of hepatitis C virus (HCV), as well as the envelope 1 (E1) gene of 51 HCV isolates, and provided evidence for the existence of at least 6 major genetic groups consisting of at least 12 minor genotypes of HCV (i.e., genotypes I/1a, II/1b, III/2a, IV/2b, 2c, V/3a, 4a-4d, 5a, and 6a). We now report the complete nucleotide sequence of the putative core (C) gene of 52 HCV isolates that represent all of these 12 genotypes as well as two additional genotypes provisionally designated 4e and 4f that we identified in this study. The phylogenetic analysis of the C gene sequences was in agreement with that of the E1 gene sequences. A major division in the genetic distance was observed between HCV isolates of genotype 2 and those of the other genotypes in analysis of both the E1 and C genes. The C gene sequences of 9 genotypes have not been reported previously (i.e., genotypes 2c, 4a-4f, 5a, and 6a). Our analysis indicates that the C gene-based methods currently used to determine the HCV genotype, such as PCR with genotype-specific primers, should be revised in light of these data. We found that the predicted C gene was exactly 573 nt long in all 52 HCV isolates, with an N-terminal start codon and no in-frame stop codons. The nucleotide and predicted amino acid identities of the C gene sequences were in the range of 79.4-99.0% and 85.3-100%, respectively. Furthermore, we mapped universally conserved, as well as genotype-specific, nucleotide and deduced amino acid sequences of the C gene. The predicted C proteins of the different HCV genotypes shared the following features: (i) high content of proline residues, (ii) high content of arginine and lysine residues located primarily in three domains with 10 such residues invariant at positions 39-62, (iii) a cluster of 5 conserved tryptophan residues, (iv) two nuclear localization signals and a DNA-binding motif, (v) a potential phosphorylation site with a serine-proline motif, and (vi) three conserved hydrophilic domains that have been shown by others to contain immunogenic epitopes. Thus, we have extended analysis of the predicted C protein of HCV to all of the recognized genotypes, confirmed the existence of highly conserved regions of this important structural protein, and demonstrated that the genetic relatedness of HCV isolates is equivalent when analyzing the most conserved (i.e., C) and the most variable (i.e., E1) genes of the HCV genome.Keywords
This publication has 21 references indexed in Scilit:
- Classification of hepatitis C virus into six major genotypes and a series of subtypes by phylogenetic analysis of the NS-5 regionJournal of General Virology, 1993
- Characterization of the genomic sequence of type V (or 3a) hepatitis C virus isolates and PCR primers for specific detectionJournal of General Virology, 1993
- Typing of hepatitis C virus isolates and characterization of new subtypes using a line probe assayJournal of General Virology, 1993
- Analysis of the Putative E1 Envelope and NS4a Epitope Regions of HCV Type 3Biochemical and Biophysical Research Communications, 1993
- Sequence variability in the 5' non-coding region of hepatitis C virus: identification of a new virus type and restrictions on sequence diversityJournal of General Virology, 1993
- Two Distinct Subtypes of Hepatitis C Virus Defined by Antibodies Directed to the Putative Core ProteinHepatology, 1992
- Analysis of a new hepatitis C virus type and its phylogenetic relationship to existing variantsJournal of General Virology, 1992
- Typing hepatitis C virus by polymerase chain reaction with type-specific primers: application to clinical surveys and tracing infectious sourcesJournal of General Virology, 1992
- A new type of hepatitis C virus in patients in ThailandBiochemical and Biophysical Research Communications, 1992
- Nucleotide sequence of the genomic RNA of hepatitis C virus isolated from a human carrier: comparison with reported isolates for conserved and divergent regionsJournal of General Virology, 1991