|Home | About | Journals | Submit | Contact Us | Français|
The complete DNA sequence of herpes B virus (Cercopithecine herpesvirus 1) strain E2490, isolated from a rhesus macaque, was determined. The total genome length is 156,789 bp, with 74.5% G+C composition and overall genome organization characteristic of alphaherpesviruses. The first and last residues of the genome were defined by sequencing the cloned genomic termini. There were six origins of DNA replication in the genome due to tandem duplication of both oriL and oriS regions. Seventy-four genes were identified, and sequence homology to proteins known in herpes simplex viruses (HSVs) was observed in all cases but one. The degree of amino acid identity between B virus and HSV proteins ranged from 26.6% (US5) to 87.7% (US15). Unexpectedly, B virus lacked a homolog of the HSV γ134.5 gene, which encodes a neurovirulence factor. Absence of this gene was verified in two low-passage clinical isolates derived from a rhesus macaque and a zoonotically infected human. This finding suggests that B virus most likely utilizes mechanisms distinct from those of HSV to sustain efficient replication in neuronal cells. Despite the considerable differences in G+C content of the macaque and B virus genes (51% and 74.2%, respectively), codons used by B virus are optimal for the tRNA population of macaque cells. Complete sequence of the B virus genome will certainly facilitate identification of the genetic basis and possible molecular mechanisms of enhanced B virus neurovirulence in humans, which results in an 80% mortality rate following zoonotic infection.
Comparative genome analyses of closely related viruses offer insight into the protein coding capacities of viral genomes (18, 69), a means to biologically classify viruses (2, 19) and identify viral genes involved in virulence and pathogenicity (1, 37, 67). The number of completely sequenced viral genomes has been increasing rapidly and now exceeds 1,000, including 29 herpesvirus genomes (GenBank data, http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/viruses.html).
B virus (Cercopithecine herpesvirus 1, monkey B virus) is a member of the subfamily Alphaherpesvirinae, which together with human herpes simplex virus types 1 and 2 (HSV-1 and HSV-2) constitutes the genus Simplexvirus. B virus generally causes only mild localized or asymptomatic infections in its natural hosts, Asian monkeys of the genus Macaca (33, 34, 74). In contrast, B virus infections in foreign hosts, humans or monkey species other than macaques, often result in encephalitis, encephalomyelitis, and death (53, 73, 74).
The genome organization of B virus is similar to that of HSV-1 and HSV-2: the unique long (UL) and unique short (US) segments flanked by inverted long (RL) and short (RS) repeat sequences are covalently joined in four possible isomeric configurations (26). Sequence analysis of the partial US regions of B virus and simian agent 8 virus demonstrated that human and primate viruses are colinear in this genomic segment (51). However, only a limited number of B virus gene sequences from the UL region have been published (3, 36, 62, 63), and nothing has been reported about the structure and the gene content of the repeat genomic elements.
The size of the B virus genome was estimated previously as 165 kb (26) and 162.5 kb (40), which is significantly larger than the HSV-1 and HSV-2 genomes (152 and 155 kb, respectively). Extra DNA might contain additional genes that are not present in human viruses and may provide insight about B virus pathogenicity in foreign hosts.
In this study, the complete genomic sequence of the B virus reference strain E2490, isolated from a rhesus macaque (Macaca mulatta), was determined, and viral DNA termini were located. The genomic structure and gene content were analyzed and compared to those of human herpesviruses HSV-1 and HSV-2.
B virus laboratory strain E2490 and two B virus clinical isolates were obtained from the National B Virus Resource Laboratory (Atlanta, Ga.). Viruses were propagated in Vero cells maintained in minimal essential medium (MEM) supplemented with 2% fetal bovine serum (Gibco). Viral DNA was purified from the infected-cell lysate by ultracentrifugation in an NaI density gradient as described previously (70). Escherichia coli Top10 (Invitrogen) cells were used for propagation of plasmids.
The BamHI, KpnI, SalI, PstI, and XhoI restriction fragments of B virus genomic DNA were cloned into the pUC19 vector digested with the same enzyme. The following three techniques were employed to generate full sequences of the clones: primer walking, unidirectional deletions, and random transposon insertions (Epicentre Technologies).
To obtain the sequence of the IRL-IRS regions, two shotgun libraries of random overlapping clones were prepared from the adjacent PstI fragments P13 and P1, covering this region (Fig. (Fig.1).1). DNA fragments of 13 kb (P13) and 4.8 kb (P1) were isolated after separation of PstI-digested genomic DNA on a 0.8% agarose gel. Each DNA fragment was then treated with DNase I from a DNaseI Shotgun kit (Novagen) according to the manufacturer's suggested protocol. The DNase I-treated fragments were fractionated on a 1.4% agarose gel, and then 300- to 1,000-bp fragments were eluted from the gel, blunt ended with T4 DNA polymerase, dA-tailed with Tth polymerase and a dA tailing kit (Novagen), and ligated into the pScreen T vector (Novagen). The ligation mixtures were transformed into E. coli NovaBlue cells (Novagen).
DNA sequencing was performed with BigDye Terminator (PE Applied Biosystems) sequencing chemistry on an automatic DNA sequencer ABI 377 (Applied Biosystems). Sequencing reactions were performed in the presence of Sequence Enhancer A (Gibco) to resolve sequence compressions due to the high G+C content of templates. A 3:1 mixture of regular and dGTP BigDye sequencing kits (Applied Biosystems) was used in order to overcome premature termination of sequencing reactions in G-rich regions. DNAStar SeqMan program (DNAStar Inc.) was used to assemble the complete genome sequence from the sequences of the overlapping plasmid clones. For adjacent fragments with no overlap, areas across the junction were amplified with B virus genomic DNA as the template, and the resulting PCR fragments were sequenced.
B virus genomic DNA (≈0.5 μg) was blunt ended by treatment with 5 U of T4 DNA polymerase (Novagen) as suggested by the manufacturer. After purification on a Qiaquick spin column (Qiagen), the blunt-ended DNA was digested overnight with restriction enzyme SphI and ligated into SmaI- and SphI-digested vector pUC19. E. coli NovaBlue (Novagen) competent cells were transformed with the ligation mixture.
SphI-digested genomic DNA was transferred from a 0.8% agarose gel onto a nylon membrane (Roche) by alkali blotting according to the instructions supplied with the membranes. Plasmid DNA was labeled with digoxigenin with a HighPrime labeling kit (Roche). Prehybridization, hybridization, and washing steps were carried out according to standard protocols (59). Chemiluminescence detection of the bound probe was performed with an ECL detection kit (Amersham Pharmacia Biotech).
Identification of open reading frames (ORFs), repeats, and DNA regulatory sequences was performed with DNAStar suite of programs. GenBank database searches were carried out with BlastN, BlastP, and BlastX with default settings. Multiple alignments between B virus, HSV-1, and HSV-2 genes and proteins were performed with and analyzed by DNAStar MegAlign program (version 4.0.3) with the PAM250 amino acid substitution matrix and Joltun Hein method (30) and the following parameters: gap opening penalty = 11 and gap extension penalty = 3. Identity values were determined from the generated alignments.
The GeneMark and GeneMark.hmm gene-finding programs were used to refine the procedure of gene identification (4, 41). The parameters of the statistical models were defined by training on the set of B virus ORFs encoding protein products homologous to known HSV-1 and HSV-2 proteins. These statistical methods are able to identify frameshifts and unusual gene starts in addition to detecting genes not found by similarity search. To prove that a homolog of the HSV neurovirulence factor is absent in the B virus genome, BlastP and PSI-Blast searches with ICP34.5 protein as the query were used against the whole B virus genome translated in six frames. In addition, a Hidden Markov Model profile was created from the conservative domain of the ICP34.5 protein and used to scan the translated B virus genome.
In a search for unique genes, the PSI-Blast and RPS-Blast programs were used to search for possible conserved domains in proteins predicted by statistical methods. Predicted proteins shorter than 20 residues were excluded from the analysis. Analyses of protein structures were performed with the DNAStar Protean program (version 4.0.3), signal peptide prediction program SignalP version 2.0.b2 (http://www.cbs.dtu.dk/services/SignalP-2.0/), and transmembrane prediction programs TMHMM version 2.0 (http://www.cbs.dtu.dk/services/TMHMM/), DAS (http://www.sbc.su.se/≈miklos/DAS/), and TMpred (http://www.ch.embnet.org/software/TMPRED_form.html).
The complete sequence of the B virus genome (GenBank accession number AF533768) and the L-terminal sequences of two clinical B virus isolates (GenBank accession numbers AY230747 and AY230748) have been deposited in the NCBI database.
The relative positions of all fragments used for B virus genome sequencing are shown in Fig. Fig.1.1. The UL and US genomic sequences were derived from the nucleotide sequences of the plasmid-cloned BamHI, KpnI, SalI, PstI, and XhoI overlapping genomic fragments. The sequences of the internal copies of the large and small repeat regions (IRL and IRS, respectively) were determined from the nucleotide sequences of short overlapping clones derived from shotgun libraries prepared from the adjacent P13 and P1 genomic fragments. Terminal copies of repeat elements (TRL and TRS) were assumed to be identical to the internal copies, as in the HSV-1 and HSV-2 genomes (18, 43, 44). An assembly of 1,389 single readings resulted in generation of the UL-IRL-IRS-US genomic sequence comprising 139,269 bp, with an average redundancy of 3.9. Exact UL-TRL, UL-IRL, US-IRS, and US-TRS boundaries were determined by sequence comparisons of PCR fragments amplified across these junctions.
To determine the first and last nucleotides in the B virus genome, genomic termini were defined. Initially, the locations of the internal a sequence and the joint between the B virus long and short internal repeat regions (L/S joint) were estimated after aligning the HSV-1 and B virus IRL-IRS sequences (not shown). Two SphI restriction sites were located approximately 1.8 kb (in the IRL region) and 1.5 kb (in the IRS region) from the predicted internal a sequence. Since the RL and RS internal and terminal copies in alphaherpesvirus genomes are identical, SphI restriction sites were expected to be located ≈1.8 kb and ≈1.5 kb from the B virus L and S genomic termini, respectively, and therefore could be used for cloning the B virus genomic ends.
Blunt-ended B virus genomic DNA was then digested with SphI and ligated into SmaI- and SphI-digested vector pUC19. All internal genomic fragments had SphI sticky ends on both sides, and consequently only terminal genomic fragments, which had one blunt end and another SphI end, were compatible with the prepared vector and successfully cloned by this procedure. As predicted, two sets of recombinant clones were generated with inserts of the estimated sizes. To confirm that the clones isolated included viral DNA ends, plasmid DNA from clones containing 1.5-kb and 1.8-kb inserts was used to probe SphI-digested B virus genomic DNA on Southern blots. Theoretically, if these clones indeed contain genomic ends, they will hybridize to the two SphI genomic fragments of 1.5 kb or 1.8 kb (terminal) and 3.1 kb (internal from the L/S junction). These fragments and two additional 2.1-kb and 3.3-kb fragments of lower intensity were detected (Fig. (Fig.2A).2A). The observed cross-hybridization patterns indicated that the two genomic ends had common sequences. The additional fragments are most likely L-terminal (2.1 kb) and junction (3.3 kb) fragments containing an extra copy of a sequence. The variable number of a sequence repeats has also been described at the L terminus and L/S junction in HSV-1 (57).
Nucleotide sequences adjacent to the vector cloning site SmaI (genomic termini) were derived from the inserts of 10 selected clones: six clones originated from the large terminal repeat (L-terminal clones) and four originated from the small terminal repeat (S-terminal clones) (Fig. (Fig.2B).2B). All contained identical sequences (223-bp long) present in direct orientation on both genomic ends and in reverse orientation at the L/S joint in either one (nine clones) or two (one clone) directly repeated copies. The B virus 223-bp terminal sequence is a homolog of the HSV-1 a sequence that is also present in other herpesvirus genomes. Two cis-acting elements, pac1 and pac2, that are required for cleavage and packaging of herpesvirus genomes exist in the a sequence (17, 68).
Although overall sequence homology between the a sequences of B virus, HSV-1, and HSV-2 was low, the sequence composition of the cleavage and packaging signals was very well conserved (Fig. (Fig.2C).2C). An A-rich motif followed by three CGCGGCG motifs formed the B virus pac2 signal. Interestingly, most herpesviruses have only one copy of the CGCGGCG motif, which contributes significantly to the efficiency of genome cleavage and packaging (45). The B virus pac1 contained three conserved motifs, an A-rich region flanked by two stretches of seven C residues and 13 C+G residues. Both pac1 and pac2 are located at conserved distances from the genomic ends (17), 31 bp from the L terminus and 34 bp from the S terminus of B virus DNA. There were no DR2 repeats between the pac1 and pac2 signals in the B virus a sequence, which were thus noticeably shorter than the HSV-1 a sequence.
The complete B virus genomic sequence was assembled according to the HSV-1 prototype genome structure (57). The genome of B virus is 156,789 bp in length, considerably shorter than earlier predictions based on the constructed B virus physical map (26) or direct measurements by electron microscopy (40). The 74.5% G+C base composition of the genome is very similar to earlier estimates of 75% determined by DNA buoyant density centrifugation (40). Table Table11 presents a comparison of the length and G+C content of B virus, HSV-1, and HSV-2 by genomic region. The G+C composition was elevated in all B virus regions compared to that in HSV types 1 and 2, with unique regions demonstrating the highest G+C content among known herpesviruses. The sizes of genomic segments were comparable among the viruses with the exception of RS, which in the B virus genome was ≈1.5 kb longer.
Ten sets of tandem reiterations were identified in the B virus genome. Genomic locations of sets, numbers of repeat units in each set, sizes, and sequences are given in Table Table2.2. Most reiterations were present at locations compatible with the locations of HSV-1 reiterations. Six sets were present in two copies in the genome due to location in the terminal and internal copies of the RS and RL segments. Only two reiteration sets were located in protein-coding regions, the UL36 and US4 (gG) genes.
Both origins of B virus replication, oriL and oriS, were tandemly duplicated and present in locations corresponding to HSV oriL and oriS locations. Thus, six origins of DNA replication exist: two copies of oriL (oriL1 and oriL2) in the UL region and two copies each of oriS (oriS1 and oriS2) in the terminal and internal RS regions. Duplications of either the oriL or oriS sequence were also found in HSV-1 strain ANG and HSV-2 strain HG52, respectively (18, 24), but concurrent duplication of both origins has not been described previously for herpes simplex viruses. All B virus origins share the core element of a 94-bp perfect palindrome (Fig. (Fig.3)3) containing two predicted binding sites for the origin-binding protein (OBP), box I, and box III (22, 27, 52). Furthermore, the nucleotide sequences of the B virus oriS and oriL core elements were extremely conserved and almost identical to HSV oriL but differed from HSV oriS (25, 42). Whether the existence of six nearly identical origins of replication in the B virus genome has any functional significance or is just an artifact due to repeated passage of the virus in cell culture currently remains unknown.
Both major approaches to gene identification, extrinsic and intrinsic, were used to identify and characterize B virus genes. Extrinsic methods, such as Blast, identify protein-coding genes by detecting similarity of translated protein sequences to the primary structure of a known protein. The intrinsic approach, an ab initio statistical method such as GeneMark, identifies protein-coding regions by detecting specific frequency patterns in nucleotide order, including the codon usage pattern. These two types of methods have complementary strengths in terms of sensitivity and specificity. Extrinsic, similarity search methods have high specificity but may miss some genes, up to 30 to 40%, which could be recovered by the intrinsic, statistical method. The B virus genomic sequence can be efficiently analyzed by similarity search methods due to extensive knowledge of the closely related viral species HSV-1 and HSV-2. Still, unique virus-specific genes could be missed. Therefore, in addition to the similarity search, we used ab initio methods, GeneMark and GeneMark.hmm, trained on the set of genes confirmed by similarity search. These methods were modified to allow a noncanonical translation initiation codon, GTG (46), that could be used in this highly G+C-rich genome.
All but one B virus gene were identified on the basis of sequence homology to HSV-1 and HSV-2 genes and named correspondingly. Table Table33 summarizes their locations in the genome, predicted sizes, and percent identities with the corresponding HSV-1 and HSV-2 gene products. Seventy-two genes existed as a single copy within unique genomic regions, whereas two genes, ICP0 and ICP4, appeared twice due to duplication of the large and small repeat regions where they reside. Two ORFs (RL2 and UL15) are predicted to have introns matching ones in the corresponding HSV genes.
It was inferred that the B virus UL1 and UL2 genes use the GTG codon as a start codon at genomic sequence positions 9072 and 9814, respectively. This prediction was made statistically and was corroborated by the presence of conserved regions upstream of the canonical ATG. Both predicted translational initiation GTG codons aligned with the ATGs of the HSV UL1 and UL2 genes. Conservation of the UL1 GTG start codon was confirmed by sequence analysis of the 5′ region of the UL1 gene from two B virus clinical isolates (data not shown).
The extent of amino acid identity between B virus and HSV polypeptides varied from 26.6% (US5) to 87.7% (UL15). The conservation of specific protein domains observed in HSV-1 and HSV-2 was mirrored when HSV-1 and B virus were compared. The following proteins were significantly conserved in B virus: DNA cleavage and packaging proteins, i.e., UL15, UL28, UL32, and UL33; capsid proteins, i.e., UL18, UL19, and UL38; proteins involved in DNA replication, i.e., UL2, UL5, UL29, and UL30; and glycoprotein B. The three least-conserved proteins in B virus were US4, US5, and US12. Similar levels of conservation have been described for homologous proteins in many other mammalian herpesviruses (58).
The B virus proteins could be divided into three groups based on the degree of similarity to HSV-1 and HSV-2 proteins. The largest group (46 proteins) showed greater similarity to HSV-2 proteins, e.g., DNA cleavage and packaging proteins. The second group (20 proteins), with capsid proteins among others, showed higher levels of similarity to HSV-1 proteins. In seven proteins, differences in similarity to HSV-1 or HSV-2 homologs were marginal, and these proteins formed the third group. The proteins that enable HSV-1 to replicate and reactivate more efficiently at orofacial sites and HSV-2 at genital sites are unknown, but the fact that B virus replicates and is reactivated with similar efficiencies at both sites may be explained by the similarities of selected B virus proteins with either HSV type 1 or 2 proteins. These hybrid properties in B virus raise many questions about alphaherpesvirus evolution.
In recent years, the existence of additional HSV-1-specific genes has been proposed and substantiated by experimental data. UL20.5 is located between UL20 and UL21, ORF P and ORF O map antisense to the 34.5 gene, and UL43.5 and UL27.5 map antisense to UL43 and UL27, respectively (8, 55, 56, 71, 72). Like HSV-2 (18), B virus has no equivalents to these proposed genes.
One well-established gene, the HSV γ134.5 (RL1) homolog, was observed to be absent in B virus. This conclusion was reached after repeated searches with state-of-the-art computational genome analysis tools (see Materials and Methods). To determine that this observation was not limited to the laboratory strain of B virus, we cloned and sequenced L-terminal SphI restriction fragments from two low-passage clinical isolates, one of which was derived from a rhesus macaque (54) and the other post mortem from a zoonotically infected human patient (16). Sequence comparison of these fragments with the corresponding fragment of the laboratory strain did not reveal any significant differences: only single nucleotide substitutions and variations in the number of copies of short reiterations were found in the clinical isolates relative to the laboratory strain. Since the absence of the γ134.5 gene homolog was verified in three independent B virus strains, we concluded that it was a genuine feature of the B virus genome.
The protein product of the HSV-1 γ134.5 gene, infected-cell protein 34.5 (ICP34.5), is a neurovirulence factor with at least two known, distinct functional activities (9, 10, 12-14, 64, 66). One function, encoded by the carboxyl-terminal domain, negates the antiviral effect of induced protein kinase R by redirecting the host protein phosphatase 1α to dephosphorylate translation initiation factor eIF2α, preventing protein synthesis shutoff in infected cells (9, 11, 12, 29, 39). Another function, mapped to both amino-terminal and carboxyl-terminal domains, somehow enables the virus to replicate in the peripheral and central nervous systems of experimentally infected animals (10, 64). Deletion of γ134.5 leads to complete neuroattenuation of highly neurovirulent HSV-1 strains. In the absence of γ134.5, other genes may supply these functions, given the striking similarities between the replication characteristics of B virus and the human simplex viruses.
We predict that B virus currently uses compensatory strategies to block host responses to infection, similar to those described for HSV-1. For example, the HSV-1 US11 protein can inhibit protein kinase R activation and compensate for the absence of the ICP34.5 function in deletion mutants if expressed early in infection (5, 6, 28, 48). It was proposed that the US11 gene encodes an alternative mechanism to preclude the shutoff of protein synthesis that is currently inactive in HSV-1 (5). In addition, B virus might have evolved unique mechanisms to prevent termination of protein synthesis and elude the DNA replication blocks imposed by neuronal cells. To examine this possibility, a search for B virus-specific genes was performed.
Two major sequence differences between the B virus and HSV-1 and HSV-2 genomes were detected. The B virus RS region contains an additional ≈1.5 kb of sequence between the S terminus and the ICP4 gene homolog, while the RL region is shorter in B virus than in the human viruses, with no sequence homology to the HSV ICP0 flanking regions. However, no potential genes were identified in these regions despite rigorous analyses.
A putative two-exon gene (UL53A) was identified by statistical analysis in the B virus UL region on a complementary strand (positions 112186 to 112495 and 112755 to 113344). This putative gene had a codon usage pattern compatible with codons present in established B virus genes. The UL53A protein is a hydrophilic basic protein (pI 11.9) with an amino acid composition biased to Pro and Arg residues. The first 21 residues were predicted to be a signal peptide by the SignalP program, and residues 222 to 240 were predicted to be a transmembrane domain by two out of three programs applied (TMpred and DAS), but probability scores were not high enough to classify this protein as a membrane protein. PSI-Blast analysis detected sequence similarity to domains in three neuronal proteins encoded in mammalian genomes: neural cell adhesion molecule NCAM-180 (GenBank accession number P13595, E value of 0.16), calcineurin inhibitor cabin 1 (GenBank accession number AAD40846, E value of 0.014), and brain calcium channel α 1A subunit (GenBank accession number AAB64179, E value of 0.007). However, the similarity was insufficient to confidently characterize the gene at this time, and pending experimental evidence will be critical to determine whether this gene, which is missing in HSV-1 and HSV-2, provides B virus with a unique mechanism to attack neural cells.
The G+C content of the protein-coding regions in B virus, 74.4%, is only slightly lower than the G+C content of the noncoding regions, 75.4%. These data indicate a strong mutation pressure toward G and C nucleotides in B virus DNA evolution. Obviously, the genetic drift toward the abundance of G's and C's has been compensated for by positive selection for conservative A's and T's in the first (26% A and T) and second (44% A and T) positions of a codon in protein-coding regions and by positive selection for A's and T's in a few evolutionarily conserved regions (promoters, repeats, and other regulatory sites) in noncoding DNA. The G+C content of the third position of codons in B virus genes is extremely high, 93.1%. This bias, created by mutation pressure, seems to be a driving force in the formation of the codon usage pattern in B virus.
Genomes of primates tend to have low G+C content, and the logical question arises whether the highly G+C-biased codon usage in B virus matches the proportions of the isoacceptor tRNA in host cells. Unfortunately, no experimental data are currently available regarding the tRNA pool in macaque cells. Still, relative amounts of the isoacceptor tRNAs could be predicted based on the frequencies of codon use in rhesus macaque genes because a strong direct correlation between these parameters has been demonstrated in a number of organisms (23, 32, 49).
A codon usage catalog of rhesus macaque genes was generated based on 288 protein-coding sequences from the GenBank database and compared with the codon usage in B virus genes (Table (Table4).4). As expected, the overall codon usage patterns of B virus and the macaque hosts differed substantially. However, remarkably, the favored codons of the B virus genes mirrored the most frequently used codons in the rhesus macaque genes (except Ser and Arg, both encoded by six codons). Moreover, these most frequent codons in B virus were almost the only codons used for each amino acid. This observation correlates with intriguing data from a number of unicellular and multicellular organisms that highly expressed genes utilize preferable codons in each synonymous group, the optimal codons (20, 21, 23, 61). Thus, the combined effects of directional mutation pressure and selection for preferential codons put the B virus genome in a favorable state to efficiently employ the host cell translation machinery for expression of viral proteins.
The complete genomic sequence of the B virus E2490 laboratory strain demonstrates that the genome organization and gene repertoire are similar but not identical to those of the HSV-1 and HSV-2 genomes. Interestingly, B virus lacks the gene homolog encoding the neurovirulence factor ICP34.5. We postulate that B virus utilizes different mechanisms to block host antivirus responses and facilitate replication in neurons. The enhanced neurovirulence of B virus in human hosts may reflect the inability of human cells to mount efficient antiviral cellular responses against these divergent strategies. The other implication of our findings is that because B virus does not possess the γ134.5 gene homolog, acquisition of this gene by human viruses is an even more recent evolutionary event than was suggested previously (6, 7).
Comparative analysis of the B virus and HSV genomes is essential to understand the mechanisms of B virus pathogenesis in humans. The complete sequence of the B virus reference strain also supplies much-needed information to determine the genetic basis of phenotypic and pathogenic differences among B virus isolates derived from different macaque subspecies. With this information, new antiviral and vaccine strategies can be designed to target critical viral components in order to control this deadly zoonotic agent.
This work was supported by Public Health Service grants R01 RR03162 and P40 RR05162 from the NIH's National Center for Research Resources and in part by grant GM-47853 from the National Institutes of Health.