Human immunodeficiency virus type 1 (HIV-1) establishes a stable latent reservoir in resting memory CD4
+ T cells that persists in patients on highly active antiretroviral therapy (HAART) and that is able to produce replication-competent virus following cellular activation (
7–
9,
14,
61). The latent reservoir is a major barrier to virus eradication (
15,
49,
50). In this stable reservoir, the provirus is transcriptionally silent (
10,
22). This latent reservoir may contribute to the rapid rebound of viremia after cessation of HAART (
26,
63). Deliberately inducing the reactivation of latent HIV-1 may be required to effectively target the reservoir and achieve eradication. Understanding the mechanisms that maintain HIV-1 latency is critical to this therapeutic strategy.
Mechanisms that maintain HIV-1 latency
in vivo are incompletely understood. It is widely accepted that the lack of active forms of key cellular transcription factors (
3,
13,
16,
31,
42,
58) and of the HIV-1 Tat protein and its cellular cofactors (
12,
23,
25,
29,
47,
51) limits the initiation and elongation, respectively, of viral transcription in resting CD4
+ T cells (
32,
60). In addition, DNA methylation and repressive histone modifications, especially the formation of a single nucleosome located at the viral promoter, have been postulated to promote transcriptional silencing of integrated proviruses (
2,
11,
21,
30,
52,
55,
59). Posttranscriptional mechanisms may also play a role (
24,
33,
43).
In the analysis of the regulation of HIV-1 gene expression, it is important to consider the nature of HIV-1 integration sites. Pioneering studies by Bushman and colleagues demonstrated that HIV-1 integrates preferentially into active cellular genes in
in vitro infections of transformed cell lines (
35,
46). However, in an elegant cell line model of HIV-1 latency, integration sites were found in chromosomal regions disfavoring transcription, such as centromeric regions (
27). This led to the notion that latency was determined primarily by the site of integration. However, the first analysis of HIV-1 integration sites in infected individuals found that HIV-1 proviruses integrated into transcriptionally active cellular genes in resting CD4
+ T cells from patients on HAART (
18). The same pattern has been observed in peripheral blood mononuclear cells (PBMC) from untreated individuals (
36). In these studies, integrated proviruses were detected by specific PCR strategies, but the replication competence of the proviruses was not assessed. Given that only a small fraction of the integrated proviruses in resting CD4
+ T cells from infected individuals appear to be capable of producing infectious virus following cellular activation (
8), the possibility remained that integration sites had a different character in the subset of cells harboring replication-competent viral genomes. It has also been unclear whether the observed pattern reflects the likelihood of initial integration into particular regions or subsequent selection of cells carrying particular types of integration events. Nevertheless, the finding that HIV-1 genomes reside within cellular genes raised the possibility that an additional mechanism, transcriptional interference, could also play a role in latency.
Transcriptional interference is defined as direct suppression of one transcription unit by another in
cis (
48). Two adjacent promoters have been shown to transcriptionally interfere with each other through perturbation of the association of the transcription initiation complex, dislodgement of a sitting promoter-bound complex, or collision between transcription elongation complexes moving in opposite directions (
1,
6,
38,
39,
44). Transcriptional interference has been demonstrated in an experimental system with two tandem HIV-1 long terminal repeats (LTRs) in HeLa cells (
17). In addition, transcription of the 5′ LTR has been shown to interfere with transcription initiating at the 3′ LTR in Jurkat cells (
34). In an elegant study in a cell line model, Lenasi et al. demonstrated that host gene transcription suppressed expression of a provirus integrated within the gene (
34). Both positive and negative effects of host gene transcription on expression of integrated proviruses have been noted in studies on individual cell clones (
20,
34). In a cell line model of HIV-1 latency, lower levels of HIV-1 gene expression have been observed when integration into highly expressed host genes occurs (
35). Host gene transcription through the LTR upstream of the HIV-1 transcription start site has been detected both in resting CD4
+ T cells from patients on HAART (
18) and in primary CD4
+ T cells infected
in vitro (
34). However, whether host gene transcription interferes with expression from an integrated provirus contained within that gene has not yet been demonstrated in any primary-cell system or
in vivo study.
In an effort to more accurately model the
in vivo regulation of HIV-1 gene expression, several groups have recently developed primary CD4
+ T cell models of HIV-1 latency (
4,
5,
37,
45,
62). The model developed by Yang et al. is of particular interest because it utilizes a prolonged culture period in cytokine-free media to allow infected T lymphoblasts to return to a fully quiescent state similar to that of resting memory CD4
+ T cells
in vivo. In the present study, we have used this primary-cell model to study the viral integration sites in latently infected resting CD4
+ T cells that are capable of upregulating HIV-1 gene expression following cellular activation. Serial analysis of gene expression (SAGE) was carried out to determine transcription levels of host genes harboring integrated proviruses. By comparing viral integration sites and the levels of expression of the host gene, we have identified new features of the integration sites of latent HIV-1, which may provide a better understanding of the mechanism of HIV-1 latency.