We have used four ChIP-chip approaches to establish a list of genes regulated directly by CIITA in B cells and DC. In addition to MHC-II, MHC-I and Ii genes, only nine new bona fide target genes were discovered. The current number of genes that have been demonstrated to be regulated directly by binding of CIITA does therefore not exceed twenty five. At all of these genes, the recruitment of CIITA is strictly dependent on binding of RFX to an X box sequence situated near the promoter. In most cases, this X box is situated in a characteristic S-Y enhancer. One of the new target genes encodes the RFX5 subunit of RFX, indicating the existence of a positive feedback loop for enhancing CIITA recruitment. Finally, seven of the nine new target genes are known or likely to be implicated in cellular processes contributing to antigen presentation (see below). Taken together, these results show that CIITA is dedicated for the regulation of a remarkably compact and highly specialized gene expression module devoted to antigen presentation. This is consistent with the fact that all phenotypes documented in BLS patients can be attributed to defects in antigen presentation. Furthermore, the finding that CIITA recruitment is strictly dependent on RFX at all validated target genes is in agreement with the observation CIITA deficient patients can not be distinguished phenotypically from RFX-deficient patients.
Robust and reproducible binding of CIITA was observed at all previously known and newly validated target genes in nine independent ChIP-chip experiments. No other bona fide target genes were identified by any of the four strategies. Genes at which signals were weaker and less reproducible turned out to be false positives. This suggests that most target genes of CIITA have been identified. It is nevertheless possible that certain target genes have been missed. There could be additional targets in specific cell types. In this respect it may be relevant that the
COL1A2 promoter was not found to be occupied by CIITA in B cells or DC (
Figure S4 and
Figure S5) although earlier ChIP experiments had suggested that CIITA regulates this gene in IFNγ induced cells
[22]. It is also possible that certain targets were missed for technical reasons. For example, the relevant CIITA associated sequences could be refractory to PCR amplification, hybridize inefficiently or have been excluded by the NimbleGen array design program. Finally, additional target genes could be controlled by distant CIITA-dependent enhancers that are situated outside of the 5 kb promoter regions present on the NimbleGen arrays. To address the latter possibility we performed ChIP-chip experiments using a high density array of our own design carrying the entire extended human MHC as well as other selected regions of interest, including several of the target genes identified here (see
Materials and Methods). ChIP-chip experiments performed with Raji and DC confirmed binding of CIITA to all target genes present on the array but did not - with the exception of known enhancers found in the vicinity of MHC-II genes
[36],
[37] - lead to the identification of any novel intergenic binding sites (
Table S2,
Figure S6). This contrasts with the high density of STAT1, p53 and NF-kB binding sites found by ChIP-chip in chromosome 22, many of which are situated far from transcription start sites. Intergenic CIITA-binding sites do thus not appear to be frequent, although we can of course not exclude their existence in other regions of the genome.
The functions of two of the new target genes are unknown.
KIAA0841 encodes a protein containing no characteristic sequence motifs providing clues to its function.
ZNF672 encodes a transcription factor belonging to the Kreuppel zinc-finger family
[38]. However, the genes and functions that
ZNF672 might regulate are unknown. For the remaining seven new target genes there is a known or potential link within antigen presentation by MHC-II or MHC-I molecules.
TRIM26 and
FLJ45422 are situated with the class I region of the MHC. For
RAB4B and
RFX5, a key role in antigen presentation has already been established.
RFX5 plays a pivotal role in activating MHC-II expression because it encodes the largest DNA-binding subunit of RFX and is essential for recruiting CIITA to its target genes
[30]–
[33].
RAB4B encodes an isoform of the small GTPase RAB4. RAB4 is associated with early and recycling endosomes, and regulates recycling of membranes and proteins from these compartments back to the cell surface
[39]. These recycling processes play important roles in various antigen presentation processes, including MHC-II restricted presentation of peptides derived from antigens internalized by receptor-mediated uptake in B cells, cross-presentation of endocytosed antigens by MHC-I molecules in DC and the presentation of intact proteins by DC to the antigen receptors of B cells
[40]–
[44]. RAB4 has been implicated directly in the MHC-II restricted presentation of antigens internalized by receptor-mediated uptake in B cells
[44]. For
MYBPC2,
TRIM26,
PSMD3,
TPP1 and
FLJ45422, a role in antigen presentation is suggested by the nature of the protein and/or the cellular processes in which they function.
FLJ45422 encodes a protein of unknown function exhibiting similarity to MHC-I molecules.
TPP1 encodes a lysosomal protease
[45] that could influence the generation of peptides presented by MHC-II molecules.
PSMD3 encodes a regulatory subunit of the proteasome, a large protease complex implicated in the generation of peptides presented by MHC-I molecules
[46].
MYBPC2 encodes an immunoglobulin superfamily member
[47] that can bind to myosin and filamentous actin, and modifies the actin-stimulated ATPase activity of myosin. Although
MYBPC2 is expressed abundantly in muscle and is best known for its role in muscle contraction
[48], it is also expressed at lower level in other cell types and its tight regulation by CIITA suggests that it could have additional functions in antigen presenting cells. Importantly, the actin cytoskeleton and/or actin-based myosin motors have been implicated in MHC-II trafficking and receptor-driven antigen presentation in B cells, the formation of immune synapses between antigen presenting cells and T cells, and antigen capture and presentation by MHC-I and MHC-II molecules in DC
[49]–
[51]. Finally,
TRIM26 encodes a member of the tripartite motif (TRIM) family of ubiquitin E3 ligases
[52]. Members of this family are implicated in diverse biological processes. They promote the ubiquitination of specific substrate proteins, thereby controlling their abundance by proteasome mediated degradation or their activity, intracellular trafficking or subcellular localization by proteasome-independent mechanisms. It is tempting to speculate that TRIM26 may regulate either the generation of specific antigenic peptides by the proteasome, or the abundance, activity or subcellular localization of specific proteins implicated in antigen presentation.
The
CIITA gene is frequently silenced by epigenetic mechanisms in tumors. It has been proposed that the loss of MHC-II expression and/or a reduction in MHC-I expression resulting from the silencing of
CIITA might allow tumors to evade immune surveillance
[23],
[53]. Our finding that CIITA is remarkably specific for genes implicated in antigen presentation is consistent with the hypothesis that the association between silencing of
CIITA and tumorigenicity reflects a reduction in the antigen presentation capacity of the tumor cells. Although there is a well established link between the loss of MHC-I expression and escape from immune surveillance
[54], it remains unclear how direct MHC-II mediated antigen presentation by tumor cells contributes to anti-tumor responses
in vivo. Therefore, an alternative possibility that has to be kept in mind is that defective CIITA expression could represent a selective advantage for tumors because it contributes to the expression of non-MHC genes. In this respect, certain of the new target genes may be relevant. For instance, an altered transcription program due to reduced
ZNF672 expression, deregulated vesicular traffic due to reduced
RAB4B expression, perturbed intracellular actin-based transport due to reduced
MYBPC2 expression, and altered ubiquitin-dependent degradation or regulation of specific proteins due to lower
TRIM26 expression, could all contribute to the development of tumors.
Two of the new target genes,
RAB4B and
TPP1, had been suggested to be regulated by CIITA in earlier studies. Both were among the genes that were found by microarray experiments to be downregulated in CIITA-deficient cells
[14].
RAB4B was also singled out by a bioinformatic screen designed to identify genes containing S-Y motifs
[32]. In contrast, more than 70 other genes suggested previously to be regulated by CIITA
[13]–
[22] were not found to be direct targets in our ChIP-chip experiments (
Figure S4). For several of the most interesting candidates we were moreover unable to confirm binding of CIITA to their promoters by classical ChIP experiments (
Figure S5). Finally, in a previous study using CIITA-deficient and CIITA-transgenic mice no direct control of these genes by CIITA could be documented
[55]. The influence of CIITA on the expression of these genes is therefore likely to be mediated by indirect mechanisms. For several genes, an indirect mechanism involving sequestration of the general co-activator CBP by CIITA has been proposed
[16],
[17],
[21]. Certain of the genes could be regulated by one of the two transcription factors - RFX5 and ZNF672 - shown here to be controlled by CIITA. Finally, there is growing evidence that ubiquitination and regulatory subunits of the proteasome can play key roles in transcriptional regulation
[56]. The modulation of
TRIM26 and
PSMD3 expression by CIITA could thus have indirect impacts on the transcription of certain genes.
The remarkably focused role of CIITA emphasized here contrasts with results derived from large-scale binding studies for other transcription factors. Good illustrations are provided by Foxp3, Stat1, cMyc and p53. Over 1000 Foxp3 target genes were identified by scanning for binding sites in the promoter regions of 16'000 mouse genes
[1]. Large scale mapping of Stat1 binding sites in chromosome 22 or selected (ENCODE) regions of the human genome have pointed to hundreds if not thousands of target genes
[2],
[3]. More than 300 cMyc target sites were identified in chromosomes 21 and 22
[4]. Finally, p53 binds to at least 500 target sites in the human genome
[5]. Similar large numbers of targets have been reported for all other transcription factors for which large-scale binding studies have been reported. The high degree of specificity observed here for CIITA is thus unprecedented. Why CIITA presents this unique degree of specificity is unknown. One explanation may reside in the finding that CIITA recruitment appears to be strictly dependent on the assembly of a well-defined multifactor enhanceosome complex on a relatively large (65–70 bp) composite regulatory module (the S-Y motif) that is tightly constrained with respect to its sequence content and architecture
[32]. Such S-Y modules are likely to be much less frequent in the genome than binding sites for individual transcription factors. A second explanation may lie in the fact that CIITA has quite an unusual origin for a nuclear transcription factor. It is the only transcriptional activator belonging to the mammalian nucleotide-binding domain and leucine-rich repeat (NLR) containing family, a large group of proteins exerting cytoplasmic functions implicated in cell death, inflammation and innate immunity
[57]–
[59]. The ability to activate transcription of a specific set of genes in the adaptive immune system may represent a recently evolved specialization acquired by an ancestral NLR protein originally having a completely different cytoplasmic function.