In the present study we have identified a novel specific element (SSE) found in the promoters of all 52
Pcdh genes, positioned immediately upstream of each CSE. In contrast to the CSE, SSEs are rather unique sequences and much less conserved between paralogs of the same gene family. We have shown that
Pcdh promoter activity was governed by the combined activity of the newly identified SSE and the CSE. Thus, while CSE is indeed crucial for transcription, it is definitely not sufficient. Using DNA binding assay we found a nuclear complex that specifically interacts with αSSE–CSE
in vitro. Subsequently, we have isolated and identified CCTC-binding-factor (CTCF) as the protein that binds and specifically binds and regulates the SSE–CSE in each promoter region of the
Pcdh. First, we found that CTCF binds in a highly specific manner to the promoter region of several representative genes from the
Pcdhα and
Pcdhγ families
in vitro. Second, knocking down CTCF down-regulated the expression of endogenous active
Pcdh genes as well as
Pcdh promoter-driven reporter gene. These findings suggest that CTCF binds to the promoters of the clustered
Pcdh genes and is acting as a master positive regulator of all the
Pcdh genes. Our findings are consistent with previous reports that show that binding of CTCF to the target site is methylation-independent (
40,
41).
CTCF is a diverse protein and has a unique structure of 11 zinc-finger-DNA binding domains which are conserved among vertebrates. This distinct structure gives CTCF an exceptional degree of flexibility for DNA binding site recognition, which led to the description of CTCF as a ‘multivalent’ transcription factor (
42). CTCF is known to perform numerous functions including enhancer blocking, X-chromosome inactivation, gene imprinting, monoallelic gene expression and promoter activation or repression (
42–46). It has been demonstrated that CTCF can mediate contact between CTCF binding sites, to stabilize intra- and inter chromosomal long range interactions to affect transcription (
38,
47,
48). Conditional CTCF Knock out models have demonstrated that CTCF affects transcription in enhancers of both beta and alpha Globin loci (
49,
50), the HOX cluster (
51), the XITE of the X-chromosome (
52) and the EBV (
53). Another recent study, which investigates the transcriptional activity in CTCF mutant limbs, has shown that the CTCF sites in RNA splicing may regulate the production of specific alternative transcript variants (
51).
CTCF is known to interact with diverse protein partners that determine its specific function; hence, the regulation of CTCF activity might be achieved by neighboring factors bound to DNA. These partners factors include the RNA polymerases I, II and III, another zinc finger factor VEZF1 and the factors YY1, SMAD, TR and Oct4. Each of these seems to influence, modulate or determine the function of CTCF (
54). Of particular interest to this study is YY1 which was shown to interact with CTCF and to function together in X-chromosome inactivation (
52). YY1 is a ubiquitous four-zinc-finger transcription factor that has been implicated in biological processes such as embryogenesis, differentiation, cell proliferation and tumorigenesis (
55). Surprisingly, in the course of our research YY1 were found to bind specifically to the SSE–CSE, most likely through a YY1 binding site that partially overlap the CSE of α6, α3 and α12. Although YY1 and CTCF appeared in our experiments as distinct complexes on EMSA, it is possible that under physiological conditions they bind cooperatively to the promoters of
Pcdhα genes.
Considering that the
Pcdh gene cluster contains several promoter elements in tandem, with only one or two are active, and the fact that
Pcdh expression is monoallelic, it is possible that in addition to its ability to activate
Pcdh transcription, it also has a central role in insulating nearby promoters in the monoallelic expression of
Pcdh genes. Depending on the promoter context and cell background, CTCF may repress or activate transcription; however, its repression function predominates. We found that CTCF acts primarily as a positive transcription factor for the
Pcdh genes. We cannot conclude from our results that the SSE dictates cell type specific expression. However, if we take into account the fact that CTCF is associated with most of the
Pcdh promoters
in vivo (
56,
57) (according to the ChIP-seq data found in the UCSC genome browser), as well as in conserved enhancers HS5-1 (
28) and HS16–20 (
58), it may well be possible that CTCF also serves as a repressor that acts to silence the inactive
Pcdh genes. However in these studies the exact site to which CTCF binds and the functional significance of CTCF for
Pcdh expression were not addressed. This repression can be achieved through different conformation of DNA/CTCF-complexes (which CTCF binding to itself) that allow it to form chromatin hubs by selective intra- and interchromosomal interactions bridging together specific subsets of genomic CTCF sites. The divergence of the CTCF-binding sequences (as a result of unique SSEs) can serve as ‘CTCF CODE’ (
59,
60) encrypting interactions with co-partner in a site specific manner (through the SSEs) and establishing structure-functional 3D organization involved in regulating the expression of individual
Pcdh genes. The nature and composition of this co-partner is yet to be determined. Profiling the gene expression pattern of several different cell lines following profiling the DNA binding location of the CTCF and other candidate proteins will give us a predictive mechanistic model for the specific regulation of the
Pcdh gene expression.
In summary, the expression of distinct
Pcdh mRNAs in individual neurons is regulated by the activation or repression of subsets of promoters preceding individual genes. The choice of a gene included in a
Pcdh mRNA appears to be a direct consequence of promoter selection. The mechanism of differential
Pcdh promoter activation or repression in individual neurons is therefore likely to be a key step in regulating the cell-specific expression of distinct
Pcdh genes. Our work is, therefore, another step forward in the effort to understand the mechanism for
Pcdh gene expression, which may have significant implications on mammalian brain development. These findings may also have implications to human neuronal diseases linked to aberrant
Pcdh function such as autism (
61), bipolar disorder, schizophrenia (
62), auditory deficiencies (
63) and tumors (
64).