Cis-regulation of Neurog1 expression
We demonstrate that cis-regulatory regions for
Neurog1 are contained within 208 kb flanking the coding region of
Neurog1 with the
N1457-nGFP transgenic mice reliably reporting domains of
Neurog1 expression throughout development. Neurog1 is expressed in neuronal progenitor cells in many different regions throughout the central and peripheral nervous systems in a pattern largely non-overlapping to another bHLH factor Ascl1 (
Ma et al., 1997;
Sommer et al., 1996). As such, the temporal and spatial control of
Neurog1 expression is complex and includes multiple discrete regulatory regions spread over more than 20 kb surrounding the coding exon. Over the past decade, work in zebrafish and mouse has identified multiple cis-regulatory sequences for
Neurog1 using sufficiency assays in transgenic animals (
Blader et al., 2004;
Blader et al., 2003;
Gowan et al., 2001;
Murray et al., 2000;
Nakada et al., 2004). Here, using BAC transgenic mice to test the requirement of these sequences in directing neural specific expression in vivo, we find additional complexities including an apparent redundancy in enhancer activity, new activities for a previously defined enhancer, and localization of an enhancer for ventral neural tube expression. The current understanding of how
Neurog1 is regulated is summarized below.
Studies in zebrafish uncovered three enhancers within an 8.4 kb 5′ proximal genomic region, LSE, ANPE, and LATE, which are conserved to mouse and have distinct activities when assayed as reporter transgenes in vivo (for review see
Strahle and Rastegar, 2008). The LSE directs expression to the lateral stripes of the neural plate during primary neurogenesis in cells that give rise to the Rohan Beard neurons, with later activity directing expression to the telencephalon. In contrast, the ANPE directs expression to the anterior neural plate, and the LATE directs expression after completion of primary neurogenesis to the neural tube, hindbrain, and diencephalon. When tested in mouse the zebrafish LATE also had activity in the lateral telencephalon. None of these enhancers were reported to direct expression to the dorsal root ganglia or to the motor neuron domain in the ventral neural tube—two other areas of
neurogenin 1 expression, although this information is within the 8.4 kb 5′ genomic region tested.
Early studies on
Neurog1 regulation in mouse showed that the proximal 4.5 kb 5′ of
Neurog1 was inefficient at directing expression to
Neurog1 domains in transgenic mice (
Gowan et al., 2001;
Murray et al., 2000). This proximal 5′ region was also recently reported to undergo chromatin remodeling during activation of
Neurog1 in retinoic acid induced differentiation of P19 cells (
Wu et al., 2009). In contrast, a more efficient regulatory region found further 5′ (TgN1-2) that contains sequence conserved to the zebrafish LSE, ANPE, and LATE could direct expression of reporter transgenes efficiently to both dorsal and ventral neural tube, more anterior domains in the hindbrain and midbrain, some cranial sensory ganglia, and the olfactory epithelium. Notably, enhancer activity for directing expression to the telencephalon and dorsal root ganglia was absent (
Gowan et al., 2001). A subsequent study further delineated a 0.8 kb region that includes homology to the zebrafish LATE element that directs expression specifically to the dorsal neural tube (here called TgN1-13dnt) (
Nakada et al., 2004). In the current study, we delineate another enhancer, distinct from any of those described above, that is sufficient to direct expression to
Neurog1 cells in the ventral neural tube. The ventral neural tube enhancer has less conservation between species than sequence for the other enhancers since it is restricted to mammals. Taking these studies together, it is clear that there is a cassette-like organization of discrete regulatory regions that work together to direct the diverse pattern of
Neurog1-specific expression.
The regulatory regions identified to date are only part of the story, since we cannot recapitulate the full
Neurog1 pattern without using a large genomic region as tested in the BAC. One promising approach in the literature is to identify tissue specific enhancers using ChIP-Seq to localize co-activators such as p300 (
Visel et al., 2009). Since Visel et al used E11.5 forebrain and midbrain, tissues where
Neurog1 is actively expressed, we examined the published data set for p300 occupied sites within the 208 kb in the BAC RP23 457E22. With forebrain tissue there was a p300 binding region about 40 kb 3′ of
Neurog1 (chr13: 56,309,425-56,310,401), a region not specifically tested in the transgenic mice here. In contrast, p300 in midbrain tissue occupies a region (chr13: 56,359,200-56,359,676) that is included within the identified dorsal neural tube enhancer tested in TgN1-13dnt, a region with homology to fish. Paradoxically, TgN1-13dnt alone did not show activity in the midbrain suggesting other sequences and complexes lacking p300 are required for efficient expression in midbrain. So while the use of ChIP-Seq with co-activators like p300 to identify tissue specific enhancers is powerful in identifying some important enhancer regions, using the current technology it is unlikely to identify all enhancers.
The studies described above define activity of regulatory sequences by determining whether they are sufficient to direct reporter gene expression to a particular tissue. In the current study we asked whether the identified enhancers are required for expression in particular tissues. We predicted that deletion of sequence that includes the dnt element as well as homology to the ANPE from the 208 kb
Neurog1 BAC transgene would have lost GFP expression in the dnt but not lose expression in other tissues like dorsal root ganglia. Unexpectedly we found a uniform attenuation of enhancer activity in all domains. This suggests the sequence deleted, while not sufficient to direct expression to all
Neurog1 domains (see TgN1-2), is required generally for expression of the locus (see
N1457-nGFPΔ
R2). Furthermore, deletion of this region revealed the existence of a weaker tissue specific secondary enhancer within the 208 kb BAC since the complete
Neurog1 pattern was detected but just at much lower levels. The existence of secondary enhancers in developmental genes has recently been reported and is suggested to be an important mechanism driving animal diversity (
Hong et al., 2008;
Jeong et al., 2006;
Markstein et al., 2002;
Zeitlinger et al., 2007). Indeed, a recent study tested the requirement for ultraconserved elements by knocking them out in the mouse genome (
Ahituv et al., 2007). Of the four elements tested, none resulted in notable abnormalities in expression of the locus or in negative consequences for viability. It remains to be determined if the decrease in expression resulting from deletion of R2 or R3 () would be sufficient to disrupt nervous system development.
There has been little advance in identifying factors that regulate through the
Neurog1 enhancers. Pax6 is a good candidate for an upstream regulator since its expression in the telencephalon and neural tube in mouse overlaps that of
Neurog1. Indeed, Pax6 can bind zebrafish LATE in vitro, and LATE transgene activity in mouse telencephalon and zebrafish diencephalon requires Pax6 (
Blader et al., 2004). However, the sequence in the mouse
Neurog1 gene that is conserved with zebrafish LATE does not have activity in mouse telencephalon, but rather is restricted to directing expression to the dorsal neural tube (TgN1-13dnt) where Pax6 is not present. Thus, although we cannot rule out a role for Pax6 in regulating mouse
Neurog1, it does not appear to be functioning through the same enhancer in zebrafish and mice, nor does there appear to be a Pax6 site within the newly localized ventral neural tube enhancer. Gli factors are also candidates for regulating
Neurog1 expression, particularly in the ventral neural tube since they are downstream effectors of Shh signaling and are active in patterning the ventral neural tube (
Vokes et al., 2007). Mutating two gli consensus sites within the ventral neural tube enhancer appeared to attenuate enhancer activity, although the overall pattern of expression remained. Identifying the full complement of upstream signaling events and transcription factors that regulate through the distinct
Neurog1 enhancers remains an area where little is known.
Neurog1 lineage in the developing spinal cord
The bHLH proteins such as Neurog1 act in balance with the Notch pathway to control the timing of differentiation of neural progenitors (
Bertrand et al., 2002;
Lee and Pfaff, 2001). In addition, in combination with homeodomain factors, bHLH factors define distinct progenitor cells in the developing spinal cord, and act to confer diversity to the emerging nervous system (
Bertrand et al., 2002;
Briscoe et al., 2000;
Helms and Johnson, 2003;
Shirasaki and Pfaff, 2002). Defining the fate of Neurog1 progenitor cells in the ventral spinal cord has been complicated by multiple factors including 1) its transient expression in progenitor cells, 2) its broad expression in ventral progenitor domains, and 3) presumptive redundancy with the closely related Neurog2 making phenotypic analysis of mutants more difficult. In particular, although
Neurog1 mutants have defects in proximal cranial ganglia, olfactory neurons, and inner ear (
Cau et al., 2002;
Ma et al., 2000;
Ma et al., 1998), defects in other neural regions such as dorsal root ganglia, spinal cord, and dorsal telencephalon require
Neurog1/Neurog2 double mutants to detect a phenotype (
Fode et al., 2000;
Gowan et al., 2001;
Ma et al., 1999;
Scardigli et al., 2001). Our data using in vivo genetic fate mapping places Neurog1 preferentially in progenitors to interneurons including dI2, dI6, V0, V1, and V2, with a minor contribution to motoneurons. Although Neurog1 is present in these progenitor cells, no obvious phenotype has been seen in the ventral spinal cord in
Neurog1 mutant mice (
Scardigli et al., 2001)(HIQ and JEJ, unpublished data) possibly due to compensation by
Neurog2. In contrast, although
Neurog1 can compensate for the loss of
Neurog2 in the initiation of neurogenesis, it does not compensate for the regulation of homeodomain markers like Hb9 and Isl1 in motoneuron development (
Scardigli et al., 2001).
Neurog1 functions in many regions of the developing nervous system. In the CNS, its two accepted roles include 1) a general role in initiating neuronal differentiation while suppressing glial lineages, and 2) its particular role in specifying glutamatergic type neurons. We note that
Neurog1 lineage cells are almost exclusively neurons, consistent with its function in neurogenesis (HIQ and JEJ, unpublished). In contrast, with respect to the second role of Neurog1, it is worth noting that it is not restricted to glutamatergic neuronal lineages. For example, using the same
N1457-Cre transgenic line reported here, it was recently shown that in the cerebellum the
Neurog1 lineage maps to a subset of Purkinje cells which are GABAergic (
Lundell et al., 2009). Future studies using these mice will aid in identifying the full complement of
Neurog1 derived cell types in the animal.
With this study we extended our understanding of the complex transcriptional mechanisms that have evolved to direct the precise temporal and spatial expression of Neurog1, and defined the cell types within the neural tube that belong to the Neurog1 lineage. Our understanding of these mechanisms is far from complete, particularly with respect to identifying the upstream factors functioning through the identified enhancers. Controlling the precise levels, timing, and pattern of Neurog1 expression is critical for generating a normal nervous system as its function is necessary for generating the correct numbers and diversity of neurons required for the accurate assembly of neuronal circuits.