|Home | About | Journals | Submit | Contact Us | Français|
The enormous progress in genomic research over the past decade has had a huge impact on experimental directions designed to understand gene function and regulation in mammalian cells. Research has rapidly shifted from analysis of individual genes to using microarray expression analysis and similar global approaches to decipher expression changes and their regulation among thousands of genes. While this progress is impressive, our understanding of how gene expression is organized and coordinated within the context of the three dimensional structure of the cell nucleus has lagged behind. With the development of more sophisticated microscopy and computer imaging approaches, however, a new view of the cell nucleus is emerging based on genomic function. This new “functional view” of the cell nucleus has already revealed discrete domains in the nucleus where genomic functions are arranged such as sites of DNA replication, transcription, splicing and DNA repair (Berezney 2002; Berezney, et al. 1996; Dundr and Misteli 2001; Meaburn and Misteli 2007; Stein, et al. 2007; Stein, et al. 2004; Stein, et al. 2003; Strouboulis and Wolffe 1996).
Correlating with these functional domains is the hierarchial organization of genomic DNA in the cell nucleus from nucleosomes to chromatin fibers to chromatin loop domains to higher order arrangements leading up to the entire chromosomes which maintain discrete territorial arrangements within the interphase nucleus (Berezney 2002; Cremer, et al. 2006; Razin, et al. 2007). Considerable progress has been made in deciphering higher order chromatin domains in the cell nucleus through investigation of functional sites of DNA replication. By combining 3-D microscopy and computer imaging approaches with appropriate pulse and pulse-chase experiments, it was demonstrated that chromatin is arranged into ~1 mbp higher order domains during DNA replication which are maintained throughout the cell cycle and in subsequent daughter cells as a fundamental feature of higher order chromatin arrangement in the interphase cell nucleus (Berezney, et al. 1995; Jackson and Pombo 1998; Ma, et al. 1998; Sparvoli, et al. 1994). Moreover these 1 mbp chromatin domain units have been readily identified in living cells using appropriate in vivo labeling or microinjection approaches which have further revealed their dynamic nature (Pliss, et al. 2009; Zink, et al. 1998).
These findings provided the basis for further experimentation and extension of these studies in several directions. For example, by simultaneously labeling DNA replication and transcription sites in mammalian cells and applying 3-D microscopy and imaging techniques, our group has determined that replication and transcription sites in the cell nucleus are sequestered into separate higher order zones each of which contains numerous chromatin domains (Berezney 2002; Wei, et al. 1998). Moreover, the individual chromatin domains and their higher order arrangement into nuclear zones is maintained following extraction of cells for nuclear matrix (Berezney and Wei 1998; Wei, et al. 1998; Wei, et al. 1999). This further indicated the importance of higher order nuclear architecture in the organization and perhaps regulation of genomic function in the cell nucleus.
This review focuses on our recent investigations of the spatio-temporal relationships of DNA replication and transcription sites and their functional components in the cell nucleus. Combining pulse-chase experiments with 3-D microscopy and newly developed computer imaging algorithms has enable us to elucidate functional neighborhoods of DNA replication/transcription factories within the interphase cell nucleus.
Our earlier studies of nuclear zones of replication and transcription in the cell nucleus focused on early S phase (Wei, et al. 1998) when most of the actively transcribed genes in the cell are replicated (Goldman, et al. 1984; Hatton, et al. 1988; MacAlpine, et al. 2004; Schubeler, et al. 2002). This immediately raised a fundamental issue: “If chromatin domains are involved with DNA replication in early S phase, when and where will gene transcription occur within these newly replicated chromatin domain and does this involve a highly coordinated spatio-temporal programming of DNA replication and transcription?” As a first step in addressing this question, we performed pulse-chase-pulse dual labeling experiments of replication and transcription sites. Replication sites (RS) in exponentially growing HeLa cells were labeled with BrdU or CldU, chased between 0 and 24 hours followed by labeling of transcription sites (TS) either in vivo with FU or in permeabilized cells (in situ labeling) with BrUTP (Fig. 1). The rationale behind these experiments is that at different times following DNA replication (green labeling), transcription (red labeling) will occur within subpopulations of the replicated chromatin domains. We would then be able to visualize those chromatin domains that were actively transcribing their genes by the merged red and green or yellow color. Much to our surprise, extensive experimentation did not reveal significant levels of yellow color throughout the extensive chase period. Instead, separate sites of green replication and red transcription were always observed (Malyavantham, et al. 2008a).
The inability to observe conversion of RS to TS by a color change to yellow sites led us to develop a computer imaging approach to directly measure the spatial position of labeled TS in relation to the RS-labeled chromatin domains (Malyavantham, et al. 2008a). We term this “nearest neighbor proximity analysis”. Following application of a high resolution segmentation program for the individual sites (Bhattacharya, et al. 2005), our proximity program determines the distance between each individual site (center-to-center distance between centroids) within one population of sites (e.g, the TS) to its nearest neighbor in another population of sites (e.g., the RS). The percent of total TS that are within a certain distance or proximity of their nearest neighbor early RS chromatin domain is then determined over a range of pixel values (Malyavantham, et al. 2008a; Malyavantham, et al. 2008b).Our analysis demonstrated that over 70% of the TS centroids are within 7 pixels (0.5 μ) of the nearest neighbor RS centroids and over 80% are within 10 pixels (0.7 μ). Since RS and TS sites average ~0.5 μ in diameter (Ma, et al. 1998; Wei, et al. 1998; Wei, et al. 1999), these results show a strikingly high degree of proximity for the great majority of the TS to RS chromatin domains (Malyavantham, et al. 2008a). For example, a 0.5 μ distance between sites indicates an average edge to edge distance between the two sites of zero pixels. In other words, the two sites are in complete juxtaposition. Virtually identical proximity associations were measured independent of the chase period (0-24 hours) between RS and TS labeling (Malyavantham, et al. 2008a). These findings were directly supported by visualization of the RS and TS centroids following segmentation. The TS centroids were preferentially associated along the borders of the early RS chromatin domains but never directly co-localized with the RS centroids and virtually identical results were observed independent of the chase time (Malyavantham, et al. 2008a).
The proximity of RS and TS to RNA polymerase II was then investigated using our imaging and computer program approaches. Much to our initial surprise we found that the proximity distributions were nearly identical with no preference of the overall population of RNA polymerase II sites to either TS or RS chromatin domain sites (Malyavantham, et al. 2008a). We conclude that the overall populations of RNA pol II sites are positioned approximately midway between the centers of the early RS chromatin domains and the TS. It is intriguing to consider that pol II sites may represent linchpins for the close spatial arrangement and coordinate expression of replicating chromatin domains and transcription sites in the cell nucleus. This possibility is further suggested in the last section of this review on functional neighborhoods.
Based on our proximity data and visualization of centroids following segmentation of RS chromatin domains and TS, we propose that chromatin activated for transcription, dynamically unfolds or “loops out” of densely packed discrete chromatin domains (Fig. 2 A & B) where it engages in active transcription within “transcription factories” (Jackson, et al. 2000; Pombo, et al. 2000). It is possible that the local configuration changes from more condensed (discrete) regions of individual chromatin domains, into more diffuse (decondensed) regions occurs during activation of gene transcription. Several studies have indicated the unfolding/folding of chromatin domains during the activation or inactivation of both artificial and endogenous gene loci (Bickmore, et al. 2004; Chambeyron and Bickmore 2004; Chambeyron and Bickmore 2004; Mahy, et al. 2002; Mahy, et al. 2002; Tumbar and Belmont 2001; Volpi, et al. 2000). Our findings extend this analysis to the global level of the genome and suggest that chromatin unfolding may be a fundamental event required for transcriptional activation.
It has been suggested that gene transcription occurs stochastically in bursts of activity throughout the cell cycle (Chubb, et al. 2006; Levsky, et al. 2002; Levsky and Singer 2003; Raj, et al. 2006; Ross, et al. 1994). Using multi-FISH approaches, this process has been directly visualized using a series of gene probes (Levsky, et al. 2007; Levsky, et al. 2002). Thus, the constant high level of transcription associated with the early RS chromatin domains may be a manifestation of burst transcription in which different subsets of genes within each chromatin domain are unfolded and activated for transcription at different times over the cell cycle (Fig. 2 B). The final result of this stochastic process at the level of individual chromatin domains is that all the active genes within the domain are transcribed at different times and at the required levels for the given cell.
Using FISH studies of individual genes, several investigations have demonstrated that distally located genes along a chromosome or between chromosomes cluster together at transcription factories (Fraser 2006; Ling, et al. 2006; Mitchell and Fraser 2008; Nunez, et al. 2008; Osborne, et al. 2004). Finding high levels of proximity of transcription sites to the early S chromatin domains is consistent with these results and suggests that assembly/activation of dynamic transcription factories containing distally located genes may be a fundamental event involved in the global regulation of genomic transcription.
To further explore this possibility we performed proximity measurements of TS compared with two temporal (two hour chase between pulses) and spatially distinct populations of early S labeled RS (Malyavantham, et al. 2008a). Since this by definition must represent two different subsets of actively transcribed genes, our finding of identical proximity of transcription sites to each spatio-temporal population of labeled chromatin domains, is likely reflecting the participation of genes from different chromatin domains (i.e., distal genes) at common transcription factories as illustrated in Figure 2 C.
Our model (Fig. 2 A) also predicts that transcription can occur simultaneously with replication at chromatin domains but without colocalization at the discrete RS due to the unfolding of the transcriptionally active regions of the chromatin domain. The chromatin domains are either active or inactive for replication and transcription giving rise to several possible states as depicted in the schematic. For simplicity, we depicted the position of pol II sites as equidistant from both replicational and transcriptional activities as indicated by the proximity values (Fig. 2 A). To produce the large number of transcripts of abundantly transcribed genes, it is conceivable that these genes are maintained in an unfolded (diffuse) state for prolonged periods of time within a transcription factory. In contrast, genes that are transcribed only a very limited number of times per cell cycle (e.g. 1-10 copies), may need to be in a more diffuse chromatin state for transcription for only very limited number of times in each cell cycle. This less abundantly transcribed class of genes, which composes the vast majority of transcribed genes in a typical mammalian cell (Chubb, et al. 2006; Shav-Tal, et al. 2006), may, therefore, reversibly extend into the transcription factories surrounding the early RS replicated chromatin domains for active transcription and fold back into the discrete chromatin domains following termination of transcription (Fig. 2 A). Several previous studies employing FISH have demonstrated massive chromatin unfolding of individual genes following transcriptional activation (Chambeyron and Bickmore 2004; Mahy, et al. 2002; Ragoczy, et al. 2003; Tumbar and Belmont 2001; Volpi, et al. 2000).
The close proximity of early S replicated chromatin domains to transcription sites and of RNA polymerase II to both populations of these sites led us to consider whether defined neighborhoods of replication/transcription activity are present in the cell nucleus. This is consistent with previous findings of higher order zoning of replication and transcription in the cell nucleus (Berezney 2002; Berezney, et al. 2005; Berezney and Wei 1998; Cook 1998; Wei, et al. 1998). In addition, our studies of TS and chromatin domain centroids following segmentation, suggested that RNA polymerase II mediated transcription in the cell nucleus is concentrated in the regions along the periphery of the chromatin domains where multiple chromatin domains could participate in providing genes for coordinate expression in so-called transcription factories or regions of enriched gene transcription (Fig. 2 C). We, therefore, extended our studies of nearest neighbor proximity to fundamental protein components that are directly involved in either DNA replication (PCNA), transcription (RNA polymerase II) as well as HP1γ which is enriched in the euchromatic active gene regions of chromatin (de Wit, et al. 2007; Lomberk, et al. 2006; Malyavantham, et al. 2008b; Minc, et al. 2000) and might, therefore, be a suitable marker for defining functional neighborhoods in the cell nucleus.
The investigation demonstrated a high degree of proximity of all these factors to each other. This supports our view of defined functional neighborhoods for DNA replication/transcription function (Malyavantham, et al. 2008b). Moreover, detailed analysis revealed a defined order in the degree of proximities for the different components that was highly reproducible (Malyavantham, et al. 2008b). For example, PCNA, which decorates RS, was always the component with the least degree of proximity when determining the nearest neighbors to TS, RNA polymerase II or HP1γ while pol II had the highest relative proximity in nearest neighbor analysis to TS, HP1γ and PCNA (Malyavantham, et al. 2008b).
We propose that these distinctive proximity values for each pair of factors may reflect a precision in the higher order arrangement in these combinatorial arrays that constitute the functional neighborhoods. A model illustrating these relationships is shown in Figure 3. Key to the organization and functional expression of these combinatorial arrays may be the RNA pol II sites which appear to be positioned mid-way between the chromatin domains and sites of newly transcribed RNA. They may, aside from their fundamental role in transcription elongation, serve as “linchpin” for the proper assembly of the combinatorial arrays of functional factors and for the coordination of replication and transcription occurring both within individual chromatin domains and among several adjacent ones (Fig 2). This model also suggests the tethering of these combinatorial arrays to a nuclear matrix infrastructure. This is supported by findings demonstrating that these functional components and their higher order arrangements are maintained following extractions for nuclear matrix (Nakayasu and Berezney 1989; Wei, et al. 1999). In addition, proximity analysis for the nuclear matrix associated proteins matrin 3 and SAF-A (Belgrader, et al. 1991; Fackelmayer, et al. 1994; Kipp, et al. 2000; Nakayasu and Berezney 1991), demonstrates high levels of proximity to TS, PCNA, pol II and HP1γ. These levels, however, are significantly lower compared to the strikingly high proximities of matrin 3 to SAF-A sites (Malyavantham, et al. 2008b). This is consistent with a close association of the combinatorial arrays to an overall nuclear matrix structure as illustrated in Figure 3.
Further evidence supporting this view is provided by application of the minimal spanning tree (MST) algorithm (Dussert, et al. 1988; Dussert, et al. 1987). This pattern recognition imaging approach connects the discrete sites identified following segmentation into a network. The line path of the MST network follows aspects of nearest neighbor relations between the sites and a minimal distance for the entire pathway. The MST network generated for matrin 3 and SAF-A was in close alignment and demonstrated many sites of overlap (Malyavantham, et al. 2008b). In contrast, the MST patterns formed by pol II sites were spatially less closely aligned and showed no visible overlap with the network patterns of matrin 3 or SAF-A (Malyavantham, et al. 2008b). These findings further support the view that matrin 3 and SAF-A are components of an extensive nuclear matrix architecture in the cell nucleus and that the functional combinatorial arrays for DNA replication and transcribed are anchored to the matrix at a much more limited number of sites.
There is growing realization that the spatio-temporal positioning of functional sites and factors within the cell nucleus are critical determinants of genomic function and regulation within the interphase cell nucleus. Previous studies from our group identified higher order functional zones of replication and transcriptional activity in chromatin domains replicated in early S phase where actively transcribed genes are enriched. It was proposed that these higher order zones represent the modular basis for the coordinate regulation of genomic replication and transcription in the cell.
In this article we review our recent findings concerning the spatio-temporal dynamics of early S phase replicated chromatin domains, sites of gene transcription and associated protein factors. Our findings led us to propose that an important step in activation for transcription in the cell nucleus is the unfolding of regions within the discrete chromatin domains so that transcriptionally active chromatin can interact with the machinery for transcription such as RNA polymerase II and the multitude of additional factors involved in transcription and its regulation. Regulation of transcription from this global perspective likely involves temporal programming for the coordinate and differential unfolding of chromatin regions in accordance with the transcriptional programming of the cell. For example, active genes from different chromatin domains could undergo transcription at common sites or transcription factories. In addition, genes regions from within the same chromatin domains could be active at different transcription factories. All this is likely made possible by the spatial arrangement of the protein factors involved in the replicational and transcriptional processes which form combinatorial arrays that define the functional landscape of the cell nucleus.
Our results demonstrate a degree of precision for the higher order spatial arrangement of these combinatorial arrays of functional components. It is thus conceivable that alterations in the spatial organization of these arrays could lead to dysfunction or altered regulation of replicational or transcriptional processes at the global level. Studies of the nuclear matrix associated proteins matrin 3 and SAF-A further suggest that anchoring to the nuclear matrix architecture may be a critical determining factor in the combinatorial properties and for the higher order arrangement of these genomic functional complexes into 3-D network-like arrays.
Studies reported from the authors’ laboratory were funded by NIH Grant GM 072131-27
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.