|Home | About | Journals | Submit | Contact Us | Français|
The origin recognition complex (ORC) was first discovered in the baker’s yeast in 1992. Identification of ORC opened up a path for subsequent molecular level investigations on how eukaryotic cells initiate and control genome duplication each cell cycle. Twenty years after the first biochemical isolation, ORC is now taking on a three-dimensional shape, although a very blurry shape at the moment, thanks to the recent electron microscopy and image reconstruction efforts. In this chapter, we outline the current biochemical knowledge about ORC from several eukaryotic systems, with emphasis on the most recent structural and biochemical studies. Despite many species-specific properties, an emerging consensus is that ORC is a ATP-dependent machine that recruits other key proteins to form pre-Replicative Complexes (pre-RCs) at many origins of DNA replication, enabling the subsequent initiation of DNA replication in S phase.
At the most fundamental level, the concept that at least one origin of DNA replication and multiple protein factors are required to initiate the physical process of genome duplication is conserved across the three domains of life (Kawakami and Katayama, 2010, Mendez and Stillman, 2003). In eukaryotes, an origin of DNA replication is a stretch of DNA sequence where the Origin Recognition Complex(Bell and Stillman, 1992) (ORC) binds and subsequently recruits other factors to establish a pre-replicative complex (pre-RC) (Bell and Dutta, 2002, Bielinsky and Gerbi, 2001, DePamphilis, 2003, Diffley and Labib, 2002, Kawakami and Katayama, 2010, Mendez and Stillman, 2003, Scholefield et al., 2011, Cocker et al., 1996). In budding yeast, the genetically defined replication origins are usually near and often overlap with the replication start sites (Marahrens and Stillman, 1992, Rao et al., 1994, Bielinsky and Gerbi, 1998, Brewer and Fangman, 1987, Theis and Newlon, 1994), but in mammalian species the start sites have not been genetically characterized in sufficient detail, but start sites have been reported new ORC binding sites (Abdurashidova et al., 2003, Bielinsky and Gerbi, 2001). In Drosophila, in which origins of DNA replication in a metazoan species been studied in most detail, ORC binding sites are located near actual sites of initiation of DNA replication and also correspond to start sites of transcription (Austin et al., 1999, Kim et al., 2011, Royzman et al., 1999, Xie and Orr-Weaver, 2008, Beall et al., 2002, Bielinsky et al., 2001, Chesnokov et al., 1999, Gossen et al., 1995, MacAlpine et al., 2010, Sher et al., 2012). Simple organisms such as virus and bacteria use a single origin of replication (Kawakami and Katayama, 2010). In eukaryotes, due to their vastly expanded genome size and the hierarchical structure of the chromosomes, there are hundreds to tens of thousands replication origins, depending on the organisms (Gilbert, 1998, Ryba et al., 2010, Cvetic and Walter, 2005, DePamphilis et al., 2006, Gilbert, 2010). The existence and utilization of the great number of origins is likely to ensure that the large genomes can be duplicated within a reasonable time frame and certainly within a single cell division cycle.
Aside from the fact that origins are located near to ORC binding sites and promote initiation of DNA replication, there is little consensus among eukaryotic species as to what constitutes an origin of replication. This is further complicated because ORC lacks sufficient DNA binding specificity to predict the location of origins of DNA replication, even in the budding yeasts where there is some sequence specificity to ORC binding (Chang et al., 2011). In this regard, the yeast S. cerevisiae is perhaps an exception rather than the normal. In S. cerevisiae, the replication origins are well defined and ScORC binds to the genetically defined origins, but even here the consensus sequence is rather variable (Chang et al., 2011, Marahrens and Stillman, 1992, Theis and Newlon, 2001, Bell and Stillman, 1992). The S. cerevisiae origins of replication are autonomously replicating sequences (ARS), 100 – 150 bp long, and constitute three of four elements termed A, B1, and B2, with an auxiliary element in some origins called B3 (Marahrens and Stillman, 1992). Element A contains the AT-rich 11-bp ARS consensus sequence (ACS) that is the most conserved and A and B1 contribute to ORC binding specificity (Bell and Stillman, 1992, Deshpande and Newlon, 1992, Van Houten and Newlon, 1990). The B2 element contains the double strand DNA unwinding element (DUE) where DNA replication starts and this element is required for loading the pre-RC and DNA helicase component MCM2-7 [mini-chromosome maintenance subunits 2-7] (Zou and Stillman, 2000, Wilmes and Bell, 2002). B3 is an accessary sequence 22-bp long and at the ARS1 origin binds the transcription factor Abf1 (Marahrens and Stillman, 1992). There are several hundred origins in S. cerevisiae and they all share the same ACS sequence and the general three-element architecture. In other eukaryotic organisms such as Schizosaccharomyces pombe, Drosophila melanogaster, Xenopus laevis, and Homo sapiens, the origin sequence pattern is not so well defined, except for the fact that they are generally contain AT rich sequences. It is clear now that, in these organisms, certain features outside ORC may be more important than ORC in defining replication origins. These additional determinants may include the local chromatin structure such as nucleosome positioning (Aggarwal and Calvi, 2004, Calvi et al., 2007, Chang et al., 2011, Eaton et al., 2010, Lipford and Bell, 2001, MacAlpine et al., 2010, Zou et al., 2006), chromatin modifications (Eaton et al., 2011, Liu et al., 2012, Weber et al., 2008, Hartl et al., 2007), transcription regulation (Karnani et al., 2010, MacAlpine et al., 2004), extra protein or RNA partners (Norseen et al., 2008, Thomae et al., 2008, Bartke et al., 2010, Shen et al., 2010), and potentially the physical properties, such as the rigidity or the malleability of the DNA fragment (Cao et al., 2008, Huang and Kowalski, 1993, Natale et al., 1993).
In contrast to the great divergence in the number and sequence of the eukaryotic replication origins, ORC, the ATP-dependent molecular machine that binds to those origins and helps to execute DNA replication, is well conserved throughout evolution, at least at the amino acid sequence level (Gavin et al., 1995, Tugal et al., 1998, Speck et al., 2005, Clarey et al., 2006). There are, however, considerable differences between species with regard to the stability and composition of ORC subunits during the cell division cycle, a topic that is addressed below with a discussion of selected species. Furthermore, ORC has functions and activities well beyond DNA replication (Sasaki and Gilbert, 2007). Here we limit our discussion to the principal role of ORC in replication initiation, although a summary of ORC activities and structures is summarized at the end of this review.
ORC is composed of six protein subunits Orc1-6, named initially in the yeast according to their molecular masses, with Orc1 being the largest subunit (120 kDa) and Orc6 the smallest (50 kDa) (Bell et al., 1995, Bell and Stillman, 1992). ORC subunits in other eukaryotes are named according to their function and amino acid sequence conservation with their yeast counterparts (Fig 1). ORC is an ATPase and its binding to origin DNA is usually ATP dependent (Klemm et al., 1997, Lee and Bell, 2000, Speck et al., 2005, Takenaka et al., 2004, Bell and Stillman, 1992). Despite the overall conservation, ORC has sufficiently evolved to warrant us to discuss ORC from different species individually. In the following sections, we will briefly survey four ORCs from S. cerevisiae, S. pombe, D. melanogaster, and H. sapiens, outlining some biochemical and structural studies haven been reported.
The most distinctive feature of ScORC is that it forms a hetero-hexamer consisting of the Orc1, Orc2, Orc3, Orc4, Orc5 and Orc6 subunits, forming a stable throughout the cell division cycle that constitutively binds all of the origins of replication (Gibson et al., 2006). ScORC binds in a DNA sequence specific manner, although compared to sequence specific transcription factors, ScORC has low binding affinity for its highly variable recognition site, which consists primarily of the A and B1 elements of the double strand DNA origins (Rao and Stillman, 1995, Rowley et al., 1995). ScORC exhibits a high affinity for single strand DNA (ssDNA) (Kd ≈10-8) in a sequence nonspecific manner and without the requirement for ATP, as long as the ssDNA is longer than 80 bases (Lee et al., 2000, Clarey et al., 2006, Lee and Bell, 1997, Speck et al., 2005)
Among the six subunits, the first five subunits, Orc1 through Orc5, all contain a predicted AAA+ domain, and are essential for DNA binding, although only Orc1, Orc2, Orc4, and Orc5 appear to have direct contact to the origin DNA (Clarey et al., 2006, Lee and Bell, 1997, Speck et al., 2005). This suggests that Orc3 may function to glue the DNA-contacting subunits together as a stable complex, but not bind DNA directly. Orc6 is the only subunit that does not contain a predicted AAA+ domain, and does not bind DNA. However, Orc6 is an essential subunit for DNA replication because Orc6 recruits multiple Cdt1 molecules during repeated loading of the replicative helicase core the Mcm2-7 hexamer (Asano et al., 2007, Chen and Bell, 2011, Chen et al., 2007, Takara and Bell, 2011).
The largest subunit Orc1 is unique among ORC subunits, because it has a N-terminal 235-residue bromo-adjacent homology (BAH) domain that interacts with the C-terminal region of silencing regulator Sir1 (Zhang et al., 2002, Bose et al., 2004, Fox et al., 1997, Gardner et al., 1999, Hou et al., 2005, Ozaydin and Rine, 2010, Triolo and Sternglanz, 1996). The BAH domain is not essential, but its presence in Orc1 can influence origin binding specificity, as it alos does in human cell ORC (Muller et al., 2010, Noguchi et al., 2006). This latter specificity may be related to the known nucleosome binding properties of the BAH domain present in the Orc1-related protein called Sir3 (Armache et al., 2011, Hickman and Rusche, 2010). Sir3 is a regulator of the silent mating type genes in yeast and maintains certain mating type gene loci transcriptionally silent. The BAH domain of Orc1 binds to the Sir1 protein, which is also required for efficient silencing of the silent mating type loci. In the absence of Sir1 or the Orc1 BAH domain, different epigenetic states of the mating type gene expression are established (Bell et al., 1993, Bell et al., 1995, Pillus and Rine, 1989, Pillus and Rine, 2004, Zhang et al., 2002). The crystal structure of the Sir3 AAA+ domain has recently been determined and although it does not have an ATPase activity (unlike the Orc1 AAA+ domain), the Sir3 AAA+ domain has evolved to bind to the partner Sir4 silent information regulator protein and to chromatin containing non-methylated H3K79 residues (Ehrentraut et al., 2011). Thus, after duplication of the Orc1 gene, the Sir3 allele evolved to acquire diverse biochemical functions for an AAA+ protein, even though it retained the same overall structural features of the AAA+ domain.
Both Orc1 and Orc5 can bind ATP (Klemm et al., 1997, Klemm and Bell, 2001, Takehara et al., 2008), but the ATPase activity of ScORC primarily resides in the Orc1 subunit, and relies on the presence of an arginine finger in Orc4 (Bowers et al., 2004, Randell et al., 2006). Orc1 has a predicted classic AAA+ ATPase domain, with functional Walker A and Walker B motifs and the ATPase activity controlled by the insertion of an arginine residue present in the Orc4 subunit into the active site of the Orc1 ATP binding site (Bowers et al., 2004). Orc1 ATPase activity is required for loading of multiple subunits of the MCM2-7 hexamer and is blocked by origin-specific double-stranded DNA (Klemm et al., 1997, Randell et al., 2006). Thus the ATPase activity of ORC is required for DNA replication. Many ORC subunits, including Orc1 have a predicted winged-helix (WH) domain that may contribute to DNA binding (Clarey et al., 2006, Speck et al., 2005), just like the DnaA protein uses both its AAA+ and its Helix-turn-helix domains for DNA sequence specific DNA binding (Kawakami and Katayama, 2010).
ScORC is subject to cyclin dependent kinase (CDK) activity regulation and is a substrate for the cell cycle regulatory kinase (Nguyen et al., 2001, Weinreich et al., 2001, Wilmes et al., 2004). ScOrc2 and ScOrc6 are phosphorylated by CDK only during S and G2 phases of the cell cycle. Phosphorylation on Orc2 enhances Orc5 to bind to ATP (Makise et al., 2009). Phosphorylation of Orc6 by CDK prevents it from interacting with Cdt1, thus regulating the Mcm2-7 helicase loading (Chen and Bell, 2011). CDK phosphorylation sites in ORC can be altered without much phenotypic consequence, unless additional sites in Cdc6 and MCM2-7 are simultaneously altered, leading to over-replication of DNA from certain origins of DNA replication in the genome (Nguyen et al., 2001).
The Orc1 subunit, in addition to having a primary amino acid sequence related to Sir3, is also highly related to Cdc6. In fact many archaea species only have a single Orc1/Cdc6 protein that binds to the origin in a DNA sequence-dependent manner (see Chapter 4) and has amino acid sequence similarity to both Orc1 and Cdc6 (Capaldi and Berger, 2004, Duncker et al., 2009, Gaudier et al., 2007, Tada et al., 2008). In S. cerevisiae, the Cdc6 protein is an AAA+ ATPase that is required for MCM2-7 loading onto the pre-RC, but then Cdc6 is degraded by ubiquitin-mediated proteolysis at the G1 to S phase transition, dependent on the activation of the S-phase cyclin-CDKs (Clb5-Cdc28 and Clb6-Cdc28) (Cocker et al., 1996, Liang et al., 1995, Santocanale and Diffley, 1996, Duncker et al., 1999, Perkins et al., 2001, Drury et al., 2000, Drury et al., 1997, Piatti et al., 1996). The destruction of Cdc6, while leaving ORC intact, is part of the mechanism that ensures that the pre-RC cannot be reassembled in S and G2 phases, thereby limiting the initiation of DNA replication to once per cell division.
Recent electron microscopic (EM) studies of the intact ScORC revealed a bipartite structure about 12 nm wide and 16 nm long (Fig 2) (Speck et al., 2005). Assuming that DNA binds along the length of the protein complex, the length of the ScORC structure is sufficient to interact with the observed 48-bp DNA fragment in the presence of ATP, as observed in the DNase I footprint assay (Bell and Stillman, 1992, Speck and Stillman, 2007). Systematic subunit mapping with a maltose binding protein fused at the N-terminus or the C-terminus of the individual ORC subunits showed that Orc1, 4, and 5 are located to one domain of the bi-globular structure, whereas Orc2 and Orc3 are located at the other end (Chen et al., 2008). Therefore, the most likely arrangement of the first five subunits is Orc1-Orc4-Orc5-Orc3-Orc2, with Orc1 and Orc2 at the two extremes of the pseudo-ringed structure. Such an architecture is consistent with in vivo and in vitro subunit interaction assays revealing direct contact between Orc2 and Orc3, and between Orc4 and Orc5, between Orc1 and Orc4, and between Orc4-5 and Orc2-3 (Chen et al., 2008, Matsuda et al., 2007). The physical position of Orc6 was mapped by comparing the EM structures of the intact ORC and Orc1-5 sub-complex missing Orc6 (Chen et al., 2008). In the 3D difference density map, two density peaks were identified: a larger one at the lower Orc2-3 lobe and a smaller one near the upper Orc1-4-5 lobe. This observation suggests that Orc6 binds mainly with Orc2-3 at the lower lobe but reaches up to Orc1-4-5 lobe. A weak interaction between Orc6 and Orc5 was indeed found in the yeast two-hybrid analysis (Matsuda et al., 2007).
The S. cerevisiae Cdc6 is another important replication initiation factor beyond ORC. It is predicted to contain an AAA+ domain, including the DNA-binding initiator-specific motif (ISM), and the DNA-binding winged helix domain, features well conserved among known replication initiators (Dueber et al., 2011, Dueber et al., 2007, Gaudier et al., 2007). In the absence of Cdc6, ORC is merely an origin binder and cannot load the MCM2-7 complex. In order to establish a pre-RC at an origin, ORC has to be transformed from a passive origin binder or origin marker to the active replicative helicase loader. It appears that Cdc6 throws a molecular switch in ORC that enables such transformation (Lee and Bell, 2000, Speck et al., 2005). Cdc6 binding to ORC in vitro introduces an extended pre-RC-like footprint on several replication origins. Formation of the Pre-RC signature DNA footprint is dependent on specific origin sequence, and the intact ATPase activity of ORC as well as that of Cdc6 (Speck et al., 2005, Speck and Stillman, 2007). The ATP hydrolysis activity of ORC is inhibited upon binding to the specific origin DNAs, and the ATP hydrolysis activity of Cdc6 is suppressed upon binding to ORC that is bound on specific origin DNAs. Presumably, the ATP molecules in the ORC-Cdc6-DNA assembly are preserved for use in the subsequent recruitment and loading of MCM2-7 helicase that are likely to be a series of energy-requiring molecular events (Evrin et al., 2009, Remus et al., 2009, Lee and Bell, 2000, Randell et al., 2006). EM studies showed that Cdc6 binds to the side of the bipartite ORC, in close contact with Orc1, forming an apparently ring-like feature in the complex (Fig. 2). The ring-like feature may function as the landing pad for the ring-shaped MCM2-7 helicase core complex. Since MCM2-7 is first loaded on the dsDNA rather than ssDNA (Evrin et al., 2009, Remus et al., 2009), it is unlikely that ScORC will melt the dsDNA. Therefore, the function of initial dsDNA melting will then have to be executed by the replicative helicase.
Following the determination of the structure of ORC and the ORC-Cdc6 structure using transmission electron microscopy, a recent study has shown a higher resolution structure of ORC-Cdc6 bound to an origin DNA [ARS1] (Sun et al., 2012). This structure shows that upon binding Cdc6 in a ATP-dependent manner, the BAH domain of Orc1 moves considerably to the back of the ring-like complex and new density, most likely the Orc6 subunit, protrudes to the front of the complex. Orc6 has been shown to bind to two copies of the Cdt1 protein and thus the major conformational change in Orc1 may facilitate the binding of two Ctd1-MCM2-7 heptamers to the origin DNA. Although the DNA was not visible in the cryo-EM structure of ORC-Cdc6-DNA complex, modeling of the archaeal Orc1/Cdc6 crystal structure into the individual ORC-Cdc6 subunits allows the prediction that the DNA forms a highly bent conformation within the ORC-Cdc6 complex, explaining the extended protection from nuclease in DNA footprinting experiments. The bending of the DNA is also consistent with other EM studies of ORC bound to DNA (Chastain et al., 2004).
It was thought that SpORC, like ScORC, remains bound to chromatin throughout the cell cycle (DePamphilis, 2005, Kong and DePamphilis, 2002). But a recent study showed that SpORC behaves much like the metazoan ORC rather than ScORC and binds replication origin periodically during the cell cycle, with the binding peaking at M to G1 transition stage (Wu and Nurse, 2009). Like in the budding yeast, SpORC, SpCdc18 and the SpCdt1 are required fro MCM loading and pre-RC assembly upon exit from mitosis (Kearsey et al., 2000, Kong and DePamphilis, 2002, Moon et al., 1999, Nishitani and Nurse, 1997, Ogawa et al., 1999, Takahashi et al., 2003).
SpOrc1, 2, and 5 subunits are highly conserved with their counterparts from S. cerevisiae (Fig. 1) (Moon et al., 1999). SpOrc4 is unique among ORC proteins in that it has an N-terminal extension containing nine AT-hook motifs that are not found in budding yeast or metazoan Orc4 homologs (Chuang and Kelly, 1999). The SpOrc4 AT hooks specifically bind the minor groove of AT-rich DNA tracts, and are necessary and sufficient for the DNA binding activity of SpORC (Chuang et al., 2002, Gaczynska et al., 2004, Lee et al., 2001). It appears that the SpOrc4 AT hooks are solely responsible for the DNA binding activity of the entire SpORC complex, as deletion of the Orc4 AT hooks not only abolish the DNA binding of Orc4, but that of the SpORC as well (Gaczynska et al., 2004). However, the AT-hook mediated initial binding of SpORC to origin DNA is salt sensitive, and this interaction is gradually converted to a salt-stable binding state in which the topology of origin DNA is changed into a negatively-supercoiled or under-wound state (Houchens et al., 2008). In agreement with this suggested DNA topology change, an atomic force microscopy measurement of SpORC bound to the autonomously replicating sequence 1 containing DNA fragments revealed shortening of the DNA length by 140 bp, a length sufficient for wrapping around SpORC by two turns (Gaczynska et al., 2004). SpCdc18, the homolog of ScCdc6, interacts with Cdt1 and together they further enhance the binding stability of SpORC on origin DNA, as if the SpCdc18-Cdt1 binary complex is an additional origin determinant in S. pombe (Kelly et al., 1993, Houchens et al., 2008). No structural characterization of the purified SpORC has been reported so far. Because of the unique AT hooks in SpOrc4, the mechanism of origin recognition of SpORC may be different from that of other eukaryotic systems.
DmORC can be isolated from a Drosophila embryo nuclear extract as a stable complex (Gossen et al., 1995). But the presence of DmORC is cell cycle dependent, and is regulated by the degradation of Orc1 via the ubiquitin proteasome pathway at the late M phase (Araki et al., 2003). DmOrc1 is resynthesized during late G1-phase. DmORC is also an ATPase, and like ScOrc1, DmOrc1 is essential for ATP hydrolysis and for ATP-dependent DNA binding (Chesnokov et al., 2001). Different from ScOrc6 that is not required for DNA binding, the DmOrc6 is required for the DNA binding of DmORC and it is an integral part of the complex (Chesnokov et al., 2001). The DmOrc6 alone has DNA binding activity, likely due to the predicted TFIIB-like DNA binding domain in the smallest subunit (Liu et al., 2011). Mutations to the predicted DNA binding region abolish its DNA binding activity (Liu et al., 2011). DmOrc6 contains a C-terminal domain that is important for cytokinesis and binds to the septin protein that mediates closure of the cytokinesis furrow at the end of cell division, and this feature seems to be conserved among metazoans (Chesnokov et al., 2003, Huijbregts et al., 2009, Prasanth et al., 2002).
Under EM, DmORC is an elongated structure with dimension of 170 Å by 115 Å, similar to ScORC (Clarey et al., 2006) (Fig. 3). The notable feature of DmORC is a spiral crescent that encompasses a 25-Å channel. Because there has been no biochemical or structural reports on the subunit arrangement, it is not clear if DmORC shares the same architecture with ScORC. But the overall dimension and basic shape of the EM reconstruction of the two protein complexes appear to be comparable, although not exactly the same. Similar features include the open-ring and the middle location of Orc5 that was mapped in ScORC by MBP-fusion approach, and in DmORC by a specific antibody (Fig. 3B).
DmORC binds DNA with little sequence specificity. DmORC localizes to open chromatin regions that are depleted of nucleosomes (MacAlpine et al., 2004, MacAlpine et al., 2010). Interestingly, DmORC binds the negatively supercoiled DNA 30-fold better than a linear or relaxed DNA and therefore DmORC may target the topology rather than the sequence of the origin DNA (Remus et al., 2004). Because of the lack of sequence specificity, it has been unclear how DmORC binds origin DNA: which subunit does or does not directly contact DNA or what is the size of the DNA footprint of DmORC. An AFM study showed shortening of a linear DNA by ~130 bp upon DmORC binding. This length is similar to the 140-bp DNA shortening by the SpORC. Therefore, DmORC may also wrap around DNA, much like the SpORC (Clarey et al., 2006).
DmORC also is subject to CDK-mediated phosphorylation (Baldinger and Gossen, 2009). DmOrc1 and DmOrc2 each contain several phosphorylation sites. Hyper-phosphorylation at these sites does not affect the integrity of DmORC, but prevents DmORC from binding to DNA (Remus et al., 2005). This is different from the yeast ORC where phosphorylation does not appear to affect their DNA binding activity. At the single molecule EM level, the hyper-phosphorylated DmORC is structurally indistinguishable from the dephosphorylated version (Fig. 3) (Clarey et al., 2006). Therefore, it can be concluded that phosphorylation does not introduce substantial conformational changes in DmORC. To reconcile the phosphorylation-induced interference with the DNA binding, one could imagine that extensive phosphorylation will significantly alter the physicochemical property of the surface of DmORC, even in the absence of large structural changes. And this may interfere with DmORC interaction with the DNA.
The location of DmORC to the entire genome has been reported using a chromatin-immunoprecipitation assay and mapping the associated DNA fragments using microarrays or deep sequencing (Calvi et al., 2007, MacAlpine et al., 2004, MacAlpine et al., 2010, Spradling, 1999). This association occurs as cells exit mitosis (Baldinger and Gossen, 2009). DmORC is associated with regions of the genome enriched with the histone H3 variant H3.3 that is associated with transcribed regions and indeed DmORC is associated with transcription initiation sites (MacAlpine et al., 2010). The complex is also associated with origins of DNA replication that are amplified within a single cell division cycle during embryonic development, notably for the production of egg shell proteins in the follicle cells (Austin et al., 1999, Kim et al., 2011, Kim and Orr-Weaver, 2011, Xie and Orr-Weaver, 2008). Here, DmORC interactions with chromatin occur in regions enriched with the transcription factor E2F (Bosco et al., 2001, Royzman et al., 1999). The E2F transcription factor is associated with a larger complex called the Myb–MuvB (MMB)/dREAM complex that contains the Myb related protein that is also required for DNA replication and is also associated with developmentally regulated gene expression (Georlette et al., 2007, Beall et al., 2002). Thus ORC may associate with specific chromatin structures that differ in different cell types, although this aspect of ORC binding has not been well investigated.
Mutations in DmORC subunits cause defects in DNA replication as expected, but cells are also observed to arrest in mitosis, although this has been attributed to DNA damage as a result of incomplete DNA replication (Chesnokov et al., 2001, Loupart et al., 2000, Pflumm and Botchan, 2001). DmORC subunits, however, localize to centromeric heterochromatin and also bind the HP1 protein that is associated with heterochromatin (Badugu et al., 2005, Huang et al., 1998, Pak et al., 1997, Shareef et al., 2003, Shareef et al., 2001).
Human cells contain origins of DNA replication that are perhaps as many as, or even more than the number of genes in the genome; although their usage is not well understood (Falaschi et al., 2007). Only a little over a dozen origins have been studied with any depth and even these have not been well characterized. One of the first specific origins identified was the lamin B2 origin (Abdurashidova et al., 2000). The recombinant human Orc4 alone was shown to bind the lamin B2 origin of DNA in vitro and in the ATP-independent manner (Stefanovic et al., 2003), perhaps reminiscent of the SpOrc4 in the origin DNA binding capacity as an individual ORC subunit, although the human protein contains no discernible AT hooks. However it is unlikely that this biochemical interaction is functionally significant for DNA replication. Human ORC binds to the latent replication origin of Epstein-Barr virus in B cells where it is required for the maintenance of the EBV plasmid (Chaudhuri et al., 2001, Dhar et al., 2001b, Julien et al., 2004). ORC also has been reported to bind in HeLa S3 cells to intergenic AT-rich regions (Ladenburger et al., 2002) and to the DBF4 promoter locus that is an efficient replication origin (Romero and Lee, 2008). This origin of replication contains two initiation zones and two ORC binding sites that are approximately 400 bp apart. The two-zone origin appears to promote a novel mode of asymmetric bidirectional replication at the DBF4 origin. The recombinant and purified human ORC has intrinsic DNA binding activity with preference for AT-rich sequences (Vashee et al., 2003). The purified human ORC seems to be capable of promoting the initiation of DNA replication from any DNA sequence in vitro, with no preference for human origin sequences, when DNA is added to ORC-depleted Xenopus egg extracts (Vashee et al., 2003). This specificity of initiation of DNA replication reflects the lack of DNA sequence specificity for DNA injected into activated Xenopus eggs (Harland and Laskey, 1980) and may say more about the nature of the egg system then origin specificity in human cells.
The six subunits of human ORC were first identified by sequence similarity to their yeast counterparts ((Dhar et al., 2001a, Dhar and Dutta, 2000, Gavin et al., 1995, Siddiqui and Stillman, 2007, Tugal et al., 1998, Vashee et al., 2001); See also fig. 1). The assembly of ORC and the stability of the complex are both ATP dependent. In the absence of ATP, the full complex doesn’t assemble, and in the absence of ATP the assembled complex is so fragile that it can’t survive the glycerol gradient fractionation (Ranjan and Gossen, 2006). ATP can be replaced by ATPγS for the purpose of assembly or maintenance of structural integrity, suggesting ATP is a structural cofactor for ORC assembly. The assembly of human ORC is a stepwise process in vitro: First, Orc2 and Orc3 form a binary complex, then the binary complex recruits Orc5. The newly formed ternary complex subsequently recruits Orc4, forming the quaternary complex. Incorporation of Orc4 into the growing complexes is ATP dependent (Ranjan and Gossen, 2006, Siddiqui and Stillman, 2007). The Orc2-5 quaternary complex in turn recruits the Orc1. ATP binding by Orc1 is not essential for this step, but ATP binding of Orc4 is essential for assembly of both Orc1-5 and Orc2-5 (Siddiqui and Stillman, 2007). It is possible that Orc4 is physically located between Orc1 and Orc5, and ATP binding converts Orc4 into an assembly competent configuration with which Orc1 can interact from one side, and Orc5 interacts from the other. Mutations in ATP binding sites of Orc4 and Orc5 impair complex assembly. Thus human ORC is unique in that ATP is not only required for the function in replication initiation, but also in the assembly and stability of the complex. Although the ATP binding motif in Orc1, Orc4, and Orc5 subunits are important for replication activity (Giordano-Coltart et al., 2005), the ATPase activity of human ORC is largely contained in Orc1.
The HsORC is a very dynamic complex in vivo and indeed the period of the cell division cycle during which ORC exists as a complete complex may be temporally restricted to G1 phase (Kreitz et al., 2001, Mendez et al., 2002, Tatsumi et al., 2003). The HsOrc1 subunit is degraded at the G1 to S phase transition in a Skp2-ubiquitin ligase-dependent manner, only to be re-appear as cells enter mitosis (Mendez et al., 2002, Tatsumi et al., 2003). The Orc2, Orc3, Orc4, Orc5 and Orc6 subunits are displaced form chromatin as cells progress through S phase, but a heterodimer of Orc2 and Orc3 remain bound to the centromere and function during mitosis (Craig et al., 2003, Prasanth et al., 2004, Siddiqui and Stillman, 2007). Thus the HsORC is a very dynamic complex with respect to the cell division cycle. It has also been reported to be associated with centrosomes and there it controls the cyclin E-CDK2-dependent duplication of centrioles (Hemerly et al., 2009). Orc6 and Orc1 each contain their own NLS, and are targeted to nucleus independently of Orc2-5. Such a mechanism allows for the formation of different sub-complex for the different functions (Ghosh et al., 2011). Additional proteins other than ORC subunits, such as the WD-repeat protein ORCA (LRWD1) may enhance the human ORC binding to origins and facilitate pre-RC assembly (Bartke et al., 2010, Chakraborty et al., 2011, Shen et al., 2010, Vermeulen et al., 2010). The LRWD1 protein is interesting since it is associated with repressive marks on histone H3, such as H3K9 and H3K27 methylation, suggesting that the ORC may recognize specialized chromatin structures via histone modifications.
Overexpressing all subunits yields only Orc1-5 sub-complex, with Orc6 only loosely associated with the complex. The human Orc6 joins Orc1-5 only at the G1/M phase, and dissociates from Orc1-5 in the S phase. Orc6 contains a middle domain with a structure similar to the helical domain of the TFIIB transcription factor (Liu et al., 2011). Interestingly, this middle domain of Orc6 binds to dsDNA. Orc6 is shown to interact directly with Orc3, as result that is different to the interaction between S. cerevisiae Orc2 and Orc6 (Siddiqui and Stillman, 2007). Other than that, the subunit interaction pattern of the human ORC appears to be similar to that of the ScORC, suggesting a conserved ORC architecture across evolution.
A prominent feature of human ORC is the transient association of both Orc1 and Orc6 with the Orc2-5 quaternary core complex (Dhar et al., 2001a, Siddiqui and Stillman, 2007, Vashee et al., 2003, Vashee et al., 2001). In addition to sites that correspond to origins of DNA replication, ORC also binds to heterochromatin and interacts with the heterochromatin protein HP1 (Chakraborty et al., 2011, Duncker et al., 2009, Lidonnici et al., 2004, Prasanth et al., 2010, Wallace and Orr-Weaver, 2005). Human Orc1-5, together with HP1, localizes to heterochromatin, and may be involved in organizing higher order chromatin structure (Prasanth et al., 2010). Interestingly, the association of the Orc2, Orc3, Orc4 and Orc5 subunits at sites of heterochromatin are very dynamic, with a half-life of association in vivo of approximately 4-5 seconds. This dynamic association of ORC subunits is similar to the dynamic association of HP1 to heterochromatin. In stark contrast, the Orc1 subunit, once bound, is stably bound to heterochromatin, suggesting that this subunit has a distinct interaction with either DNA or HP1. Both the Orc1 and Orc3 subunits bind directly to HP1, perhaps explaining the different chromatin binding kinetics, but Orc1 must interact with other components of the chromatin other than HP1.
Crystal structures of archaeal ORC proteins in complex with DNA have provided tantalizing clues about how the eukaryotic ORC may interact with DNA (Berquist and DasSarma, 2003, Dueber et al., 2007, Duncker et al., 2009, Gaudier et al., 2007, Grainge et al., 2003, Wigley, 2009). But clearly, the crystal structure of ORC is a critical missing piece of information that when available will greatly advance the field. Many eukaryotic ORC have been expressed in various heterologous systems and then purified. It is hoped that the next several years may see a series of crystal structures of the eukaryotic replication initiators, likely in the form of individual ORC subunits or as the stable sub-complexes. However, given the large size, the dynamic nature, and interaction with extended stretch of DNA sequences, the structure of the entire ORC, and its further assembly with other replication factors, such as the replicative helicases, could be exceedingly difficult for such crystallographic approaches. We anticipate that single particle cryo-EM will continue to play an important role in elucidating the molecular mechanism of eukaryotic replication initiation. Indeed, the anticipated crystal structures of the individual subunits or sub-complexes will facilitate the interpretation of cryo-EM studies of the various replication initiation complexes, including ORC, Cdc6, Cdt1 bound to MCM2-7 and the MCM2-7 double hexamer. And without a doubt, EM and crystallographic observations will raise deeper mechanistic questions that will prompt more specific biochemical experiments.
Research in the authors’ labs is supported by grants from the National Institutes of Health, particularly CA13106 and GM45436.
Huilin Li, Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY 11794, USA, And, Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA, Email: vog.lnb@ilh, Tel: 631-344-2931, Fax: 631-344-3407.
Bruce Stillman, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Email: ude.lhsc@namllitsb, Tel: 516-367-8383.