Keywords

A Brief History

The ORC was identified in Saccharomyces cerevisiae (herein referred to as ScORC) as a stable, six-subunit heteromer that binds to an essential element within budding yeast origin DNA [1]. This element includes a conserved sequence called the ACS, for ARS consensus sequence, because it can be found in all chromosomal fragments that can provide for autonomous replication of bacterial plasmids in budding yeast [2, 3]. The ACS is required for yeast origin function in both plasmid and native chromosomal contexts [48]. Thus by analogy to the prokaryotic cellular initiator dnaA binding to the E. coli chromosomal origin, oriC, and several viral initiators binding to their cognate sites, there was strong reason to hypothesize that the ACS was a prime target for the binding of the eukaryotic initiator. The importance of having a defined, functional sequence linked to origins cannot be overemphasized—in no other eukaryote were origins defined at a level to be useful for this type of direct biochemistry and this fact remains true. The identification of ScORC as a protein complex that protected the ACS, by a DNaseI footprinting assay, and required it for DNA binding therefore provided compelling evidence that it was, or was strongly linked to, the eukaryotic initiator that functioned directly at the DNA replication origins of the cellular chromosomes of eukaryotes. ScORC was also intriguing because it bound ATP, indicating another similarity to previously characterized bacterial and viral initiators (reviewed in [9]). Following ScORC’s biochemical discovery, temperature-sensitive mutant alleles of two ScORC genes, ScORC2 and ScORC5, were isolated in forward genetic screens, allowing for their rapid cloning [1012]. ScORC6 was identified and cloned using a one-hybrid protein-DNA interaction assay, and the remaining yeast ORC genes were cloned using degenerate primers designed from protein sequence [13, 14]. Orthologs of each of the six ScORC genes have since been identified in many other organisms, and sequence and structural information about ORC subunits from archeal and eukaryotic model organisms, as described in the following sections, extend the analogy between ORC and other initiators of DNA replication to the structural level (discussed in several reviews [1517]).

Primary and Secondary Structural Features of ORC Subunits

The ORC is a complex comprised of six distinct subunits. To understand how these individual subunits interact to form a functional ORC, this section will first consider what is known about some of the key primary and secondary features of the subunits. Five of the six ScORC subunits, Orc1-5 contain a region with substantial sequence similarities to the defining domain of the AAA+ family of ATP-binding and hydrolysis proteins and these subunits will be discussed together [16, 18]. Orc6, which has a distinct structure, will be discussed separately.

The primary structure of the five AAA+ ScORC subunits is diagrammed in Fig. 9.1a. Each subunit is named based on its relative molecular weight, so Orc1 is the largest subunit and Orc6 is the smallest subunit of ORC. This size differential is generally conserved, though in metazoans Orc3 is similar in size or slightly larger than Orc2, and metazoan Orc6 is considerably smaller than ScOrc6. Orc1-5, and particularly Orc1, share sequence similarity with Cdc6, a seventh AAA+ protein that joins with ORC in G1-phase to form the active AAA+ oligomeric core bound to origin DNA. It is this ORC-Cdc6-DNA complex that functions to load the MCM complex [18].

Fig. 9.1
figure 1

(a) Schematics of primary structure of S. cerevisiae Orc1-5 subunits adapted from [16]. Perfects matches to Walker A (WA) and B (WB) motifs are colored in black. Deviations from consensus motif signatures of the AAA+ domain are colored in grey. (b) Schematic of the AAA+ secondary fold for the Initiator clade adapted from [19]. WA and WB motif positions indicated in red, and the Initiator Specific Motif (ISM) (two similarly sized alpha helices following beta strand 2) is outlined in red. (c) Alignments of the WA and WB motifs of Orc1, Orc4, and Orc5

The AAA+ Domain

Because the AAA+ domains of Orc1-5 subunits and Cdc6 together form the active MCM complex loading machine, a discussion of this domain in general and its variation within individual Orc subunits is warranted. AAA+ (ATPases Associated with diverse cellular Activities) proteins define a superfamily of proteins that share features of ATP binding and hydrolysis that allow them to participate in a vast array of cellular processes [18]. Both primary and secondary structural characteristics define this domain [19]. Key primary characteristics include the Walker A and Walker B motifs, associated with ATP binding and hydrolysis, respectively (Fig. 9.1). The key secondary characteristic is the core sheet of parallel beta strands arranged in a 51432 topology (Fig. 9.1b). While Orc1-5 share secondary and primary features with the AAA+ family, there is variation among these subunits with respect to the catalytic core motifs within the Walker A and Walker B boxes both within and between species [18] (Fig. 9.1c). In budding yeast, only two subunits, ScOrc1 and ScOrc5, contain perfect matches to the Walker A signature (GXXXXGK[T/S]), consistent with the experimental finding that only the Orc1 and Orc5 subunits of ScORC bind ATP [20]. In metazoans, a perfect match to the Walker A signature can also be found in Orc4. Metazoan Orc1, Orc4, and Orc5 each bind ATP (Figs. 9.1c and 9.2) [21, 22].

Fig. 9.2
figure 2

Alignments of the AAA+ domains of Orc1, Orc4, and Orc5: Positions of relative amino acids are indicated. For Orc1 the beta strands are numbered to aid in comparison to Fig. 9.1b

While ScOrc1-5 each contain a recognizable AAA+ domain, and both ScOrc1 and ScOrc5 can bind ATP, only ScOrc1 contains a perfect match to the Walker B signature, hydrophobic-hydrophobic-DE (hhDE). Consistent with this observation, Orc1 is the subunit within ScORC that hydrolyzes ATP and in fact is the major ATPase of ORC in all eukaryotes examined. Interestingly, Orc2 contains a highly conserved sequence at the predicted position of the Walker A motif, though it does not match the canonical Walker A motif, suggesting that it performs an important but as yet undefined function of ORC [18]. Orc3 is the most diverged of the Orc1-5 subunits [16, 18]. In summary, experimental work, including a recently solved crystal structure of Drosophila melanogaster ORC (DmORC), and sequence conservation provide strong evidence that the AAA+ domains of the Orc1-5 subunits drive the assembly of a functional complex [23]. The fundamental role of ATP binding and hydrolysis by ORC that is essential to MCM complex loading onto chromosomal DNA is performed by the Orc1 subunit.

Initiator Specific Motif (ISM) and Winged-Helix (WH) Domain

Fundamental to ORC’s catalytic role in loading the MCM complex is its ability to directly contact double-stranded DNA. By binding DNA, ORC serves to position it for MCM complex loading. Two lines of evidence reveal that these ORC-DNA contacts, which do not necessarily play a role in the selection of specific chromosomal sites by ORC (see below), are likely performed by two motifs present in the Orc1-5 subunits. One motif is the ISM that is part of the AAA+ fold in these subunits (Fig. 9.1b). Specifically, the Orc1-5 subunits are members of the DnaA/Cdc6/ORC Initiator clade within the AAA+ superfamily whose defining feature is the presence of an extra helical insertion, resulting in two similarly sized alpha helices after strand 2 [9, 10, 19] (Fig. 9.1b). In dnaA, the bacterial initiator, the ISM forms a wedge structure that helps push the subunits of the dnaA homo-oligomer into a spiral that binds single-stranded DNA (ssDNA) and thus contributes to origin DNA melting [9, 24, 25]. In an archeal Orc1 (S. solfataricus) the ISM forms a different, more parallel arrangement that contacts double-stranded DNA [26]. It has been proposed that the eukaryotic Orc1-5 subunits contain similar ISMs that adopt a conformation in which each subunit ISM interfaces with the duplex DNA (dsDNA) substrate [9, 26, 27]. This conformation may be adapted for the role of ORC/Cdc6 in stabilizing both the DNA substrate and the MCM helicase for loading onto double-stranded DNA [28]. Notably, the recently solved crystal structure of DmORC, which will be discussed more in sections that follow, and modeling using the S. solfataricus Orc1-1 structure, reveals a central channel through the DmORC that forms from the ISMs of Orc2-5 and the β-hairpin wings of the WH domains of Orc1 and Orc3-5 that can fully accommodate dsDNA [23].

It is worth noting here that in vitro and in the absence of ATP, ScORC can bind single-stranded DNA (ssDNA) and does so in a distinct conformation, indicating that there are forms of ScORC that favor ssDNA binding [29]. However, no biological role for this interaction has been demonstrated and current models do not predict that such an activity is necessary for the MCM complex loading reaction [3032].

The second motif likely involved in catalytically relevant DNA contacts (i.e., regulation of ATPase activity; regulation of MCM complex loading) is the winged-helix (WH) DNA binding domain. The Orc1-5 subunits contain regions predicted to form winged-helix (WH) DNA binding domains C-terminal to their AAA+ domains (Fig. 9.1a). The WH domains, predicted based on a secondary structural signature, can be found in multiple Orc orthologs, including archeal Orc1/Cdc6 orthologs. For archeal Orc1s, the WH domains participate in extensive interactions with DNA and contribute to sequence-specific binding, although an unexpectedly small number of protein side-chain-DNA-base contacts suggest that context or deformability plays a larger role in origin specificity than sequence [26, 27, 33]. Regardless, these observations led to the obvious hypothesis that the WH domains of eukaryotic ORC subunits contact DNA in a similar manner to archeal Orc1 that forms a lobster claw-like structure, with the ISM and WH domains forming the two halves of the claw that bind DNA. However, the WH domains of eukaryotic ORCs have not been studied in great detail, perhaps because they do not appear to contribute substantially to ORC-origin specificity across species. Interestingly, while the isolated WH domains of human Orc1 (HsOrc1) bind DNA, the WH domain of ScOrc1 does not but instead forms oligomers in solution [34]. It is posited that the WH domain of ScOrc1 may have evolved instead for protein-protein interactions important to the mechanism of transcriptional silencing. After the genome duplication in budding yeast, the SIR3 gene, an ORC1 paralog, evolved for dedicated transcriptional silencing, and indeed the Sir3 WH domain is important for both Sir3 oligomerization and silencing in yeast [34, 35]. Nevertheless, these observations indicate that a WH domain of an ORC subunit can function in mediating protein-protein interactions. The recent crystal structure of DmORC reveals that the WH domains of the Orc1-5 subunits are integral to the stability of the complex because they mediate extensive contacts with the AAA+ domains on adjacent subunits [23]. In fact these contacts rely on the canonical DNA recognition helices of the WH domains interacting extensively with the AAA+ domains and thus made unavailable for DNA binding.

Other DNA Binding Domains in ORC

The ORC-DNA contacts mediated by the ISM and WH domains discussed above are likely conserved and critical for the catalytic role of ORC in loading the MCM complex onto DNA. However, ORC is also important for selecting specific positions on chromosomes to serve as MCM complex loading sites. It is likely that for most eukaryotic ORCs, with the possible exception of budding yeast, that these ORC-DNA contacts are quite distinct from the contacts discussed above that are intimately connected to ORC’s mechanism in MCM complex loading. In addition, it appears that these types of contacts are not well conserved. For example, S. pombe Orc4 (SpOrc4) contains a distinct N-terminal extension with nine AT-hook motifs important for ORC-origin binding and replication function in vivo, but the binding mediated by the AT-hook appears to be separable from tighter ORC-DNA interactions needed for ORC’s catalytic function [36, 37]. ScORC is the only eukaryotic ORC that shows sequence-specific DNA binding to origins in vitro and this binding, but not nonspecific DNA binding, can regulate the Orc1 ATPase activity [20]. Thus for ScORC, there appears to exist a connection or overlap between ORC-DNA interactions critical for ORC’s selection of chromosomal origins and ORC-DNA interactions with functional roles in MCM complex loading. However even in budding yeast, recent in vitro studies indicate that the specific sequence of yeast origin DNA, that is the conserved ORC binding site, is not intrinsically essential to ORC’s ability to establish functional replication origins [38, 39]. Regardless the mechanism by which particular ORC-DNA interactions function “catalytically” in the MCM complex loading reaction and their relationship to ORC-origin interactions required primarily for localizing ORC to appropriate chromosomal regions for this reaction to occur remain open questions. These questions have been raised to a new level of interest by the crystal structure of DmORC that suggests a stable form of the ATP-bound complex exists in an auto-inhibited form that prevents ORC from engaging with the double-stranded DNA in a catalytically useful way [23]. In particular, perhaps ORC-DNA, or ORC-chromatin contacts important for origin selection by ORC help relieve this auto-inhibited form of ORC such that it can encircle dsDNA.

For metazoans, the most prominent models for origin selection by ORC invoke specific interactions between certain types of chromatin structures and ORC, as described below and later in this review. Thus the prominent model is that origin selection mechanisms in metazoan cells do not involve substantial and direct sequence-specific contacts between ORC and origin DNA. However, a recent study of HsORC-DNA binding challenges an overly rigid take on this model [40]. In particular, HsORC, via discrete region of the Orc1 protein N-terminal to the AAA+ domain, shows in vitro binding preference for G-quadruplex (G4)-preferable G-rich ssDNA or RNA. Notably this observation is not the first example of ORC-origin interactions involving nucleic acids other than dsDNA [4144]. However, what is particularly significant about these biochemical observations is that G4-preferable elements are common features in human DNA replication origins [4547]. In summary there appear to be multiple acceptable mechanisms for localizing ORC to chromosomal regions in a way that will allow for catalytically relevant ORC-DNA contacts to form subsequently that direct the MCM complex loading reaction. It is possible that these differing selection mechanisms influence ORC dynamics with the DNA in a manner that is relevant to the efficiency of MCM complex loading, as discussed later.

Orc1 Bromo Adjacent Homology (BAH) Domain

Many observations suggest that metazoan ORC relies on specific contacts with chromatin to select and bind to the chromosomal positions that will serve as MCM complex loading sites. The largest subunit of Orc1 contains a BAH domain, a protein module found in many nuclear proteins that likely functions, at least in part, through direct interactions with nucleosomes [4851]. This domain is defined primarily at the level of secondary structure. While it can be found in most Orc1 orthologs, from yeast to humans, there is limited conservation between yeast and metazoan Orc1BAH domains at the sequence level (Fig. 9.3). Nevertheless accumulating evidence supports a role for the Orc1BAH domain in ORC-chromatin interactions in both metazoans and yeast.

Fig. 9.3
figure 3

(a) Schematic of the H3K20me2 peptide interactions with the Orc1BAH domain adapted from [51]. (b) Alignments of the Orc1BAH domains from the indicated species, asterisks refer to key residues involved in forming the hydrophobic cage that binds K20me2 on histone H3 peptide

In human cells the Orc1BAH domain contributes to re-binding of Orc1 to chromosomes in G1-phase and the replication of a plasmid that depends on the Epstein-Barr virus oriP [52]. The metazoan Orc1BAH domain binds a histone H3 N-terminal peptide dimethylated on lysine 20 (H3K20me2) [51] (Fig. 9.3). Mutations that cause defects in this interaction reduce ORC binding to an origin in vivo and cause growth defects in a zebrafish model that mimic Meier-Gorlin Syndrome (MGS), a form of human dwarfism associated with defects in several proteins that function in pre-RC assembly [53]. These data provide a direct link between ORC and specific chromatin modifications relevant to ORC-origin binding. More recent studies demonstrate that the MGS -associated substitution in the Orc1BAH domain, R105Q, reduces an Orc1BAH-nucleosome interaction in vitro [54]. All together, these data provide evidence that a specific metazoan Orc1-BAH-nucleosome interaction is important for ORC-chromatin binding and ORC’s replication function in metazoans.

The paradigm for ScORC-origin binding is that interactions between ORC and specific sequences within yeast origin DNA are key determinants. While this model is generally correct and contributes predictive power with respect to identifying yeast origins, accumulating evidence reveals that ORC-chromatin interactions likely play a positive role in yeast ORC-origin binding just as they do in metazoans [55, 56]. While the yeast Orc1BAH domain does not bind H3K20me2, it does bind chromatin [50]. Given recent structural studies of the similar Sir3BAH domain bound to nucleosomes, it is likely that the yeast Orc1BAH domain binds nucleosomes directly [48]. In addition, while a yeast mutant lacking the Orc1BAH domain (i.e., ORC-bahΔ) shows no growth defects, ORC association with chromatin is reduced, and multiple individual origins show substantially reduced ORC binding and defects in activation in vivo [14, 56]. Thus the ScOrc1BAH domain is important for ORC-origin binding, but ORC-origin binding capability must exist in excess of what is required for yeast cell viability. Consistent with this interpretation, viable mutations in genes encoding proteins that participate in pre-RC assembly are synthetically lethal when combined with an orc1bahΔ mutation [56]. However, no specific ScOrc1BAH-histone interaction has been reported, and ScORC lacking an Orc1BAH domain can still bind origins in chromatin in vitro, though this result may indicate that the appropriate ScOrc1BAH-dependent nucleosome configuration or modification is currently missing from the in vitro system and/or that other regions of ScORC may contact chromatin to facilitate origin binding [5658]. While the specifics of the molecular interactions may differ, these observations provide evidence that the Orc1BAH domain is a conserved chromatin-binding module from yeast to humans important for ORC-origin binding.

The Orc6 Subunit

Orc6 is the only subunit of ORC that is not an AAA+ protein, and thus it must use distinct mechanisms to incorporate into ORC. While yeast Orc6 co-purifies as a stoichiometric component of ScORC, it is not essential for ScORC-origin DNA binding [1, 59]. However, experiments using a conditional yeast Orc6 mutant provide strong evidence that Orc6 is required in ORC for stable MCM complex loading and DNA replication [60]. This function may relate to the ability of Orc6 to bind to Cdt1, an MCM chaperone needed for stable loading of the MCM complex onto DNA in the presence of ATP, though data are conflicting with respect to the precise role of this interaction [60, 61]. While in yeast Orc6 is the only subunit dispensable for ScORC-origin binding, metazoan Orc6 is required for ORC-origin binding as well as replication [62].

In comparison to the AAA+-domain-containing ORC subunits, Orc6 is less conserved between yeast and metazoans [16]. However, a recent study presents revised Orc6 sequence alignments to reveal greater similarity in Orc6 domain structure between yeast and metazoans than previously thought [63]. In addition, experiments show that yeast and metazoan Orc6 actually use a similar mechanism for binding to ORC [41].

Orc6 is conserved among metazoan species and contains three distinct domains. Two of these domains are similar to regions in Transcription Initiation Factor II B (TFIIB, domains A and B, Fig. 9.4a) [62]. The third region of similarity occurs in an otherwise novel C-terminal domain (Orc6 CTD). Importantly, similarity to all three of these regions can also be located within ScOrc6, once a large insertion between the TFIIB domains is correctly accounted for [41]. The region of highest conservation within the Orc6 CTD contains a missense substitution in human Orc6 (Y232S) associated with MGS. The corresponding substitution in DmOrc6 (Y225S) reduces Orc6 binding to Orc1-5 in vitro and MCM loading in vivo [63]. This metazoan Orc6-(Orc1-5) interaction is mediated by an interface between a highly conserved C-terminal region of Orc6 and a distinct region of Orc3 (Fig. 9.4b). These results align well with data that show that human Orc6 and Orc3 interact [64, 65]. While these data are consistent with the human Orc6 MGS-mutant’s (Y232S) ) effects on DNA replication causing MGS-associated phenotypes, it must be noted that Orc6, including the Orc6CTD (and other subunits of ORC), has additional nonreplicative functions in metazoans that may also contribute to MGS phenotypes [6674].

Fig. 9.4
figure 4

(a) Schematics of the primary structure for DmOrc6 and ScOrc6 adapted from [63]. Region of the Orc6 CTDs that contain the “MGS” implicated Y residue and contact Orc3 are highlighted for Homo sapiens, Drosophila melanogaster, and Saccharomyces cerevisiae. (b) Cartoon models depicting the basic subunit organization of ScORC and DmORC as adapted from [63]. These models are based on the analyses of EM structures as described in the text [63]; a recent crystal structure of DmORC revised the order of the subunits from Orc1-Orc4-Orc5-Orc2-Orc3 in the AAA+ core to Orc1-Orc4-Orc5-Orc3-Orc2, and shows that ATPγS bound DmORC forms a ring with an enclosed central channel that could accommodate dsDNA [23]

ScOrc6’s interaction with the Orc1-5 AAA+ core complex appears to involve both yeast-specific features and metazoan-like features. Independent studies provide evidence that ScOrc6 interacts with ScOrc2 or ScOrc3 to incorporate into ORC. In particular the C-terminal region of ScOrc6 binds ORC through an interaction with ScOrc3, but ScOrc6 also binds ScOrc2 under conditions where it cannot bind ScOrc3 [60, 75]. A ScOrc6 mutant carrying the MGS substitution (Y418S) abolishes an Orc6-Orc3 interaction in vitro, but not ScOrc6’s interaction with the Orc1-5 AAA+ core [63]. Together these studies suggest that ScOrc6 contains at least two independent interfaces that help it incorporate into ScORC, a conserved Orc6 CTD that binds ScOrc3 and a yeast-specific region that binds ScOrc2 (Fig. 9.4b). Nevertheless, these studies identify strong similarities between Orc6 and its structure, both alone and in the context of ORC, between yeast and metazoans. Interestingly, the yeast-specific insertion plays an important functional role in yeast ORC, as it contains cyclin-dependent kinase (CDK) sites and an S-cyclin binding motif, both of which help regulate ORC’s activities in yeast [76, 77]. While the core structure and mechanical functions of ORC are probably highly conserved from yeast to metazoans, strategies to regulate its activity appear to vary considerably (e.g., [64]).

Architecture and Conformational States of the ORC

As is true for many AAA+ proteins, the Orc1-5 subunits acquire their functionality by coming together in an oligomeric complex, ,with the ATP-binding pockets of individual subunits participating in the subunit-subunit interfaces [19]. The arrangement of the ATP-binding pockets at subunit interfaces plays a role in ATPase activity of AAA+ oligomeric complexes [78]. In particular, X-ray crystal structures of AAA+ oligomers suggest an arginine of one subunit, referred to as an “arginine finger,” interacts with the ATP-binding pocket of the neighboring subunit forming interactions with the bound ATP, promoting ATP hydrolysis [78]. ScOrc1 and ScOrc4 follow this paradigm, with the R267 residue of ScOrc4 being essential for ATP hydrolysis by ScOrc1 in ORC [79]. Interestingly, while both EM structures of ScORC and DmORC and the recent crystal structure of DmORC reveal that Orc1 and Orc4 are adjacent subunits, consistent with these experimental findings (Fig. 9.4b), the recent crystal structure of DmORC indicates that the arginine finger of DmOrc4 is 40 angstroms away from nucleotide binding cleft of DmOrc1. This and additional observations strongly suggest that DmOrc1 must undergo a large conformational shift so that DmOrc4 can trigger its ATPase activity [23]. Such changes could be achieved in part by ORC binding to DNA and/or other proteins that participate in the MCM complex loading reaction, including Cdc6 and MCM itself. In addition the DmORC crystal structure suggests that ATP binding and hydrolysis by DmOrc4, regulated by an arginine finger from the neighboring subunit of DmOrc5, may execute important DmORC functions in some organisms, consistent with metazoan Orc4s containing both consensus Walker A and B motifs. Because ScOrc4 lacks a Walker A motif, whatever this role of metazoan Orc4 may be, it is not essential in budding yeast and therefore likely more critical for regulation than core mechanism.

Substantial efforts to define the structures of ScORC and DmORC by electron microscopy (EM), and more recently an X-ray crystal structure of DmORC, have generated images of ORC that provide clues about how it may function in the MCM complex loading reaction [18, 23, 28, 63, 75, 80, 81]. Single-particle reconstructions of negative-stained EM complexes in which the heterologous Maltose Binding Protein is fused to individual Orc subunits allowed the subunit organization within ScORC and DmORC to be determined with impressive accuracy, though the recent crystal structure of DmORC reveals that the subunit arrangement of Orc2-Orc3 is actually Orc3-Orc2 [23, 63, 81] (Fig. 9.4b). Regardless, all together these studies indicate that both the subunit organization and basic shape of ScORC and DmORC are similar, as predicted based on the fundamental role ORC plays at DNA replication origins (Fig. 9.4b) [63].

For DmORC, high levels of ATPγS were required to provide the level of resolution necessary to locate the subunits in the EM images and make a useful comparison to ScORC, suggesting that the ATP-bound DmORC forms a more stable and homogenous conformation. Based on EM analysis the ORCs from both organisms form a two-lobed structure in which roughly one half contains the subunits with ATP-binding and/or hydrolysis capabilities in the following order: Orc1-Orc4-Orc5, and the other half contains the remaining subunits in the following order: (Orc2-Orc3)-Orc6 [63] (Fig. 9.4b). The crystal structure of DmORC shows that ATP-bound ORC reveals a ring structure with a clear central channel for duplex DNA and, as mentioned above, revises the placement of Orc2 and Orc3 within ORC1-5 to Orc1-Orc4-Orc5-Orc3-Orc2 [23]. Interestingly, this structure provides no entry point for duplex DNA, suggesting metazoan ORC must undergo conformational changes even in the ATP-bound state to allow for DNA binding. There are two distinct tiers to DmORC, with one being formed by both canonical and noncanonical interactions between adjacent AAA+ domains and the other being formed by WH domain interactions between the AAA+ domains of neighboring subunits. These interactions lead to the WH domains forming a “collar” that is offset from the AAA+ ring. It is the flexible linker regions between the AAA+ and WH domains on most subunits that allow for these interactions to occur in this way. While there is as yet no reported crystal structure for ScORC, and potentially important differences may exist between DmORC and ScORC, the overall similarity in shape and organization derived from completely independent EM studies is substantial and comforting. In addition, all of these data are consistent with previous examinations of ORC organization as well as subunit-subunit interactions in these and other systems, including human ORC [59, 64, 65, 82, 83].

EM structural studies to address possible conformational changes in ORC have been performed with ScORC and include examining the effect of relevant ScORC partners, including Cdc6 and origin DNA and, most recently, a partially assembled putative intermediate in the MCM loading reaction, the OCCM (for ORC-Cdc6-Cdt1-MCM) on ORC structure [18, 28, 75, 81]. In each of these situations ORC’s structure is determined in the presence of ATPγS to stabilize intermediates. The ORC-dependent MCM loading machine consists of ORC and Cdc6 together on origin DNA, such that the loading complex contains six related AAA+ protein subunits and Orc6. In yeast, ORCs remain bound to most origins throughout the cell cycle, and Cdc6 joins ORC only in G1-phase when the MCM complex is loaded onto chromosomal DNA [84, 85]. In vitro, Cdc6 preferentially binds ORC when ORC is bound to origin DNA, and upon doing so creates an extended ORC-Cdc6 DNaseI footprint that includes sequences adjacent to the ORC binding site but not required for ORC binding per se [18]. In fact this extended footprint contains an origin element required for MCM complex loading and looks similar to the extended pre-RC (G1-phase) footprint initially defined at yeast origins by in vivo footprinting [8587]. Together these observations suggest that distinguishing features of the in vivo G1-footprint result from the ORC-Cdc6 complex participating in MCM complex loading, rather than the loaded MCM complex itself. The formation of this ORC-Cdc6-origin DNA complex, including the diagnostic extended DNaseI footprint, requires ATP and a functional ORC binding site, that is defined as an ORC site that can support origin function in vivo and not merely ORC binding in vitro [18]. ORC-1A is an ORC containing a defective Orc1 subunit (a K485A substitution in Orc1). ORC-1A is lethal in vivo and does not support formation of the diagnostic ORC-Cdc6-origin DNA complex in vitro, providing evidence that this distinctive ternary complex is functionally relevant to the essential MCM complex loading reaction [18, 20].

A single-particle cryo-EM-derived structure of the ScORC-origin DNA and ScORC-Cdc6-origin DNA complexes in the presence of ATPγS indicates that conformational changes in ORC accompany these assemblies [75]. First, ORC binding to origin DNA stabilizes the ORC structure considerably and induces a rotation of the Orc1-Orc4-Orc5 region relative to the rest of ORC. Second, upon Cdc6 binding, additional conformational changes in ORC are observed, including a shift in the Orc1 N-terminal domain, which contains the nucleosome-binding BAH module. This shift may accommodate new Cdc6-Orc1 contacts. Orc6 is also rearranged in the complex upon binding Cdc6 such that it juts out of the central portion of the ring’s surface and comes into contact with Orc1, which is supported by additional Orc1-Orc6 interaction data. This change is proposed to position Orc6 to help recruit the MCM complex via direct contacts with MCM-bound Cdt1. Recent studies with purified proteins suggest that this interaction is not intrinsically essential for recruitment of MCM to ORC-Cdc6-DNA, though both Orc6 and Cdt1 are required for a complete reaction that culminates in stable loading of the MCM complex [60, 61]. These major conformational changes to Orc1 and Orc6 within ScORC observed from the EM studies are consistent with earlier limited protease digestion patterns of ORC compared to ORC-Cdc6 [88]. The bottom line is that ScORC conformations are altered considerably by nucleotide binding, origin DNA, and the key protein partner Cdc6.

More recently, ScORC structure by cryo-EM has been examined in a trapped and putative intermediate in the MCM complex loading reaction on origin DNA termed the OCCM for ORC-Cdc6-Cdt1-MCM [28]. The formation of this intermediate is accompanied by additional changes in the conformation of ORC-Cdc6 in which the relatively flat complex changes to a more dome-like shape, with the concave face directed toward the Cdt1-MCM complex. While the initial interpretation of this structure was that the AAA+ domains reach toward the MCM subunits within the Cdt1-MCM complex, the recent model based on the crystal structure of DmORC suggests that the WH collar on ORC contacts the AAA+ ring on the MCM [23]. Based on the EM structural analysis of the OCCM, the movement of ScOrc6 from the middle of the ScORC-Cdc6 complex seems to be essential to allow for the binding of the Cdt1-MCM complex and to help promote additional transitions in ScOrc3 and ScCdc6. Interestingly, one outcome of these transitions is that Orc4 and Orc1 come into closer contact, consistent with the notion that these conformational changes promote Orc4’s ability to stimulate ATP hydrolysis by Orc1, a reaction essential for efficient MCM complex loading in vitro as well as cell viability [79]. Together with the crystal structure of DmORC that shows DmOrc4 too far away from DmOrc1 to stimulate Orc1 ATPase, these data suggest that ORC’s catalytically important ATPase activity is highly regulated by steps in the MCM complex loading reaction. Another conformational change is that the ORC-Cdc6 complex forms a right-handed spiral structure within the OCCM that has a 34 angstrom helical rise from bottom (Orc3) to top (Cdc6), matching the helical pitch of B-form dsDNA. Thus, an attractive hypothesis is that these matching structures allow ORC-Cdc6 to hold the DNA substrate for loading of the MCM complex [28].

As an AAA+ initiator protein, the observed conformational changes in ORC in response to ATP binding and interactions with DNA and other proteins would be predicted to drive the mechanical work that culminates in the loading of a replication-competent MCM complex onto DNA. The challenge, of course, is to link these conformational states to discrete biochemical steps in the MCM loading reaction, and currently a strong opportunity for meeting this challenge exists in the yeast system where the MCM complex loading reaction, and most recently the entire origin reaction, from ScORC binding to origin activation, can be reconstituted with recombinant proteins [61, 8992]. While it makes sense that the structural changes described to date cause the changes in protein-protein and protein-DNA interaction affinities necessary to drive the G1-phase MCM complex loading cycle, precisely how ORC’s conformational states fit into this scheme remains an area of intense research and debate [93, 94]. The recent reports about both ScORC in the OCCM and the high resolution crystal structure of DmORC described above raise the possibility that the MCM complex loading step(s) promote ORC-Cdc6 architectural changes that modulate both the ORC and Cdc6 ATPases. Interestingly, recent studies show that the ATP hydrolysis by subunits of MCM complex substrate itself is required for MCM complex loading [89, 90]. Thus, while there may be an intrinsic rate of ATP hydrolysis by ATP-bound ORC-Cdc6 on DNA that could conceivably function in proofreading (i.e., dissolution of any incomplete MCM complexes that may form) as proposed, perhaps another and not mutually exclusive possibility is that a fully loaded MCM complex (i.e., double hexamer of MCMs) signals its “maturation” to its ORC-Cdc6 loading complex and promotes ORC-Cdc6 ATPase activity and release from DNA [61]. In this model, a loaded MCM complex is made competent for S-phase activation (i.e., now released from ORC-Cdc6 and free to move away for the “loading site” on double-stranded DNA) and, at the same time, ORC and Cdc6 are “reset” such that they can now participate in another independent MCM complex loading reaction at the same or another origin [79]. The end result would be a cycle that could continue to load MCM complexes onto chromosomes through G1-phase as long as Cdt1-MCM complexes were available. For a cycle to work efficiently and to load multiple MCM complexes, a “mature” MCM complex would have to move out of the way to allow another loading event. This movement on double-stranded DNA is not hard to envision given the demonstrated passive sliding of the MCM complex on naked DNA in vitro and the dynamic behavior of nucleosomes observed near origins [30, 32, 95]. The bottom line based on several lines of accumulating evidence, in terms of ORC, is that ORC must bind and release its origin DNA substrate to establish one complete MCM complex loading reaction. Mechanisms that facilitate re-binding of released ORC would promote multiple MCM complex loading reactions in a given G1-phase. The recent models based on the crystal structure of DmORC raise the possibility that origin binding dynamics could be regulated by the rate that ATP-bound DmORC is able to alter its conformation to access double-stranded DNA (i.e., the “on” reaction) [23]. Specific interactions between ORC subunits and chromatin structures neighboring origins are suggested as one possible mechanism for achieving such a conformational alteration.

Roles for ATP Binding and Hydrolysis by ORC

The specific roles for ATP binding and hydrolysis events catalyzed by ORC have been studied extensively in yeast using genetic tools, purified proteins and extracts. In one basic approach, a defined mutant form of a ScORC subunit is generated. The mutant subunit is then expressed in vivo as the sole source of that subunit and/or as a component of an overproduced ORC (i.e., where all subunits are overproduced to make an excess of the desired mutant ORC complex). In addition, the mutant subunit is reconstituted with the remaining ScORC subunits in an expression system (Baculovirus infected Sf9 cells or a yeast strain optimized for protein expression and purification) to facilitate purification of a defined mutant form of ScORC. The biochemical functions of the mutant ScORC are then compared to that of a wild-type ScORC using a variety of in vitro assays. Similar in vitro expression approaches can be used to study metazoan ORCs in vitro as well, as mentioned below.

Because only Orc1 and Orc5 were shown to bind ATP within ScORC, an early study addressed their ATP-binding functions in the context of ORC by generating mutant versions of each of these subunits in which their Walker A motifs were altered to abolish their ATP-binding function [20]. Two distinct mutant forms of ScORC can be assembled that either cannot bind ATP at the Orc1 subunit (ScORC-1A) or cannot bind ATP at the Orc5 subunit (ScORC-5A). ScORC-1A but not ScORC-5A cannot bind a yeast origin in vitro, indicating that ATP binding by ORC via the Orc1 subunit but not the Orc5 subunit is essential for specific ScORC-origin binding. In addition an orc1-A mutant allele fails to complement an orc1Δ mutation in vivo, while an orc5-A mutant allele complements an orc5Δ mutation. Interestingly, the biochemical role of ATP binding by ScOrc5 remains unknown, even though ATP binding by this subunit is conserved. A clue comes from a genetic screen to identify mutant alleles that enhance gross chromosomal rearrangements (GCRs) in yeast that isolated orc5-70, an allele that encodes a G42R substitution in ScOrc5’s Walker A motif [96]. While orc5-70 causes temperature-sensitive growth defects, it causes no detectable defects in origin activation per se, in contrast to the temperature-sensitive orc5-1 allele [11, 84, 97]. However, an increase in origin number per chromosome exacerbates the elevated GCR caused by orc5-70, suggesting that whatever defect in ScORC this allele causes is related in some way to ScORC behavior at origins.

To examine the role of ATP hydrolysis by ScORC, ,which occurs through the ScOrc1 subunit, a similar approach was taken. One type of allele was identified in ORC1 through a dominant negative overexpression screen that assessed the effect of mutations in the region encoding the Orc1 Walker B motif predicted to be important for ATP hydrolysis [98]. Two alleles were isolated that affect the same amino acid, orc1-d1(D569Y) and orc1-d2(D569F). The goal of this screen was to isolate a “clean” defective version of ScORC in which Orc1 ATPase was nonfunctional but ATP binding was unaffected. While this goal was not achieved—an ScORC-d1 mutant complex has a 10-fold reduction in the Km for ATP and a 16-fold defect in kcat for ATPase activity—the analyses of these mutants provide important insights into ORC function. Interestingly the orc1-d alleles support viability and complement an orc1Δ mutation, indicating that a substantial reduction in intrinsic ATPase activity of ScORC is tolerated in vivo. Consistent with this observation, under saturating levels of ATP, a mutant ScORC-d1 binds yeast origin DNA indistinguishably from wild-type ScORC. Thus the fundamental role for ORC in origin binding appears unperturbed in vivo, indicating that MCM complexes must be loaded onto origin DNA by ScORC-d mutants. However, overproduction of an orc1-d subunit together with overproduction of each of the other Orc subunits produces a dominant negative growth phenotype. Thus an excess of ORC-d1 complexes is dominant negative over wild-type ORC. (Overexpression of orcd-1 alone is not dominant negative because an ORC-d1 complex can provide enough ATPase activity to support viability—only when excess ORC-d1 is made, such that it accumulates in the nucleus is a dominant negative effect observed.) However, this dominant negative effect of ORC-d1 can be bypassed by overexpressing CDC6. The interpretation is that ORC-d1, essentially acting as an excess of ORC in the ATP-bound state, is titrating limiting levels of Cdc6 away from any ORC (wild type or ORC-d1) bound to origins. Cdc6 has the highest affinity for the ORC-DNA-ATP complex [18]. Thus this observation is consistent with a model whereby ORC’s ATPase is required to release ORC-Cdc6 from DNA and possibly each other, thus recycling the proteins for further rounds of MCM complex loading [79, 99].

While the Cdc6 overexpression rescue of the dominant negative effect of ScORC-d1 makes sense, an unanswered question is why ScORC-d1 or ScORCd2, when expressed at normal levels and as the sole source of ORC, provides for normal levels of cell growth. At least two different models may explain this observation. The first is that, in the context of Cdc6, origin DNA, and MCM-Cdt1, the ORC-d1 (or -d2)-Cdc6 complex has no substantial defect in whatever level of ATPase activity is required by this complex to participate in a normal MCM complex loading reaction. Related to this point, it must be acknowledged that the requirement and roles of ATP hydrolysis by the ORC-Cdc6 complex for MCM complex loading remain incompletely understood. Indeed, recent reports show that, in a purified system, the Cdc6- ATPase is not required for loading a replication-competent MCM complex in vitro, even though the Cdc6-ATPase is required for viability and MCM complex loading in a crude extract [89, 90, 100]. Nevertheless, this first model posits that, in the biologically relevant context, ORC-d1 or ORC-d2 has little to no defect in ORC ATPase activity despite showing weak intrinsic ATPase activities as isolated complexes in vitro. A second possibility is that, while MCM complex loading at origins can still occur in cells in which ORC-d1 or ORC-d2 is the only source of ORC, the MCM loading cycle may be substantially less efficient. However, perhaps in yeast cells, as in mammalian cells, levels of loaded MCM complexes (i.e., numbers of potential origins) are in excess of what is needed to complete an unchallenged S-phase [101103].

A second mutant of version ORC defective in ATPase activity is called ORC-4R to signify that the gene encoding the Orc4 subunit contains a missense mutation in the codon for arginine 267 (i.e., orc4-R267A, orc4-R267K, orc4-R267E), the arginine “finger” discussed above needed for Orc1’s ATPase activity [79]. In contrast to the orc1-d alleles, orc4-R alleles do not support yeast viability. In addition, in vitro ORC-4R complexes show no ORC ATPase activity but can bind ATP similarly to wild-type ORC, indicating Orc1’s ability to bind ATP remains intact. As predicted by this result, ORC-4R binds origin DNA similarly to WT ORC both in vivo and in vitro. However, ORC-4R reduces MCM complex loading efficiency in an in vitro MCM complex loading extract in which wild-type ORC is depleted and replaced with recombinant ORC-4R. Interestingly, in vitro, MCM complex loading, as measured by salt-resistant origin DNA-associated MCM, is not abolished by ORC4-R, only reduced. These and other data lead to a model that posits that ATP hydrolysis by ORC is required for multiple rounds of MCM complex loading reactions onto DNA in vitro, but is not essential for loading per se. These data also support a model in which the ORC ATPase activity functions to recycle the ORC-Cdc6 loading machine and, perhaps, also release the loaded MCM complex for “activation competence” in S-phase, as discussed above. With respect to these ideas, it would be useful to know the cell cycle arrest point of orc-4R mutants, and whether ORC and/or MCM are associated with origins at this arrest stage. ORC4-R may remain “stuck” at origins, or it may leave, but load inadequate levels of MCM complexes to support replication. Suppressors of orc-4R lethality might also be informative, though not necessarily trivial (or even possible) to obtain.

In terms of ATP binding and ATPase activity, metazoan ORC, as represented by studies of DmORC, shows strong similarities to ScORC even though DmORC DNA binding shows considerably lower preference for any particular DNA sequence [21, 104, 105]. In particular, both tight DNA binding and ATPase activity of ORC depend on DmOrc1 [21]. Furthermore, chromatin binding by ORC as well as DNA replication in a cell-free extract depends on an intact Orc1, i.e., an Orc1 containing wild type and functional Walker A and Walker B motifs. Analogous defects in Orc4 or Orc5 have only minimal effects. In substantial contrast to ScORC, however, DmOrc6 is essential for DNA and chromatin binding by ORC. In general, however, there are strong functional similarities between DmORC and ScORC in terms of ATP binding and hydrolysis.

Studies of human ORC ( HsORC), ), however, suggest that, while mechanistic aspects of Orc1-ATP binding and hydrolysis are conserved once ORC is bound to an origin and presumably participating in the MCM complex loading reaction, differences in terms of regulation of HsORC are apparent [22, 64, 65]. As with ScORC and DmORC, Orc1 is required to support ATP hydrolysis of HsORC. However, in contrast to DmORC, recombinant HsORC requires the Walker A motifs of Orc1, Orc4, and Orc5 to associate with chromatin and support replication in a cell-free extract derived from Xenopus laevis [22]. A partial explanation for these somewhat unexpected observations may be provided by the subsequent studies of recombinant HsORC that show its assembly into a stable complex requires ATP binding by the Orc4 and Orc5 subunits and follows an ordered pathway [64, 65]. In particular, HsOrc4 binds to a stable subcomplex of HsOrc2-3-5 only in the presence of ATP, and intact Walker A motifs in Orc4 and Orc5 are necessary. Moreover, Orc1 association with an Orc2-5 subcomplex also requires ATP, though Orc1 does not interact with Orc4 on its own, regardless of ATP. Thus a model is proposed whereby ATP binding by the Orc4 and Orc5 subunits controls an ordered assembly of HsORC, with Orc1 joining the complex last in G1-phase to allow ORC to participate in the MCM complex loading reaction. This assembly process may proceed in reverse during S-phase, after an origin is activated, and be important for the S-phase mediated degradation of Orc1 in human cells and the inactivation of ORC activity until the following cell division is complete and the cells re-enter G1-phase [106, 107].

Regulation of ORC Activity

The eukaryotic cell division cycle demands a precise coordination between chromosome duplication in S-phase and chromosome segregation in M-phase. In particular, every chromosome has to be duplicated completely and only once during a normal S-phase or chromosome breakage and aneuploidy can result after M-phase. In terms of DNA replication origins, this rule means that any given region of a chromosome, including a region containing a potential origin, is replicated only once during a given cell cycle, regardless of whether the origin-containing region is replicated from the functional activation of that origin or a replication fork emanating from a neighboring origin. In either situation, the origin must be inactivated and made incapable of firing until cell division is complete and the new daughter cells enter their own S-phases. To achieve this goal, multiple mechanisms exist that inhibit the MCM complex loading reaction during S-phase and promote it only during G1-phase. Thus, the proteins required for the MCM complex loading reaction, including ORC, Cdc6, Cdt1, and MCM itself, are prime targets for these inhibitory mechanisms. While the specific mechanisms vary between organisms, and the pathways emphasized for inhibition vary as well, the ultimate result is the same—the MCM complex loading reaction is strictly confined to G1-phase. A complete discussion of this topic is beyond this chapter’s scope, but ORC activity, because of its central role in the MCM complex loading reaction, is one relevant target and mechanisms for its regulation, some of which are mentioned above, are briefly summarized here.

As mentioned above, ScORC binds origins throughout most of the cell cycle, though recently a class of origins was discovered that lacks robust ORC association in G2-phase [108110]. Nevertheless, ScORC does not appear to be regulated in general or substantially at the level of ORC-origin binding during the cell cycle, at least compared to metazoans. Instead, as cells enter S-phase, ScORC is phosphorylated by S-phase cyclin-dependent kinases (CDKs) on the Orc2 and Orc6 subunits [76]. In addition, one S-phase cyclin, Clb5, binds directly to an RXL motif in Orc6 [111]. While phosphorylation does not appear to reduce ORC-origin binding, it inhibits ScORC’s function in the MCM complex loading reaction [61, 77]. Thus while S-phase CDKs activate targets needed for origin activation during S-phase, they simultaneously phosphorylate and inhibit targets that function in the MCM complex loading reaction.

Evidence from studies of DmORC suggests both similarities and differences in terms of ORC regulation by CDKs. In particular DmORC is also a target CDK-dependent phosphorylation, though the target subunits are Orc1 and Orc2. In vitro studies show that DmORC’s ATPase activity but not ATP-binding affinity is reduced by this phosphorylation. Moreover, ATP-dependent ORC-DNA binding is also inhibited, suggesting a mechanism different from that used by ScORC. Finally, as mentioned above, HsORC activity in MCM complex loading is regulated by the degradation of Orc1 during S-phase [106]. Thus the assembly of a complete HsORC functional for MCM complex loading is confined to G1-phase by this mechanism and possibly ATP-dependent ordered assembly of the remainder of the complex as well [64, 65].

Selecting Chromosomal Origins

While ORC binding is not sufficient to establish an origin, it is obviously an essential step. The mechanisms that recruit ORC to specific chromosomal regions and their relationship to the catalytic role of ORC in MCM complex loading remain incompletely understood. For example, while it is reasonable to think that key aspects of the ISM and WH domain contacts with origin DNA have fundamental catalytic roles and therefore are similar at all origins, the mechanisms responsible for localizing ORC to a chromosomal region where it has the opportunity to make such contacts in the first place appear to vary considerably between organisms and even between origins within a single cell. This variability creates substantial challenges to understanding precisely how ORC selects origins across species, and how these selection mechanisms might affect origin activity. However, this variability may also create opportunities for achieving cell-type-specific modulation of ORC function as well as increasing our basic understanding of the relationships between chromatin structure and origin activity.

Because ScORC shows binding specificity for a conserved sequence element found within yeast origins, origin selection is relatively well understood in S. cerevisiae. Indeed, an ORC binding site can be identified in most yeast origins [3]. However, even in S. cerevisiae, an ORC binding site is not sufficient to identify a yeast origin. For example, a tight ORC binding site in vitro does not necessarily predict an efficient origin in vivo [57, 112]. In fact, a large number of budding yeast origins actually have surprisingly weak ORC binding sites such that the intrinsic strength of the ORC-DNA interaction at these origins is insufficient to explain the strength of ORC-origin binding in vivo [57]. In addition, a number of highly efficient ORC binding sites can be found in the yeast genome that do not function as origins or even bind ORC in vivo [57]. Thus, while ORC binding site specificity certainly aids in mechanistic studies of yeast origins, there is as yet no clear understanding of ORC-DNA contacts critical for ORC’s catalytic roles versus contacts used “simply” to localize or stabilize ORC to a particular chromosomal region. In budding yeast this issue is challenging because the same DNA element that localizes ORC to origins appears to have catalytic functions as well.

Moreover, accumulating evidence in budding yeast suggests that, as in metazoans, ORC accessory factors, in the form of specific chromatin structures and/or other DNA binding proteins, may have direct and positive roles in promoting selective ScORC-origin binding. In fact, the best predictor of a functional ScORC binding site in vivo is not the closest sequence match to a consensus ORC site but rather an ORC binding site contained within a defined nucleosome configuration in which the predicted ORC site is relatively free of nucleosomes [55]. As mentioned above, the ScOrc1BAH domain, a putative nucleosome-binding module, is needed for wild-type levels of ScORC-origin binding at most yeast origins, and origins that are particularly dependent on the Orc1BAH domain for ScORC binding are enriched for a distinctive local nucleosome configuration that places two nucleosomes closer to the Orc1 subunit side of ORC [56]. Recently, another distinct Orc1BAH-independent group of yeast origins have been identified that likely rely on the presence of additional proteins, including chromatin, to either properly remodel otherwise weak ORC sites or to provide for direct ORC-protein interactions [57]. While the specific ORC-accessory interactions that promote ORC-origin binding at these yeast origins have not been defined, a recent study demonstrates a physical interaction between an evolutionarily conserved transcription factor, a member of the forkhead transcription factor family conserved from yeast to metazoans, named Fkh1 and ScORC [113]. In addition, forkhead protein binding sites are enriched near many S. cerevisiae DNA replication origins and contribute to their normal early activation during S-phase [113, 114]. Therefore, while sequence specificity of ScORC-double-stranded origin DNA interactions still makes it exceptional among eukaryotes, accumulating evidence makes it clear that, as for ORCs in other organisms, accessory factors, including nucleosomes, play a role in promoting functional ORC-origin binding.

In metazoans, including human cells and excluding the rapidly dividing cells of early embryos, the guiding paradigm for origin selection by ORC is that ORC-origin binding specificity is achieved by ORC accessory factors, including specific modification states of chromatin that directly bind regions of ORC as discussed above (e.g., K3K20me2 binding by Orc1BAH). Consistent with this model, ORC interacting proteins that have a key role in recruiting HsORC to origins have been identified. HMGA1a, a high-mobility group chromatin-associated protein, interacts directly with HsORC and, when tethered to a heterologous DNA binding domain, can recruit ORC to a heterologous DNA region and establish a functional origin [115]. Thus the HMGA1a-ORC contacts govern the specificity of ORC-origin binding, but establishing the functional MCM complex loading site has little, if any, requirement for specific ORC-DNA contacts. The interaction between HMGA1a and ORC is robust within AT-rich heterochromatin, consistent with a proposal that such ORC-protein interactions may be particularly critical in difficult-to-replicate heterochromatin, an abundant component of metazoan genomes [116]. ORCA/LRWD1 (ORC associated/leucine rich repeats and WD containing 1) is a more recently described ORC accessory factor with similar roles to HMGA1a in recruiting ORC to DNA [117, 118]. Again, the most abundant co-localization between ORCA and ORC occurs within heterochromatic regions. ORCA also interacts with other regulators of MCM complex loading such as Cdt1. Depletion experiments in embryonic stem cells and human primary cells show that ORCA is required for normal levels of chromatin-associated MCM. A recent review discusses many other ORC accessory factors that have been described in metazoans, several of which appear to function after the ORC-origin binding step and facilitate the MCM complex loading reaction [119]. What is clear from these studies is that metazoan ORC relies on a number of accessory factors to perform its role in establishing origins in complex genomes. The relatively stringent requirement for a specific sequence for ORC-origin binding in the small compact genome of budding yeast may have evolved to insure origins would not be disrupted by or interfere with gene transcription, though even in budding yeast robust gene transcription makes some origins more sensitive to defects in MCM complex loading components compared to others [120]. Regardless, in larger genomes with substantial gene-free regions, there is an increased opportunity for ORC-origin interactions that may have relieved some of the evolutionary pressure for sequence-specific binding by ORC. Despite differences between ORC-origin binding in budding yeast and metazoans, it is worth noting that ORC binding regions in chromatin occur within chromatin structures that appear, at least in terms of local nucleosome organization, similar to the structure described for yeast origins [55, 121123].

An important question is whether these varied ORC-origin selection mechanisms have any impact on ORC’s catalytic role in loading MCM complexes onto chromosomal DNA and/or other steps relevant to origin activation. Interestingly, independent studies in S. pombe and S. cerevisiae indicate a link between ORC-origin binding dynamics and origin activation time during S-phase, even though no functional role for ORC in origin activation during S-phase is known or proposed [57, 110, 124]. The multiple origins that replicate eukaryotic chromosomes are not activated simultaneously during S-phase, but rather individual origins each activate with a distinct probability and at a characteristic point during S-phase, with some origins activating soon after S-phase begins and others activating later after much of the genome has been duplicated. In S. pombe, origins that bind to ORC early after M-phase, presumably because they have a higher affinity for ORC, also show earlier MCM recruitment and are activated earlier in the subsequent S-phase compared to origins that bind ORC later [124]. One interpretation is that an origin with high affinity for ORC increases the probability of a successful MCM complex loading event occurring, thus allowing for multiple MCM complexes to be loaded at such an origin in a given G1-phase. Because the MCM complex is the target for limiting S-phase activators, origins that have received more MCM complexes during G1-phase can more effectively compete for these limiting activators and hence activate soon after S-phase begins [125, 126]. In S. cerevisiae, a recent study provides a complimentary correlative observation [110, 127, 128]. In particular, while budding yeast origins are thought to be bound by ScORC throughout the cell cycle, a recent study used a more sensitive measure for ORC binding in vivo than used in prior genome-wide studies to identify a subset of yeast origins that fail to associate with ORC immediately following the completion of replication in G2-phase. The origins that show this delay in ORC binding are enriched for origins that activate relatively late during S-phase. Of course, other models that do not invoke reiterative loading of MCM complexes are reasonable. For example, whatever chromatin structure promotes early ORC binding to an origin may also promote access of that origin to limiting S-phase factors, and the correlation between ORC binding and activation time may merely reflect a “shared response” to the same chromatin cues. However, some experimental evidence argues against this interpretation of the correlative data [124].

Another series of studies focused on ScORC-origin binding interactions may seem to contradict these findings at first glance, but on closer consideration all of these data may be reconciled [57, 112]. In these studies, yeast origins that bind ORC tightly because they possess an ORC DNA binding site with a high-intrinsic affinity for ScORC, as measured in vitro, are enriched for later activating origins, a result that is the opposite of that described for SpORC above. However, it is important to recall that SpORC selects origins via an AT-hook on the Orc4 subunit acting essentially as an ORC-origin tether. Therefore, the SpORC catalytic DNA contacts, presumably mediated by the ISM and WH domains conserved in all ORCs, are distinct from the contacts SpORC uses to select an origin site (more accurately thought of as selecting an MCM complex loading site). Based on the role of accessory ORC factors in ORC-origin binding in metazoans as discussed above, it seems likely that a similar division of labor occurs for metazoan ORC, except that the ORC-origin tethering may be achieved through ORC-protein or ORC-chromatin contacts as opposed to an AT-hook-DNA interaction. In contrast to these situations, ScORC, at least at these high-affinity DNA-dependent origins, appears to rely on the same DNA element for both origin selection and for ORC’s catalytic role in MCM complex loading. Thus for ScORC an intrinsically tight ORC binding site may inhibit the kind of ORC-DNA dynamics required for efficient MCM complex loading (i.e., efficient binding and release). In support of this interpretation, budding yeast origins that also bind ScORC tightly in vivo but instead show weak intrinsic ORC-origin DNA interactions are enriched for origins that are activated early and efficiently in S-phase. These origins are referred to as “chromatin-dependent” because they use factor(s) extrinsic to the ORC binding site, such as ORC-chromatin contacts, to “tether” ORC to the origin. Thus at these origins ScORC is positioned to have a dynamic relationship with the DNA that controls its catalytic function, similarly to how the SpOrc4 AT-hook positions SpORC. The bottom line is that ORC may require a dynamic interaction with its catalytically relevant DNA contacts (i.e., it must bind and release and rebind efficiently) to either maximize the rate of an MCM complex loading cycle in G1-phase, as suggested by biochemical data or, alternatively, “release” a loaded MCM complex for activation during S-phase [99]. An ORC-origin tethering mechanism distinct from catalytically important ORC-DNA interactions may facilitate such dynamics.

While the mechanisms that control origin activation timing are mechanistically interesting and relevant to genome stability and cell differentiation in their own right, several studies now provide evidence that the number of MCM complexes loaded onto a chromosome directly reflects the number of potential origins on that chromosome, and these potential origins are essential for chromosomal stability under conditions of replicative stress, including the levels of stress encountered during the typical lifespan of a multicellular organism [101103, 129133]. Thus specific ORC-origin selection mechanisms may facilitate distinct ORC-DNA dynamics that in turn establish the rate for an MCM complex loading cycle within a particular chromosomal region. An efficient cycle would help establish the “excess” number of potential origins, in the form of loaded and competent MCM complexes, which appears so critical to the stable inheritance of eukaryotic genomes.