Jump to: Page Content, Section Navigation, Site Navigation, Site Search, Account Information, or Site Tools.
|
|
ReportsStructural Basis of DNA Replication Origin Recognition by an ORC Protein
DNA replication in archaea and in eukaryotes share many similarities. We report the structure of an archaeal origin recognition complex protein, ORC1, bound to an origin recognition box, a DNA sequence that is found in multiple copies at replication origins. DNA binding is mediated principally by a C-terminal winged helix domain that inserts deeply into the major and minor grooves, widening them both. However, additionalDNA contacts are made with the N-terminalAAA+ domain, which inserts into the minor groove at a characteristic G-rich sequence, inducing a 35° bend in the duplex and providing directionality to the binding site. Both contact regions also induce substantial unwinding of the DNA. The structure provides insight into the initial step in assembly of a replication origin and recruitment of minichromosome maintenance (MCM) helicase to that origin.
Cancer Research UK Clare Hall Laboratories, London Research Institute, Blanche Lane, South Mimms, Potters Bar, Herts EN6 3LD, UK.
* To whom correspondence should be addressed. E-mail: Dale.Wigley{at}cancer.org.uk Archaeal DNA replication and repair processes share closer similarity to those in eukaryotes than to those in eubacteria (1), albeit with fewer proteins and hence complexity. Furthermore, whereas some archaea have a single replication origin (2), others have multiple origins, more like the situation in eukaryotes (3, 4). Archaeal replication origins are recognized by proteins with homology to eukaryotic origin recognition complex (ORC) and Cdc6 proteins (often annotated as ORC/Cdc6 but referred to here as ORC). The number of ORC proteins varies between archaea but is usually one or two proteins, although it can be as many as 14(4). Crystal structures of two archaeal ORC proteins have been determined (5, 6). The proteins comprise two domains: an AAA+ domain (7) and a C-terminal domain of the winged helix (WH) family, a structural motif commonly used to bind to specific nucleic acid sequences (8). The isolated AAA+ domain retains adenosine triphosphatase (ATPase) activity, and the WH domain binds to DNA (6, 9). Analysis of archaeal genome sequences revealed a series of short (13 bp) conserved repeats located close to ORC genes that were proposed to be a signature for archaeal replication origins (10), which was confirmed experimentally in Pyrococcus abysii (11, 12). The later study revealed that two longer repeats were located on either side of an AT-rich region named a duplex unwinding element (DUE). Similar repeats flank a DUE in a region of chromosomal DNA from Halobacterium that conveys autonomous replication to plasmids in that species (13). These extended repeat sequences were the origin recognition box (ORB) elements later identified at a replication origin in Sulfolobus solfataricus (14). The ORC1 protein of S. solfataricus footprints at these ORB elements in both P. abyssi and Halobacterium, demonstrating their conservation across species. Curiously, not all of the origins in Sulfolobus contain full-length ORB elements but instead have shorter sequences called mini-ORBs (14), also found at a proposed origin in Methanobacterium thermoautotrophicum (10). The situation in Sulfolobus is complicated further because the three origins are quite different from one another and they all contain binding sites for multiple ORC proteins (14, 15). Using the sequence of the conserved ORB elements, Robinson et al. proposed the location of a replication origin for A. pernix (14), which was later confirmed (16). At this origin (Ori1), there are four ORB elements arranged in pairs on either side of a DUE (Fig. 1A). Only one of the two ORC proteins in A. pernix (ORC1) binds at the origin. Consequently, A. pernix Ori1 can be considered an archetype for archaeal replication origins such as the origin in Pyrococcus (12), the oriC1 origin of Sulfolobus (14), at least one of the origins in Halobacterium (13), and the five origins identified in Haloferax volcanii (4), all of which contain full-length ORB elements. Similarities with the eukaryotic system also give insight into conserved elements of eukaryotic origin recognition.
To gain an understanding of replication origin assembly, we determined the crystal structure of the A. pernix ORC1 protein complexed with a 22–base pair (bp) canonical ORB element (Fig. 1B). The structure of ORC1 is similar to that of other archaeal ORC proteins (5, 6) comprising AAA+ and WH domains. In common with both previous archaeal ORC protein structures, adenosine diphosphate (ADP) co-purifies with the protein (5, 6). The main difference between ORC1 and ORC2 is the region connecting the AAA+ and WH domains. In ORC2, the flexibility of this region allows the protein to adopt several conformations (6). By contrast, the linker in ORC1 is more rigid, a structural difference that is a key element in the interaction between ORC1 and DNA.
The principal contact between ORC1 and its DNA target is with the WH domain, as predicted by crystal structures (5, 6) and biochemical data (6, 9, 16–18). The WH domain interacts with the ORB sequence in a canonical mode (Fig. 1), with the In addition to the recognition helix, the wing plays an important part in the interaction between ORC1 and DNA. The wing is longer than that seen in most WH domains, resulting in an extended contact region that spans five base pairs, including the TCC motif (Fig. 1). The wing inserts deeply into the minor groove, causing it to widen by over 5 Å, and direct contacts are made with the DNA bases. This feature is unusual because the wing in other WH domains merely contacts the phosphodiester backbone over the minor groove. A consequence of this widening of the minor groove is that the protein induces substantial unwinding of the duplex at the binding site (Fig. 2).
The symmetry of WH domain binding-site sequences commonly permits binding of two proteins (8). Mutations in the ORB dyad sequence abolish binding of Sulfolobus ORC1 to ORB elements (14). However, the inverted repeat is located at one end of the conserved ORB element rather than at its center, and there is an additional G-rich sequence flanking it. The crystal structure explains this puzzling feature.
Unexpectedly, there is also a substantial contact between the AAA+ domain and the DNA. A short loop at the end of an Although our crystallization conditions contained a ratio of ORC1:DNA of 2:1, our DNA substrate only had a single ORC1 molecule bound. ORC1 forms dimers at higher protein concentrations, and dimerization requires the AAA+ domain (16). Although full-length ORC1 protein is poorly soluble when not bound to DNA, we used the more soluble WH domain to evaluate the stoichiometry of binding at ORB elements using isothermal calorimetry. These data revealed that a single WH domain binds to an ORB element (Fig. 3A), which can be explained by the crystal structure. In canonical WH domain dimers, the wing does not insert into the minor groove. However, in ORC1, insertion of the wing into the minor groove causes it to widen by 5 Å and the associated major groove to narrow by up to 2 Å. Similarly, binding of the helix widens the adjacent major groove by 2 Å. Hence, binding of a symmetric pair at the site requires the major groove to be 4 Å wider than observed, and the recognition helix of a second WH domain cannot be accommodated at the site. Similarly, the associated minor groove would need to widen by over 3 Å to accommodate the wing.
Given the paucity of sequence-specific contacts, particularly those that are not palindromic within the ORB4 sequence, it is unclear how the protein distinguishes sufficiently between the two possible modes of binding. However, of the two possible binding orientations, one is favored by the interaction of the AAA+ domain with the G-rich sequence, providing a directionality of ORC1 binding at an ORB site. This is important because the WH domain of ORC1 interacts with the minichromosome maintenance (MCM) helicase (19). Most archaeal origins characterized to date have a pair of inverted ORB elements placed on either side of a DUE in an orientation that would place the WH domains, and hence also the MCM complexes, facing the DUE (4, 12–14, 16), which in A. pernix is the location of the replication start site (16). Consequently, the G-rich sequence in a full-length ORB element controls the arrangement of ORC1 proteins in an orientation appropriate to interact with MCM helicase during replication initiation (Fig. 4).
To evaluate the contacts observed in our crystal structure, we used DNA footprinting for ORC1 protein and the WH domain alone (Fig. 3). At the 5' end of the top strand, contacts with the WH domain near the dyad repeat produce footprints that are identical in both cases. The 3' end of the ORC1 footprint extends four bases further than that of the WH domain alone because of contacts with the AAA+ domain. On the bottom strand, there is a small region of sensitivity at the center of the footprint. The structure shows that this strand is protected except for the two unprotected bases that are located between the two contact regions made by the WH domain (Fig. 3). The 3' ends of both footprints are identical but differ at the 5' end because of contacts made by the AAA+ domain, extending the footprint by two bases in agreement with the structure. Consequently, the footprints are consistent with the contacts that we observe in the structure, with a single ORC1 molecule binding at an ORB element. Although we only saw a single ORC1 molecule binding at an ORB, once the initial binding events have taken place then a higher order assembly process begins, the culmination of which is unwinding of the DUE (16). The nature of these later events remains unclear but could involve binding of additional ORC1 molecules combined with structural changes in the origin DNA itself. Whatever these changes may be, they likely involve communication between ORC1 monomers, and this may be manifest as the dimer that we saw for the unbound protein at high protein concentrations (16). Consequently, although the structure we present here enhances our understanding of the initial stage in this complex process, further work will be required to uncover the structural changes that take place as an origin assembles.
Supporting Online Materialwww.sciencemag.org/cgi/content/full/317/5842/1213/DC1 Fig. S1 Table S1
Received for publication 11 April 2007. Accepted for publication 26 June 2007.
The editors suggest the following Related Resources on Science sites:In Science Magazine
THIS ARTICLE HAS BEEN CITED BY OTHER ARTICLES:
|
Science. ISSN 0036-8075 (print), 1095-9203 (online)