30
Modeling DNA Replication Intermediates Debjani Roy*, Brian E. Hingerty*, Robert Shapiro# and Suse Broyde*a =X- *13iology Department, New York University, New York, NY 10003-5181 $Health Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831-8077 #chemistry Department, New York University, New York, NY 10003-5180 ORNL/CP-101136 a Corresponding Author: (212) 998-8231, Fax (2 12) 995-4015, E-mail [email protected] The submitted manuscript has been authored by a contractor of the U. S. Government under contract number DE-AC05-960R22464. Accordingly, the U.S. Government retains a non-exclusive royalty free license to publish or reproduce the published form of this contribution, or allow others to do so for U. S. Government purposes.

Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

Modeling DNA Replication Intermediates

Debjani Roy*, Brian E. Hingerty*, Robert Shapiro# and Suse Broyde*a

=X-

*13iology Department, New York University,New York, NY 10003-5181

$Health Sciences Division, Oak Ridge National Laboratory,Oak Ridge, TN 37831-8077

#chemistry Department, New York University,

New York, NY 10003-5180

ORNL/CP-101136

a Corresponding Author: (212) 998-8231, Fax (2 12) 995-4015, E-mail [email protected]

The submitted manuscript has been authored by a contractor of the U. S. Government under contract number DE-AC05-960R22464.Accordingly, the U.S. Government retains a non-exclusive royalty free license to publish or reproduce the published form of this contribution,or allow others to do so for U. S. Government purposes.

Page 2: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

DISCLAIMER

This report was prepared as an account of work sponsoredbyanagency of the United States Government. Neither theUnited States Government nor any agency thereof, nor anyof their employees, make any warranty, express or implied,or assumes any legal liability or responsibility for theaccuracy, completeness, or usefulness of any information,apparatus, product, or process disclosed, or represents thatits use would not infringe privately owned rights. Referenceherein to any specific commercial product, process, orservice by trade name, trademark, manufacturer, orotherwise does not necessarily constitute or imply itsendorsement, recommendation, or favoring by the UnitedStates Government or any agency thereof. The views andopinions of authors expressed herein do not necessarilystate or reflect those of the United States Government orany agency thereof.

Page 3: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

DISCLAIMER

Portions of this document may be illegiblein electronic image products. Images areproduced from the best available originaldocument.

Page 4: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

b ABSTR4CT

While there is now available a great deal of itiorrnation on double stranded DNA horn

X-ray crystallography, high resolution NMR and computer modeling, very little is known about

structures that are representative of the DNA core of replication intermediates. DNA replication

occurs at a single strand/double strand junction and bulged out intermediates near the junction

can lead to frameshifl mutations. The single stranded domains are particularly challenging. Our

interest is focused on strategies for modeling the DNA of these types of replication

intermediates. Modeling such structures presents special problems in addressing the multiple

minimum problem and in treating the electrostatic component of the force field. We are testing a

number of search strategies for locating low energy structures of these types and we are also

investigating two different distance dependent dielectric functions in the coulombic term of the

force field. We are studying both unmodified DNA and DNA darnaged by aromatic arnines,

carcinogens present in the environment in tobacco smoke, barbecued meats and automobile

exhaust. The nature of the structure adopted by the carcinogen modified DNA at the replication

fork plays a key role in determining whether the carcinogen will cause a mutation during

replication that can initiate the carcinogenic process. In the present work results are presented

for unmodified DNA.

1

Page 5: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

INTRODUCTION

DNA replication takes place at a fork containing DNA single strand-double strand

junctions, together with polymerase and a number of other proteins. Little is known about the

confirmational preferences of the DNA in such structures. In this work, we are modeling

intermediates representing the DNA core of the replication structures present at one arm of the

replication fork.

It has long been proposed that slippage mechanisms are responsible for frameshill

mutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence

5’G1GzC~GdC~Cb3’ is a NarI restriction enzyme recognition site (4,5) and contains three G’s

(G1G&JG4C~CJ. The sequence is a mutational hotspot for -2 deletions induced by chemical

carcinogens, notably 2-(acetyl) aminofluorene (WF). Covalent modification by &4.F induces -2

frameshift mutations only when the modification is at GJ (6) even though each of the G’s shows

equal reactivity towards the reactive precursor.

A specific slippage mechanism has been proposed to explain these double base

frameshifts at the NarI sequence (7). According to this model the GC double base deletion

results from a slippage mechanism during replication (8). In this model the replication apparatus

first incorporates a cytosine opposite the modified guanine. Then template-primer slippage

occurs and a misaligned intermediate forms wherein the two terminal bases in the primer (3 ‘-CG-

5’) hydrogen bond with a repeated downstream complementary 5’-GC-3’ dinucleotide in the

template. Continued synthesis from thk intermediate now containing an unpaired GC

dinucleotide leads to a complementary strand two bases shorter than the template strand. This

model predicts that -2 deletion mutagenesis should occur at the G~ but not the G1 or G~ positions,

because, given the sequence at the NarI site, only the former would allow formation of a

misaligned intermediate stabilized by two correct terminal base pairs.

Timsit et al. (9) have solved the crystal structure of a DNA dodecamer duplex containing

the NarI sequence. They found a novel and intriguing structural theme in the central NarI

sequence portion of the 12-mer which suggested an intrinsic propensity for the NarI Gdhotspot to

undergo slippage. Specifically, the G1.C1, base pair is disrupted; the partner Cl, is unstacked and

the pair is highly propeller twisted. In qddhion, the neighboring C~ .GIOis highly propeller

2

Page 6: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

twisted and C~ also participates in a three centered hydrogen bond to Gg (from N4 of C~ to 06 of

G9).

The objective of the present work is to.model the DNA of replication intermediates using

the NarI sequence in unmodified DNA. Little experimentation has been done on the structures of

these intermediates, and how to search for plausible models computationally is a challenging

problem. In the present study we present results of an effort to model computationally such

intermediates, including tests of two different dielectric functions in the electrostatic component

of the force field. We are currently exploring various treatments for this function.

We studied the following intermediates which model a fragment of DNA undergoing

replication at one arm of a replication f~rk.

.x-

Intermediate I. 5’ G1 Gz C~ G,C~ C, A,3’

3’ G,0G,T85’

In Intermediate I replication has proceeded to the base 3‘ to the Gd hotspot.

Intermediate 11. 5’ G, G2 C~ G, C5 C, A, 3’

3’,

In Intermediate II the replication machinery incorporates the correct base cytosine opposite to the

G, hotspot, which may lead to a misaligned intermediate, shown as Intermediate III.

Intermediate 111. G, C,5’ G1 Gz Cq CG A,3’

3’ C,, G,O & T*5’

In Intermediate III a misaligned intermediate has formed where the terminal two bases of the

primer (3’ C11G105’) hydrogen bond with a repeated downstream complementary 5’G2 Cq3’ in the

template, from which extension gives rise to the double-base frarneshift mutation

(3’C,1G,0G,T85’).

3

Page 7: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

METHODS

We employ the torsion angle space molecular mechanics program DUPLEX (1O) to

compute energy minimized structures of the replication intermediates.

A feature of DUPLEX that played a major role in the current work is the hydrogen bond

penalty function which provided an important tool for searching for the different types of

structures. This penalty fimction or pseudopotential energy, F, is added to the energy, and is

employed to locate minimum energy conformations of any designated hydrogen bonding pattern,

or a denatured site if the function is not employed at a given position. The penalty function has

the form.

F= Wi~~[(d-dO)2 +(l+cos(z))’+~)].,-

It can be applied to a chosen Donor-Hydrogen-Acceptor pair in Watson-Crick, Hoogsteen or

any other type of selected hydrogen bonding scheme. If the fiction is not applied to a given

base pair, a denatured site can result. W is an adjustable weight and values of W are in the range

of 5-50 (kcal/mole-A2) depending on n, the number of targeted hydrogen bonds. d is the value of

the current Donor-Acceptor distance and dOis the value of the ideal target distance. The current

angle around the hydrogen atom, the Donor-Hydrogen-Acceptor angle is ~ . p = \ c1 - C212where

c1 is a unit vector perpendicular to the plane of one base and Cz is a unit vector perpendicular to

the plane of its partner. In an ideal hydrogen bond, d equals dO,~ = 180°, c1 is parallel to c1, and

F, which ‘is summed over all n hydrogen bonds at all residues, equals O. The hydrogen bond ~

penalty function F is employed in the first stage. A second minimization is then carried out in

which the fimction is released so that final structures are unrestrained energy minima.

The search strategy employed was as follows: starting structures were an ideal B form

DNA (11) except that both anti (~=240°) and syn, (%=60°) orientations were employed for the Gd

mutational hotspot. The hydrogen bond penalty fimction was then employed in the following

way: at residues with normal Watson-Crick pairing we employed the fiction to seek out that

type of bonding. In bulged structures the function was employed only for the paired bases, and

the non-paired bases were not targeted for any hydrogen bonding. In Intermediate 11where there

is a C (Cll) opposite to the Go hotspot, five different hydrogen bonding schemes were sought for

the GA.C1lbase pair. Three of them are Watson-Crick (Gt anti), I-Ioogsteen (Gi syn), and Wedge

4

Page 8: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

(G, syn) (Figure 1). The other two involved syn G, opposite anti Cl,, and anti G, opposite anti

Cl], with no hydrogen bonding between Gq and Cl,.

The energy minimizations were done in stages. The hydrogen bond penalty function

with weight of 15 was employed in all first stage minimizations to guide in the location of the

selected hydrogen bonding patterns. A second minimization was performed for each resulting

structure from the frost minimization without the hydrogen bond penalty function. Those

structures that employed protonated bases (Hoogsteen and Wedge hydrogen bonded structures,

Figure 1) were deprotonated and minimized again to permit comparison of their energies with

those of the unprotonated conformers (12). The energy minimized structures resulting from each

intermediate were ranked according to energy. It should be emphasized that the hydrogen

bonding scheme which is targeted by the hydrogen bond penalty function may or may not be

achieved during the stages of minimization, and a final unrestrained structure may have. a

different hydrogen bonding pattern than the one that was initially searched for.

DUPLEX employs a consistent force field for nucleic acids based on one devised in the

Olson laboratory (13, 14) which contains the usual Van der Wards, electrostatic and torsional

contributions to the energy, E,~, E~l, EtO,,as well as a number of special terms. These are needed

to properly model sugar conformations(15) and phosphodiester rotational preferences (16). The

original treatment for hydrogen bonding devised by Olson(17) is also employed.

E~l is an especially important term in the force field because electrostatics play a central

role in determining the molecular structure of the DNA. To model the interpenetration of

solvent as an atom pair separates, the dielectric constant is allowed to increase from its value

within DNA (believed to be in the range of 2 to 4 (13)). However, there is considerable

uncertainty as to the form the increasing dielectric function should take with increasing

interatomic distance. In the present work we employ two different treatments. In the first

treatment the E.l term is the standard coulomb potential, modified with an exponential screening

term P (13,14).

E,l = ZZ (332 qi~ e-srij) / s’rij Screened Coulomb’s Law

i<j

~ = ~l/e -Brij

We term this Dielectric Function A.

5

Page 9: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

The choice for the value of ~ in the screening term of the dielectric function allows any selected

concentration of counterion to be represented (13). The term &’/e-Drij produces a distance

dependent dielectric function which increases as rti increases. ~ is assigned a value of 0.1 in the

present work which corresponds to screening by a monovalent salt concentration of about 0.1 M

(18,19). The choice of d, a variable parameter, governs the exact shape of the function. In the

current work we employ a value of 1 (Function A) on the basis of earlier experience in our

laboratory (20). In the second treatment we tested the dielectric function devised by Hingerty et

al. (21):

E(r) =78 - 77( r/2.5 )2 [er’2s/( er’25-1 )2]

-r We term this Dielectric Function B.

The dktance dependence of these two functions is given in Figure 2. In addition, reduced partial

charges on the pendant phosphate oxygens are employed to model charge neutralization by

counterion condensation (22).

For each of the replication intermediates (Intermediates I-III) energy minimization was

carried out using Functions A and B separately. The energy minimized structures obtained from

Function A were subsequently minimized again using Function B, and energy minimized

structures obtained from Function B were subsequently minimized again using Function A. In

addition, in several cases structures containing N-2-aminofluorene (AF) modification obtained

by energy minhnization searches or by computer graphics modeling (23) were used as starting

structures for another set of minimizations following removal of the AF. The total number of

trials for each intermediate is given in Figure 3.

Computations were carried out on a Cray C90 Supercomputer at the National Science

Foundation San Diego Supercomputer Center and on a Cray C90 at the Department of Energy

National Energy Research Supercomputer Center. The computer graphics program INSIGHT II

by Molecular Simulations Inc. was used for visualization.

6

Page 10: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

RESULTSIntermediate I: 5’ G, G2 C~ GdC~ CG A, 3’

3’

..*

GIOGgT85’

Results are summarized in Tables 1 and 4. All minima obtained are included in Table 1

without an energy cutoff. The two most important structures for this intermediate (Table 1, #1,

Figure 4A and Table 1, #2, Figure 4B) are very similar. The double stranded region is normally

paired. The unpaired G1 is normal anti and stacked with the adjacent Cj.GIO base pair on its 3’

side; GAis unstacked on its 5’ side. There is a bend between Gi and C~. C~, G2 and Gl are nicely

stacked. tie single stranded region of the template strand interacts with residues of the duplex

region of the primer. The sugar-phosphate between G, and Gz interacts with the Gg and T8.-.

residues. Higher energy structures are summarized in Table 4 and illustrated in FigWe 4.

Intermediate 11: 5’ G1 G2 CJ Gd C~ CGA, 3’

3’

Results are summarized in Tables 2 and 5. Computed structures to 25 kcal/mol are

included in Table 2. The most important structure (Table 2, #l, Figure 5A) has the following

features: All four base pairs in the double stranded region are intact. The G1 hotspot is normal

anti and nicely stacked with the adjacent C~.GIObase pair on its 3‘ side. The C~ is not stacked

with the GOC,l base pair. The Gz and C~ in the single stranded region are nicely stacked with

each other but G, and Gz are not stacked, instead, G1is parallel to C, ~and partly stacked with it.

Higher energy structures are summarized in Table 5 and illustrated in Figure 5.

G, C5Intermediate 111: 5’ G1 G2 C~ Cb A, 3’

. .

3’ C1l GIO “ “Gg T* 5’

Results are summarized in Tables 3 and 6. All minima obtained are included in Table 3

without an energy cutoff.’l%e two most important structures for this intermediate (Table 3, #1,

Figures 6A and Table 3, #2, Figure 6B) have the following features:

7

Page 11: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

G} is only partly stacked with Gz in both structures. Gz is properly paired with its partner

C,l. CJ is properly paired with GIOand the two base pairs are well stacked. In structure #1 Gt is

anti and protrudes into the major groove ; in structure #2 G1 is syn and also protrudes into the

major groove. The unpartnered C~ is stacked with Gg on one face; the other face is mainly

exposed. The Cb.G~and A,.Tg base pairs are normal. Stacking between the Cc.Gg base pair and

A7.T8 base pair is fair in both structures. The helix axis between the two base paired regions is

shifted in both cases, resulting in a bend. Descriptions of a number of higher energy structures

are summarized in Table 6 and illustrated in Figure 6. Figure 7 shows low energy structures of

Intermediates 1, H, and III

-DISCUSSION

Search Strategies: Reasonable sets of starting structures were employed for our energy

minimizations, includlng canonical B forms, structures built by computer graphics modeling and

structures obtained from carcinogen-DNA studies with removal of the carcinogen. In addition

various hydrogen bondhg possibilities were considered for Intermediate II at the GJ hotspot.

The syn as well as anti conformations were considered for Gd in all intermediates. Moreover,

structures found in the search employing Dielectric Function A were used as starting structures in

Dielectric Function B minimizations and vice versa. A total of 44 trials were made for the three

intermediates.

Dielectric Function: A gratifying aspect of our results is that the most important

structures for each category of intermediate were similar irrespective of which dielectric fimction

was employed in the searches, although the energy rankings of similar structures diverged

considerably in the higher energy forms. It is possible that different dielectric treatments might

be more suitable for differing structural types (normal duplexes, single strand/double strand

junctions, bulges) but it will be necessary to have experimental benchmarks in the same sequence

and length context available to permit such discrimination. Moreover other dielectric function

variants may need to be tested (13,24,25,26).

8

Page 12: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

Relevance of Higher Energy Structures: We have included among our results

structures of fairly high energy. We feel that in the presence of relevant proteins including

polymerases, single strand binding proteins, topoisomerases and others in normal replication, and

repair enzymes with modified DNA, structures that are higher energy in their absence could

become favored. Examples of protein induced alterations to DNA structure are accumulating in

the crystallographic literature (27,28). We note that among our structures of Intermediate H , for

example, the structure most plausible for normal replication with normal alignment of the

template is not the lowest energy form but has an energy of about 16 kcal/mol. In addition,

minima for the small fragments treated here may be more or less favored at an authentic

replication fork where the DNA is essentially infinitely long.

. .

Bending. An interesting feature of all the important structures of the intermediates

modeled in this work is that the DNA is bent. In the cases of Intermediates I and II there is a

strong bend near the next to be replicated templating base. This is in line with a model based on

the crystal structure of human DNA polymerase ~ complexed with DNA (29) which contains a

90° bend in the single stranded template. It was proposed by Pelletier et al. (29) that such a bend

of the template at the active site could increase the enzyme’s sensitivity to mismatch prior to.

catalysis, by minimizing &e importance of indiscriminate base stacking between the primer

terminus and an incoming nucleotide until a check has been made against the template for proper

base pairing . In the case of the bulged Interinediate III, bending is known to be associated with

bulged structures (30-41) although the biological significance, if any, is not known.

Bulged Intermediates versus Normal Extension: Replication Fidelity and

Carcinogen Modification: Comparison of the energies of the G4 anti bulged structure

(Intermediate HI) versus its G, anti normal counterpart (Intermediate II) shows that the unbulged ~

form (Intermediate H) is strongly favored, as would be expected to maintain ftithful replication

even in sequences such as iVarI which accommodate slippage to form a bulge (Table 7).

However, such slipped structures can become stabilized in the presence of a covalently bound

carcinogen (42-47). on the other hand when the Gq is syn the energies of Intermediates II and III

are very close (Table 7). This suggests that if a carcinogen like AAF which induces the syn

9

Page 13: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

conformation were covalently bound to GAthe slipped Intermediate 111may be much closer in

energy to the normally extended one than when the Gq is anti.

CONCLUSION

Our modeling studies of, replication interrnediates for the NarI mutational hotspot

sequence of E. coli has yielded some plausible structures that are in line with faithful replication

at the G~ template when this base is undamaged by carcinogen. On the other hand when the GAis

syn the energies of normally extended Intermediate 11 and slipped Intermediate III are much

closer than when GAis anti. This suggests that if a carcinogen like AAF which induces the syn

conformation were covalently bound to the Gd hotspot the slipped Intermediate 111may be close

-m energy to the normally extended one.

ACKNOWLEDGEMENTS

This research is supported by NIH Grants CA 75449, CA 28038

Grant DE-FG02-90ER60931 to S.Broyde and R. Shapiro, and DOE

960R22464 with Lockheed Martin Energy Research to B.E. Hingerty.

RR-06458 and DOE

Contract DE-AC05-

10

Page 14: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

REFERENCES1. Streisinger, G., Okada, Y., Ernrich, J., Newton, J., Tsugita, A., Terzaghi, E., and Inouye, M.

Cold Spring Harbor Symp. Quant. Biol. 31,77-84 (1966).2. Masurekar, M., Kreuzer, K. N., and Ripley, L. S. Genetics 127,453-462 (1991).3. Kaiser, V. L., and Ripley, L. S. %oc. Nat]. Acad. Sci. USA. 92,2234-2238 (1995).4. Fuchs, R.P.P., Schwartz, N. and Daune, M. Nature 294,657-659 (198 1).5. Koffel-Schwartz. N., Verdier. J.M., Bichara, M., Freund. A.M., Daune, M. and Fuchs R.P.P.

J Mol. Biol. 177,33-51 (1984).

6. Burnouf, D., Koehl, P. and Fuchs, R. P. P. Proc. Natl. Acad Sci. USA. 86,4147-4151(1989).

7.

8.9.

10.-11.12.

Sch&per, R. M., Koffel-Schwartz, N. and Fuchs R. P. P. Carcinogenesis (Lend.)1 l,10S7-1095 (1990).Kunkel, T. A. Biodzemisz?y 29,8003 (1990).Timsit, Y., Westhof, E., Fuchs, R. P. P., Moras, D. Nature 341,459-462 (1989).

Hingerty, B.E., Figuero~ S., Hayden, T. and Broyde, S. BiopoZymers 28, 1195-1222 (1989).Arnott, S., Dover, S., and Wonacott, A. Acts Cryst. B25, 2192 (1969).Shapiro, R., Hingerty, B. E. and Broyde, S. J Biomol Struct. Dyn. 7,493-513 (1989).

13. Olson, W. K. and Srinivasan, A. R. in Landolt-Bornstein Numerical Data and FunctionalRelationships in Science and Technology, Group VII. Biophysics. Volume 1D, W. Saenger,Editor, Springer Verlag, Berlin, pp. 415-435 (1990).

14. Srinivasan, A. R. and Olson, W. K. Fed. Proc., Fed. Am. Sot. Exp. Biol. 39,2199 (1980).15. Olson, W. J. Am. Chem. Sot. 104,270-286 (1982).16. Srinivasan, A. R., Yathin@ N., Rae, V.S.R. and Trakash, S. Biopol. 19, 165-191 (1980).17. Olson, W. Biopolymers 17, 1015-1040 (1978).

18. Fenley, M.O., Manning, G.S. and Olson, W. K. Biopolymers 30, 1191-1203 (1990).19. Fenley, M. O., Olson, W. K., Tobias, I. and Manning, G.S. Biophysical Chemis@ 50,255-

271 (1994).20. Shapiro, R., Ellis, S. and Broyde, S. manuscript in preparation.21. Hingerty, B.E., Ritchie, R. H., Ferrell, T. L. and Turner, J. E. Biopolymers 24,427-439

(1985).22. Manning. G.S. Quart. Rev. Biophys. 11, 179-246 (1978).23. Roy, D., Hingerty, B. E. and Broyde, S. unpublished.24. Olson, Wilma K. and %-inivavasan, A. R. Computer Science Information Project e-book

Oak Ridge National Laboratories (1997)25. Ramstein, J., Lavery, R. Proc. NatL Acad. USA. 85,723 1-7235(1988).26. Lavery, R. Structure and expression, Volume 3, DNA Bending and Curvature, Olson, W. K.,

Sarma, M. H., Sarma, R. H., Sundaralingam, M., eds., Adenine Press, Schenectady, NZ pp.

27.

28.

29.

30.31.

191-211 (1988).Dickerson, R. E. Oxford University Press, Oxford (S. Neidle, editor) in press (1997).

Berman, H. M., W. K. Olson, D. L. Beveridge, J. Westbrook, A. R. Srinivasan, and B.Schneider. Biophys J. 63, 751-759(1992).Pelletier, H., Saway~ M. R., WoMe, W., Wilson, S. H. and Kraut, J. Biochemistry 35,

12742-12761 (1996).Woodson, S. A., and Crothers, D. M. Biochemistry 27,3130-3141 (1988).

Hsie~ C. H. and Grifl%h, J. D. Proc. Natl. Acad. USA. 86,4833-4837 (1989).

11

Page 15: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

32.33.34.35.36.37.38.39.

40.41,

Rice, J. A. and Crothers, D. M. Biochemistry 28,4512-4516 (1989).Bhattacharyy~ A. and Lilley, D. M. J. Nucleic Acids Res. 17,6821-6840 (1989).Bhattacharyya, A. Murchie, A. I. H. and Lilley, D. M. J. Nature 343,484-487 (1990).Tang, R. S. and Draper, D. E. Biochemistry 29,5232-5237 (1990).Wang, Y. H. and Griffith, J. Biochemistry 30, 1358-1363 (1991).

Rosen, M. A., Shapiro, L. and Patel,D.J.Biochemis@31, 4015-4026 (1992).Wang, Y. H., Baker, P. and Griffith, J. J. BioL them. 267,4911-4915 (1992).

Gohlke, C., Murchie, A. I. H., Lilley, D. M. and Clegg, R. M. Proc. Natl. Acad. USA.91,11660-11664 (1994).Zacharias, M. and Hagerman, P. J. J Mol. Biol. 247,486-500 (1995).Lilley, D. M. J. Proc. Natl. Acad. USA. 92,7140-7142 (1995).

42. Garci~ A., Lambert, I. B. and Fuchs, R. P. P. Proc. Natl. Acad. Sci. USA. 90,5989-5993

(1993).,

43. Milhe’, C., Dhalluin, C., Fuchs R. P. P. and Lefevre, J. F. Nucleic Acids Res. 22,4646-4652

(1994).

“44. Milhe’, C., Fuchs R. P. P. and Lefevre, J. F. Eur J Biochem. 235,120-127 (1996).45. Y% N. Q., Smimov, S., Cosman, M., Bhanot, S., Ibanez, V., and Geacintov, N. E. Structural

Biolo~: The State of the Art, Proceedings of the Eighth Conversation in the DisciplineBiomoiecuIar Stereodynamics, State Universi~ of New York at Albany, June 22-26, 1993(Sarma R. H., Sarma. M., Eds.) Vol. 2. pp 349-366, Adenine Press, Schenectady, NY. (1994),

46. Mao, B., Cosman, M., Hingerty, B.E., Broyde, B., Patel, J. P. Biochemistry 34,6226-

6238 (1995).47. Mao, B., Flingerty, B. E., Broyde, B., Patel, J. P. Biochemistry 34, 1664’ -16653 (1995).

12

Page 16: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

TABLE 1Energy minimized conformations of Intermediate ~ 5’ G1 G2 CJ G4C5 C6 A, 3’

3’. . .

GIOGgT85’

Dielectric Function AMk# x Domain Figure #

(KC:v:ol)1 0.0 anti 4A, 7A

2 2.2 anti 4B.-3 5.1 Syn 4C4 10.5 syn 4D5 11.3 syn 4E

Dielectric Function BMin# x Domain Figure #

(KcaYmol)1 0.0 anti 4B2 7.4 anti b

3 17.3 Syn 4E4 21.2 Syn 4C5 25.3 Syn 4D

‘All computed structures shown. Structures with the same figure number are virtually identical.‘TJot shown, structure is a somewhat distorted variant of 4A.

13

Page 17: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

TABLE 2Energy minimized conformations of Intermediate 11’ 5’ G, Gz C~ Gd C~ CG& 3’

. . . .

3’ Cll GIOGyT~ 5’

Dielectric Function AMin# x Domain Figure #

(Kc~UEmol)1 0.0 anti 5A, 7B

.< 2 16.2 anti 5B

3 22.4 anti 5C

Dielectric Function BMin# AE x Domain Figure #

(Kcal/mol)1 0.0 anti 5A, 7B2 16.4 anti 5B

3 24.5 syn 5D

‘Computedidentical.

structures to 25 kcal/mol shown. Structures with the same figure number are virtually

14

Page 18: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

TABLE 3 GdC,

Energy minimized conformations of Intermediate 11~ 5’ GI G2 C~ Cb A, 3’. . . .

3’ Cll GIO Gg Tg 5’

Dielectric Function A

Min# AE x Domain Figure #(Kcal/mol)

1 0.0 anti 6A, 7C

2 0.2 Syn 613

3 12.3 syn 6C

4 18.1 anti 6D

5 19.5 anti 6E

Dielectric Function B

Min# AE x Domain Figure.#(Kcal/mol)

1 0.0 anti b

2 5.9 syn 6C

3 9.7 syn 6B

4 23.4 anti 6D

5 26.1 anti 6E

‘All computed structures shown. Structures with the same figure number are virtually identical.!Not shown, structure is a somewhat distorted variant of 6A.

15

Page 19: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

TABLE 4Structure characterization of Intermediate I :5’ GI G2 C~ G~C~ CG A, 3’

3’. . .

G,OGg Tg 5’

~igure ,Xof Gd Hydrogen A7.T~ Pair Single Stranded Region Stacking in# Bonding Single

in Double StrandedStranded Region Region

4A Anti All three are Normal Sharp bend between GJ and CJ CJ,G2 and G,intact are stacked

4B Anti All three are Normal Sharp bend between Gd and C~ C~ ,G2and G,intact are stacked

4C Syn All three are Normal U-turn between G1 and C~ C3 ,Gz and G1intact are stacked

4D Syn All three are Normal U-turn between C~ and Gq C~ ,Gz and G,intact are stacked

4E Syn Two are intact Disrupted Sharp bend between Gq and C~ Gz and G, arestacked

16

Page 20: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

TABLE 5Structure characterization of Intermediate II :5’ GI G* CJ GdC~ Cb A, 3’

. . . .

3’ C1lGIOG9T8 5’

‘igmx#

5A

5B

5C

5D

Anti

Anti

Anti

Syn

Iydrogen BondingrnDouble Stranded

Region

All four are intact

All four are intact

All four are intact

“Ihree are intact

Normal

Normal

Normal

Only onehydrogen

bond isformed

stacking

No stacking

Stacking

No stacking

SingleStranded

Region

Sharp bendbetween G,

andC~

Sharp bendbetween Gq and

C3

Shallow bendbetween Gd and

c,

Sharp bendbetween Gi and

C3

31is parallel to C1,md stacked with it;

C~ and Gzarestacked

C~, Gzand G1arestacked

G2and G1are notstacked; C~ and Gz

are stacked

Cg,Gzand G1arestacked. G~is

parallel to Cll butnot stacked with it

17

Page 21: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

TABLE 6 G, C,Structure characterization of Intermediate III: 5' GlG2 CJ

?igure til Stacking G2.C1l C3”G3 Unpartnered Gq Unpartnered C~ Cb.Gg A7.T8# with GI Pair Pair Pair Pair

6A Poor Normal Normal Anti One face stacked Normal NormalMajor groove with Gg

6B Poor Normal Normal Syn One face Normal Normal

Major groove stacked with Gg

6C Partly Normal Normal Syn Stacked with Distorted DistortedStacked Major groove CG.G9

6D None Normal Normal Anti, Stacked with C~ Stacked with Normal DistortedCGand G,

6E Poor Normal Normal Anti, Stacked with C~ Stacked with Gd Normal Distorted

18

Page 22: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

TABLE7 Energies of Intemediate II@xtended) versus Intermediate III@ulged)in lowest energy anti and syn conformations

Energies (kcalhnol)’

Intermediate II Intermediate HI Intermediate II Intermediate IIIAnti Anti SYn Syn

Dielectric Function A -327.5 (0.0) -295.6 (0.0) -294.0 (33.5) -295.~ (0.2)Dielectric Function B -281.7 (0.0) -~55.o (().0) -257.2 (24.5) -249.1 (5.9)

-..

aAbsolute energies are followed by AE, the energy of the given structure relative to thelowest energy form forthat intermediate, inparentheses. See Tables 2and3.

19

Page 23: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

FIGURE

Figure 1.(a)

Figure 2.

Figure 3.

Figure 4.

Figure 6.

Figure 7

LEGEND

Protonated hydrogen bonding patterns investigated.Hoogsteen and (b) Wedge

Dielectric functions studied.

Total number of trials employed for energy minimization.

Energy minimized conformations of Intermediate I (See Tables #1 and #4).Stereo views.

Energy minimized conformations of Intermediate II (See Tables #2 and #5).Stereo views.

Energy minimized conformations of Intermediate III (See Tables #3 and #6).Stereo views.

Space-filling models for low energy conformations ofA. Intermediate I B. Intermediate 11 C. Intermediate III (See Tables 1-3)Stereo views.Color code: template strand - yellow, primer strand - blue; Gt hotspot - red;

unpaired C~ - green (Intermediate III)

20

Page 24: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

CYTeSLmE WANME

(al

Susan

CTTOSSME

SIJsAa

CUANIRE

SUGAR

Cx●

SU6AR

(b)

Page 25: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.

.

/,/

Dielectric Functions

.~.~-x-x-x-x”x-x-x-x”x-~”~-------

70 .

60 .

50 .

&413 ..

30 .

20

..z-

10

0

Page 26: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

\

UAS W@ law“w! UAS UeeJs “qq l]UW

UAs J]uv ON 4)6POM ‘600H ON ~“m UAS jylv

Page 27: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

,

A.

B.

c.

. D.

Ii w

A.

B.

c.

D.

E

Page 28: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

A.

B.

c.

ew-

FD.

= w

A.

B.

c.

FD. v

-.

Page 29: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

.-

A.

B.

A.

B.

c. c.

D.

%

E.

D.

E.

Page 30: Modeling DNA Replication Intermediates/67531/metadc674939/m2/1/high_res_d/3524.pdfmutations such as double base (-2) deletions during replication of DNA (1,2,3). The sequence 5’G1GzC~GdC~Cb3’

?

?

r-

-ii