40
DNA-protein electrostatic recognition: analysis of Protein Data Bank structures of DNA-protein complexes A. G. Cherstvy 1 , A. B. Kolomeisky 2 , and A. A. Kornyshev 3 1 Institut für Festkörperforschung, Theorie-II, Forschungszentrum Jülich, D-52425 Jülich, Germany 2 Department of Chemistry, Rice University, Houston, Texas 77005 , USA 3 Department of Chemistry, Faculty of Natural Sciences, Imperial College London, SW7 2AZ, London, UK

DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

DNA-protein electrostatic recognition: analysis of Protein Data Bank structures of DNA-protein complexes

A. G. Cherstvy1, A. B. Kolomeisky2, and A. A. Kornyshev3

1 Institut für Festkörperforschung, Theorie-II, Forschungszentrum Jülich, D-52425 Jülich, Germany

2 Department of Chemistry, Rice University, Houston, Texas 77005 , USA

3 Department of Chemistry, Faculty of Natural Sciences, Imperial College London, SW7 2AZ, London, UK

Page 2: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Introduction to protein-DNA interaction and recognition

• DNA-protein recognition is vital for many biological processes (e.g., gene expression and regulation)

• Extreme diversity of proteins: humans ∼500 000 proteins, ∼ 25 000 genes.

• Protein classes: gene regulatory (transcription factors), repair proteins, structural proteins (histoneshistoneshistoneshistones), processing proteins (RNA Poly), etc.

• Main interactions: hydrogen bonding (HB), electrostatic electrostatic electrostatic electrostatic (DNA/proteins), hydrophobic, van der Waals forces.

• Protein recognition motifs: helix-turn-helix, zinc finger, leucine zipper.

• Complex and rather probabilistic code of DNA-protein recognition.

• Protein-DNA binding affinity: DNA sequenceDNA sequenceDNA sequenceDNA sequence, pH, [salt], T, helper proteins, DNA 3D conformation, etc.

• Physical mechanisms behind electrostatic DNA-protein interactions.

Page 3: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic DNA-protein interactions: lac repressor

Enormous dependence of lac repressor association binding constant K on [salt]

Upon sliding, condensed cations are removed in front and they bind back on DNA behind the protein.

specific: 7-8 non-specific: 11

R B. Winter et al., Biochem., 20 6961 (1981) M.T. Record et al., Biochem., 16 4791 (1977)

Winter & von Hippel: Electrostatic DNA-protein interactions are largely sequence non-specific !?

Page 4: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Hydrogen bonds with DNA bases: DNA-protein recognition code

• HB donors and acceptors determine the unique code of DNA-protein HB interactions; HB strength is 1-5 kBT

• HB formation preferences in DNA-protein complexes: Arg NH1/NH2 and Lys NZ with O6 and N7 of Guanine, Asn and Gln with Adenine, Glu and Asp with Cytosine.

N. M. Luscombe et al., Nucl. Acids Res., 29 2860 (2001)

Page 5: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic potential of lac repressor

• Protein residues Lysine (pKa=10), Arginine (12), Histidine (6.5) are in close proximity to DNA phosphates

• DNA-induced charge patterns on proteins that are recognized by DNA?

Non-specific: 1osl.pdb Specific 1l1m.pdb

PositiveNegative

MDL Chime 6 Protein Explorer 2.80

Page 6: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Positive protein charges “love” DNA: sequence specificity of interactions?

Oxygens OD1/OD2 of Asp and OE1/OE2 of Glu.

Nitrogens NZ on Lys, NH1/NH2 on Arg, and

ND1 on His NCP stability ([salt])

Page 7: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

W. Kabsch et al., Nucl. Acids Res., 10 1097 (1982)W. K. Olson et al., PNAS, 95 11163 (1998)

B-DNA charge and structure non-ideality-1 e0 per each 1.7 Å along DNA axis

Major groove

Minor groove

H=34Å

2a=20Å

DNA corrugated structure is recognized by proteins

Page 8: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

R

Protein

DNA

s1 s2

sph

Model

• Extract atomic coordinates from PDB files of protein-DNA complexes (Math 6)

• Identify closest protein N+ charges, R~lB~7Å

• s1,2 on the samesamesamesame DNA strand; DNA direction

• Histogram of s1-s2 distribution

• If uniform distr. ⇒ no DNA sequence specificity

• Two-peaks distr. ⇒ protein N+ follow DNA P-

• As 3D DNA structure is sequence specific, individual P- are tracked by Lys and Arg

• Complementary DNA-protein interaction lattices

• Sequence-specific electrostatic interactionssph~7Å

Page 9: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Arginine: N CA C O CB CG CD NE CZ NH1 NH210th and 11th atoms are N

Lysine: N CA C O CB CG CD CE NZ9th atom is N

Histidine: N CA C O CB CG ND1 CD2 CE1 NE27th atom is N, charged or neutral

Protein positive residues and DNA negative charges

DNA: PO4- groups

Page 10: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Nucleosomes: DNA-wrapping proteins of eukaryotes

K. Luger et al., Nature, 389 251 (1997)

Page 11: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

-4 -2 0 2 4 60

5

10

15

20

s1-s2, Þ

N+

,1ao

i

-4 -2 0 2 4 602468

101214

s1-s2, Þ

N+

,1f6

6

-4 -2 0 2 4 60

5

10

15

s1-s2, Þ

N+

,1kx

3

-4 -2 0 2 40

2

4

6

8

10

12

s1-s2, Þ

N+

,2cv

5

-4 -2 0 2 4 60

2

4

6

8

10

s1-s2, Þ

N+

,2nq

b-4 -2 0 2 4 6

02468

101214

s1-s2, Þ

N+

,1eq

z

-4 -2 0 2 4 60

2

4

6

8

10

12

s1-s2, Þ

N+

,1p3

4

-4 -2 0 2 4 60

2

4

6

8

10

s1-s2, Þ

N+

,1m

18

-4 -2 0 2 40

10

20

30

40

50

s1-s2, Þ

N+

,Sum

-4 -2 0 2 4 602468

101214

s1-s2, Þ

N+

,2nz

d

-4 -2 0 2 4 60

2

4

6

8

10

12

s1-s2, Þ

N+

,2f8

n

-4 -2 0 2 4 602468

101214

s1-s2, Þ

N+

,1u3

5

-4 -2 0 2 4 60

5

10

15

20

s1-s2, Þ

N+

,1kx

5

-4 -2 0 2 4 60

2

4

6

8

10

12

s1-s2, Þ

N+

,2fj

7

-4 -2 0 2 4 602468

101214

s1-s2, Þ

N+

,2py

o 147 bp: 1kx5, 2fj7, 2pyo

145 bp: 1nzd, 2f8n, 1u35

Canonical 146 bp NCP: 1aoi, 1f66, 1kx3, 2cv5, 2nqb, 1eqz. 1p34, 1m18

Sum of all complexes: frog, human, fruit fly, chicken NCPs

Results for s1-s2 in NCPs: 75-100 N+ close, 160-230 in totaldepleted

Page 12: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

-6 -4 -2 0 2 4 60

5

10

15

20

25

s1-s2, Þ

N+

,Sum

Prokaryotic DNA-bending proteins also reveal two peaks

Complexes analyzed:2np2, 1ihf, 1p51,1p71,

1p78, 1ouz, 1owf, 1owg

U-turn like severe bending of DNA

Page 13: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Main DNA-binding motifs of proteins

Helix-turn-Helix,λλλλ repressorrepressorrepressorrepressor, 1lmb.pdb:

2 α-helices in major groove, HB with DNA bases,ES with phosphates

Zinc finger, Zif268Zif268Zif268Zif268, 1aay.pdb: 3 α-helices in major groove,

each finger recognizes 3 bps,HB+ES

Leucine zipper, GCN4GCN4GCN4GCN4, 1ysa.pdb: 2 consecutive major grooves are recognized by 2 long bound α-

helices, HB+ES

PositiveNegative

Page 14: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

-4 -2 0 2 40

10

20

30

40

s1-s2, Þ

N+

,Sum

-4 -2 0 2 4 60

5

10

15

20

25

s1-s2, Þ

N+

,Sum

-4 -2 0 2 405

101520253035

s1-s2, Þ

N+

,Sum

Basic DNA-binding protein motifs: uniform distributions and no sequence-specificity

zinc fingers leucine zippers lac-, lambda-, 434-, cro-, arc-repressor-like complexes

1aay, 1a1l, 1p47, 1jk1, 1jk2, 1a1f, 1a1g, 1a1j, 1a1k, 1a1h, 1a1i, 1zaa, 1g2f, 1g2d, 1f2i, 1llm, 1mey, 1ubd, 1tf3, 2jp9,

2gli, 3dfx

1ysa, 2c9l, 2c9n, 2h7h, 1d66, 1fos,

1gu5, 1hjb,

Repressors (1osl, 1l1m, 2bjc, 1cjg; 1lmb, 3bdn, 6cro, 1lli, 1rio; 1par,

1bdt, 1bdv, 2bnz, 2cax; 1au7, 2or1, 1per, 3cro, 1rpe, 2p5l, 1gt0, 1hf0, 1ic8, 1o4x, 2r1j) and CAP proteins (1cgp, 1zrc, 1zrd, 1zre, 1zrf, 1o3q,

1o3r, 1o3s, 1j59, 1run, 2cgp),

Page 15: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Conclusions and outlook

• For largelargelargelarge DNA-protein complexes, NCP and HU, tracking of individual DNA phosphates. • DNA-induced sph~7 Å charge periodicity along DNA-protein interfaces.• Up to 100 charge-charge contacts, large 10-30 kBT energy profit due to complementarityof DNA-protein charge lattices. • Recognition of native and strongly bound DNA sequences.• Nucleosome positioning on DNA, together with sequence-specific DNA bending code.• 146 vs. 145/147 bp DNA NCPs. Different DNA affinities to histones? No data.

• For smallsmallsmallsmall complexes, with 5-10 ES contacts, no statistical preference and weak/no sequence specificity of ES interactions. • Electrostatics is weak and other interactions contribute to recognition (HB).

• Interplay of HB+ES : future research.

Page 16: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Thank you

Grant CH 707/2-1

• A. G. Cherstvy, A. B. Kolomeisky, and A. A. Kornyshev, J. Phys. Chem. B, 112 4741 (2008).

• A. G. Cherstvy, J. Phys. Chem. B, 113 4242 (2009).

Page 17: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

E. Segal et al., Nature 442 772 (2006)

NCP positioning code

Page 18: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

hydrogen bonding hydrophobic electrostatic

Lac repressor contacts with DNA

C. G. Kalodimos et al., Science, 305 386 (2004)

Electrostatic contacts are believed to be sequence-nonspecific, while hydrogen bonding is highly sequence specific

Page 19: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Spiraling RNA Polymerase: protein binding requires DNA

groove tracking

K. Sakata-Sogawa et al., PNAS, 101 14731 (2004)

Sliding vs. Spiralingelectrostatic barriers vs. hydrodynamic friction

• Theory of Schurr for lac spiraling: 100 times stronger hydrodynamic drag and smaller D1: D1=5×10-9 cm2/s

• Old experiments (Blomberg): D1=3×10-9 cm2/s

M. J. Schurr, Biophys. Chem., 9 41 (1979)

A.G. Cherstvy and R.G. Winkler, J. Chem. Phys., 120 9394 (2004)

Page 20: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

h

R

h

Protein charge array

DNA charge array

z

K

δm

∆n

Model of DNA-protein recognition: charge complementarity

• Random charge displacements mimic bp specific nonideality of DNA/protein structure

• Long-range correlations zn=nh+∆n

• Recognition region -- similar charge variations ∆n = δm -- stronger DNA-protein attraction

• Potential well near the homology region

PositiveNegative

Page 21: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Artificial charge periodicity in protein DNA-binding domains

• Periodicity of ≈ 7 Å and ≈34 Å is expected from PDB data analysis.

• Next step: backbone elasticity + DNA helicity + PDB files analysis + computer simulations of protein diffusion

34 Å

7 Å

PO4−

Page 22: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

np

r

λ konkoff

protein

DNA

target

x

nads

Macroscopic qualitative model of protein diffusion in DNA coil

• Every cycle: 3D diffusion in solution + 1D sliding along DNA

• [Protein] in solution cp=np/V and on DNA cads=nads/V

• Volume of DNA coil ∼Lr2

PositiveNegative

Page 23: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Mechanisms of protein diffusion on DNA

J. Marko et al., Nucl. Acids Res., 32 3040 (2004)

Sliding/1D diffusion Hopping Inter-segmental transfer (loop-facilitated process)

Actual diffusion is a combination of these basic steps

Page 24: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Time of target search: 3D + 1D

0 0

exp[ ( )]exp[ ( ')] '

( )

x z

c

G zdz G z dz

D z

λ βτ β

+

= −∫ ∫

expon p p eff

eff

off ads ads B

k n c Ey y

k n c k T

= = =

3

1

, 0( )

,

D z xD z

D x z x λ

< <=

< < +

0, 0( )

, .eff

z xG z

E x z x λ

< <= − < < +

3 1

2 2

12 2c

eff

x

D y

x

D D

λτ

λ= + +

1/

c

ads

L

n

α

τ τλ

van Kampen: Mean First Passage Time for 1 cycle

Diffusion coefficient profile

Non-equilibrium protein adsorption constant on DNA;equilibrium: y=kon/koff

Free energy profile: no DNA bp specificity

Total search time along DNA of length L: α=1: random protein attachment every stepα>1: super-diffusion

3D + 1D + correlation term (missing previously)protein unbinding before travelling length λ on DNA

Page 25: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

2 2pn Lx Lr V= =

3 12 2

21 1 2,on p off ads

free ads

D Dk c k c

xτ τ λ= = = =

ads

r yd

nλ =

11

3

1 1 22

p

p ads ads adsp

nLr r L

D n n r n d nn yd

αλτ

λ λ

− = + +

2

3 3

12 2S

p p

Lr

D ac D anτ = =

11

3/ 2

21 p

S ads d pa s adsn y

n ya L

r n yd n d r n ydd

αττ

− = + +

1

3

1D

dD

= <<

Total search time vs. Smoluchovski time

L scales out

Rates of protein binding and unbinding

Optimal sliding length λ

Smoluchovski 3D diffusion rate to a drain of radius a

Final ratio of search times

12

S p

a d

r dy n

ττ

+= At equilibrium, yeff=1, d<<1,

correlation term

/ px r n= Length of 3D path

Page 26: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Minimal search time at intermediate y and np values

a=1 nm, r=30 nm, α=1, na=1000, np=1, d=0.001

Weak attraction to DNA

Strong attraction: long λineffective 1D search only

Unbinding drift is strong, λ is short

Always exist proteins close to the target in solution

• As nads grows, τ decreases -- parallel search of DNA by many proteins

• Dotted curves: without correlation term -- wrong results

d=0.001

• Diffusion times faster than Smoluchovski

Page 27: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Part 2: Electrostatic key-lock mechanism of protein-DNA recognition

Page 28: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic DNA-protein interaction and recognition energy

( ) ( ) ( )0

22 20

0m n

Niqh m n iqiqz

el

m N nc

eW dq K q R e e e

δκε π

∞∞ − −∆

−∞=− =−∞

= − + ∑ ∑∫

g=2π/h, <∆n2>=∆2, <δm

2>=δ2, Ω2=δ2+∆2

( ) ( ) [ ]

( ) ( ) ( )

2 2 2

2 2

22 2 2 / 20

0 0 01

22 2 / 20

0 * 00

22 cos

2cos 1

n g

el long rangenc

q

c

eW K R K n g R e ngz

b

edq K q R q z z

M

M e

κ κε

κπε

∞− Ω

−=

∞ − Ω

= − + + −

+ − −

( )

( )5/ 22

2 2

2

2 22

long range B

B c

W z l

k T R

RM

z

zεε

− Ω ∆

+ ∆

−∆ ∆≈ −

z=z0

zz=z*

M=2N+1 charges

General electrostatic interaction energy

Averaged

Simple form for Ω<<1, κ=0

0 *z z z∆ = − ( )20 /B Bl e k Tε=

Recognition energy

Page 29: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic recognition energy ∆W

• Well is accompanied by the barriers• Well depth is several kBT

• Narrow wells: no “funnels” for protein diffusion • Screening makes wells shallower• Well depth d grows linearly with M• d scales as 1/R3 at κ=0 and as e-κR with salt

o o o2 211, 10A, 2, 3.4A, , 1AcM R hε δ= = = = = ∆ Ω =

o

1/ 7 Aκ =

0κ =

max 3/ 2z R=

0 *z z z∆ = −

Page 30: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Protein residence time in the well

• Wells of ∼kBT in depth slow down protein diffusion

• Enough time to provoke protein conformational changes (µs- ms) and to induce stronger protein binding to DNA

• ES DNA-protein recognition is the first step of protein docking

• Stronger Hydrogen Bonding interactions can be formed afterwards

Kramers <escape time>

van Kampen M-F-P-Time

τc

Page 31: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Thank you

cond-mat: 0708.0021

Page 32: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Funny energy barriers: Coulomb case

δ

R

∆z

( )

( ) ( )( )

( )( )

( )

2 2

2 20 0

2 2 22

2 2200

3/ 2 5/ 22

2

2 2 2

2

( )

0

2

2

el el el

c c

TaylorExpansion

el

c c

R R z

e eW W W

R z R z

ee zW

R z

R z R zδ

δ δ

δε ε δ

δδ

ε ε

= + ∆ +

∆ = − = −+ ∆ + ∆ +

∆∆ ≈

+ ∆

− ∆+

+ ∆

At R2>2∆z2 fluctuations of charges always reduce their attraction energy

R

∆z

δ

||||||||

-4 -2 0 2 4

-0.20

-0.15

-0.10

-0.05

0.00

x and z

z only

x only

W (

arb

. u

nits)

z (arb. units)

Computer Simulations

||

⊥ + ||

Page 33: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic DNA-protein interactions: lac repressor

Enormous dependence of lac repressor association binding constant K on [salt]

Upon sliding, condensed cations are removed in front and they bind back on DNA behind the protein.

specific: 7-8 non-specific: 11

R B. Winter et al., Biochem., 20 6961 (1981) M.T. Record et al., Biochem., 16 4791 (1977)

Electrostatic DNA-protein interactions are largely sequence non-specific? [complex]

M[DNA][protein]

K = ⋅

Page 34: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

1 5 10 50 100 5001000

Ll1

5

10

50

100

500

1000

t t S

Simple computer test

Compl

ete D

NA cov

erage

First p

assa

ge ti

me to t

he ta

rget

• single protein hopping randomly to left/right• random target location• random protein attachment point • average over 5 runs• L=20000, λ = 50, 100, 200

Page 35: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Interaction-induced folding and conformational adaptation

C. G. Kalodimos et al., Science, 305 386 (2004)

Page 36: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Lac repressor: D1 << D3

• Brownian Protein Motion with large D1 variations • Extract D1 from Mean Square Displacements of proteins• Experiment (Austin): D1: D1=2×10-10 cm2/s • Experiment D3: D3=4×10-7 cm2/s

1MSD( 1) 2n D t= ≈ ∆

R. Austin et al., Phys. Rev. Lett., 97 048302 (2006)

Page 37: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

DNA loops formed by lac repressor

Page 38: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Electrostatic potential of RNA Polymerase II

R. D. Kornberg et al., Science, 292 1863 (2001)

Page 39: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Looping uncharged elastic rods: buckling instability

Elasticity theory: 2D and 3D elastica, Euler and Kirchhoff -- local balance of forces and moments

Excess twist energy Etw turns into loop bending energy Eb

Looping of submarine cables [J. Coyne, IEEE J. Ocean. Ing., 15 72 (1990)]

FF

2ΛA

C=kBTltw -- twist modulus, ltw=750 ÅB= kBTlp -- bend modulus, lp=500 Å

Λ=√B/F, A2=1-C2τ2/(4BF)K2(s)=4FA2/cosh[As]2 -- curve curvature

Eb=4FAΛ -- loop bending energy

F0>C2τ2/(4B) -- force to keep cable straight

∆L=4AΛ -- cable slack upon looping

Twist rate τ

Tw =Lτ

Every loop removes about 2π of the excess twist Tw: τ = τ 0 - 2π/L

Page 40: DNA-protein electrostatic recognition: analysis of Protein Data Bank structures …dnatec09/presentations/2009_05... · 2009-05-18 · DNA-protein electrostatic recognition: analysis

Looping charged rods: limitations of OSF theory

Optimal loop shape in 3D is a complicated problem: non-locality, self-contacts.

Numerical summation of the Debye-Hückel potentials along the loop contour

∆Eel=Eellooped – Eel

straight

Eel of loops with Debye-Hückel interactions: OSF electrostatic rod stiffening works only

for large loops R>>1/κ with no close contacts

lp→ lp,el = lp + lB/(4κ2h2)

08 Bl nκ π=( )2

r

el

eE r e

r

κ

ε−=

1/κ≈10Å in physiological solutionScreened interactions

of charges

h is charge-charge separation

Applicability of OSF to tight DNA loops