21
LETTER doi:10.1038/nature13709 A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing Sarah B. Miller 1 {*, F. Zehra Yildiz 1 *, Jennifer A. Lo 1 , Bo Wang 1 & Victoria M. D’Souza 1 To prime reverse transcription, retroviruses require annealing of a transfer RNA molecule to the U5 primer binding site (U5-PBS) region of the viral genome 1,2 . The residues essential for primer annealing are initially locked in intramolecular interactions 3–5 ; hence, annealing requires the chaperone activity of the retroviral nucleocapsid (NC) protein to facilitate structural rearrangements 6 . Here we show that, unlike classical chaperones, the Moloney murine leukaemia virus NC uses a unique mechanism for remodelling: it specifically targets mul- tiple structured regions in both the U5-PBS and tRNA Pro primer that otherwise sequester residues necessary for annealing. This high- specificity and high-affinity binding by NC consequently liberates these sequestered residues—which are exactly complementary—for inter- molecular interactions. Furthermore, NC utilizes a step-wise, entropy- driven mechanism to trigger both residue-specific destabilization and residue-specific release. Our structures of NC bound to U5-PBS and tRNA Pro reveal the structure-based mechanism for retroviral primer annealing and provide insights as to how ATP-independent chaperones can target specific RNAs amidst the cellular milieu of non-target RNAs. *These authors contributed equally to this work. 1 Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA. {Present address: Department of Biology, Georgetown University, Washington DC 20057, USA. a U G A U A G C C G G C U C U G GGG U A U U U U A A G UC G U A G U U G U G C U C C A G G A C A C C C GCCC G A G G C U C G C G A C G G G G C G A U 53Anti-PBS Anti-PAS 10 23 31 41 51 61 71 tRNA Pro Anticodon D-loop TΨC loop 9 37 16 59 56 G GC U A GU A U A U G C G U C U U C C U U G C G G A G G U G C C U U G G G G G C C U G U C C G G A U GC 53GC U U UU A U CG UG CG A U A CG CG GC 100 110 122 130 140 150 160 PBS PAS U5-PBS Site V1 Site V2 128 146 156 98 Site T1 G110 5180 A111 U109 C108 U107 G147 C106 U146 U145 3G110 A111 U109 C108 C106 5U107 U146 U145 G155 G100 U104 U102 G155 U146 U145 U156 G100 G110 U107 Trp 35 Tyr 28 G100 U156 G155 A122 A98 A111 C108 180° G110 150 G U A U G U U C U C C U G G G C G 110 150 53G U A U U U U C U C C U G G G C G 110 150 53G U A U C C U G G G C G 110 A U 530 20 40 60 80 100 120 Infectivity (%) WT G110U DM HI b c d e f 13 C (p.p.m.) 107.6 5.8 6.0 6.2 C CN (H1) 128 128* 1:0 1:0.5 1:0.9 1 H (p.p.m.) Site T2 Site T3 18 Figure 1 | Structure, function and NC interaction of U5-PBS. a, b, Secondary structures of U5-PBS (a) and tRNA Pro (b) with complementary PBS/anti-PBS and PAS/anti-PAS sequences shown in blue and green, respectively. The two internal loops in U5-PBS are boxed, and the first, second and third high-affinity NC binding site sequences are shown in red, orange and yellow, respectively. The tRNA Pro residue numbering reflects the canonical numbering scheme for all tRNAs and, hence, the number 17 is omitted. c, NMR structure of the free U5-PBS RNA. Insets: top, zoom-in view of the sequestration of U 145 and 11 U 146 residues by U 109 and C 108 of the UCUG 110 NC binding site; middle, G 110 aromatic ring is turned outside of the helix in a conformation poised for NC interaction; bottom, the minor, extruded conformation of G 155 is shown. PAS residues are sequestered by alternating G–C and G–U base pairs in both conformations. d, One-dimensional slice of a 1 H– 13 C spectrum depicting the NC tail-mediated increase of the minor conformation (indicated by an asterisk) of the tetraloop C 128 residue. On the basis of the population distribution in the pre-bound state, we estimate the extruded conformation to have a free energy of ,1.4 kcal mol 21 greater than the stacked conformation. e, Structure of NC bound to UCUG 110 via the zinc finger (the Trp 35 and Tyr 28 stacking interactions are in black) showing that the protein tails can extend to contact the UCAG 130 tetraloop and residues in and near the (G 97 A, G 155 U) internal loop. f, Left, secondary structures of the wild-type (WT), G110U and deletion mutant (DM). Right, reduction of viral infectivity is observed in mutants (n 5 6). Error bars indicate standard deviations (n 5 6 for both packaging and infectivity experiments). As a negative control, heat-inactivated (HI) virions were used for infection. A packaging assay was also done to confirm that the mutations do not affect genome encapsidation (see Extended Data Fig. 4g). 00 MONTH 2014 | VOL 000 | NATURE | 1 Macmillan Publishers Limited. All rights reserved ©2014

A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Embed Size (px)

Citation preview

Page 1: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

LETTERdoi:10.1038/nature13709

A structure-based mechanism for tRNA andretroviral RNA remodelling during primer annealingSarah B. Miller1{*, F. Zehra Yildiz1*, Jennifer A. Lo1, Bo Wang1 & Victoria M. D’Souza1

To prime reverse transcription, retroviruses require annealing of atransfer RNA molecule to the U5 primer binding site (U5-PBS) regionof the viral genome1,2. The residues essential for primer annealing areinitially locked in intramolecular interactions3–5; hence, annealingrequires the chaperone activity of the retroviral nucleocapsid (NC)protein to facilitate structural rearrangements6. Here we show that,unlike classical chaperones, the Moloney murine leukaemia virus NCuses a unique mechanism for remodelling: it specifically targets mul-tiple structured regions in both the U5-PBS and tRNAPro primer

that otherwise sequester residues necessary for annealing. This high-specificity and high-affinity binding by NC consequently liberates thesesequestered residues—which are exactly complementary—for inter-molecular interactions. Furthermore, NC utilizes a step-wise, entropy-driven mechanism to trigger both residue-specific destabilization andresidue-specific release. Our structures of NC bound to U5-PBS andtRNAPro reveal the structure-based mechanism for retroviral primerannealing and provide insights as to how ATP-independent chaperonescan target specific RNAs amidst the cellular milieu of non-target RNAs.

*These authors contributed equally to this work.

1Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA. {Present address: Department of Biology, Georgetown University, Washington DC 20057, USA.

a

U

G

AU

A

G

CC

GG

C

UC

U

G

G G G

UA

U U

UU

A

A

G

U C

G

U

A

G

UU

G

U

G C

U

CC

AG

G

A

C

A

C

CC

G C C CGA

GG

CU

C

GC

GA

CG

GG

G

C

G

A

U

5′

3′

Anti-PBS

Anti-PAS

10

23

31

41

51

61

71

tRNAPro

Anticodon

D-loop

TΨC loop

9

37

16

59

56

G

G CU AG UA

U

A

U G

C

GUC

U U

CCU

UG

C

GG

AG

GU

G

CC

UU

GGG

GGC

CU

GUCC

GG

AU

G C

5′ 3′

G CU UU UA U

C G

U GC GA U

A

C GC GG C

100

110

122

130

140

150

160

PBS

PAS

U5-PBS

Site V1

Site V2

128

146

15698

Site T1

G110

5′

180

A111

U109

C108

U107

G147

C106

U146

U145

3′

G110A111

U109

C108

C106

5′

U107

U146

U145

G155

G100

U104

U102

G155

U146

U145

U156

G100

G110

U107

Trp 35

Tyr 28

G100

U156

G155

A122

A98

A111

C108

180°

G110

150

G UA U

G

U

UC

U

CCU

GG

G

CG

110

150

5′3′

G UA U

U

U

UC

U

CCU

GG

G

CG

110

150

5′3′

G UA U

CCU G

GG

C G

110A U

5′ 3′0

20

40

60

80

100

120

Infe

ctivity (%

)

WT G110U DM HI

b

c

d

e

f

13C

(p

.p.m

.)

107.6

5.86.06.2

CCN(H1′) 128

128*

1:01:0.51:0.9

1H (p.p.m.)Site T2

Site T3

18

Figure 1 | Structure, function and NC interaction of U5-PBS.a, b, Secondary structures of U5-PBS (a) and tRNAPro (b) with complementaryPBS/anti-PBS and PAS/anti-PAS sequences shown in blue and green,respectively. The two internal loops in U5-PBS are boxed, and the first, secondand third high-affinity NC binding site sequences are shown in red, orange andyellow, respectively. The tRNAPro residue numbering reflects the canonicalnumbering scheme for all tRNAs and, hence, the number 17 is omitted. c, NMRstructure of the free U5-PBS RNA. Insets: top, zoom-in view of the sequestrationof U145 and 11U146 residues by U109 and C108 of the UCUG110 NC bindingsite; middle, G110 aromatic ring is turned outside of the helix in a conformationpoised for NC interaction; bottom, the minor, extruded conformation of G155

is shown. PAS residues are sequestered by alternating G–C and G–U base pairsin both conformations. d, One-dimensional slice of a 1H–13C spectrum

depicting the NC tail-mediated increase of the minor conformation (indicatedby an asterisk) of the tetraloop C128 residue. On the basis of the populationdistribution in the pre-bound state, we estimate the extruded conformation tohave a free energy of ,1.4 kcal mol21 greater than the stacked conformation.e, Structure of NC bound to UCUG110 via the zinc finger (the Trp 35 and Tyr 28stacking interactions are in black) showing that the protein tails can extend tocontact the UCAG130 tetraloop and residues in and near the (G97A, G155U)internal loop. f, Left, secondary structures of the wild-type (WT), G110U anddeletion mutant (DM). Right, reduction of viral infectivity is observed in mutants(n 5 6). Error bars indicate standard deviations (n 5 6 for both packaging andinfectivity experiments). As a negative control, heat-inactivated (HI) virionswere used for infection. A packaging assay was also done to confirm that themutations do not affect genome encapsidation (see Extended Data Fig. 4g).

0 0 M O N T H 2 0 1 4 | V O L 0 0 0 | N A T U R E | 1

Macmillan Publishers Limited. All rights reserved©2014

Page 2: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Retroviruses preferentially use specific host tRNAs as primers for thefirst step of reverse transcription; for example, human immunodeficiencyvirus requires tRNALys 3, while Moloney murine leukaemia virus (MLV)uses tRNAPro 1,2. Two distinct sequences in the tRNA anneal to com-plementary sequences in the retroviral U5-PBS domain to form the ini-tiation complex: the 39 end of the tRNA acceptor stem anneals to the18-nucleotide PBS sequence7, while a portion of the tRNA TYC arm base-pairs with a primer activation signal (PAS)8,9 (Fig. 1a, b). However, forprimer annealing to occur, favourable intramolecular associations involv-ing both the PBS and PAS in U5-PBS and the complementary anti-PBSand anti-PAS sequences in tRNA must first be disrupted by NC chap-erone proteins. Mechanistically, all NC proteins have thus far been thoughtto function as classical, ATP-independent chaperones, using both theirzinc-finger domain(s) and unstructured tails for this process10,11 (ExtendedData Fig. 1a). ATP-independent chaperones are known to permit RNAmolecules to access higher energy conformations and then allow refold-ing by rapidly dissociating from the RNA during the process12. Thesetransitory, low-affinity interactions generally necessitate the coating ofan RNA with many molecules of chaperone12 (Extended Data Fig. 1b). Inaddition, and in contrast, to the transient interactions of NC proteins13–15,the NC zinc fingers are also capable of sequence-specific, high-affinitybinding to RNA16. However, this mode of interaction has, until now,been thought to be used exclusively for the recognition of the genomevia interaction with theY-genome packaging signal during viral assembly(Supplementary Discussion 1). To gain insights into the mechanism ofNC-mediated primer annealing in MLV, a prototypical retrovirus, wesolved structures of both the genomic U5-PBS RNA and the tRNAPro

primer, both in the free form and in complex with MLV NC proteins,by NMR spectroscopy.

The free U5-PBS is a largely linear molecule capped by a structuredtetraloop (UCAG130) and contains one single-nucleotide bulge (A122) andtwo internal loops ((UCUGA111, UU146) and (GA98, GU156)) (Fig. 1a, c,Extended Data Figs 2, 3 and Extended Data Table 1). In the absence ofNC, the (UCUGA111, UU146) internal loop maintains a distinct, foldedconfiguration in which residue C108 of the NC binding site sequesters the59 end of the PBS sequence (11U146) via an intramolecular ribose zipperinteraction17,18 (superscript denotes the PBS position from 59 to 39) (Fig. 1cand Extended Data Fig. 3e–g). Residue U109 of the NC binding site alsobase pairs with U145, which is the first template residue read by reversetranscriptase. Continuous intramolecular base stacking interactions fromU144 to 12G147 further serve to tether the 59 end of the PBS inside the inter-nal loop. On the other side of the bulge, continuous nuclear Overhauserenhancements (NOEs) indicate the stacking of C106 with C108 and, hence,extrusion of residue U107. Importantly, residue G110 exhibits a syn gly-cosidic torsion angle and faces the major groove, making it poised forinteraction with NC (Fig. 1c). In the NC-bound structure, the NC zincfinger binds the UCUG110 sequence of the internal loop (dissociationconstant (Kd) 5 33 6 3 nM; Extended Data Fig. 4a) in a mode similarto that previously described for UCUG309 in the MLVY-genome pack-aging signal19–21 (see Supplementary Discussion 1 and 2). Notably, sinceNC-binding residues are initially involved in sequestering the first tem-plate (U145) and the first PBS (11U146) residues, the well-defined internalloop structure is mutually exclusive with NC binding. Thus, NC bindingliberates U145 and 11U146 for primer annealing and reverse transcriptioninitiation (Fig. 1e).

Whereas the NC tails are not involved in binding the Y-packagingsignal, in U5-PBS, the NC tails specifically remodel the (GA98, GU156) inter-nal loop and the UCAG130 capping tetraloop (Fig. 1e). In the absence ofNC, the six PAS residues in the lower stem, 11G99 to 16U104 (subscriptwith plus sign denotes the PAS position from 59 to 39), form base pairswith PBS residues (Fig. 1c and Extended Data Fig. 2a), and hence are notavailable for primer annealing. The preceding (GA98, GU156) internal loop,however, exists in multiple conformations; in the major form, a con-tinuous internal stacking of all residues leads to the formation of tan-dem A98–110G155 and G97–111U156 non-canonical base pairs, while inthe minor, destabilized form, the 110G155 and 111U156 PBS residues are

extra-helical (Fig. 1c and Extended Data Figs 2a, 3h–j). Similarly, the cap-ping UCAG130 tetraloop forms a YNMG-type structure22 (Extended DataFig. 3a), with the C128 base either stacked on A129 or, in a small popula-tion, extruded from the structure (Fig. 1d and Extended Data Fig. 3c).Interaction with the NC tails alters the equilibria between the two con-formations in favour of the minor, destabilized conformations (Fig. 1d)and hence leads to release of the 10th and 11th (110G155 and 111U156)PBS residues and the C128 tetraloop residue. The destabilization of thelatter is important for cooperative binding of a second NC to the tetra-loop (Extended Data Fig. 4a; see Supplementary Discussion 3). Thus,the positively charged NC tails do not globally destabilize U5-PBS butinstead specifically target residues inherently predisposed for destabili-zation. Furthermore, because PAS residues immediately follow the desta-bilized internal loop (Fig. 1a), the NC tails also specifically perturb residues11G99, 12G100, and 13G101 (Extended Data Fig. 3k). Interestingly, bothNC tails (Ala 1–Arg 17 and Arg 4–Leu 56) remain disordered, indicatingthat destabilization must occur via transient interactions that are never-theless residue-specific owing to the inherent accessibility of the parti-cular RNA residues and the orientation constraints imposed by the zincfinger binding to UCUG110 (Fig. 1e and Extended Data Fig. 3l). In liveviruses, a G110U mutant designed to liberate the U145 and 11U146 resi-dues and a deletion mutant designed to sequester them exhibited only

c

A66

C67

Anticodon

Acceptor stem

3′

G9

U7

Tyr 28

NC-1

G6

U7

G9

U8

Trp 35

Tyr 28

G45

G46

U22

U13

G10

U25

G45

A

GG

U

GA

UGU

37Anti-PAS46

AnticodonD-loop

9

18

Variable loop

55

AUC

G

U

G

G

Site T2

Site T3

8

23

The elbow

UG

15′

3′

Anti-PBS

677

TΨ C-loop

aAcceptor stem

6

66

G

AC G6

U7

G9

U8C67

A66

b

A21

G23

C12

A14

Site T1

Figure 2 | Structure and NC interaction of site T1 in tRNAPro. a, Cartoonrepresentation of the L-shaped tRNAPro structure. Dashed lines show thelong-range elbow and base triple interactions with the D-stem loop. Solidlines denote the covalent linkages. b, Three-dimensional model of the freetRNAPro showing site T1 residues GUUG9 (red) involved in intramolecularinteractions. c, Structure of the NC–tRNAPro complex. The aromatic NCresidues Tyr 28 and Trp 35 are shown in black. The zinc finger interaction withthe GUUG9 binding site is shown in red. The tails are excluded for simplicitysince they form random coils and do not specifically interact with the tRNA(Extended Data Fig. 9f). Top inset shows a close-up of the zinc fingerinteraction, with G9 inserted into the hydrophobic pocket of the NC proteindue to stacking interactions of Trp 35. Bottom inset shows the interaction of thevariable loop with the core of the tRNA molecule that is maintained afterinteraction with NC-1.

RESEARCH LETTER

2 | N A T U R E | V O L 0 0 0 | 0 0 M O N T H 2 0 1 4

Macmillan Publishers Limited. All rights reserved©2014

Page 3: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

56% and 7%, respectively, of wild-type MLV infectivity (Fig. 1f). Whilethe severe infectivity defect of deletion mutant virions confirms the impor-tance of high-affinity NC binding in releasing the 59 end of the PBS, thepartial defect observed for G110U virions indicates that tail-mediatedinteractions are also required for optimal function.

In tRNAPro, NC binding occurs first at GUUG9 (site T1; 4 6 2 nM)followed by the anticodon loop AGGG37 (site T2; 13 6 3 nM) and then theD-stem loop UAUG23 (site T3; 834 6 343 nM) (Extended Data Fig. 4b).Importantly, titration of MLV NC into the human immunodeficiencyvirus (HIV) primer, tRNALys 3, did not lead to NMR chemical shift per-turbations, thus confirming the specificity of MLV NC for tRNAPro (Ex-tended Data Fig. 1f, g; for assignment strategy of tRNAPro see ExtendedData Figs 5–8 and Supplementary Discussion 4). The structure of thefirst NC bound to tRNAPro shows the zinc finger making extensive con-tacts with the GUUG9 sequence that links the acceptor stem with theD-stem loop, with G9 stacking within the zinc finger pocket (Fig. 2,Extended Data Table 1 and Extended Data Fig. 9a). Importantly, beforeNC binding, all four GUUG9 residues are involved in intramolecularinteractions: G6 and U7 are part of the acceptor stem, and U8 and G9

are involved in core tertiary interactions (Fig. 2a, b and Extended Data

Figs 6, 8a). Similar to U5-PBS, because these contacts are mutually exclu-sive with NC interactions, NC binding leads to major RNA remodellingevents. First, since residues G6 and U7 are initially base paired with anti-PBS residues 210C67 and 211A66 (superscript with minus sign denotesthe anti-PBS position from 39 to 59), respectively, NC binding releasesthese sequestered anti-PBS residues (Fig. 2c and Extended Data Fig. 9b, c).Second, because U8 and G9 are involved in core tertiary interactions viatriple base formation with the D-stem loop (U8:A14–A21, G9:C12–G23)(Fig. 2a, b and Extended Data Fig. 8a), NC binding disrupts these coretertiary interactions (Extended Data Fig. 9c). As a result of this rearrange-ment, D-stem residues A21 and G23, which are part of the UAUG23 siteT3 sequence, are made partially available for the third NC binding event.Globally, while the helical arrangement between the TYC-stem and theacceptor stem is lost, the helical stacking between the D-stem and anti-codon stem is preserved (Extended Data Fig. 9d, e), as is the variable loopinteraction and the TYC-loop:D-loop interface (the ‘elbow’) (Fig. 2a, cand Fig. 3a).

In comparison with sites T1 and T3 (see later), there is a marked mech-anistic difference in the remodelling activity of the NC that binds thesecond site, T2—this NC uses its tails to achieve residue-specific desta-bilization. After the first NC binding event, the second NC accesses theresidual elbow structure by anchoring its zinc finger to the distal AGGG37

sequence in the anticodon loop, with the G37 base stacking inside the zincfinger (Fig. 3b and Extended Data Fig. 9g). While the elbow interactionis maintained, the NC tails specifically perturb D-loop and TYC-loopresidues G16 and A59, respectively (Fig. 3c, d), which are in close proximityto each other (Fig. 3b). Prior to NC binding, these residues are extrudedout of their respective loops and are hence available for NC interaction.Thus, as in the interaction with U5-PBS, the NC tails do not cause globaldestabilization of tRNAPro but instead target specific, already accessibleresidues for remodelling.

We also structurally characterized the third NC binding event usingthe tRNAPro–T1MT2M construct (see Supplementary Discussion 5). Ourstructures show that NC zinc finger binding to UAUG23 in the D-stemdisrupts the entire helix and, because the D-stem architecture is crucialfor the D-loop–TYC interaction, eliminates the residual elbow tertiarystructure (Fig. 3a, e and Extended Data Fig. 9h, i). Consequently, theinteractions between D-loop residues G18 and G19 and TYC-loop anti-PAS residues 21C56 and 22U55 are lost, leading to the release of these

G23

G16

A66

C67

NC-1

AnticodonD-loop

3′

NC-1

NC-2

TΨC-loopC56

A34

U55

A66

C67

Tyr 28

G36

G37

Trp 35

G16

A59G16

U47

TΨC-loop

U47

7.87.9

3735

36

55

295.7

5.8

7.87.9

3735

36

55

1H (p.p.m.)

29

7.87.9

55

29

60 60 60

60

4747 47

47

55 55 55

55

8

55

1:0.2

1:2.5 1:3.0

1:1

16

2336

37 35

*

*

*

1H (p.p.m.) 1H (p.p.m.)

1:0 1:21:1

6.0 5.8 5.6 6.0 5.8 5.6 6.0 5.8 5.6

6.0 5.8 5.6 6.0 5.8 5.6

106

108

1H (p

.p.m

.)13C

(p

.p.m

.)

106

108

16

1:1

1:2

1:1.4

1:1.6

1:2.4

7.88.0 7.6

125

127

127

127

127

127

16

16

1:1 1:1.9

59

57

5757

59

1:0 1:1.3

5.65.6

8.2

8.0

8.0

1H (p

.p.m

.)

a

b

d

e

f

T1m,T2m,

1H (p.p.m.)

c1:0

13C

(p

.p.m

.)

Anticodon

NC-2

NC-3

C56

U55

U20U20

Trp 35Trp 35

U20

Trp 35

G35G35G35

Figure 3 | Structure and NC interaction with sites T2 and T3 in tRNAPro.a, A portion of the two-dimensional nuclear Overhauser effect spectroscopy(NOESY) spectrum with H8/H19 correlations for tRNAPro showing theperturbation of anticodon residues only after titrations above 1.0 equivalent ofNC, thus confirming the sequential binding mode. b, 1H–13C two-dimensionalU-labelled heteronuclear multiple quantum coherence (HMQC) spectrashowing that whereas the U8 resonance perturbation occurs upon the additionof one equivalent of NC, the perturbation of D-loop–TYC signals occur onlyupon the third NC binding, thus indicating that the elbow contacts aremaintained during the first two binding events. c, 1H–13C two-dimensionalHMQC spectra showing selective perturbation of the extruded G16 residue inthe D-loop as evidenced by a marked chemical shift change is shown byasterisks. d, Regions of two-dimensional NOESY for NC complexes withtRNAPro and tRNAPro–T1MT2M. Top and middle panels show that theprotected A57 in the TYC loop is not perturbed, but the extruded A59 is affectedupon the second NC binding. In the T1MT2M mutant (bottom panel), NCbinding to the third site disrupts the TYC–D-loop interaction, resulting in achemical shift change for residue A57. Thus, the lack of A57 perturbation inthe native tRNAPro also demonstrates that the elbow region is maintained afterthe second NC binding. e, Structure of NC bound to tRNAPro sites T1 andT2 via the zinc fingers. The structures show that the NC-2 protein tails canextend to contact the D-loop–TYC-loop elbow region. Inset shows theproximity of the extruded G16 and A59 residues. For clarity, only the tails of theNC-2 protein are shown. f, Model of three NCs bound to tRNAPro (based on thestructure of NC bound to T1MT2M tRNAPro; see Extended Data Fig. 9i),showing the loss of the elbow tertiary interactions upon binding of the thirdNC, which leads to the release of anti-PAS sequences. For clarity only thezinc-finger portion of the NC proteins are shown.

LETTER RESEARCH

0 0 M O N T H 2 0 1 4 | V O L 0 0 0 | N A T U R E | 3

Macmillan Publishers Limited. All rights reserved©2014

Page 4: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

sequestered anti-PAS residues (subscript with minus sign denotes theanti-PAS position from 39 to 59) (Fig. 3d). NC binding to site T3 thusserves to dismantle the residual tRNA tertiary structure before primerannealing. Importantly, destabilization of the D- and TYC-loop residuesby the anticodon site T2 NC tails is maintained even after the elbow con-tacts are dismantled by the third NC binding event (Fig. 3c), prohibitingthe freed TYC-loop from forming an intrinsically stable structure23 (seeExtended Data Fig. 6) and ensuring that the released anti-PAS residueswill remain accessible during primer annealing.

Our data demonstrate how MLV NC ‘captures’ specific portions ofboth the U5-PBS and tRNAPro through high-affinity interactions withresidues that are normally engaged in intramolecular stabilizing inter-actions and results in the subsequent ‘release’ of these sequestered residues,thereby reducing the energetic barrier for primer–template complex for-mation (Fig. 4). The combinations of liberated and pre-exposed residueswithin tRNAPro and U5-PBS are exactly complementary and thereforepoised for intermolecular base pairing. Furthermore, the complemen-tarity of liberated sequences to regions that are already exposed in thecounterpart RNA allows remodelling to occur with a limited number ofNC molecules (Fig. 4). Indeed, the presence of four NC molecules is suf-ficient for formation of a functional U5-PBS–tRNAPro complex (Extended

Data Fig. 4c, d). Importantly, because the NC binding sites are perfectlypositioned in close proximity to, but not overlapping with, the RNA-annealing sequences, subsequent dissociation of NC from the annealedcomplex is not required24. In fact, the presence of NC has been shownto be important for the elongation step of reverse transcription25,26.

In addition to defining previously undiscovered roles for high-affinityNC binding events in the retroviral lifecycle, our study has implications forthe location, timing and specificity of primer annealing (see Supplemen-tary Discussion 6). Like MLV NC, some other RNA chaperones andremodellers bind with high affinity to their substrates; however, they typ-ically require the input of additional energy for subsequent dissociation27.The MLV NC-mediated capture-and-release mechanism described hereis distinct from mechanisms used by other known ATP-independentRNA chaperones28,29: During the capture-and-release RNA remodel-ling, NC uses high-affinity interactions to bind a limited number of siteswith high specificity. Furthermore, unlike typical chaperones, which causeglobal destabilization to allow access to higher energy conformations, themechanism of NC-mediated remodelling in primer annealing involvesthe formation of stable, lower energy complexes with RNA that cause stra-tegic local destabilization of the regions important for annealing. Con-sistent with this, the thermodynamic analyses of all U5-PBS–NC andtRNAPro–NC interactions show high binding affinities with entropicallydriven profiles (see Supplementary Discussion 7). This entropy-driven,capture-and-release remodelling thus represents the first example, toour knowledge, of a new mechanism by which RNA chaperones can spe-cifically select their specific targets from a sea of cellular RNAs.

Online Content Methods, along with any additional Extended Data display itemsandSourceData, are available in the online version of the paper; references uniqueto these sections appear only in the online paper.

Received 19 February; accepted 24 July 2014.

Published online 7 September 2014.

1. Harada, F., Peters, G. G. & Dahlberg, J. E. The primer tRNA for Moloney murineleukemia virus DNA synthesis. Nucleotide sequence and aminoacylation oftRNAPro. J. Biol. Chem. 254, 10979–10985 (1979).

2. Wain-Hobson, S., Sonigo, P., Danos, O., Cole, S. & Alizon, M. Nucleotide sequence ofthe AIDS virus, LAV. Cell 40, 9–17 (1985).

3. Mougel, M. et al. Conformational analysis of the 59 leader and the gag initiation siteof Mo-MuLV RNA and allosteric transitions induced by dimerization. Nucleic AcidsRes. 21, 4677–4684 (1993).

4. Paillart, J. C.et al.First snapshots of theHIV-1 RNA structure in infectedcells and invirions. J. Biol. Chem. 279, 48397–48403 (2004).

5. Wilkinson, K. A. et al. High-throughput SHAPE analysis reveals structures in HIV-1genomicRNAstrongly conservedacrossdistinctbiological states.PLoSBiol.6,e96(2008).

6. Levin, J. G., Mitra, M., Mascarenhas, A. & Musier-Forsyth, K. Role of HIV-1nucleocapsid protein in HIV-1 reverse transcription. RNA Biol. 7, 754–774 (2010).

7. Cordell, B., Stavnezer, E., Friedrich, R., Bishop, J. M. & Goodman, H. M. Nucleotidesequence that binds primer for DNAsynthesis to the avian sarcoma virus genome.J. Virol. 19, 548–558 (1976).

8. Beerens, N., Groot, F. & Berkhout, B. Initiation of HIV-1 reverse transcription isregulated by a primer activation signal. J. Biol. Chem. 276, 31247–31256 (2001).

9. Beerens, N. & Berkhout, B. Switching the in vitro tRNA usage of HIV-1 bysimultaneous adaptation of the PBS and PAS. RNA 8, 357–369 (2002).

10. Thomas, J. A. & Gorelick, R. J. Nucleocapsid protein function in early infectionprocesses. Virus Res. 134, 39–63 (2008).

11. Rein, A. Nucleic acid chaperone activity of retroviral Gag proteins. RNA Biol. 7,700–705 (2010).

12. Woodson, S. A. Taming free energy landscapes with RNA chaperones. RNA Biol. 7,677–686 (2010).

13. De Rocquigny, H. et al. Viral RNA annealing activities of human immunodeficiencyvirus type 1 nucleocapsid protein require only peptide domains outside the zincfingers. Proc. Natl Acad. Sci. USA 89, 6472–6476 (1992).

14. Hargittai, M. R., Mangla, A. T., Gorelick, R. J. & Musier-Forsyth, K. HIV-1nucleocapsid protein zinc finger structures induce tRNALys,3 structural changesbut are not critical for primer/template annealing. J. Mol. Biol. 312, 985–997(2001).

15. Prats, A. C. et al. Viral RNA annealing activities of the nucleocapsid protein ofMoloney murine leukemia virus are zinc independent. Nucleic Acids Res. 19,3533–3541 (1991).

16. Rein, A., Henderson, L. E. & Levin, J. G. Nucleic-acid-chaperone activity of retroviralnucleocapsid proteins: significance for viral replication. Trends Biochem. Sci. 23,297–301 (1998).

17. Tamura, M. & Holbrook, S. R. Sequence and structural conservation in RNA ribosezippers. J. Mol. Biol. 320, 455–474 (2002).

PAS/anti-PAS Site 1

PBS/anti-PBS

Site 2

Site 3

N

N

N

N

NH2

O

OH

O

H

H

H

H

PO

O-

O

-O

tRNAPro–1

Anticodonloop

D-loopT C-loop

Acceptor stem

NH

N

N

O

NH2

NO

HO

H

H

H

H

O

P

O

-O

HO

NH

O

O

N

O

HO

O

H

H

H

H

P

O

-O

O

O -

U5-PBS

+10

+11

N

N

N

N

NH2

O

OH

O

H

H

H

H

PO

O-

O

-O–1

NN

H2

O

N

O

OH

O

H

H

H

HP

OO

-

OH

N

N NN

NH

2

O

OH

O

H

H

H

H

P O

O

O-

–10

–11

NCNC

T1 binding liberates

sequestered anti-PBScomplementary

–1

–10

–11

NH

O

O

N

O

HO

O

H

H

H

H

P

O

-O

NH

O

O

N

OHO

O

H

H

H

HP

O

-OO

O -

N

NN

N

NH2

O

OHO

H

H

H

H

PO

O-

O

-O

OP

O-

OO

-

H

H

O

OH

H

H

N

NH

2

O N

O

OH

OH

H

H

H

N

N

N

N

NH

2

P

OO

-

T3 binding liberates

sequestered anti-PAS

P

O

-OO

–1–2

O

NH

N

N

O

NH2

NO

HO

H

H

H

H

O

P

O

-O

HO

NH

O

O

N

O

HO

O

H

H

H

H

P

O

-O

O

O -

N

NN

N

NH

2

O

OH

O

H

H

H

H

P

OO

-

OH

- O

NH

O

ONO

HOO

HH

H

H

PO

-O

O

O-

+1

+10

+11

+3

+1+2

V2 binding for efficient RT

elongation?

RT initiation

T2 NC binding continues to destabilizeD-loop and T C-loop

complementary

O

NH

N

N

O

NH2

NO

HO

H

H

H

H

O

P

O

-O

HO

NH

O

O

N

O

HO

O

H

H

H

H

P

O

-O

O

O -

N

NN

N

NH

2

O

OH

O

H

H

H

H

P

OO

-

OH

- O

NH

O

ONO

HOO

HH

H

H

PO

-O

O

O-

NCNC

V1 binding liberates

sequestered

+1

+10

+11

+3

+1+2

PBS and PAS

T2 NC binding destabilizes

tertiary structure

N

N

N

N

NH2

O

OH

O

H

H

H

H

PO

O-

O

-O

-1

NN

H2

O

N

O

OH

O

H

H

H

H

P

OO

-

OH

N

N NN

NH

2

O

OH

O

H

H

H

H

P O

O

O-

–10

–11

NCNC

NCNC

NCNC

Figure 4 | Capture-and-release mechanism for NC-mediated remodelling ofMLV U5-PBS and tRNAPro. A mechanistic model for NC zinc-finger- andtail-mediated remodelling via a step-wise, residue-specific release and residue-specific destabilization of sequestered PBS/PAS and anti-PBS/anti-PASsequences in U5-PBS and tRNAPro, respectively. Upon NC binding to U5-PBS,the PBS residues 11U146, 110G155 and 111U156 are released, and the PASresidues 11G99, 12G100 and 13G101 are destabilized. Reciprocally, in tRNAPro,NC binding to site T1 frees anti-PBS residues 210C67 and 211A66 (21A76 isalready available in free tRNAPro), and binding to site T2 and site T3 renders theanti-PAS residues 21C56, 22U55 and 23U54 accessible for annealing. NCbinding to the second site in U5-PBS may be important for primer extension byreverse transcriptase (RT) since stem loops with stable tetraloops have beenshown to stall reverse transcription.

RESEARCH LETTER

4 | N A T U R E | V O L 0 0 0 | 0 0 M O N T H 2 0 1 4

Macmillan Publishers Limited. All rights reserved©2014

Page 5: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

18. Nonin-Lecomte, S., Felden, B. & Dardel, F. NMR structure of the Aquifex aeolicustmRNA pseudoknot PK1: new insights into the recoding event of the ribosomaltrans-translation. Nucleic Acids Res. 34, 1847–1853 (2006).

19. D’Souza, V. & Summers, M. F. Structural basis for packaging the dimeric genomeof Moloney murine leukaemia virus. Nature 431, 586–590 (2004).

20. Dey, A., York, D., Smalls-Mantey, A. & Summers, M. F. Composition and sequence-dependent binding of RNA to the nucleocapsid protein of Moloney murineleukemia virus. Biochemistry 44, 3735–3744 (2005).

21. D’Souza, V. et al. Identification of a high affinity nucleocapsid protein bindingelement within the Moloney murine leukemia virus y-RNA packaging signal:implications for genome recognition. J. Mol. Biol. 314, 217–232 (2001).

22. Theimer, C. A., Finger, L. D. & Feigon, J. YNMG tetraloop formation by adyskeratosis congenita mutation in human telomerase RNA. RNA 9, 1446–1455(2003).

23. de Smit, M. H. et al. Structural variation and functional importance of a D-loop–T-loop interaction in valine-accepting tRNA-like structures of plant viral RNAs.Nucleic Acids Res. 30, 4232–4240 (2002).

24. Martin-Tumasz, S., Richie, A. C., Clos, L. J. II, Brow, D. A. & Butcher, S. E. A noveloccluded RNA recognition motif in Prp24 unwinds the U6 RNA internal stem loop.Nucleic Acids Res. 39, 7837–7847 (2011).

25. Gonsky, J., Bacharach, E. & Goff, S. P. Identification of residues of the Moloneymurine leukemia virus nucleocapsid critical for viral DNA synthesis in vivo. J. Virol.75, 2616–2626 (2001).

26. Liu, S., Harada, B. T., Miller, J. T., Le Grice, S. F. & Zhuang, X. Initiation complexdynamics direct the transitions between distinct phases of early HIV reversetranscription. Nature Struct. Mol. Biol. 17, 1453–1460 (2010).

27. Fedorova, O., Solem, A. & Pyle, A. M. Protein-facilitated folding of group II intronribozymes. J. Mol. Biol. 397, 799–813 (2010).

28. Semrad, K. Proteins with RNAchaperone activity: a worldofdiverse proteins with acommon task-impediment of RNA misfolding. Biochem. Res. Int. 2011, 532908(2011).

29. Dethoff, E. A., Chugh, J., Mustoe, A. M. & Al-Hashimi, H. M. Functional complexityand regulation through RNA dynamics. Nature 482, 322–330 (2012).

Supplementary Information is available in the online version of the paper.

Acknowledgements We wish to thank the laboratory of S. Goff for the pNCS plasmid,C. Salguero for the artistic rendition of the model, as well as M. Summers and R. Gaudetfor critical reading of the manuscript and the Merck Research Fellowship and DamonRunyon Cancer Research scholarship for funding.

Author Contributions V.M.D’S., S.B.M. and F.Z.Y. conceived and designed theexperiments. J.A.L. and S.B.M. performed and analysed the isothermal titrationcalorimetry (ITC) experiments, V.M.D’S., S.B.M. and F.Z.Y. did the structural analysis,and B.W. and S.B.M. performed and analysed the virological experiments. S.B.M., F.Z.Y.and V.M.D’S. wrote the manuscript.

Author Information Coordinates and restraints for the final ensembles of the MLVU5-PBS and tRNAPro structures have been deposited in the Protein Data Bank underaccession numbers 2MQT, 2MQV, 2MS1 and 2MS0. Reprints and permissionsinformation is available at www.nature.com/reprints. The authors declare nocompeting financial interests. Readers are welcome to comment on the online versionof the paper. Correspondence and requests for materials should be addressed toV.M.D’S. ([email protected]).

LETTER RESEARCH

0 0 M O N T H 2 0 1 4 | V O L 0 0 0 | N A T U R E | 5

Macmillan Publishers Limited. All rights reserved©2014

Page 6: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

METHODSSample preparation. The MLV NC protein was prepared as described previously21.The HIV NC protein was made in an analogous manner to that of the MLV NC pro-tein; the NC sequence was amplified from pNL4-3 plasmid30, cloned into pGEX-6p1(GE Healthcare) vector, and the purified fusion protein was cleaved from glutathi-one S-transferase (GST) through the use of PreScission Protease. NMR experimentswere used to confirm the correct folding of the proteins. The HIV and MLV U5-PBSRNA samples were also made using methods previously described21. tRNAPro andtRNALys 3 DNA templates were constructed through annealing and ligation of oli-gonucleotides designed to contain a T7 promoter and two 29-O-methoxy modifiednucleotides at the 59 end of the template strand to maintain 39 end homogeneity31.For the U5-PBS construct used for NMR studies, a single base-pair swap (G96C:C157G) was made to ameliorate the spectral overlap problem that arose owing tofive consecutive guanosine residues in a row. This swap does not change the sec-ondary or tertiary structure of the U5-PBS domain (Extended Data Fig. 2b). TheG110U mutation was structurally characterized to ensure that the mutation doesnot give rise to an alternate structure that sequesters the 59 end of the PBS (data notshown).Isothermal titration calorimetry. Prior to all ITC experiments, NC proteins andRNAs were exchanged into ITC buffer (25 mM NaCl, 1 mM MgCl2, 0.1 mM ZnCl2,10 mM Tris, pH 7.0). For each ITC experiment, reaction heats (mCal s21) were mea-sured for 1.5–2ml titrations of 70–90mM NC into 5mM RNA at 30 uC using an iTC200machine (MicroCal). Titration curves were analysed using ORIGIN (OriginLab).NMR data acquisition, resonance assignment and structure calculations. ForNMR experiments, RNA samples were resuspended in NMR buffer (10 mM Tris-HCl, pH 7.5, and 10 mM NaCl). NMR data for tRNAPro were collected with andwithout the presence of 1 mM MgCl2 and 100–150 mM NaCl. For NC complexstudies, the samples were prepared in buffers containing 1 mM MgCl2. NMR datawere acquired using a Bruker 700 MHz spectrometer equipped with a cryoprobe.Spectra were recorded at 35 uC, with the exception of data for the imino region,which were also collected at 10 uC. Non-exchangeable assignments were made usingtwo-dimensional NOESY, two-dimensional HMQC and three-dimensional HMQC-NOESY experiments using both unlabelled and nucleotide-specific labelled (G–,U–, A–, and C–15N,13C-labelled and C/U/A-deuterated) or protonated samples.Residual dipolar coupling (RDC) data for U5-PBS were collected using Pf1 phage(12 mg ml21, ASLA Biotech) as has been described32.

Initial structures were calculated as described26 using manually assigned restraintsin CYANA28. In contrast to all long range triple bases in tRNAPro, the two base pairsbetween the D-loop and TYC-loop were not observable by NMR: these hydrogenbonds were modelled on the basis of our knowledge of tRNA structures. The crystalstructure of tRNAPhe was used as a guide for soft phosphate–phosphate distancerestraints within a particular stem33. For U5-PBS–NC and tRNAPro–NC complexes,the ten best CYANA models were then used for final structure calculations in AMBERsimilar to what has been described34. Specifically, the refinement included 50,000steps, with temperature increasing from 0 K to 500 K over the first 12,500 steps,remaining at 500 K over the next 32,500 steps, and then decreasing to 0 K over thenext 5,000 steps. These calculations incorporated all upper limit restraints used inCYANA but not the angle restraints. The individual structures generated were thenused for tensor fitting, and the above structure calculation process was repeatedwith the RDC restraints (for U5-PBS) along with a final minimization that included8,000 steps. In the final minimization step for the U5-PBS structure, loose hydrogenbond restraints used for the ribose zippers were removed. Several rounds of in vacuoAMBER calculations were done until the distance violations were less than 0.5 A.Molecular images were generated with PyMOL (http://www.pymol.org).Mutagenesis of pNCS vector for MLV virus production. The pNCS plasmidencoding the MLV genome was a gift from the laboratory of S. Goff. The presenceof the repeated U5 sequence at both ends of the proviral DNA required use of theNdeI restriction endonuclease to cleave the pNCS plasmid into two pieces, a 3 kbfragment and a 9 kb fragment. This allowed for selective introduction of the muta-tions only into the U5 sequence in the 59 long terminal repeat (LTR). The 9 kb frag-ment was circularized by self-ligation and used as a template to introduce the G110U,deletion mutation orYC331G mutation using a QuickChange II XL Site-DirectedMutagenesis Kit (Agilent Technologies). Mutant plasmids were then digested with

NdeI and re-ligated with the 3 kb fragment to form the complete mutant pNCS.DNA sequencing confirmed both the orientation of the insert and the presence ofthe desired mutation.MLV virion infectivity and genome packaging assay. 293T and Rat2 cells weregrown at 37 uC and 5% CO2 in DMEM supplemented with 10% fetal bovine serum(FBS) and 5% penicillin-streptomycin. The 293T cells were used for transfection-mediated viral production, while Rat2 cells were used for infectivity assays. Trans-fection of 293T cells with pNCS was performed at a cell confluency of ,80% usingFugene-6 (Roche). Virion-containing supernatant was harvested 48 h after transfec-tion and filtered through a 0.45mm filter. Viruses were quantified with the use ofexogenous Luciferase RNA template (Promega) provided in excess during qPCRreactions. Purified, exogenous MLV reverse transcriptase (RT) (Promega) was usedto generate a linear standard curve (R2 . 0.98) for the calculation of RT activity. SYBRGreen-based qPCR was carried out as described35, and dissociation curve measure-ments were performed at the conclusion of each run to confirm primer specificity.Equal numbers of wild-type or mutant virions were used to infect 293T cells, as hasbeen described36,37. As a negative control, separate aliquots of wild-type virions wereincubated at 80 uC for 15 min to render the virus non-infectious. The RT-based virusquantification assay was used to quantify the viral yield resulting from infection.

For the genome packaging assay, total virion RNA was extracted from viral super-natant with the QIAamp Viral RNA Mini Kit (Qiagen) using 50–150ml of viral super-natant per RNA extraction column. Three independent extractions were pooled andviral genomic RNA was quantified by qPCR using a TaqMan probe-based assay sys-tem as has been described38 with an AABI 7900 sequence detection apparatus (AppliedBiosystems). Negative control reactions lacking viral RNA template yielded negli-gible signals, and standard curves were generated using threshold cycle (Ct) valuesfrom a range of different input viral DNA (pNCS) concentrations. A two-sample,one-tailed unpaired t-test was performed to compare mean values of RT activityfor the using the statistical package Stata (StataCorp), assuming unequal variancesand using a 99% confidence interval.

For heat-annealing, 250 pmol of MLV U5-PBS RNA was incubated with 250 pmoltRNAPro at 50 uC for 5 min followed by 85 uC for 15 min in 50 mM Tris-HCl (pH 7.2),50 mM KCl and 5 mM MgCl2. For NC-mediated annealing, the same amounts ofRNA were incubated with five molar equivalents of NC at 37 uC for 16 h. After anneal-ing, 100 U RNase inhibitor and 750 U MLV reverse transcriptase were added to eachsample, along with Cy3-labelled dideoxy CTP (ddCTP) to a final concentration of0.4 nM. Following ethanol precipitation, the RNA was digested by RiboshredderRNase and resolved on a denaturing gel. DNA terminated with Cy3-ddCTP wasvisualized with an imager.

30. Adachi, A. et al. Production of acquired immunodeficiency syndrome-associatedretrovirus in human and nonhuman cells transfected with an infectiousmolecular clone. J. Virol. 59, 284–291 (1986).

31. Miller, J. T., Khvorova, A., Scaringe, S. A. & Le Grice, S. F. Synthetic tRNALys,3 as thereplication primer for the HIV-1HXB2 and HIV-1Mal genomes. Nucleic Acids Res.32, 4687–4695 (2004).

32. Hansen, M. R., Mueller, L. & Pardi, A. Tunable alignment of macromolecules byfilamentous phage yields dipolar coupling interactions. Nature Struct. Biol. 5,1065–1074 (1998).

33. D’Souza, V., Dey, A., Habib, D. & Summers, M. F. NMR structure of the101-nucleotide core encapsidation signal of the Moloney murine leukemia virus.J. Mol. Biol. 337, 427–442 (2004).

34. Spriggs, S., Garyu, L., Connor, R. & Summers, M. F. Potential intra- andintermolecular interactions involving the unique-59 region of the HIV-1 59-UTR.Biochemistry 47, 13064–13073 (2008).

35. Pizzato, M. et al. A one-step SYBR Green I-based product-enhanced reversetranscriptase assay for the quantitation of retroviruses in cell culturesupernatants. J. Virol. Methods 156, 1–7 (2009).

36. Auerbach, M. R., Shu, C., Kaplan, A. & Singh, I. R. Functional characterization of aportion of the Moloney murine leukemia virus gag gene by genetic footprinting.Proc. Natl Acad. Sci. USA 100, 11678–11683 (2003).

37. Lim, D., Orlova, M. & Goff, S. P. Mutations of the RNase H C helix of the Moloneymurine leukemia virus reverse transcriptase reveal defects in polypurine tractrecognition. J. Virol. 76, 8360–8373 (2002).

38. Onafuwa-Nuga, A. A., King, S. R. & Telesnitsky, A. Nonrandom packagingof host RNAs in moloney murine leukemia virus. J. Virol. 79, 13528–13537(2005).

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 7: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 8: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Figure 1 | Specific, higher-affinity interactions of MLV NCwith monomeric MLV U5-PBS and tRNAPro and its implications. a, MLVNC sequence depicting the zinc finger moiety with amino- and carboxy-terminal tails. b, Cartoon representation displaying the general mechanismof ATP-independent chaperones. c, Cartoon representation of the E- andB-forms of U5-PBS genomic RNA, including the secondary structure of theY-genome packaging signal, with the high-affinity sites coloured in red(see Supplementary Discussion 1). d, ITC data for MLV NC binding to nativeU5-PBS-B (black) and site 1 mutant (V1M, red) forms. Top, raw ITC datafor 1.5ml titrations of 80mM NC into 5mM RNA at 30 uC. Bottom, datafollowing peak integration, with continuous black line representing the fit fora one-site binding model. Elimination of site V1 in the U5-PBS B-formcompletely abolishes the binding. e, f, ITC data showing weak binding ofHIV NC to HIV U5-PBS RNA and tRNALys 3 primer. g, Portions oftwo-dimensional NOESY spectra collected in D2O displaying an overlay of free

tRNALys 3 primer (black) and 0.5 equivalents of MLV NC (red). Lack of anychemical shift perturbation indicates the absence of any interactions betweenthe two molecules. h, Models for primer annealing in MLV and HIV-1: in MLV(left), high-affinity binding of the NC domain (red) of Gag to the U5-PBSregion of the viral genome and to the tRNAPro primer promotes tRNAannealing in the cytosol of the host cell before virion budding. Furthersupporting this model is the evidence that MLV virions are only slightlyenriched for the tRNAPro primer (see Supplementary Discussion 1). In HIV-1(right), NC does not bind with high affinity to the U5-PBS domain or to theprimer tRNALys 3. Furthermore, the tRNALys 3 primer is highly enriched inHIV-1 virions (see Supplementary Discussion 1), which leads to primerannealing after virion budding, mediated by weak, non-specific interactions ofHIV-1 NC with the viral and primer RNAs. For the sake of simplicity, only oneof the two packaged retroviral genome copies is shown.

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 9: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

(b)

a

(b)

G94.H1

G95.H1

G94.H1

G99

.H1

A98.H62

U156.H3

G100.H1

G155.H1

U153.H3

G97

.H1

U15

6.H

3

A98.H61

G-UG-CA-GG-U

100

99

98

97

153

154

155

156

U5PBS

11.0 10.0 9.0

9.0

10.0

11.0

12.0

H, p.p.m

H, p

.p.m

PBS

PAS

site-V1

site-V2

bb cU5-PBS C96:G157U5-PBS G96C:C157G

A98

.H62

G99

.H1

G97

.H1

G97.H1

U15

6.H

3

H, p.p.m

H, p.p.m

H, p

.p.m

H, p

.p.m

N ,

p.p.

m15

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 10: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Figure 2 | Structural characterization of U5-PBS. a, Top,portion of the 1H–1H two-dimensional NOESY spectrum showing imino-to-imino NOEs for the U5-PBS RNA. Data were collected at 10 uC in 10 mM NaCland 10 mM Tris (pH 5.0). Imino-to-imino connectivities for the upper stemare shown in orange, while those for the lower stem are shown in blue.Inset, secondary structure of the U5-PBS construct used for structural andbiochemical studies. Grey residues indicate non-native positions: theG96C:C157G base pair represents a base-pair swap alteration that was madeto aid in unambiguous assignments of the otherwise five consecutive Gs andthe terminal G–C base pair was added for the purposes of transcriptionalefficiency and for SmaI digestion of the DNA template. Bottom, portions of the1H–15N two-dimensional HSQC spectra for 15N, 13C-labelled U5-PBS. Theimino resonances for U- and G-labelled samples are shown, as are aminoresonances for the A-labelled sample. The A98 amino resonances had unusuallydownfield chemical shifts due to interaction with the G155 inside the helix.Formation of the stacking interactions of G155 is shown on the right with azoom-in inset. The stacking is also confirmed by a G155 to U156 walk in the D2O

two-dimensional NOESY spectrum (data not shown). b, Portion of the 1H–1Htwo-dimensional NOESY imino spectrum overlay for U5-PBS constructs withand without the G96C:C157G base-pair swap. No change in the imino-to-imino connectivities were observed for the RNA except at the expectedmutation site (shown in boxed region). In fact, the GA98, GU156 bulgeimmediately above the G–C swap has the exact chemical shifts, line widthsand so on in both constructs, demonstrating that they have similar secondarystructures. Importantly, similar binding affinities with the NC protein wereobtained for both constructs. c, U5-PBS NMR ensemble alignments: theensemble of the lowest-energy AMBER structures is shown. They are aligned bythe top portion (left) or bottom portion (right) of the molecule. NC bindingsites V1 (UCUG110) and V2 (UCAG130) are shown in red and orange,respectively. Left, alignment of residues 112–125 and 132–144, yielding a rootmean squared deviation (r.m.s.d.) value of 0.6 6 0.2 A. Right, alignment ofresidues 94–106 and 147–159, excluding G155, which appears from NOE datato exhibit multiple conformations. The resulting r.m.s.d. for alignment of thebottom residues is 1.2 6 0.5 A.

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 11: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

G1

47

.H8

C1

08

.H6

1.h

2

5.4

5.5

5.5

5.7

7.77.8 7.6

U1

09

.H6

5.6 5.45.8

U1

46

.H6

A1

1

U CN

C CN

U146.C1’

C108.C1’

C1

08

.H1

U1

46

.H1

’¹H (p.p..m)

¹H (p

pm

)

H, p

.p.m

1.0

3.0

3.0 1.0H, p.p.m

e

i

g

k

a

U121

d

5.506.006.507.007.508.00H (ppm)

8.10

8.20

8.30

H (p

pm)

5.506.006.507.007.508.00H (ppm)

4.20

4.30

H (

pp

m)

6.08.0 7.07.5 6.5 5.5

4.2

4.3

8.18.28.3

123.

H1’

123.

H6

122.

H8

122.

H1’

121.

H1’

121.

H5

120.

H1’

121.

H6

U121.h2'

A122.h2

135.

H1’

f

l

U1

46

.H6

U1

46

.H1

U1

46

.H5

6.0

6.0

8.0

U146.C5

U146.C6

U146.C1’

G1

30

.H3

C1

28

.H1

A1

29

.H8

C1

28

.H5

C1

28

.H6

C1

28

.H5

C1

28

.H5

’’

C1

28

.H2

C128.C1’

C128.C6

A129.C1’

A129.C8

c

(b)

h

U107U146

U145U109

C108

G110

G UAU

G

U

UC

U

CC

UG

GG

CG

110

150

5’

3’

G

U

UC

0

UU

0 GG

(b)d

U121

A122

A122G135

A129C128

j

H, p

.p.m

, p.p

.m, p

.p.m

H, p

.p.m

b

H, p

.p.m

H, p

.p.m

C, p.p.m13

H, p

.p.m

C, p.p.m13

¹H (p.p..m)

¹H (p.p..m)

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 12: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Figure 3 | Structural characterization of U5-PBS in the freeand bound form. a, Three-dimensional NOESY-HMQC strip plot of the13C-edited 1H–1H planes for C128 and A129 aromatic and H19 protons. NOEsfrom A129 H8 to the preceding C128 ribose protons are indicative of stackinginteraction between the two residues. In contrast, the minor conformation thatis populated upon NC binding does not give rise to any inter-residue NOEs(data not shown). b, Structure of the UCAG tetraloop. The U–G hydrogenbonds, characteristic of UNCG tetraloops, are shown in green. c, Lack of NOEsbetween U121 and A122 indicate a break in regular stacking interactions betweenthese two residues. Strip plots of 13C-edited 1H–1H planes for U121 H29 andA122 H2 from three-dimensional NOESY-HMQC spectra are shown. The U121

H29–C123 H6 and A122 H2–G135 H19 long-range NOEs are indicated in red.d, The A122 bulge forms a triple base interaction. Hydrogen bonds are shown ingreen, and the A122 H2–G135 H19 distance is indicated by a solid black line.e, Strip plots of 13C edited 1H–1H planes from three-dimensional NOESYHMQC spectra of U5-PBS, showing interresidue NOE connectivities. Black,dashed lines show sequential inter-residue connectivities, while the orange linesrepresent long-range interactions. The strong G110 H8-to-H19 intraresidueNOE is indicative of a syn glycosidic torsion angle. Residues in the PBS and NCbinding site are labelled with blue and red, respectively. f, Lowest-energyAMBER structures of the (UCUGA111, UU146) internal loop, showing the

flexibility of U107 due to lack of inter-residue NOEs. Alignment of residues108–111 and 144–146 yields an r.m.s.d. of 0.8 6 0.3 A. g, Portions fromthree-dimensional NOESY-HMQC spectra collected for selectively labelled13C,15N-U5-PBS. As evidence for the ribose-zipper motif, intense H19–H19

inter-residue connectivities are observed between U146 and C108, which arelocated on opposite strands. h, Portions from three-dimensional NOESY-HMQC spectra collected for selectively labelled 13C,15N-U5-PBS in complexwith NC. Both the ribose and aromatic protons of U146 do not give rise to anyinter-residue NOEs, indicating the lack of U146 interaction with neighbouringresidues. An inset of the secondary structure of the bulge region is shown tovisualize the availability of the released uracils. i, j, 1H–13C two-dimensionalHMQC spectra collected with 0 (black) and 0.9 (red) equivalents of NC.Perturbation of G155, U156 signals towards the minor (indicated by an asterisk)conformation are shown by dashed lines. Blue asterisk indicates the emergenceof new freed uridine resonances after NC addition. k, Specific perturbationof G100 is seen by selective loss of imino NOE to G101. l, A portion of two-dimensional NOESY spectra collected in D2O for a 1:1 Y-packaging signal–NC complex (black) and U5-PBS–NC complex (red). The complete match ofthe two data sets indicates that, similar to the Y–NC interaction19, the NC tailscontinue to exist in a random coil confirmation upon complex formation.

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 13: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

0.0 1.0 2.0 3.0 4.0

30 40 50 60 70 80Time (min)

Molar Ratio

tRNA -T2MPro

0.0 0.5 1.0 1.5 2.0 2.5-14-12-10-8-6-4-20

-0.16

-0.12

-0.08

-0.04

0.0020 30 40 50

Time (min)

μcal

/sec

Molar Ratio

kcal

/mol

e of

inje

ctan

t

Kd: 357 ± 78 nMN: 1.3 ± 0.1

0.0 0.5 1.0 1.5 2.0 2.5

-12-10-8-6-4-20

-0.16

-0.12

-0.08

-0.04

0.0020 30 40 50

Time (min)

μcal

/sec

Molar Ratio

kcal

/mol

e of

inje

ctan

t

Kd: 374 ± 54 nMN: 1.1 ± 0.2

G

31

GA

CU

GC

CG

GU

A G

U U

41

C G

AU5’ 3’

31

GA

CU

GC

CG

GU

A G

41

C G

AU5’ 3’

GΨ Ψ

m1

40 60Time (min)

20

0 1 2 3Molar Ratio

4

0.00

-0.04

-0.08

0.0

-4.0

μca

l/sec

KC

al/M

ole

of I

njec

tant

0.0

-1.0

-2.0

0 1 2 3 4

-2.0

0 1 2 3

0.0

-1.0

-2.0

-15

-10

-5

0

5

10

V1 V2 Ψ-packaging

signal

ΔG

ΔH

−ΤΔS

375 +_ 52

33 +_ 3

1422 +_ 237 X

X Xsite V1 ( nM)

site V2 (Kd, nM)

MV1 Native MV1 MV2 Molar Ratio

KC

al/M

ole

MV1

MV1 MV2 Native

Kd,

-8.0

-4.0

0.0

-0.16

-0.08

0.00

MT1

MT1 MT2 ΔG

ΔH

−ΤΔS

tail-mediated loss in enthalpy

tail-mediated gain in entropy

-15

-10

-5

0

5

10

T1 T2 T3 SL

834 +_ 343 439 +_ 55 704 +_ 85

X

60 80Time (min)

4 0

μca

l/sec

KC

al/M

ole

of I

njec

tant

0 1 2 3 4 5 6

0 1 2 3 4

0 1 2 3 4

0.0

-4.0

-6.0

-8.0

Molar Ratio Molar Ratio

0.0

-4.0

-6.0

-8.0

153 +_ 28 X

X

X

MT1 Native MT2

4 +_ 2

13 +_ 3

MT1 MT2

11 +_ 7

315 +_ 130

KC

al/M

ole

Native

site T1 ( nM)

site T2 (Kd, nM)

Kd,

site T3 (Kd, nM)

a

b

c e f

d

0

20

40

60

80

100

120

WT G110U DM NTC

% g

en

om

e p

ack

ag

ing

Ψ control

g

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 14: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Figure 4 | Thermodynamic analysis of NC-U5-PBS andNC-tRNAPro interaction. a, b, ITC data for NC interactions with the U5-PBSdomain (monomeric form; see Supplementary Discussion 1) (a) andtRNAPro (b), along with their respective mutant RNAs (V1M 5 G110U;V2M 5 G130A; T1M 5 G9A; T2M 5 G35A,G36A,G37A; T3M was not madesince G23 is part of a critical base triple). The Kd values are the averages ofthree independent experiments, n 5 3. Continuous lines represent the fit fortwo- and three-site binding models for U5-PBS and tRNAPro, respectively.As expected, V1M is fit with a one-site binding model, while V1MV2M abolishesNC binding. Similarly, T1M and T2M are fit with two-site binding models, whileT1MT2M is fit with a one-site binding model. Entropy values were calculatedusing T 5 303 K. The net gain in entropy (green) and loss in enthalpy (red) areshown for site T2 in full-length tRNAPro in comparison with the isolatedanticodon SLAC RNA. The favourable enthalpy resulting from zinc-fingerinteraction with the anticodon loop in tRNAPro (DH 5 214.2 kcal mol21) isoffset by loss of enthalpy in RNA elements beyond the anticodon stem (lossof 9.8 kcal mol21) and, consequently, a significant entropic compensation(gain of 11.5 kcal mol21) is observed (see Supplementary Discussion 7).c, Titration of NC into a mixture of tRNAPro and U5-PBS. Lanes 1, 2, 3 and 4

correspond to 1, 2, 3 and 4 equivalents of NC. Addition of four equivalentsof NC to a mixture of tRNAPro and U5-PBS led to a near complete annealing ofthe molecule. For this experiment, the U5-PBS dimeric construct was usedbecause, for a functional assay, the entire 18-nucleotide primer binding siteis required. NMR studies on this construct, however, showed that the constructis in equilibrium with the monomeric form. This experiment demonstratesthat only a few NC molecules are required to yield a functional complex.d, Reverse transcription of heat-annealed versus NC-annealed U5-PBS–tRNAPro initiation complexes. The 10-nucleotide DNA product is obtained byreverse transcriptase terminating transcription after incorporating ddCTP atposition 10 (and subsequent removal of the tRNA nucleotides by RNasetreatment). e, ITC data for NC binding to tRNAPro T2M (AGGG37 is mutated toAAAA37). f, ITC data confirming that base modifications are not required forNC binding. ITC data (n 5 3) were fit well with a one-site binding model.g, A packaging assay confirms that the mutations do not affect genomeencapsidation (n 5 6). As a positive control, the observed decrease for Y(C331G) control is similar to that previously observed37, and the non-templatecontrol (NTC) exhibits no viral RNA signal.

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 15: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

7.2

7.4

7.6

7.8

8.0

8.2

43

42

41

40

39

31

30

29

28

2726

44

34

32

33

3635

37

42

.h2

44

.h2

43

42

44 44

.h2

42

.h2

41

40

38

34

32

37

31

30

2827

26

29

39

3536

33

38

G

UU

U

CC GA

AU

G G GG

GGC

5’

3’

anticodon stemloop

7.2

7.4

7.6

7.8

8.2

6.0 5 .9 5 .8 5 .7 5 .5 5 .4 5 .3 5 .2

31

1741

37

15’

3’

51

61

71

749

22 25 27

44

46

47

U

A

C

H p

.p.m

anticodon stemloop

UU

U

CC GA

AU

G G GG

GGC

31

4137

27

44

UC

GC

GA

GC

C

45

9

a

b

8.0

1

H p

.p.m

1

H p.p.m1

6.0 5 .9 5 .8 5 .7 5 .5 5 .4 5 .3 5 .2H p.p.m1

Extended Data Figure 5 | Assignment of anticodon stem loop in tRNAPro.a, b, Two-dimensional 1H–1H NOESY spectra for the isolated anticodon stemloop (a) and full-length tRNAPro (b) constructs, with blue lines indicating

inter- and intra-residue aromatic-to-H19 NOE connectivities (see alsoSupplementary Discussion 4).

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 16: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

22 25 27

46

48

b

a

C

GGCUC

UG

GGG

UU

A A

C

U

CC

AG

G

AC

AC

CC

GCCC

G

A

C

5’

3’

TΨC loop

anti-PAS

anti-PBS

61

66

71

31

1841

37

C

GGCUC

UG

GGG

UU

A A

C

U

CC

AG

G

AC

AC

CC

GCCC

G

A

C

15’

3’

51

61

66

71

9749

7.2

7.4

7.6

7.8

8.0

8.2

8.4

1

2

3

4

5

6

49

51

52

53

54

6056

57

57

58

59

59

61

62

6366

6567

68

69

70

71

72

76

69

.h2

66

.h2

5075

58

.h2

64

74

52

54

53

51

50

62

63

65

64

61

58

.h2

1

2

4

1

3

5

6

7.h549

7

66

67

68

69

70

71

66

.h2

75

69

.h2

7475

49.h5

55

6.0 5 .9 5 .8 5 .7 5 .5 5 .4 5 .3 5 .2

H p

.p.m

H p.p.m

7.2

7.4

7.6

7.8

8.0

8.2

8.4

1

1

H p

.p.m

1

H p.p.m6.0 5 .9 5 .8 5 .7 5 .5 5 .4 5 .3 5 .21

1

58

56

60

49

Extended Data Figure 6 | Assignment of the acceptor–TYC-stem loopconstruct confirms proper coaxial stacking in the context of the full-lengthtRNAPro tertiary structure. a, b, Two-dimensional 1H–1H NOESY spectrafor the isolated acceptor–TYC-stem loop (a) and full-length tRNAPro

(b) constructs, with black and red lines indicating inter- and intra-residuearomatic-to-H19 NOE connectivities in the acceptor stem and TYC-stem loop,respectively (see also Supplementary Discussion 4).

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 17: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

2827

44

.h2

12

24

.h2

25

26

22

21

11

7.2

7.4

7.6

7.8

8.0

8.2

6.0 5 .9 5 .8 5 .7 5 .5 5 .4 5 .3 5 .2

31

8

1741

37

1

5’

3’

5161

71

7

49

22 25 27

44

46

48

GU

UUU U

GCG G

GG A A

U

A

CA

D stemloop

10

7.4

7.6

7.8

8.0

8.2

12

13

24

.h2

loop Gs

5 .9 5 .7 5 .5 5 .3 5 .1

D stemloop

GU

UUU U

GCG G

GG

A

A A

5’

3’C

GC

GGC

a b

c

H p.p.m15.0

7.5

8.0

5.45.2

1

D-stemloop, 45 °C

tRNApro , 35 °CtRNApro , 40 °CtRNApro , 45 °C

53

4318

19

16

7.7

7.8

7.9

5.8 5.7 5.6 5.8 5.7 5.6

G37

G35

G36 G36

G37

G35

G16

1

7.5

7.6

5.2 5.15 5.2 5.15H ppm1

H p

.p.m

1

U54

U54

C31

d

e

20

7.0

7.1

5.7 5.5 5.7 5.5

G18

G43 G43G- sampleH

GU- sampleH

13

45

H p.p.m1

H p

.p.m

1

H p.p.m1

H p

.p.m

1

H p.p.m1

H p

.p.m

1

H p.p.m1

H p

.p.m

1

H p.p.m1

Extended Data Figure 7 | Assignment of the D-stem loop. a, b, Spectra forthe isolated D-stem loop (a) and full-length tRNAPro (b) constructs, with linesindicating inter- and intra-residue aromatic-to-H19 NOE connectivities. Thechemical shifts of residues in the D-stem loop are drastically altered in thecontext of the full-length tRNA due to the formation of multiple long-rangeinteractions between the D-stem loop and other regions in the tertiary structureof tRNAPro. c, Extreme sensitivity of the tRNAPro elbow tertiary interaction totemperature. At higher temperature, the D-stem loop loses its long-range

interaction with the TYC-loop, and chemical shifts resemble those for theD-stem loop in the isolated form (see Supplementary Discussion 4).d, e, Sensitivity of the tRNAPro elbow tertiary interaction to magnesiumdivalent salt. The data show the downfield shift of the U54 base in the TYC-loopand the characteristic signal for G16, G18 that arises as the D-loop becomesstructured upon elbow interaction formation. In contrast, the bottom rightpanel shows the insensitivity of the isolated TYC loop to divalent salts (see alsoSupplementary Discussion 4).

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 18: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

14.0 13.5 13.0 12.5 12.0 11.5 11.0 10.5

11.6 11.4 11.2 11.0 10.8

10.6

10.8

11.0

11.2

11.4

11.6

7.7 7 .6 7 .5 7 .4 7 .3 7 .2 7 .1

13.6

13.8

14.0

14.21

A58.h2

14.0 13.5 13.0 12.5 12.0 11.5 11.0 10.5

21-13

4513

45-21

54-53

A44.h2 A42.h2

A69.h2

10

26-43

11-46

70-2

4-70

4-67

28-43

28-41

30-39

65-6430-41

45-10 11-1025-10

45-25

11-23

11-25

9-10

52-51

53-52

22-13

4613

U54

U26 U28

U4 U25U22 U13

G70 G45 G39

G53

G30 G9

G43

G65G64 G51

G68 G41G10

G46

G2

G52

U11

25-9

G23

a

H p

.p.m

1

14

.0

13

.5

13

.0

12

.5

1

2.0

1 1

.5

1 1

.0

10

.5

H p.p. m1

1

14.0 13.5 13.0 12.5 12.0 11.5 11.0 10.5

160

144

H p.p.m

N p

.p.m

15

H p

.p.m

1

H p.p.m1

9-46

52

53

51

65

64

6

70

43

41 30 38

41

.h2

2

1

10

18

19

239

16

45

46

15

68

7.2

7.4

7.6

7.8

8.0

57

.h2

8 .2

H p

.p.m

1

5 .9 5 .8 5 .7 5 .5 5 .4 5 .3H ppm1

1

b

5.9 5 .8 6.1 6 .0 H p.p.m

A5

8.h

1’

A7

6.h

1’

G53.h8

7.2

7.4

7.6

A5

8.h

1’

G18.h8

G19.h8

8.40

8.45

c

G

UU

U

CC GA

AU

G G G GGC

31

GU

1741

37

C

GGCUC

UG

GGG

UU

A A

C

U

CC

AG

G

AC

AC

CC

GCCC

G

A

C

15’

3’

51

61

71

749

22 25 27

44

46

48

U

A

CGU

UUU U

GCG G

GG A A

A

GG

UC

7.9

7.7

dU60

U8

8.25

8.3

7.9

7.7

U13

A14

G15C48

H p.p.m

H p

.p.m

H p.p.m

H p.p.m1

H p

.p.m

1

Extended Data Figure 8 | Structural characterization of tRNAPro.a, Imino resonances for full-length tRNAPro indicate proper tertiary structureformation. Portions of the 1H–15N two-dimensional HSQC spectra for U- (top)and G-labelled (centre) full-length tRNAPro constructs, with iminoassignments indicated. A portion of the 1H–1H two-dimensional NOESYspectrum showing imino-to-imino NOEs is shown at the bottom. b, Two-dimensional NOESY of a GH sample of the T2M construct wherein only theguanosines were protonated and the other three nucleotides were deuterated.This mutant was chosen to unambiguously assign the G resonances from theD-loop and the variable loop by reducing the spectral complexity from the

G-rich anticodon loop. c, Strips from fully protonated and G-protonated two-dimensional NOESY spectra showing direct evidence for the D-loop–TYCinteraction (see Supplementary Discussion 4). The left panel shows datacollected at low temperature that allow us to confirm the long-rangeassignment because the A58 H19 NOE spin diffuses to the aromatic protonof G19 via G18. d, Strips from fully protonated sample showing the NOEwalk from the D-stem to the loop residue G15 that forms the critical Levitt basepair with C48. This arrangement leads to an unusual downfield shift of the C48

H19 (see Supplementary Discussion 4).

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 19: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

7.0 6 .8 6 .6 7.0 6 .8 6 .67.0 6 .8 6 .6

Y28he

A34h

1’

W35

h4

W35

hd

W35

h7W

35h6

W35

h5

5.4

NC-tRNApro : 1:1 NC-tRNApro : 2:1

(free)

(NC-bound)

g

W35

hd

W35haG37h1’

NC-SL: 1:1

7.4 7 .0 6 .6

6.6

5.8

8.2

h

W35.H5

W35.H4

W35.H6

W35.H7

W35.H2

Y28.HD

W3

5.H

A

U2

0.H

5

W3

5.H

A

U2

0.H

5

Y2

8.H

E

G2

3-H

8

G2

3-H

8

7.0

7.0

7.9 7.9

6.5

5.5 5.5

tRNAPro -TM1TM2-MLVNC

complex

UAUG23MLVNC

complex

W35

h4

W35

h7W

35h6

W35

h5

W35

hd

H p.p.m1

i

UAUG MLVNC

complex

f

a

6.6

6.6

7.4 7.2 7.0 6.8 6.6 6.4 6.2 6.0

NC:r(GUUG)

NC:tRNA T2MPro

Tyr 28.hd

Tyr 28.hd

U7.h5

Tyr 2

8.he

Trp35.hd

Trp35

.h5

7.3 7.2 7.1 7.0 6.9

6.6

6.7

6.8

6.9

7.0

7.007.05

7.9

Tyr28hd

Trp35h5

Trp35hd

Trp3

5h7

Trp3

5h6

Trp3

5h4

Tyr2

8he

G9h

8

G9h1’

6.8

5.7

b c

d

e

G6

G30

G19

G23G9

G68

G15

7.6

7.8

5.605.65

1: 0.1

G

(H8)H

T2 M,

U26

U25

7.12

8.1 7.87.98.0

1

G65

7.35

A6

9.h

2C

67

.h6

C67.h1’

A66.h1’

G70.h1’

C5.h1’

C67.h5

G53

G10

H p.p.m

H p.p.m

H p.p.m

1

1

1

H p

.p.m

1

H p

.p.m

1

H p.p.m1

H p.p.m1

H p

.p.m

1

H p

.p.m

1

H p

.p.m

1

23

H p.p.m1

H p

.p.m

1H

p.p

.m1

H p

.p.m

1

H p.p.m1

H p

.p.m

1

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014

Page 20: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Figure 9 | Structural characterization of NC-tRNAPro

interactions. a, The packing of U7 against the Tyr 28 side-chain protons leadsto a downfield shift of the H5 and H19 protons of U7. Intermolecular NOEscharacteristic of the fourth G of an NNNG motif entering the hydrophobicpocket of the NC protein zinc finger are also observed for G9 of the GUUG9.An r(GUUG) construct essentially gives the same NOE pattern as the tRNAPro

GUUG9 upon interacting with NC, and was used as a guide for assignments.This technique has been used previously for the genome-Y-packaging–NCcomplex. The tRNAPro T2M construct was used for assignments since it has noeffect on site T1 binding but allows for unambiguous assignments; that is, itensures that no NC is bound to the second site at 1:1 titration. Nevertheless, wehave confirmed via HMQCs that identical site T1 binding chemical shifts areobserved in the native and the T2M mutant. b, Portion of two-dimensionalNOESY spectra collected with 0 (black) and 0.3 (red) NC equivalents.Selective perturbation of the helical walk between the 10th and 11th anti-PBSresidues, 210C67 and 211A66, is shown. In contrast, the typical minor grooveconnectivity of the A69 H2 shows the preservation of adjacent intramolecularassociations of the seventh (27G68), eighth (28A69) and ninth (29G70) anti-PBSresidues. c, A portion of two-dimensional NOESY spectrum showing H8/H19

correlations for the selectively protonated T2M–GH tRNAPro sample. Datafor the T2M construct are shown to demonstrate that the observed structuralchanges are due solely to NC binding to site T1 rather than site T2. Addition ofonly 0.1 NC equivalent results in immediate perturbation of specific signals(G6 and G9), confirming the location of the first NC binding site. d, e, Portion of

two-dimensional NOESY spectra collected with 0 (black) and 0.3 (red) NCequivalents, showing the preservation of the stacking interaction between theD-stem and anticodon stem (e), whereas the walk between the acceptor stemand TYC stem is disrupted (d). f, A portion of two-dimensional NOESYspectra collected in D2O for a 1:1Y-packaging-signal–NC complex (black) andtRNAPro T2M–NC complex (red). The complete match of the two data setsindicates that, similar to theY–NC interaction, the NC tails continue to exist ina random coil conformation upon complex formation19. g, Characteristicintermolecular NOEs from the G37 residue of the anticodon AGGG37 bindingsite to the NC zinc finger. The isolated anticodon (SL) construct essentiallygives the same NOE pattern as the site T2 tRNAPro binding upon interactingwith NC. However, this pattern is observed only upon titration of NC over 1:1in the tRNAPro construct, demonstrating the sequential binding involved inNC-mediated remodelling. h, Characteristic intermolecular NOEs from theG23 residue from the UAUG23 binding site to the NC zinc finger. The tRNAPro

T1MT2M construct NOE pattern was an exact match with that of the NC–r(UAUG)23 complex, explicitly confirming the third NC binding site, site T3.The delta proton of Tyr 28 proton of the NC protein displays an intermolecularNOE to the downfield-shifted H5 proton of U20, which indicates packing ofRNA protons against the Tyr 28 side-chain protons. Furthermore, the presenceof characteristic intermolecular NOEs between the H8 proton of G23 andthe Trp 35 aromatic H5 and H6 protons indicates specific insertion of G23 intothe hydrophobic pocket of the zinc knuckle domain of the NC protein.i, Structure of NC bound to tRNAPro T1MT2M.

RESEARCH LETTER

Macmillan Publishers Limited. All rights reserved©2014

Page 21: A structure-based mechanism for tRNA and retroviral RNA remodelling during primer annealing

Extended Data Table 1 | Statistics for the NMR-derived structures for U5-PBS and tRNAPro in the free form and in various NC-complexedstates, respectively

1H-1H distance restraints Intraresidue Interresidue H-bond restraints (4/H-bond) Torsion angle restraints

768 342 426 261 270

159 20 139 22 -

49 - 36 13 -

639 342 297 130 -

159 20 139 22 -

49 - 36 13 -

Mean AMBER energy (kcal mol-1) Mean constraint energy (kcal mol-1) Distance violations (> 0.5 Å) Dihedral angle violations (> 5°)

°

- - - - - -

-19316 125.78 1 0 0 0.54

(residues 26-31,39-46) (residues 10-14,20-24)

(residues 49-53,61-65)

1H-1H distance restraints Intraresidue Interresidue H-bond restraints (4/H-bond) Torsion angle restraints

758 339 419 261 270

320 40 280 46 -

97 - 71 26 -

629 339 290 132 -

320 40 280 46 -

97 - 71 26 -

Mean AMBER energy (kcal mol-1) Mean constraint energy (kcal mol-1) Distance violations (> 0.5 Å) Dihedral angle violations (> 5°)

°

- - - - - -

-22220 186.17 0 0 0 0.38

(residues 26-31,39-46) (residues 10-14,20-24)

(residues 49-53,61-65)

CN:SBP5U SBP-5U

CYANA model

AMBER structure

U5PBS NC U5PBS:NC

NMR-derived restraints 1H-1H distance restraints Intraresidue Interresidue H-bond restraints (4/H-bond) Torsion angle restraints RDC restraints

696 324 372 282 316 -

555 324 372 141 - 32

675 333 342 272 302 -

162 20 115 27 - -

64 - 42 22

Statistics for AMBER structure Mean AMBER energy (kcal mol-1) Mean constraint energy (kcal mol-1) Distance violations (> 0.5 Å) Dihedral angle violations (> 5°) RDC violations (> 0 Hz)

15,961 36 0 0 0

- - - -

Target function (Å2) Mean ± Std Dev Minimum Maximum

- - -

4.32 ± 0.56 3.35 5.29

Restraint violations2 Av. sum of upper distance viol. (Å) Av. max. upper distance viol. (Å) Av. sum of VDW viol. (Å) Av. max. VDW viol. (Å) Av. sum of torsion angle viol. (°) Av max. torsion angle viol. (°) Av. sum of lower distance viol. (Å) Av. max. lower distance viol. (Å)

- - - - - - - -

27± 5 0.4 ± 0.04 13.9 ± 1.3 0.61 ± 0.17 3 ± 3 4.84 ± 7.35 3 ± 1 0.21 ± 0.02

Structure Convergence (Å) Top (residues 114-142) bottom (residues 94-106,147-154,156-159) Internal loop (UCUGA111, UU146) NC (Arg21-Pro43)

0.6 ± 0.2 1.2 ± 0.5 0.8 ± 0.3 -

0.75 ± 0.23 1.5 ± 0.15 - 0.41 ± 0.12

*Pairwise r.m.s.d. was calculated among ten refined structures.

LETTER RESEARCH

Macmillan Publishers Limited. All rights reserved©2014