Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
Indian Journal of Biochemistry & Biophysics Vol. 38, October 2001 , pp.289-293
Mini Review
A unique group of self-splicing introns in bacteriophage T4
Asad U Khan *l, Malik Ajama\uddin' and Masood Ahmad#
I Interdisciplinary Biotechnology Unit. A M U Aligarh 202 002, Indi a
# Biochemistry Department Faculty of Life Sciences, A M U Aligarh 202 002, India
Received 1 February 2001: accepted 18 May 2001
We describe in this review, the salient splicing features of group I introns of bac teriophage T4 and propose, a hypothetical model to fit in the self-splic ing of nrdB intron of T4 phage. Occurrence of non-coding sequences in prokaryotic cell s is a rare event while it is common in ellkaryotic cells, especially the higher eukaryotes. Therefore. T4 bacteriophage can serve as a good model system to study the evoluti onary aspects of splicing of introns. Three genes of T4 phage were fou nd to have st retches of non-coding sequences which belonged to the group IA type introns of self-splici ng nature.
Discovery of introns in 1977 heralded a new era in the study of molecular biology of eukaryotic gene expression I. The complexity of gene organization, the combinatorial possibilities of assembling different coding exons from an RNA precursor, and the novelty of the RNA splicing process clearly indicate that eukaryotic molecular biology would be fasc inatingly different from that of prokaryotic system2
.3 . But Mon'issey and Tollervey have highlighted a striking similarity between prokaryotic and eukaryotic systems4 th at ribosomal RNA processing in eukaryotes, archaebacteria and bacteria is of common origin can be shown by the fact that the pre-rRNA processing is fo und in archaeon, Solfolobus acidocaldarius5
.6
.
In general, prokaryotic genes do not have introns because of energetic demands of rapid generation time, coupled with single origin of replication7
. Therefore, it is thought that introns and other non-coding DNA sequences have been kept out of prokaryotes. But an exception to this is the occurrence of three introns in bacteriophage T4. Group I introns have been found in mitochondrial and chloroplast genomes of plants, fungi, a few nuclear rRNA, bacteriophages and also in some fungal plasmids8
. Based on all known group I intron sequences, comparative folding analyses and computer search for structure with minimum free energy of folding were performed and generic
d d . d . d9 10 secon aryan tertiary structures were enve . .
* Author for correspondence at hi s present address: Biochemi stry Department, RWJMS , University of Medicine and Dentistry New Jersey 675 Hoes Lane, Pi scataway, New Jersey 08854-5635. USA; E mail: [email protected]
T4 introns The bacteriophage T4 genome contains three
introns (Fig. I) , all of which reside in genes involved in nucleotide biosynthesi s. The first T4 intron was discovered in 1984 in the tel gene coding for thymidylate synthase t I , whereas the other two introns are in the nrdB gene coding for small subunit of ribonucleotide reductase l2 and the nrelD gene coding for small subunit of anaerobic ribonucleoside triphosphate reductase l3
. The reason of the occurrence of introns only in the genes involved in nucleotide biosynthesis is still unanswered. These introns differ in sequence and size; the nrelD and tel introns comprise 1033 and 1016 nucleotides respectively, whi le IlrdB intron has only 598 nucleotides. All these introns comprise Open Reading Frames (ORFs). The ORFs are not homologous and occur at three different positions, at least two of which looped out from a secondary structure. Autocatalysis of these introns is enabled by the folding of introns into characteristic secondary structures required for intron excision and exon ligation 10. 14. The critical ribozyme secondary structure comprises 10 RNA pairing regions (PI-PIO) which alongwith conserved primary sequence elements namely P, Q, Rand S identify the T4 .. I . 15 t 6 Th P4 1I1tervemng sequences as group 1I1tron · . e .. P6 domain directs the folding of Tetrahymena ribozyme core 17. It is not established that whether the tertiary structure of group I intron is essential for catalytic activity. Basic features of predicted three dimensional (3 D) tertiary structure have been studied by different techniques like photochemical crosslinking reactions, affinity cleavage reagents, NMR
d I · h 18·22 spectroscopy an computer a gont m .
290
A.nrd B
INDIAN J BIOCHEM. BIOPHYS., VOL. 38, OCTOBER 2001
U U A U U C-G
P 5 C-G U A GU '1
A G A A A A A G C AA t
_A-U C-G G-C U-G A-U G-U
PI 9-C P2 A-U U-A C-G A-U . P3 P4 C-G PG AAG PGa/79.
5'U9-CCUUUAUCGAAAACUCCUUUAAU~:~-GCC GUUUU GC A· I I I I I I • I I I I I I I I I ORF U· GAGGGG UGG CAAAACG, G.
P8 / P7 CU
A A '-256 J UCCCUAA AUCAGC:
C I I I , I I \" I I, I C GGGAUUG UGAUA UAG U C U G G A C-G C A G-C-G-G-C GAAC A
U-A U-A G-U U UAAAGC-GAUG.C-G P9 G-C P9.1 U-AP9.2U-A A
U-G A-·.U P7.1 C-G C-G U- A C C G G -C A A-U A A G C G
P7.2 A-U G U U G U-A U G A-U g-G-C C U C U G-C AA G -C JJ
C A -,.... U-A G A A G a GAPS C-G U A A A
U-A U G C C U C
C A A G A A A I
B. td UU
A G A A U G -JJ G UG
3'
'l- C G A G-U 9 - C A .. G-<;'" U G-C 9-U A-U G-C
PI ~:::~ ~:~ P3 P4 g:g PS uAAAU PSa /38 l 5' C-G-U-ACUUGUAAUCUAUCUAAAU-Yr Y~Y~y~~CU A·
IIIIII CG AC;'UCUUCGORFU'
P B . .............A G A U A G P 7 uC U AA A U \ JG' UC ~ A 735
U GUUCAAC AUCAGCA , I I I I I I /' I II I · I
UAACAAGUUGG AGAUA UAGUC UGCUC-G-C-G--G-CUGAACA U-A U G G-U LI
C AAGCGAUGAU G-C G C P9.1 U-A P9. 2 A P7. 2
CUG
G G-C p7.1 P9 C-G'G-A-U- C A-U t.
A-U G-:G-CC A-U U U A-U lJ G-C G U U-A A A G- C G
U A UU G C U A A C ~ GA C G A U A II
A U UG U G UAA C-G AAC
c. suny )'
G G U-A CUP 5 C-G A C U-A A A C C
-+U-A A A U-G A A 9-C A I
a-U G · G U C PI 9-C G C- PG AAA PGA
U -A P 3 A_C G_U U U A 5'uaUGUAAAAUCUGCCUAAAU--GC CAGC ' O
I I I I I I I I I I I I
YAGACGG CCC ~AA~ U CC
llA U
PB / /' P7 . A UCUCAAC AUCAGCA
A 111111 1"111 I AGAG\J.UG A--AGAUA UAGUC UGAAC-G-C-G-G-CUGAACA
~ U-A U-G U-A C CAAACC-GAUGA-U G-CP91 G - CP9.2 A - U A
C-G G-C P9 C-G . U-A G-C U P7. 2 C-G G-C P7.1 A-U U-A A-U U
A-U G-C U-A U-A G C G G-C A-U G-C A-U A A a
C A A G· GA ' U - A A A U
GA U A Uc. '(" G 9
L G U FIG C 03' UG ORF UAG AA 31 754
L ~y~ J ----=------
Fi g. I-Secondary structure for intron sequences from the IIrdB , Id and SUllY (nrclD) genes
KHAN el aJ.: A UNIQUE GROUP OF SEL-SPLICING INTRONS IN BACTERIOPHAGE T4 291
Self splicing of group I intron
Cech and coworkers23 found that a ribosomal precursor RNA of Tetrahymena could remove its own 413 nucleotides long intron in the absence of proteins in vitro. This reaction proceeds by two consecutive trans-esteri fication reactions and requires a di valent
t· (M 2+ M 2+) . ca IOn g or n as well as guanosIne (or a guanosine phosphate)24. Divalent cation is needed for the formation of stable secondary structure of group I . t 25.26 R I' h b f In ron . ecent y It as een ound that Cobalt ion [Co(III)(NH3)6]3+ (ref. 27-29) and other monovalent cation (Lt or even NH/) at higher concentration can
b · h M 2+ su stltute t e g (ref. 30). These findings reveal that the fundamental requirement for RNA catalysis is simply one of a relatively positive charge. The 3' end of the 5' exon forms a hybrid with an internal guide sequence (lGS) in the 5' end of the intron. The transesterification reaction is initiated by the external guanosine which is held in a guanosine binding site of the intron hydrogen bridges31. The 3'-OH of this g.uanosine attacks the phosphorus atom at the 5' splice site and forms 3',5'-phosphodiester bond with the first nucleotide of intron. In the second trans-esterification step, the free 3' hydroxyl group at the 3' splice site brings about the ligation of exons and removal of the linear intron (linear intervening sequences), while the extra guanosine remains at its 5' end. During this reaction, the conserved 3' guanosine from the intron remains bound at the G-binding site. Subsequently, the intron itself undergoes a self-catalyzed transesterification reaction. The conserved terminal 3' guanosine residue still bound in the G-binding site attacks a phosphorus atom near the end of the molecule resulting in intron cyclization (cyclic intervening sequences) and removal of short 5' 01 igonucleotide32.
Self-splicing of group I intron is very sensitive to mu~ation in core of this structure but many of the penpheral stem-loop structures can be deleted without loss of splicing function in vivo. These structures may, however, play a role in stabilizing the intron or providing binding sites for proteins which facilitate or ~egulate self splicing in vivo. Introns themselves may In fact encode a protein, involved in their own structural stabi lization33.
Splicing of T4 introns The T4 introns have the ability to self-splice in
't 13.34 A I" b Vl ro . utocata YSls IS ena led by folding of the intron into characteristic secondary structure that facilitates a series of trans-esterification reactions
necessary for intron excision and ex on ligation8. The T4 td intron contains four long range base paired regions in P3 , P6, P7 and PI 0 . The remaining base paired regions are stem-loop structure including P7.1 and P7.2 which coupled with specific variants in the P, Q, Rand S sequence elements further sub classify the T4 introns as group IA 8.15 .
The secondary structure model for group I intron was initially postulated on the basis of phylogenetic comparisons of intron sequences from various organisms which predicted a common secondary structure35.36. Since then, mutational analysis of autocatalytic group I intron in Tetrahymena, T4 and yeast mitochondria has identified many splicing defective cis-acting intron mutations that would destabilize p,redicted RNA pairing regions, thus generally supporting the model. In T4, for example, the sequence changes for 31 splicing defecti ve td intron mutants (19 phage mutants, 12 mutations from a cloned td gene) have been determined and the vast majority disrupt proposed intron RNA helices 37.39.
Single base changes in predicted RNA pairing regions which disrupt splicing, provides proof that a nucleotide is essential for the splicing reaction40.41. Genetic verification that a base pair (and thus a pairing region) exists, can be demonstrated by the isolation of a pseudorevertant that harbours a compensatory mutation at the nucleotide predicted to base pair with the original mutant nucleotide. The double mutant would be predicted to regenerate a WatsonCrick base pair in an essential RNA helix, thus permitting intron RNA to fold, resulting in the restoration of some degree of autocatalytic splicing42. Such experiments have been carried out using si te~pecific mutagenesis to verify the existence of group I Intron RNA pairing region in Tetrahymena and yeast
' t h d ' 43-45 F . ml oc on na . or the T4 Introns, however, only the P6 pairing region of the td intron has been proven by isolation of such second site suppressor mutations38.
Conclusion Our study demonstrated that 3' region of ORF in
the nrdB intron could also play significant role in I·· 40 sp ICIng . In contrast, genetic studies on td and other
group I introns demonstrated that intron ORF is not required for active splicing conformation39. This could probably be due to the small size of ORF of nrdB intron (294 bases) compared to that of td intron (778 bases) which looped out of the secondary structure models for RNA folding26
,46 . Present study of the nrdB mutants of phage T4 and their revertants
292 INDIAN J BlOCH EM. BIOPHYS., VOL. 38, OCTOBER 200 1
A A A
G P9.1 A A <?~~y A A CUGG
U Pl U-A (_
G-u C-g P9 G-c , A A
UUCC - 9 5 ~~y~y~~ uuucu U C GUACGUC
r---~-----, A A- - - - - - -) - - - - - - - - - -, UMU J116 C r - - - - - - )- - - U I ~
U-A U ~ A- U G-C G·U A C-G
G G-CGAACA G- U U U-A. A U-A P9.2 C G- C G A-U g<-G-C U
A G 8 A A C
U G C 3'
UA G CC
P4 C-G P7.2G G - C C-G P6 C C A - U P7 U-A C-G G GACU C- G
G ' U AA CA A A CUGGAU ~=~ G-C G A "-
A A G- C C G A A A U- A U- A UCGMAA G ~~y ~ J8J7
A G-C A U- A A - U C - G, U CUG G U A P611 A-U U-A ,U ~P7.1 /U P5 U-A - A - U P2 C- G '-
C-G U -::- A C-G C-G P3 C-G G-C G-C U· G "A- S
U A C-G G AU' G A-U
U U (ORF "I I ~ ~ _ _ _: g = ~ PO U 79mt 256mt L.. _____ , ~ - 8
LAUG~ UcC
P4-P6 P1 -P3 P7-P9.2
Fig. 2-A proposed model for tertiary structure of nrdB intron involved in self-splicing. [Solid lines show connectivities with each RNA domain whi le dashed lines indicate con nect ivities within the intact intron. Arrows superimposed on lines give 5 ' to 3' polarity . The 5' and 3' exon sequences are shown in lower case le tters and both sites are marked with arrows. Base-paired segments are designated as "P" and sec tion join the two paired region as "1")
further leads us to conclude that the conserved sequences of nrdB intron that are important in premRNA spli cing, especially the extreme ends of nrdB intron, were more critical for auto-catalysis , Moreover, the exon sequences immediately adjacent to the sp lice site were also important for splice site se lection40
.42. Finally, the data also strongly suggest that the cleavage of intron might require a well defined secondary structure which on folding would probably acquire a typical tertiary structure involving the P9.0 and PI 0 loops. These secondary and lor tertiary structures are crucial for the splicing of nrdB intron like other group I type of introns47
.
In view of the above discussion, we would like to propose a pLausible model to show the in volvement of various domains of nrdB intron resulting in the formation of typical tertiary structure (Fig. 2 ). This hypothetical model for the auto-catalysis of nrdB
intron is very close to that proposed earlier in case of 'T' I . 48 1 elra lymena \Otron . References
I Berezney R & Coffey D S (1977) J Cell Bioi 73, 616-620 2 Green M R ( 1986) Annll Rev Genet 20, 671-708 3 Padgett R A, Grabowski P J, Konarska M M, Seiler S &
Sharp P A( 1986) Annll Rev Biochem 55, I 119-1150 4 Morrissey J P & Tollervey D (1995) Trellds Biochelll Sci 20,
78-82 5 Potter S, Durovic P & Denni s P P (1995) Science 268, 1056-
1060 6 Doolittle R F ( 1995) Proc Natl Acad Sci USA 92, 2421-2423 7 Darnell J E & Doolittle W F (1996) Proc Natl Acad Sci USA
83,1271 -1 275 8 Belfort M ( 1990) Annu Rev Genet 24, 363-385 9 Brion P & Westhof F (1997) Anllu Rev Biophys Bioll1ol StrllCl
26,113-137 10 Wiltop-koning T H & Schumperli D ( 1994) Ellr J Biochem
219,25-42 II Chu F K, Maley G F, Maley F & Belfort M (1984) Proc Natl
Acad Sci USA 81,3049-3053 12 Sjoberg B M, Hahne S, Mathews C Z, Mathews C K, Rand K
N & Gait M J (1986) EMBO J 5,2031-2036
KHA el of.: A UN1QUE GROUP OF SEL-SPLICING INTRONS IN BACTERIOPHAGE T4 293
13 GOlt J M, Shub D A & Belfort M (1986) Cell 47, 81-87 33 Lambow itz A M (1989) Cell 56, 323-326 14 Cech T R (1990) All/III Rev Biochem 59, 543-568 34 Chu F K, Maley G F, West D K. Bel fort , M & Maley F 15 Sa ldanha R, Mohr G, Belfort M & Lambow itz A M ( 1993) (1986) Cell 45, 157- 166
FASEB J 7, 15-24 35 Shub D A, Gott J M, Xu M Q, Franz Lang B, Michel F. 16 Cech T R (1988) Gelle 73, 259-271 Tomaschenski J, Pedersen-lane J & Bel fort M (1988) Proc 17 Doherty E A, Herschlag D & Doudna J A( 1999) Biochemis/t )' Nail Acad Sci USA 85. 1151-1155
38, 2982-2990 36 Cate. J H, Good ing A R, Podell E, Zhou K, Golden B L. 18 Gultyaev A P, Vanbatenburg F H D & Pleij C W A (1995) J Kundrat C E, Cech. T R & Doudna J A (1996) Sciellce 273,
Mol Bioi 250. 37-5 1 1678-1685 19 Allai n F H-T & Varani G ( 1995) J Mol Bioi 250, 333-353 37 Brawn M D, Povinelli C M & Hall D H ( 1993) Biochelll 20 Wang J-F, Downs W D & Cech T R ( 1993) Sciellce 260, 504- Genel 3 1, 505-521
508 38 Ehrenman K. Schroeder R, Chandry P S, Hall D H & Belfort 21 Wang J-F & Cech T R (1992) Sciellce 256. 526-529 M (1989) Nllcleic Acid Res 17,9147-9163 22 Burke J M, Esherick J S, Burfeind W R & King J L ( 1990) 39 Hall D H, Povinelli, C N, Ehrenman K, Pedersen-lane J, Chu
Nal llre (Lo lldoll) 344. 80-82 F & Belfort M. ( 1987) Cell 48, 63-7 l 23 Cech T R. Zaug A J & Grabowsk i P J ( 198 1) Cell 27, 487- 40 Khan A U, Lal S K & Ahmad, M ( 1998) Biochem Biophys
496 Res CO III//1lI11 242, 10-15 24 Menger M. Tusch l T, Eckstein F & Porschke D (1996) 4 1 Lal S K & Hall D H ( 1993) Biochem Biophys Res COllI/ill III
BiochemisT"v 35. 147 10- 14716 196, 943-949
25 Wu M & Tonoco I Jr ( 1998) Proc NaTl Acad Sci USA 95, 42 Khan A U, Ahmad M & Lal S K (2000) Biochelll Biophys Res
11555- 11 560 COlllm1l1l 268. 359-364 26 Cate J H. Hanna R L & Doudna J A ( 1997) Nat Sl rtiCI Bioi 4, 43 Couture S, Ellington A D, Gerber A S, Cherry J M, Doudna J
553-558 A. Green R. Hanna M, Pace U, Raj agopal J & Szostak J W 27 Hampel A & Cowan J A ( 1997) Chem Bioi 4, 5 13-5 17
28 esbi tt S. Hcgg L A & Fedor M J ( 1997) Chem Bioi 4, 6 19- ( 1990) J Mol Bio12 15, 345-358
630 44 FloI' P J, Flanegan J B & Cech T R. ( 1989) EM BO J 8, 339 1-
29 Young K J. Gi ll F & Grasby J A (1997) Nllcleic Acid Res 25, 3399 45 Burke. J M (1988 ) Gelle 73 . 273-294
3760-3766 46 Sc lavi B. Sullivan M, Chance M R, Brenowitz M & Woodson 30 Murray J B. Seyhan A A. Walter N G. Burke J M & Scott W
G ( 1998) Chem Bioi 5. 587-595 S A ( 1998) Sciellce 279, 1940-1943
31 Blum B. Stu rm N R. Simpson A M & Simpson L ( 1991) Cell 47 La! S K & Hall D H ( 1997) Biochim Biophvs Acta 1350. 89-65. 543-550 97
32 Herschlag D & Cech T R ( 1990) Biochemis/t)' 29. 10 159- 48 Doherty E A & Doudna J A (1997) Biochell1isl/), 36. 3159-10171 3 169