12
Copyright Ó 2008 by the Genetics Society of America DOI: 10.1534/genetics.108.094318 Insulator and Ovo Proteins Determine the Frequency and Specificity of Insertion of the gypsy Retrotransposon in Drosophila melanogaster Mariano Labrador, 1 Ky Sha, 2 Alice Li 3 and Victor G. Corces 4 Department of Biology, Johns Hopkins University, Baltimore, Maryland 21218 Manuscript received July 23, 2008 Accepted for publication September 11, 2008 ABSTRACT The gypsy retrovirus of Drosophila is quite unique among retroviruses in that it shows a strong preference for integration into specific sites in the genome. In particular, gypsy integrates with a frequency of .10% into the regulatory region of the ovo gene. We have used in vivo transgenic assays to dissect the role of Ovo proteins and the gypsy insulator during the process of gypsy site-specific integration. Here we show that DNA containing binding sites for the Ovo protein is required to promote site-specific gypsy integration into the regulatory region of the ovo gene. Using a synthetic sequence, we find that Ovo binding sites alone are also sufficient to promote gypsy site-specific integration into transgenes. These results indicate that Ovo proteins can determine the specificity of gypsy insertion. In addition, we find that interactions between a gypsy provirus and the gypsy preintegration complex may also participate in the process leading to the selection of gypsy integration sites. Finally, the results suggest that the relative orientation of two integrated gypsy sequences has an important role in the enhancer-blocking activity of the gypsy insulator. R ETROVIRAL DNA integration into the host ge- nome is an essential step for production and replication of viral RNA. It has been traditionally difficult to study the factors controlling selection of integration sites, since most retroviruses integrate throughout the genome with no apparent DNA sequence specificity (Bushman et al. 2005). However, detailed analysis of multiple genomic integration sites in vivo has revealed that retroviruses have a strong preference for certain geno- mic regions. In particular, retroviruses integrate preferen- tially into actively transcribed DNA, which will thereafter facilitate transcription of the provirus (Scherdin et al. 1990). The distribution of retroviral integration sites along chromosomes suggests that open chromatin favors retroviral insertion, since integration events are favored in transcriptionally active chromatin and are rare in DNA sequences associated with heterochromatin (Lewinski et al. 2005; Yant et al. 2005). However, chromatin state or DNA accessibility could not be the only factor influenc- ing integration, since different retroviruses manifest pref- erences for integration that are unlikely to be only the result of chromatin organization. For example, both HIV and murine leukemia virus (MLV) integrate in actively transcribing DNA, but HIV integrates with equal fre- quency throughout all transcribed DNA, whereas MLV integrates preferentially into transcription start sites (Schroder et al. 2002; Wu et al. 2003). The mechanism of integration of retrotransposons is fundamentally identical to that of retroviruses (Bushman 2003). However, constraints imposed by small genome sizes have led some retrotransposons to the acquisition of mechanisms for site-specific integration. The best exam- ples of site-specific integration are found in non-LTR retrotransposons such as the Drosophila telomeric ele- ments HeT-A, TART , and TAHRE or rDNA elements such as the Drosophila R1 and R2 non-LTR retrotransposons (Yang et al. 1999; Christensen and Eickbush 2005; George et al. 2006). A number of examples involving integrases from yeast LTR retrotransposons have also shown that retroviral-like integrases have evolved to acquire strong site-specific integration properties (Sandmeyer 2003; Zhu et al. 2003; Brady et al. 2008; Gao et al. 2008). For example, the integration of Saccharomyces cerevisiae retrotransposons Ty1 and Ty3 is associated with RNA polymerase III transcription (Yieh et al. 2000; Bachman et al. 2005; Mou et al. 2006) and the Tf1 retrotransposon from Schizosaccharomyces pombe integrates near RNA polymerase II promoters (Singleton and Levin 2002; Bowen et al. 2003). In other examples, transposable elements are targeted to heterochromatic sites by tethering mechanisms involving interactions be- tween the integrase and DNA binding proteins. Target- ing of the Ty5 retrotransposon from S. cerevisiae to heterochromatin, for example, requires a six-amino-acid motif at the C terminus of the Ty5 integrase that interacts with the heterochromatin protein Sir4 (Gai and Voytas 1 Present address: Department of Biochemistry, Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996. 2 Present address: Department of Biology, MIT, 77 Massachusetts Ave., Cambridge, MA 02139. 3 Present address: Mount Sinai School of Medicine, New York, NY 10029. 4 Corresponding author: Department of Biology, Emory University, 1510 Clifton Rd. NE, Atlanta, GA 30322. E-mail: [email protected] Genetics 180: 1367–1378 (November 2008)

Insulator and Ovo Proteins Determine the Frequency and

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Insulator and Ovo Proteins Determine the Frequency and

Copyright � 2008 by the Genetics Society of AmericaDOI: 10.1534/genetics.108.094318

Insulator and Ovo Proteins Determine the Frequency and Specificity ofInsertion of the gypsy Retrotransposon in Drosophila melanogaster

Mariano Labrador,1 Ky Sha,2 Alice Li3 and Victor G. Corces4

Department of Biology, Johns Hopkins University, Baltimore, Maryland 21218

Manuscript received July 23, 2008Accepted for publication September 11, 2008

ABSTRACT

The gypsy retrovirus of Drosophila is quite unique among retroviruses in that it shows a strong preferencefor integration into specific sites in the genome. In particular, gypsy integrates with a frequency of .10% intothe regulatory region of the ovo gene. We have used in vivo transgenic assays to dissect the role of Ovoproteins and the gypsy insulator during the process of gypsy site-specific integration. Here we show that DNAcontaining binding sites for the Ovo protein is required to promote site-specific gypsy integration into theregulatory region of the ovo gene. Using a synthetic sequence, we find that Ovo binding sites alone are alsosufficient to promote gypsy site-specific integration into transgenes. These results indicate that Ovo proteinscan determine the specificity of gypsy insertion. In addition, we find that interactions between a gypsyprovirus and the gypsy preintegration complex may also participate in the process leading to the selection ofgypsy integration sites. Finally, the results suggest that the relative orientation of two integrated gypsysequences has an important role in the enhancer-blocking activity of the gypsy insulator.

RETROVIRAL DNA integration into the host ge-nome is an essential step for production and

replication of viral RNA. It has been traditionally difficultto study the factors controlling selection of integrationsites, since most retroviruses integrate throughout thegenome with no apparent DNA sequence specificity(Bushman et al. 2005). However, detailed analysis ofmultiple genomic integration sites in vivo has revealedthat retroviruses have a strong preference for certain geno-mic regions. In particular, retroviruses integrate preferen-tially into actively transcribed DNA, which will thereafterfacilitate transcription of the provirus (Scherdin et al.1990). The distribution of retroviral integration sitesalong chromosomes suggests that open chromatin favorsretroviral insertion, since integration events are favoredin transcriptionally active chromatin and are rare in DNAsequences associated with heterochromatin (Lewinski

et al. 2005; Yant et al. 2005). However, chromatin state orDNA accessibility could not be the only factor influenc-ing integration, since different retroviruses manifest pref-erences for integration that are unlikely to be only theresult of chromatin organization. For example, both HIVand murine leukemia virus (MLV) integrate in activelytranscribing DNA, but HIV integrates with equal fre-

quency throughout all transcribed DNA, whereas MLVintegrates preferentially into transcription start sites(Schroder et al. 2002; Wu et al. 2003).

The mechanism of integration of retrotransposons isfundamentally identical to that of retroviruses (Bushman

2003). However, constraints imposed by small genomesizes have led some retrotransposons to the acquisition ofmechanisms for site-specific integration. The best exam-ples of site-specific integration are found in non-LTRretrotransposons such as the Drosophila telomeric ele-ments HeT-A, TART, and TAHRE or rDNA elements suchas the Drosophila R1 and R2 non-LTR retrotransposons(Yang et al. 1999; Christensen and Eickbush 2005;George et al. 2006). A number of examples involvingintegrases from yeast LTR retrotransposons have alsoshown that retroviral-like integrases have evolved to acquirestrong site-specific integration properties (Sandmeyer

2003; Zhu et al. 2003; Brady et al. 2008; Gao et al.2008). For example, the integration of Saccharomycescerevisiae retrotransposons Ty1 and Ty3 is associated withRNA polymerase III transcription (Yieh et al. 2000;Bachman et al. 2005; Mou et al. 2006) and the Tf1retrotransposon from Schizosaccharomyces pombe integratesnear RNA polymerase II promoters (Singleton andLevin 2002; Bowen et al. 2003). In other examples,transposable elements are targeted to heterochromaticsites by tethering mechanisms involving interactions be-tween the integrase and DNA binding proteins. Target-ing of the Ty5 retrotransposon from S. cerevisiae toheterochromatin, for example, requires a six-amino-acidmotif at the C terminus of the Ty5 integrase that interactswith the heterochromatin protein Sir4 (Gai and Voytas

1Present address: Department of Biochemistry, Cellular and MolecularBiology, The University of Tennessee, Knoxville, TN 37996.

2Present address: Department of Biology, MIT, 77 Massachusetts Ave.,Cambridge, MA 02139.

3Present address: Mount Sinai School of Medicine, New York, NY 10029.4Corresponding author: Department of Biology, Emory University, 1510

Clifton Rd. NE, Atlanta, GA 30322. E-mail: [email protected]

Genetics 180: 1367–1378 (November 2008)

Page 2: Insulator and Ovo Proteins Determine the Frequency and

1998; Xie et al. 2001). It has also been proposed that thechromodomain (CHD domain) from certain transpos-able element chromointegrases targets the retrotranspo-son for insertion into sites bearing the specific epigeneticmarks recognized by the CHD domain (Gao et al. 2008).

In contrast to mammalian retroviruses, Drosophilaretroviruses such as gypsy, ZAM, or Idefix, display a highrate of site-specific integration into certain regions ofthe genome (Mevel-Ninio et al. 1989; Desset et al. 1999;Leblanc et al. 1999; Conte et al. 2000). The mechanismsgoverning this specificity however are poorly under-stood, but the genetic tools available in Drosophilaprovide a unique opportunity to analyze retroviral siteintegration specificity in higher eukaryotes. In particu-lar, gypsy insertions into the ovo locus occur in the germline of �10% of the female offspring from motherscarrying permissive mutations in the flamenco ( flam)locus (Prud’homme et al. 1995; Dej et al. 1998). Gypsyintegrations take place specifically into a sequence of�1.3 kb spanning the 59 regulatory region of the ovogene (Dej et al. 1998). The flam locus is located in theheterochromatin of the X chromosome (Prud’homme

et al. 1995) and produces a long noncoding RNA thatcontrols transcription of the gypsy retrovirus throughthe piwiRNA pathway (Brennecke et al. 2007).

The process of gypsy transposition is maternallyregulated, involving maternally inherited gypsy particlesthat originate in the developing oocyte of flam mutantfemales. These females fail to produce the flam RNA,allowing the transcription of euchromatic gypsy ele-ments in the follicle cells surrounding the oocyte duringoogenesis. Transcription of gypsy in follicle cells leads tothe formation of virus particles that infect the oocyteand subsequently participate in the integration of gypsyin the germ line of the resulting embryo after fertiliza-tion (Song et al. 1994, 1997). These integration eventstake place preferentially in the ovo gene, whose productis necessary for the development of the female germ lineand the normal progression of oogenesis (Lu et al.1998). The ovo gene encodes two isoform proteins, Ovo-A and Ovo-B, which have a common DNA-bindingdomain but different N-terminal domains. Ovo-B posi-tively regulates the ovo promoter, whereas Ovo-A func-tions as a negative regulator of the ovo promoter(Andrews et al. 2000). Adult females homozygous fora null mutation of the ovo gene do not develop germ linecells. The ovoD1 allele is caused by a point mutation thatcreates a new in-frame methionine codon in the 59

region of ovo, adding an extra amino terminus domainto Ovo-B that is normally only present in the wild-typeOvo-A protein (Mevel-Ninio et al. 1996). The ovoD1

allele is dominant negative and causes female sterilityeven when heterozygous. The sterility is due to theexpression of OvoD1B protein, which is made at the sametime of development as Ovo-B but has the repressoractivity of Ovo-A; the presence of OvoD1B is sufficient toarrest oogenesis at stage 4 (Lu et al. 1998).

Insertion of gypsy into the ovoD1 allele in a heterozy-gous female reverts to fertility by preventing theexpression of the OvoD1B protein, although the re-version occurs only in those germ cells in which gypsyis inserted into the ovoD1 sequence. The ability of gypsyto integrate specifically into ovo sequences was analyzedby Dej et al. (1998). These studies concluded that gypsyintegrates in at least seven different target sites localizedwithin a 200-bp sequence present in the promoterregion of the ovo gene. Close analysis of these sitesreveals a very relaxed consensus sequence consisting ofsix alternating pyrimidines and purines. The weakconservation of the observed target sequence suggeststhat gypsy site-specific integration is not due to a directinteraction of the gypsy integrase with these sequences.Instead, it has been proposed that Ovo proteins maymediate gypsy insertion specificity by promoting protein–protein interactions between Ovo or an associated pro-tein and the gypsy preintegration complex (Labrador

and Corces 2001).The gypsy retrovirus of Drosophila also exhibits the

interesting property of blocking enhancers from acti-vating promoters when gypsy is inserted between them.This property is referred to as insulator activity andresides in the Suppressor of Hairy wing [Su(Hw)]binding sites present in a 350-bp sequence located inthe 59-UTR of the gypsy retroviral genome (Geyer et al.1986; Spana et al. 1988; Gerasimova and Corces 1998,2001; Gerasimova et al. 2000). In addition, gypsy insu-lators are also able to buffer transgenes from positioneffects by preventing heterochromatin from spreadingthrough the chromatin fiber (Roseman et al. 1993).There is mounting evidence suggesting that gypsy insu-lators function by creating chromatin domains mostprobably defined by the interaction between adjacentinsulator sites in chromosomes. Molecular evidence forsuch interactions has been obtained by measuring thedistance between adjacent gypsy insertions in wild-typeand in su(Hw) mutant cells (Gerasimova et al. 2000).These experiments revealed that the two gypsy sequenceswere significantly closer during interphase when theSu(Hw) protein was present. It has been proposed thatsuch interactions might create chromatin domains bylooping out the DNA contained between two interactinginsulators. Additional evidence in support of this modelhas been provided by showing the presence of DNAloops attached at their base to the nuclear matrix by thegypsy insulator in the nucleus of Drosophila imaginaldisc cells (Byrd and Corces 2003). Interaction betweengypsy insulators is also supported by data showing thattwo adjacent insulators were able to cancel each other,no longer exerting their enhancer blocking effect whenlocated between the enhancer and the promoter of areporter gene (Cai and Shen 2001; Muravyova et al.2001; Kuhn et al. 2003).

The molecular basis for interactions between individ-ual insulators is not well understood but it has been

1368 M. Labrador et al.

Page 3: Insulator and Ovo Proteins Determine the Frequency and

suggested that Modifier of mdg4 [Mod(mdg4)] andCP190, both components of the insulator complex,might facilitate such interactions by mediating protein–protein contacts between the BTB domains present inthe two proteins (Gause et al. 2001; Ghosh et al. 2001;Golovnin et al. 2007). These two properties of gypsy, site-specific integration and insulator activity, have been thesubject of intense but unrelated studies during the pasttwo decades (Mevel-Ninio et al. 1989; Bucheton 1995;Prud’homme et al. 1995; Dej et al. 1998; Gerasimova andCorces 2001; Labrador and Corces 2001; Capelson

and Corces 2004). Here, we attempt to analyze these twoproperties simultaneously in an effort to understand howinsulators might mediate genome organization andhow this organization may influence retroviral selectionof integration sites through the genome. We have previ-ously developed an assay to show that the 59 regulatoryregion of ovo is able to recruit gypsy insertions indepen-dently of its position in the genome (Labrador andCorces 2001). We have also provided genetic evidencesuggesting that the Ovo protein is directly implicated insuch recruitment. We now take advantage of the abilityof inducing two consecutive gypsy insertions into a yellowreporter gene to analyze the role that interactions be-tween a gypsy provirus and the gypsy preintegrationcomplex may play in the selection of retrovirus integra-tion sites and the effect of the relative orientation ofinteracting proviruses on the enhancer-blocking activityof the gypsy retrovirus.

MATERIALS AND METHODS

Cloning DNA sequences containing Ovo binding sites: Thethree constructs ovoWT, ovoEN, and ovoDIM were obtained usingpolymerase chain reaction (PCR) following standard proce-dures (94� for 1.0 min, 60� for 1.5 min, and 72� for 2.0 min).Each sequence was cloned into the NotI site of the Casper-yellowvector as described (Labrador and Corces 2001). ovoWT wasconstructed by PCR amplification using genomic DNA fromthe Oregon R wild-type strain as template. The primers OVO-F(59-GATGGGTCGCGGCCGCTTGAAAACAGCAGAAAAATA-39)and OVO-R (39-ATACTAATTAAGCATAAATTCGCCGGCGTATTGTAC-59) were used to amplify ovoWT. ovoEN and ovoDIM were eachbuilt from a set of 50–60 overlapping oligonucleotides as follows.

OVO-dim1: 39-CTACCCAGCGCCGGCGAACTTTTGTCGTCTTTTTATAT-59

OVO-dim2: 59-CAGCAGAAAAATATGCCAATTTGTTTTGAATTTAACAGATTTT-AATAAATAGTTTTAACTTAATGG-39

OVO-dim3: 39-CAAAATTGAATTACCAAGTCGTGTAAAATTTGAAGGGAT-TCATATACGTATCGGGA-59

OVO-dim4: 59-ATATGCATAGCCCTGTTTTAATTTTTAATTTCAAGTTAATAA-CTTTTATTCACATCATACT-39

OVO-dim5: 39-ATAAGTGTAGTATGAGTCTATCATATAAAATCACAT-GAGAAAACTCTATTAAAGTAGTG-59

OVO-dim6: 59-AGATAATTTCATCACTTGTTGATTCGTCGTTGGCAACTCTGC-ACCCACGATTCCAAATAGGATTA-39

OVO-dim7: 39-AAGGTTTATCCTAATCTTACTGTGTGTGGGTACCTTAACTTC-TTACATTCCTTCTTTCC-59

OVO-dim8: 59-GTAAGGAAGAAAGGGAGTGTGATCGAAAGTCCGTTCC-TTTGCTCAAATAGATAGCAATCGTC-39

OVO-dim9: 39-CTATCGTTAGCAGGCTCGCTTGCCTGTCTGTTTAAAGACTCTT-AGCGTGAAGAAACGAAGAGAG-59

OVO-dim10: 59-TTCTTTGCTTCTCTCATTTTCGGTGATTTTAGAATGCTTGC-TTATTGTGTGTGCACTCGAAAGTTCTATT-39

OVO-dim11: 39-GAGCTTTCAAGATAATCCAAGGTGTCCCAAAAATATGT-ATACTAATTAAGCATAAATTCGCCGGCGTCTTGTAC-59

OVO-dim12: 59-TTTATACATATGATTAATTCGTATTTAAGCGGCCGCAGCAGA-TGCTACTTAACGT-39

OVO-en1: 59-AAATGACGATGGGTCGCGGCCGCTTGAAAACAGCAGAAAAA-TAAAGCCGTTAAAATT-39

OVO-en2: 39-ATTTCGGCAATTTTAACTTTTTCACGTCAAATTTACATTGAC-AATTATACTCGTCATATAGT-59

OVO-en3: 59-TGAGCAGTATATCACGACTACAGTTAGAATTAGCTCTACGG-ATCCT-39

OVO-en4: 39-GAGATGCCTAGGAAAAATGTCAATGTATCGTCTCAGGAACC-TAAAAGGCAACGAAAAAA-59

OVO-en5: 59-TTCCGTTGCTTTTTTATTGAGGCTGTGTGCAAAATAAAGCCG-TTAAAATTGAAGGTTCCAC-39

OVO-en6: 39-AACTTCCAAGGTGCCAATTTACATTGACAATTATACCGA-TAAATATGTCGTCCGAGGA-59

OVO-en7: 59-CAGCAGGCTCCTTTTTACAGTTACATATGATTAATTCGTAT-TTAAGCGGCCGCAGCAGATGCTACTTAACGT-39

OVO-en8: 39-TTGTACAATAAATGATA-59

We used a two-step PCR amplification as previously de-scribed (Dillon and Rosen 1990). In the first PCR step, theovoEN template was assembled using 1.0 ml (100 ng/ml) of eachof the OVO-en1–OVO-en8 overlapping oligonucleotides plus0.5 ml Ex Taq polymerase (TaKaRa Biomedicals). This PCRstep consisted of seven cycles at 94� for 1.0 min, 50� for 1.5 min,and 72� for 2.0 min. The ovoDIM template was assembled inidentical fashion using OVO-dim1–OVO-dim12. Once theovoEN and ovoDIM templates were assembled, they were subjectedto the second round of PCR amplification. The second-stepPCR amplification was identical to that used to amplify ovoWT

from genomic DNA, using the same end primers containingNotI restriction sites. After the second-step PCR amplification,cloning of ovoEN and ovoDIM into the Casper-yellow vector wasidentical to the procedure used to clone ovoWT into the NotIsites of Casper yellow. The Casper-yellow plasmids containingovoWT, ovoEN, and ovoDIM were sequenced to confirm that all threecorrespond to the expected sequences shown in Figure 1. Theorientation of ovoWT, ovoEN, and ovoDIM in the Casper-yellow vectoris opposite to the direction of transcription of the yellow gene.Nucleotide numbers indicating the location of insertionsthroughout the manuscript are as described by Geyer andCorces (1987). To designate transgenes carrying the Casper-yellow plasmids containing ovo we use the terminology P[yovo;w1]. The super index ovoWT, ovoEN, and ovoDIM was added todenote the particular ovo sequences cloned into yellow.

Induction of gypsy mobilization using flam females: Stockswere maintained on standard cornmeal medium at 25� in ahumidity-controlled incubator. The flam flies were maintainedas a y v f mal flam/FM3 stock. This stock segregates homozygousflam flies and heterozygous flam/FM3 females. Only heterozy-gous females were used to maintain the stock each generation.Virgin homozygous y v f mal flam were collected from the stockfor crosses requiring homozygous flam flies. ovoD1 mutants weremaintained in a stock by crossing ovoD1 males to femalescarrying attached X chromosomes. Flies carrying P[yovo; w1]transgenes display a yellow wild-type phenotype and weremaintained in a y w67c background (Labrador and Corces

2001). To induce gypsy integrations into the P[yovo; w1] trans-genes, virgin females homozygous for y v f mal flam werecrossed to P[yovo; w1] transgenic males. Since a single insertionin the germ line can give rise to several mutant flies, singlefemale crosses were performed instead of mass crosses. Virgin

Retroviral Insertion Specificity 1369

Page 4: Insulator and Ovo Proteins Determine the Frequency and

F1 females with genotype y v f mal flam/y w67c; P[yovo; w1] wereindividually crossed to y w67c males. To detect gypsy insertionsinto the P[yovo; w1] transgene, the offspring of this cross (F2

generation) was systematically screened for individuals with amutant yellow phenotype as follows. Individual y v f mal flam/yw67c; P[yovo; w1] females crossed to several y w67c males weretransferred to new vials containing fresh food every 3–4 days. Aminimum of three vials was screened per cross and crosses withlow numbers of offspring were discarded. The yellow pheno-type of the F2 progeny containing the P[yovo; w1] transgene wasscreened under a dissecting microscope and 90–105 flies werecounted and screened per cross. This process was repeated twoto three times per vial to allow new flies to eclose and thenumber of flies counted each time was annotated in the vial.Only independent mutant flies (from different mothers) wererecorded and crossed in new vials for further analysis. As apositive control, to test whether flam permissive femalesproduced active virus particles, virgin y v f mal flam femaleswere crossed to ovoD1 males and the offspring were screened forfertile females as described previously (Labrador and Corces

2001). We used the ovoD1 reversion assay (Prud’homme et al.1995) to determine the gypsy mobilization activity of the y v fmal flam stock. Five to ten y v f mal flam virgin females werecrossed with ovoD1 males and kept in the same vial for 3–5 days.After this time, parental flies were removed from the vials andthe offspring larvae were allowed to develop. Groups of 5–10ovoD1/ovo1 females from the offspring were transferred to newvials and crossed to wild-type males. ovoD1/ovo1 females aresterile unless the ovoD1 allele carries a new insertion of the gypsyelement integrated into the ovoD1 regulatory region. Afterseveral days, vials were examined for the presence of eggs orlarvae. All the mothers in vials containing eggs were dissectedunder the microscope and females displaying functionalovaries were considered fertile and counted as a gypsy in-tegration event into the ovoD1 allele.

Analysis of gypsy insertions by PCR and DNA sequencing:Genomic DNA was extracted from 10–20 flies for the detectionof gypsy integrations into P[yovo; w1] transgenes. DNA extrac-tion was carried out using the potassium acetate quick prep asdescribed (Dej et al. 1998). The primers used to amplify gypsyinsertions were gypsy-P3 CTTTGCCGAAAATATGCAATG andgypsy-P1 CAACATGACCG-AGGAGCGGTCATAAAC locatedoutside the LTRs in the 59 and 39 ends of gypsy, respectively,and yP0030 GCCCGATTACCACATTGAG, yP1400 GTTGCACAAAA-TTACCGGC, yP1450 CTGTGGGTGCAATGATTAG,yP1120 TCATTGCCGCAAGC-TCTG, and yP2900 CGCCACGGTCCACAGAAGAG, which are located at different pointsin the regulatory sequences of the yellow gene and thereforecould be present adjacent to new gypsy insertions. A combina-tion of P1 or P3 with a yellow primer will detect the insertion ofgypsy in either orientation. Alternative combinations of P3 orP1 with alternative primers in the yellow gene can detect thesame insertion at the opposite end of gypsy and was used toconfirm the results obtained with the first combination ofprimers. Approximately 50 ng of genomic DNA were used perPCR amplification. Amplified DNA fragments containing gypsyinsertion sites were directly sequenced using gypsy-LTR3AAATCGCTATCGCCACAAGGC or gypsy-LTR5 GCAGCGTGAAG-CAACACTCCC as sequencing primers.

RESULTS

Ovo protein binding sites promote gypsy insertioninto a yellow transgene: We have previously suggestedthat binding of the Ovo protein to the regulatory regionof the ovo gene could account for the specificity of

integration into this region of the gypsy retrotransposon(Labrador and Corces 2001). In previous experi-ments, a DNA fragment containing �1.3 kb from theregulatory region of the ovo gene, including the pro-moter, was inserted into the Casper-yellow transformationvector. Transgenes containing these ovo gene sequencesturned into highly specific targets for gypsy insertions,which occurred at a rate similar to that observed at theendogenous ovo gene (Dej et al. 1998; Labrador andCorces 2001).

To provide molecular evidence supporting the hypoth-esis that Ovo proteins are capable of targeting gypsyinsertions into Ovo binding sites, we first asked whether asubset of the ovo sequences used in previous experiments,containing binding sites for the Ovo protein, would alsoinduce a high rate of gypsy insertions. To this end we usedPCR to specifically amplify a 508-bp DNA fragment(ovoWT ) containing nucleotides 512–1020 from the ovogene regulatory sequence (Mevel-Ninio et al. 1996).This fragment lacks the promoter region of the Ovo-Aisoform but still contains the promoter region of the Ovo-B isoform and includes five Ovo protein binding sites(Figure 1B). We then cloned the ovoWT fragment into theNotI site at nucleotide 1980 of the Casper-yellow vector(Figure 1A), which is located between the wing blade andbody enhancers and the promoter of the yellow gene(Labrador and Corces 2001). To determine whetherthe binding of Ovo proteins was necessary to specificallyrecruit gypsy insertions into Casper-yellow transgenes, wegenerated a synthetic DNA sequence identical to ovoWT

but containing specific point mutations into the fiveexisting Ovo binding sites. The Ovo binding sites inFigure 1 were identified following the descriptions by Lu

et al. (1998).The induced point mutations result in sequences that

strongly deviate from Ovo protein consensus bindingsites, whereas the flanking sequences remain identical tothose in ovoWT (Figure 1C). We refer to this sequence asovo DIM. The ovoDIM fragment was also cloned into theCasper-yellow vector and plasmids containing ovoWT andovoDIM were independently microinjected into y w; D2-3Sb/TM6 embryos. Transgenic lines obtained from bothconstructs show a wild-type yellow phenotype in a y1

mutant background, indicating that the yellow ovoWT andovoDIM transgenes (P[yovoWT; w1] and P[yovoDIM; w1], respec-tively) are able to express the Yellow protein in wingblades, body cuticle, and bristle tissues (see Figure 2B).

To test the role of Ovo binding sites in the specifica-tion of gypsy insertion, two independent transgenic linesfrom each construct were selected and transgenic maleshomozygous for the insertion were crossed to flamfemales. Gypsy elements are highly transcribed in thefollicle cells surrounding the developing oocytes ofthese females, producing virus particles that will infectthe oocyte and subsequently will integrate into the germline DNA of the offspring (Pelisson et al. 1994; Song

et al. 1994, 1997). Insertions of gypsy into P[yovo; w1]

1370 M. Labrador et al.

Page 5: Insulator and Ovo Proteins Determine the Frequency and

transgenes can be detected in the F2 generation by thechanges in the yellow phenotype induced by the gypsyinsulator after integration of gypsy into the regulatoryregion of the P[yovo; w1] transgene (Labrador andCorces 2001). For each experimental cross, controlsusing the ovoD1 reversion assay (Mevel-Ninio et al. 1989;Prud’homme et al. 1995) were performed to ensure thatgypsy was actively transposing in the flam stock (Table 1).We analyzed an average of 100 F2 flies per each y v f malflam/y w; P[yovo; w1] F1 parent. The actual number of F2

descendants counted for each cross was 95–105 and thisvariability was randomized by using a large number of F1

progenitors and discarding low yield crosses equallyamong all samples. Statistical significance was calcu-lated using the x2 contingency tables, assuming that theprobability of gypsy insertions into transgenes wasuniform across vials within each sample.

Results from these experiments indicate that flies con-taining P[yovoWT; w1] transgenes show a high frequencyof gyspy insertions into yellow, whereas flies containingP[yovoDIM; w1] transgenes did not yield a single gypsy in-sertion. Table 1 indicates the number of flies analyzedand the frequencies of gypsy insertions in the P[yovo; w1]transgenes from different transgenic lines. From 395 F1

females analyzed, a total of 43 new insertions occurredinto P[yovoWT; w1] trangenes (10.85%), whereas analysis of253 F1 P[yovoDIM; w1] females did not reveal any new gypsyinsertion (Table 1). Therefore, these results suggest thatremoval of all Ovo binding sites from the ovoWTsequenceeliminates the ability of gypsy to specifically integrate intoP[yovo; w1] transgenes, suggesting that the frequency and

specificity of gypsy insertions into the regulatory regionof the ovo gene depends on the presence of the Ovoprotein.

A synthetic DNA sequence containing an array ofeight Ovo binding sites flanked by random interveningsequences actively promotes gypsy insertion into ayellow transgene: Although the experiments describedabove strongly suggest a requirement of Ovo proteins tospecifically recruit gypsy insertions, we could not discarda role of the flanking sequences in this process. It ispossible that these sequences could recruit additionalproteins with an active role in targeting gypsy forintegration. Therefore, we asked whether the presenceof Ovo binding sites flanked by random DNA sequencesin a synthetic sequence will also specifically induce ahigh rate of gypsy insertion. We recreated the Ovobinding sites previously identified in the ovo regulatoryregion (Lu et al. 1998) to generate a 314-bp syntheticDNA sequence, named ovoEN (Figure 1D). The ovoEN

sequence has eight Ovo binding sites identical to thosefound in wild-type ovo sequences (Figure 1A) exceptthat they are flanked by random DNA sequences (Figure1D). The ovoEN sequence was inserted into the NotI site ofthe same Casper-yellow transformation vector used in theprevious experiments (Figure 2A) and the resultingCasper-yellowovoEN DNA was microinjected into y w ; D2-3St/TM6 Drosophila embryos.

Two transgenic stocks carrying yellowovoEN transgenes(P[yovoEN; w1]) were subsequently analyzed to test theability of the ovoEN sequence to specifically promote newgypsy integrations (Table 1). The phenotype of trans-

Figure 1.—Ovo DNA sequences cloned intothe yellow regulatory region of the different P[yovo;w1] transgenes. (A) Schematic drawing of the yel-low gene with the location of ovo sequences. Theblack box indicates the upstream regulatory re-gion and the white boxes indicate exons 1 and2 of the yellow gene. Tissue-specific wing (W),body (B), and bristle (Br) enhancers are indi-cated as ovals. The open triangle indicates theOvo sequences inserted between wing and bodyenhancers and the yellow promoter at nucleotide1980 of the yellow gene. (B) DNA fragment 508 bplong from the wild-type regulatory region of theovo gene containing five Ovo binding sites. (C)Synthetic sequence identical to the one shownin B but with the Ovo binding sites mutated toprevent binding of Ovo proteins. (D) Synthetic314-bp sequence containing eight Ovo bindingsites separated by random DNA sequences.

Retroviral Insertion Specificity 1371

Page 6: Insulator and Ovo Proteins Determine the Frequency and

genic flies carrying P[yovoEN; w1] transgenes was indistin-guishable from that of wild type, indicating that thesynthetic ovoEN sequence has no effect on the expressionof the yellow transgene (Figure 2B). Gypsy insertions intothe P[yovoEN; w1] transgene were induced after activa-tion of gypsy transposition in crosses with flam mutantfemales. Insertions were identified by their effect on theyellow phenotype in adult flies. Gypsy insertions intothe P[yovoEN; w1] transgene were identified in 24 of 488(4.92%) analyzed females, further supporting the hy-pothesis that the presence of Ovo proteins is responsible

for recruiting new gypsy insertions into the transgene.The lower frequency of gypsy insertion into ovoEN vs.ovoWT sequences may perhaps reflect a role of the se-quences flanking the Ovo binding sites in the bindingaffinity of the Ovo proteins.

Gypsy insertions also occur into sequences flankingOvo binding sites: Analysis of the phenotypes emergingfrom new gypsy insertions into P[yovo; w1] transgenesunexpectedly revealed that some insertions did not takeplace into ovo sequences. For example, flies from thestrain P[yovoEN; w1] 2.4 ywing have wild-type black pigmen-

Figure 2.—Gypsy-induced yel-low phenotypes depend on theintegration sites as well as the rel-ative orientation of gypsy. (A) Theyellow reporter gene in the Cas-per-yellowovo transformation vectorshowing the synthetic DNA se-quence containing eight Ovobinding sites. (B) y w male con-taining the P[yovoEN; w1] 2.4 trans-gene shows black wing blade,black body, and black bristles.(C and E) Integration of gypsy be-tween the wing and body en-hancers results in yellow wingsand black body phenotype. (D)Integration of gypsy between thebody enhancer and the promoterresults in yellow wings and bodyphenotype. (F) Integration of asecond gypsy between the bodyenhancer and promoter, in oppo-site orientation to the preexistingone (C and E), results in blackwings and yellow body, indicatingthat the wing enhancer is capableof bypassing the enhancer-block-ing activity of the two insulators.(G) Integration of a second gypsybetween the body enhancer andpromoter, in the same orienta-tion as the preexisting gypsy inser-tion (C and E), results in yellowwings and yellow body, suggestingthat both enhancers, wing andbody, are prevented from activat-ing the yellow promoter by theenhancer-blocking activity of thetwo insulators. (H) Two insertionsin opposite orientations with inte-gration sites between the wing andbody enhancers result in wild-typeyellow phenotype.

1372 M. Labrador et al.

Page 7: Insulator and Ovo Proteins Determine the Frequency and

tation in the body but yellow wing blades. This pheno-type could be explained if a gypsy insertion occurredbetween the wing and body enhancers, with the gypsyinsulator preventing the wing enhancer from activatingtranscription in wing tissues, but allowing the bodyenhancer to activate transcription in the body cuticle(Figure 2E). The insertion of gypsy between the wingand body enhancers would be unexpected, since theovoEN sequences are placed between the body enhancerand the promoter of yellow. The location of the gypsyinsertion site in the P[yovoEN; w1] 2.4 ywing transgene wasdetermined using PCR. Gypsy-P1 and -P3 primers (seematerials and methods) were selected from 59 and 39

non-LTR gypsy sequences, respectively, and primersyellow-Pn were selected from sequences present in the59 regulatory region of the yellow gene. PCR productswere obtained using sets of two primers such that onewould match yellow gene sequences and the secondwould match gypsy sequences. Using this procedure,each combination of primers successfully amplifying aspecific DNA fragment provides information on theorientation and the approximate insertion site of thegypsy insertion in the P[yovo; w1] transgene. The preciselocation of gypsy integration sites was then obtained bysequencing of the amplified DNA using gypsy LTR-P3 orgypsy LTR-P1 primers. The gypsy insertion in the P[yovoEN;w1] 2.4 ywing transgene occurred between the wing andbody enhancers, as predicted by the phenotype (Figure2E), at nucleotide 1111 in the upstream regulatorysequences of the yellow gene and outside of the ovoEN

sequences (Geyer and Corces 1987).Results from previous work had suggested that gypsy

insertions occur only within the DNA fragment contain-ing the binding sites for the Ovo protein, indicating thatsomehow targeting of the gypy preintegration complexand integration into DNA were part of the same process(Dej et al. 1998; Labrador and Corces 2001). In-

terestingly, integration of gypsy in the P[yovoEN; w1] 2.4ywing transgene in this case took place 869 bp away fromthe insertion site of the ovoEN sequences, suggesting thattargeting of gypsy preintegration complexes and gypsyintegration itself might take place in two independentevents during the process of gypsy integration intogenomic DNA. To obtain additional insights into themechanisms determining the specificity of gypsy in-tegration, we sequenced the insertion sites of a total of85 insertion events (Figure 3 and Table 2). Resultsconfirmed the previous observation and showed thatinsertions into P[yovo; w1] transgenes occurred up to 869bp upstream and up to 441 bp downstream of the actualovo sequence (Figure 3). Interestingly, whereas inser-tions in yovoWT occur both into the ovo sequences or intothe flanking yellow sequences, insertions induced by theovoEN sequence fall mostly into flanking yellow DNA.

In addition to the Ovo binding sites, there may beother sequences present in ovoWT but not in ovoEN thatmight be responsible for the observed difference in thedistribution of gypsy integration sites when transgenescarrying these two sequences are compared. For exam-ple, the Ovo-B promoter is present in the ovoWTsequenceand is missing in the ovoEN sequence. Although there isno experimental evidence to support this explanation, itis possible that the ability to activate transcription by theOvo-B promoter in the ovoWT sequence could involvelocal changes in chromatin structure in a manner thatfavors gypsy integration. These data suggest that, withoutother regulatory or promoter regions, Ovo proteins mayrecruit gypsy preintegration complexes to the vicinityof Ovo binding sites. The specific integration site may beselected later by the gypsy preintegration complexeswithin a few hundred nucleotides from the Ovo bindingsites by a still unknown mechanism.

It has been shown previously that gypsy has a prefer-ence for integration into the sequences TATATA andTACATA, later defined as YRYRYR, in which Y is apyrimidine and R is a purine (Dej et al. 1998). Given thatintegration sites into this sequence are significantlymore abundant than in any other sequence, our resultsconfirm the preference of gypsy for integration intoYRYRYR sequences, with 61 of the 85 sequences ana-lyzed conforming to the consensus (Table 2). However,a number of other sequences are also capable offunctioning as target sites for gypsy integration, suggest-ing that, although important, DNA sequence is not theonly factor determining integration site selection. In-terestingly, the pattern of insertion sites is significantlydifferent at each side of the ovo sequences (Figure 3).Clusters of insertions into sequences distal to the pro-moter occur at intervals of 145, 159, 246, and 319nucleotides, suggesting that these intervals may corre-spond to nucleosomal DNA that is protected and notaccessible to the gypsy integrase. On the other hand,insertions into sequences proximal to the yellow pro-moter occur in a more arbitrary fashion. A large cluster

TABLE 1

Frequency of gypsy insertions into P[yovo; w1] transgenes

LinesFemalestested

Newinsertionsa

Frequency(%)

ovoD1 controlsb 2134 141 6.61P[yovoWT; w1] 2.1 214 24 11.21P[yovoWT; w1] 1.1 181 19 10.50P[yovoDIM; w1] 2.1 132 0 0P[yovoDIM; w1] 1.2 121 0 0P[yovoEN; w1] 2.4 250 14 5.60P[yovoEN; w1] 4.2 238 10 4.20P[yovoEN; w1] 2.4 ywing 128 17 13.28

P[yovoEN; w1] 2.4 vs. P[yovoEN; w1] 2.4 ywing x2 ¼ 6.63; P ¼ 0.01.P[yovoEN; w1] 4.2 vs. P[yovoEN; w1] 2.4 ywing x2 ¼ 8.76; P ¼ 0.0031.

a Number of females that produced at least one progenywith a new gypsy insertion into a yellow transgene.

b Controls using the ovoD1 assay were performed in parallelwith experiments using transgenic lines to determine the ac-tivity of gypsy in flam flies.

Retroviral Insertion Specificity 1373

Page 8: Insulator and Ovo Proteins Determine the Frequency and

of multiple integrations is observed at 55 bp from theOvo binding sequences and small clusters or singleinsertion sites are found at intervals of only 145, 30, 46,118, and 47 nucleotides. Except for the 145-bp intervalbetween insertions at nucleotides 2036 and 2170, whichmay suggest the presence of a nucleosome, the remain-ing insertion sites are randomly scattered, suggestingthat positioned nucleosomes may be absent from this

region at the time of retroviral integration. Therefore,our results suggest that in addition to the recruitment ofgypsy preintegration complexes by Ovo proteins otherfactors such as the specific DNA sequence of the in-sertion site, nucleosomal arrangement, or other DNAbinding proteins may also play an important role in thefinal determination of the integration site.

The presence of a gypsy provirus results in highfrequency of secondary gypsy insertions: Interactionsbetween Ovo proteins and components of the gypsy pre-integration complex are likely responsible for targetinggypsy insertions to the vicinity of Ovo binding sites.Since gypsy insulator proteins mediate interactionsbetween individual insulators to create chromatinloops (Gerasimova et al. 2000; Cai and Shen 2001;Muravyova et al. 2001; Byrd and Corces 2003; Kuhn

et al. 2003), it is possible that the presence of a gypsyprovirus in a P[yovo; w1] transgene will increase thefrequency of secondary gypsy insertions. To test thispossibility, we compared the insertion rates into yellow-ovo sequences present in transgenes containing orlacking a copy of a gypsy provirus. We used the transgenicline P[y ovoEN; w1] 2.4 ywing, which contains a gypsy insertionin the yellow transgene regulatory region between thewing and body enhancers and displays wild-type blackpigmentation in the body but yellow wing blades.Starting with this transgenic line we then induced gypsymobilization using the same flam assay described above.We set up 128 F1 y w f ml flam/P[yovoEN; w1] 2.4 ywing singlefemale crosses and, as in previous assays, we analyzed anaverage of 100 offspring from each single female cross(range 95–105), screening for changes in the yellowphenotype (Table 1).

Transgenes with new insertions resulted in three newphenotypes when starting from flies with an initial ywing

phenotype (Figure 2): flies with a y2 phenotype (yellowwing blades, yellow body, and black bristles), flies withwild-type phenotype (black wing blades, black body, and

Figure 3.—gypsy integrationsites and orientation in the up-stream regulatory region of P[yovo;w1] transgenes. A total of 85 inte-gration sites were characterized atthe sequence level (see Table 2).Filledcircles correspond to integra-tion events. Arrowheads on top ofcircles indicate orientation of theinsertions with the arrowheadpointing toward the 39 end of gypsy.Ovo sequences were cloned atnucleotide 1980 in the upstreamregulatory sequence of yellow. In-tegration sites within the ovosequences are colored. Boldfacesegments below indicate distances

in nucleotides between the first and the last insertion before and after ovo DNA sequences. Numbers in brackets indicate distance be-tweenadjacentintegrationsites.Whiteboxes illustratethepositionofwing(W)andbody(B)enhancers.Theyellowpromoter is indicatedby a broken arrow. Drawing is not to scale.

TABLE 2

Target site sequences and integration sites intoP[yovo; w1] transgenes

Target site duplication Integration site No. of integrations

TATATGa 1835 21TGTATGa 2035 15TATATGa 2031 14TATAAA 491 in ovoWT 6TGTATAa 489 in ovoWT 4TATACAa 2170 3TATATC 1838 2TGTATAa 2029 2TATACT 1842 2TGTTTG 1430 2CTGTAT 252 in ovoEN 2TATAAA 1111 1TATTAA 2374 1TATAAC 2220 1CGTTAT 2210 1TGTAGA 1676 1TGTATT 2256 1TATACAa 2180 1ACGTTA 2209 1TATGCT 2036 1GTGTTT 1429 1GCACGA 2421 1TGCACAa 454 in ovoWT 1

a Target site duplications conform to the preferred consen-sus YRYRYR for gypsy integration.

1374 M. Labrador et al.

Page 9: Insulator and Ovo Proteins Determine the Frequency and

black bristles), and ybody flies (black wing blades, yellowbody, and black bristles). We obtained 17 such eventswith a final frequency of 13.28%. A Fisher exact testindicates that the number of new gypsy insertions inP[yovoEN; w1] 2.4 ywing is significantly higher than the samenumber in the parental transgenic stock P[yovoEN; w1] 2.4(Table 1). Since the initial P[yovoEN; w1] transgene isinserted in the same genomic location in both stocks,the observed differences are not due to chromosomalposition effects. The results suggest that the higherfrequency of second gypsy integrations may be due tointeractions between the preexisting gypsy element inthe genomic copy of the transgene and proteins presentin the gypsy preintegration complex.

Insulator function depends on the orientation ofgypsy provirus sequences: To confirm that the newphenotypes described in the previous section are dueto new gypsy integration events, we mapped the integra-tion sites and orientation of the putative new insertions.All new phenotypes contain a single additional new gypsyinsertion. Interestingly, each phenotype correlates eitherwith the orientation or with a specific integration site ofthe second gypsy insertion (Figure 2). We obtained a totalof 10 independent new insertions resulting in a ybody

phenotype and all of them originated from a secondinsertion, in an orientation opposite to the first one,between the body cuticle enhancer and the yellow pro-moter (Figure 2F). In these flies the body cuticleenhancer is flanked by two gypsy insertions in oppositeorientations and they display wild-type coloration of alltissues except for the body cuticle. We obtained only oneevent of a y1 phenotype, which originated from a secondgypsy insertion between the body cuticle enhancer andthe wing enhancer but integrated in an orientationopposite to the first one (Figure 2H). Finally, we obtained6 independent new insertions resulting in y2 phenotypes(Figure 2G). Interestingly, all 6 independent new linescarried a second gypsy insertion between the body en-hancer and the promoter, the same arrangement ob-served in ybody strains.

The only difference between flies with ybody pheno-types and y2 phenotypes is the relative orientation of thetwo gypsy insertions. In flies displaying a ybody phenotypethe two copies of gypsy are in opposite orientations(Figure 2F) whereas in flies with a y2 phenotype the twocopies of gypsy are in the same orientation (Figure 2G).These results show that two copies of the gypsy provirusinserted in the regulatory region of the yellow gene havethe same properties as two paired insulators in exper-imental transgenes (Cai and Shen 2001; Muravyova

et al. 2001; Kuhn et al. 2003). The presence of a singlegypsy provirus prevents the wing and body cuticleenhancers from activating the yellow promoter due tothe presence of the gypsy insulator (Figure 2D). Howeverwhen a second insertion occurs in the opposite orien-tation, distal enhancers can bypass both gypsy insertionsand their respective insulators (Figure 2, F and H).

Although the specific mechanism explaining thisproperty is still unknown, current models suggest thatthis phenomenon is the result of molecular interactionsbetween the two adjacent insulators and that theseinteractions loop out the intervening DNA. Enhancersoutside the resulting chromatin loop are capable ofactivating transcription of distal promoters, but en-hancers trapped within the chromatin loop are blockedby the interacting insulators. Our results confirm thismodel when the two copies of gypsy are in oppositeorientation and suggest that such interactions can occurat distances much longer than the distances so farreported (usually a few hundred nucleotides). Whenthe wing enhancer bypasses two gypsy insertions inopposite orientation as in Figure 2F and once thedistance between the two insertion sites and the size ofthe two gypsy sequences (7.4 kb) is taken into account,the distance between the two gypsy insulators is .14 kb.Interestingly, these interactions between insulators seemto be abrogated when the two copies of the gypsy provirusare in the same orientation. Since the only geneticdifference between gypsy insertion arrangements origi-nating the ybody and y2 phenotypes is the relative orienta-tion of the two gypsy elements, the results suggest thatthe relative orientation of gypsy sequences may affect theability of the two insulators to pair and thus influence theenhancer blocking properties of gypsy insulators.

DISCUSSION

The gypsy retrovirus of Drosophila may offer valuableclues as to how retroviruses develop strategies tospecifically select integration sites into the genome.Results shown here suggest that interactions betweenthe gypsy preintegration complex and, most likely, Ovoproteins are sufficient to promote site-specific integra-tion of gypsy into the ovo locus of Drosophila. Alterationof Ovo binding sites from a wild-type ovo gene fragmentabolishes the ability of gypsy to specifically integrate intoadjacent sequences. In addition, a synthetic DNAsequence carrying eight Ovo binding sites flanked byrandom DNA sequences is sufficient to function as ahighly specific target for integration of the gypsyretrovirus. Although direct interactions between Ovoproteins and the gypsy preintegration complex have notbeen substantiated, the data point to a mechanism bywhich Ovo proteins may tether the gypsy preintegrationcomplexes to their binding sites.

It is tempting to speculate that the gypsy integrase mayactually interact with the Ovo proteins and that such in-teraction may target integration to genome sites enrichedin Ovo. Evidence demonstrating tethering of integrases asa mechanism capable of targeting retroviruses to specificDNA binding sites has been shown in experiments usingfusion proteins in which the DNA binding domain ofphage l-repressor was fused to the integrase of the HIVretrovirus and successfully showed preferential integra-

Retroviral Insertion Specificity 1375

Page 10: Insulator and Ovo Proteins Determine the Frequency and

tion into target DNA near l-repressor-binding sites(Bushman 1994). Similarly, experiments with yeast retro-transposons have shown that such interactions may occurbetween the retrotransposon integrase and proteins thattarget the integration to their cognate chromosomal DNAbinding sites (Gai and Voytas 1998; Xie et al. 2001;Sandmeyer 2003; Zhu et al. 2003; Gao et al. 2008).

It has been reasoned that retrotransposons in smallgenomes such as that of yeast may develop tetheringmechanisms of site-specific integration by stimulatinginteractions between the integrase encoded by the retro-transposon and endogenous proteins, thus minimizingthe chances of deleterious mutations induced by retro-transposon integration events. In larger genomes suchmechanisms appear infrequently, probably due to thelack of selective pressure from the host genome. Inhumans for example, sequences related to interspersedretroviruses occupy .50% of the genome and onlyrelatively low frequencies of integration events in specifictarget spots have been reported (Farnet and Bushman

1997; Suzuki et al. 2004; Bushman et al. 2005; Shun et al.2007). In Drosophila only a number of specializednon-LTR retrotransposons have acquired specificity of in-tegration associated with specialized chromosomal re-gions such as telomeres or ribosomal DNA ( Jakubczak

et al. 1992; Casacuberta and Pardue 2005). Nonverte-brate retroviruses such as ZAM, Idefix, and gypsy appearto be an exception when compared with their vertebratecounterparts, since some degree of sequence specificityand targeted site integration has been described in allthree (Mevel-Ninio et al. 1989; Desset et al. 1999;Leblanc et al. 1999; Conte et al. 2000). The high rate ofinsertion of gypsy into the ovo gene and the role apparentlyplayed by the Ovo proteins provide an excellent tool tostudy the integration mechanism and how retrovirusesmay acquire integration site specificity in vivo.

Interestingly, even though Ovo proteins appear tohave a role in targeting gypsy to their binding sites, gypsyinsertion sites do not necessarily occur into the Ovobinding sequences themselves. From a total of 85sequenced insertion sites, only 13 (15%) occurred intothe DNA fragment containing Ovo binding sites; theremaining integration sites fall within an interval of.1300 bp flanking the DNA containing the Ovobinding sites. The analysis of insertion sites suggeststhat the targeting and the integration mechanisms areuncoupled, with the precise integration sites distributedin a nonrandom manner. Results shown here confirmprevious observations suggesting that gypsy has a pref-erence for integration into YRYRYR sequences. How-ever, a variety of other sequences appear to be able tofunction as integration sites. This disparity makes itdifficult to draw a clear conclusion as to what is themechanism ultimately involved in selecting target sites;however, it is tempting to speculate a role for nucleo-some positioning as one of the factors determining theselection of insertion sites by the gypsy retrovirus.

Several indirect lines of evidence suggest such a role.For example, integration frequencies are significantlyhigher between the promoter of the yellow gene and theOvo binding sites, indicating a preference that probablyreflects a difference in chromatin structure. Forty-twoindependent integration events occurred into a frag-ment of 441 bp located between Ovo binding sequencesand the promoter of the yellow gene, whereas only 30integrations occurred in a 869-bp DNA fragmentlocated distal to the promoter and upstream of theOvo binding sites. This asymmetry does not appear toreflect sequence differences or viability effects, suggest-ing an epigenetic basis for integration site selection. Inaddition, insertion sites found distally to the yellow pro-moter and upstream of the Ovo binding DNA sequencesappear to be spatially distributed in four intervals.140 bp, whereas insertions proximal to the promoterare distributed in a random manner, with only one largegap of 145 bp and apparently lacking meaningful spacingintervals. A possible interpretation of these results is thatgypsy integration preferentially occurs at specific pointsof either the nucleosome or the linker DNA. In the dis-tal interval, one spacing .140 bp could reflect nucleo-some positioning, whereas nucleosomes may be absentor not positioned in the promoter proximal region of thetransgene.

Results presented here also suggest that a preexistinggypsy insertion significantly increases the chances of newgypsy insertions into adjacent sequences by more thantwofold. We speculate that such enhancement of in-sertion frequency might be the result of a tetheringmechanism mediated by protein–protein interactionsbetween the gypsy element located in the chromosomalDNA and the gypsy preintegration complex during thenormal process by which a new copy of the retrovirus isinserted into the chromosome. Since two copies of thegypsy insulator have been shown to be able to interactwith each other (Gerasimova et al. 2000; Byrd andCorces 2003), it is tempting to speculate that inter-actions between gypsy insulators are responsible for theincreased frequency of gypsy insertions. Nevertheless, wecannot rule out the possibility that gypsy sequences otherthan the insulator or proteins associated with the gypsyelement itself are responsible for the observed inter-actions. If the high frequency of secondary insertions isdue to interactions between gypsy insulator proteinspresent in the provirus and in the preintegrationcomplex, the results would lend support to proposedmodels suggesting that individual insulators located indifferent regions of a chromosome can interact to formchromatin loops.

The analysis of phenotypes resulting from doubleinsertions allows further elaboration of this model andoffers additional insights into the mechanisms by whichinsulators affect enhancer–promoter interactions. Forexample, we have shown that a wing enhancer distal totwo adjacent gypsy insertions is capable of bypassing the

1376 M. Labrador et al.

Page 11: Insulator and Ovo Proteins Determine the Frequency and

activity of the two insulators when the two copies of thegypsy provirus are inserted in opposite orientation.These results demonstrate that interactions such asthe ones determined genetically in transgenes, involv-ing pairs of 400-bp gypsy insulators (Cai and Shen 2001;Muravyova et al. 2001; Kuhn et al. 2003), also occurbetween pairs of gypsy insulators embedded in the gypsyprovirus and suggest that establishing such interactionsis part of the normal life cycle strategy used by theretrovirus. Interestingly, when two gypsy insertions occurin the same orientation, distal enhancers are unable tobypass the two insulators and are blocked from activat-ing the promoter, contrary to what it has been observedwith direct repeats of insulator sequences (Cai andShen 2001; Muravyova et al. 2001; Kuhn et al. 2003).The main difference between the two sets of experi-ments is the presence of additional DNA sequences inthe gypsy provirus. These sequences may be able to forma stem-loop structure when the two copies of gypsy arearranged in opposite, but not when they are in the same,orientation. A similar role has been suggested for therelative orientation of insulator sequences betweeninteracting Mcp insulators (Kyrchanova et al. 2007).The stem-loop structure would allow interactions be-tween insulator proteins present in the two copies ofgypsy with opposite orientations but a direct tandemarrangement of the two copies of the provirus wouldpreclude such interactions. These observations supportthe hypothesis that interactions between paired insu-lators are required to bypass insulator function andallow enhancer–promoter communication.

This work was supported by U.S. Public Health Service AwardGM35463 from the National Institutes of Health.

LITERATURE CITED

Andrews, J., D. Garcia-Estefania, I. Delon, J. Lu, M. Mevel-Ninio

et al., 2000 OVO transcription factors function antagonisticallyin the Drosophila female germline. Development 127: 881–892.

Bachman, N., M. E. Gelbart, T. Tsukiyama and J. D. Boeke,2005 TFIIIB subunit Bdp1p is required for periodic integrationof the Ty1 retrotransposon and targeting of Isw2p to S. cerevisiaetDNAs. Genes Dev. 19: 955–964.

Bowen, N. J., I. K. Jordan, J. A. Epstein, V. Wood and H. L. Levin,2003 Retrotransposons and their recognition of pol II pro-moters: a comprehensive survey of the transposable elementsfrom the complete genome sequence of Schizosaccharomycespombe. Genome Res. 13: 1984–1997.

Brady, T. L., C. L. Schmidt and D. F. Voytas, 2008 Targeting inte-gration of the Saccharomyces Ty5 retrotransposon. Methods Mol.Biol. 435: 153–163.

Brennecke, J., A. A. Aravin, A. Stark, M. Dus, M. Kellis et al.,2007 Discrete small RNA-generating loci as master regulatorsof transposon activity in Drosophila. Cell 128: 1089–1103.

Bucheton, A., 1995 The relationship between the flamenco geneand gypsy in Drosophila: how to tame a retrovirus. Trends Genet.11: 349–353.

Bushman, F., M. Lewinski, A. Ciuffi, S. Barr, J. Leipzig et al.,2005 Genome-wide analysis of retroviral DNA integration. Nat.Rev. Microbiol. 3: 848–858.

Bushman, F. D., 1994 Tethering human immunodeficiency virus 1integrase to a DNA site directs integration to nearby sequences.Proc. Natl. Acad. Sci. USA 91: 9233–9237.

Bushman, F. D., 2003 Targeting survival: integration site selectionby retroviruses and LTR-retrotransposons. Cell 115: 135–138.

Byrd, K., and V. G. Corces, 2003 Visualization of chromatin do-mains created by the gypsy insulator of Drosophila. J. Cell Biol.162: 565–574.

Cai, H. N., and P. Shen, 2001 Effects of cis arrangement of chromatininsulators on enhancer-blocking activity. Science 291: 493–495.

Capelson, M., and V. G. Corces, 2004 Boundary elements and nu-clear organization. Biol. Cell 96: 617–629.

Casacuberta, E., and M. L. Pardue, 2005 HeT-A and TART, twoDrosophila retrotransposons with a bona fide role in chromo-some structure for more than 60 million years. Cytogenet. Ge-nome Res. 110: 152–159.

Christensen, S. M., and T. H. Eickbush, 2005 R2 target-primed re-verse transcription: ordered cleavage and polymerization steps byprotein subunits asymmetrically bound to the target DNA. Mol.Cell. Biol. 25: 6617–6628.

Conte, C., V. Calco, S. Desset, P. Leblanc, B. Dastugue et al.,2000 Impact of multiple insertions of two retroelements, ZAMand Idefix at an euchromatic locus. Genetica 109: 53–59.

Dej, K. J., T. Gerasimova, V. G. Corces and J. D. Boeke, 1998 A hot-spot for the Drosophila gypsy retroelement in the ovo locus.Nucleic Acids Res. 26: 4019–4025.

Desset, S., C. Conte, P. Dimitri, V. Calco, B. Dastugue et al.,1999 Mobilization of two retroelements, ZAM and Idefix, in anovel unstable line of Drosophila melanogaster. Mol. Biol. Evol.16: 54–66.

Dillon, P. J., and C. A. Rosen, 1990 A rapid method for the con-struction of synthetic genes using the polymerase chain reaction.Biotechniques 9(298): 300.

Farnet, C. M., and F. D. Bushman, 1997 HIV-1 cDNA integration:requirement of HMG I(Y) protein for function of preintegrationcomplexes in vitro. Cell 88: 483–492.

Gai, X., and D. F. Voytas, 1998 A single amino acid change in theyeast retrotransposon Ty5 abolishes targeting to silent chromatin.Mol. Cell 1: 1051–1055.

Gao, X., Y. Hou, H. Ebina, H. L. Levin and D. F. Voytas,2008 Chromodomains direct integration of retrotransposonsto heterochromatin. Genome Res. 18: 359–369.

Gause, M., P. Morcillo and D. Dorsett, 2001 Insulation ofenhancer-promoter communication by a gypsy transposon in-sert in the Drosophila cut gene: cooperation between suppres-sor of hairy-wing and modifier of mdg4 proteins. Mol. Cell. Biol.21: 4807–4817.

George, J. A., P. G. DeBaryshe, K. L. Traverse, S. E. Celniker andM. L. Pardue, 2006 Genomic organization of the Drosophilatelomere retrotransposable elements. Genome Res. 16: 1231–1240.

Gerasimova, T. I., K. Byrd and V. G. Corces, 2000 A chromatin in-sulator determines the nuclear localization of DNA. Mol. Cell 6:1025–1035.

Gerasimova, T. I., and V. G. Corces, 1998 Polycomb and trithoraxgroup proteins mediate the function of a chromatin insulator.Cell 92: 511–521.

Gerasimova, T. I., and V. G. Corces, 2001 Chromatin insulatorsand boundaries: effects on transcription and nuclear organiza-tion. Annu. Rev. Genet. 35: 193–208.

Geyer, P. K., and V. G. Corces, 1987 Separate regulatory elementsare responsible for the complex pattern of tissue-specific and de-velopmental transcription of the yellow locus in Drosophila mel-anogaster. Genes Dev. 1: 996–1004.

Geyer, P. K., C. Spana and V. G. Corces, 1986 On the molecularmechanism of gypsy-induced mutations at the yellow locus ofDrosophila melanogaster. EMBO J. 5: 2657–2662.

Ghosh, D., T. I. Gerasimova and V. G. Corces, 2001 Interactionsbetween the Su(Hw) and Mod(mdg4) proteins required forgypsy insulator function. EMBO J. 20: 2518–2527.

Golovnin, A., A. Mazur, M. Kopantseva, M. Kurshakova, P. V. Gulak

et al., 2007 Integrity of the Mod(mdg4)-67.2 BTB domain is crit-ical to insulator function in Drosophila melanogaster. Mol. Cell.Biol. 27: 963–974.

Jakubczak, J. L., M. K. Zenni, R. C. Woodruff and T. H. Eickbush,1992 Turnover of R1 (type I) and R2 (type II) retrotransposableelements in the ribosomal DNA of Drosophila melanogaster. Genet-ics 131: 129–142.

Retroviral Insertion Specificity 1377

Page 12: Insulator and Ovo Proteins Determine the Frequency and

Kuhn, E. J., M. M. Viering, K. M. Rhodes and P. K. Geyer, 2003 Atest of insulator interactions in Drosophila. EMBO J. 22: 2463–2471.

Kyrchanova, O., S. Toshchakov, A. Parshikov and P. Georgiev,2007 Study of the functional interaction between Mcp insula-tors from the Drosophila bithorax complex: effects of insulatorpairing on enhancer-promoter communication. Mol. Cell. Biol.27: 3035–3043.

Labrador, M., and V. G. Corces, 2001 Protein determinants of in-sertional specificity for the Drosophila gypsy retrovirus. Genetics158: 1101–1110.

Leblanc, P., B. Dastugue and C. Vaury, 1999 The integration ma-chinery of ZAM, a retroelement from Drosophila melanogaster,acts as a sequence-specific endonuclease. J. Virol. 73: 7061–7064.

Lewinski, M. K., D. Bisgrove, P. Shinn, H. Chen, C. Hoffmann et al.,2005 Genome-wide analysis of chromosomal features repres-sing human immunodeficiency virus transcription. J. Virol. 79:6610–6619.

Lu, J., J. Andrews, D. Pauli and B. Oliver, 1998 Drosophila OVOzinc-finger protein regulates ovo and ovarian tumor target pro-moters. Dev. Genes Evol. 208: 213–222.

Mevel-Ninio, M., E. Fouilloux, I. Guenal and A. Vincent,1996 The three dominant female-sterile mutations of the Dro-sophila ovo gene are point mutations that create new translation-initiator AUG codons. Development 122: 4131–4138.

Mevel-Ninio, M., M. C. Mariol and M. Gans, 1989 Mobilization ofthe gypsy and copia retrotransposons in Drosophila melanogasterinduces reversion of the ovo dominant female-sterile mutations:molecular analysis of revertant alleles. EMBO J. 8: 1549–1558.

Mou, Z., A. E. Kenny and M. J. Curcio, 2006 Hos2 and Set3 pro-mote integration of Ty1 retrotransposons at tRNA genes in Sac-charomyces cerevisiae. Genetics 172: 2157–2167.

Muravyova,E.,A.Golovnin,E.Gracheva,A.Parshikov,T.Belenkaya

et al., 2001 Loss of insulator activity by paired Su(Hw) chromatininsulators. Science 291: 495–498.

Pelisson, A., S. U. Song, N. Prud’homme, P. A. Smith, A. Bucheton

et al., 1994 Gypsy transposition correlates with the productionof a retroviral envelope-like protein under the tissue-specific con-trol of the Drosophila flamenco gene. EMBO J. 13: 4401–4411.

Prud’homme, N., M. Gans, M. Masson, C. Terzian and A. Bucheton,1995 Flamenco, a gene controlling the gypsy retrovirus of Dro-sophila melanogaster. Genetics 139: 697–711.

Roseman, R. R., V. Pirrotta and P. K. Geyer, 1993 The su(Hw)protein insulates expression of the Drosophila melanogasterwhite gene from chromosomal position-effects. EMBO J. 12:435–442.

Sandmeyer, S., 2003 Integration by design. Proc. Natl. Acad. Sci.USA 100: 5586–5588.

Scherdin, U., K. Rhodes and M. Breindl, 1990 Transcriptionallyactive genome regions are preferred targets for retrovirus inte-gration. J. Virol. 64: 907–912.

Schroder, A. R., P. Shinn, H. Chen, C. Berry, J. R. Ecker et al.,2002 HIV-1 integration in the human genome favors activegenes and local hotspots. Cell 110: 521–529.

Shun, M. C., N. K. Raghavendra, N. Vandegraaff, J. E. Daigle,S. Hughes et al., 2007 LEDGF/p75 functions downstream frompreintegration complex formation to effect gene-specific HIV-1integration. Genes Dev. 21: 1767–1778.

Singleton, T. L., and H. L. Levin, 2002 A long terminal repeat ret-rotransposon of fission yeast has strong preferences for specificsites of insertion. Eukaryot. Cell 1: 44–55.

Song, S. U., T. Gerasimova, M. Kurkulos, J. D. Boeke and V. G. Corces,1994 An env-like protein encoded by a Drosophila retroelement:evidence that gypsy is an infectious retrovirus. Genes Dev. 8: 2046–2057.

Song, S. U., M. Kurkulos, J. D. Boeke and V. G. Corces,1997 Infection of the germ line by retroviral particles producedin the follicle cells: a possible mechanism for the mobilization ofthe gypsy retroelement of Drosophila. Development 124: 2789–2798.

Spana, C., D. A. Harrison and V. G. Corces, 1988 The Drosophilamelanogaster suppressor of Hairy-wing protein binds to specificsequences of the gypsy retrotransposon. Genes Dev. 2: 1414–1423.

Suzuki, Y., H. Yang and R. Craigie, 2004 LAP2alpha and BAF col-laborate to organize the Moloney murine leukemia virus preinte-gration complex. EMBO J. 23: 4670–4678.

Wu, X., Y. Li, B. Crise and S. M. Burgess, 2003 Transcription startregions in the human genome are favored targets for MLV inte-gration. Science 300: 1749–1751.

Xie, W., X. Gai, Y. Zhu, D. C. Zappulla, R. Sternglanz et al.,2001 Targeting of the yeast Ty5 retrotransposon to silent chro-matin is mediated by interactions between integrase and Sir4p.Mol. Cell. Biol. 21: 6606–6614.

Yang, J., H. S. Malik and T. H. Eickbush, 1999 Identification of theendonuclease domain encoded by R2 and other site-specific,non-long terminal repeat retrotransposable elements. Proc. Natl.Acad. Sci. USA 96: 7847–7852.

Yant, S. R., X. Wu, Y. Huang, B. Garrison, S. M. Burgess et al.,2005 High-resolution genome-wide mapping of transposon in-tegration in mammals. Mol. Cell. Biol. 25: 2085–2094.

Yieh, L., G. Kassavetis, E. P. Geiduschek and S. B. Sandmeyer,2000 The Brf and TATA-binding protein subunits of the RNApolymerase III transcription factor IIIB mediate position-specificintegration of the gypsy-like element, Ty3. J. Biol. Chem. 275:29800–29807.

Zhu, Y., J. Dai, P. G. Fuerst and D. F. Voytas, 2003 Controlling in-tegration specificity of a yeast retrotransposon. Proc. Natl. Acad.Sci. USA 100: 5891–5895.

Communicating editor: M. J. Simmons

1378 M. Labrador et al.