16
J. gen. Viral. (1989), 70, 919-934. Printed in Great Britain Key words: SVDV/nucleotide sequence~amino acid sequence 919 The Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease Research Division, National Institute of Animal Health, Jyosuihonchyo, Kodaira-shi Tokyo, 187 Japan and 2National Institute of Animal Health, Tsukuba-shL Ibaraki-ken, 305 Japan (Accepted 5 December 1988) SUMMARY The complete nucleotide sequence of the genome of the enterovirus swine vesicular disease virus (SVDV ; H/3 '76) isolated from a healthy pig has been determined using molecular cloning and DNA sequencing techniques. The RNA genome was 7400 nucleotides long, excluding the poly(A) tract, and appeared to encode a single polyprotein of 2185 amino acids. The predicted amino acid sequence of the polyprotein showed close homology (around 90 ~) to that of the previously sequenced coxsackie- viruses B1, B3 and B4, and also showed homology (around 60~) to that of poliovirus. This homology allows us to predict the possible cleavage sites of the polyprotein and to identify other features of structural and functional significance, which seem to be important to the biological integrity of the virus. A detailed analysis of homology between SVDV and coxsackieviruses shows that non-structural proteins are highly conserved whereas the structural proteins are less well conserved. The 5' and 3' non- coding regions are also conserved, although there are several divergent nuclcotide stretches. These stretches may differentiate SVDV from coxsackieviruses. INTRODUCTION Swine vesicular disease (SVD) was first observed in Italy in 1966 (Nardelli et al., 1968). Subsequent outbreaks occurred in Hong Kong (Mowat et al., 1972), the United Kingdom (Dawe et al., 1973), a number of other European countries (Burrows et al., 1974) and in Japan (Kodama, 1976). SVD is an infectious disease of pigs characterized by the appearance of vesicles on the tongue, in the mouth and on the feet and hocks, lesions which are indistinguishable from those caused by foot-and-mouth disease virus. The causative agent, SVD virus (SVDV) belongs to the enterovirus genus, in the family Picornaviridae, and has a close serological relationship to coxsackievirus B5 (CB5) (Graves, 1973). However, SVDV can be distinguished from CB5 by cross-neutralization and immunodiffusion tests (Brown et al., 1973, 1976). Pigs inoculated with CB5 develop antibody to both SVDV and CB5 but do not develop the disease (Garland & Mann, 1974). Recently the complete nucleotide sequences of three coxsackie B viruses (CBs), CB 1, CB3 and CB4 have been determined (Iizuka et al., 1987; Lindberg et al., 1987; Jenkins et aL, 1987). Like other picomaviruses, the genome of the CB is a single-stranded (ss) RNA with a positive polarity, and is composed of approximately 7400 nucleotides. A small protein, VPg, is attached at the 5' terminus, and a poly(A) tract is located at the 3' terminus. Little information is available on the nucleotide sequence of the SVDV genome except for 75 nucleotides at the Y terminus (Porter et al., 1978). In this study, we have cloned the genome of SVDV into a plasmid and determined its complete nucleotide sequence. We also carried out comparisons of its nucleotide and deduced amino acid sequences with the sequence of CBs and some other picornaviruses. The results indicate that SVDV is closely related to the CBs. 0000-8624 © 1989 SGM

The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

Embed Size (px)

Citation preview

Page 1: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

J. gen. Viral. (1989), 70, 919-934. Printed in Great Britain

Key words: SVDV/nucleotide sequence~amino acid sequence

919

The Complete Nucleotide Sequence of Swine Vesicular Disease Virus

By T O R U I N O U E , .1 T A I K O S U Z U K I 2 AND K I I C H I S E K I G U C H I 2

1Exotic Disease Research Division, National Institute of Animal Health, Jyosuihonchyo, Kodaira-shi Tokyo, 187 Japan and 2National Institute of Animal Health, Tsukuba-shL

Ibaraki-ken, 305 Japan

(Accepted 5 December 1988)

SUMMARY

The complete nucleotide sequence of the genome of the enterovirus swine vesicular disease virus (SVDV ; H/3 '76) isolated from a healthy pig has been determined using molecular cloning and DNA sequencing techniques. The RNA genome was 7400 nucleotides long, excluding the poly(A) tract, and appeared to encode a single polyprotein of 2185 amino acids. The predicted amino acid sequence of the polyprotein showed close homology (around 90 ~) to that of the previously sequenced coxsackie- viruses B1, B3 and B4, and also showed homology (around 60~) to that of poliovirus. This homology allows us to predict the possible cleavage sites of the polyprotein and to identify other features of structural and functional significance, which seem to be important to the biological integrity of the virus. A detailed analysis of homology between SVDV and coxsackieviruses shows that non-structural proteins are highly conserved whereas the structural proteins are less well conserved. The 5' and 3' non- coding regions are also conserved, although there are several divergent nuclcotide stretches. These stretches may differentiate SVDV from coxsackieviruses.

INTRODUCTION

Swine vesicular disease (SVD) was first observed in Italy in 1966 (Nardelli et al., 1968). Subsequent outbreaks occurred in Hong Kong (Mowat et al., 1972), the United Kingdom (Dawe et al., 1973), a number of other European countries (Burrows et al., 1974) and in Japan (Kodama, 1976). SVD is an infectious disease of pigs characterized by the appearance of vesicles on the tongue, in the mouth and on the feet and hocks, lesions which are indistinguishable from those caused by foot-and-mouth disease virus.

The causative agent, SVD virus (SVDV) belongs to the enterovirus genus, in the family Picornaviridae, and has a close serological relationship to coxsackievirus B5 (CB5) (Graves, 1973). However, SVDV can be distinguished from CB5 by cross-neutralization and immunodiffusion tests (Brown et al., 1973, 1976). Pigs inoculated with CB5 develop antibody to both SVDV and CB5 but do not develop the disease (Garland & Mann, 1974).

Recently the complete nucleotide sequences of three coxsackie B viruses (CBs), CB 1, CB3 and CB4 have been determined (Iizuka et al., 1987; Lindberg et al., 1987; Jenkins et aL, 1987). Like other picomaviruses, the genome of the CB is a single-stranded (ss) RNA with a positive polarity, and is composed of approximately 7400 nucleotides. A small protein, VPg, is attached at the 5' terminus, and a poly(A) tract is located at the 3' terminus. Little information is available on the nucleotide sequence of the SVDV genome except for 75 nucleotides at the Y terminus (Porter et al., 1978).

In this study, we have cloned the genome of SVDV into a plasmid and determined its complete nucleotide sequence. We also carried out comparisons of its nucleotide and deduced amino acid sequences with the sequence of CBs and some other picornaviruses. The results indicate that SVDV is closely related to the CBs.

0000-8624 © 1989 SGM

Page 2: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

920 T. INOUE, T. SUZUKI AND K. SEKIGUCHI

METHODS

Virus and cells. SVDV (H/3 '76) was isolated from the faeces of a clinically healthy pig in Hokkaido, Japan in 1976 by Dr M. Kodama (Kodama et al., 1980a, b). The virus was less pathogenic for pigs than isolates from diseased pigs, and vesicle formation was restricted to the sites of inoculation. The virus was plaque-purified twice and grown in IBRS-2 cell monolayers in roller bottles.

Viruspurifieation. Infected cells were harvested 16 h post-infection when cells exhibited c.p.e. Virus was purified by a modification of the procedure described by Nomoto et al. (1979). Briefly, the clarified infected culture medium was made up to 2 ~ Sarkosyl and 20 mM-disodium EDTA and centrifuged for 2 fi at 27000 r.p.m, at 5 °C. The pellet was suspended in 0-05 i-Tris-HC1 pH 7-4, 0.3 M-NaC1, 1 mM-disodium EDTA, 1 ~o Sarkosyl, and the suspension was sedimented through 30~ glycerol containing 0-3 M-NaC1, 0.01 M-Tris-HC1 pH 7.4, 1 mi-disodium EDTA, 1 ~ Sarkosyl for 2 h at 35000 r.p.m, at 5 °C. The pellet was resuspended in TNE (0.01 i-Tris-HC1 pH 7-4, 0.1 M-NaCI, 1 mi-disodium EDTA) and banded by isopycnic centrifugation in CsC1 (p = 1-34 g/ml) for 16 h at 40000 r.p.m, at 7 °C.

RNA extraction and characterization. RNA was obtained from purified virus by the method of Grubman et al. (1979) with several modifications. The purified virus was treated with proteinase K/SDS and extracted with phenol/chloroform. The extracted RNA was recovered by precipitation in ethanol, and the viral RNA was purified by sucrose density gradient centrifugation and oligo(dT)-cellulose chromatography. Viral RNA was subsequently analysed by 1 ~ agarose gel electrophoresis. The fundamental integrity of the RNA was shown by an infectious RNA assay on monolayers of IBRS-2 cells (Grubman et al., 1979).

Preparation o f S V D V cDNA clones. Double-stranded (ds) cDNA was synthesized from viral RNA using a cDNA synthesis kit (Amersham). The first strand RNA--cDNA hybrid was primed with oligo(dT) and synthesized using reverse transcriptase. The second strand was synthesized by replacing the RNA strand of the RNA-cDNA hybrid using RNase H and DNA polymerase I. T4 DNA polymerase was subsequently added to the reaction mixture to remove any small 3' overhang on the first strand cDNA. The ds cDNA was phenol-extracted and tailed with oligo(dC) using terminal transferase, as described by Deng & Wu (1981).

The dC-tailed ds cDNA was annealed to dG-tailed PstI-cleaved pBR322 cloning vector (Bethesda Research Laboratories; BRL). The annealed DNA was then used to transform competent ceils of Escherichia coli strain HB101 (BRL).

Restriction fragments of the ds cDNA obtained by digestion with appropriate restriction enzymes were purified by 1 ~ agarose gel electrophoresis and inserted into the corresponding site of the M13 cloning vector. The ligated DNA was used to transform competent E. colt JMI09 cells.

The recombinant clones were screened by colony and Southern blot hybridizations (Maniatis et al., 1982) using oligo(dT)-primed 32p-labelled single-stranded (ss) cDNA as a probe. Recombinant plasmids were isolated from strongly hybridizing colonies (Birnboim & Doly, 1979) and further characterized by restriction enzyme mapping. For the genome region close to the 5" terminus, a synthetic oligonucleotide (20-met; nucleotides 665 to 684; Takara Syuzo) was used as a primer to synthesize ds eDNA.

Nucleotide sequence analTsis. Inserts from recombinant plasmids were digested with various restriction enzymes, and each fragment was subcloned into M13mpl8 or mpl9 replicative form DNA cleaved at compatible sites. Subclones that harboured a large insert were treated with exonuclease III and Sl nuclease to construct stepwise deletion sets (Henikoff, 1984). Each fragment was sequenced by the dideoxynucleotide chain termination method of Sanger et al. (1977).

The primer extension method was employed for sequencing the extreme 5' terminal region of the genome, whose sequence was not contained in any of the cloned cDNAs obtained. A synthetic oligonucleotide that was complementary to a region of the viral RNA (nucleotides 91 to 110) was radiolabelled using polynucleotide kinase and [~-32p]ATP (Geliebter, 1987). RNA sequencing was carried out by the dideoxynucleotide chain termination method using avian myeloblastosis virus reverse transcriptase. Alternatively, the 32p-labelled oligonucleotide primer was annealed to viral RNA and a primer extension reaction was carried out using reverse transcriptase. The resulting cDNA product was separated on an 8 ~ denaturing polyacrylamide gel and sequenced by the method of Maxam & Gilbert (1977).

Computer analysis. Nucleotide sequences were compiled and analysed using an NEC PC9801 personal computer and the Microgenie and Genetyx suites of programs.

RESULTS AND DISCUSSION

Cloning o f the S V D V g e n o m e

T h e ex t r ac t ed S V D V R N A , ana lysed by 1 ~ aga rose gel e lec t rophores i s , s h o w e d a species o f the expec t ed size, a n d t he b io logica l in tegr i ty of t he R N A was d e m o n s t r a t e d by g e n e r a t i n g in fec t ious S V D V (6 x l0 3 p . f .u . /gg R N A ) . T h e v i ra l R N A was c o n v e r t e d to ds c D N A as de sc r ibed in M e t h o d s ; t he resu l t ing c D N A was n e a r ful l - length.

Page 3: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

5' non- P_g..coding /

I1ml

P-1

In I 1C I

0 1 2

Nucleotide sequence o f S V D V

P-2

ID Y 2 A I2BI

3 4

P-3

2C ~(3AI[ 3C I 3B

5 6

3D

921

3' non- coding

I poly(A)

I MP--P-ff6N

4 - - J - - 4

4 - - - * •

4 - - -4 4

• 4

tl 1 I 11 t

I I I

SVDV i cDNA I i L

PBRSI4

4

F: t 4 I 4 I I

4 14 , 4

!: il ,L

II I I I I

i

] I PBRS19

p i • ~ > - - - - - ~ 0

I , 4 I ~ 4-- - - - -4

e l

Fig. 1. Restriction cleavage sites on SVDV and strategy for sequence analysis. Genome organization and nucleotide length of the SVDV genome are shown at the top of the figure (scale bar is in kb). Restriction cleavage sites of SVDV cDNA and cDNA clones obtained are shown below the SVDV genome. Open triangles on SVDV cDNA lane indicate the site of synthetic oligonucleotide primers. Arrows in brackets indicate the regions where nucleotide sequences of cloned cDNA were analysed. The dots by the arrows represent the restriction sites used for subcloning, the bars represent the sites chosen for the stepwise deletion sets. The closed triangle at the bottom of the figure represents the nucleotide sequence determined by using primer-extended products.

The ds c D N A was tailed with d C M P residues and annealed to dG-ta i led PstI-cleaved pBR322. The recombinant mixture was used to transform competent E. coli HB 101 cells. Thirty- five ampicill in-sensit ive colonies were obtained. The harboured plasmids were cut with PstI and the released fragments were analysed by Southern blot hybridizat ion using SVDV ss c D N A as a probe. Finally, two clones (PBRS14 and PBRS19) were obtained whose inserts were 5.5 kb and 0.8 kb respectively. Clone PBRS19 contained the 3' poly(A) tail consisting of 47 A M P residues.

Clone MPS453 which spans the genomic gap between PBRS14 and PBRS19 was prepared by ligation of an SphI/SacI fragment of ds c D N A to the M13 cloning vector replicative form carrying compat ible cohesive ends.

To obtain the clone MPS671 covering the 5' end of the viral genome, ds c D N A was prepared by the use of a synthetic oligonucleotide (nucleotides 665 to 684) as a primer. An EcoRI l inker was added to each terminus of the ds c D N A and digested with EcoRI and PstI. The resulting fragments carrying one EcoRI cohesive end and one PstI cohesive end were ligated to the M13 cloning vector carrying compat ible cohesive ends. These c D N A clones covered the whole viral genome except for 24 nucleotides at the extreme 5' terminus. These clones and ds c D N A were mapped with restriction enzymes and aligned using restriction sites and sequence data. The restriction map is shown in Fig. 1 together with the sequencing strategy.

Nucleotide sequence

Fragments generated from c D N A inserts were subcloned into M13 cloning vectors and sequenced using the dideoxynucleotide chain terminat ion method (Sanger et al., 1977). To obtain the sequence of the extreme 5' terminal region of the genome, pr imer extension

Page 4: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

922 T. I N O U E , T. S U Z U K I AND K. S E K I G U C H I

10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 IAIAAA/C.AGQ31G,9(~IU~U~IJACAQJr,~IJWCkCG~ACC~UGUGCroCC'~ U U U r o A C U U ~ ~

160 170 180 190 200 210 220 2,30 240 250 2.60 270 280 290 300 e..,AACUG~ UUAC0~G~UGAG~AC~AAU " MAC~UUCG~ . A G ~ ~

310 320 330 340 350 360 370 380 390 400 410 420 430 ~ 450 AACCC~AGUGUAGAUCAGGCcGAUGAGU~Ac~G~UG~GGUGQcU~UGGGGUAA~U~AU~GGAUG~U~AA~AcUGACAUGG~

460 470 480 490 500 $I0 520 530 540 550 560 5-/0 580 590 600

mm

610 620 630 640 650 660 670 680 670 700. 710 720 730 740~F IA (VP4) UGAGAGAUOSIlMCCAUAUUGCUAUUGGAU UGGCCJ~CC~ UGACGAAUAGAACAGUUGCUUACCUGUUUGU UAG UCtJ~ U A U C A C U ~ ~ U U ~ U ~ ~ ~

H G A

760 770 780 790 ~0 810 820 ~0 840 850 860 870 880 890 900 UCAAGUGUCAACACJULAAGACCGGUGCUCAUGAGACCAGCUUGAGUGCAGCGGGCAAUUCAG U C A U U C A U U A C A C A A A O U J A A A C U A C U ~ ~ ~ QV S TQ K T G A H E T S LS A AGN 5 V I H Y T H I N Y Y K OAASN 5 AN RQ 0 F TQO P G K F

J

910 920 930 940 ~ I B (VP2) 970 980 990 1000 1010 1020 1030 1040 1050 ~UGAJUkGA~U~G UCJU~UAUGCCUGCCCUOU~UUCCCI?J~UCAGCA~AG UG UGGCUA~JkGUGACJ~UM~U(~7,A~MUU~~~UGGIJAGUUGG

T E p V ~ 0 [ HV ~S ~p ALNS p S ~ E EC~Y S 0 RV RS IT IGHS T IT T~ECA~ V VV G

1060 1070 1080 1090 1100 1110 1120 1130 1140 1150 1160 1170 1180 1190 I~ AUAUGGGGUGUGGCCAACUUA~UUGAAGGAUGAAGk~.J~ACA~GA GAUGUGGCC~gG UGCkGG UUUUACACGCIJCGAJ~K~GUGAUGUGGCJLACAGAGUUCACCAGGCtJC~G~ yG V~ p T y L K 0 E E A TA E D Q p T Q p 0 V A T C R F Y TL E S VH QG O S S P G ~QW K F P 0

I~I0 I790 I~0 1240 1250 I~0 12"/0 I~0 I~70 1300 1310 1320 1330 134~) I~0 CGCGUUG U(~CAACAUG~GCUAUU~Gr~CkAAJ~U~GCJ~GUACCACUACCUUG~oAGAGCCGCd~Jk~,GAU~ UGCJkG U~UCCA/~AUU~UG U C I J ~ U A U G ~ ~ ~

A L S N H G L F G ~ N H Q Y H Y LGRA~ yT[ HVQCNAS K F H ( I G C L L V V C V P E A E M G C

1360 1370 1380 1390 1400 1410 1420 1430 1440 1450 1460 1470 1400 1490 1500 UGCCAcGUUGGCCAAUAAGCCUGACCC~UUULAGCCUGAGU/ULAGGGGAAAUAGCCAACAUG~UUGAAUcCCA/u~ACUCCACCGGGGAAJ~CGGC~G UGCJU~GCUAA~A~~ ~ U ~ U ~ A U A T L A N K P 0 P K S L S K G E I A N ~ F E S Q H S T ~ ETAVQAN V [ M AG ~G V~ VG N kT [

1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 1610 1620 1630 1640 1650 CUU~GUGGAUcAACUUGCGCACUAkCAACAGCGC1JACGAUUGUCAUGCcAUAUAUAAACAG~UGCcCAUGGACAACAUGUUcAG~UUUU~U~U~U~

F p H Q~ I N L R T N N S A T I V ~ p Y I N 5 V P H 0 N H F R H N H F T LHV I P F A P L S Y S TG i

1660 1670 1680 1690 1700 1710 17'20 1730 ~ ' tC (VP3) 1760 1770 1780 1770 1800 GG(~jACCAcGUACG~G~UG~,`GCCJUk~0GCGCUGMUAUA/~k3GGCUGCGU(~JAGCCGGUA/~``AJ~UiJ~GUCG~A~CAJXCC`JkG~UCU~UCCGJ~~UC A TT y V p I TV T VA P HC A E y N G L R L A G K'Q ~ I P T L S T PG S H 0 F L T S 0 0 F g S P S

1810 1820 1830 1840 1850 1860 1870 1880 1890 1900 1910 1920 1930 1940 1950 AGcCAUGCCACAAUUCGAUGUCACUCcUGAGAJJGGAUAUUCCAGGACAAGU~UGAUGGAGAUUGCAGAAGUAGAUUC~GUGGUGCcUG U ~ U G A U G UCAAU ~ ~ ~

A~ PQ F D V T P E ~ D I p G Q V N N L B E [ A ( V 0 5 V V P VNN T E G KVBS I E A YO [ P V ~

1960 1970 1980 t990 2000 2010 2020 2030 2040 2050 2060 2070 2080 2090 2100 AUCGAAUCCAACCA,e~G~LI¢UCAGGUUUUUGGGU~AGUGUG U U A A k C A G G A C U U U G C U G G G A G A A A U ( ~ U A A A C L I A C U ~ U U G G U ~ ~ ~

SN P TN G 8 QV F G F P L T PO AN $ V L N 8 T L L G E I LN Y YAH W $ G S I KL ~ F~FCG $

2110 2120 2130 2140 2150 2160 2170 2180 2190 ?900 7910 ?'790 2230 2240 2250 ~ A A A A AUGC~AGGUACUCA¢GUGAUCUGGGAUGUGGGUCUACAAUCGAGCUGOGUAUIJGUGUA~AGUCA

A~ATG K F L LAY S P P GAG APT T 8 K E A ~ I G THV I W OVG L~S S C V L C I PW I S ~

2260 2270 2280 2290 2300 2.t10 2.320 2.330 2-!40 2350 2360 2370 2380 2390 2400 MCC.~T.ACUACAGGUAUGUAGUMUGGAUGMU,kCAC~CUGGUGGAIJACAUMCU U G C U G G ~ U A U U G U G G ~ U M G A U C U U G ~ U U ~ ~ W ~

T It Y R Y V V M 0 E Y T A G G Y I T C ~ Y O. T N I V V P ~ 0 ~ O, S 0 C K I k C F A S /~ C 14 D F s ~f R J

2410 2420 2430 2,t40 f ]D (VPI) 2470 2480 2490 2500 2510 2520 2530 2540 2530 ~ A C A C A C C C t , ' ~ A W U U L A C A G G A U A A U U ~ A ~ C G U C G C O G A U A C U A U U G G G A G C G G A C ¢ ~ ~

N L t~ 0 T P F I KG ON F F 9 G P PG E V V E R A I A R V A 0 T Z G S G P V N $ ES I P A L T A A E

2560 2570 2580 25'90 2600 2610 .?629 2630 2,640 ~50 2660 ?,.6~ 2680 2690 J A ~ 3 A G G C A ~ U G ~ A U C A ~ U C G A C A G ~ U G C A ~ g C U ~

T 6 H T $ G V V P $ 0 T II Q T I t It V K N Y H S R S E 5 T V E N F L C R S A C V F Y T T Y K N II D $ 0

2710 27"20 2730 2740 Z/SO 2?60 27"70 2?80 2790 2800 2810 2820 2830 2840 2850

G D N F A Y V Y [ N T R G V k 9 L R R K L E N F T Y k R F 0 L E L T F V I T $ T Q. E d[P T ¥ R ~ I 0

28~ 7870 '2880 2890 29~0 2910 292O 2930 2040 2950 '2960 '2970 2988 2990 3000 ~ U G O A O G O A C O U C C A ~ ~ U C C A C C A . , ' ~ C C A A Q U Q ~ ~

T P V L T H 0 I H Y V P P G G P V P T K Y N $ Y S WQ T S T N P S V F W T EG S A P P R M $ | P F I

3010 3020 3030 30~ 30SO 3060 3070 3080 3090 3100 311Q 3120 3130 3140 3150 U G G r . J Q J A G G ~ U I J O U A U Q A O G G G U ~ ~

0 I G N k Y $ M F Y D G W k R F 0 K Q G T Y G T $ T L N N PIG T L Y I'l R H V N 0 G G P G P I V $ T V

Page 5: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

Nucleotide sequence of S VD V 923

3160 3170 3|M 3190 3200 3210 ~ 3~0 ~40 3250 3260 3'2"/'(] 3~ 3~ V L~ A C G A A U ~ A C O ~ C G U C A A A A O ; U G G G ~ U G A A ~

R I Y F K P K H V K T ~ V P R P'P R L C Q Y Q K A G N V N F E P T G V T E G R T 0 I T i M K T T G A

3310 3320 3330 3~ 3350 3360 3370 3380 33~ ~ 3410 3420 ~k58 3440 ~58 ~ ~ U ~ U U G G C J ~ A C U A I I A G A G I k I G ~ ~ ~ ~ F G O 0 S G A V Y V G N Y R V V N I HI- A T R A D WON CVQ E DY 0 R 0 L L V S T T TA 0 G CD T

3470 ~0 ~ 3500 3~I0 3~ ~58 3540 ~0 3~ 3570 3,$80 35~ ~Ut~.~CZT.A£.~G ~AU ~ ~AGUGU GL~CA~UGAGr-~go~L~-'CI~GUGGAGGU UCAAGAGAG~UAOU ~ U ~

I A R C 0 O T A G V Y F C A S R N K H Y P V T F E G P G L V E VO E S E Y Y P K K H 0 S H V L L A A m

3620 3630 3640 3650 3660 3670 ~,.,. 3680 ~ ~ 3710 ~ 3730 3740 ~2B ~I0 UGGAUUUGC~AGAGCcGGGUGAUUGUGGAGGGAUUC1jcAGA~UGGGGUGAUUGGCAUAG~UACCAU``~UUGUUGGUUUUG~GA~~~

G F A E P G D C G G I L R C 0 H G V I G I V T P ~ G G E G V V G F A O V R 0 L L W L E D 0 A ~ E 0 8 V

3760 3770 3780 3790 3800 3810 3820 3830 3840 ~58 3860 3870 3880 3890 3900 UAGGGAUUAUGUGGAACAACU~C-CAAUGC~U~GOJCA¢~3AU~UUUG~AJ~AGGUU~AGgOG'~ ~UOAAGAUAGUGO~aC

R 0 Y V E Q L ~ N A F ~ S G F T N 0 I C E ~ V T L L K E S L I 0 0 0 5 I L E K $ L I ( A l V K I V S A

3910 3~0 3930. 3940 3950 3960 3970 3980 39~3 4000 4010 4020 4058 4040 I~ 2C ~AUCG ~AGM, AU~CGAUGACCU~UU~~IIIJAAU~A~UA~'IJ~~L~UCCCC,,~~OAGUGG "~" L V I VV R N H 0 0 L I TV T A T L A L I ~ C T T S PW RVL KO K V$ 0 y y G I p ~A E RQ N $ 0

4060 4(]70 4080 4090 41(](3 4110 4120 4130 4140 4150 4160 4170 4180 4190 4~0 ~ U OC-J~AGAUGACCAA~ U ~ ~AUAGCCJ~UCC/~A/LAGUUC, J ~ ~ U ~ ~

W L K K F T E Pl T N A C K G i'~ E W I k I K I 0 ]( F I E W L K V K I L P E V K E K H E F L ~ R L K 0 L

4210 4770 4230 4240 4250 4260 4~0 4280 4290 4580 4310 4320 4~0 434.(] 4350 ACCACUC1JUGG,iUkAGUCAAAUAGCJ~AUUGAG~GAG~UOJ~AAAG UGA~J~GGAC, C A A ~ A U U O J C 1 J ~ U G U ~ U ~ ~ U ~ ~ ~ P L L E S Q I A T I E 0 S A P S Q S D 0 E Q I F S N V 0 Y F A H Y C R I( y k p I. y k A E A K R V F S

4~0 4370 4380 4390 44(10 ~I0 ~20 4430 4440 4450 4460 4470 4480 ~913 4500 ~U~AUG/~C`e~UUA~UA~GUUC`J~`~GUCC/~A~C~``GUAU~/U~cCUGU~GUCUCU~UGGCA£,CCCJ~~[A~CG(~J~ L E i(K M S N Y I Q F K S K C ~ I E PV C L L L H G S PG AG KSVATN L I G R S L A r K LNS S

4510 4520 4530 4540 4550 4560 4570 4580 ¢590 4600 ~I0 ,I~20 4630 ~ ~58 G G~ UACU ~ACC,~"CAG AU ~ ACC.J~U U U ~A~G U UA~.a.e~Cj~ U ~UCj~Ucj~U~C~AC~ ~ UG C~j~G~/~.C.GU~G A ~ LTG.UCITd ~ U UCI~G ~L~.G [ICLICC,,I£~(~ UU(~ACU ~ ~ U V Y S I P P 0 ~ D H F 0 G Y K 0 0 A VV I PI 0 0 L C Q N P OG KOV S L F C 0 H V S S V 0 F V p p M

~0 4670 ~0 4690 4700 4710 4720 4730 47~ 4750 4760 4/'/0 4780 4790 4800 C-GCGGCGC~/~GA/IJIN~r~CAUUCIJAU ~UIJOGU~UCC.ACCAj~UU~ ~ M C , J~U~UTdUCCJ~U A A L E E K G I L F T S P F V L A S TN A G S V N A P T VS 05 RA L V R R F H F O HN I E V V$ ~I

4010 ~ 4830 ~ ~58 ~ ~7~) 4080 ~ 4900 4910 4920 4930 4940 4~0 GUAUAGC~,AGAA~GUAAGAUCAA~UGCCUA~ U U A N U ~ A U G U G A U G A G G A G U G U U G ( ~ G U C A A C U U C , ~ U ~ G ~ ~ ~ ~ ~ ~

YS QN G K I N M P~AV K TC 0 E ECC P V N F K KCCP L VCG KA I 0 F I 0 RR T~ VR Y$ I.

4960 4970 4980 4990 5000 5010 5029 t 3A 5050 5060 58~ 5080 5090 5100 / ~ A C J ~ U U ~ A N ~ O U ~ U A C J U ~ ~ U ~ ~ DT~ L V T E M F R E Y N H R H S VG A T L E A L F ~G P PVY R E I K I SVA p E T p p p P AVAO

5110 5120 5130 5140 5150 5160 5170 5180 5190 ~ 5210 5220 5230 52~ NO CUIJACUGAAAUCJ~GUAG~ ~U~/~£dJ~~I~GCUUAU~ UCC4~UU(~C.~£C~UACJ~£dUJ U~UAUGUUUGOU~'IJCUAA(~A~UUUGUC~r~u

L L K $ Y D S E AV R E Y C K E K G ~L I P E V D S T CO I E K HVN RAF I C L~ AI. TT F V$ V

$260 5270 5280 $290 t 3B (VPg) 5320 5~0 5340 $350 S360 I~ 3C (protease) 5~0 u ~ o ~ o ~ o = ~ = ~ . ~ u u ~ u , ~ o u ~ . ~ u ~ . ~ , x u ~ ~ ~ AG I I Y I I Y K LFAG F ~ G A Y T G H PNO K P K V P T L R ~ A K V Q G P A FE F A V A M M K R

5410 5420 5430 5440 5450 5460 5470 5480 5490 5500 5"510 5520 5"530 5'540 5550

N A S TV K T E Y G EF TIt LG I Y 0 R W AV L P R HAK PG PTI LNN 00 V VG V LO AK ELV

55~ 55"70 5580 5590 5600 5610 5620 5630 5640 5660 5660 5670 5680 5690 5700

O K 0 G T N L E L T L L K L N R N E K F R O I R G F L A R E E V E AII E A V L A I N T $ K F p N ply

5710 ~ ~ 5"740 5750 5760 5"/'70 5780 5790 58~ 5810 5820 58.30 5840 ~ 0 CAOAC~GUGGGCCr.,GGUAACCOu~UAUG~ ~ ~J~ ~ ~ ~ ~ ~ ~ m ~ ~ ~ ~ ~ 0 ~

Z P V G R V T O Y G F I N L G G T P T K R PIL ~ Y N F P T R A G ~ C Q G V L ~ $ T O K V L G [ H V 6 I I

G N G H O G F S A A L L R N Y F N E E 0 G E I E F V E S S ~ O A Q F P V I N T p $ K T K L E P S V F

6010 6020 6030 6040 6050 6060 ~ 6080 ~ 61~ 6110 61~ 6130 61~ 6150 ur.J~cr, A c G o ~ ~ ~

H H VF E GN K E P AV L R N G 0 P RL K A N F E E A I F SKy IGIIVN THV 0 EyB MEAVOY

Page 6: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

924 T. I N O U E , T. S U Z U K I AND K. S E K I G U C H I

6160 6170 6180 6190 6200 6210 6220 6230 6240 6250 6260 6270 62,80 6290 6300 U UAUGCAC.O ~ UA~CJXC U G G/kCJ~G~4VCOCJ~.ICAr~(~J~ AU(ICi)G ~UAiF~GC,~b'IJO ~ C, CtlCG ASkC-OJ~CII~ A~X~t)~ UGC,AC.~U UACOu"t!~UGI~C-GOCCU G~GUALk~Jk/VB~AG AC-,~ U

Y A G O. L A T L 0 I S T E P B K L E ~ ~ V Y G I E G L E A L D t I T 8 ~ G Y P Y V ,~ L G I '( ~( R 0 I

6310 6320 6330 6~0 6350 6360 6370 6380 6390 6400 6410 6420 6430 6440 6450

I S K K T ~ 0 t T K L K E C H O K Y G LN L P MV T Y V KO EL RS AO KV AK G KS RL I E AS S

6460 6470 6480 6490 6500 6510 6520 6530 6540 6550 6560 6570 6580 6590 6600

L N D 8 V A Yl R Q T F G N L Y K T F H L N P G I V T G $ A V G C 0 P 0 V F kJ S K I P V H L D G H L I

6610 6620 6630 6640 6650 6660 6670 6680 6690 6700 6710 67'20 6730 6740 6750 A~UU~UI~J&~AUGACGC~J~UG~GUUU~U~~~A~UA£,A U~/~JAO(~J~U~UUOCCAC~A~GU~/C.JU~.J~

A F D Y S G Y D A S L S P V V F TC l K L L L E K L G Y T H K E T N Y I 0 Y L 0 N S H H L Y ~ 0 K H

6760 6770 6780 6790 6800 6810 6820 6830 6840 6850 6860 6870 6880 6890 6900 C U A C U U U ~ U G A G G G G C G G C A U G C C A U C A G G A U G C U C A G G C ~ A C U / ~ A U A U U U A A U U C C A U G A U ~ U C A U M ~ ~ U U ~ U ~ ~ ~

Y F V R G G M P S G C S G T S I ~ N S M I N N I I [ R T l M t l(V Y KG I D l 0 Q F R M I A Y G 0 0

6910 6920 6930 ~40 ~50 6960 6970 6980 6990 7000 7010 7020 7030 7040 7050

V I A S y p W p IDA S LL A E k G K G Y G L I MTP A D KG ECF N EVTQTN V T F L K R Y F R

7060 7070 7080 7090 7100 7110 7120 7130 7140 7150 7160 7170 7180 7190 7200 GG~GAU~&~GUACCCAUUUU~GUCC~I/GCCk~UG~AUAU~tK)CAUUAGGUGG/~II~~~~~~A~

A 0 E Q y p F L V H P V M P M K 0 I H E 8 I R W T K 0 P K N TQ 0 H V R$ L CL L A WHN G E H E Y

7210 7220 7730 7240 7250 7260 7270 7280 7299 7300 7310 7"520 7330 7340 7"550 ~ u t n t ~ u ~ u A J ~ G A ~ ~ ~ ~ ~ ~ G A O t J C ~ I J b ~ J U k A I U ~ U ~ U A ~ G U O k A U C ~ ~ t k l M

E 8 F I R K I R S V P V G RC L S L P k F S T L R R KW L 0 S F . . . . "

7360 7370 7380 7390 7400 Ctr, AACWGAgAA~UC-Cr~UAg~UAAAU~C~UAU~G -- p ol y ( A )

Fig. 2. The nucleotide sequence of the SVDV genome and the predicted amino acid sequence. The R N A sequence is deduced from the D N A sequence of cDNA. The putative cleavage sites determined by al ignment with CBs and PV1 are shown by arrows.

sequencing was performed using a 32p-labelled synthetic primer (nucleotides 91 to 110) as described in Methods.

The complete nucleotide sequence and predicted amino acid sequence of SVDV genome are shown in Fig. 2. The total size of the SVDV genome excluding the poly(A) tract is 7400 nucleotides. The base composition of the SVDV genome without the poly(A) tail showed a high adenine content (A, 28-1~; G, 25.2~; U, 22.6~; C, 24.1~) and a low frequency of the dinucleotide C-G (3.7~), which are both consistent with other enteroviruses (Toyoda et al., 1984; Iizuka et al., 1987).

Computer-aided translation of the RNA sequence in all three reading frames revealed a single long open reading frame (ORF) consisting of 6555 nucleotides (2185 triplet codons). The ORF starts at an AUG codon (nucleotide position 743) and ends at a UAA codon (nucleotide position 7298).

Comparisons among the genomes of CB1, CB3 and CB4 were carried out at the nucleotide sequence level for the non-coding regions, and at the amino acid sequence level for the coding regions, and these are shown schematically in Fig. 3 and 4 respectively. The predicted amino acid sequence of the coding region of SVDV exhibits a high degree of homology to CBs (90.0 ~, 88.1~ and 88.3~ to that of CB1, CB3 and CB4 respectively) and also shows homology to poliovirus type 1 (PV1) (57.5~). This high degree of homology has enabled us to predict probable cleavage sites between the majority of the viral proteins. All of the proteolytic cleavage sites in PV1 and CBs are conserved in SVDV with the exception of that at the 1D/2A and 2B/2C junctions (Table 1). For the lD/2Ajunction, a T/G pair in SVDV is conserved in CB1 but not in PVI and CB4. For the 2B/2C junction, Q/N in SVDV is conserved in CB1 and CB4 but not in PV1. Lindberg et al. (1987) have proposed that in CB3 the 2B/2C cleavage occurs either at a Y/G or a Q/N pair. We therefore conclude that in SVDV Q/N and T/G pairs probably function as the cleavage sites for 2B/2C and 1D/2A junctions, respectively, although N- and C-terminal protein sequence analysis is required in order to identify the cleavage sites precisely.

Page 7: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

Nucleotide sequence o f S VD V 925

5' Non-coding

1 100 200 300 400 I t t 1 t

i n | l i D , ; | l i l i J I B , £ t m m D i n R l | H | U l i l l l i l l | m | n IIIH l i L I B m P

i i l u i d i idle g • i L l U l U m l i l i m i i J E C B D D B B I a l B B B K U . i B i n a H u m

i g N O r e D m iN m m l ~ i ~ i 3 DO ~ ic~mmmommmm [:~ m ~ : : : ~ m m m i

i m am m m m ~ ~ ~ m C:~ ~ ~ m ~ = m m m m ; ' m e i m m l m m m

m i i m .

3' Non-coding

500 600 700 7300 7400

t I I I 1

I l l i i l

011 I

n o / l l l I W U l • U l l l l l l l J k l L l i l l l III l • I l l I I I I I I III • n l l l i w D i l l | i u l m l l l

l C : i I t i i ~ r - h E i e l ] i ~ I n

Fig. 3. Alignment comparison of nucleotide sequence homology of the non-coding regions of SVDV and CBs genomes [the details for CB1 are taken from Iizuka et al. (1987), for CB3 from Lindberg et al. (1987) and for CB4 from Jenkins et al. (1987)]. The nucleotides are arranged to obtain the highest level of homology. Positions of common sequences among SVDV and CBs are indicated as closed boxes. Positions of common sequences only among CBs are indicated as open boxes. The positions of nucleotides that are common among CBs but not SVDV are indicated by small arrows. The uppermost band indicates homologous regions in which more than one nucleotide is common, the second more than three, the third six, the fourth nine and the fifth 12 nucleotides. Nucleotide numbers are indicated above the uppermost bands. The comparison was performed using the Microgenie program.

Table 1. Comparison o f the predicted cleavage sites o f the S VD V polyprotein with those o f CB1, CB3, CB4 and PVI*

Boundary SVDV CB1 CB3 CB4 PVI

1A (VP4)/I B(VP2) N/S N/S N/S N/S N/S 1B (VP2)/IC(VP3) Q/G Q/G Q/G Q/G Q/G 1C (VP3)/1D(VPI) Q/G Q/G Q/G Q/G Q/G 1D (VP1)/2A (protease?) T/G T/G Q/S? Y/G Y/G

Z/R? Y/R? Q/N?

2A (protease?)/2a Q/G Q/G Q/G Q/G Q/G 2B/2C Q/N Q/N Y/G ? Q/N Q/G

Q/N? 2C/3A Q/G Q/G Q/G Q/G Q/G 3A/3B (VPg) Q/G Q/G Q/G Q/G Q/G 3B (VPg)/3C (protease) Q/G Q/G Q/G Q/G Q/G 3C (protease)/3D (polymerase) Q/G Q/G Q/G Q/G Q/G

* Details taken from: Iizuka et al. (1987) for CB 1 ; Lindberg et al. (1987) for CB3; Jenkins et al. (1987) for CB4; Kitamura et al. (1981) for PV1.

The 5' non-coding region

The 5' non-coding region of SVDV consists of 742 nucleotides, a number comparable to that of PVs (742 to 744) and CBs (740 to 743). The overall homologies of this region to CBs and PVs are 82 to 83 ~o and 68 to 70~o respectively (Table 2). An alignment comparison of SVDV with three CBs indicates similar features to those shown by PVs (Toyoda et al., 1984). From these features, the 5' non-coding region can be divided into three classes of sub-sequence, namely

Page 8: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

P1

P2 P3

1000

I

I [

I I

I I

il I1

00

'200

|I

A (

VP

4)

iiB

(V

P2)

l

or

el

ei

a

m

illill

mill

Be

II mail

m i

m~

i

I i

N

i leO

Il

lu

me

0

m)

ia

ll

mi

INN

Bi

n

In

l

lm

a

lm

a

mm

m m

e

H/

m

i

i iB

m

1500

20

00

2500

30

00

I i

130~

) ,

i ,4

010

i i

1506

i

I '6

00

i i

'700

I

i 18

0b

i

llC

(V

P3)

I1

D (

VP

1)

I II

II

I ~

ll

ll

l

I ~1

I

Ill

I I

Illl

II

III

I I

i I

,m

g I

IJ

li

ml

l

~m

I.

i I.

MII

i

lO

ll

I

Ill

I ii

i H

il

l

il

l

i I

~ l

i I

II

HI

am

It

l

i

Ola

illi

l I

..

..

..

..

..

..

i :

mi

d

, :

..

..

..

..

..

..

..

.

: .

..

..

..

..

.

: .

..

..

..

..

.

. .l

i

i e

lm

I I

imam

~mmm

ameRinD

i imummmi

me

aim

mn

mE

if

n

c:~

~ i

me

i

~ immmmim

n i

IQ

I

i i

N

il

I

+ m

+ +

+ ~

+ +

#+

+ +

3500

40

00

I 1'9

010

I i

I t

i I

i J

'I00

0 'l I00

12

A (

prot

ease

?)

12B

: I

2C

illl

ml

ill

il

l

am

DI

ml

i

l

J

ni

le

a

lu

oe

mm

.m

il

rome

im

am

u m

e

i q

ml

ii

im

ai

ml

l

mg

ii

n~ml

m C

LU

BS

R

Hi

i

Na

B

-=

.

..

..

PG

DC

GG

ILR

CQ

HG

4500

50

00

, ,

I ,

, ,

, I

'120

0 '1

300

'140

0

BR

im

i mmmmmmmi

ii

l

i

,imlnmm

+++

+

5500

60

00

6500

I '

'150

0 '

1,16

60

, ,

,176

0 I

, ,

, ,

I ,

, ,

, q

800

~ 190

0 '2

000

~3A

I 13

B13

C (

prot

ease

) 13

D (

poly

mer

ase)

i

ll

i~

iH

li

ll

i

ii

Ni

Oi

li

~

li

lR

IL

IN

gl

l

li

gi

iN

ig

Hi

l

ida

D

H

I R

il

e

in

gl

e

&in

n

il

m

ai

n

in

i i

N

iN

il

li

Ki

l

..

..

..

..

..

.

: =

'=:-

=

;=._

i

el

l

ie

I

ll

e

l

i ell

m~

m i

i am

) l

mn

am mm~ m

um

mm

a a

m G

OC

GG

V

HV

GG

NG

G

DD

7000

1

I I

I

'210

0

ii__ "

:'::

m t

Fig

. 4.

P

redi

cted

am

ino

aci

d se

quen

ce h

omol

ogy

of p

olyp

epti

des

amo

ng

SV

DV

and

CB

gen

omes

, re

pres

ente

d as

in

Fig

. 3.

Con

serv

ed a

min

o a

cids

am

on

g S

VD

V

and

CB

s ar

e in

dica

ted

as c

lose

d bo

xes.

Am

ino

aci

d se

quen

ce b

lock

s of

hom

olog

y on

ly a

mo

ng

CB

s ar

e in

dica

ted

as o

pen

boxe

s. P

osit

ions

of

amin

o a

cids

tha

t ar

e co

mm

on

to C

Bs

but

not

SV

DV

are

ind

icat

ed b

y sm

all a

rrow

s at

the

bot

tom

. T

he u

pper

mos

t ba

nd i

ndic

ates

hom

olog

ous

regi

ons

in w

hich

mor

e th

an o

ne a

min

o a

cid

is

com

mon

, th

e se

cond

mor

e th

an t

wo,

the

thi

rd m

ore

than

fou

r, t

he f

ourt

h m

ore

than

six

, an

d th

e fi

fth

mor

e th

an e

ight

. T

he n

ucle

otid

e an

d am

ino

aci

d re

sidu

e nu

mbe

rs a

re i

ndic

ated

abo

ve a

nd b

elow

the

sca

le b

ar r

espe

ctiv

ely.

Pro

pose

d fu

ncti

onal

ly i

mpo

rtan

t co

nser

ved

sequ

ence

s ar

e sh

own

belo

w P

2 an

d P

3 (s

ee t

ext)

. D

etai

ls a

re t

aken

fro

m t

he s

ame

sour

ce a

s th

ose

in F

ig.

3.

Page 9: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

Nucleotide sequence o f S V D V 927

Table 2. Nucleotide sequence homology between non-coding region o f S V D V and other picornaviruses*

Nucleotide region CB1 CB3 CB4 PVI PV2 PV3 BEV HRV2 HRV14 HAV EMCV TMEV FMDV

5'Non-coding 82.9 83.6 82.0 67.9 69.0 69.7 56.1 53.7 54.0 42.5 - 31.8 - 3'Non-coding 79.8 76.9 81.9 37.9 36-9 35.9 33-7 24.0 16.5 33.0 36.6 31.2 35.8

* Sequence homology is expressed as percentages. The percentage homology among CBs and PVs of the 5' non- coding regions are 87-2 + 5.7 and 83.i _+ 2.3 respectively; 3' non-coding regions are 89-6 _+ 3.5 and 98-I +_ 0-8 respectively. Details of the other virus sequences are taken from: Iizuka et al. (1987) for CBI; Lindberg et al. (1987) for CB3; Jenkins et al. (1987) for CB4; Kitamura et al. (1981) for PV 1 ; Earle et al. (1988) for BEV; Callahan etal. (1985) for HRV14; Skern etaL (I985) for HRV2; La Monica etal. (1986) for PV2; Stanwayetal. (1984b) for PV3; Forss et al. (1984) for FMDV; Palmenberg et al. (1984) for EMCV; Pevear et al. (1987) for Theiler's murine encephalomyelitis virus (TMEV); Najarian et al. (1985) for hepatitis A virus (HAV).

highly conserved regions (nucleotides 1 to 90, 430 to 490 and 510 to 564), highly variable regions (nucleotides 120 to 170 and 675 to 742) and the remainder, relatively variable regions (Fig. 3).

The highly conserved regions demonstrate much higher homology than the overall homology and are considered to be important to some aspects of viral replication (Toyoda et al., 1984; Kuge & Nomoto, 1987). The highly variable regions show a low degree of homology between SVDV and CBs. The region next to the VP4 coding region (nucleotides 675 to 742) has no SVDV-specific nucleotides that are different from the nucleotides conserved among CBs (indicated by small arrows at the bottom of Fig. 3). The corresponding region is low in homology among the genomes of CBs and PVs and is missing in the genome of human rhinoviruses (HRVs) (Callahan et al., 1985; Skern et al., 1985; Stanway et al., 1984a). This region of the genome has been considered to be a possible spacer (Toyoda et al., 1984). Mutations (insertion or deletion) in this region had no effect on viral replication and therefore it may not play an essential role in replication in cell culture (Kuge & Nomoto, 1987). Another highly variable region (nucleotides 120 to 170) includes several SVDV-specific nucleotides. In the relatively conserved region most of the common sequences are conserved as short blocks. However, comparison of homology among CBs except SVDV shows additional blocks of homology. These blocks are indicated as open boxes in Fig. 3 and seem to represent differences between SVDV and CBs. The relatively variable region containing these blocks may determine some specific characteristics that distinguish SVDV from CBs.

Two specific nucleotide sequences, the U-rich region (nucleotides 562 to 576) and the eight base consensus sequence ( C U U A U G G U ; nucleotides 587 to 594) are conserved in this region and might act as important signals to maintain the rates of viral replication (Kuge & Nomoto, 1987).

The predicted stem-loop structure close to the 5" end of the R N A of CB3 (Tracy et al., 1985) is conserved in SVDV. More R N A secondary structure was predicted by a computer search. There are nine potential translation start codons positioned before the codon that initiates the large O R F at nucleotide position 743. However, the potential translation products initiated from these A U G s are unlikely to function in viral replication because of the short ORF (two to 50 amino acids), the lack of conservation in the size and position among other picornaviruses, and the absence of efficient flanking sequences for ribosome recognition (Kozak, 1986).

Structural proteins

The P1 region of the polyprotein consists of four capsid proteins, 1A (VP4), 1B (VP2), 1C (VP3) and 1D (VP1). The amino acid sequence similarities between the predicted protein of SVDV and those of CBs, PVs and other picornaviruses are shown in Table 3. The comparative amino acid sequence homology of the whole polyprotein between SVDV and CBs is shown in Fig. 4. The homology of the P1 region to other enteroviruses is lower than that of the P2 or P3 regions. VP4 is the most conserved of the structural proteins; it is not exposed on the outer surface of the virion and there is probably little immune pressure to mutate. In the other three

Page 10: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

928 T. INOUE, T. S U Z U K I AND K. S E K I G U C H I

Table 3. Amino acid sequence homology between the proteins of S VD V and other picornaviruses*

Protein CB1 CB3 CB4 PV1 PV2 PV3 BEV HRV2 HRV14 HAV EMCV TMEV F M D V

1A (VP4) 92-8 91.3 91-3 65.2 65.2 66.7 50-7 56.5 58.6 - 14.6 16-3 12.5 k J

(95-2 + 3-0) (93-7 + 1-7) IB (VP2) 80-6 80.6 82.8 50.2 55.4 52.6 52-7 52.8 65.2 - 25.8 31-0 24.4

k _ _ - - - / k - - - - y W

(79.4 _+ 2.0) (78.4 + 3.7) 1C (VP3) 84-5 82.8 78.6 52.9 54.6 54.6 46.5 50.2 48.5 29.9 30.6 26.7

k - - ,/ k - - - - ~r Y"

(77-9 _+ 1-0) (84-6 _+ 1-5) 1D (VP1) 77.1 73.5 70.1 43.6 44.5 42-5 35.9 37.9 36.6 - 17.8 15.9 16.8

k _ _ J

(73.9 + 4-0) (76.1 + 2.5) 2A (protease?) 91-3 82.1 86-7 56.3 57.6 58.3 62-0 34.7 42.1 12-7 10.6 14.4 7.3

k _ _ J

(87-4 + 3-0) (92-0 + 1-2) 2B 94-9 93-9 93.9 48.5 51.5 50.5 63.6 39.4 55.6 14-6 18.2 20-5 18.8

~r -v

(97-3 + 0.6) (90.7 + 3.1) 2C 98.8 96.4 97.0 60.9 63-0 62.4 61.9 45.8 58.3 25.2 30.9 30-4 29.6

(97-6 + 0.6) (96.0 +_ 0.4) 3A 91.0 92.1 91-0 48.3 50.6 49.4 48.3 33.7 51.7 15.7 12.8 16.0 12.4

k J

(92.5 + 1.7) (97.3 + 0.6) 3B (VPg) 100-0 90.9 100-0 77.3 72.7 72-7 34-8 40.9 52.2 12.0 36.4 27.3 29-2 + 7.2

v ~ y

(93-9 + 5.3) (94.0 + 2.7) 3C (protease) 94.5 92.9 95-6 54.9 61.2 60.7 59.8 48.1 53.0 16.8 24.2 19-8 21.2

k _ _ ) , ( y

(95.5 + 1.3) (91.6 + 3.6) 3D (polymerase) 95.7 95.0 96.3 73.6 74-2 74.0 70.0 56-3 64.7 24.4 30.9 32.2 30.9

k _ _ ) -¢- y

(95-0 + 0.4) (97-7 _+ 0.3)

* Sequence homologies are expressed as percentages. The values in parentheses represent the percent homology among CBs or PVs. Details of the other virus sequences are taken from the same sources as indicated in Table 2.

structural proteins, VP 1, VP2 and VP3, the highest degree of variation occurs in VP 1. In spite of divergence in the sequence of this region, highly conserved regions are distributed as blocks in the variable region. A similar distribution of highly conserved blocks is observed in the comparison between SVDV and PV 1 (Fig. 5 e). This pattern is very similar to the alignment comparison among PVs (Toyoda et al., 1984). The amino acid sequence of SVDV is aligned with that of PV1 to give maximal sequence homology. The hydrophilicities of SVDV and PV1 polyprotein sequences are represented together in Fig. 5 and are very similar. Computer- predicted protein secondary structure of SVDV is similar to that of PV1 determined by crystallography by Hogle et al. (1985). The/3-sheet structures which are well conserved in SVDV and PV1 almost totally overlap on the highly conserved blocks and hydrophobic valleys of the hydrophilicity profile. The structurally essential regions, especially the fl-sheet forming/~-barrel structures which form the viral core structure, seem to be common to SVDV and PV1. Thus the SVDV virion seems to have a very similar structure to PV 1. From the similarity of the structures, we predicted the antigenic determinant sites of SVDV on the basis of the secondary structure of PV 1. These regions, which have hydrophilic and poorly conserved properties, are located on the outer surface of virion and at loop-out sequences. The antigenic sites are marked as 'cl'; other structurally important regions, such as the canyon floor which is situated near the receptor, and the hinge which connects each subunit, are also indicated in Fig. 5.

Page 11: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

1 A (

VP

4)

(")j

--~

"

v't

L

1 50

(b

) .

..

.

: ,

1 50

(c

) ~

-

--

(d)

1B (

VP

2)

cl2b

cl

3bl

c13b

2 -]

100

--

200

300

~"

L-3

100

200

300

t-,,,,

,~l,,,

,,,~

1,4..1

,,,,,,

,,~

t--,,I

.....t

......

I, I

i=-I

In

tern

al i

nse

rtio

n

}-...

-',=,

,t ~

I II

[

A1

A2

B

C

D

E1

F G

H

I

1C (

VP

3)

c12c

(a)

- ~,

,t~

" -

~'~

--=.

, "

", .

....

:',

"

&

~ "-

,"r

....

....

(b)

; .

, ,

. ,~

,

? ~

__

400

__

50

0 ..

....

....

....

...

(c)

.,., =

Inte

rnal

(d

) in

sert

ion

~ ~

....

..

P'"

I '

I ~

I-,.

,,~

,,,~

~

~ ~

B

C

D

E1

F G

H

I

ID (

VP

I)

clld

1

cl I

d2

cl3d

cl

4d

{ w.:-

....c

:_:.~

~

_,~

.~.,

^ ~.

~',

~A

~,

A

..~

~-.

~ ~

_~

~.

~_

~i

~

600

700

8~

~

- 0~

. ~

L

(b)

~ -

- ~

-~

- 60

0 70

0 80

0 (c

) .

..

..

..

--

"'

::

..

..

..

..

..

.

: ""

Inte

rnal

in

sert

ion

inte

rnal

(d)

~ ~

B

~ ~'~

C ~ ..

..... ~

D

~E

1 ~ ..

......

~F

t

G

I in

sert

ion

J'~

I

I I

Fig

. 5.

C

ompa

riso

n of

pre

dict

ed s

truc

ture

of

caps

id p

rote

ins

betw

een

SV

DV

and

PV

1 M

aho

ney

(K

itam

ura

et

al.,

1981

). (

a) C

om

par

iso

n o

f hy

drop

hili

city

bet

wee

n S

VD

V a

nd P

V1

(GE

NE

TY

X:H

YD

O

prog

ram

). S

olid

lin

e in

dica

tes

SV

DV

and

dot

ted

line

ind

icat

es P

V1.

Pos

itiv

e va

lues

rep

rese

nt h

ydro

phil

ic r

esid

ues

and

nega

tive

val

ues

repr

esen

ts h

ydro

phob

ic o

nes.

(b)

The

am

ino

acid

nu

mb

ers

of P

V 1

and

SV

DV

are

ind

icat

ed a

bove

and

bel

ow t

he li

ne r

espe

ctiv

ely.

Th

ey a

re a

rran

ged

to s

how

am

ino

acid

hom

olog

y an

d si

mil

arit

y be

twee

n S

VD

V a

nd P

V1.

Com

pari

son

was

per

form

ed b

y th

e M

icro

geni

e pr

ogra

m.

(c)

Pre

dict

ed a

min

o a

cid

sequ

ence

ho

mol

ogy

of th

e P1

reg

ions

of

SV

DV

and

PV

1. T

he f

igur

e is

rep

rese

nted

as

in F

ig.

4. H

omol

ogy

anal

ysis

was

per

form

ed b

y th

e M

icro

geni

e pr

ogra

m.

(d)

The

pro

tein

se

cond

ary

stru

ctur

e of

PV

1 (

soli

d li

ne,

fl-s

heet

; br

oken

lin

e, a

-hel

ix).

The

lin

e re

pres

ents

the

X-r

ay c

ryst

allo

grap

hic

anal

ysis

by

Hog

le e

t al

. (1

985)

. T

hese

dat

a ar

e al

igne

d ac

cord

ing

to a

min

o a

cid

hom

olog

y be

twee

n S

VD

V a

nd P

V1.

Str

uctu

rall

y an

d an

tige

nica

lly

impo

rtan

t si

tes

are

also

ind

icat

ed.

Th

e ab

brev

iati

on '

cl' m

ean

s cl

uste

r of

neu

tral

izin

g an

tibo

dy d

eter

min

ants

as

desc

ribe

d by

Hog

le e

t al

. (1

985)

. A

lpha

beti

c ch

arac

ters

ind

icat

e th

e fl

-she

et s

truc

ture

s as

des

crib

ed i

n R

oss

man

n e

t at

. (1

985)

.

e~

¢%

t-J

~D

Page 12: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

930 T. INOUE, T. SUZUKI AND K. SEKIGUCHI

Non-structural proteins

The P2 and P3 regions are highly conserved among enteroviruses, especially between SVDV and CBs. Except for 2A, all the proteins have more than 90~ homology (Table 3). Protein 2A has been shown to function as a protease responsible for cleavage of the polyprotein 1D/2A at the Y/G cleavage site in PV (Toyoda et al., 1986). The homology of 2A between SVDV and CB1 is higher than that of SVDV to CB3 or SVDV to CB4. This may mean that CB1 and SVDV have the same substrate cleavage site at T/G, although this is not the case for comparisons of CB4 and CB3, and CB4 and PV1 (Jenkins et aL, 1987).

The region near the C-terminal of the 2A protein, P G D C G G X L X C X H G (residues 107 to 119), which is similar to the possible active site of protease 3C, is completely conserved among SVDV, PVs, CBs, bovine enterovirus (BEV) and HRVs (Fig. 4).

A comparison between the 2C protein of SVDV and CBs shows a remarkably high degree of homology (96 to 99~) (Table 3). It is thought that the 2C protein attaches to vesicular membranes and is associated with viral RNA synthesis (Bienz et al., 1983; Takegami et al., 1983), although its exact function is obscure.

The VPg (3B) sequence of SVDV is identical to those of CB1 and CB4 at the amino acid level. It differs from those of CB5 and CB3 by only one and two amino acids respectively (Lindberg et al., 1987).

The 3C polypeptide is a protease that cleaves at certain Q/G sites in the polyprotein. These are highly conserved sequences in the SVDV 3C region in comparison with other picornaviruses. In particular, two stretches of amino acids, G Q C G G V (residues 145 to 150) and H V G G N G (residues 161 to 166), in the C-terminal region of the protein are conserved in the proteases of PVs, CBs, BEV and HRV14 (the other HRVs have YQCGGV in place of the former sequence). The Cys 147 and His 161 residues (SVDV and PV1) are conserved in picornaviruses and plant comoviruses. These residues are possibly part of the active site of the protease, which is considered to be a CYsteine protease (Argos et al., 1984).

In the polymerase region, 3D, the homologies between SVDV and CBs are 95 to 96~, compared to 73.6 ~ between SVDV and PV 1. The homologous regions are distributed along the polymerase region. The amino acid sequence YGDD (residues 327 to 330) is conserved among picornaviruses including SVDV, and in plant comoviruses. The element GDD is also identified in other RNA viruses, for example vesicular stomatitis virus, the haemagglutinating virus of Japan, flaviviruses, plant and bacterial viruses and retroviruses (Kamer & Argos, 1984; Ishihama & Nagata; 1988). These stretches might be a possible active site or nucleic acid recognition region for the polymerase. Two other sequences, PSD (residues 228 to 290) and FLKR (residues 374 to 377), are conserved among picornaviruses and cowpea mosaic virus.

The 3' non-coding region

The translation of the polyprotein RNA is terminated at position 7298 by the sequence UAA, which is followed by a non-coding region of 102 nucleotides prior to the poly(A) tract. The 75 nucleotides next to the 3' poly(A) of the SVDV genome have been reported by Porter et al. (1978) and are identical in our strain of SVDV. The sequence homology of SVDV to other picornaviruses, except CBs, is generally low (Table 2). This is partly a reflection of the difference in sequence length. For example, the number of nucleotides of the 3' non-coding region are 70 to 71 for PVs, 42 to 46 for HRVs, and 125 for encephalomyocarditis virus (EMCV). Although the sequence homologies of SVDV to CBs having nearly the same size of 3' non-coding region (100 to 102 nucleotides) are high (76.9 to 81.9~), the homology value among the different CB serotypes (89.6~o) is higher than that of SVDV to CBs.

The alignment comparison of the 3' non-coding region is shown in Fig. 3. There are some SVDV-specific regions that are different from CBs; these are indicated as open boxes as shown in the 5' non-coding region.

In the 3' non-coding region of enteroviruses, a characteristic RNA secondary structure has been reported by many investigators. For SVDV, a number of RNA secondary structures are possible in this region including part of the 3D region (about 100 nucleotides) as shown in Fig. 6. This region is composed mainly of five independent stem-loop structures (Fig. 6a). These

Page 13: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

(a)

3 ~-~

.~-~

-"~-

'~

73

50

"~'~

-- ~

UG

AACCG

AA

,~"

CC

,CU

AC

CG

CA

C

~

Illl

llJJ

ll

73 O

0 ,,:

C~,

GGGAU

GGCG

U U

~-

n~

'~

GAAAAUAG

U

~,t

op

m _

_ ,~

&C

U

CC

G

A

~

Ill

1 ~n~

v. ~ ~

,~G~C

u I1

1111

I11

,,~

740

0

t2

--

l.

D

l_.~

--

t.

a

5

(b)

Nuc

leot

ide

posi

tion

5

5'

4 4'

St

op

CB

4 72

16

~I£,G

C

~ ~

---

~ -[

(~U

A

U--

- A

[

U

r~

7~

4d~A

~

j4.5,

PVI

7279

~A

AA

U£A

GG

P

GCU

UA

- --

A

A

AC-

A --

O

UA -

--C

CC

C U

A

..

...

~ii~

BE

V

72

64

~

AG

AU

CC

~

U

U

C--

- ~

AG

CC

CC

AU

UI

U

--A

- G

--

~

A

AC

~

..

... C

:!i~i~

HR

V2

6

98

7

C

U

AAG

CU -

--

U

UG

CAC

A

A

C

A G

AUUU

GC

--U

I~

--

- U

AU

A

AA

A .

....

iiiiiiiii

HR

VI4

7097

....

A

A U

---

A

AA~

% U

~ AA

UU

--A

AA

-A

G

UA

UA

G--

- C

GC

CU

UUAG

G ii~

!iiii!

SVD

V

CB

I

CB

3

CB

4

PVI

BEV

HR

V2

7063

.

..

..

..

..

..

..

-~

(~--

"'_

___?

UA

..

..

UAG-

-

..v14

71

69

iiiill

ii~i

iilll

......

......

......

.:~

-~A

c ...

.. ~-

-

7299

-

- ]

-AAA

] ..

...

O U(

; "C

U CA

"

(%

7296

~3

-

[~

3~

Lo

..

...

~, --c

i, ,~

. <

....

. U

-U

AG

U-U

IJ~A

-

U L

-

--

GU

GG

AG--U

UUA-

Fig

. 6.

(a

) C

ompu

ter-

pred

icte

d R

NA

se

cond

ary

stru

ctur

e of

the

3'

ter

min

al

regi

on

of t

he

SV

DV

ge

nom

e.

The

str

uctu

re

was

ca

lcul

ated

by

the

pr

ogra

m

G E

N E

TY

X :

SE

CST

AG

=

- 37

6-1

kJ/m

ol.

(b)

Ali

gnm

ent

of 3

" te

rmin

al r

egio

n of

sev

eral

pic

orna

viru

ses

(det

ails

are

tak

en f

rom

the

sou

rces

ind

icat

ed i

n T

able

2).

Del

etio

ns a

re m

arke

d by

a d

ash

( ) a

nd a

bla

nk s

pace

ind

icat

es n

o ch

ange

fro

m t

he b

ase

in t

he S

VD

V.

Box

es a

roun

d se

quen

ces

indi

cate

bas

es i

nvol

ved

in f

orm

atio

n of

the

pre

dict

ed s

tem

s. S

eque

nce

hom

olog

y an

alys

is w

as p

erfo

rmed

by

the

prog

ram

GE

NE

TY

X

: MA

XH

AM

an

d th

e R

NA

sec

onda

ry s

truc

ture

s ar

e ca

lcul

ated

by

the

prog

ram

GE

NE

TY

X

:SE

CST

.

Page 14: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

932 T. INOUE, T. SUZUKI AND K. SEKIGUCHI

Table 4. Free energy of computer-predicted R N A secondary structure o f the 3" terminal region*

Minimum free energy (kJ/mol) Stem-loop r

no. SVDV CBI CB3 CB4 PVI BEV HRV2 HRV14

It -13-8 -13-8 -22.6 -31-8 -2.5 -51.9 - - 2t -102.1 -87.0 -93.3 -111-3 -40-6 -59-4 -34.7 -19.2 3t -60-7 -0.8 -3.3 -18-4 . . . . 4:~ - 80-8 - 56.5 - 74-5 - 25.9 -138.9 -135.6 5J; -72.4 -41.0 -40.2 -77-8

*Each stem-loop no. correlates with Fig. 6(a). Free energy was calculated by the program GENETYX :SECST.

"~ 3" Non-coding region. :~ 3D protein-coding region.

structures are also predicted for other enteroviruses and rhinoviruses. The alignment and the position of each predicted stem structure are represented in Fig. 6 (b). The free energy of each stem-loop structure is shown in Table 4. The stem-loop no. 2 corresponds to the previously mentioned stem-loop structure and its free energy is very low. This structure is also observed in HRV but its free energy is not always low. The stem-loop no. 3 observed in SVDV is not obvious in CBs, and the corresponding stem-loop is deleted in PV1, BEV and HRVs. Stem-loop no. 4 and 5 in SVDV which are located in the 3D region are also observed in CBs but not in BEV, PV or HRV. The function of the 3' non-coding region has not been determined and the functional significance of its secondary structure is obscure. However, the secondary structure may contribute to R N A replication (Sarnow et al., 1986) and stabilization, such as resistance against attack by endonuclease. On the other hand, these proposed structures in the 3' non-coding region seem to reveal some differences among enteroviruses. In the picornaviruses deletions of insertions that contribute to evolutionary divergence may occur at a stem-loop structure.

SVDV has been found to be closely related to CB5 in its physicochemical, biochemical and serological characteristics, while immunodiffusion, neutralization and R N A hybridization have shown differences between them (Harris et al., 1977). CB5 shows considerable variation (Brown & Wild, 1974), but the variation of SVDV is smaller (Harris & Brown, 1975). Graves (1973) suggested that SVDV was originally derived from human CB5. Garland & Mann (1974) were critical of this hypothesis because they failed to infect SVDV-susceptible pigs with CB5. Knowles et al. (1979) proposed that SVDV should be considered a porcine strain of CB5 on the basis of production of c.p.e, and antigenic analysis. St~lhandske et al. (1984) have also considered SVDV as a variant of CB5. However, the evidence that SVDV is a variant of CB5 is inconclusive.

In this paper we have reported the complete nucteotide sequence of the genome of an SVDV strain of low pathogenicity, and have demonstrated that the genomic organization is similar to that of other picornaviruses. Comparisons of nucleotide and predicted amino acid sequence between SVDV and other picornaviruses indicate that SVDV is closely related to the other enteroviruses and rhinoviruses. The homology between SVDV and CBs is remarkably high and large parts of the sequence are conserved. Although the complete nucleotide sequence of CB5 has not been determined, our results indicate a close relationship between SVDV and CBs.

Whether SVDV is a variant of CB5 wilt not be known until the complete nucleotide sequences of CB5 and other strains of SVDV, including a pathogenic one, are established. Our results also suggest the presence of SVDV-specific sequence blocks, which are conserved among CBs but not in SVDV. As nucleotide sequences, these blocks are located in relatively variable regions of the 5" non-coding region and in the 3' non-coding region. As amino acid sequences, they are observed in the 2A region. The SVDV-specific blocks may hold a clue to understanding not only the relationship between SVDV and CBs but also more precise molecular features of picornaviruses.

Page 15: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

Nucleotide sequence o f S VD V 933

R E F E R E N C E S

ARGOS, P., KAMER, G., NICKLIN, M. J. H. & WIMMER, E. (1984). Similarity in gene organization and homology between proteins of animal picornaviruses and a plant comovirus suggest a common ancestry of these virus families. Nucleic Acids Research 12, 7251-7267.

BIENZ, K., EGGER, D., RASSER, Y. & BOSSART, W. (1983). Intracellular distribution of poliovirus proteins and the induction of virus-specific cytoplasmic structures. Virology 131, 39-48.

BIRNBOIM, H. C. & DOLY, J. (1979). A rapid alkaline extraction procedure for screening recombinant plasmid DNA. Nucleic Acids Research 7, 1513 1523.

BROWN, F. & WILD, V. (1974). Variation in the coxsackievirus type B5 and its possible role in the etiology of swine vesicular disease. Intervirology 3, 125-128.

BROWN, F., TALBOT, P. & BURROWS, R. (1973). Antigenic differences between isolates of swine vesicular disease virus and their relationship to coxsackie B5 virus. Nature, London 245, 315-316.

BROWN, F., WILD, T. F., ROWE, L. W., UNDERWOOD, B. O. & HARRIS, T. J. R. (1976). Comparison of swine vesicular disease virus and coxsackie B5 virus by serological and R N A hybridization methods. Journal of General Virology" 31, 231 237.

BURROWS, R., MANN, J. A. & GOODRIDGE, D. (1974). Swine vesicular disease: comparat ive studies of viruses isolated from different countries. Journal of Hygiene 73, 109-117.

CALLAHAN, P. L., MIZUTANI, S. & COLONNO, R. J. (1985). Molecular cloning and complete sequence determination of R N A genome of h u m a n rhinovirus type 14. Proceedings of the National Academy of Sciences, U.S.A. 82, 732- 736.

DAWE, V. S., FORMAN, A. J. & SMALE, ¢. J. (1973). A preliminary investigation of the swine vesicular disease epidemic in Britain. Nature, London 241, 540-542.

DENG, G. & WE, R. (1981). An improved procedure for utilizing terminal transferase to add homopolymers to the 3' termini of D N A . Nucleic Acids Research 9, 4173-4188.

EARLE, J. A. P., SKUCE, R. A., FLEMING, C. S., HOEY, E. M. & MARTIN, S. J. (1988). The complete nucleotide sequence of a bovine enterovirus. Journal of Genera/ Virology 69, 253-263.

FORSS, S, STREBEL, K., BECK, E. & SCHALLER, H. (1984). Nucleotide sequence and genome organization of foot-and- mouth disease virus. Nucleic Acids Research 12, 6587-6601.

GARLAND, A. J. M. & MANN, J. A. (1974). At tempts to infect pigs with coxsackie virus type BS. Journal of Hygiene 73, 85 96.

GELIEBTER, J. (1987). Dideoxynucleotide sequencing of R N A and uncloned cDNA. Focus 9 (1), 5-8. Gaithersburg: Bethesda Research Laboratories.

GRAVES, J. I~. (1973). Serological relationship of swine vesicular disease virus and coxsackie B5 virus. Nature, London 245, 314-315.

GRUBMAN, M. J., BAXT, B. & BAeHRACH, n. L. (1979). Foot-and-mouth disease virion R N A : studies on the relation between the length of its Y-poly(A) segment and infectivity. Virology 97, 22-31.

HARRIS, T. J. R. & BROWN, r. (1975). Correlation of polypeptide composition with antigenic variation in the swine vesicular disease and coxsackie B5 viruses. Nature, London 258, 758-760.

HARRIS, T. J. R., DOEL, T. R. & BROWN, F. (1977). Molecular aspects of the antigenic variation of swine vesicular disease and coxsackie B5 viruses. Journal of General Virology 35, 299-315.

nENIKOFF, S. (1984). Unidirectional digestion with exonuclease III creates targeted breakpoints for D N A sequencing. Gene 28, 351-359.

HOGLE, J. M., CHOW, M. & FILMAN, D. J. (1985). Three-dimensional structure of poliovirus at 2.9A resolution. Science 229, 1358-1365.

IIZUKA, M., KUGE, S. & NOMOTO, A. (1987). Complete nucleotide sequence of the genome of coxsackievirus B1. Virology 156, 64-73.

ISnlHAMA, A. & NAGATA, K. (1988). Viral R N A polymerases. CRC Critical Reviews in Biochemistry 23, 27-76. JENKINS, O., BOOTH, J. D., MINOR, P. D. & ALMOND, J. W. (1987). The complete nucleotide sequence of coxsackievirus

B4 and its comparison to other members of the Picornaviridae. Journal of General Virology 68, 1835-1848. KAMER, G. & ARGOS, P. (1984). Primary structural comparison of RNA-dependent polymerases from plant, animal

and bacterial viruses. Nucleic Acids Research 12, 7269-7282. KITAMURA, N., SEMLER, B. L., ROTHBERG, P. G., LARSEN, G. R., ADLER, C. J., DORNER, A. J., EMINI, E. A., HANECAK, R.,

LEE, J. J., VAN DER WERF, S., ANDERSON, C. W. & WIMMER, E. (1981). Primary structure, gene organization and polypeptide expression of poliovirus RNA. Nature, London 291, 547-553.

KNOWLES, N. J., BUCKLEY, L. S. & PEREIRA, H. G. (1979). Classification of porcine enteroviruses by antigenic analysis and cytopathic effects in tissue culture : description of 3 new serotypes. Archives of Virology 62, 201-208.

KODAMA, M. (1976). Outline of studies on swine vesicular disease in Japan. Bulletin de l'O~ce International des Epizooties 86, 423 432.

KODAMA, M., OGAWA, T., SAITO, T., TOKUDA, G., SASAHARA, J. & KUMAGAI, T. (1980a). Swine vesicular disease virus isolated from healthy pigs in non-epizootic periods. L Isolation and identification. National Institute of Animal Health Quarterly (Japan) 20, 1-10.

KODAMA, M., SAITO, T., OGAWA, T., TOKUDA, G., SASAHARA, J. & KUMAGAI, T. (1980b). Swine vesicular disease viruses isolated from healthy pigs in non-epizootic periods. II. Vesicular formation and virus multiplication in experimentally inoculated pigs. National Institute of Animal Health Quarterly (Japan) 20, 123-130.

Page 16: The Complete Nucleotide Sequence of Swine … Complete Nucleotide Sequence of Swine Vesicular Disease Virus By TORU INOUE, .1 TAIKO SUZUKI 2 AND KIICHI SEKIGUCHI 2 1Exotic Disease

934 T. I N O U E , T. S U Z U K I AND K. S E K I G U C H I

KOZAK, M. (1986). Point mutat ions define a sequence flanking the A U G initiator codon that modulates translation by eukaryotic ribosomes. Cell 44, 283-292.

KUGE, S. & NOMOTO, A. (1987). Construction of viable deletion and insertion mutan ts of the Sabin strain of type 1 poliovirus; function of the 5' noncoding sequence in viral replication. Journal of Virology 61, 1478 1487.

LA MONICA, N., MERIAM, C. & RACANIELLO, V. R. (1986). Mapping of sequences required for mouse neurovirulence of poliovirus type 2 Lansing. Journal of Virology 57, 515-525.

LINDBERG, A. M., ST,~LHANDSKE, P. O. K. & PETI'ERSON, U. (1987). Genome of coxsackievirus B3. Virology 156, 50-63. MANIATIS, T., FRITSCH, E. F. & SAMBROOK, J. (1982). Molecular Cloning: A Laboratory Manual. New York: Cold

Spring Harbor Laboratory. MAXAM, A. M. & GILBERT, W. (1977). A new method for sequencing DNA. Proceedings of the National Academy of

Sciences, U.S.A. 74, 560-564. MOWAT, G. N., DARBYSHIRE, J. H. & HUNTLEY, J. F. (1972). Differentiation of a vesicular disease of pigs in Hung

Kong from foot-and-mouth disease. Veterinary Record 90, 618 621. NAJARIAN, R., CAPUT, D., GEE, W., POTTER, S. 2t., RENARD, A., MERRYWEATHER, J., NEST, G. V. & DINA, D. (1985).

Primary structure and gene organization of human hepatitis A virus. Proceedings of the National Academy of Sciences, U.S.A. 82, 2627 2631.

NARDELLI, L., LODETTI, E., GUALANDI, G. L., BURROWS, R., GOODRIDGE, D., BROWN, F. & CARTWRIGHT, B. (1968). A foot and mouth disease syndrome in pigs caused by an enterovirus. Nature, London 219, 1275-1276.

NOMOTO, A., KAJIGAYA, S., SUZUKI, K. & IMURA, N. (1979). Possible point mutat ion sites in LSc, 2ab poliovirus R N A and a protein covalently linked to the 5'-terminus. Journal of General Virology 45, 107 117.

PALMENBERG, A. C., KIRBY, E. M., JANDA, M. R., DRAKE, N. L., DUKE, G. M., POTRATZ, K. F. & COLLETT, M. S. (1984). The nucleotide and deduced amino acid sequences of the encephalomyocardit is viral polyprotein coding region. Nucleic Acids Research 12, 2969-2985.

PEVEAR, D. C., CALENOFF, M., ROZHON, E. & LIPTON, H. L. (1987). Analysis of the complete nucleotide sequence of the picornavirus Theiler 's murine encephalomyelitis virus indicates that it is closely related to cardioviruses. Journal of Virology 61, 1507-1516.

PORTER, A. G., FELLNER, P., BLACK, D. N., ROWLANDS, D. J., HARRIS, T. J. R. & BROWN, F. (1978). 3 '-Terminal nucleotide sequences in the genome R N A of picornaviruses. Nature, London 276, 298-301.

ROSSMANN, M. G., ARNOLD, E., ERICKSON, J. W., FRANKENBERGER, E. A., GRIFFITH, J. P. HECHT, H. J., JOHNSON, J. E., KAMER, G., LUO, M., MOSSER, A. G., RUECKERT, R. R., SHERRY, B. & VRIEND, G. (1985). Structure of a human common cold virus and functional relationships to other picornaviruses. Nature, London 317, 145-153.

SANGER, F., NICKLEN, S. & COULSON, A. R. (1977). D N A sequencing with chain-terminat ing inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467.

SARNOW, P., BERNSTEIN, H. D. & BALTIMORE, D. (1986). A poliovirus temperature-sensitive R N A synthesis mutan t located in a noncoding region of the genome. Proceedings of the National Academy of Sciences, U.S.A. 83, 571-575.

SKERN, T., SOMMERGRUBER, W., BLAAS, D., GRUENDLER, P., FRAUNDORFER, F., PIELER, C., FOGY, I. & KUECHLER, E. (1985). H u m a n rhinovirus 2: complete nucleotide sequence and proteolytic processing signals in the capsid protein. Nucleic Acids Research 13, 2111 2126.

ST,~d.HANDSKE, P. O. K, LINDBERG, M. & PETI'ERSSON, U. (1984). Replicase gene of coxsackievirus B3. Journal of Virology 51, 742-746.

STANWAY, G., HUGHES, P. J., MOUNTFORD, R. C., MINOR, P. D. & ALMOND, J. W. (1984a). The complete nucleotide sequence of a common cold virus: h u m a n rhinovirus 14. Nucleic Acids Research 12, 7859-7875.

STANWAY, G., HUGHES, P. J., MOUNTFORD, R. C., REEVE, P., MINOR, P. D., SCHILD, G. C. & ALMOND, J. W. (1984b). Comparison of the complete nucleotide sequences of the genomes of the neurovirulent poliovirus P3/Leon/37 and its attenuated Sabin vaccine derivative P3/Leon 12a I b. Proceedings of the National Academy of Sciences, U.S.A. 81, 1539 1543.

TAKEGAMI, T., KUHN, R. J., ANDERSON, C. W. & WIMMER, E. (1983). Membrane-dependent uridylylation of the genome-linked protein Vpg of poliovirus. Proceedings of the National Academy of Sciences, U.S.A. 80, 7447-7451.

TOYODA, H., KOHARA, M., KATAOKA, Y., SUGANUMA, T., OMATA, T., IMURA, N. & NOMOTO, A. (1984). Complete nucleotide sequences of all three poliovirus serotype genomes: implication for genetic relationship, gene function and antigenic determinants. Journal of Molecular Biology 174, 561-585.

TOYODA, H., NICKLIN, M. J. H., MURRAY, M. G., ANDERSON, C. W., DUNN, J. J., STUDIER, F. W. & WIMMER, E. (1986). A second virus-encoded proteinase involved in proteolytic processing of poliovirus polyprotein. Cell 45, 761 770.

TRACY, S., LIU, H-L. & CHAPMAN, N. M. (1985). Coxsackievirus B3 : primary structure of the 5' non-coding and capsid protein-coding regions of the genome. Virus Research 3, 263-270.

(Received 9 August 1988)