9
DNA Sequence-The Journal ofSequencing andMapping, Vol. 5, pp. 41-49 Reprints available directly from the publisher Photocopying permitted by license only 01 994 Harwood Academic Publishers GmbH Printed in the United States of America Multiple secondary plant product UDP- glucose glucosyltransferase genes expressed in cassava (Manihof escdenfa Crantz) cotyledons JANE HUGHES and MONICA A. HUGHES Department of Biochemistry and Genetics, Medical School, University of Newcastle upon Tyne, Newcastle upon Tyne NE2 4HH, UK Database Accession Nos. X77459 (pCGT1 ), X77460 (pCGT4), X77461 (pCGTZ), X77462 (pCGT5), X77463 (pCGTG), X77464 (pCGT7) Six different putative UDP-glucose glucosyltransferase clones were isolated from a cassava cotyledon cDNA library probed with an Acc I-Bgl II restriction fragment from a UDP-glucose flavonoid 3-0-glucosyltransferase from Anfirrhinum majus. The heterologous probe contained a glucosyltransferase consensus signature amino acid sequence which was also present in the cassava cDNA clones. Nucleotide and derived amino acid se- quences are presented for two of the clones. Northern analysis showed different patterns of expression for the six genes in de- veloping seedling tissues, indicating temporal and tissue-specific regulation. A comparative analysis was made of the six cassava clone derived amino acid sequences and other reported UDP- glucosyltransferase genes. Highly conserved residues in plant genes from three species allow redefinition of essential residues within the signature sequence for secondary plant product me- tabolism glucosyltransferase genes. KEY WORDS cassava, cDNA, cotyledon, flavonoids, UDP-glu- cose glucosyltransferase INTRODUCTION Cassava (Manihor esculenta Crantz) is a major tropi- cal root crop with an estimated world annual pro- duction of 150 million tonnes of fresh roots (Hershey, 1993). It is the major crop plant of tropi- cal Africa, with average African per capita con- Address for correspondence: Professor M.A. Hughes, Department of Biochemistry and Genetics, Medical School, University of Newcastle upon Tyne, Newcastle upon Tyne NE2 4HH UK. sumption at over 100 kg per annum (Hahn, 1989). The biology of cassava is seriously under-researched and there is considerable scope for the improve- ment of agronomic and quality characteristics. Cassava is cyanogenic, that is hydrocyanic acid (HCN) is released from all tissues following me- chanical damage. HCN release is brought about by the sequential action of a P-glucosidase and an a- hydroxynitrilase on two structurally related cyanoglucosides (linamarin and lotaustralin) (Poulton, 1990). These cyanogenic glucosides are synthesised in leaf tissue (including the cotyledons) and transported throughout the plant (Koch et a/., 1992). The cyanogenic glucosides are synthesised from two precursor amino acids (valine and isoleucine) to form two unstable cyanohydrins (2- hydroxy-2-methylpropionitrile and 2-hydroxy-2- methylbutyronitrile) which are glucosylated by a UDP-glucosyltransferase to produce the stable cyanoglucosides. Ungerminated seeds of cassava contain very small quantities of cyanoglucoside (1-1 0 nmolheed) but during germination rapid syn- thesis occurs in the hypogeal cotyledons so that after 10 days the seedlings contain about 8 pol of cyanoglucoside per seed (unpublished data). Given the rapid synthesis of cyanoglucosides in the cotyle- dons, the cyanogen ic U DP-glucose gl ucosyltrans- ferase is expected to be present in this tissue. Glycosylation of a number of secondary plant compounds, including flavonoids, steroidal alka- 41 Mitochondrial DNA Downloaded from informahealthcare.com by University of Toronto on 11/24/14 For personal use only.

Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

Embed Size (px)

Citation preview

Page 1: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

DNA Sequence-The Journal ofSequencing andMapping, Vol. 5, pp. 41-49 Reprints available directly from the publisher Photocopying permitted by license only

0 1 994 Harwood Academic Publishers GmbH Printed in the United States of America

Multiple secondary plant product UDP- glucose glucosyltransferase genes expressed in cassava (Manihof escdenfa Crantz) cotyledons JANE HUGHES and MONICA A. HUGHES

Department of Biochemistry and Genetics, Medical School, University of Newcastle upon Tyne, Newcastle upon Tyne NE2 4HH, UK

Database Accession Nos. X77459 (pCGT1 ), X77460 (pCGT4), X77461 (pCGTZ), X77462 (pCGT5), X77463 (pCGTG), X77464 (pCGT7)

Six different putative UDP-glucose glucosyltransferase clones were isolated from a cassava cotyledon cDNA library probed with an Acc I-Bgl I I restriction fragment from a UDP-glucose flavonoid 3-0-glucosyltransferase from Anfirrhinum majus. The heterologous probe contained a glucosyltransferase consensus signature amino acid sequence which was also present in the cassava cDNA clones. Nucleotide and derived amino acid se- quences are presented for two of the clones. Northern analysis showed different patterns of expression for the six genes in de- veloping seedling tissues, indicating temporal and tissue-specific regulation. A comparative analysis was made of the six cassava clone derived amino acid sequences and other reported UDP- glucosyltransferase genes. Highly conserved residues in plant genes from three species allow redefinition of essential residues within the signature sequence for secondary plant product me- tabolism glucosyltransferase genes.

KEY WORDS cassava, cDNA, cotyledon, flavonoids, UDP-glu- cose glucosyltransferase

INTRODUCTION

Cassava (Manihor esculenta Crantz) is a major tropi- cal root crop with an estimated world annual pro- duction of 150 m i l l i on tonnes of fresh roots (Hershey, 1993). It i s the major crop plant of tropi- cal Africa, with average African per capita con-

Address for correspondence: Professor M.A. Hughes, Department of Biochemistry and Genetics, Medical School, University of Newcastle upon Tyne, Newcastle upon Tyne NE2 4HH UK.

sumption at over 100 kg per annum (Hahn, 1989). The biology of cassava is seriously under-researched and there is considerable scope for the improve- ment of agronomic and quality characteristics.

Cassava i s cyanogenic, that i s hydrocyanic acid (HCN) i s released from all tissues following me- chanical damage. HCN release is brought about by the sequential action of a P-glucosidase and an a- hydroxynitrilase on two structurally related cyanoglucosides ( l inamarin and lotaustralin) (Poulton, 1990). These cyanogenic glucosides are synthesised in leaf tissue (including the cotyledons) and transported throughout the plant (Koch et a/., 1992). The cyanogenic glucosides are synthesised from two precursor amino acids (val ine and isoleucine) to form two unstable cyanohydrins ( 2 - hydroxy-2-methylpropionitrile and 2-hydroxy-2- methylbutyronitrile) which are glucosylated by a UDP-glucosyltransferase to produce the stable cyanoglucosides. Ungerminated seeds of cassava contain very small quantities of cyanoglucoside (1-1 0 nmolheed) but during germination rapid syn- thesis occurs in the hypogeal cotyledons so that after 10 days the seedlings contain about 8 p o l of cyanoglucoside per seed (unpublished data). Given the rapid synthesis of cyanoglucosides in the cotyle- dons, the cyanogen ic U DP-gl ucose gl ucosyltrans- ferase i s expected to be present in this tissue.

Glycosylation of a number of secondary plant compounds, including flavonoids, steroidal alka-

41

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 2: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

I. HUGHES AND M. A. HUGHES 42

loids and cyanohydriris, occurs at the end of their biosynthetic pathway (Sun and Hrazdina, 1991; Stapleton et a/., 1992; Reay and Conn, 1974). The most commonly used sugar is glucose and the reac- tion i s catalysed by a UDP-glucose. glucosyltrans- ferase to produce a stable water soluble compound that is often transported to the vacuole (Hrazdina and Wagner, 1985; F’oulton, 1990). A number of secondary plant compound UDP-glucosyltrans- ferases from different species have been studied [Hrazdina, 1988; Heilemann and Strack, 1991; lshikura et a/ . , 1993; U l lmann et a/., 1993; Vellekoop et a/., 1993). Two higher plant UDP-glu- cose glucosyltransferase genes have been cloned and their sequences reported in the literature: the UDP-glucose flavonoid glucosyltransferase specified by the bronze locus in maize (Ralston et a/., 1988; Furtek et a/., 1988) and a related gene from barley (Wise et dl . , 1990). Given the reported specificity of these enzymes and the large number of potential substrates, a wide range of different glucosyltrans- ferases may be expected to occur within a single plant species (Harborne, 1988). Here we report the use of a heterologous probe derived from a dicotyle- donary plant, Antirrhinurn rnajus (Martin et a/., 1991 ), to identify glucosyltransferase clones in a cassava cotyledon cDNA library. We present the nucleotide sequence of two cDNA clones and a comparative analysis of the derived amino acid se- quences of six cassava cDNA clones with other re- ported gtucosyltransferase genes.

RESULTS & DISCUSSION

cDNA Library Construction and Screening A hgt 10 cDNA library made from mRNA extracted from the cotyledons of 10 day old light grown cas- sava seedlings (Hughes et a/., 1992) was screened with a heterologous probe constructed from a 0.9 kb Acc I-Bgl II restriction fragment from a flavonoid glucosyltransferase from Antirrhinum rnajus kindly provided by Ur. C. Martin, John lnnes Institute (Martin et a/. , 1991). The nucleotide and derived amino acid sequences of the Antirrhinurn flavonoid glucosyltransferase clone used as a probe in the ini- tial library screen have not been published. The de- rived amino acid sequence of this fragment contains a region with high homology to a proposed gluco- syltransferase consensus signature sequence (PROSITE, Bairoch, 1991). Six independent clones

were selected and subcloned into the plasmid vec- tor pCem SZf(-), and confirmed by restriction map- ping and sequencing to be different from each other. The derived amino acid sequences of all of these clones contain the proposed glucosyltrans- ferase signature.

Southern blot analysis of genomic DNA from 54 cassava accessions, digested wi th Eco R 1 and probed with the s ix putative glucosyltransferase cDNA clones, gave different sized fragments and different levels of polymorphism for each clone, confirming that they represent different genes (per- sonal communication, Dr. H.R. Haysom).

Sequence Analysis Restriction maps of the six putative glucosyltrans- ferase clones from cassava are shown in Fig. 1. Four clones do not contain the complete coding se- quence but the position of the predicted stop codons for each clone in relation to the glucosyl- transferase signature sequence indicates the con- served 3’ location of this proposed UDP-binding site (Hundle, 1992). There i s a conserved Eco RI site within the PROSITE region in five of the six clones, and an Ndel site in two of the clones. None of the other restriction sites is conserved.

The nucleotide sequences of the two longest clones, pCGTl and pCGT5, are presented in Fig. 2, together with the derived amino acid sequences. pCGT5 is considered to be full length, as the tran- script size estimated from Northern blot analysis is 1.6 kb (Table 1 ). This clone contains a polyA tail of 54 residues and the open reading frame shown pre- dicts a protein of 487 amino acid residues with a calculated M, of 54,379. pCGT1 has an open read- ing frame that predicts a similar sized protein of 449 amino acid residues with a calculated M, of 50,280, but does not appear to contain all of the 3’ non-cod- ing region of the 1.6 kb transcript (Table 1).

It has been proposed that the final step i n the biosynthesis of flavonoid and cyanogenic gluco- sides, that i s the addition of glucose to the aglycone, takes place in the lumen of the endoplasmic reticu- lum prior to transport to the vacuole, and that the glucosyltransferases involved are loosely associated w i th the endoplasmic re t icu lum membrane (Hrazdina and Wagner, 1985). Sequestration into the endoplasmic reticulum depends upon an amino terminal signal sequence consisting of 13-30 amino acid residues with an uninterrupted stretch of 7 or 8 hydrophobic residues and a more polar carboxy ter- minal region that defines the cleavage site (von

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 3: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

CASSAVA GLUCOSYLTRANSFERASE GENES 43

Hind 111 EcoRI Ndel pCGT1

pCGT2

pCGT4

X

Eco RI

~~ *

EcoRI EcoRI

Pstl EcoRI I

kb

1.4

1.1

0.8

Nco I Nsil Sspl Kpnl Eco RI ssp I I 1 1 1

X * n n 1.7 pCGT5 -I

pCGT6

pCGT7

Nsi I Srp I Eco RI Nde I Nde I * n 1.3

Pst I Barn HI Kpn 1 ssp I 1 .o *

Figure 1 ipCGTl and pCGT5). *: stop codon. A: polyA tail. The PROSITE glucosyltransferase signature region is underlined.

Restriction site maps of cassava glucosyltransferase cDNA clones. pCGTl-7: cassava cDNA clones. x: first methionine codon

Heijne, 1988). Hydrophobicity plots (Kyte and Doolittle, 1982) and sequence analysis of the de- duced N-terminal amino acids suggest that the pre- dicted proteins of pCGTl and pCGT5 may have amino terminal domains with the properties of a sig- nal sequence. The potential cleavage sites are be- tween cysteine (1 8) and histidine (1 9) for pCGTl, and leucine (29) and glycine 130) for pCCT5.

Comparative Analysis of Derived Amino Acid Sequences The PROSITE (Bairoch, 1991 ) UDP-glucosyltrans- ferase signature sequence i s based upon a con- served domain of 44 amino acid residues located in the C-terminal region of three cloned genes, namely a UDP-glucose flavonoid 3-0-glucosyltransferase from Zea mays (Furtek et a/., 19881, a mammalian U DP-gl ucuronosy ltransferase (Dutton, 1 980), and an ecdysteroid UDP-glucose glucosyltransferase from the baculovirus, Autographa ca l i forn ica (O’Reilly and Miller, 1990). In addition to other mammalian glucuronosyltransferase clones, two fur- ther sequences containing the glucosyltransferase signature are recorded on the Swiss-Prot and NBRF-

PIR protein sequence databases, namely, a second plant flavonoid 3-O-glucosyltransferase from barley (Wise et a/., 1990) and a zeaxanthin glucosyltrans- ferase from the non-photosynthetic bacterium, Erwinia herbicola (Hundle et a/., 1992). The agly- cone substrates of this group of glucosyltransferase enzymes are diverse and the glycosylated products include anthocyanidins, the insect moulting hor- mone ecdysone, the steroid hormone bilirubin and the carotenoid pigment zeaxanthin. A factor com- mon to all the enzymes is that none is involved in primary metabolism.

The proposed PROSITE UDP-glucosyltransferase signature sequence is shown in Fig. 3 together with the homologous region of the derived amino acid sequences of the six putative glucosyltransferase clones from cassava. The equivalent region from the amino acid sequences of the flavonoid 3 - 0 gluco- syltransferase genes from maize and barley are in- cluded for comparison. Identical and equivalent amino acid residues are indicated in bold type. Of the twenty-three residues specified by Bairoch (7 991), seventeen correspond exactly with the plant gene sequences (=) and a further three contain a sin-

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 4: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

59

EIT

R~

C

CGC

16:

cnc

nsT

CIC TC

C ni

c nc

? &i

n cn

nic

TTC

nnT n

R1 T

CI sin

STT

ncc

LS

LS

RC

HS

LS

IT

VL

IF

NN

SV

VT

119

TCC

nnn 6

11 C

AT n

nc TA

T 61

1 ~

RT

ic

i cnG

nii

~C

T icc

icc T

CC n

ni CG

T ci

c C

~A

TIC

~~

SK

VH

NY

VD

SO

I~

SS

SN

RL

RF

179

All

111

C16

CCC

hC(L

6111

6116

llCl 6

GL 1

111

1Cl 1

\61

1lC

ICl 1

Cl 1

16 A

ll 66

6 hh

11 C

RG

5S

l~

LP

RD

El

GI

SS

FS

SL

IC

KO

239

nnn c

cc cn

i GTT

nnn

snn

TCT

STS

nTG

nnG

nic

nci G

nG T

IT G

~T

icn

1161

611

GnG

TCG

75

I( P

H V

K E

S V

n K

IT

f F

6S

S V

L S

299

CCT

C66

TlG

616

661

TIC

AT1

GTT

SAT

RTG

Tic

T6C

RCP

GCG

RTG

RTA

GbT

616

SCG

RAT

PS

PR

LV

~~

F

I V

D~

F

c T

nn

i~

vi

~

359

6nn

111

GST

STT

ccn

ICI in

c AT

^ IN in

c nc

6 rcG

661

SCA

Gci

111

CIC nn

i TIC

AT&

HS

E

F 6

v P

SV

I

F v

T s

G~

AF

LN

F

n

419

CTT

cni G

IG cn

A nn

6 n

il cn

i Gnc

GR

~ ~n

n

nni T

TT n

Rc cc

c nc

i GRC

TTC

nnc

GCG

icn

IJ

SL

HV

QK

I

HD

E E

N F

H P

I I

F n

ns

479

~R

T

661

6116

Tin

cnn

STT

cc6

661

Tin

GTA

nic

TC

~

TTT

CCT

TCT

nnG G

CT R

TG C

CT n

ci

15

SD

6E

LO

VP

GL

VN

SF

PS

Kl

HP

T

539

ccn

nu 11

6 AC

C nn

n cn

n 16

s in

ccc

CCT

cTn

cii G

nn nn

T nc

n ns

n n6

n in

c GC

A 6n

n I

~S

~I

LS

KO

Y~

PP

LL

EN

TR

RY

GE

599

GCT

nn

~

EST

sii n

in n

in n

ni nc

s TT

C TIC

GAG

ciG

Gnn

TCC

cni

GCS

nil s

nri T

CT T

TC

I~

S~

K~

V

I

IN

I

F

F E

LE

SH

AI

E

SF

659

nnn E

IC CC

I cc

n nic

inc

ccc

sin

6sn

ccc

nic i

in G

nc 6

16 n

66 ic

n nn

i s6n

n6n

nnc

21

5K

DP

PI

VP

V6

PI

LD

VR

SW

ER

N

719

nci n

ni cn

n sn

n ni

c RT

G cn

n 16

s CIT

6ni

~n

i

cnn

CCT

ccn

icn ICC

Gi

n 61

s ITC

ii

n Z

SS

TN

OE

II

OY

LD

DO

PP

SS

VV

TL

839

6hS

6Rl

A61

GGG

CAT

C6A

Tic

TTA

TGG

TCT

CIA

GCC

GK

CAC

C66

6CG

CC6

6ET

111

CTA

27

5E

DS

EH

RF

LY

SL

ID

HR

IP

6F

l

899

6AL

1C1

CCG

A61

GLC

lAT

GAG

GA1

CIA

CIA

6M

6lC

1IA

CCl

6AA

6SA

T1C

116

6AA

A61

2

95

ES

PS

DV

ED

LQ

EV

LP

EG

FL

ER

1019

6C

C hC

C 66

4 661

1 TT

A 61

T TC

T CA

C A6

1 66

A T6

E AA

T TC

T RT

A TT

A 6A

6 R6

C AT

A 16

6 11

1 3

35

AT

6G

LV

SH

SG

YH

SI

LE

SI

YF

1079

~

CA

EIC

CCA

611

6cc

Acn

166

ccn

nis

Tni

~C

A

6nn

cnc

cAn

TIC An

1 6c

c TT

T cn

n A1

6 ~

~~

GV

PV

~T

WP

N~

~E

OO

FW

~F

Q~

1139

GI

G hi

1 GA

G TT

G 66

A TT

A GC

ll GT

1 GR

A A1

1 RR

6 Ri

G GR

T Th

l R6

R AR

T 6R

C A6

1 G6

A 6R

I ~

~~

VI

EL

GI

AV

II

K~

DV

~~

DS

~E

1259

116

11 AS

G RA

G A6

6 6T

G M

G GA

G AT

G R6

1 CA

I AM

A6C

R6A

661

6CT

TlA

1116

611

6 6G

T 66

4 ~

~S

RR

KK

VK

E~

SE

KS

RG

RL

~E

C~

1319

TC

T Tc

n Tn

c TG

T TS

G Ti

n GA

T Rn

T CT

R nrc

nnn

Gni

nTG

m nn

n IR

C nc

n 611

11 Gn

i nn

i 43

5 s

s v

c w

L D

H L

I K

D n

I

K 8

Figu

re 2

N

ucle

otid

e an

d de

rived

am

ino

acid

seq

uenc

es o

f (a)

pC

CT

l an

d (b

) pC

CT

5. T

he P

RO

SITE

glu

cosy

ltran

sfer

ase

sign

atur

e se

quen

ce is

und

erlin

ed.

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 5: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

(b) n

TIT ccn

ci6

cii

CCT

CTT

nni i

ic nT

c GCC

AN

6611

n6c

nci

6ni

cin nn

c icn

nn6

59

ccc cn

r nrn

cin T

in c

in ic

n n~

i cci 6

6c i

ic 66

11 cn

c cic

nrc cc

n cri

cic

6nn

crc

119

EM

nnn

ccc n

in CI

T nc

n CIC

icc nn

c rrc

Gni

6rc

nci

nrn IIC

1116

616

661

icc Gn

c

179

ncA T

CA E

CC G

ci G

rin c

ci cn

n GTT

CTC

CEI ic

n GC

C ni

6 nc

i ccn

nnn

crc i6

c MA

nT

c ~

OT

SA

~E

PO

VC

RS

~~

TP

KC

CE

I

239

nIc cn

n cic

ccn

ccn c

ci An

c nT

i rcc

icc c

ic n

ic 6n

c ccn

Grin

6cc

ncc

6in

T~

T ncc

~O

IO

LP

PP

~I

SC

LI

DP

E~

TV

C~

299

CEI

cii i

n 6i

i ii

6 n

it nC

n Gr

in ni

c nsG

CCR

GCT

TTC

c66

1x6

6cn

sin ICC

~C

T cic

~

O~

LF

VL

~R

EI

RP

~T

RI

~V

SI

I

359

nn6

TTT

crin

cc6

GCII

GCC

nin

nii s

ic ~

AC

CT

C iii

~6

n ncr G

nn TC

T cis

GAG

6in

~C

T

~~

OK

FR

PA

AI

IV

OC

F~

TC

SL

~V

~

419

Ann

~n

n

CIT G

~C

nic

Gcn

nnn

ini G

TG in

c ni

b GC

T ic

i nni

GCA

TGG

TIT Ti

n 6c

i CT

T I

~O

KC

L~

I~

NY

VY

I~

SN

~W

FL

LI

419

nci n

in in

i Gin

ccc

nii c

rn 6

ni n

nn 6

16 6

16 r

an G

sn cn

6 iii

GTT

cii c

nG n

nG 61

16

I M

GS

TD

LN

SK

~O

PN

IV

LL

SS

P6

LG

HL

lP

VL

EL

30

6 11

RI

VT

LC

N F

D V

T1

f f!

V 6

S 0

l~

Ol

IY

VP

IL

DK

EV

EG

Ef

V1

0I

IE

S39

CCC

1116

Mil

AT1

CCT

661

T6C

b66

CC6

ST1

C6G

6CC

GM

G46

614

61C

GST

CCT

AT6

CTG

17

0P

HK

I P

CC

RP

VR

TE

EV

VD

PH

L

599

cnc

cu ncn

nnr

cnn

cnn

ini i

cc 60

6 in

i TTT

CGC

Tin

66i

nic

6116

nrc cc

n nc

n ~C

T

659

snc GI al

a iIn

ni6

nac

nc6

16c san

6c1

CTT

can c

cn nc

n nc

n ITC

GC

~ sci i

rG n

6n

21

0~

6

IL

MN

TW

E~

LE

P i i

F 6

nt

~

719

ELI

GIG

nnri

iic CIG

EN

CCA

EM s

ci nn

6 si

n CC

G 61

1 iii

CCG

n1i s

rii c

ci ci

G AC

G

779

n~n

cn

6 EC

C 66

n cc

6 TG

C 6s

i icn

nni i

6i 6A

6 rin

crc E

AI is6

Tin

GAC

cnn c

nn cc

c ~

SO

RO

~C

PC

GS

~C

E~

~D

W~

D~

OP

190

D R

1

N 0

P r

'S

E

Y F

R 1

6 I

E I

P 1

A

23

OD

VK

FL

GR

Vn

KV

PV

fP

IG

PL

R

899

hlC

CIE

Cll

6CT

166

CCC

CTl C

AE C

EC M

C C

IC C

AE I

CC 1

11 4

11 I

CE C

IC EI

T CC

C C

M

29

01

E

L~

W~

LE

RS

~~

RF

IW

VV

RP

959

ccc n

cc 6i

n nn

c ncn

CCA

MI

Gcn

CCI iii

111

nci c

nn 6

66 6

~ S

C~

6cn

cni 6

nc n

if

~~

OP

~V

II

TG

D~

~F

FT

PC

DE

~D

D~

1019

TC

A 66

6 TA

C TI

C CC

T 64

6 GG

6 TI

C CT

G AC

C RE

6 A

ll C

A6 A

AC 6

16 6

66 T

IC 6

16 E

TC C

CA

~3

OS

6Y

FP

E6

fL

11

10

~V

61

VV

P

1019

cnn

KG

n~

c ccn cn

n nic

cnc n

ic RT

G n6

c cn

r ccn

icn

616

6~

n

6111

TTT

irn

rcn cn

c

1139

i6

i 6ci

16s

nni

TCT

tin T

TG 6

116 A

N nic

ncn G

CA 6

61 6

16 cc

c nir

nir 6

c6 1

66 cc

n 3

70

c~

~n

sv

tc

s1

T

~G

VP

I

I

nn

p

~~

OI

Y~

EQ

R~

HA

T~

~T

IE

L~

V~

V

1259

A

G~

ccn

nnG

nni T

in CC

G GC

G An

n 6n

n 6i

n CT

G nn

c R

~C

G

~G

Gn

G ni

n 611

6 ns

6 ni

6 ni

i ~

~O

RP

II

NL

P~

K

EV

VW

RE

E I

E~

HI

35

0P

WS

PQ

IH

IM

SH

PS

VC

VF

LS

H

1199

Ri

ll TA

T GC

T GA

S Ch

G 46

6 AT

G Ah

1 6C

G AC

G Cl

G IT

6 AC

6 6R

6 GA

G CT

b 66

C 61

1 6C

k 61

G

1559

ni

n ri

n ni

c in

i iic

ini c

ni TC

A ic

i GTC

TAT

611

TTC

iin

ins

nnn

nni n

in nn

6 c4

6

P

Ln

Figu

re 2

N

ucle

otid

e an

d de

rived

am

ino

acid

seq

uenc

es of

(a)

pC

CT

l and

(b) p

CC

T5. T

he P

RO

SITE

glu

cosy

hran

sfer

ase s

igna

ture

sequ

ence

is u

nder

lined

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 6: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

46 J. HUGHES AND M. A. HUGHES

Table 1 dons

Summary of putative UDP-glucose glucosyltransferase cDNA clones from cassava cotyle-

Expression

Length Transcript size Seedlings

Clone kb kb Cotyledon Hypocotyl Root Leaves

pCGT1 1.4 1 .b + - + -

pCGT4 0.8 1.6 (+I (4 (+) (+I pCGT5 1.7 1.6 (+) (4 (-) (4 pCGT6 1.3 1.5 + - - + pCGT7 1 .o 1.7 + + + +

pCGT2 1.1 1.6 + - + +

Rrackets indicate (OW levels of transcript

pCGTl Pca2 pc-4 pcoT5

pcGT7 pCGT6

zm Hv

PROSITE

PSPG

l0LPQVAVLAH PASGGLVSHSQPlMS I LESIWFOVPVA'IWPMY~ WQVAVLAEPAI QG FVSECGRQNSVLPSLWQ ATWPMYIWQ WSPQVLILSBPAI OAF F T H C m S TLEGI SAGVPIVACPLFAEQ ~PQIH~EPSVdVFLSHCOPONSVIgSf TAQVPI IAWPIYAEQ OQAPQVAILEHPAIOGFVSEC~SILESIWFSVPSATWPLYATLO WLPQVEILEEAALQVFVTBCGPQNS ILESIV-I C R P F m Q

WXXQXXZLXHXXXXAF'LSXSGXXSXXXSLXXXLPLXXXPLLSDQ I I T T T T I I 1 IITE 7 1 V A A A AV V V W A iuI M G G G GM M M MMG

FF

WSPQIXILXHPSXGXF%SHXGWNSILESLXXSVPIXXXPLYADQ A V VM AA LVT A V M G I G V I FGE

F T V M M F

Figure 3 Comparison of derived amino acid sequences of eight plant glucosyltransferase genes with the proposed PROSITE glucosyltrans- ferasc signature sequence. Identical and equivalent amino acid residues are shown in bold type. PROSITE sequence comparison: conserved speciiied residues (=, +); conserved unspecified residues (:), pCTCl-7: cassava cDNA clones. Zm. flavonoid 3-0-glucosyltransferase from Zed mays (Furtek eta/., 1988). Hv: flavonoid 3-0-glucosyltransferase from Hordeum vulgare (Wise eta/., 1990). PROSITE: proposed gluco- 5vhransterase signature sequence (Bairoch, 1991 ), PSPG: consensus sequence for plant secondary product glucosyltransferase genes.

gle mismatch (+). Twelve of the twenty-one unspec- iiied residues (X) also have identical or equivalent residues in seven or eight of the clones in Fig. 3 (:). Only two of the remaining twelve residues show no dement o f amino acid conservation. Amino acid \equence homology between these eight plant clones decreases rapidly on both the N and C termi-

nus sides of this highly conserved region. The UDP- glucosyltransferase signature sequences of the 3 non-plant genes (Dutton, 1980; Hundle et a/., 1992; O'Reilly and Miller, 19901, are not conserved to the same extent as the plant genes. A modified PROSITE UDP-glucosyltransferase signature 5e- quence (PSPG) i s proposed for secondary plant

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 7: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

CASSAVA GLUCOSYLTRANSFERASE GENES 47

product metabolism (Fig. 3). This sequence contains fourteen residues with identity compared with ten in the original PROSITE sequence and includes four conserved prolines.

Homology within the group of six cassava clones, and between these genes and the f i v e other recorded genes containing the glucosyltransferase signature was further investigated with PROSIS (Pharmacia) and MACAW (Schuler et a/., 1991 ) soft- ware. Homology between the plant and non-plant genes, confined primarily to the glucosyltransferase signature region, is confirmed by the more powerful amino acid sequence data analysis programme MACAW, which compares multiple sequences and defines regions of homology at different levels of stringency (data not shown). Fig. 4 shows the rela- tive similarity between each pair of sequences (in- cluding regions outside the signature sequence) based upon the percentage homology score deter- mined by the PROSIS homology search programme. A high level of homology i s found between the maize and barley flavonoid glucosyltransferase genes (73%). pCGT7 is more similar (43% and 42%) to these monocotyledon flavonoid biosynthesis genes than to the other five cassava clones and may therefore also represent a flavonoid glucosyltrans- ferase gene. Three of the cassava clones, pCGT1, pCGT2 and pCGT6 form another closely related group (60-67%). Partial amino acid sequences from potato solanidine UDP-glucose glucosyltransferase reported by Stapleton et a/ . (1992) did not contain

pCGTl pCGT2 pCGT4 pCGTS

pCGTl - 67 33 29 35 34 pCGT2 -

39 pCGT4 - pCGT5 - pCGT6 pCGT7 Zm H v Ac R l Eh

the PROS ITE glucos y I transferase signature sequence and no significant homology was found between these sequences and the cassava glucosyltransferase clone derived amino acid sequences.

Northern Analysis of Expression Northern blots prepared from total RNA extracted from cassava seedling cotyledons, hypocotyls and roots at 5 defined developmental stages were probed with the putative cassava glucosyltransferase clones. Stage 0 is the seed plus newly emerged radi- cle, at stage 1 the cotyledons are still enclosed in the seed coat but this is split, at stage 2 the cotyle- dons are free of the seed coat, at stage 3 the seedling is morphologically similar to stage 2 but has green cotyledons following transfer to the light and at stage 4 the seedling is approximately 10 days old and a small true leaf is present. Three distinct patterns of expression were found (Fig. 5). pCGT1 and pCGT2 have maximum expression in stage 2 cotyledons, with low levels in hypocotyls and in- creasing levels in roots throughout this period of de- velopment. pCGT6 is expressed primarily in stage 3 and stage 4 cotyledons with very low levels in hypocotyts and no measurable expression in roots. pCGT7 is expressed in all tissues at uniform levels at all stages of development (data not shown). pCGT4 and pCGT5 appear to represent rare transcripts compared with the other four clones because sig- nals on the Northern blots were always low. Further, three of the genes, pCGTl, pCGT2 and

pCGT6 pCGT7

60 31 60 29 31 35 35 32

35 - -

Zm Hv Ac R l Eh

33 43 23 18 25 35 36 23 30 28 32 31 26 27 38 34 32 29 26 19 26 31 22 25 17 43 42 24 28 21 - 73 29 30 26

- 26 24 37 - 23 29

- 25 -

Figure 4 Pairwise sequence comparison of glucosyltransferase cDNA clone derived amino acid sequences (PROSIS, YO homology). pCGT1-7: putative UDP-glucose glucosyltransferase clones from Manihot esculenta. Zrn: flavonoid 3-0-glucosyltransferase from Zed mays (Furtek et al., 1988). Hv: flavonoid 3-0-glucosultransferase from Hordeum vulgare (Wise et a/., 1990). Ac: ecdysteroid UDP-glucose glucosyltransferase from Autographa californica (O’Reilly and Miller, 1990). R1: UDP-glucuronosyltransferase from rat liver (Mackenzie, 1986). Eh: zeaxanthin glucosyltransferase from Erwinia herbicola (Hundle, 1992).

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 8: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

48 J. HUGHES AND M. A. HUGHES

kb 0 1C 1H 1R 2C 2H 2R 3C 3H 3R 4C 4H 4R Probe

1.6 pCGT2

* pCGT6 (iic 1.5

Figure 5 Northern blot analysis of total RNA from developing cassava seedling tissues. Key to wells: Developmental stages; 0, radicle emerged. 1, cotyledons enclosed in split seed coat. 2, cotyledons emerged from seed coat. 3, as stage 2, cotyledons green following 24 hrs in light. 4, first true leaf present, approximately ten days after germination. The seedlings were transferred from the dark to 12 hour light conditions between stages 2 and 3. T: total seedling. C: cotyledon. H: hypocotyl. R: root.

pCGT6, were isolated more than once during the Ii- brary screens. Northern blot analysis of total RNA from young cassava leaf tissue showed that pCGT2, pCCT4, pCCT6 and pCGT7 are expressed in leaves whereas pCGT1 and pCGT5 are not (data not shown). Kuhasek ef a/. (1 992) showed that enzymes involved in flavonoid biosynthetic pathways i n Arcdidopsis are coordinately regulated by a devel- opmental timing mechanism during germination. Northern blot analysis of the putative glucosyltrans- terase genes reported here suggests both temporal and tissue-specific regulation of expression in devel- oping cassava seedlings.

The cyanogenic glucosides, linamarin and lotaus- tralin, are not synthesised in seedling roots (Koch et a/., 19923. pCGTl , pCGT2 and pCGT7 can therefore be eliminated as cyanogenic glucosyltransferase clones because they are expressed in roots. The low level of expression of pCGT4 and pCGT5 also make i t un l i ke ly that these clones represent the cyanogenic enzyme, which has a high level of activ- ity in seedling cotyledons. It remains possible that pCGT6 is involved in cyanogenesis, since it has high levels of expression in cotyledons, which are the pri- mary site of cyanoglucoside synthesis in seedlings.

'The expression data determined by Northern blot analysis is summarised in Table 1 together with de- tails of cDNA clone and transcript sizes. The multi- plicity of glucosyltransferase genes with different developmental profiles expressed in a single tissue icotyledons) of young cassava seedlings demon- strates the complexity of this group of enzymes and suggests the production of individual glucosyltrans- ierase proteins with specific functions in the metab- olism of secondary plant products.

MATERIALS AND METHODS

Plant material Cassava seeds from the plant CM1223-11 were supplied by Dr. C. Hershey, CIAT, Cali, Colombia, and grown as described in Hughes et a/., 1992.

Selection of cDNA clones A cDNA library was constructed in the hGTlO vector using Not I/Eco RI adaptors, with mRNA extracted from cotyledons as de- scribed in Hughes et al., 1992. The library was initially screened with a heterologous probe derived from a 0.9 kb Acc I-Bgl II re- striction fragment from a flavonoid 3-0-glucosyltransferase cDNA clone from Antirrhiniurn majus (Martin et a/., 1991~. A second screen was carried out using a putative 1.1 kb cassava glucosyltransferase cDNA clone selected during the first screen. Probes were radiolabelled with a Random Primed DNA Labelling Kit (Boehringer-Mannheim). Standard procedures were used to prepare replica filters and Southern blots (Samhrook et a/., 1989). Filters were washed at 60°C at low stringency: twice tor ten min- utes in 4X SSC, 0.1% SDS and twice for ten minutes in 2 X SSC, 0.1% SDS (20X SSC is 0.3M sodium citrate, 3M sodium chloride pH 7.0). Selected cDNA inserts from recombinant phage were subcloned into the Not I site of the plasmid vector pGEM 5ZF(--) (Promega).

DNA sequencing Both strands of the cDNA clones were independently sequenced by the dideoxy chain termination method (Sanger, 1977) using a Sequenase Version 2 Kit (United States Biochemical Corporation). Double-stranded DNA was sequenced directly from the pCEM SZf(-) plasmid using standard universal forward and reverse primers (USB). Overlapping deletions were generated where possible using a Nested Deletion K i t (Pharrnacia-LKB), Subcloned restriction fragments or specific synthetic 17-mer oligonucleotide primers were used where nested deletions were unobtainable.

Computer analysis DNA sequence data was analysed with DNASIS software from Pharmacia. Analysis of the derived amino acid sequences was performed with the following programmes: PROSIS (Pharmacia), PROSITE (Bairoch, 1991), MACAW [Schuler ef a/., 1991). Homology searches were carried out with the CenBank and EMBL DNA sequence databases, and the NBRF-PIR and Swiss Prot protein sequence databases (May, 1992).

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.

Page 9: Multiple secondary plant product UDP-glucose glucosyltransferase genes expressed in cassava (Manihot esculenta Crantz) cotyledons

49 CASSAVA GLUCOSYLTRANSFERASE GENES

Northern blot analysis of RNA Total RNA was extracted from plant tissues by the guanidine thio- cyanate method of Broglie et a/. (1984). 10 pg of total RNA per sample was separated on 1.5% agarose gels in the presence of formaldehyde, then transferred to Hybond N membranes (Amersham). Northern blotting and hybridisation were carried out by standard procedures (Sambrook eta/., 1989). Probes were prepared from cDNA clones as above. Filters were washed at 42°C at high stringency: twice for ten minutes in 2X SSPE, 0.1% SDS, once for fifteen minutes in 1X SSPE, 0.1% SDS, and twice for ten minutes in 0.1 X SSPE, 0.1 % SDS (20X SSPE is 3.6M NaCI, 0.2M NaP04 pH7.7, 0.02M EDTA).

ACKNOWLEDGEMENTS

We wish to thank Kathleen Kelly and Martin Fletcher for technical assistance.

Dr. J . Hughes i s supported by the EC-funded Casanova Project, grant no. ECSTD3 TS3*-CT9Z- 01 08.

(Received 10th lanuary 1994)

REFERENCES

Bairoch, A. (1991). PROSITE: a dictionary of sites and patterns in proteins. Nucl. Acid Res. 19, 2241-2245.

Bro lie, R., Coruzzi, G., Keith, B. and Chua, N-H. (1984). hfolecular biology of C4 photosynthesis in Zea mays: differen- tial localization of proteins and mRNAs in two leaf cell types. Plant Mol. Eiol. 3, 4311144.

Dutton, G.T. (1980). In Glucuronidation of drugs and other com- pounds, Dutton, G.]. (ed.)l-78, CRC Press, Boca Raton.

Furtek, D., Schiefelbein, J.W., Johnston, F. and Nelson, O.E., Jr. (1 988). Sequence comparisons of three wild-type Bronze1 al- leles from Zed mays. Plant Mol. Eiol. l l, 473481.

Hahn, S.K. (1 989). An overview of African traditional cassava processing and utilization. Outlook Agric. 18, 11 0-1 18.

Harbourne, J.B. (ed.) (1988). The Flavonoids, Advances in Research since 1980. Chapman Hall, London.

Heilemann, J . and Strack, D. (1 991). Flavonol glucosyltransferase from Norway Spruce needles. Phytochem. 30, 1773-1 776.

Hershey, C.H. (1 993). Cassava Manihot esculenta Crantz. In Genetic Improvement of Vegetable Crops, (eds.) G . Kalloo and 8.0 . Bergh. Chpt. 46, pp. 669-691, Pergamon Press, N.Y.

Hrazdina, G. and Wagner, G.J. (1 985). Compartmentation of plant phenolic compounds: sites of synthesis and accumula- tion. Annu. Proc. Phytochem. SOC. Europe 25, 120-1 33,

Hrazdina, C. (1988). Purification and properties of a UDP glu- cose: flavonoid 3-0-glucosyltransferase from Hippeastrum petals. Biochem. Biophys. Acta 955, 301-309.

Hughes, M.A., Brown, K., Pancoro, A., Murray, B.S., Oxtob , E. and Hughes, J. (1992). A molecular and biochemical anarysis of the structure of the cyanogenic P-glucosidase (linamarase) from cassava (Manihot esculenta Crantz). Arch. Biochem.

Hundle, B.S., OBrien, D.A., Alberti, M., Beyer, P. and Hearst, J.E. (1 992). Functional expression of zeaxanthin glucosyltrans-

Eiophys. 295, 2 73-2 79.

ferase from Erwinia herbicola and a proposed uridine diphos- phate binding site. Proc. Natl. Acad. Sci. USA 89, 9321-9325.

Ishikura, N., Yang, Z-Q. and Teramoto, S. (1993). UDP-D-glu- cose: flavonol 3-0-and 7-0-glucosyltransferases from young leaves of Paederia scandens var. maire;. Zeitschrifi fur naturfor C48,563-569.

Koch, B., Nielsen, V.S., Halkier, B.A., Olsen, C.E. and Moller, B.L. (1 992). The biosynthesis of cyanogenic glucosides in seedlings of cassava (Manihot esculenta Crantz). Arch. Biochern. Biophys. 292, 141-1 50.

Kubasek, W.L., Shirley, B.W., McKillop, A,, Goodman, H.M., Briggs, W. and Ausubel, F.M. (1 992). Regulation of flavonoid biosynthetic genes in germinating Arabidopsis seedlings. The Plant Cell4, 1229-1236.

Kyte, J. and Doolittle, R.F. (1 982). A simple method for displaying the hydropathic character of a protein. j . Mol. Biol. 157,

Martin, C., Prescott, A., Mackay, S., Bartlett, J. and Vrijlandt, E. (1 991 ). Control of anthocyanin biosynthesis in flowers of Antirrhinum majus. The Plant journal 1 (1 1, 37-49.

O’Reilly, D.R. and Miller, L.K. (1990). Regulation of expression of a Baculovirus ecdysteroid UDPglucosyltransferase gene. 1. Viro/. 64, 1321-1328.

Poulton, J.E. (1 990). Cyanogenesis in plants. Plant Physiol. 94,

Ralston, E.J., English, 1.). and Dooner, H.K. (1988). Sequence of three bronze alleles of maize and correlation with the genetic fine structure. Genetics 119, 185-1 97.

Reay, P.F. and Conn, E.E. (1974). The purification and properties of a UDP glucose: aldehyde cyanohydrin b-glucosyltransferase from Sorghum seedlings. J. BioLChem. 249, 5826-5830.

Sambrook, J., Fritsch, E.F. and Maniatis, T. (1 989). Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.

Sanger, F., Nicklen, S. and Coulson, A.R. (1977). DNA sequenc- ing with chain-terminating inhibitors. Proc. Natl. Acad. Sci.

Schuler, G.D., Altschul, S.F. and Lipman, D.J. (1991). A work- bench for multiple alignment construction and analysis. Proteins Struct. Funct. Genet 9, 180-1 90.

Stapleton, A., Allen, P.V., Tao, H.P., Belknap, W.R. and Friedman, M. (1992). Partial amino acid sequence of potato solanidine UDP-glucose glucosyltransferase purified by new anion-exchange and size exclusion media. Prot. Expression & Purif. 3, 85-92.

Sun, Y. and Hrazdina, C. (1991). Isolation and characterization of a UDP glucose: flavonol 03-glucosyltransferase from illumi- nated red cabbage (Erassica oleracea cv Red Danish) seedlings. Plant Physiol. 95, 570-576.

Ullmann, P., Ury, A., Rimmele, D., Benveniste, P. and Bouvier- Nave, P. (1 993). UDP-glucose sterol P-D-glucosyltransferase, a plasma membrane-bound enzyme of plants: Enzymatic proper- ties and lipid dependence. Biochimie 75, 71 3-723.

Vellekoop. P., Lugones, L . and van Brederode, J . (1993). Purification of a UDP-glucose: flavone 7-0-glucosyltransferase from Silene latifolia using a specific interaction between the enzyme and phenyl-sepharose. Febs Letters 330, 36-40.

Von Heijne, G. (1 988). Transcending the impenetrable: how pro- teins come to te-rms w i th membranes. Eiochimica et Biophysica Acta 947, 307-333.

Wise, R.P., Rohde, W. and Salamini, F. (1990). Nucleotide se- quence of the bronze1 homologous gene from Hordeum vul- gare. Plant Mol. Biol. 14, 277-279.

105-1 32.

401 -405.

74, 5463-5467.

Mito

chon

dria

l DN

A D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsity

of

Tor

onto

on

11/2

4/14

For

pers

onal

use

onl

y.