14
1. Introduction 2. Brief overview of “protein-based” NMR methods used in drug discovery 3. Biological systems used for isotopic labeling 4. Advances in specific/selective isotope labeling techniques 5. In-cell NMR: pharmaceutical NMR applications for cellular biochemistry and drug delivery 6. Expert opinion Review Recent applications of isotopic labeling for protein NMR in drug discovery Hidekazu Hiroaki Nagoya University, Graduate School of Pharmaceutical Sciences, Nagoya, Japan Introduction: Nuclear magnetic resonance (NMR) applications in drug discov- ery are classified into two categories: ligand-based methods and protein- based methods. The latter is based on the observation of the 1 H- 15 N HSQC spectra of a protein with and without lead compounds. However, in order to take this strategy, isotopic labeling is an absolute necessity. Given that each 1 H- 15 N HSQC signal corresponds to a residue of the target protein, signal changes provide specific information on whether a compound will fit into a pocket. Thus, this protein-based method is particularly suitable for fragment- based approaches, such as “SAR-by-NMR” and “fragment-growing.” Alternatively, the information from a protein interface may be used to develop inhibitors for protein--protein interactions. Areas covered: This review discusses at the experimental procedures for preparing isotopically labeled protein and introduces selected topics on atom-specific and residue-selective isotope labeling, which may facilitate the development of PPI/PA inhibitors. Furthermore, the author reviews the recent applications of “in-cell” NMR spectroscopy, which is now considered as an important tool in drug delivery research. Expert opinion: Many recent advances in labeling methods have succeeded in expanding NMR’s potential for drug discovery. In addition to those methods, another new technique called “in-cell NMR” allows the observation of protein--ligand interactions inside living cells. In other words, “in-cell NMR” may become a pharmaceutical NMR technique for drug delivery. Keywords: amino acid type-selective labeling, fragment-based approach, in-cell NMR, inverse labeling, protein-based method, SAIL method Expert Opin. Drug Discov. (2013) 8(5):523-536 1. Introduction Nuclear magnetic resonance (NMR) spectroscopy has recently become an indis- pensable analytical method for drug discovery and drug design because it may pro- vide considerable information on molecular interactions as well as molecular structures at the atomic level. The number of NMR application methods dedicated to pharmaceutical sciences has increased drastically during the past decade due to the advances in methods and increased sensitivity of spectrometers. NMR applications in drug discovery are roughly classified into two categories [1-4]. The first one is “ligand-based methods,” which include T1 and T2 differences, transferred-NOE [5], SHAPES strategy [6], DOSY [7], affinity NMR [8], STD [9], Water-LOGSY [10], ILOE [11], and INPHARMA [12]. All these approaches focus on the differences in the NMR signals derived from ligands, which make it unnecessary to use isotopically labeled target protein samples. The second one is “protein-based methods” for which isotopic labeling is neces- sary. After introducing suitable stable isotopes ( 13 C, 15 N, or both), heteronuclear correlation spectra such as HSQC are measured. It is known that the NMR 10.1517/17460441.2013.779665 © 2013 Informa UK, Ltd. ISSN 1746-0441, e-ISSN 1746-045X 523 All rights reserved: reproduction in whole or in part not permitted Expert Opin. Drug Discov. Downloaded from informahealthcare.com by Universitat de Girona on 11/10/14 For personal use only.

Recent applications of isotopic labeling for protein NMR in drug discovery

Embed Size (px)

Citation preview

Page 1: Recent applications of isotopic labeling for protein NMR in drug discovery

1. Introduction

2. Brief overview of

“protein-based” NMR methods

used in drug discovery

3. Biological systems used for

isotopic labeling

4. Advances in specific/selective

isotope labeling techniques

5. In-cell NMR: pharmaceutical

NMR applications for cellular

biochemistry and drug delivery

6. Expert opinion

Review

Recent applications of isotopiclabeling for protein NMR in drugdiscoveryHidekazu HiroakiNagoya University, Graduate School of Pharmaceutical Sciences, Nagoya, Japan

Introduction: Nuclear magnetic resonance (NMR) applications in drug discov-

ery are classified into two categories: ligand-based methods and protein-

based methods. The latter is based on the observation of the 1H-15N HSQC

spectra of a protein with and without lead compounds. However, in order to

take this strategy, isotopic labeling is an absolute necessity. Given that each1H-15N HSQC signal corresponds to a residue of the target protein, signal

changes provide specific information on whether a compound will fit into a

pocket. Thus, this protein-based method is particularly suitable for fragment-

based approaches, such as “SAR-by-NMR” and “fragment-growing.”

Alternatively, the information from a protein interface may be used to

develop inhibitors for protein--protein interactions.

Areas covered: This review discusses at the experimental procedures for

preparing isotopically labeled protein and introduces selected topics on

atom-specific and residue-selective isotope labeling, which may facilitate

the development of PPI/PA inhibitors. Furthermore, the author reviews the

recent applications of “in-cell” NMR spectroscopy, which is now considered

as an important tool in drug delivery research.

Expert opinion: Many recent advances in labeling methods have succeeded in

expanding NMR’s potential for drug discovery. In addition to those methods,

another new technique called “in-cell NMR” allows the observation of

protein--ligand interactions inside living cells. In other words, “in-cell NMR”

may become a pharmaceutical NMR technique for drug delivery.

Keywords: amino acid type-selective labeling, fragment-based approach, in-cell NMR, inverse

labeling, protein-based method, SAIL method

Expert Opin. Drug Discov. (2013) 8(5):523-536

1. Introduction

Nuclear magnetic resonance (NMR) spectroscopy has recently become an indis-pensable analytical method for drug discovery and drug design because it may pro-vide considerable information on molecular interactions as well as molecularstructures at the atomic level. The number of NMR application methods dedicatedto pharmaceutical sciences has increased drastically during the past decade due tothe advances in methods and increased sensitivity of spectrometers.

NMR applications in drug discovery are roughly classified into twocategories [1-4]. The first one is “ligand-based methods,” which include T1 andT2 differences, transferred-NOE [5], SHAPES strategy [6], DOSY [7], affinityNMR [8], STD [9], Water-LOGSY [10], ILOE [11], and INPHARMA [12]. All theseapproaches focus on the differences in the NMR signals derived from ligands, whichmake it unnecessary to use isotopically labeled target protein samples.

The second one is “protein-based methods” for which isotopic labeling is neces-sary. After introducing suitable stable isotopes (13C, 15N, or both), heteronuclearcorrelation spectra such as HSQC are measured. It is known that the NMR

10.1517/17460441.2013.779665 © 2013 Informa UK, Ltd. ISSN 1746-0441, e-ISSN 1746-045X 523All rights reserved: reproduction in whole or in part not permitted

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 2: Recent applications of isotopic labeling for protein NMR in drug discovery

chemical shift is highly sensitive to the chemical and stericenvironment of an atom. HSQC spectra provide informationon whether a small molecule binds to a target protein andwhich residues in the target protein’s pocket interact withthe small molecule. This residue-specific or, sometimes,atomic level information can be efficiently utilized to furtherdrug design. Therefore, protein-based methods are used notonly for initial drug screening but also for validating the hitcompounds and subsequent step-by-step lead optimization.Another possible application is developing inhibitors of

protein--protein interactions (PPIs) and protein assemblies(PAs). In this post-genomic era, many intracellular proteinsin the human genome have been shown to participate incertain PPI networks, which are target candidates fornext-generation drug discovery. In addition, intrinsic fibrousproteins such as cytoskeletal components and membraneskeletons as well as pathogenically fiber-forming proteinssuch as a-synuclein and Ab(1-40/42) peptides should becometherapeutic targets for unmet medical needs. Because protein-based NMR methods provide considerable information onPPIs and PAs, advances in this area will realize a new horizonfor drug discovery of PPI/PA inhibitors.In this review, experimental procedures for preparing isoto-

pically labeled protein samples have been overviewed. Selectedtopics on atom-specific and residue-selective isotope labelinghave also been introduced, which may facilitate the develop-ment of PPI/PA inhibitors. In addition, recent applicationsof “in-cell” NMR spectroscopy, which is now considered asa new research tool in drug delivery, have been discussed.

2. Brief overview of “protein-based” NMRmethods used in drug discovery

2.1 Theoretical background and limitationsMost protein-based methods rely on two- or higher-dimensional NMR for heteronuclear chemical shift correla-tion. This is simply because 1H 1D NMR spectra of targetproteins are heavily overlapped thereby less informative.

This signal overlapping is solved by introduction of anotherdimension, either 15N or 13C. The natural abundance of15N and 13C are 0.4% and 1.1%, respectively. Thus, additionof other compounds does not interfere with the 1H-15N and1H-13C HSQC spectra of target proteins.

The methodologies for heteronuclear multidimensionalNMR of proteins were developed for high-resolutionstructural determinations. The typical steps in these studiesinclude the following: i) preparing a 15N-uniformly labeledprotein, ii) measuring 1H-15N HSQC, iii) making a go/no-godecision to continue the project, iv) choosing a labeling strat-egy, v) preparing either a 1H/15N/13C or 2H/15N/13C-labeledprotein based on the labeling strategy, vi) accumulating fulldatasets of 3D/4D NMR, vii) making sequential assignmentsfor backbone NH signals, viii) making nearly complete assign-ments for the side chain signals, ix) collecting NOEs and otherstructural restraints derived by NMR experiments as much aspossible, and x) calculating the solution structure that fulfillsall NMR-derived experimental data (Figure 1).

For modern drug discovery using NMR spectroscopy, theinformation obtained in step (vii) along with HSQC spectraof the target protein is used for its residue-specific sequentialassignments. The obtained 3D structure of the target proteinis also useful for further structure-guided drug design;however, this can be replaced with X-ray-derived orhomology-modeled structures. Hence, obtaining completesequential assignments of NH signals is not absolutely necessary.

Several limitations of these protein-based approaches shouldbe taken into consideration. First, any of the solution NMRtechniques are only applicable for relatively smaller proteins(i.e., molecular weights < 30 K). This limitation arises as aresult of the property of magnetization relaxation of slowlytumbling molecules, such as large proteins. Second, not onlythe theoretical molecular mass but also an increase in theapparent molecular mass of a protein may hamper the qualityof HSQC spectra. Nonspecific self-association of proteins isone of the major causes. Third, the presence of chemicalexchanges and equilibria between two or more states also ham-pers spectral quality. This may occur when a protein includesdynamic motions in its molecule. Fourth, NMR measure-ments are not highly sensitive, and a high concentrationprotein sample (~ 10 mg/ml) is required. For the latter threereasons, extensive optimization of a sample’s conditions isusually required; otherwise, informative and analyzable HSQCspectra cannot be obtained. In addition to these demerit, itshould be noted that the NMR-derived information ofdrug--protein interaction is rather qualitative than quantitative.Although Kd can be calculated with a saturation curve derivedfrom NMR titration experiments, it is only available forrelatively weak interaction such as sub-micromolar order.

2.2 Featured applications of the protein-based NMR

approach: NMR-assisted FBDDA trend in lead generation is the fragment-based approach.This approach is started to find one or several small molecular

Article highlights.

. Recent advances on HSQC-based NMR methods fordrug discovery as well as isotopic labeling methodsare reviewed.

. Not only Escherichia coli expression systems but alsoBrevibacillus, Kluyveromyces lactis, and Pichia pastoris,plant cells, insect cells, and mammalian cells are nowavailable for isotopic labeling.

. Amino acid-specific labeling, inverse labeling, andsegmental labeling techniques are used to simplify thespectra thereby focusing on protein pockets of interest.

. 13C-fractional labeling, alternating Ca labeling,methyl-specific labeling, and SAIL methods are used toexpand the limit for observing larger proteins.

This box summarizes key points contained in the article.

H. Hiroaki

524 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 3: Recent applications of isotopic labeling for protein NMR in drug discovery

weight compounds called fragments that weakly bind to thebinding site of a target protein. Two of these are selectedand linked together with an appropriate linker moiety,resulting in a high-affinity drug candidate (Figure 2A).Alternatively, one fragment molecule is modified to growstep-by-step, gaining further affinity until all pocket vacanciesare filled (Figure 2B). In many cases, both strategies are usedin combination.

NMR-assisted FBDD is a concept in which the informa-tion derived from a target protein in the presence of acompound is fed back to design and synthesize the nextmolecule. One of the pioneering efforts of this concept is“SAR-by-NMR” that was proposed by Shuker et al.(Figure 2A) [13]. First, 1H-15N HSQC spectra were recorded.The resonances around the mutual drug-binding pockets Aand B should be uniquely identified. Subsequently, byNMR titrating experiments, the first ligand that binds topocket A is identified using the HSQC signals surroundingthe pocket as a guide. This molecule (ligand a) is modifiedto form ligand b with higher affinity. Subsequently, in thepresence of ligand b, a compound (ligand c) that binds topocket B is screened and optimized to form ligand d. Finally,ligands b and d are connected by an appropriate linker moi-ety. With this approach, NMR signals of backbone amidesare used to monitor how well the fragment molecules fitinto the cavities. The potential of this method was firstdemonstrated by Abott’s group in discovering a nonpeptidicstromelysin inhibitor [14]. Recently, the same group has

reported the application of this method for selectiveBcl-2 inhibitor [15]. The original method only describes theapplication using 1H-15N HSQC, although a similarexperiment using 1H-13C HSQC with a 13C-labeled proteinhas also been considered.

3. Biological systems used for isotopiclabeling

3.1 Escherichia coli expression systemsSimilar to many other biotechnology applications, E. coliprotein expression systems are the first choice for the practicalpreparation of isotopically labeled protein samples [16,17]. Oneof the widely used approaches is to use a plasmid that carriesan inducible T7 RNA polymerase promoter to express thegene of interest, which is subsequently used to transformBL21(DE3) E. coli cells [18]. Alternatively, ptac/plac-promoter-driven or pCold (cold shock protein) [19] promoter-drivenexpression vectors with appropriate purification tags (glutathi-one-S-transferase, maltose-binding protein, and thioredoxin)are used.

In all cases, NMR minimal media containing 2 -- 4 g/l ofglucose (12C6- or 13C6-) as the sole carbon source and0.5 -- 1 g/l of ammonium chloride (14N- or 15N-) as thesole nitrogen source and supplemented with some vitamins,minerals, and nucleosides are used as the harvesting medium.Because the metabolic pathways of E. coli have been almostcompletely analyzed, many variations of selective/specific

Signal assignments restraints

Distancesdihedralangles

go/no-godecision

3D/4DNMR

15Nsample

15N/13Csamples

9.0(1H) 8.0 7.0 (ppm)

(ppm)

125

120

115

110

(15N)

Figure 1. Schematic drawing of the standard solution NMR protocol for protein structure determination (see text). Usually,

the first step is to obtain uniformly 15N-labeled protein samples, and go/no-go decision is made according to the quality of1H-15N HSQC spectra as a guide. After analyses of several 3D/4D-NMR data, full assignments of back-bone amide proton

resonances are achieved. Further structural restraints (inter-proton distances and dihedral angles) are accumulated and

subsequently used for the structure calculation step.

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 525

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 4: Recent applications of isotopic labeling for protein NMR in drug discovery

isotope labeling are now obtained using wild-type andgenetically modified E. coli strains (see Section 4).

3.2 Brevibacillus choshinensisA new prokaryotic protein-expressing host species has beenidentified. Brevibacillus choshinensis HPD31-SP3 is a gram-negative bacterium [20]. The origin of this strain was formerlyknown as Bacillus brevis, which was reclassified and renamedBrevibacillus. Related strains are well-known hosts for highlevel protein secretion and low extracellular proteaseactivity [21]. A B. choshinensis expression system was furtherdeveloped and fine-tuned by the Higeta Shoyu Co., a soysauce brewer in Choshi, Japan. Tanio et al. demonstratedthat 15N-labeled bFKBP12 was secreted into the medium [22]

and also recommended using a commercially available semi-synthetic medium (C. H. L. medium, Chlorella IndustryCo., Japan) rather than an M9-based medium because oflower protein production. The authors successfully expandedtheir method to amino acid type-specific (AATS) labeling [23].A remarkable feature of a Brevibacillus expression system liesin its protein secretion pathway. Protein samples that containdisulfide bonds may be available using this system.

3.3 Pichia pastoris and other yeastsBacterial protein expression systems have many advantages,such as the ease of molecular genetic analysis and plasmidhandling, faster growth rates, and lower media costs.Nevertheless, heterologous protein production often fails forseveral reasons, particularly when expressing eukaryotic andmammalian genes. This phenomenon can be explained bythe following reasons: i) differences in the architectures andtranslation speeds of ribosomes, ii) presence or absence ofsubcellular components, iii) differences in the numbersof molecular chaperones, and iv) the presence or absence ofactivity for post-translational modifications [24].So far, an economical expression system using eukaryotic

cells has been long-awaited. Yeast can be used for this pur-pose. For example, recently, Kamiya et al. demonstrated the

overproduction of 13C-labeled homogenous oligosaccharides,which may become another source for pharmaceuticalresearch [25]. In contrast to Saccharomyces cerevisiae, the mostbasic model organism, Schizosaccharomyces pombe appears tobe more suitable for heterologous protein production [26]. Nev-ertheless, to our best knowledge, no isotopic labeling applicationof Sc. pombe has been reported. Williopsis mrakii, a fermentingbudding yeast for Japanese sake, was used to express 15N-labeledWm toxin, although this protein was not heterologous [27].

A minimal medium for yeasts can be used to harvest cul-tures. Thus, 15N-labeling is cost-effective, whereas 13C-label-ing is possible, but expensive. This is because the yeastmedium contains 10 -- 20 times more glucose than theE. coli M9 medium.

Therefore, P. pastoris, a methyltrophic yeast, is one of thebest choices in this category. In structural biology, P. pastorisis widely used for the large-scale production of membraneproteins, such as GPCRs. Isotopic labeling of NMR samplesby Pichia has been developed and established [28]. A Pichiaexpression system may be used in combination with analcohol oxidase 1 (AOX1) promoter. Pichia can grow in amedium with NH4Cl as the sole nitrogen source, and a15N-labeled protein sample is easily obtained at a reasonablecost, although, a problem may arise when attempting toobtain 13C-labeled proteins. An optimized protocol withregard to lowering the medium cost has been proposed [29].

3.4 Kluyveromyces lactisKluyveromyces lactis is another yeast that was recently estab-lished as a host for protein expression studies [30,31]. K. lactisis a well-known yeast used in the industry for the commercialproduction of rennet. K. lactis is also known as a killer yeastbecause it secrets a toxin (g toxin) that kills other yeast speciessuch as S. cerevisiae. Sugiki et al. successfully utilized a K. lactisexpression system to produce isotopically labeled proteinsamples with an acceptably low cost, even for 13C-labeling.

One of the major advantages of a K. lactis expression systemcompared with that of P. pastoris is its promoter for

Pocket A Pocket B

Ligand c Ligand d

A. B.

Figure 2. The two major strategies of "protein-based" approaches of NMR spectroscopy in drug discovery: (A) SAR-by-NMR

method, (B) fragment-growing method. Open and closed circles represent each NH groups in the target protein, whose

signals are either insensitive or sensitive to ligand binding, respectively. Filled arrows indicate the direction of the growth of

the initial fragment (light gray) by step-by-step lead optimization processes.

H. Hiroaki

526 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 5: Recent applications of isotopic labeling for protein NMR in drug discovery

gene expression. The strong LAC4 promoter, which inducescontinuous protein production in the presence of galactose inthe culture medium, was used for this purpose. Moreover,Sugiki et al. successfully optimized the culture conditions toreduce the total amounts of 13C-galactose used as a carbonand energy source. The original protocol used 2% (20 g/l)galactose [32]. Sugiki et al. reduced this to 5 g/l of 13C6-glucoseby employing a well-designed fed-batch harvesting protocol. Asa result, the cost of isotope (13C/15N) labeling the proteins ofinterest was as low as 20% of that used for P. pastoris. Becausethis cost is in the range of 100 -- 200% of that with E. coli,the K. lactis expression system should become more popular.

3.5 Insect, plant, and mammalian cellsIn addition to yeast expression systems, other highereukaryotes are now available as hosts for heterologous geneexpression in terms of isotopic labeling. However, the problemsof the cost and availability of isotopically labeled media persist.For insect cells, the well-known Sf9 cells with a baculovirusexpression system, which are widely used in molecular biologystudies, were also employed to prepare NMR samples [33-35].Uniform labeling and AATS labeling were achieved.

With regard to plant cells, Ohki et al. established an expres-sion system using plant cells [36]. Tobacco BY-2 cells wereused to demonstrate the production of sufficient amounts ofproteins, including dihydrofolate reductase, chicken calmodu-lin, porcine protein kinase C-dependent protein phosphatase-1 inhibitor, and bovine pancreatic trypsin inhibitor (BPTI).In this method, only 15N-labeled proteins are available byusing K15NO3 and 15NH4

15NO3 as nitrogen sources. It isnotable that BY-2 cells successfully produce BPTI with correctdisulfide bonds (three bonds in this protein).

The use of mammalian cells should also be noted, since theiruse for standard molecular biology experiments is very popular.In terms of isotopically labeled preparation of biologics andimmunoglobulins, mammalian expression systems are especiallyimportant, because their glycosylation profile is crucial for thepharmacological function (reviewed in [37]). So far, Sastry et al.recently demonstrated the preparation of isotopically labeled,NMR-competent samples for the outer domain of theHIV-1 gp120 glycoprotein using an adenovirus expressionvector [38]. This system was optimized for use with an NMR-ready synthetic medium for mammalian cells, CGM-6000(Cambridge Isotope Laboratory, Ltd.). Under optimized con-ditions, HEK293 cells could produce > 40 mg/l of proteinswith an average of > 80% isotope incorporation. This applica-tion will open the door for NMR structural biology usingmammalian expression systems, in addition to the reasonablecost for labeling.

3.6 Cell-free systemsSeveral cell-free protein synthesis systems and many commer-cially available kits have been developed [39]. Theoretically,there should be no difference between protein expressionwith natural (no label) and isotopically labeled amino acids.

Thus, the only issue for isotopic labeling is the productionrate. E. coli cell-free systems and wheat germ cell-free systemsare known for their use in highly automated protein produc-tion in several structural genomics projects. Based on thedegree of purification of cell-free extracts, which include theentire machinery of protein production along with other met-abolic enzymes, some of these are limited in their use foramino acid-specific labeling. Special care should be taken toavoid metabolic scrambling when chemically modified aminoacids, such as SAIL amino acids, are used [40].

4. Advances in specific/selective isotopelabeling techniques

Strategies for specific and selective isotope labeling techniquesare briefly summarized in this section (Figure 3). In general,these labeling strategies are roughly grouped into two categoriesbased on their purposes. The first group includes AATS label-ing, inverse labeling (AATS unlabeling), combinatorial AATSlabeling, and segmental labeling. These strategies are used tosimplify the measured spectra by decreasing the numbers ofobservable signals. These simplifications may contribute to acertain rationale for confirming already-made assignments ofsignals, thus contributing to the assignment process itself.

The second group of strategies includes metabolic frac-tional labeling methods and a chemical method for certainspecific moieties. The stereo-array isotope labeling (SAIL)method is one of the most extreme uses of chemical synthesis.This group of methods is primarily used to challenge andexpand upon the molecular weight limit of NMR methodol-ogy by improving the relaxation pathways of certain spins inthe protein of interest.

4.1 AATS labelingSeveral expression systems have been used to demonstrateAATS labeling [23,33,35,41-43]. To accumulate the complete setof AATS labeling for all amino acids, the system usingE. coli required a set of auxotrophs [44]. However, for someamino acids that are located at the terminus of a metabolicpathway, such as His, Lys, Met, and Ala, AATS labelingwith a prototrophic host strain is highly attainable [45].When using cell-free protein synthesis systems, AATS labelingappears to be much easier than when using expression sys-tems. However, some cell-free systems still contain active met-abolic enzymes; therefore, special care should be taken toavoid isotope scrambling [46].

Obviously, the greatest merit of this approach is its simplic-ity (Figure 3B). The residues of only the labeled amino acidgive HSQC signals without doubt. Thus, the process ofsequential assignment becomes simpler and more reliable. Insome difficult cases, such as low sample solubility or weakself-association, the quality of 3D NMR spectra is notsufficient to reliably complete the sequential assignments ofbackbone signals. AATS labeling can resolve this issue.

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 527

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 6: Recent applications of isotopic labeling for protein NMR in drug discovery

1H(amide) (ppm) 15

N (

pp

m)

15N A.

ALA

1H(amide) (ppm)

15N

(p

pm

)

15N B.

ALA

1H(amide) (ppm)

15N

(p

pm

)

15N C.

ALA

1H(amide) (ppm)

15N

(p

pm

)

15N

13VALC D.

1H(amide) (ppm)

15N

(p

pm

)

15NE.

1H(methyl - ppm)

13C

(p

pm

)

13C F.

1H(methyl - ppm)

13C

(p

pm

)

13C G.

M

M

M

M M

M

M

1H(methyl - ppm)

13C

(p

pm

)

13C / 2H H.

Figure 3. Schematic diagram for uniform and specific isotope labeling of a protein with its appearance in 2D NMR spectra.

Filled and double circles represent fully isotopically labeled amino acid residues with the indicate nuclei. Open and shaded

circles represent unlabeled and fractionally labeled amino acids, respectively. Circles with "M" denote residues with 13C-

methyl-specific labeling. (A) Uniform 15N-labeling, (B) 15N-AATS labeling (15N-Ala), (C) 14N-inverse-labeling (14N-Ala) in 15N

background, (D) combinatorial AATS labeling with 13C-Val and 15N-Ala. Only the signal corresponding to Val-Ala sequence

gives a doublet signal, (E) segmental labeling, (F) uniform 13C-labeling and the expanded CH3 region of 13C-HSQC spectrum.

Signals are broadened by 13C--13C coupling, (G) 13C-fractional labeling. proR methyl groups give doublet signals whereas proS

methyl groups are in singlet form, (H) 13C-methyl-specific labeling in 2H background. Signals are much sharper than (F).

H. Hiroaki

528 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 7: Recent applications of isotopic labeling for protein NMR in drug discovery

When combining a genetic strategy to prepare all Alasubstitution mutants for the target amino acid in the proteinof interest, the specific assignments for all certain aminoacid residues are possible without using any three- or higher-dimensional NMR techniques. Thus, the molecular weightlimitation of NMR methodology is also largely expanded(up to 100 K).

4.2 Amino acid-selective unlabeling (inverse labeling)Another approach is amino acid-selective labeling and unlab-eling. In this strategy, few 14N-amino acids are added to themedium with a 15N-background (Figure 3C). This methodwas originally developed by Shortle for simplifying HSQCspectra and for identifying amino acid residue types to aidin the subsequent sequential assignment process [47]. Werecently improved this protocol by combinatorial aminoacid inverse labeling without using auxotrophic E. coli hoststrains [48]. This method was optimized to obtain eight com-binations of amino acids, Ile/Leu/Val(-), Gly/Ser/Cys(-),Phe/Tyr(-), Ala(-), Lys(-), Arg(-), His(-), and Trp(-), whichwere inversely labeled in the standard E. coli host strainBL21(DE3) using a minimal M9 medium formulation.This technique was further expanded to facilitate sequentialassignments in a cost-effective manner [49].

4.3 Combinatorial AATS labeling for sequential

assignmentsAn advanced option for AATS labeling is simultaneouslyusing two or more differently labeled amino acids. For exam-ple, a combination of 13C-labeled Val and 15N-labeled Alaprovides unique 13C-coupled-15N signals only at the aminoacid sequence of Val-Ala (Figure 3D). Thus, combinatorial13C and 15N AATS labeling can be used for sequence specificassignments for specific amino acid pairs.

Kohno recently developed a method to produce dual aminoacid-selective 13C-15N labeled proteins by using an improvedwheat germ cell-free system [50]. This method opened thedoor to completing sequence-specific assignments of amidesignals for very large protein samples.

4.4 Segmental labelingSegmental labeling is another approach for simplifying thecomplex HSQC spectra of larger or multidomain proteins(Figure 3E) [51]. This method utilizes in vitro protein ligationof two individually prepared protein segments; one is isotopi-cally labeled, and the other is unlabeled. For protein ligation,the “split intein” approach is commonly used [52].

One of the major drawbacks of this strategy is that refold-ing of the ligated protein sample is absolutely necessary.Thus, applications for some fragile samples such as kinasesand other biologically important enzymes remain limited.A recent application of segmental labeling was optimized forimproving the solubility of smaller soluble protein samples [53].An appropriate fusion tag protein efficiently improves the

solubility of the protein of interest, which enables obtainingan NMR sample of sufficient concentration.

4.5 Metabolic fractional labeling and moiety-specific

labelingMetabolic fractional labeling is a method to achievestereo- and region-specific isotopic labeling of proteins usingisotopically enriched precursors of amino acids for biosyn-thetic pathways. For example, mixtures of 13C6- and12C6-glucoses (Figure 3G) [54], [1]-13C1-glucose [55,56], as wellas [1,3]13C2- and [2]13C2-glycerols [57,58] are in this category.Therefore, an E. coli expression system is often used with thesemethods. To the best of my knowledge, the first applicationof metabolic fractional labeling was the use of 13 -- 87%13C6- and 12C6-glucose mixtures for stereo-specific NMRassignments of prochiral methyl groups in Leu and Val resi-dues (Figures 3G, 4A) [54]. In addition, this approach is usedfor analyzing amino acid metabolic pathways in E. coli [59,60]

and successfully applied to backbone assignments [61].Alternatively, a method to selectively introduce 13CH3

groups of Val, Ile, and Leu within an otherwise fully2H-labeled background was developed (Figure 3H) [41,62]. Inthis case, 13C2-a-keto-b-methylvaleric acid was added to theE. coli culture medium. The isolated spin system of 13CH3

groups showed improved relaxation properties. A pulsesequence “methyl-TROSY” is employed to observe these13CH3 signals in very large proteins. In addition, this methodis suitable for protein--protein interaction interfaces in largerprotein complexes.

Recently, Takeuchi et al. demonstrated a method for alter-nating 13C labeling using either [1,3]13C2- or [2]

13C-glycerolas the carbon source in 15N/2H2O medium [58]. This methodprovided a protein sample in which a 13Ca atom was placedas the neighbor of 12C¢ and 12Cb; thus, their spin was isolated(Figure 4B). This method was effectively combined with13C-directly detected 13C-15N-2D NMR experiments(CaN). The prolonged relaxation properties of the isolated13Ca in larger proteins were advantageous for further NMRapplications, such as CACA-TOCSY experiments [63].

4.6 Chemical 13C-Me-Lys labelingChemically introducing 13C-labeled methyl groups ontoe-amino groups of lysine residues in unlabeled proteinsshould be noted [64]. Chemical methylation of lysines has along history for improving the resolution during X-raycrystallization [65] and was recently re-evaluated forhigh-throughput structural genomics [66,67]. So, this methodhas been expanded to protein NMR. In this case, a treatmentcalled “reductive methylation” is used.

In brief, 13C-formaldehyde is mixed with a target protein ina buffer at physiological pH, after which sodium cyanoboro-hydride is added to reduce the Schiff base complexes offormaldehyde and lysine. This reaction is so mild that itprovides the following advantages: i) methylation of lysine

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 529

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 8: Recent applications of isotopic labeling for protein NMR in drug discovery

residues does not significantly change a protein’s structureduring and after the reaction, ii) the methylated lysines retaintheir positive charges, iii) this method allows the introductionof NMR-active nuclei into proteins purified from organismsfor which there are no available isotope enrichment protocolspost-translation, and iv) the 13C methyl groups on lysines

have favorable relaxation properties, which makes itapplicable for large molecular weight proteins.

4.7 SAIL methodTo further improve the relaxation properties of the atoms ofamino acids, SAIL adopted a strategy of an extreme use ofchemical synthesis [40]. This method employs stereo- andregion-specific patterns of stable isotopes in which all2H- and 13C-labeling sites in uniformly 15N-labeled proteinsare extensively designed. Their occupancy level is nearlycomplete (~ 100%). Selected examples of SAIL amino acidsare illustrated in Figure 4C.

The concept for designing the labeling patterns of SAILamino acids is as follows: i) one of the two protons at CH2

is substituted to 2H (D) resulting in 13CHD; ii) two of theprotons at the Ala, Thr, and Met methyl groups are substi-tuted to 2H (D) resulting in 13CD2H; iii) one of the prochiralmethyl groups of Val and Leu are also labeled with 13CD2H;iv) the other prochiral methyl site is labeled with 12CD3;v) finally, for the six-membered aromatic rings of Phe andTyr, 13CH and 12CD moieties are applied at alternating posi-tions. As a result, all 13C atoms possess single 1H whose NMRsignals are not hampered by geminal 2JHH coupling as well asrelaxation via 1H--1H and 1H--13C dipolar couplings.

Typically, these SAIL amino acids are introduced intoproteins through cell-free protein synthesis [68-70]. This methodprovides an opportunity to determine the structures of largemolecular weight proteins with a reasonable effort [71,72].

5. In-cell NMR: pharmaceutical NMRapplications for cellular biochemistry anddrug delivery

5.1 In-cell NMR in bacterial expression systemsAs mentioned previously, heteronuclear correlation spectros-copy for 1H/15N and 1H/13C is merely observable withoutthe aid of isotopic labeling due to their low naturalabundance. This is the greatest advantage of HSQC-based/protein-based NMR methods used in drug screening becauseonly the signals from the target protein of interest are visible,even in a highly heterogeneous sample with numerousimpurities. One of the most extreme examples of this isin-cell NMR spectroscopy (reviewed by [73,74]).

The first example of in-cell NMR in living E. coli cells wasprovided by Serber et al. [75,76]. The principle of introducingstable isotopes into a protein of interest for in-cell NMR isvery similar to the protocol used for standard solution NMRby overexpressing an 15N-labeled protein sample using aT7-promoter-based system. The only major difference isoptimizing the timing for switching the culture mediumfrom nonlabeled to an isotopically enriched medium justbefore isopropyl-b-thioglucoside (IPTG) induction of targetprotein synthesis.

Because the efficiency of accumulation of the protein ofinterest is sufficiently high, it is well known that even a crude

13C6-glucose 12C6-glucose

Val

Leu

13% + 87 %

β

β

Val Ile Leu

Trp

A.

B.

C.

Figure 4. Examples of metabolic and chemical atom-specific

isotope labeling of amino acids. In the scheme, *C denotes13C atom. (A) 13C-fractional labeling using mixture of13C6- and 12C6-glucose by E. coli metabolic pathways. The

signal-splitting pattern can be used for stereo-specific signal

assignment of methyl groups, (B) 13C-alternating labeling

with [1,3]-13C2- and [2]-13C1-glycerol. (C) Selected examples

of SAIL amino acids.

H. Hiroaki

530 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 9: Recent applications of isotopic labeling for protein NMR in drug discovery

lysate of E. coli can provide a sufficient quality HSQCspectrum [16,77]. Thus, this technique has been expanded tomeasure a slurry sample of living bacterial cells that mayhave the overexpressed, isotopically labeled proteins. Becausethis methodology appears to be adequately robust, the bacte-rial in-cell NMR technique can be combined with many ofthe isotopic labeling techniques using E. coli cells describedin Section 4.

One of the most significant recent advances in bacterialin-cell NMR methods is the de novo NMR structure determi-nation of a protein from scratch. Sakakibara et al. successfullydetermined the structure of protein TTHA1718, a putativemetal binding protein from Thermus thermophilus, in livingE. coli cells [78]. The high viscosity and substantial instabilityof an in-cell NMR sample interfered with the complete resultsof a series of three-dimensional NMR spectra. They overcamethis problem by extensively using nonuniform samplingprofiles of multidimensional NMR spectra coupled withmaximum-entropy reconstruction rather than the usualFourier transformation [78,79].

Another remarkable application of bacterial in-cell NMRfor drug discovery is small molecule inhibitor library byin-cell NMR (SMILI-NMR) [80]. Before developing this,Shekhtman et al. developed a new variant of bacterial in-cellNMR method called STINT-NMR [81,82]. E. coli cellsharboring two different genes under the control of twodifferent promoters were used.

The first gene was allowed to be expressed in an isotope-labeled medium, after which the second gene was expressedin nonlabeled conditions. As a result, the first protein was vis-ible on NMR, whereas the second protein, which interactedwith the first protein, was not observable. Complex forma-tions of these two proteins could be monitored similar toin vitro protein--protein interaction experiments. This idea(SMILI-NMR) is applicable to examine the effects ofsmall molecules that potentially inhibit a protein--proteininteraction of interest [80].

5.2 In-cell NMR in Xenopus oocytesAnother application of in-cell NMR was successfully devel-oped using a Xenopus laevas oocyte system [83-85]. Xenopusoocytes are frequently used in drug discovery, particularlyfor the physiology of ion channels [86] and water channels [87].For these purposes, either mRNA or cDNA of the gene ofinterest is microinjected into oocytes, which then express thegene product. An automated, high-throughput system fordrug screening using Xenopus oocytes is now widely used [88].

Sakai et al. successfully made the first application of “in-cell NMR” for eukaryotic cells. 15N-uniformly labeled Ub(prepared with a standard E. coli expression system) wasmicroinjected into 200--250 oocytes at developmental stagesV--VI. Although up to 100 mM, 15N-labeled Ub accumula-tion in living oocytes caused severe signal losses. When com-paring wild-type and interaction-deficient Ub mutants, weconcluded that this signal loss was due to protein--protein

interactions inside the living oocytes [83,89]. We also demon-strated the monitoring of endogeneous enzyme activity of ade-ubiquinating protease that processed the C-terminalamino acid of 15N-Ub.

The application of Xenopus in-cell NMR for monitoringprotein--protein interactions was expanded to the study ofthe microtubule binding protein Tau [84]. Tau is an intrinsi-cally disordered, aggregation-prone protein that is stronglyassociated with neuronal degenerative diseases (tauopathiesand Alzheimer’s disease). The formation of cellular aggregatesof Tau, known as paired helical filaments (PHFs), and itssubsequent drop-out from neuronal microtubules is believedto underlie this molecular pathogenesis. Thus, the in vivobehavior of Tau is of special interest.

Alternatively, Hansel et al. established how to observeDNA and RNA NMR spectra inside living oocytes [85]. Thecritical point was chemically modifying natural nucleic acidsto avoid in vivo degradation.

All these papers have clearly shown the potential of in-cellNMR methodologies, which are applicable to studies indrug delivery.

5.3 In-cell NMR in mammalian cellsWith regard to its potential for evaluating drug deliverysystems, in-cell NMR using mammalian cells has been long-awaited. To date, two applications have been published [90,91];one of these applications was proposed by us. In both meth-ods, the difficulty was introducing isotopically labeled proteinsamples into living cells. In contrast to bacterial systems, anefficient, rapid protein expression system for the genestransfected into mammalian cells has not yet been established.Further, in contrast to Xenopus oocytes, microinjecting labeledproteins into mammalian cells in large amounts isnot realistic.

Thus, Inomata et al. overcame this problem by using atechnique called a protein transduction domain (PTD) tag.PTD, also known as a cationic membrane penetrating peptide(CPP), has emerged in applications for drug delivery ofmacromolecules and biomedicines, such as bioactive peptides,antibody drugs, and antisense oligonucleotides [92-94]. Theefficiency of protein transduction is boosted in combinationwith pyrenebutyrate [95]. HeLa and COS7 cells were shownto be competent for in-cell NMR methods in which 15N-labeled Ub, FKBP12, and GB1 were observed by NMR [90].Further, we succeeded in monitoring signal changes ofFKBP12 in the presence and absence of a binding drug,FK-506. This was the first biophysical evidence ofprotein--drug interactions at the atomic level in living cells.

Another means to deliver 15N-labeled thymosin b4 (Tb4)into 293F cells was demonstrated by Ogino et al. They useda pore-forming bacterial toxin, streptolysin O (SLO). SLOproduces pores with a diameter of 25 nm on the cell mem-brane, which are then resealed in a medium that contains Ca2+. Using this technique, they transduced Tb4 and successfullyobserved its HSQC spectra. In addition, post-translational

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 531

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 10: Recent applications of isotopic labeling for protein NMR in drug discovery

N-terminal acetylation of Tb4, which resulted in certainchemical shift changes, was observed.In conclusion, although mammalian in-cell NMR method-

ologies have not yet matured and are still under development,their potential for pharmaceutical applications is noteworthy.

6. Expert opinion

In drug discovery research, solution NMR spectroscopy withboth isotopically labeled and unlabeled proteins has enormouspotential for target protein validation, initial drug screening,hit evaluation, lead optimization, and even for ADMEimprovement. In this review, I did not focus on ligand-basedmethodologies. However, the increasing number of newapplications and techniques with ligand-based approachesmust not be negligible. In addition, both strategies, protein-based and ligand-based methods, are highly complementaryto each other. Thus, researchers must attempt to establish anexpression system for their target protein of interest to incor-porate isotopes and to increase the number of applications ofHSQC-based experiments.For example, not only chemical shift perturbation experi-

ments (i.e., SAR-by-NMR) but also many other physicaland physicochemical parameters such as relaxation times,H/D-exchanging rates, conformational exchanges on millisec-ond time scales, and residual dipolar couplings are nowbecoming accessible by introducing isotope labels. Thisfine-tuned information at individual residue or atomic levelresolution is suitable for characterizing relatively difficultdrug targets.For example, modifying and controlling PPIs and PAs by

low molecular weight compounds are attractive concepts fornext generation drugs. In such cases, X-ray crystallographymay not be efficient because protein complexes, fibrils, andaggregates are sometimes rather transient and heterologous,which precludes their crystallization. In the same context,intrinsically disordered proteins (IDPs) are also importanttargets that can be studied only by solution methods.The use of in-cell NMR spectroscopy for basic research on

drug delivery is another possible option. Currently, in-cellNMR is the only way to monitor a protein at work inside liv-ing organisms, which may provide new insights in the mech-anism of drug action. However, the very low signal-to-noiseratios of proteins inside cells hinder the rapid spread of thistechnology. In particular, for mammalian applications of in-cell NMR, there have been only two successful examples.

The method used for incorporating labeled proteins insideliving cells at a high concentration is the difficulty. Othermethods, such as overexpression of labeled proteins insideliving mammalian cells must be developed.

A recent advancement in the high-throughput, automatedstructure determinations of proteins provides another per-spective. This topic may only attract the attention of research-ers within the field of structural genomics. However, decadesafter the initiation of structural genomics research projects,two consortia (in Europe and the US, but not in Japan) con-tinue to develop and refine the pipeline for structure determi-nations by NMR [96,97]. A highly automated pipeline ofprotein structure determinations may become readily applica-ble to drug design.

In case of structure-guided drug design, the speed ofobtaining a structure of a protein--drug (candidate) complexis the rate-limiting factor. A part of this problem has alreadybeen solved by high-throughput X-ray crystallography ofdrug--target complexes. Some pharmaceutical companieshave their own synchrotron beam lines. The remainder ofthis problem will be solved soon by high-throughput NMRstructural determinations. This will be possible by combiningknown techniques to accelerate NMR experiments includingnonuniform sampling and reduced dimensionality techniquesfor rapid measurements of 3D/4D-spectra, SOFAST-HMQC- and BEST-HNCA-based pulse sequences also usedfor shortening data acquisition times, and an automatedprotocol for signal assignments from NMR data [98,99]. Theseapproaches can be used in place of high-throughput X-raycrystallography.

With the help of high-performance computing andaccumulation of a large number of examples of protein--drugcomplexes, it should not be surprising that the paradigmsused in rational drug design and discovery will change duringthe next five years.

Acknowledgments

The author thanks K Sugiki and H Takahashi for helpfuladvice. The author also thanks Enago (www.enago.jp) forthe English language review.

Declaration of interest

The author declares no conflict of interest and has received nopayment in preparation of this manuscript.

H. Hiroaki

532 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 11: Recent applications of isotopic labeling for protein NMR in drug discovery

BibliographyPapers of special note have been highlighted as

either of interest (�) or of considerable interest(��) to readers.

1. Pellecchia M, Sem DS, Wuthrich K.

NMR in drug discovery. Nat Rev

Drug Discov 2002;1(3):211-19.. A comprehensive overview of

“ligand-based” techniques.

2. Pellecchia M, Bertini I, Cowburn D,

et al. Perspectives on NMR in drug

discovery: a technique comes of age.

Nat Rev Drug Discov 2008;7(9):738-45.. A comprehensive overview of

“ligand-based” techniques.

3. Heller M, Kessler H. NMR spectroscopy

in drug design. Pure Appl Chem

2001;73(9):1429-36

4. Orita M, Warizaya M, Amano Y, et al.

Advances in fragment-based drug

discovery platforms. Expert Opin

Drug Discov 2009;4(11):1125-44.. This is a nice review on how FBDD is

coupled with other

biophysical methods.

5. Ni F. Recent developments in transferred

noe methods. Prog Nucl Magn

Reson Spectrosc 1994;26:517-606

6. Fejzo J, Lepre CA, Peng JW, et al. The

SHAPES strategy: an NMR-based

approach for lead generation in drug

discovery. Chem Biol 1999;6(10):755-69

7. Antalek B. Using pulsed gradient spin

echo NMR for chemical mixture analysis:

how to obtain optimum results.

Concepts Magn Reson

2002;14(4):225-58

8. Chen A, Shapiro MJ. Affinity NMR.

Analyt Chem 1999;71(19):669A-75A

9. Bhunia A, Bhattacharjya S, Chatterjee S.

Applications of saturation transfer

difference NMR in biological systems.

Drug Discov Today

2012;17(9-10):505-13.. Recent review for STD technique in

“ligand-based” techniques.

10. Dalvit C, Pevarello P, Tato M, et al.

Identification of compounds with

binding affinity to proteins via

magnetization transfer from bulk water.

J Biomol NMR 2000;18(1):65-8. A paper describing WATERLOGSY.

11. Becattini B, Culmsee C, Leone M, et al.

Structure-activity relationships by

interligand NOE-based design and

synthesis of antiapoptotic compounds

targeting Bid. Proc Nat Acad Sci USA

2006;103:33:12602-6

12. Sanchez-Pedregal VM, Reese M,

Meiler J, et al. The

INPHARMA Method: protein-Mediated

Interligand NOEs for pharmacophore

mapping. Angew Chem

2005;117(27):4244-7

13. Shuker SB, Hajduk PJ, Meadows RP,

Fesik SW. Discovering high-affinity

ligands for proteins: SAR by NMR.

Science 1996;274(5292):1531-4.. The original paper for

“SAR-by-NMR” method.

14. Hajduk PJ, Sheppard G,

Nettesheim DG, et al. Discovery of

Potent Nonpeptide Inhibitors of

Stromelysin Using SAR by NMR. J Am

Chem Soc 1997;119(25):5818-27

15. Petros AM, Huth JR, Oost T, et al.

Discovery of a potent and selective

Bcl-2 inhibitor using SAR by NMR.

Bioorg Med Chem lett

2010;20(22):6587-91

16. Cai M, Huang Y, Sakaguchi K, et al. An

efficient and cost-effective isotope

labeling protocol for proteins expressed

in Escherichia coli. J Biomol NMR

1998;11(1):97-102

17. Hewitt L, McDonnell JM. Screening and

optimizing protein production in E. coli.

Methods Mol Biol 2004;278:1-16

18. Studier FW, Rosenberg AH, Dunn JJ,

Dubendorff JW. Use of

T7 RNA polymerase to direct expression

of cloned genes. Methods Enzymol

1990;185:60-89

19. Qing G, Ma L-C, Khorchid A, et al.

Cold-shock induced high-yield protein

production in Escherichia coli.

Nat Biotechnol 2004;22(7):877-82

20. Yashiro K, Lowenthal JW, O’Neil TE,

et al. High-level production of

recombinant chicken interferon-gamma

by Brevibacillus choshinensis.

Protein Expr Purif 2001;23(1):113-20.. Brevibacillus expression system.

21. Udaka S, Yamagata H. High-level

secretion of heterologous proteins by

Bacillus brevis. Methods Enzymol

1993;217:23-33

22. Tanio M, Tanaka T, Kohno T. 15N

isotope labeling of a protein secreted by

Brevibacillus choshinensis for NMR

study. Anal Biochem 2008;373(1):164-6

23. Tanio M, Tanaka R, Tanaka T,

Kohno T. Amino acid-selective isotope

labeling of proteins for nuclear magnetic

resonance study: proteins secreted by

Brevibacillus choshinensis. Anal Biochem

2009;386(2):156-60

24. Takahashi H, Shimada I. Production of

isotopically labeled heterologous proteins

in non-E. coli prokaryotic and eukaryotic

cells. J Biomol NMR 2010;46(1):3-10.. A comprehensive review for non-E. coli

expression systems.

25. Kamiya Y, Yamamoto S, Chiba Y, et al.

Overexpression of a homogeneous

oligosaccharide with 13C labeling by

genetically engineered yeast strain.

J Biomol NMR 2011;50(4):397-401

26. Takegawa K, Tohda H, Sasaki M, et al.

Production of heterologous proteins

using the fission-yeast

(Schizosaccharomyces pombe) expression

system. Biotechnol Appl Biochem

2009;53(Pt 4):227-35. An industrial scale expression system

using fission yeast.

27. Antuch W, Guntert P, Wuthrich K.

Ancestral beta gamma-crystallin precursor

structure in a yeast killer toxin.

Nat Struct Biol 1996;3(8):662-5

28. Pickford AR, O’Leary JM. Isotopic

labeling of recombinant proteins from

the methylotrophic yeast Pichia pastoris.

Methods Mol Biol 2004;278:17-33.. Isotopic labeling using Pichia

expression system.

29. Rodriguez E, Krishna NR. An

economical method for (15)N/(13)C

isotopic labeling of proteins expressed in

Pichia pastoris. J Biochem

2001;130(1):19-22

30. Sugiki T, Shimada I, Takahashi H.

Stable isotope labeling of protein by

Kluyveromyces lactis for NMR study.

J Biomol NMR 2008;42(3):159-62.. A key publication for K. lactis

expression system.

31. Sugiki T, Ichikawa O,

Miyazawa-Onami M, et al. Isotopic

labeling of heterologous proteins in the

yeast Pichia pastoris and Kluyveromyces

lactis. Methods Mol Biol

2012;831:19-36

32. Colussi PA, Taron CH. Kluyveromyces

lactis LAC4 promoter variants that lack

function in bacteria but retain full

function in K. lactis.

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 533

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 12: Recent applications of isotopic labeling for protein NMR in drug discovery

Appl Environ Microbiol

2005;71(11):7092-8

33. Strauss A, Bitsch F, Cutting B, et al.

Amino-acid-type selective isotope labeling

of proteins expressed in

Baculovirus-infected insect cells useful for

NMR studies. J Biomol NMR

2003;26(4):367-72.. Isotopic labeling and AATS by

baculovirus expression system.

34. Strauss A, Bitsch F, Fendrich G, et al.

Efficient uniform isotope labeling of Abl

kinase expressed in Baculovirus-infected

insect cells. J Biomol NMR

2005;31(4):343-9

35. Gossert AD, Hinniger A, Gutmann S,

et al. A simple protocol for amino acid

type selective isotope labeling in insect

cells with improved yields and high

reproducibility. J Biomol NMR

2011;51(4):449-56

36. Ohki S, Dohi K, Tamai A, et al.

Stable-isotope labeling using an inducible

viral infection system in

suspension-cultured plant cells.

J Biomol NMR 2008;42(4):271-7.. BY-2 plant cell expression system.

37. Kato K, Yamaguchi Y, Arata Y.

Stable-isotope-assisted NMR approaches

to glycoproteins using immunoglobulin

G as a model system. Prog Nucl Magn

Reson Spectrosc 2010;56(4):346-59.. A comprehensive review for isotopic

labeling in mammalian

expression system.

38. Sastry M, Xu L, Georgiev IS, et al.

Mammalian production of an isotopically

enriched outer domain of the

HIV-1 gp120 glycoprotein for NMR

spectroscopy. J Biomol NMR

2011;50(3):197-207

39. Takeda M, Kainosho M. Cell-free

protein production for NMR studies.

Methods Mol Biol 2012;831:71-84

40. Kainosho M, Torizawa T, Iwashita Y,

et al. Optimal isotope labelling for NMR

protein structure determinations. Nature

2006;440(7080):52-7.. The orginal article of SAIL method.

41. Goto NK, Kay LE. New developments

in isotope labeling strategies for protein

solution NMR spectroscopy. Curr Opin

Struct Biol 2000;10(5):585-92

42. Chen C-Y, Cheng C-H, Chen Y-C, et al.

Preparation of amino-acid-type selective

isotope labeling of protein expressed in

Pichia pastoris. Proteins

2006;62(1):279-87

43. Bruggert M, Rehm T, Shanker S, et al.

A novel medium for expression of

proteins selectively labeled with15N-amino acids in Spodoptera

frugiperda (Sf9) insect cells.

J Biomol NMR 2003;25(4):335-48

44. Lin MT, Sperling LJ,

Frericks Schmidt HL, et al. A rapid and

robust method for selective isotope

labeling of proteins. Methods

2011;55(4):370-8

45. O’Grady C, Rempel BL, Sokaribo A,

et al. One-step amino acid selective

isotope labeling of proteins in

prototrophic Escherichia coli strains.

Anal Biochem 2012;426(2):126-8

46. Yokoyama J, Matsuda T, Koshiba S,

et al. A practical method for cell-free

protein synthesis to avoid stable isotope

scrambling and dilution. Anal Biochem

2011;411(2):223-9

47. Shortle D. Assignment of amino acid

type in 1H-15N correlation spectra by

labeling with 14N-amino acids. J Magn

Reson B 1994;105(1):88-90.. The first paper describing 14N

inverse labeling.

48. Hiroaki H, Umetsu Y, Nabeshima Y,

et al. A simplified recipe for assigning

amide NMR signals using combinatorial14N amino acid inverse-labeling. J Struct

Funct Genomics 2011;12(3):167-74

49. Krishnarjuna B, Jaipuria G, Thakur A,

et al. Amino acid selective unlabeling for

sequence specific resonance assignments

in proteins. J Biomol NMR

2011;49(1):39-51

50. Kohno T. NMR assignment method for

amide signals with cell-free protein

synthesis system. Methods Mol Biol

2010;607:113-26

51. Iwai H, Zuger S. Protein ligation:

applications in NMR studies of proteins.

Biotechnology & genetic

engineering reviews 2007;24:129-45. A review for protein ligation and

segmental isotope labeling.

52. Yamazaki T, Otomo T, Oda N, et al.

Segmental isotope labeling for protein

NMR using peptide splicing. J Am

Chem Soc 1998;120(22):5591-2.. The first paper describing segmental

isotope labeling.

53. Kobayashi H, Swapna GVT, Wu K-P,

et al. Segmental isotope labeling of

proteins for NMR structural study using

a protein S tag for higher expression and

solubility. J Biomol NMR

2012;52(4):303-13

54. Senn H, Werner B, Messerle BA, et al.

Stereospecific assignment of the methyl1H NMR lines of valine and leucine in

polypeptides by nonrandom 13C

labelling. FEBS Lett 1989;249(1):113-18.. The first paper describing metabolic

fractional 13C labeling.

55. Teilum K, Brath U, Lundstr€om P,

Akke M. Biosynthetic 13C labeling of

aromatic side chains in proteins for

NMR relaxation measurements. J Am

Chem Soc 2006;128(8):2506-7

56. Lundstr€om P, Teilum K, Carstensen T,

et al. Fractional 13C enrichment of

isolated carbons using [1-13C]- or [2-13C]-glucose facilitates the accurate

measurement of dynamics at backbone

Calpha and side-chain methyl positions

in proteins. J Biomol NMR

2007;38(3):199-212

57. LeMaster DM, Kushlan DM. Dynamical

mapping of E. coli thioredoxin via 13C

NMR relaxation analysis. J Am

Chem Soc 1996;118(39):9255-64

58. Takeuchi K, Sun Z-YJ, Wagner G.

Alternate 13C-12C labeling for complete

mainchain resonance assignments using

C alpha direct-detection with

applicability toward fast relaxing protein

systems. J Am Chem Soc

2008;130(51):17210-11. Modern application of alternate

13C-12C labeling.

59. Szyperski T. Biosynthetically directed

fractional 13C-labeling of proteinogenic

amino acids. An efficient analytical tool

to investigate intermediary metabolism.

Eur J Biochem FEBS

1995;232(2):433-48.. In this paper, metabolic fractional

13C-labeling in E. coli was

extensively analysed.

60. Szyperski T, Neri D, Leiting B, et al.

Support of 1H NMR assignments in

proteins by biosynthetically directed

fractional 13C-labeling. J Biomol NMR

1992;2(4):323-34

61. Iwai H, Fiaux J. Use of biosynthetic

fractional 13C-labeling for backbone

NMR assignment of proteins.

J Biomol NMR 2007;37(3):187-93

62. Goto NK, Gardner KH, Mueller GA,

et al. A robust and cost-effective method

for the production of Val, Leu, Ile (delta

H. Hiroaki

534 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 13: Recent applications of isotopic labeling for protein NMR in drug discovery

1) methyl-protonated 15N-, 13C-,2H-labeled proteins. J Biomol NMR

1999;13(4):369-74

63. Takeuchi K, Frueh DP, Sun Z-YJ, et al.

CACA-TOCSY with alternate 13C-12C

labeling: a 13Calpha direct detection

experiment for mainchain resonance

assignment, dihedral angle information,

and amino acid type identification.

J Biomol NMR 2010;47(1):55-63

64. Abraham SJ, Hoheisel S, Gaponenko V.

Detection of protein-ligand interactions

by NMR using reductive methylation of

lysine residues. J Biomol NMR

2008;42(2):143-8. Chemical 13C-methyl labeling on

lysin residues.

65. Kobayashi M, Kubota M, Matsuura Y.

Crystallization and improvement of

crystal quality for x-ray diffraction of

maltooligosyl trehalose synthase by

reductive methylation of lysine residues.

Acta crystallogr D Biol Crystallogr

1999;55(Pt 4):931-3

66. Walter TS, Meier C, Assenberg R, et al.

Lysine methylation as a routine rescue

strategy for protein crystallization.

Structure 2006;14(11):1617-22

67. Sledz P, Zheng H, Murzyn K, et al. New

surface contacts formed upon reductive

lysine methylation: improving the

probability of protein crystallization.

Protein sci 2010;19(7):1395-404

68. Oba M, Kobayashi M, Oikawa F, et al.

Synthesis of 13 C/D doubly labeled l -

leucines: probes for conformational

analysis of the leucine side-chain.

J Org Chem 2001;66(17):5919-22

69. Oba M, Miyakawa A, Nishiyama K,

et al. Stereodivergent Synthesis of (2S,

3S, 4R, 5R) - and (2S, 3S, 4R, 5S) -

[3,4,5-D 3]Proline Depending on the

Substituent of the g-Lactam Ring.

J Org Chem 1999;64(25):9275-8

70. Oba M, Terauchi T, Miyakawa A, et al.

Stereoselective deuterium-labelling of

diastereotopic methyl and methylene

protons of L-leucine. Tetrahedron Lett

1998;39(12):1595-8

71. Takeda M, Sugimori N, Torizawa T,

et al. Structure of the putative 32 kDa

myrosinase-binding protein from

Arabidopsis (At3g16450.1) determined

by SAIL-NMR. FEBS J

2008;275(23):5873-84

72. Takeda M, Chang C, Ikeya T, et al.

Solution structure of the c-terminal

dimerization domain of SARS

coronavirus nucleocapsid protein solved

by the SAIL-NMR method. J Mol Biol

2008;380(4):608-22

73. Ito Y, Selenko P. Cellular structural

biology. Curr Opin Struct Biol

2010;20(5):640-8.. A well-written guide for in-cell

NMR spectroscopy.

74. Serber Z, Selenko P, Hansel R, et al.

Investigating macromolecules inside

cultured and injected cells by in-cell

NMR spectroscopy. Nat Protoc

2006;1(6):2701-9.. A well-written guide for in-cell

NMR spectroscopy.

75. Serber Z, Ledwidge R, Miller SM,

D€otsch V. Evaluation of parameters

critical to observing proteins inside living

Escherichia coli by in-cell NMR

spectroscopy. J Am Chem Soc

2001;123(37):8895-901

76. Serber Z, D€otsch V. In-cell NMR

spectroscopy. Biochemistry

2001;40(48):14317-23

77. Gronenborn AM, Clore GM. Rapid

screening for structural integrity of

expressed proteins by heteronuclear

NMR spectroscopy. Protein Sci

1996;5(1):174-7. In situ HSQC experiment of

isotopically labeled

nonpurified proteins.

78. Sakakibara D, Sasaki A, Ikeya T, et al.

Protein structure determination in living

cells by in-cell NMR spectroscopy.

Nature 2009;458(7234):102-5

79. Ikeya T, Sasaki A, Sakakibara D, et al.

NMR protein structure determination in

living E. coli cells using nonlinear

sampling. Nat Protoc 2010;5(6):1051-60

80. Xie J, Thapa R, Reverdatto S, et al.

Screening of small molecule interactor

library by using in-cell NMR

spectroscopy (SMILI-NMR).

J Med Chem 2009;52(11):3516-22. An article addressing applications of

in-cell NMR for drug discovery.

81. Burz DS, Dutta K, Cowburn D,

Shekhtman A. In-cell NMR for

protein-protein interactions (STINT-

NMR). Nat Protoc 2006;1(1):146-52. An article addressing applications of

in-cell NMR for drug discovery.

82. Burz DS, Dutta K, Cowburn D,

Shekhtman A. Mapping structural

interactions using in-cell NMR

spectroscopy (STINT-NMR).

Nat Methods 2006;3(2):91-3. An article addressing applications of

in-cell NMR for drug discovery.

83. Sakai T, Tochio H, Tenno T, et al.

In-cell NMR spectroscopy of proteins

inside Xenopus laevis oocytes.

J Biomol NMR 2006;36(3):179-88.. This is the first article about in-cell

NMR in Xenopus oocyte.

84. Bodart J-F, Wieruszeski J-M, Amniai L,

et al. NMR observation of Tau in

Xenopus oocytes. J Mag Reson

2008;192(2):252-7

85. Hansel R, Foldynova-Trantırkova S,

L€ohr F, et al. Evaluation of parameters

critical for observing nucleic acids inside

living Xenopus laevis oocytes by in-cell

NMR spectroscopy. J Am Chem Soc

2009;131(43):15761-8

86. Dascal N. The use of xenopus oocytes

for the study of ion channel. Crit Rev

Biochem Mol Biol 1987;22(4):317-87

87. Preston GM, Carroll TP, Guggino WB,

Agre P. Appearance of water channels in

xenopus oocytes expressing red cell

CHIP28 protein. Science

1992;256(5055):385-7

88. Pehl U, Leisgen C, Gampe K,

Guenther E. Automated

higher-throughput compound screening

on ion channel targets based on the

xenopus laevis oocyte expression system.

Assay Drug Dev Technol

2004;2(5):515-24

89. Sakai T, Tochio H, Inomata K, et al.

Fluoroscopic assessment of protein

leakage during Xenopus oocytes in-cell

NMR experiment by co-injected EGFP.

Anal Biochem 2007;371(2):247-9

90. Inomata K, Ohno A, Tochio H, et al.

High-resolution multi-dimensional NMR

spectroscopy of proteins in human cells.

Nature 2009;458(7234):106-9.. A key paper for in-cell NMR in

mammalian cells.

91. Ogino S, Kubo S, Umemoto R, et al.

Observation of NMR signals from

proteins introduced into living

mammalian cells by reversible membrane

permeabilization using a pore-forming

toxin, streptolysin O. J Am Chem Soc

2009;131(31):10834-5.. A key paper for in-cell NMR in

mammalian cells.

92. Vives E. A Truncated HIV-1 tat protein

basic domain rapidly translocates through

Recent applications of isotopic labeling for protein NMR in drug discovery

Expert Opin. Drug Discov. (2013) 8(5) 535

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.

Page 14: Recent applications of isotopic labeling for protein NMR in drug discovery

the plasma membrane and accumulates

in the cell nucleus. J Biol Chem

1997;272(25):16010-17

93. Derossi D, Joliot AH, Chassaing G,

Prochiantz A. The third helix of the

Antennapedia homeodomain translocates

through biological membranes.

J Biol Chem 1994;269(14):10444-50

94. Wender PA, Mitchell DJ,

Pattabiraman K, et al. The design,

synthesis, and evaluation of molecules

that enable or enhance cellular uptake:

peptoid molecular transporters. Proc Nat

Acad Sci 2000;97(24):13003-8

95. Takeuchi T, Kosuge M, Tadokoro A,

et al. Direct and rapid cytosolic delivery

using cell-penetrating peptides mediated

by pyrenebutyrate. ACS Chem Biol

2006;1(5):299-303

96. Montelione GT. The protein structure

initiative: achievements and visions for

the future. F1000 Biol Rep 2012;4:7

97. Bonvin AMJJ, Rosato A, Wassenaar TA.

The eNMR platform for structural

biology. J Struct Funct Genomics

2010;11(1):1-8

98. Schanda P, Kupce E, Brutscher B.

SOFAST-HMQC experiments for

recording two-dimensional heteronuclear

correlation spectra of proteins within a

few seconds. J Biomol NMR

2005;33(4):199-211

99. Schanda P, Van Melckebeke H,

Brutscher B. Speeding up

three-dimensional protein NMR

experiments to a few minutes. J Am

Chem Soc 2006;128(28):9042-3

AffiliationHidekazu Hiroaki PhD

Nagoya University,

Graduate School of Pharmaceutical Sciences,

Furocho, Chikusa-kum, Koto-kenkyu-kan,

Nagoya, 464-8601, Japan

E-mail: [email protected]

H. Hiroaki

536 Expert Opin. Drug Discov. (2013) 8(5)

Exp

ert O

pin.

Dru

g D

isco

v. D

ownl

oade

d fr

om in

form

ahea

lthca

re.c

om b

y U

nive

rsita

t de

Gir

ona

on 1

1/10

/14

For

pers

onal

use

onl

y.