10
Identification and structure elucidation by NMR spectroscopy Mikhail Elyashberg * Advanced Chemistry Development, Moscow Department, 6 Akademik Bakulev Street, Moscow 117513, Russian Federation ARTICLE INFO Keywords: CASE Computer-assisted structure elucidation Dereplication Expert system Hydrogen-deficient molecule NMR Nuclear magnetic resonance Structure elucidation Structure identification Structure verification A B ST R AC T The state of the art and recent developments in application of nuclear magnetic resonance (NMR) for structure elucidation and identification of small organic molecules are discussed. The recently sug- gested new two-dimensional (2D)-NMR experiments combined with the advanced instrumentation allow structure elucidation of new organic compounds at a sample amount of less than 10 μg. A pure shift ap- proach that provides 1 H-decoupled proton spectra drastically simplified 1 H and 2D NMR spectra interpretation. The structure elucidation of extremely hydrogen-deficient compounds was dramatically facilitated due to the methodology based on combination of new 2D-NMR experiments providing long- range heteronuclear correlations with computer-assisted structure elucidation (CASE). The capabilities of CASE systems are discussed. The role of NMR-spectrum prediction in structure verification and NMR approaches for qualitative mixture analysis are considered. © 2015 Elsevier B.V. All rights reserved. Contents 1. Introduction ........................................................................................................................................................................................................................................................... 88 2. Common set of 1D- and 2D-NMR experiments ........................................................................................................................................................................................ 89 3. Development of NMR experiments and instrumentation ..................................................................................................................................................................... 89 4. CASE expert systems .......................................................................................................................................................................................................................................... 93 5. NMR chemical-shift prediction ...................................................................................................................................................................................................................... 94 6. Structure verification ......................................................................................................................................................................................................................................... 94 7. Structure identification and dereplication in mixtures .......................................................................................................................................................................... 94 8. Is it possible to avoid an erroneous structure elucidation? ................................................................................................................................................................. 95 9. Conclusions ............................................................................................................................................................................................................................................................ 95 Acknowledgements ............................................................................................................................................................................................................................................. 95 References .............................................................................................................................................................................................................................................................. 95 1. Introduction Molecular structure determination is a central theme of organic and analytical chemistry. Nuclear magnetic resonance (NMR) spec- troscopy in combination with high-resolution mass-spectrometry (HRMS) makes up a basic set of methods to solve this problem. Given the molecular formula of a complex organic molecule that has been determined using HRMS, two-dimensional (2D)-NMR plays a crucial role in structure elucidation. In this review, we consider applica- tion of NMR to determine structures of small organic molecules. The results achieved in this area were discussed in monographs [1,2] and reviews [3–9], including two comprehensive reviews pub- lished recently [8,9]. To make clear the issues being discussed in this review, it is nec- essary to consider some basic concepts. The first step in the structure determination of an unknown is a spectral search against the rel- evant available databases using MS and NMR spectra. If the spectrum of the unknown fully coincides with a reference spectrum, it means that the structural formula of the unknown is identical to that of the reference. This is termed as structure identification. Otherwise, the problem of structure elucidation arises. Given the structure is elucidated, it is necessary to establish if the compound is new. The structural search against corresponding databases {see review [10]} to answer this question is called dereplication. This procedure is also interpreted in literature as a structural identification of a known chemical entity based on previously reported analytical and spec- troscopic information [11]. Structure elucidation is obviously the most complicated task. It is related to the class of inverse problems [12] for which solution am- biguity is a distinctive peculiarity. A single solution is selected by imposing additional constraints. On the whole, the problem of * Tel.: +7 495 438 2153; Fax: +4954382874. E-mail address: [email protected] http://dx.doi.org/10.1016/j.trac.2015.02.014 0165-9936/© 2015 Elsevier B.V. All rights reserved. Trends in Analytical Chemistry 69 (2015) 88–97 Contents lists available at ScienceDirect Trends in Analytical Chemistry journal homepage: www.elsevier.com/locate/trac

Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

Identification and structure elucidation by NMR spectroscopyMikhail Elyashberg *Advanced Chemistry Development, Moscow Department, 6 Akademik Bakulev Street, Moscow 117513, Russian Federation

A R T I C L E I N F O

Keywords:CASEComputer-assisted structure elucidationDereplicationExpert systemHydrogen-deficient moleculeNMRNuclear magnetic resonanceStructure elucidationStructure identificationStructure verification

A B S T R A C T

The state of the art and recent developments in application of nuclear magnetic resonance (NMR) forstructure elucidation and identification of small organic molecules are discussed. The recently sug-gested new two-dimensional (2D)-NMR experiments combined with the advanced instrumentation allowstructure elucidation of new organic compounds at a sample amount of less than 10 μg. A pure shift ap-proach that provides 1H-decoupled proton spectra drastically simplified 1H and 2D NMR spectrainterpretation. The structure elucidation of extremely hydrogen-deficient compounds was dramaticallyfacilitated due to the methodology based on combination of new 2D-NMR experiments providing long-range heteronuclear correlations with computer-assisted structure elucidation (CASE). The capabilitiesof CASE systems are discussed. The role of NMR-spectrum prediction in structure verification and NMRapproaches for qualitative mixture analysis are considered.

© 2015 Elsevier B.V. All rights reserved.

Contents

1. Introduction ........................................................................................................................................................................................................................................................... 882. Common set of 1D- and 2D-NMR experiments ........................................................................................................................................................................................ 893. Development of NMR experiments and instrumentation ..................................................................................................................................................................... 894. CASE expert systems .......................................................................................................................................................................................................................................... 935. NMR chemical-shift prediction ...................................................................................................................................................................................................................... 946. Structure verification ......................................................................................................................................................................................................................................... 947. Structure identification and dereplication in mixtures .......................................................................................................................................................................... 948. Is it possible to avoid an erroneous structure elucidation? ................................................................................................................................................................. 959. Conclusions ............................................................................................................................................................................................................................................................ 95

Acknowledgements ............................................................................................................................................................................................................................................. 95References .............................................................................................................................................................................................................................................................. 95

1. Introduction

Molecular structure determination is a central theme of organicand analytical chemistry. Nuclear magnetic resonance (NMR) spec-troscopy in combination with high-resolution mass-spectrometry(HRMS) makes up a basic set of methods to solve this problem. Giventhe molecular formula of a complex organic molecule that has beendetermined using HRMS, two-dimensional (2D)-NMR plays a crucialrole in structure elucidation. In this review, we consider applica-tion of NMR to determine structures of small organic molecules. Theresults achieved in this area were discussed in monographs [1,2]and reviews [3–9], including two comprehensive reviews pub-lished recently [8,9].

To make clear the issues being discussed in this review, it is nec-essary to consider some basic concepts. The first step in the structuredetermination of an unknown is a spectral search against the rel-evant available databases using MS and NMR spectra. If the spectrumof the unknown fully coincides with a reference spectrum, it meansthat the structural formula of the unknown is identical to that ofthe reference. This is termed as structure identification. Otherwise,the problem of structure elucidation arises. Given the structure iselucidated, it is necessary to establish if the compound is new. Thestructural search against corresponding databases {see review [10]}to answer this question is called dereplication. This procedure is alsointerpreted in literature as a structural identification of a knownchemical entity based on previously reported analytical and spec-troscopic information [11].

Structure elucidation is obviously the most complicated task. Itis related to the class of inverse problems [12] for which solution am-biguity is a distinctive peculiarity. A single solution is selected byimposing additional constraints. On the whole, the problem of

* Tel.: +7 495 438 2153; Fax: +4954382874.E-mail address: [email protected]

http://dx.doi.org/10.1016/j.trac.2015.02.0140165-9936/© 2015 Elsevier B.V. All rights reserved.

Trends in Analytical Chemistry 69 (2015) 88–97

Contents lists available at ScienceDirect

Trends in Analytical Chemistry

journal homepage: www.elsevier.com/ locate / t rac

Page 2: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

structure elucidation from 2D-NMR data can be presented as a com-bination of two inverse problems, which should be solved inconsecutive order [2]. The first problem is to determine all (if pos-sible) pairs of atoms (nuclei) in the molecule for which there existcorrelations observed in 1D and available 2D-NMR spectra. Thisgoal is achieved as a result of interpretation of 1D- and 2D-NMRspectra, which may admit alternative solutions due to resonanceoverlap and other reasons. The second problem is to determine allstructures that meet the revealed set of coupled nuclei and then toselect the most probable structure by imposing additional con-straints coming from characteristic spectral features, NMR chemical-shift prediction and chemical knowledge. It is evident that thesolution of the second, main, problem strongly depends on thesolution of the first problem. If erroneous spin couplings leak intothe solution of the first problem, the possibility of a correct struc-ture becomes problematic.

Analysis of spectroscopists’ reasoning during structure elucida-tion led to the conclusion that initial NMR-based information usedfor this goal could be represented as a set of “axioms”, which makeup a partial axiomatic theory formulated specifically for a givenproblem [1,2,13]. Hence, the problem reduces to inferring all plau-sible structures from the set of axioms. The axioms can be readilyformalized, and provide a theoretical basis for creation of algo-rithms for computer-assisted structure elucidation (CASE).

Note that both a human expert and a CASE program commonlyuse the same set of axioms, but the program is not governed by thechemical “prejudices” of the human mind and delivers all (withoutany exception) structures satisfying the given set of axioms adoptedby the chemist. This task, as a rule, is impossible for a human expert.The program finds solution far more quickly and more reliably [13].

Generally speaking, progress in the reviewed area is going onin the following two directions:

• suggestion of new NMR experiments and instrumentation forreliably acquiring as much structural information as possible fromthe smallest amounts of sample [8,14,15] in the shortest time(provides solution to the first inverse problem); and,

• enhancing the performance of the existing CASE programs andcreating new ones (provides solution to the second inverseproblem).

This review discusses the state of progress in solving theseproblems.

2. Common set of 1D- and 2D-NMR experiments

1H- and 13C-NMR spectra carry information about the qualita-tive and quantitative composition of an unknown and they are usedfirst for the determination of the molecular formula. Along with 1D1H- and 13C (15N if available)-NMR spectra, many two-dimensionalNMR experiments were developed for structure elucidation [16,17].For molecules containing nitrogen atoms, resonances of 15N nucleiare determined from 1H-15N heteronuclear single-quantum corre-lation (HSQC) and 1H-15N heteronuclear multi-bond correlation(HMBC) 2D spectra (see below). The most frequently used set of 2D-NMR experiments is presented in Table 1.

The correlation spectroscopy (COSY) and HMBC correlationswhose lengths most frequently do not exceed three bonds are re-ferred to as standard correlations [1]. However, depending on thespatial configuration of a molecule, correlations longer than stan-dard correlations can also be observed. These correlations are referredto as non-standard correlations (NSCs). The presence of NSCs, theirnumber and lengths in HMBC and COSY spectra are difficult to detect,and this issue can make the initial information not only fuzzy butalso contradictory.

3. Development of NMR experiments and instrumentation

The latest advances in development and improvements of 2D-NMR experiments were extensively considered in recently publishedreviews [6,8,19–24]. Significant efforts of researchers were focusedon creating methods for detecting correlations whose lengths could(at least in principle) be unambiguously determined. For example,distinguishing between correlations of coupling constants 2JCH and3JCH in the HMBC spectrum leads to obtaining crisper 2D-NMR struc-tural information. Different methods were suggested to reduce thetime for spectral acquisition, to increase sensitivity and to simpli-fy post-acquisition processing of 2D-NMR data. Some 2D-NMRexperiments that complement those shown in Table 1 are enumer-ated in Table 2.

From the standpoint of a protocol for elucidating chemical struc-ture, a 1H spectrum is almost always recorded first. The next stepis the acquisition of 2D-NMR data, and it is recommended to startwith a multiplicity-edited HSQC (ME-HSQC) spectrum [5]. The HSQCdata may provide some insight into the numbers of possible het-eroatoms, as well as a partial carbon count. The next step is to acquirea COSY spectrum. Having in hand 1H, COSY and ME-HSQC, it is pos-sible to begin to assemble structural fragments comprising thecontiguous protonated carbon resonances.

Having the molecular formula from HRMS and HSQC, it is pos-sible to establish the expected number of signals of quaternarycarbons in 13C-NMR spectrum and the minimum number of het-eroatoms. Then, the HMBC spectrum is acquired, and in principle,allows one to complete assembly of the structure.

If the molecule contains nitrogen atoms, employing the 1H-15NHMBC spectrum gives valuable information that is crucial for the struc-ture elucidation in many cases {see reviews [30–34]}. Low sensitivityof the 1H-15N HMBC experiment is overcome using small-volume

Table 1The most frequently used set of 2D NMR experiments [16]

Experiment Characteristics

HSQC,HeteronuclearSingle QuantumCoherence

The HSQC spectrum shows resonances (heteronuclearcorrelations) which arise as a result of 1JCH couplingsbetween 13C nuclei and protons attached to thecorresponding atoms. This allows one to detect all CH, CH2

and CH3 groups with chemical shift assignment. ME-HSQC(Multiplicity Edited HSQC) alleviates distinguishingresponses from CH3, CH2 and CH groups in HSQC spectrumobviating the acquisition of DEPT data.

COSY, CorrelationSpectroscopy

The COSY spectrum usually reveals homonuclearcorrelations (spin couplings) between vicinal hydrogensseparated by three bonds (3JHH). This makes it possible toidentify the neighbor carbon atoms connected by achemical bond.

TOCSY, TotalCorrelationSpectroscopy

TOCSY allows one, in principle, to obtain sub-spectra fordifferent sequences of coupled protons in a molecule.In practice, investigators usually acquire only one of thespectra.

HMBC,HeteronuclearMultiple-BondCoherence

The HMBC spectrum reveals heteronuclear correlationsbetween 1H and 13C (15N) nuclei separated by two or threechemical bonds, allowing users to detect ”fuzzy”fragments around a given C or N atom. There is no routineapproach that would allow determining which intervening1H-13C pairs are separated by two bonds and which – bythree. Therefore the information carried by HMBC is fuzzyby nature. 1H-13C HMBC data are made even more “fuzzy”by the occasional observation of 4JCH correlations. Incontrast, for conventional 1H -15N HMBC, 4JNH correlationsare almost never observed.

NOESY, NuclearOverhauserEnhancementSpectroscopy

ROESY, Rotatedframe NOESpectroscopy

The NOESY/ROESY [18] reveals couplings betweenhydrogen atoms separated in space by distance <5 Å, ,which is used for determining stereochemistry of anelucidated structure, as well as for clarifying positions ofsome substituents if HMBC and COSY data do not allow todo that. These spectra are usually not used duringstructure assembly.

89M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 3: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

inverse probes and/or cryogenic NMR probes, which allow acquisi-tion of spectra on sub-micromole quantities of samples [35]. Thenumber of publications in which 1H-15N HMBC is utilized is con-stantly growing. The very recently reported new experiments on long-range heteronuclear single quantum multiple bond correlation(LR-HSQMBC) optimized for 1H-15N long-range heteronuclear cou-plings [36] and H-C-N multiple-bond correlation (HCNMBC) [37,38]produced data complementary to 1H-15N HMBC-type correlations. Itis expected that such experiments can facilitate the structure eluci-dation of nitrogen-containing molecules, particularly those belongingto heterocyclic compounds and alkaloids.

In comparison with MS and optical spectroscopy, NMR pos-sesses significantly less sensitivity, which becomes especially notableregarding the sample size and the acquisition time of 2D-NMR. Asa result of technical progress, cooled microprobes became available[39,40].

Hilton and Martin [35] investigated experimental performancelimits for an ensemble of 2D-NMR experiments using a 600 MHzspectrometer with 1.7 mm Bruker TCI MicroCryoProbe. A solutioncontaining 870 μg (2.6 μmol) of a model compound – strychninein 30 μL of CDCl3 – was used. The following acquisition times toobtain adequate signal-to-noise ratios were determined: COSY –7 min; rotating-frame Overhauser-effect spectroscopy (ROESY) –1 h 11 min; 13C reference – 25 min; ME-HSQC – 7 min; 1H-13CHMBC – 33 min; 1H-13C heteronuclear 2-bond correlation (H2BC)– 3 h 11 min; 1H-15N HMBC – 1 h 22 min; adequate sensitivitydouble-quantum spectroscopy (1,1-ADEQUATE) – 14 h 40 min.Further dilutions have shown that, with samples of 45 μg (150 nmol)even 1H-15N HMBC remains accessible experimentally over aweekend. The authors [35] concluded that, with a 1 mg sample ofstrychnine (~3 μmol), it is now possible to acquire the full set ofhomonuclear and heteronuclear 2D-NMR experiments in 4 h (in-cluding 1H -15N HMBC but not 1,1-ADEQUATE) that could, in principle,be used to establish the full chemical structure and stereochemis-try. Using the same equipment as in [35], high signal-to-noise pureshift (see below) HSQC data from a 7.4 μg metabolite sample wereacquired in just over 30 min [41].

Structure elucidation becomes especially challenging for mol-ecules for which a severe deficit of protons is inherent. Thesemolecules contain “silent” (deprived of hydrogen) fragments, whichprevent structure assembly using HMBC correlations. If the ratio ofthe number of protons in a molecule to the sum of the heavy atoms(e.g., C, N, O, and S) is <2, it may be difficult and, in some cases, im-possible to elucidate a structure unequivocally based solely on NMRdata and molecular formula information. This statement is knownas the “Crews rule” [42]. To circumvent this issue, several new ap-proaches were suggested.

Gross and co-workers were the first to employ atomic force mi-croscopy (AFM) [43] to determine a challenging structure of a smallmolecule by making an image of its skeleton visible. Then, AFM com-bined with 1D- and 2D-NMR, density functional theory (DFT) andCASE system ACD/Structure Elucidator [44] (see Section 4) was used[45] to determine the structure of natural product breitfussin A,Structure 1(a “silent” fragment is highlighted in bold).

A new, high-sensitivity experiment complementary to HMBC, LR-HSQMBC [46] extends the observation of long-range correlation datato 4-, 5-, and even 6-bond long-range nJCH heteronuclear cou-plings. The correlation lengths can be estimated from nJCH, for whichexperiments were optimized. These correlations can reach quater-nary carbons of “silent” fragments, allowing structure elucidation.Proton-deficient model compound cervinomycin A2, Structure 2,C29H21NO9, was used [27] to assess the benefits of including LR-HSQMBC data as input for ACD/Structure Elucidator. Fig. 1 showsthe observed LR-HSQMBC correlations (optimized at 2 Hz).

Table 2Experiments complementing the most frequently used 2D NMR techniques

Method Characteristics

HSQC-TOCSY The 1H-13C HSQC-TOCSY experiment combines 1H-13C HSQCwith a 1H-TOCSY experiments to give through-bondcorrelations between a 13C-attached 1H to all other coupled1H. The resonances of coupled protons can be seen along aline at the same 13C chemical shift from the carbon atomattached to the primary 1H [16].

CIGAR-CHMBC Makes it much easier to interpret the spectrum, particularlyin the crowded regions. Is the best of the existing HMBCsequences in terms of information content and ease ofinterpretation [20].

2J,3J-HMBC Affords the means of unequivocally differentiating between2JCH from 3JCH correlations. Peaks of 2JCH have skewcharacteristics, while 2JCH signals for quaternary carbons aremissing. It also suffers from a severe lack of sensitivity(almost 10 times lower than for HMBC) [20]

1H-13C H2BC HMBC-type spectrum almost exclusively showing onlytwo-bond correlations and markedly also two-bondcorrelations that are absent in HMBC spectra. As a rule,correlations that are strong in an H2BC spectrum and weakin an HMBC spectrum indicate two-bond correlations. H2BCand HMBC are therefore quite complementary. Drawbacksare that H2BC spectra only show peaks involving protonated13C carbons, they cannot identify adjacent quaternarycarbon resonances. In addition there is no absoluteguarantee that a peak in an H2BC spectrum represents atwo-bond correlation [20].

ADEQUATE 1,1-ADEQUATE exploits 1JCH and 1JCC couplings to allow theidentification of adjacent neighbor carbons. Both protonatedand non-protonated adjacent carbons are observed.Correlations are equivalent to 2JCH correlations inHMBC-type experiments. Method suffers from sensitivityand sample limitations, which are overcome by employingsmall volume high sensitivity and cryogenic NMR probes[19].1,n- ADEQUATE [19,25] provides predominantly 3JCC long-range correlations that are analogous to 4JCH HMBCcorrelations. Though it is less sensitive than 1,1-ADEQUATEand 1JCC correlations “leak” into the spectrum, it has beenshown [26–28] that 1,n-ADEQUATE data acquired using1.7 mm cryoprobe technology are very helpful for structurecharacterization of proton-deficient molecules whosestructures are extremely difficult to elucidate byconventional HMBC. Advantages of ADEQUATE over HMBCwere demonstrated experimentally in combination withquantum-chemical coupling constant computations [28].

INADEQUATE A COSY-like experiment that yields 13C-13C correlations. It iscapable of establishing the identity of adjacent neighborcarbons via 1JCC couplings. Major drawbacks of theINADEQUATE experiment are extreme insensitivity andprodigious sample requirements (>10 mg). To an extent, thelimitations of the experiment have been overcome by thedevelopment of small volume high sensitivity and cryogenicNMR probes [29].

Structure 1. Natural product breitfussin A.

90 M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 4: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

The results of computational experiments (Table 3) clearly showthat complementing the standard 2D-NMR set by 1H-13C LR-HSQMBC data dramatically accelerates structure generation andreduces the size of the output file.

Blinov et al. [27] performed analogous computational experi-ments using staurosporine, C28H26N4O3, (a “Screw structure”), as amodel compound. In this case, the set of experiments shown inTable 3 was complemented by IDR-HSQC-total correlation spec-troscopy (TOCSY), 1H-15N HMBC, 1,1-ADEQUATE and 1,-n-ADEQUATE.The results obtained were very interesting and highlighted the con-tribution of the combination LR-HSQMBC and 1,-n-ADEQUATE.

It should be expected that the application of the described com-bination of 2D-NMR experiments within CASE systems would allowthe problem of “silent fragments” to be overcome to a consider-able degree. The methodology of the approach described should befurther developed on the basis of structure elucidation of other mol-ecules subjected to the Crews rule.

Kummerlöwe et al. [47] concluded that 2D-NMR data, includ-ing 1H -15N HMBC and 1,1-ADEQUATE, were insufficient to elucidatethe structure of an unusual proton-deficient molecule (3) manu-ally, due to extremely contradictory 1H-13C HMBC data (presenceof nine NSCs, two of which had five bond lengths and were ratherstrong).

Kummerlöwe et al. [47] were the first to utilize successfully theNMR spectrum of a residual dipolar coupling (RDC) [48] to eluci-date the structure of a small molecule.

Zangger and Sterk [49] developed a “pure shift” approach, pro-viding 1H-decoupled proton spectra (i.e., all multiplets are transformedinto singlets). In recent years, efforts of researchers were directedto overcome the major drawback of the approach – its low sensitivity[50–58]. As a result, very recently, a new and very general pure shiftmethod, Pure Shift Yielded by CHirp Excitation (PSYCHE), was in-troduced by Morris and co-workers [56]. PSYCHE has approximately10-fold better sensitivity than competing “pure shift” methods.Normal and PSYCHE spectra of estradiol in DMSO-d6 are shown inFig. 2as an example.

The same group applied PSYCHE to the TOCSY experiment [57].In combination with covariance processing (see below), the resultis a high-quality, high-resolution TOCSY spectrum with singlets inboth dimensions (see Fig. 3). It is evident that the suggested ap-proach dramatically facilitates interpretation of 1D- and 2D-NMRspectrum and will significantly simplify spectroscopic data inputinto CASE programs.

Analytical applications of the ultra-fast (UF) 2D-NMR tech-nique were extensively discussed in a review [21]. This techniqueacquires a 2D spectrum in a few seconds, but it suffers from lowsensitivity. For example, to use a single-scan HSQC spectra for re-action monitoring, a 0.44 M solution was necessary [59]. Anotherpossibility to accelerate acquiring a spectrum appeared when NMRspectrometers equipped with two or more independent receiversbecame available, which allowed different types of 2D spectra tobe obtained simultaneously [60]. For example, the parallel-acquisitionNMR all-in-one combination of experimental applications (PANACEA)

Structure 2. Proton-deficient model compound cervinomycin A2.

Structure 3. An unusual proton-deficient molecule.

Fig. 1. Long-range heteronuclear single quantum multiple bond correlations (LR-HSQMBCs) observed for cervinomycin A2. Note that numerous long-range correlationsreach carbon atoms of the “silent fragment” [27].

Table 3Results obtained for cervinomycin A2 (2) with various 2D NMR aspects used as input.Data that were provided as program input are denoted by (+), nJCH for which ex-periments were optimized (in Hz), gt – time of structure generation, and k – numberof generated structures

COSY, HSQC 1H-13CHMBC

1H-13CLR-HSQMBC

gt k

8 Hz 4 Hz 4 Hz 2 Hz

+ + + 49 h 314+ + + + 37 h 4+ + + + 2 m 30 s 7+ + + + + 1m 44 s 1

Fig. 2. Spectra of estradiol obtained by (a) normal proton nuclear magnetic reso-nance (1H-NMR) spectroscopy and (b) Pure Shift Yielded by CHirp Excitation (PSYCHE)[56].

91M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 5: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

[61] technique can in principle obtain all data [including indirectADEQUATE (INADEQUATE)] needed for small molecule structure elu-cidation in a single experiment, but low sensitivity prevents its wideapplication. More sensitive timeshared (TS) versions of 2D-NMR ex-periments (TS-HMBC, TS-HSQC, TS-HSQC-TOCSY, and TS-HSQMBC)were suggested by Parella and Nollis [62] who demonstrated uti-lization of this approach for simultaneous acquisition of 1H/13C and1H/15N-NMR spectra.

New methods for 2D-NMR data processing, which facilitate spec-trum interpretation, were developed and reviewed [8]. Covarianceprocessing involves reprocessing 2D data sets, singly or in pairs, toimprove resolution or to present the existing information in a dif-ferent, clearer fashion. Unsymmetric Covariance Processing (UCP),developed by Martin and co-workers [63], and the closely-relatedGeneralized Covariance Processing (GCP), suggested by Snyder andBruschweiler [64], are the most useful. These methods allow one

Fig. 3. Spectra of (a) normal Total Correlation Spectroscopy (TOCSY), (b) F1-Pure Shift Yielded by CHirp Excitation (PSYCHE)-TOCSY, and (c) double pure shift TOCSY usingPSYCHE in F1 and covariance processing in F2 of a sample of estradiol in DMSO-d6 [57].

92 M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 6: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

to combine data from two kinds of 2D spectra that have a commonaxis (usually 1H). The resultant spectrum does not include any newinformation that was not present in the two original 2D spectra. Nev-ertheless, the data are presented in a clearer manner, which is easierto interpret {see review [65]}. The software for covariance process-ing is available from Bruker, ACD Labs and Mestranova.

4. CASE expert systems

CASE expert systems mimic the reasoning of a hum-an expert during the process of structure elucidation. Thefollowing main advantages of CASE systems should be noted[1]:

1) all statements about interrelation between a spectrum anda structure (“axioms”) are expressed explicitly;

2) all logical consequences (structures) following from the systemof “axioms” are deduced completely, without any exclusions;

3) if the initial data are complete and consistent, the process ofcomputer-based structure elucidation is usually fast, signifi-cantly saving time and labor of the scientist; and,

4) if the chemist has several sets of axioms related to a givenstructural problem, an expert system allows rapid genera-tion of all consequences from each of the sets and identifiesthe most probable structure by comparing the solutionsobtained.

The state of the art in this area was extensively reviewed[1,5,66,67]. The contemporary CASE programs include SESAMI [68],LSD [69], COCON [70], ACD/Structure Elucidator [1,2], and BrukerCMC-se, which are based on 2D-NMR, and CAST\CNMR StructureElucidator [71], based on 13C spectra.

ACD\Structure Elucidator (StrucEluc) [1,2,44,72] is the most ad-vanced expert system. The 1D- and 2D-NMR spectra can be importedto the program from a spectrometer or input manually from a tableprepared by the user. The imported data must be thoroughly checkedand edited by the spectroscopist. The strategy of the system restsupon a series of databases containing factual and axiomatic knowl-edge. All axioms are explicitly presented on the Molecular ConnectivityDiagram (MCD) for visual analysis (e.g., Fig. 4).

This easily allows users to investigate the dependence of thestructural problem solution on any change in the initial set of axioms.The system is capable of inferring all plausible structures from a com-bination of a molecular formula and 1D- and 2D-NMR data (e.g.,HSQC, HMBC, and COSY), even in those cases when the spectrum-structural information is very fuzzy, incomplete and contradictory.Selection of the most probable structure is performed on the basisof the 13C chemical-shift prediction using three algorithms imple-mented into the system – HOSE code based [73], neural networksand additivity rules [1,5]. The program is capable of elucidating astructure of an unknown in the presence of an unknown numberof non-standard correlations of unknown length. The software iscommercially available and was used for solving many complex an-alytical problems. For example, with its aid, the structure of complexalkaloid quindolinocryptotackieine was determined [74] in an in-teractive mode, allowing for step-by-step resolution of manyambiguous correlations, which took a spectroscopist a week of work.It was the first time that the program solved the structure that hadbeen unsolvable by experienced spectroscopists. Structure 3 (seeSection 3) declared as manually unsolvable from the full set of 2D-NMR data was unambiguously elucidated by StrucEluc in a fractionof a second [75]. The software was recently applied for structureelucidation of armeniaspirols A–C [76] . Fig. 5illustrates a typicalrepresentation of the most probable structures suggested by theprogram.

System performance can be illustrated by the following example:with its assistance, 1500 unknown molecules were elucidated bytwo spectroscopists in 6 months [77]. More than 50 examples ofstructure elucidation for natural products with unprecedented orunique skeletons were described [2].

Significant improvements [78] were introduced in the LSDprogram [69], which is now capable of treating 2D-NMR data con-taining NSCs, and selection of the most probable structure isperformed by NMR spectrum prediction. LSD is the first version ofCASE program freely available from the Internet [79].

The SESAMI system [68] was combined with a 13C-NMR inter-pretive library-search system (INFERCNMR) [80], capable of searchingsubstructures in a database containing assigned 13C-NMR spectra.The search resulted in a set of substructures predicted to be presentin the unknown, each of which is assigned an estimated predic-tion accuracy. Involving the best substructures in the structure-generation procedure allows a significant reduction in the generationtime and the size of the output file. The suggested approach is prom-ising for the further development of CASE methodology, as it doesnot require application of the fragment database incorporated intothe program.

As for the Bruker CMC-se program, the absence of any publica-tions utilizing the program does not allow a comparative evaluationto be made.

New expert system CAST/CNMR Structure Elucidator is based on13C-NMR and a database containing structures with chemical-shift assignment [71]. A series of advanced graph-theory algorithms

Fig. 4. The molecular connectivity diagram created from 1H, 13C, heteronuclear single-quantum correlation (HSQC), heteronuclear multi-bond correlation (HMBC) andcorrelation spectroscopy (COSY) spectra of gymnopalyne (upper segment of the figure).Atom properties are adjusted to the structure of gymnopalyne to illustrate differ-ent conventional signs: atom hybridization (sp3 – blue, sp2 – violet, sp – green, notsp – light blue, not defined – black), connectivities (COSY – blue, HMBC – green),and labels (ob – obligatory neighbor heteroatom, fb – forbidden neighbor heteroa-tom). Information about 1H chemical shifts can also be visualized [2].

93M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 7: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

are used for selecting appropriate substructures, then the most prob-able structures are formed by merging the substructures that havecommon parts. The program produces a correct structure if all thefragments are included in the database. As only one exampleillustrates the suggested approach, no conclusions regarding its ef-ficacy can be made.

It should be noted that the first open source structure genera-tor, OMG [81], recently became available. It can be used for solvingstructural problems as a stand-alone program and as a block of aCASE system. The CASE program COCON is also freely accessible fromthe WEBCOCON server [70].

As discussed in Section 3, utilization of new 2D-NMR experi-ments within the CASE systems leads to rapid structure elucidationof complex proton-deficient molecules.

5. NMR chemical-shift prediction

As mentioned above, NMR chemical-shift prediction plays an in-valuable role in the estimation of suggested structures [1,2,5].Chemical shifts of 13C, 1H, 15N, 19F and 31P nuclei can be calculatedusing empirical and quantum-mechanical (QM) methods. NMR spec-trum predictors based on HOSE codes [73], neural nets, and anincremental approach are used most frequently and were re-viewed [1,5]. The predictors can be incorporated in expert systems(as described for ACD/Structure Elucidator and LSD) or used sep-arately as commercial programs [82–85] or free [86]. The programsprovide an accuracy of 13C chemical-shift prediction about1.5–1.8 ppm, which almost always allows for selecting the most prob-able structure(s) among candidates. The prediction is very fast. Forexample, the incremental program incorporated into ACD/StructureElucidator calculates ~10, 000 shifts per second on a standard PC.

A new program, NMRscape [87] for 13C chemical-shift calcula-tion based on new principles {OptiChem theory [88]} was recentlydeveloped and tested on several families of structures. The resultsseem rather promising, as the accuracy provided by this approachis expected to be higher than that inherent to existing empiricalmethods.

A common drawback of all empirical methods is that predic-tion accuracy depends on the composition of the databases usedfor program training. In contrast, the DFT GIAO quantum-chemicalmethod does not suffer from this limitation, but it is very time-consuming and there is the problem of selecting appropriatefunctional and basic sets for calculations {see reviews [89,90]}. Thefollowing optimal strategy of jointly utilizing empirical and QMmethods for structure elucidation was suggested [91]: all candidate

structures must be ranked first by empirical 13C spectrum predic-tion, and, if the average deviations between the experimental andpredicted spectra calculated for two to three top-ranked struc-tures are large or very close, only then should QM predictions beperformed for those questionable structures. The possibility wasdemonstrated of successful application of JHH, JHC, and JCC coupling-constant calculations by QM methods for distinguishing isomershaving very similar structures [92].

6. Structure verification

Structure verification based on 13C chemical-shift predictionallows selection of the correct structure among thousands of plau-sible hypotheses produced by expert systems [1,2]. For verificationof structures suggested for large sets of synthesized molecules, 1Hchemical-shift prediction is frequently used due to the possibilityof quickly acquiring a spectrum with a small sample size. Protonchemical-shift dependence on, e.g., solvent, temperature, and pH,limits the utility of this approach. Utilization of PSYCHE 1H-NMRcoupled with pure shift ME-HSQC acquired using the 1.7-mmMicroCryoProbe would make the approach more robust.

Verification and identification of organic molecules from a struc-ture database using both 1H and 13C-NMR spectra were reported [93].Keyes et al. [93] examined a method of structure validation usinga set of 500 compounds supplied with 1H, HSQC, LC-MS and HPLCdata. 1H and HSQC spectra were predicted using the ACD/NMR Pre-dictor [82] and compared with experimental spectra. It wasconcluded that the approach was very practical for application ina pharmaceutical company. As current implementations of auto-mated structure-verification systems allow false-positive results, anapproach based on 1H and HSQC spectra was suggested [94] thatgreatly reduces the probability of an automated validation systempassing incorrect structures (i.e., false positives). The novel methodwas examined by automatic validation of 127 compounds, whichshowed a reduction in the false-positive rate from 20% to 5%.Plainchont and Nuzillard [95] proposed to verify a molecular struc-ture using a combination of 1D and 2D HSQC, COSY and HMBC-NMR spectra. This approach is not suitable for rapid structureverification, but it rather fits analysis of doubtful structures passedfor verification.

7. Structure identification and dereplication in mixtures

Dereplication of organic compounds (identification in a mixture)is a challenging problem for NMR spectroscopy [96]. If the sample

Fig. 5. The three top-ranked structures according to the dA deviation. dA(13C), dI(13C) and dN(13C) – average deviations calculated by Hierarchical Organization of SphericalEnvironments (HOSE) code-based algorithm, incremental approach and neural networks correspondingly. The correct structure #1 (armeniaspirol B) is reliably selected asthe most probable [76] in accordance with the criteria suggested [1].

94 M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 8: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

amount is enough for separation of individual components, they canbe determined by a spectral search against an NMR database [97],or structures of unknowns are elucidated using MS and a commonset of 1D- and 2D-NMR spectra. For example, Thummala et al. [98]isolated an unknown impurity of ambroxol using preparative HPLCand established its structure by combined application of 2D-NMR,FTIR and LC-MS/MS.

Codina et al. [99] isolated five impurities from a pharmaceuti-cal matrix (samples of 20–40 μg) and identified them from MS and2D-NMR spectra with the aid of the ACD/Structure Elucidator. Aneffective dereplication strategy for identification of natural me-tabolites directly within mixtures was suggested [100,101]. Amultigram quantity of a crude extract was rapidly fractionated bycentrifugal partition extraction (CPE). The fractions of simplifiedchemical composition were subsequently analyzed by 13C-NMR, andhierarchical clustering analysis (HCA) suggested by the authors forpattern recognition. A locally-built 13C-NMR chemical-shift data-base was used for identification. This methodology resulted in thedirect identification (without time-consuming isolation proce-dures) of seven major compounds of a bark extract [100] and sixout of eight prominent constituents of the crude extract of lichen[101].

Pauli and co-workers [11] comprehensively investigated the pre-cision necessary for measuring spectral parameters of 1H-NMRspectra for their tabulation to be used for structure dereplicationand identification. It was shown that δ and J values of 1H-NMR shouldbe routinely reported with Δδ = 0.1–1 ppb and ΔJ = 10 mHz preci-sion, respectively.

As a manual 1H spectrum assignment is a time consuming,tedious and error-prone procedure, an expert system for the auto-matic atom-to-peak or multiplet assignment of 1H-NMR spectra ofsmall molecules has been developed [102]. It was found with thetest set of 90 compounds that 94% of assignments were correct, soconfirming the efficiency of the program.

Different NMR experimental approaches were developed for qual-itative analysis of mixtures without any preliminary fractionationor separation. The NMR diffusion-ordered spectroscopy (DOSY) ex-periment is based on molecules of different sizes and shapesfrequently having different diffusion coefficients [8]. DOSY is verypowerful for studying complex mixtures, but its applicability is stilllimited, particularly when dynamic or unstable systems are studied.A UF version of DOSY was recently described as overcoming thisdrawback [21]. Morris and co-workers [58] developed a pure shiftDOSY experiment, which greatly extended the range of applicabil-ity of DOSY. Simplified (pure shift) PSYCHE 1H-NMR [56] spectra (seeFig. 2) are obviously very promising for analyzing complex mix-tures by 1H-NMR. A method of mixture analysis based on 13C-NMR [100,101] can probably be extended to PSYCHE 1H-NMR spectraof mixtures.

Although the sensitivity of NMR is significantly lower than MS,LC-NMR is now available as a coupled method for mixture analy-sis [8,103–105]. Only 1H spectra can usually be obtained incontinuous flow mode. The low sensitivity of the approach pre-vents its wide application. However, very recently, Foley and co-workers [106,107] described application of on-line reactionmonitoring using continuous flow NMR. A standard 5 mm NMRprobe enables the researcher to conduct experiments on flowingreaction mixtures using a range of spectrometers of varying mag-netic field strengths [107]. It was reported [59] that UF COSY spectrawere utilized for reaction monitoring using a commercial HPLC-NMR set-up.

A combination of LC with solid-phase extraction (SPE) car-tridges led to the possibility of obtaining MS and NMR spectra ofcomponents from a single experiment [108]. After LC separation,the sample was split, and a small portion sent for MS analysis, whilethe remainder was directed to SPE cartridges for collection.

Application of a cryogenically-cooled probe reduced the time andthe sample amount necessary for structure identification (elucida-tion) [8].

8. Is it possible to avoid an erroneous structure elucidation?

As spectroscopic structure elucidation is a complex logical-combinatorial process, it is not surprising that different scientistsmay come to different structures from the same initial data. A seriesof reviews [13,109–112] discussed many structural misassignments.The following most typical reasons of obtaining erroneous struc-tures can be noted:

a) severe resonance overlap in NMR spectra;b) erroneous user suggestions (”axioms”);c) mistaken logical conclusions inferred from the presence or

the absence of characteristic spectral features in 1D-NMR and2D-NMR spectra;

d) complexity, inconsistency and entanglement of initial infor-mation {see [75]}.

Expert system-based analysis of many cases, when erroneousstructures were inferred, shows [1,13,75,113] that application of CASEallows the determination of the correct structure and causes of thehuman error to be detected. Figuratively speaking, a CASE systemcan be used as a “polygraph detector”. To avoid upsetting errors, itis recommended to predict 13C-NMR and 1H-NMR chemical shiftsfor any structural hypothesis in the process of chemical research.

9. Conclusions

Contemporary NMR spectroscopy is the most powerful routineanalytical tool for molecular structure elucidation and identifica-tion, given that the molecular formula is determined using HRMSand all available spectroscopic data. 2D-NMR plays a crucial role inestablishing structures of new organic molecules, and, to date, aplethora of different kinds of 2D-NMR experiments has been elabo-rated. These methods are extensively developed to provide thepossibility of acquiring as much as possible structural informationfrom a minimum amount of sample (<10 μg [41]) in the shortestinstrument time. NMR is successfully used for structure dereplicationand analysis of mixtures, including usage of LC-NMR. Many studieshave shown that CASE is a powerful amplifier of human intelli-gence. It should be expected that application of pure shift spectraas input to artificial intelligence programs developed ad hoc for dataimport to CASE systems will automate this procedure signifi-cantly, so facilitating and accelerating the stage of data preparation.

The author believes that, in the near future, CASE will becomea routine analytical tool, which will serve as an integral part of anyNMR spectrometer similar to the software used in X-raycrystallography.

Acknowledgements

The author thanks Gary Martin for constructive comments andpieces of valuable advice.

References

[1] M.E. Elyashberg, A.J. Williams, K.A. Blinov, Contemporary Computer-AssistedApproaches to Molecular Structure Elucidation, RSC Publishing, Cambridge,2012.

[2] M.E. Elyashberg, A.J. Williams, Computer-based Structure Elucidation fromSpectral Data. The Art of Solving Problems, Springer, Heidelberg, 2015.

[3] N. Bross-Walch, T. Kuhn, D. Moskau, O. Zerbe, Strategies and tools for structuredetermination of natural products using modern methods of NMRspectroscopy, Chem. Biodivers. 2 (2005) 147–177.

95M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 9: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

[4] E. Fukushi, Advanced NMR approaches for a detailed structure analysis ofnatural products, Biosci. Biotechnol. Biochem. 70 (2006) 1803–1812.

[5] M.E. Elyashberg, A.J. Williams, G.E. Martin, Computer-assisted structureverification and elucidation tools in NMR-based structure elucidation, Prog.NMR Spectrosc. 53 (2008) 1–104.

[6] M. Halabalaki, K. Vougogiannopoulou, E. Mikros, A.L. Skaltsounis, Recentadvances and new strategies in the NMR-based identification of naturalproducts, Curr. Opin. Biotechnol. 25 (2014) 1–7.

[7] D.S. Wishart, Characterization of biopharmaceuticals by NMR spectroscopy,Trends Anal. Chem. 48 (2013) 96–111.

[8] R.C. Breton, W.F. Reynolds, Using NMR to identify and characterize naturalproducts, Nat. Prod. Rep. 30 (2013) 501–524.

[9] W.F. Reynolds, E.P. Mazzola, Nuclear magnetic resonance in the structuralelucidation of natural products, in: A.D. Kinghorn, Y. Falk, J. Kobayashi (Editors),Progress in the chemistry of organic natural products. Methodology, Springer,Heidelberg, 2015, pp. 223–310.

[10] L. Borland, M. Brickhouse, T. Thomas III, A.W. Fountain, Review of chemicalsignature databases, Anal. Bioanal. Chem. 397 (2010) 1019–1028.

[11] G.F. Pauli, S.-N. Chen, D.C. Lankin, J. Bisson, R.J. Case, L.R. Chadwick, et al.,Essential parameters for structural analysis and dereplication by 1H NMRspectroscopy, J. Nat. Prod. 77 (6) (2014) 1473–1487.

[12] A. Tarantola, Inverse Problem Theory and Methods for Model ParameterEstimation, SIAM, Philadelphia, 2005.

[13] M. Elyashberg, A. Williams, K. Blinov, Structural revisions of natural productsby Computer Assisted Structure Elucidation (CASE) systems, Nat. Prod. Rep.27 (2010) 1296–1328.

[14] T.F. Molinski, NMR of natural products at the ‘nanomole-scale’, Nat. Prod. Rep.27 (2010) 321–329.

[15] O. Gökay, K. Albert, From single to multiple microcoil flow probe NMR andrelated capillary techniques: a review, Anal. Bioanal. Chem. 402 (2012)647–669.

[16] S. Berger, S. Braun, 200 and More NMR Experiments: A Practical Course, Wiley,New York, 2004.

[17] W.F. Reynolds, R.G. Enríquez, Choosing the best pulse sequences, acquisitionparameters, postacquisition processing strategies, and probes for naturalproduct structure elucidation by NMR spectroscopy, J. Nat. Prod. 65 (2002)221–244.

[18] D. Neuhaus, M. Williamson, The Nuclear Overhauser Effect in Structural andConformational Analysis, second ed., Wiley, New York, 2000.

[19] G.E. Martin, Using 1,1- and 1,n-ADEQUATE 2D NMR data in structureelucidation protocols, Ann. Rep. NMR Spectrosc. 74 (2011) 215–291 V.

[20] J. Furrer, Recent developments in HMBC studies, Ann. Rep. NMR Spectrosc.74 (2011) 293–354.

[21] P. Giraudeau, L. Frydman, Ultrafast 2D NMR: an emerging tool in analyticalspectroscopy, Annu. Rev. Anal. Chem. 7 (2014) 129–161.

[22] J. Furrer, A comprehensive discussion of hmbc pulse sequences, part 1: theclassical HMBC, Concepts Magn. Reson. Part A 40A (2012) 101–127.

[23] J. Furrer, A comprehensive discussion of HMBC pulse sequences. 2. Some usefulvariants, Concepts Magn. Reson. Part A 40A (2012) 146–169.

[24] L. Castañar, T. Parella, Recent advances in small molecule NMR: improved HSQCand HSQMBC experiments, Ann. Rep. NMR Spectrosc. 84 (2015) 163–232,doi:10.1016/bs.arnmr.2014.1010.1004.

[25] G.E. Martin, M. Reibarkh, A.E. Buevich, K.A. Blinov, R.T. Williamson, Applicationof 1,n-ADEQUATE and modified variants to structure elucidation and spectralassignment problems, eMagRes 3 (2014) 215–234.

[26] M.M. Senior, R.T. Williamson, G.E. Martin, Using HMBC and ADEQUATE NMRdata to define and differentiate long-range coupling pathways: is the Crewsrule obsolete?, J. Nat. Prod. 76 (2013) 2088–2093.

[27] K.A. Blinov, A.V. Buevich, R.T. Williamson, G.E. Martin, The impact of LR-HSQMBC very long-range heteronuclear correlation data on computer assistedstructure elucidation, Org. Biomol. Chem. 12 (2014) 9505–9509.

[28] A.V. Buevich, R.T. Williamson, G.E. Martin, MR structure elucidation of smallorganic molecules and natural products: choosing ADEQUATE vs HMBC, J. Nat.Prod. 77 (2014) 1942–1947.

[29] D. Uhrín, Recent developments in liquid-state INADEQUATE studies, Ann. Rep.NMR Spectrosc. 70 (2010) 1–34.

[30] G.E. Martin, A.J. Williams, Long-range 1H-15N heteronuclear chemical shiftcorrelation, Ann. Rep. NMR Spectrosc. 55 (2005) 1–119.

[31] G.E. Martin, C.E. Hadden, Long-Range 1H−15N Heteronuclear Shift Correlationat Natural Abundance, J. Nat. Prod. 63 (2000) 543–585.

[32] R. Marek, A. Lycka, E. Kolehmainen, E. Sievanen, J. Tousek, 15N NMRspectroscopy in structural analysis, Curr. Org. Chem. 11 (2007) 1154–1205.

[33] G.E. Martin, M. Solntseva, A.J. Williams, Applications of 15N NMR Spectroscopyin Alkaloid Chemistry, in: E. Fattorusso, O. Taglialatela-Scafati (Editors), ModernAlkaloids, Wiley-VCH, New York, 2008, pp. 409–472.

[34] G.E. Martin, A.J. Williams, Utilizing long-range 1H -15N 2D NMR spectroscopyfor chemical structure elucidation and confirmation, in: R.K. Harris, R.A.Wasylishen (Editors), Encyclopedia of Magnetic Resonance, Wiley, New York,2010.

[35] B.D. Hilton, G.E. Martin, Investigation of the experimental limits of small-sample heteronuclear 2D NMR, J. Nat. Prod. 73 (2010) 1465–1469.

[36] R.T. Williamson, A.V. Buevich, G.E. Martin, Using LR-HSQMBC to observelong-range 1H-15N correlations, Tetrahedron Lett. 55 (2014) 3365–3366.

[37] S. Cheatham, P. Gierth, W. Bermel, E. Kupce, HCNMBC – a pulse sequence forH-(C)-N multiple bond correlations at natural isotopic abundance, J. Magn.Reson. 247 (2014) 38–41.

[38] S. Cheatham, M. Kline, E. Kupce, Exploiting natural abundance 13C–15N couplingas a method for identification of nitrogen heterocycles: practical use of theHCNMBC sequence, Magn. Reson. Chem. 53 (2015) 363–368.

[39] G.E. Martin, Small volume and high sensitivity NMR probes, Ann. Rep. NMRSpectrosc. 56 (2005) 1–96.

[40] C.J. Jones, C.K. Larive, Could smaller really be better?Current and future trendsin high-resolution microcoil NMR spectroscopy, Anal. Bioanal. Chem. 402(2012) 61–68.

[41] Y. Liu, M.D. Green, R. Marques, T. Pereira, R. Helmy, R.T. Williamson, et al., Usingpure shift HSQC to characterize microgram samples of drug metabolites,Tetrahedron Lett. 55 (2014) 5450–5453.

[42] T.F. Molinski, B.I. Morinaka, Integrated approaches to the configurationalassignment of marine natural products, Tetrahedron 68 (2012) 9307–9343.

[43] L. Gross, F. Mohn, N. Moll, G. Meyer, R. Ebel, W.M. Abdel-Mageed, et al., Organicstructure determination using atomic-resolution scanning probe microscopy,Nat. Chem. 2 (2010) 821–825.

[44] M.E. Elyashberg, K.A. Blinov, S.G. Molodtsov, A.J. Williams, G.E. Martin, Structureelucidator: a versatile expert system for molecular structure elucidation from1D and 2D NMR data and molecular fragments, J. Chem. Inf. Comput. Sci. 44(2004) 771–792.

[45] K.O. Hanssen, B. Schuler, A.J. Williams, T.B. Demissie, E. Hansen, J.H. Andersen,et al., A combined atomic force microscopy and computational approach forthe structural elucidation of breitfussin A and B: highly modified halogenateddipeptides from Thuiaria breitfussi, Angew. Chem. Int. Ed Engl. 51 (2012)12238–12241.

[46] R.T. Williamson, A.V. Buevich, G.E. Martin, T. Parella, LR-HSQMBC: a sensitiveNMR technique to probe very long-range heteronuclear coupling pathways,J. Org. Chem. 79 (9) (2014) 3887–3894.

[47] G. Kummerlowe, B. Crone, M. Kretschmer, S.F. Kirsch, B. Luy, Residual dipolarcouplings as a powerful tool for constitutional analysis: the unexpectedformation of tricyclic compounds, Angew. Chem. Int. Ed Engl. 50 (2011)2643–2645.

[48] G. Kummerlowe, S. Schmidt, B. Luy, Cross-fitting of residual dipolar couplings,Open Spectrosc. J. 4 (2010) 16–27.

[49] K. Zangger, H. Sterk, Homonuclear broadband-decoupled NMR spectra, J. Magn.Reson. 124 (1997) 486–489.

[50] N.H. Meyer, K. Zangger, Simplifying proton NMR spectra by instanthomonuclear broadband decoupling, Angew. Chem. Int. Ed. 52 (2013) 7143–7146.

[51] J.A. Aguilar, M. Nilsson, G.A. Morris, Simple proton spectra from complex spinsystems: pure shift NMR spectroscopy using BIRD, Angew. Chem. Int. Ed. 50(2011) 9716–9971.

[52] L. Paudel, R.W. Adams, P. Király, J.A. Aguilar, M. Foroozandeh, M.J. Cliff, et al.,Simultaneously enhancing spectral resolution and sensitivity in heteronuclearcorrelation NMR spectroscopy, Angew. Chem., Int. Ed. 52 (2013) 11616–11619.

[53] L. Castañar, J. Saurí, P. Nolis, A. Virgili, J. Parella, Implementing homo- andheterodecoupling in region-selective HSQMBC experiments, J. Magn. Reson.238 (2014) 63–69.

[54] I. Timári, L. Kaltschnee, A. Kolmer, R.W. Adams, M. Nilsson, C.M. Thiele, et al.,Accurate determination of one-bond heteronuclear coupling constants with“pure shift” broadband proton-decoupled CLIP/CLAP-HSQC experiments, J.Magn. Reson. 239 (2014) 130–138.

[55] J.R. Ying, J. Joche, A. Bax, Homonuclear decoupling for enhancing resolutionand sensitivity in NOE and RDC measurements of peptides and proteins, J.Magn. Reson. 241 (2014) 97–102.

[56] M. Foroozandeh, R.W. Adams, N.J. Meharry, D. Jeannerat, M. Nilsson, G.A.Morris, Ultrahigh-Resolution NMR Spectroscopy, Angew. Chem. Int. Ed. 53(2014) 6990–6992.

[57] M. Foroozandeh, R.W. Adams, M. Nilsson, G.A. Morris, Ultrahigh-resolutiontotal correlation NMR spectroscopy, J. Am. Chem. Soc. 136 (2014) 11867–11869.

[58] J.A. Aguilar, S. Faulkner, M. Nilsson, G.A. Morris, Pure Shift 1H NMR: a resolutionof the resolution problem?, Angew. Chem. Int. Ed. 49 (2010) 3901–3903.

[59] L.H.K. Queiroz, P. Giraudeau, F.A.B. Dos Santos, K.T. de Oliveira, A.G. Ferreira,Real-time mechanistic monitoring of an acetal hydrolysis using ultrafast 2DNMR, Magn. Reson. Chem. 50 (2012) 496–501.

[60] E. Kupce, R. Freeman, B. John, Parallel acquisition of two-dimensional NMRspectra of several nuclear species, J. Am. Chem. Soc. 128 (2006) 9606–9607.

[61] E. Kupce, R. Freeman, High-resolution NMR correlation experiments in a singlemeasurement (HR-PANACEA), Magn. Reson. Chem. 48 (2010) 333–336.

[62] T. Parella, P. Nolis, Time-shared NMR experiments, Concepts Magn. Reson. PartA 36A (2010) 1–23.

[63] K.A. Blinov, N.I. Larin, A.J. Williams, K.A. Mills, G.E. Martin, Unsymmetricalcovariance processing of COSY or TOCSY and HSQC NMR data to obtain theequivalent of HSQC-COSY or HSQC-TOCSY spectra, J. Heterocycl. Chem. 44(2006) 163–166.

[64] D.A. Snyder, R. Bruschweiler, Generalized indirect covariance NMR formalismfor establishment of multidimensional spin correlations, J. Chem. Phys. 113(2009) 12898–12903.

[65] M. Jaeger, R.L.E.G. Aspers, Covariance NMR and small molecule applications,Ann. Rep. NMR Spectrosc. 83 (2014) 271–349.

[66] R. Stefani, F.B. Da Costa, Computer-assisted structure elucidation of organiccompounds, in: C.A. Taft, C.H.T.P. Silva (Editors), Current Methods in MedicinalChemistry and Biological Physics, Research Signpost, 2007, pp. 101–123.

[67] J.M. Nuzillard, Automated interpretation of NMR spectra of small organicmolecules in solution, eMagRes 3 (2014) 287–293.

96 M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97

Page 10: Identification and structure elucidation by NMR spectroscopy · structureelucidationfrom2D-NMRdatacanbepresentedasa com-bination of two inverse problems,which should be solved in

[68] M.E. Munk, Computer-based structure determination: then and now, J. Chem.Inf. Comput. Sci. 38 (1998) 997–1009.

[69] J.-M. Nuzillard, G. Massiot, Logic for structure determination, Tetrahedron 47(1991) 3655–3664.

[70] COCON, Available from: <http://cocon.nmr.de>.[71] S. Koichi, M. Arisaka, H. Koshino, A. Aoki, S. Iwata, T. Uno, et al., Chemical

structure elucidation from 13C NMR chemical shifts: efficient data processingusing bipartite matching and maximal clique algorithms, J. Chem. Inf. Model.54 (2014) 1027–1035.

[72] M. Elyashberg, K. Blinov, S. Molodtsov, Y. Smurnyy, A.J. Williams, T. Churanova,Computer-assisted methods for molecular structure elucidation: realizing aspectroscopist’s dream, J. Cheminform. 1 (2009) 3.

[73] W. Bremser, HOSE – a novel substructure code, Anal. Chim. Acta 103 (1978)355–365.

[74] K.A. Blinov, M.E. Elyashberg, E.R. Martirosian, S.G. Molodtsov, A.J. Williams,M.M.H. Sharaf, et al., Quindolinocryptotackieine: the elucidation of a novelindoloquinoline alkaloid structure through the use of computer-assistedstructure elucidation and 2D NMR, Magn. Reson. Chem. 41 (2003) 577–584.

[75] M.E. Elyashberg, K.A. Blinov, S.G. Molodtsov, A.J. Williams, Elucidating“undecipherable” chemical structures using computer assisted structureelucidation approaches, Magn. Reson. Chem. 50 (2012) 22–27.

[76] C. Dufour, J. Wink, M. Kurz, H. Kogler, H. Olivan, S. Sabl, et al., Isolation andstructural elucidation of armeniaspirols A-C: potent antibiotics againstgram-positive pathogens, Chem. Eur. J. 18 (2012) 16123–16128.

[77] J. Hollinshead, 6th annual ACD/Labs UK user meeting, Old windsor, UK, 2008.[78] B. Plainchont, V. de Paulo Emerenciano, J.-M. Nuzillard, Recent advances in

the structure elucidation of small organic molecules by the LSD software,Magn. Reson. Chem. 51 (2013) 447–453.

[79] LSD, Available from: <http://www.univ-reims.fr/LSD>.[80] P.N. Penchev, K.-P. Schulz, M.E. Munk, INFERCNMR: a 13C NMR interpretive

library search system, J. Chem. Inf. Model. 52 (2012) 1513–1528.[81] J.E. Peironcely, M. Rojas-Chertó, D. Fichera, T. Reijmers, L. Coulier, J.-L. Faulon,

et al., OMG: open molecule generator, J. Cheminform. 4 (2012) 21.[82] Advanced Chemistry Development, ACD/NMR predictors. Prediction suite

includes 1H, 13C, 15N, 19F, 31P NMR prediction.[83] ChemDraw, Available from: <http://www.cambridgesoft.com/Ensemble_for

_Chemistry/ChemDraw/>.[84] Modgraph, Available from: <http://www.modgraph.co.uk/product_nmr.htm>.[85] Upstream solutions GMBH NMR prediction products, Available from:

<http://www.upstream.ch/products/nmr.html>.[86] Spinus, Available from: <http://www2.ccc.uni-erlangen.de/services/spinus/>.[87] K.W. Moore, R. Li, I. Pelczer, H. Rabitz, NMR landscapes for chemical shift

prediction, J. Phys. Chem. A 116 (2012) 9142–9157.[88] K.W. Moore, A. Pechen, X.J. Feng, J. Dominy, V.J. Beltrani, H. Rabitz, Why is

chemical synthesis and property optimization easier than expected?, Phys.Chem. Chem. Phys. 13 (2011) 10048–10070.

[89] M.W. Lodewyk, M.R. Siebert, D.J. Tantillo, Computational prediction of 1H and13C chemical shifts: a useful tool for natural product, mechanistic, andsynthetic organic chemistry, Chem. Rev. 112 (2012) 1839–1862.

[90] F.V. Toukach, V.P. Ananikov, Recent advances in computational predictions ofNMR parameters for the structure elucidation of carbohydrates: methods andlimitations, Chem. Soc. Rev. 42 (2013) 8376–8415.

[91] M. Elyashberg, K. Blinov, Y. Smurnyy, T. Churanova, A. Williams, Empirical andDFT GIAO quantum-mechanical methods of 13C chemical shifts prediction:competitors or collaborators?, Magn. Reson. Chem. 48 (2010) 219–229.

[92] G.E. Martin, A.V. Buevich, M. Reibarkh, S.B. Singh, J.G. Ondeyka, R.T. Williamson,Coniothyrione: anatomy of a structure revision, Magn. Reson. Chem. 51 (2013)383–390.

[93] P. Keyes, G. Hernandez, G. Cianchetta, J. Robinson, B. Lefebvre, Automatedcompound verification using 2D-NMR HSQC data in an open-accessenvironment, Magn. Reson. Chem. 47 (2009) 38–52.

[94] S.S. Golotvin, R. Pol, R.R. Sasaki, A. Nikitina, P. Keyes, Concurrent combinedverification: reducing false positives in automated NMR structure verificationthrough the evaluation of multiple challenge control structures, Magn. Reson.Chem. 50 (2012) 429–435.

[95] B. Plainchont, J.M. Nuzillard, Structure verification through computer-assistedspectral assignment of NMR spectra, Magn. Reson. Chem. 51 (2013) 54–59.

[96] J.S. McKenzie, J.A. Donarski, J.C. Wilson, A.J. Charlton, Analysis of complexmixtures using high-resolution nuclear magnetic resonance spectroscopy andchemometrics, Prog. NMR Spectrosc. 59 (2011) 336–359.

[97] R. Dunkel, X. Wu, Identification of organic molecules from a structure databaseusing proton and carbon NMR analysis, J. Magn. Reson. 188 (2007) 97–110.

[98] V.R.R. Thummala, M.R. Ivaturi, S.R. Nittala, Isolation, identification, andcharacterization of one degradation product in ambroxol by HPLC-Hyphenatedtechniques, Sci. Pharm. 82 (2014) 247–263.

[99] A. Codina, R.W. Ryan, R. Joyce, D.S. Richards, Identification of multipleimpurities in a pharmaceutical matrix using preparative gas chromatographyand computer-assisted structure elucidation, Anal. Chem. 82 (2010) 9127–9133.

[100] J. Hubert, J.-M. Nuzillard, S. Purson, M. Hamzaoui, N.S. Borie, R. Reynaud, et al.,Identification of natural metabolites in mixture: a pattern recognition strategybased on 13C NMR, Anal. Chem. 86 (2014) 2955–2962.

[101] S.K. Oettl, J. Hubert, J.-M. Nuzillard, H. Stuppner, J.-H. Renault, J.M. Rollinger,Dereplication of depsides from the lichen Pseudevernia furfuracea by centrifugalpartition chromatography combined to 13C nuclear magnetic resonancepattern recognition, Anal. Chim. Acta 846 (2014) 60–67.

[102] C. Cobas, F. Seoane, E. Vaz, M.A. Bernstein, S. Dominguez, M. Péreza, et al.,Automatic assignment of 1H-NMR spectra of small molecules, Magn. Reson.Chem. 51 (2013) 649–654.

[103] M.A. Silva Elipe, LC-NMR and Other Hyphenated NMR Techniques: Overviewand Applications, John Wiley & Sons, Chichester, 2011.

[104] N.C. Gonnella, LC-NMR: Expanding the Limits of Structure Elucidation, CRCPress, Boca Raton, 2013, pp. S157–S162.

[105] S. Sturm, C. Segera, Liquid chromatography-nuclear magnetic resonancecoupling as alternative to liquid chromatography-mass spectrometryhyphenations: curious option or powerful and complementary routine tool?,J. Chromatogr. A 1259 (2012) 50–61.

[106] D.A. Foley, J. Wang, B. Maranzano, M.T. Zell, B.L. Marquez, Y. Xiang, et al., OnlineNMR and HPLC as a reaction monitoring platform for pharmaceutical processdevelopment, Anal. Chem. 85 (2013) 8928–8932.

[107] D.A. Foley, E. Bez, A. Codina, K.L. Colson, M. Fey, R. Krull, et al., An NMRflow-tube for online NMR reaction monitoring, Anal. Chem. 86 (2014)12008–12013.

[108] H. Tang, C. Xiao, Y. Wang, Important roles of the hyphenated HPLC-DAD-MS-SPE-NMR technique in metabonomics, Magn. Reson. Chem. 47 (2009) S157–S162.

[109] K.C. Nicolaou, S.A. Snyder, Chasing molecules that were never there:misassigned natural products and the role of chemical synthesis in modernstructure elucidation, Angew. Chem. Int. Ed. 44 (2005) 1012–1044.

[110] M.E. Maier, Structural revisions of natural products by total synthesis, Nat.Prod. Rep. 26 (2009) 1105–1124.

[111] T. Amagata, Missasigned structures: case examples from the past decade, in:L. Mander, H.-W. Liu (Editors), Comprehensive Natural Products II: Chemistryand Biology, Elsevier, 2010, pp. 581–621.

[112] T.L. Suyama, W.H. Gerwick, K.L. McPhail, Survey of marine natural productstructure revisions: a synergy of spectroscopy and chemical synthesis, Bioorg.Med. Chem. 19 (2011) 6675–6701.

[113] M. Elyashberg, K. Blinov, S. Molodtsov, A.J. Williams, Structure revision ofasperjinone using computer-assisted structure elucidation methods, J. Nat.Prod. 76 (2013) 113–116.

97M. Elyashberg/Trends in Analytical Chemistry 69 (2015) 88–97