1199
i AB INITIO STUDIES OF A PENTACYCLO- UNDECANE CAGE LACTAM by Thishana Singh Submitted in partial fulfillment of the requirements for the degree of Master of Technology in the Department of Chemistry, Faculty of Engineering, Science and the Built Environment, Durban Institute of Technology, Durban, May 2003.

AB INITIO STUDIES OF A PENTACYCLO- UNDECANE CAGE LACTAM · AB INITIO STUDIES OF A PENTACYCLO-UNDECANE CAGE LACTAM by ... STOs Slater-Type Orbitals ... However theoretical chemistry

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

  • i

    AB INITIO STUDIES OF A PENTACYCLO-

    UNDECANE CAGE LACTAM

    by

    Thishana Singh

    Submitted in partial fulfillment of the requirements for the degree of

    Master of Technology

    in the Department of Chemistry,

    Faculty of Engineering, Science and the Built Environment,

    Durban Institute of Technology, Durban, May 2003.

  • ii

    DECLARATION

    I, Thishana Singh, declare that unless indicated, this dissertation is my own work and it

    has not been submitted for a degree at another Technikon or University.

    ________________________

    T. Singh

    _____day of______________2003

    APPROVED FOR FINAL SUBMISSION

    ________________________ ________________________

    Supervisor: Dr. K. Bisetty (Ph.D.) Co-supervisor: Dr. H.G. Kruger (Ph.D.)

    ________________________

    Date

  • iii

    ACKNOWLEDGEMENTS

    I would like to thank my supervisors Dr. K. Bisetty and Dr. H.G. Kruger for their

    assistance, guidance, patience and much appreciated words of encouragement

    throughout the duration of this research project.

    My sincere gratitude also goes to:

    (i) My friends and colleagues at the Durban Institute of Technology,

    Department of Chemistry, for their support, encouragement and

    assistance.

    (ii) The National Research Foundation (NRF), M L Sultan Technikon

    Research Center for financial assistance.

    (iii) De Beers Educational Trust Fund for the purchase of a desk top

    computer.

    (iv) The University of Natal, Durban for the generous use of computer time

    on the DEC Alpha computer workstation.

    A special thanks to my parents for their continuous support and encouragement

    throughout the duration of my studies.

    "Jai Shree Krishna"

  • iv

    ABSTRACT

    The purpose of this study is to utilize computational techniques in the determination of

    the mechanistic pathways for the one-pot conversion of a pentacyclo-undecane (PCU)

    dione 1.1 to a pentacyclo-undecane cage lactam 1.2.

    OO

    NaCN

    H2O

    OH

    C O

    NHHO

    1.1 1.2

    In pursuance of this objective, the ab initio quantum mechanical level of theory was

    employed. The primary goal of this study was to compute the relative difference in

    energies for the reactants, products and transition-states of the proposed mechanistic

    pathways. The energy values obtained were used to predict the thermodynamic and

    kinetic pathways of the mechanism. All calculations were performed using the

    GAUSSIAN 98 series of programs, and GAUSSVIEW was used to visualize the

    transition-state structures.

    Full geometry optimizations were performed at the Hartree-Fock (HF) level of theory

    using the 3-21+G* basis set. In addition, the transition-states were established using a

    SCAN technique to obtain a starting structure. Transitions states were verified by using

    second-derivative analytical vibrational frequency calculations and the visual inspection

    of the movement of atoms associated with the transition.

    Hess's Law was applied to compute the heats of formation. It was found that two

    transition structures in the gas phase had abnormally high energies. However, these

    energies were found to be considerably lower in the presence of a solvent molecule.

    Furthermore, it was observed that the one-step conversion of the dione 1.1 to the lactam

    1.2 proceeded via a single transition-state.

    Previous experimental work found that the reaction proceeds through a cyanohydrin

    intermediate which in all likelihood represents the rate determining step. Sound

  • v

    arguments exist to demonstrate that the computationally determined rate-determining

    step agrees with the experimentally observed rate-determining step.

  • vi

    TABLE OF CONTENTS

    Declaration ...................................................................................................................... ii

    Acknowledgements ........................................................................................................ iii

    Abstract .......................................................................................................................... iv

    Table of contents ............................................................................................................ vi

    List of figures ............................................................................................................... viii

    List of tables ................................................................................................................. viii

    List of Abbreviations ..................................................................................................... ix

    Chapter 1 ......................................................................................................................... 1

    1. INTRODUCTION ................................................................................................ 1

    1.1 Computational Chemistry and Molecular Modeling ....................................... 1

    1.2 Pentacyclo-undecane (PCU) Cage Compounds............................................... 2

    1.3 Lactam formation ............................................................................................. 4

    Chapter 2 ......................................................................................................................... 9

    2. THEORETICAL TOOLS FOR MOLECULAR ORBITAL

    CALCULATIONS ............................................................................................... 9

    2.1 Molecular Orbital Theory ................................................................................ 9

    2.2 Molecular Mechanics ....................................................................................... 9

    2.3 Electronic Structure Methods ........................................................................ 10

    2.3.1 The Ab Initio Method .............................................................................. 11

    2.3.2 Semi-Empirical ........................................................................................ 12

    2.3.3 Density Functional Theory (DFT) Methods ............................................ 12

    2.3.4 Basis Sets ................................................................................................. 13

    2.3.4.1 Minimal Basis Sets ............................................................................... 14

    2.3.4.2 Split Valence Basis Sets ....................................................................... 14

    2.3.4.3 Polarised Basis Sets .............................................................................. 15

    2.3.4.4 Basis Sets Incorporating Diffuse Functions ......................................... 16

    2.3.5 Hartree-Fock Theory ............................................................................... 16

  • vii

    Chapter 3 ....................................................................................................................... 19

    3. TRANSITION STRUCTURE MODELING .................................................. 19

    3.1 Transition-state modeling with empirical force-fields ................................... 20

    3.2 Locating minima on the seams of intersecting semi-empirical PES .............. 22

    3.3 Transition-structure modeling of a PCU Cage Lactam using ab initio methods

    ........................................................................................................................ 23

    Chapter 4 ....................................................................................................................... 26

    4. COMPUTATIONAL DETAILS EMPLOYED ............................................... 26

    4.1 The GAUSSIAN 98 Program ........................................................................ 26

    4.2 The GaussView Program ............................................................................... 27

    4.3 The SCAN Calculation .................................................................................. 27

    4.3.1 Commands used during a SCAN or a TS Search .................................... 28

    4.4 Calculation Details ......................................................................................... 29

    5. RESULTS AND DISCUSSION ......................................................................... 30

    5.1 Introduction .................................................................................................... 30

    5.2 Local minima on the energy profile ............................................................... 33

    5.3 The Transition Structures (TS) ...................................................................... 34

    5.3.1 Transition Structure 5.3.1 ........................................................................ 36

    5.3.2 Transition Structure 5.5 ........................................................................... 39

    5.3.3 Transition Structure 5.7.1 ........................................................................ 41

    5.3.4 Transition Structure 5.10 ......................................................................... 47

    5.3.5 Transition Structure 5.12 and 5.13 .......................................................... 49

    5.4 Transition structure modeling with solvent molecules .................................. 56

    5.5 Calculation of Heats of Formation ................................................................. 61

    Chapter 6 ....................................................................................................................... 62

    6. CONCLUSION ................................................................................................... 62

    APPENDIX 1................................................................................................................. 63

    Scheme of reaction ...................................................................................................... 64

    References ...................................................................................................................... 67

    SUPPLEMENTARY MATERIAL: A CD accompanying this thesis includes the following:

    Text containing Chapters 1-6 (including References).

    Cartesian coordinates of all the 3-D structures from Chapter 5.

    Frequency calculations of the TS's.

  • viii

    LIST OF FIGURES

    Figure 3.1 Points on a simple reaction coordinate ....................................................... 19

    Figure 5. 1 Modified mechanism for the conversion of the dione 5.1 to the lactam 5.2

    .................................................................................................................... 31

    Figure 5. 2 Calculated reaction profile for the proposed mechanism. .......................... 32

    Figure 5.3 Graphical representation of Energy vs Reaction Coordinate for structure

    5.3.1 ............................................................................................................ 37

    Figure 5.4 Graphical representation of Energy vs Reaction Coordinate for structure 5.5

    .................................................................................................................... 40

    Figure 5.5 Graphical representation of Energy vs Reaction Coordinate for structure

    5.7.1 ............................................................................................................ 42

    Figure 5.6 Graphical representation of Energy vs Reaction Coordinate of the zoomed

    in (2.03 Å to 1.78 Å) relaxed scan for structure 5.7.1 ................................ 44

    Figure 5.7 Graphical representation of Energy vs Reaction Coordinate of the zoomed

    in (1.83 Å to 1.35 Å) relaxed scan for structure 5.7.1 ................................ 45

    Figure 5.8 Graphical representation of Energy vs Reaction Coordinate for structure

    5.10 ............................................................................................................. 47

    Figure 5.9 Graphical representation of Energy vs Reaction Coordinate for structure

    5.12 ............................................................................................................. 50

    Figure 5.10 Graphical representation of Energy vs Reaction Coordinate for structures

    5.12 and 5.13. ............................................................................................. 52

    Figure 5.11 Graphical representation of Energy vs Reaction Coordinate for structure

    5.13 ............................................................................................................. 53

    Figure 5.12 Graphical representation of Energy vs Reaction Coordinate for structure

    5.12 ............................................................................................................. 55

    Figure 5.13 Calculated reaction profile for the proposed mechanism. ........................... 60

    LIST OF TABLES

    Table 5.1 Calculated energies of the local minima. ....................................................... 33

    Table 5.2 Calculated energies of the transition structures. ............................................ 35

    Table 1.1 Summary of Energies ..................................................................................... 63

    Table 2.1 Heats of Formation ........................................................................................ 66

  • ix

    LIST OF ABBREVIATIONS

    CPU Central Processing Unit

    DEC Digital Equipment Corporation

    DFT Density Functional Theory

    G98W Gaussian 98 Windows

    GTOs Gaussian-Type Orbitals

    GUI Gaussian User Interface

    HF Hartree Fock

    IR Infra Red

    LCAO Linear Combination Atomic Orbitals

    MM Molecular Mechanics

    NMR Nuclear Magnetic Resonance

    PCU Pentacyclo-Undecane

    PES Potential Energy Surface

    RFO Rational Functional Optimization

    RHF Restricted Hartree Fock

    SCF Self Consistent Field

    SE Semi-Empirical

    SP Single Point

    STOs Slater-Type Orbitals

    STQN Synchronous Transit-Guided Quasi Newton

    TS Transition-State/Structure

    UV Ultra Violet

  • x

  • 1

    CHAPTER 1

    1. INTRODUCTION

    1.1 Computational Chemistry and Molecular Modeling

    The terms “theoretical chemistry”, “computational chemistry” and “molecular

    modeling” are used interchangeably and indeed most molecular modelers use all three

    concepts to describe various aspects of their research.

    “Theoretical chemistry” is often considered synonymous with quantum mechanics,

    whereas computational chemistry encompasses not only quantum mechanics but also

    molecular mechanics, minimizations, simulations, conformational analysis and other

    computer-based methods for the understanding and the prediction of the behaviour of

    molecular systems. However theoretical chemistry is a subfield of chemistry where

    mathematical methods are used in combination with the fundamental laws of physics to

    study chemical processes.1 In particular, this involves the breaking of new ground that

    ultimately leads to the writing of new mathematical codes or software that can model

    certain aspects of a chemical structure.

    Computational chemistry has therefore become one of the mainstays of modern

    industrial and academic chemistry that makes use of the established codes or software to

    study chemical systems. The ever-increasing power of modern computers coupled with

    the development of new theoretical approaches can be used for accurate and precise

    prediction of molecular properties.2 It is interesting to note that computational

    chemistry accounts for roughly a third of the super computer usage worldwide.

    Computer methods are used extensively to solve chemical problems that would be

    intractable or even impossible experimentally.3

    Computer-aided molecular design became a subject worthy of discussion in the media

    around 1981, with the advancement of sophisticated computer graphics hardware.3 The

    general aims of computational chemistry is the characterization and the prediction of

    chemical structures and their stability; the prediction of NMR, IR and UV spectra, the

    prediction of thermodynamic data, and the simulation and modulation of reaction

  • 2

    courses.4 To achieve this computational chemists are using advanced computer

    software that enables them to gain insight into chemical processes and to avoid time-

    consuming and expensive experiments.3 This approach does not replace the traditional

    wet chemistry experiments but it is a powerful aid in the understanding of experimental

    observations and the prediction of new reaction pathways. Some methods can be used

    to model not only stable molecules, but also short-lived, unstable intermediates and

    even transition-states which are required for kinetic information. In this way, they can

    provide information about molecules and reactions that may be impossible to obtain

    experimentally. Computational chemistry is therefore both an independent research

    area and a vital adjunct to experimental studies.

    1.2 Pentacyclo-undecane (PCU) Cage Compounds

    During the past half-century,5,6,7,8

    many research groups focused on the synthesis and

    chemistry of novel polycyclic cage molecules. Davis and co-workers9 were responsible

    for discovering that 1-amino-adamantane 1.3, commonly known as amantadine, exhibits

    antiviral activity thus realizing that polycyclic cage molecules also have the potential as

    biologically active agents.

    An unexpected observation10

    of the biological activity profile of amantadine revealed

    that it could be beneficial to patients with Parkinson‟s disease. The hydrocarbon cage

    of amantadine has the ability to cross the blood-brain barrier and to enter the central

    nervous system10

    due to the hydrophobicity of the "cage" despite the fact that the amino

    group is protonated at physiological pH. Drugs containing hydrocarbon cage moieties

    promote their transport across cell membranes and increase their affinity for lipophilic

    regions in receptor molecules.11

    While the hydrophobicity of the cage facilitates transport of the drug across membranes,

    the size and stability of the cage inculcate the drug with a structural property which

  • 3

    results in controlled release of the active ingredients of the drug, namely, stability

    towards degradation. In practice this translates into slow metabolism of the drug. The

    important implications of this is that the intervals between drug administration is

    increased.12

    Another factor that was found to influence the rate of release of the active

    ingredient and the potency of the drug was the presence of the amantadine substituent in

    the drug. It has been shown that the inclusion of amantadine has given rise to longer

    time over which the drug is effective, greater potency of the drug and faster drug action.

    Furthermore the nature of the substituent influences the specificity of the drug to

    antibacterial13

    , anabolic14

    and analgesic action.15

    A number of cases16,17,18

    have recently demonstrated the potential therapeutic value of

    novel pentacyclic cage compounds. These compounds have promising potential as an

    important new class of medicinal and pharmaceutical agents and might extend the

    existing range19,20,21

    of bio-active pentacyclo-undecane compounds. Further

    investigation is required into the influence of the unique steric distribution of important

    functional groups around a rigid cage structure on the pharmacological activity.

    The Diels-Alder 1.6 adduct of cyclopentadiene 1.4 and p-benzoquinone 1.5 produces

    the PCU dione 1.1. The reaction in which the dione 1.1 is synthesized is carried out by

    intramolecular photocyclisation.22

    h

    OO

    OO

    +

    O

    O

    1.4 1.5 1.6 1.1

    Treatment of the dione 1.1 with Strecker reagents (HCN, NH4OH) unexpectedly

    produced23

    the -lactam compounds 1.2a-1.2c. Strecker reactions normally produce

    cyanohydrins or amino nitriles.24

  • 4

    O

    O

    C

    NH

    OH

    O

    R

    C

    N

    OH

    OH

    R

    Lactam Lactim1.1 1.2 a, R=OH1.2 b, R=CN

    1.2 c, R=NH2

    811811

    The mechanism of this unique one pot conversion23,25,26,27

    is not well understood. The

    mechanism proposed by the authors23,25,26

    was based on basic chemical principles and

    is discussed below.

    1.3 Lactam formation

    In a previous study28

    it was shown that the dione 1.1 is easily hydrated to form the

    hydrates 1.7 and 1.8 in a 4:1 ratio. Thus it can be expected that this phenomenon should

    also play a significant role in the nucleophilic addition reactions on the carbonyl groups

    of the dione 1.1 in aqueous media.

    It can subsequently be assumed that both 1.1 and 1.9 (in Scheme I shown below) can

    participate in the formation of the dihydroxylactam 1.2. Nucleophilic attack is expected

    to take place on the exo face of the carbonyl groups27

    in the dione 1.1 or the hydrate 1.9

    as a result of the proximity of the groups and could lead to the formation of 1.11.

    Transannular cyclisation of 1.11 is expected to form the cyclic ether 1.12. Scheme I

    shown below was one of the two proposed mechanistic pathways, based on basic

    chemical principles and intuition.23,25,26,27

  • 5

    -

    1.121.11

    1.10

    1.9

    H2O

    H2ONaCN

    H2O

    H2ONaCN

    1.1

    OHOH

    O

    CN

    OH

    O

    CN

    OH

    OHOH

    CN

    OH

    O

    H2O

    Scheme I: Conversion of the dione to the cyclic ether

    It was assumed that the cyclic ether 1.12 plays an intermediate role in the conversion of

    the dione 1.1 to the dihydroxylactam27

    1.2, since the “inversion” of a nitrile group can

    be explained as in Scheme II shown below.27

    The authors later showed27

    that the

    cyanohydrin 1.11 and 1.12 can be converted to the corresponding hydroxyl lactam 1.17

    upon treatment with aqueous NaOH, providing experimental proof for their assumption

    above.

    The explanation27

    for the conversion of 1.12 to 1.2 is discussed in Scheme II below.

    Under basic reaction conditions, ring cleavage of the cyclic ether 1.12 forms the

    intermediate 1.13 which is converted to intermediate 1.14. The electron deficient nitrile

    carbon atom of the endo-orientated cyano group in 1.13 is in an extremely favourable

    position to suffer attack from the nearby negatively charged oxygen atom. Intermediate

    1.14 rearranges to form intermediate 1.15. Cyclisation of the intermediate 1.15 results

    in the formation of lactim 1.16 and by implication the lactam 1.2.27

  • 6

    H2O

    1.151.161.2

    C

    OH

    OH

    ON

    1.14

    C

    O

    OH

    O

    N

    H

    OH

    C

    OO

    N

    H

    1.13

    OH

    C

    NH

    OH

    OH

    O

    1.12

    C

    N

    OH

    OH

    OH

    CN

    OH

    O

    H2O

    --

    -

    -

    Scheme II: Conversion of the cyclic ether

    The above explanation does not account for the formation of a compound such as the

    cyano hydroxylactam 1.2b (Scheme III).23,26

    The nucleophilic attack of the hydroxide

    on the cyclic cyanohydrin 1.12 is shown is Scheme III. This is necessary to “invert” the

    cyanide group and is expected to be combined with the loss of the hydroxide group in

    the cyclic cyanohydrin 1.12, whereby the cyanohydrin 1.19 should be formed. The

    endo cyano group in 1.20 has an electron deficient carbon atom which is in an

    extremely favourable position to suffer attack from the nearby negatively charged

    oxygen atom thus producing the intermediate 1.21 and subsequently 1.22. The

    rearrangement of 1.22 to the -cyano cation 1.23 is a postulated rearrangement and is

    not a controversial one. This is so since -cyano cations of the general formula shown

    in 1.24 are significantly stabilized by charge delocalisation through resonance structures

    such as 1.25, even though this requires a portion of the charge to reside on a divalent

    nitrogen.29,30

    When attached to very unstable cations30

    then only is the -donor effect

    of cyano substituents manifested. The rearrangement proposed in Scheme III is

    promoted since the negative charge on the nitrogen atom of 1.23 is sufficiently

    stabilized by the adjacent carbonyl group to facilitate the rearrangement.

  • 7

    +

    +

    +

    -

    --

    -OH

    CN

    -

    HOC O

    CN

    N

    OOHNC

    O

    HOCN

    HOC

    O

    NH C

    N

    CNC O

    HO

    HN

    CNC O

    HO

    N

    C C N

    R

    R

    C C N

    R

    R

    1.12 1.19 1.20

    H2O

    1.22 1.211.23

    HOC

    O

    NHCN

    1.24 1.25

    1.2b

    Scheme III: Nucleophilic attack of hydroxide to form the cyano hydroxy

    lactam23,27

    The formation of 1.2 shown in Scheme IV below is also based on the explanation of the

    cyano group stabilization as postulated in Scheme III. The hydroxy group in 1.26

    should similarly stabilize the cation and result in the formation of 1.2. Attack of

    hydroxy anions instead of cyanide ions on the carbonyl carbon atom of the cyanohydrin

    1.19 also results in the conversion of dione 1.1 to the dihydroxylactam.27

  • 8

    1.2

    HOC

    O

    NHOH

    1.28

    1.261.19

    HOC

    O

    NH OH

    O

    HOC

    N

    HOC O

    OH

    N

    HOC O

    OH

    HN

    H2O

    1.27

    -

    OH-

    -

    +

    Scheme IV: Nucleophilic attack of hydroxide to form the

    dihydroxylactam23,25,26,27

  • 9

    CHAPTER 2

    2. THEORETICAL TOOLS FOR MOLECULAR ORBITAL CALCULATIONS

    2.1 Molecular Orbital Theory

    Molecular orbital theory is an approach to molecular quantum mechanics which uses

    one-electron functions or orbitals to approximate the full wavefunction. A molecular

    orbital, (x, y, z), is a function of the cartesian coordinates of a single electron. Its

    square, 2(or square modulus | |

    2 if is complex) is interpreted as the probability

    distribution of the electron in space. To describe the distribution of an electron

    completely, the dependence on the spin coordinates , also has to be included. This

    coordinate takes on one of two possible values (½) and measures the spin angular

    momentum component along the z-axis in units of h/2 .31

    2.2 Molecular Mechanics

    Molecular mechanics (MM) simulations use the laws of classical physics to predict the

    structures and properties of molecules. There are many different molecular mechanics

    methods. Each one is characterized by its particular force-field. A force-field

    comprises a set of equations defining how the potential energy of a molecule varies with

    the locations of its component atoms and a series of atom types, defining the

    characteristics of an element within a specific chemical context.32,33

    The atom types

    describe different characteristics and behaviour for an element depending upon its

    environment. For example, a carbon atom in a carbonyl is treated differently than one

    bonded to three hydrogens. The atom type depends on hybridization, charge and the

    types of the other atoms to which it is bonded. Molecular mechanics calculations don‟t

    explicitly treat the electrons in a molecular system. Instead, they perform computations

    based upon the interactions among the nuclei. Electronic effects are implicitly included

    in force-fields through parameterization. This approximation makes molecular

    mechanics computations quite inexpensive computationally, and allows them to be used

    for very large systems containing many thousands of atoms. However, it also carries

    several limitations as well. The most important is that each force-field achieves only

    good results for a limited class of molecules, related to those for which it was

  • 10

    parameterized. No force-field can generally be used for all molecular systems of

    interest. Neglect of electrons means that molecular mechanics methods cannot treat

    chemical problems where electronic effects predominate. For example, they cannot

    describe processes which involve bond formation or bond breaking. Molecular

    properties which depend on subtle electronic details are also not reproducible by

    molecular mechanics methods.34

    2.3 Electronic Structure Methods

    These methods use the laws of quantum mechanics rather than classical physics as the

    basis for their computations. According to quantum mechanics, the energy and other

    related properties of a molecule may be obtained by solving the Schrödinger equation:

    H = E (2.1)

    where, H = Hamiltonian, a differential operator which like the energy in classical

    mechanics, is representative of the kinetic and potential energy of the

    molecule,

    E = numerical energy of the state, and

    = corresponding wavefunction for molecular state.

    The Hamiltonian used in the Schrödinger equation is that for nuclear motions,

    describing the vibrational, rotational and translational states of the nuclei.35

    Schrödinger‟s equation for molecular systems can only be solved approximately. Exact

    solutions of the Schrödinger equation may only be obtained for the very simplest

    molecules (e.g., H2) because of the inter-electronic repulsion terms in the equation,

    where the motion of each electron depends on the motion of the other electrons, so

    approximate methods have to be used for larger molecules, for example the variation

    method.36

    The two main classes of electronic structure methods are semi-empirical

    methods and ab initio methods.33

    Semi-empirical methods use parameters derived from

    experimental data or high level ab initio calculations to simplify the computation. An

    approximate form of the Schrödinger equation is solved which depends on having

    appropriate parameters available for the type of chemical system in question.

  • 11

    2.3.1 The Ab Initio Method

    The term ab initio is given to computations which are derived directly from theoretical

    principles with no inclusion of experimental data. The approximations are usually

    mathematical approximations, such as using a simpler functional form for a function or

    getting an approximate solution to a differential equation. The square of the

    wavefunction 2 is interpreted as the probability density for the electrons within the

    system. The first step in simplifying the general molecular problem in quantum mechanics

    is in the separation of the nuclear and electronic motions. This is possible because the

    nuclear masses are much greater than those of the electrons and, therefore, nuclei move

    much more slowly. This separation of the general problem into two parts is called the

    adiabatic or Born-Oppenheimer Approximation.37

    Thus, the electron distribution within a

    molecular system depends on the positions of the nuclei, and not on their velocities.

    The advantage of ab initio methods is that they eventually converge to the exact

    solution, once all of the approximations are made sufficiently small in magnitude.

    However, this convergence is not monotonic. Sometimes, the smallest calculation gives

    the best result for a given property.

    The disadvantage of ab initio methods is that they are expensive. These methods often

    take enormous amounts of computer CPU time, memory and disk space. In practice,

    extremely accurate solutions are only obtainable when the molecule contains about half

    a dozen electrons or less.

    Restricted Hartree Fock (RHF) or Unrestricted Hartree Fock are the two forms of the

    wave function that are used in quantum mechanic calculations. The RHF wave function

    is used for singlet electronic states, for example, the ground states of stable organic

    molecules. The UHF wave function is most often used for multiplicities greater than

    singlets.38

    The Mller-Plesset second order perturbation theory (MP2) specifies the

    calculation of electron correlation energy. The MP2 option can only be applied to

    Single Point calculations.39

    In general, ab initio calculations give very good qualitative results and can give

    increasingly accurate quantitative results as the molecules in question become smaller.31

  • 12

    2.3.2 Semi-Empirical

    Semi-empirical (SE) calculations are set up with the same general structure as a Hartree

    Fock (HF) calculation. Within this framework, certain pieces of information such as

    two electron integrals are approximated or completely omitted. In order to correct for

    the errors introduced by omitting part of the calculation, the method is parameterized,

    by curve fitting a few parameters or numbers in order to give the best possible

    agreement with experimental data.40

    The advantage of semi-empirical calculations is that they are much faster than the ab

    initio calculations. The disadvantage is that the results can be erratic. If the molecule

    being computed is similar to molecules in the database used to parameterize the method,

    then the results may be very good. If the molecule being computed is significantly

    different from anything in the parameterization set, the answers may be very poor.40

    Semi-empirical calculations have been very successful in the description of organic

    chemistry where there are only a few elements used extensively and the molecules are

    of moderate size. However, semi-empirical methods have been devised specifically for

    the description of inorganic chemistry as well.40

    2.3.3 Density Functional Theory (DFT) Methods40

    Density Functional Theory (DFT) is the third class of electronic structure methods that

    have recently come into wide use. These methods are similar to the ab initio methods in

    many ways. DFT calculations require approximately the same amount of resources as

    the Hartree-Fock theory, but they produce results approaching the quality of the MP2

    level of theory.

    DFT methods include the effects of electron correlation, where electrons in a molecular

    system react with each other's motion and attempt to keep out of each other's way. This

    is what makes DFT methods more attractive than the expensive ab initio methods.40

    DFT methods are based on the theory developed by Hohenberg and Kohn41

    in which

    they demonstrated that the ground state energy of any molecule can be described in

    terms of the total electron density, in other words, each molecule has a unique

    functional form which exactly determines the ground state energy and electron density

  • 13

    (i.e. geometry) of the molecule.40,42

    This system is different to the wave function

    approach of ab initio techniques, where the complexity of the wave function increases

    by a factor of 3N for an N-electron system (no spin included). For the DFT system, the

    complexity of the function is less dependent to the system size since the electron density

    has the same number of variables. The aim of DFT methods, therefore, is to design

    functions which connect the electron density with energy.43

    Kohn and Sham44

    were

    responsible for the introduction of orbitals which formed the basis for the use of DFT

    calculations in computational chemistry.

    The advantage of DFT is that only the total density is to be considered and to calculate

    the kinetic energy with accuracy, orbitals need to be re-introduced. The disadvantage of

    DFT is the derivation of suitable formulae for the exchange-correlation term. DFT

    methods, however, have the ability to produce accurate results.42

    2.3.4 Basis Sets45

    A basis set is a mathematical description of the orbitals within a system used to perform

    the theoretical calculation. The wavefunction , can be expanded in terms of a set of

    atomic orbitals, in the linear combination of atomic orbitals (LCAO) method, to

    give45

    :

    = c (2.4)

    where c = molecular orbital expansion coefficient, and

    = basis function of atomic orbital.

    The coefficient c is varied to obtain the wavefunction , which will give the lowest

    energy in the Schrödinger equation. The more vibrational parameters used to describe

    an individual orbital, the lower the energy. However, a situation is reached when the

    energy is no longer decreased when the number of vibrational parameters is increased

    and then the best single determinant wavefunction is obtained. When this occurs,

    changing the wavefunction , by an infinitesimal amount will not alter the energy. The

    number and quality of the atomic orbitals determine the quality of the molecular

    orbital . If there are many electrons in a molecule then the number of atomic integrals

  • 14

    required increases rapidly and can be as many as several million for quite small

    molecules. For this reason a fast computer which has a large storage capacity is

    essential. The two types of atomic basis functions are Slater-type atomic orbitals (STOs)

    and Gaussian-type atomic orbitals (GTOs). The former is not well suited to numerical

    work, and their use in practical molecular orbital calculations has been limited. Almost all

    modern ab initio calculations employ GTO basis sets. These basis sets, in which each

    orbital is made up of a number of Gaussian probability functions, has considerable

    advantages over STOs. The Gaussian series of programs deals, as the name implies,

    almost exclusively with Gaussian-type orbitals.

    2.3.4.1 Minimal Basis Sets45

    These contain the minimum number of basis functions needed for each atom. Minimal

    basis sets are characterized by fixed-size atomic-type orbitals. Each orbital is

    characterized by its coefficient and its exponent. They therefore do not have the

    capability to expand or contract because the exponent is fixed.

    (i) The STO-3G Basis Set45

    The series of minimal basis sets consists of expansions of Slater-type orbitals (STOs).

    The STO-3G basis set yields properties that are reasonably close to limiting values and

    in view of the relative computational times of the various expansions, it is this level that

    has been selected as an optimum compromise for widespread application. Another

    possible exception to the use of the three-gaussian expansion as the standard minimal

    basis level occurs in the consideration of the properties of weakly-bound complexes

    where long-range forces are important.

    2.3.4.2 Split Valence Basis Sets45

    The way that a basis set can be made larger is to increase the number of basis functions

    per atom. Inclusion of two sets of isotropic p-functions in the representation, one tightly

    held to the nucleus and the other relatively diffuse, will allow independent variation of

    the radial parts of the two sets of p-functions, thus producing more contracted or more

    diffuse functions that would be suitable for the descriptions of say, s- and p-systems,

    respectively. A basis set formed by doubling all functions in a minimal representation

  • 15

    is known as a double-zeta basis, while one in which only the basis functions for the

    outer valence shells are doubled, is known as a split-valence basis set.

    (i) The 3-21G Basis Set45

    The 3-21G basis set defined through the fourth row of periodic table of elements, typify

    representations in which two basis functions, instead of one have been allocated to

    describe each valence atomic orbital. Except for hydrogen, the 3-21G basis sets are

    employed as is, that is, without rescaling of the valence functions to account better for

    changes that might occur as a result of molecule formation.

    2.3.4.3 Polarised Basis Sets45

    While split valence basis sets allow orbitals to change their size, but not their shape,

    polarised basis sets remove this limitation by adding orbitals with angular momentum

    beyond that which is required for the ground state description of each atom. For

    example, polarised basis sets add p-functions to hydrogen atoms, d-functions to the

    main groups and f-functions to transition metals.

    (i) The 6-31G* Basis Set45

    The 6-31G* basis set was originally proposed for first-row atoms and later extended to

    second-row elements. The 6-31G* basis set is constructed by the addition of a set of six

    second order (d-type) gaussian primitives to the split valence 6-31G basis set

    description of each heavy (non-hydrogen) atom.

    (ii) The 6-31G** Basis Set45

    The basis set described above does not allow for any polarisation of the s orbitals of

    either hydrogen or helium atoms. The 6-31G** basis set is identical to 6-31G* except

    for the addition of a single set of gaussian p-type functions to each hydrogen and helium

    atom.

    (iii) The 3-21G* Basis Set45

    The 3-21G* basis set for second-row elements are constructed directly from the

    corresponding 3-21G representations by the addition of a complete set of six second-

    order gaussian primitives. Although the resulting representations contain the same

    number of atomic basis functions per second-row atom as the 6-31G* polarisation basis

  • 16

    set previously described, these are made up of significantly fewer gaussians (three

    instead of six for each inner-shell atomic orbital, and two gaussians instead of three for

    the inner part of the valence description).

    2.3.4.4 Basis Sets Incorporating Diffuse Functions45

    The basis sets that have been discussed thus far are more suitable for molecules in

    which electrons are tightly held to the nuclear centers than they are for species with

    significant electron density far removed from those centers. Calculations involving

    anions pose special problems. Since the electron affinities of the corresponding neutral

    molecules are typically quite low, the extra electron in the anion is only weakly bound.

    One way to overcome problems associated with anion calculations is to include in the

    basis representation one or more sets of highly diffuse functions. These are then able to

    describe properly the long-range behavior of molecular orbitals with energies close to

    the ionization limit.

    (i) The 3-21+G and 3-21+G* Basis Set45

    The 3-21+G basis set for the first-row elements and the 3-21+G* basis set for the

    second-row are constructed from the underlying 3-21G and 3-21G* representations by

    the addition of a single set of diffuse gaussian s- and p-type functions. For first-row

    elements with lone pairs, the effects of diffuse and polarisation functions are

    complementary to some extent. Hence, the energies of processes involving changes in

    the number of lone pairs, for example, protonation, hydrogen bonding, or other

    interactions, are improved at diffuse-orbital-augmented levels even when large basis

    sets are used.

    This basis set produces entry level results and is relatively inexpensive in terms of

    computer resources and time. If the system under study involves only C, H, N and O

    atoms, the system (with RHF) is ideal for student training. Upgrading to a better basis

    set is trivial and one does not loose too much valuable time during your learning curve.

    2.3.5 Hartree-Fock Theory46

    The most common type of ab initio calculation is the Hartree Fock (HF) calculation in

    which the primary approximation is the central field approximation. This means that

    the Coulombic electron-electron repulsion is not specifically taken into account.

  • 17

    However, its net effect is included in the calculation. This is a variational calculation,

    meaning that the approximate energies calculated are all equal to or greater than the

    exact energy. Because of the central field approximation, the energies from HF

    calculations are always greater than the exact energy and tend to a limiting value called

    the Hartree Fock limit. This variational principle leads to the following equations

    describing the molecular orbital expansion coefficients, derived by Roothaan and

    Hall46

    :

    01

    vi

    N

    v

    viv cSF (2.2)

    where, = 1, 2, …, N and cvi = molecular orbital expansion coefficient.

    Equation (2.2) can be re-written in matrix form:

    FC = SC (2.3)

    where F = the Fock matrix,

    S = the overlap matrix,

    = the diagonal matrix, and

    N = one-electron function or basis function.

    Equation (2.3) is not linear and therefore must be solved iteratively. The procedure by

    which this is carried out is called the self-consistent field (SCF) method.46

    Slater and

    Gaussian type orbitals are used in these equations. The Hartree Fock equations are

    applicable no matter how many electrons there are in the molecule. However, this

    theory does not include a full treatment of the effects of electron correlation, that is, the

    energy contributions arising from electrons interacting with one another. It is

    reasonably good at computing the structures and vibrational frequencies of stable

    molecules and some transition-states. As such, it is a good base-level theory.46

    The second approximation in HF calculations is that the wave function must be

    described by some functional form, which is only known exactly for a few one electron

    systems. The functions used most often are linear combinations of Slater-type orbitals

  • 18

    (STOs) or Gaussian-type orbitals (GTOs). The wave function is formed from linear

    combinations of atomic orbitals or more often from linear combinations of basis

    functions. Because of this approximation, most HF calculations give computed energy

    greater than the Hartree Fock limit. The exact set of basis functions46

    used is often

    specified by an abbreviation, such as STO-3G or 6-31g**.

  • 19

    CHAPTER 3

    3. TRANSITION STRUCTURE MODELING47

    A transition structure (TS) is the molecular species that is represented by the top of the

    potential energy graph in a simple one dimensional reaction coordinate shown in Figure

    3.1. In order to determine the energy barrier to reaction rate, the energy of this

    transition-state species is needed. The geometry and energy of a transition structure

    include important pieces of information for describing reaction mechanisms.

    En

    erg

    y

    Reaction coordinate

    Reactants

    Products

    Transition state

    Precomplex

    Precomplex

    Figure 3.1 Points on a simple reaction coordinate48

    A transition structure is defined mathematically as the geometry which has a zero

    derivative of energy with respect to moving every one of the nuclear coordinates and

    has a positive second derivative of energy of all but one geometric movement which has

    a negative curvature.47

    This description however, describes many structures other than

    a reaction transition, for example an eclipsed conformation or the intermediate point in a

    ring flip, a simple rotation of a methyl group or any structure with a higher symmetry

    than the ground state of the compound.

    It is difficult to predict what a transition structure will look like without the aid of

    computer simulation. Such a prediction might be made based on a proposed

    mechanism, which may be incorrect. The potential energy surface (PES) around the

    transition structure is often much more flat than the surface around a stable geometry,

    thus there may be large differences in the transition structure geometry between two

  • 20

    seemingly very similar reactions and with very small differences in energy.47

    It has

    however been possible computationally, to determine transition structures, although it is

    not always easy. Experimentally, it has only become possible to examine reaction

    mechanisms directly using femtosecond pulsed laser spectroscopy. It will be some time

    before these techniques can be applied to all of the compounds that are accessible

    computationally. Furthermore, these techniques yield indirect information such as

    vibrational information rather than a likely geometry for the transition structure.47

    Synthetic approaches to obtain information about transition-states are also limited to

    very special cases, such as the static SN2 transition-state 3.1 shown below.49

    OMeMeO C

    OMeMeO

    3.1

    An X-ray structure of the above mentioned molecule was reported.49

    3.1 Transition-state modeling with empirical force-fields50

    The transition-states (TS) involved in a conformational equilibrium can be studied using

    the ground-state parameters developed from geometries and heats of formation of stable

    molecules. One of the earliest applications of empirical force-fields to organic

    chemistry was Westheimer‟s study of rotational barriers in biphenyls, which begun in

    the 1940‟s and was reviewed in the 1950‟s.51,52

    In the subsequent half-century, there

    have been many studies of conformational rate processes in organic systems. Ground-

    state parameters are fully appropriate to such studies; transition-states have more torsion

    and non-bonded strains than energy minima, but have the same type of bonds.

    However, when bonds are being formed or broken, the parameters suitable for ground

    states are no longer appropriate.50

    Consequently, parameters must be developed to

    model partial bonds in a quantitative way when using semi-empirical methods.

    Theoretical studies on transition structures of several class of reactions have shown that

  • 21

    bond lengths and other geometrical parameters in transition structures have a relatively

    narrow range of values. For example, in pericyclic reactions, forming C-C single bonds

    generally have bond lengths from 1.95 to 2.28 Ǻ, even though some of these reactions

    are very exothermic and others are thermoneutral.53

    Radical additions to alkenes have

    been studied for a variety of carbon- and oxygen- centred radicals, with constant angles

    and bond lengths around 105 ± 3 and 2.25 ± 0.01 Ǻ, respectively.54

    Hydroborations of alkenes and alkynes have been studied for a variety of alkylboranes

    and substituted ethylenes. Even in the presence of high steric hindrance, the formation

    and the breaking up of bond lengths are relatively constant.55

    These examples support

    one critical procedure often used in simple force-field transition-state modeling, namely

    the breaking and forming bonds are either fixed at some lengths, or these values are

    treated as energy minima. The energy is actually a maximum for the reaction

    coordinate, but the simple expedient of calculating transition-states as minima has been

    used in many cases. The TS is a saddle point along the free-energy surface. This saddle

    point has a negative curvature in only one direction. The negative force constant

    corresponds to motion along the reaction coordinate. All other vibrational motions have

    positive force constants exactly like energy minima on potential energy surfaces.

    However the negative force constant causes the transition structures to have unique

    properties different from force minima.

    As mentioned at the beginning of this chapter the energy surface at the TS is often very

    flat. This normally causes problems for using SE methods to determine transition-

    states.

    Due to the structural features and characteristics in the highly strained moieties, the

    outer limits of what can be prepared and studied regarding thermodynamic stability and

    kinetic reactivity is investigated. Studies of such systems provide an excellent test for

    existing chemical theory and thus perhaps furnish the best opportunity for advancing the

    frontiers of our chemical knowledge.7 It is therefore likely that SE methods will give

    poor results for transition-states involving the cage structure.

  • 22

    3.2 Locating minima on the seams of intersecting semi-empirical PES56

    Although the word "minima" is used above, it actually implies the lowest maximum on

    the energy surface between the product and the reactant.

    In recent years, the location of TS‟s has become relatively routine due to improvements

    in the optimization algorithms. The TS can be refined to any desired accuracy, if the

    energy-generating function is of the ab initio type. However, the practical consideration

    usually limits both the size and the level of sophistication. Semi-empirical methods can

    be used for somewhat larger systems, but in this case only comparison with experiments

    or accurate ab initio calculations can be used to assess the quality of results.

    The primary concern in many applications is not in the prediction of absolute values of

    activation parameters, but rather on how they vary for closely related systems. There

    are two or more reaction pathways that have activation energies which differ by only a

    few kcal mol-1

    and synthetic sequences are often dependent on these pathways. In such

    cases, the desired reaction can often be favoured by a careful selection of substituents at

    specific sites in the molecule being studied. According to the influence they have on a

    reaction, substituents can be divided into two limiting cases; those of a “structural” or

    “steric” nature and those which mainly exert an “electronic” influence. While the latter

    requires an explicit description of the electrons in the system, the former can be

    modeled by less rigorous theoretical methods, i.e. SE or MM. However, the above

    classification of substituents will depend on the given reaction.56

    The treatment of a TS as a minimum on the PES is a more fundamental problem with

    Houk's approach.56

    These directions can in general be written as a linear combination

    of internal or cartesian coordinates. Three different strategies can be employed in

    transferring the ab initio structure to the force-field model:

    (i) The "fixed atom” procedure, where the atoms directly involved in the

    reaction are frozen by fixing their cartesian coordinates;

    (ii) The “fixed parameter” procedure, where certain internal coordinates are

    constrained by assigning large force constraints to these variables; and

  • 23

    (iii) The “flexible parameter” procedure, where all atoms are allowed to

    move.

    As an alternative, the equivalent of a TS in a force-field environment can be defined as

    the lowest energy structure linking the reactant and product. When different sets of

    parameters are needed for describing the two end points, the TS equivalent is thus the

    lowest energy structure on the seam of the intersecting PES‟s. The advantage of the

    current strategy over Houk‟s TS modeling, is that only information regarding the two

    minima (reactant and product) on the PES is needed, and such data are in principle

    accessible by experiments. The disadvantage is that the functional form of the energy

    must be reasonably accurate over a wider range of geometries than just near the

    "minima".

    SE methods were initially employed in this study for the determination of transition

    structures. It was found that the method indeed produced poor quality results, and in

    many cases the correct transition structures could not be found. It was therefore decided

    to use ab initio techniques for the determination of the required transition-states.

    3.3 Transition-structure modeling of a PCU Cage Lactam using ab initio

    methods

    The TS algorithm uses a combination of Rational Functional Optimization (RFO) and

    linear search step to search for the lowest maximum on the energy surface between the

    reactant and the product. A mathematical algorithm by default uses a crude semi-

    empirical (SE) guess (INDO guess is used for the first-row systems, CNDO for the

    second-row, and Hückel for the third-row and beyond)57

    for the initial start structure to

    the solution of the transition-state wave function. If the starting structure is too far from

    the real maximum on the energy surface, the search algorithm would not find the correct

    transition structure within the multitude of local maxima.

    In order to obtain a better guess for the solution to the wave function, one could use

    either of two methods:

    (i) carry out a Single Point (SP) calculation of the starting structure at the

    same level of theory using the same basis set, followed by an additional

  • 24

    step in the calculation, with the option to read the guess (guess = read)

    from the calculation done (obtained from the checkpoint file) at the

    required level and basis set or

    (ii) use the option “CALCFC” where the force constant is calculated at the

    required level/basis set and used to start the solution to the wave

    function. The second option is considerably more expensive in terms of

    resources and time, since the “CALCFC” option does the same

    calculation as required for a frequency calculation. (see Chapter 4 for a

    discussion on CALCFC).57

    A third method is the QST2 and QST3 methods in Gaussian which has the facility for

    automatically generating a starting structure for a transition-state optimization based

    upon the reactants and products that the transition-state connects,58

    known as the

    Synchronous Transit-Guided Quasi Newton (STQN) method. This method uses a

    quadratic synchronous transit approach to get closer to the quadratic region of the

    transition-state, and uses a quasi-Newton or eigenvector-following algorithm to

    complete the optimization.57

    QST2 requires two molecule specifications, i.e., the

    reactants and products. QST3 requires three molecule specifications, that is, the

    reactants, the products and an initial structure for the transition-state, respectively. The

    QST2 and QST3 methods were utilized in this study, but it did not yield favourable

    results.

    A fourth option is to use the “CALCALL” option in combination with the transition-

    state optimization. During this calculation, the force constant will be calculated for

    each optimization step. This option is very expensive and is only used as a last resort

    when the methods above do not show positive results.

    In this study, the TS for a PCU cage lactam were found by locating maxima on the

    potential energy surfaces using Restricted Hartree-Fock theory and the 3-21+G* basis

    set. For each TS (see Chapter 5 for the mechanistic pathway) a SCAN calculation (see

    Chapter 4 for computational details) was performed to establish the maxima. The final

    structure corresponds to a minimum on the potential energy surface, or saddle point. In

    order to determine the nature of the stationary point found, a frequency calculation was

  • 25

    performed. The frequency output file has information that is critical in characterizing

    the stationary point, namely, the number of imaginary frequencies and the normal mode

    corresponding to the imaginary frequencies. Imaginary frequencies are listed in the

    output file as negative numbers. By definition, a structure which has n imaginary

    frequencies is an nth

    order saddle point. Thus transition structures are usually

    characterized by one imaginary frequency since they are first-order saddle points.59

    The

    movement of atoms associated with the imaginary frequency should follow the atoms

    on the reaction coordinate between the reactant and product. Any reaction profile will

    have only one transition-state. Although many intermediate transition structures on the

    reaction profile might exist, only one of them will be the rate-determining step. That

    transition structure is defined as the transition-state for the reaction.

  • 26

    CHAPTER 4

    4. COMPUTATIONAL DETAILS EMPLOYED

    This study was carried out on a series of related molecules using the GAUSSIAN 9860

    program implemented on a DEC Alpha DS20 workstation with two CPU‟s and a

    Pentium II desktop computer. A true 64 bit operating system was implemented on the

    DEC Alpha workstation with a parallelized version of GAUSSIAN 98 program.

    4.1 The GAUSSIAN 98 Program

    GAUSSIAN 98 (G98) is one of the series of electronic structure computer programs

    which began with GAUSSIAN 70, GAUSSIAN 92 and GAUSSIAN 94. GAUSSIAN

    03 is the latest program which was released in April 2003.

    The GAUSSIAN programs are general-purpose programs capable of performing ab-

    initio Hartree-Fock (HF) molecular orbital calculations based on the linear combination

    of atomic orbitals (LCAO) approach. As the name implies, the program deals mainly

    with Gaussian-type orbitals, which has been described in Chapter 2. However new

    methods have been added to G98 so as to improve optimization procedures for

    transition-state calculations.

    In addition, G98 can compute energies, molecular structures, vibrational frequencies

    and numerous other molecular properties for systems in the gas phase and in solution,

    including the ground state and excited states.

    The input section to the GAUSSIAN programs consists of the molecular charge and the

    multiplicity, the symbols of the constituent atoms and a definition of the molecular

    structure, either in the form of cartesian coordinates or the Z-matrix notation, which

    defines the molecular geometry in terms of bond lengths, bond angles and dihedral

    angles. The task to be performed, i.e. whether a single-point calculation, geometry

    optimization or frequency calculation, must also be specified, together with the

    appropriate basis set and the level of theory.

    The 3-D structures on the CD accompanying this thesis are written in the Gaussian input

    file format (gjf). The geometries of the structures can be viewed by using the Gausview

  • 27

    program or the freeware (Molekel) program. The installation files for Molekel can be

    downloaded from the site: www.cscs.ch/molekel

    4.2 The GaussView Program61

    GaussView is a Graphical User Interface (GUI) program designed to simplify and

    extend the use of the Gaussian 98 program. In this study, Gaussview was used to build

    and edit molecules, set up and submit Gaussian jobs, and to display and use the results

    from the Gaussian jobs. However, GaussView is not directly integrated into the

    Gaussian program system, but acts as a front-end/back-end processor to facilitate its use

    on a desktop computer workstation. GaussView was also used to verify the animation

    of atoms associated with the negative eigenvalue of the different transition-states.

    4.3 The SCAN Calculation

    The scan option was exclusively used as an aid to finding an approximate starting

    structure for a normal transition-state optimization. A relaxed SCAN calculation

    involved changing of bond length from reactants to products, in a step-wise manner.

    The only constraint in this calculation is the required reaction coordinate (i.e. bond

    length, angle or the dihedral angles). The rest of the molecule is then optimized to find

    the lowest possible structure and energy, subject to the imposed constraints, after which

    the reaction coordinate is modified by a prescribed value and in the next step the

    procedure is repeated. In this study only relaxed SCAN calculations were used. The

    energy of each step was plotted against the reaction coordinate. By inspecting the

    different structures at each step of the scan job, one could follow the course of the

    reaction. The approximate starting structure for a full (non-restrained) transition-state

    optimization was obtained by manually extracting the coordinates of the structure

    closest to the maxima on the energy vs. reaction coordinate plot. The TS was verified

    by performing a frequency (FREQ) calculation. A frequency calculation produces only

    one negative eigenvalue, which is usually associated with the movement of atoms

    involved in either bond breaking or bond formation.

    http://www.cscs.ch/molekel

  • 28

    4.3.1 Commands used during a SCAN or a TS Search60

    (i) GDIIS

    Specified the use of the modified GDIIS algorithm, recommended for use with

    larger systems, tight optimizations and molecules with flat potential energy

    surfaces. It is the default for semi-empirical calculations. This command makes

    use of a smaller step-size down the potential energy valley.60

    (ii) MODREDUNDANT or ModRed

    Used in geometry optimization, e.g. the bond length to be scanned. The specific

    coordinates that are to be constraint (Modified Redundant coordinates = ModRed) is

    specified below the Cartesian coordinates.60

    (iii) TS

    Used in a search to request the optimization to use a mathematical search algorithm

    which aims to find a local maxima on the potential energy surface (i.e. a transition-

    state) rather than a local minimum.60

    (iv) NOEIGENTEST

    The default transition-state search in G98 makes use of the EigenTest. If only one

    imaginary frequency is found, the calculation continues to find the transition-state

    associated with this negative eigenvalue. If more than one imaginary frequency

    exists, the default routine is to terminate the calculations. Since it is practically

    almost impossible to find a starting structure for a TS with one and only one

    negative eigenvalue, the default TS calculation terminates very often. In order to

    overcome this oversensitive search criterion, one uses the "NoEigenTest" option

    which overrides the default search criteria in G98.60

    (v) CALCFC

    Specified the force constants be computed at the first point using the current method

    (available for the HF, MP2, CASSCF, DFT, and semi-empirical methods only). By

    default Gaussian uses a MNDO (semi-empirical) guess for the solution to the wave

    function of the specified system. The optimization uses this guess as starting point

  • 29

    where, after the ab initio calculation is “built” on this starting point. Since the

    MNDO guess is based on a rather crude or inaccurate method, the calculation could

    sometimes follow a wrong solution for the wave function. One observes this by

    inspecting the geometry of the structure produced by the optimization, there are

    basically two choices: (a) start with a better structure, i.e., a structure that was

    optimized at a lower level of theory or (b) if a better starting structure was already

    used, use the CalcFc option.3,60

    4.4 Calculation Details

    In this study the procedure for all structures was the structures in HyperChem Version

    5.62

    In order to remove any van der Waals contacts or overlap, an energy minimization

    was performed with ChemOffice,63

    using a molecular mechanics force-field. The final

    structures were saved as input files for GAUSSIAN 9860

    (G98W). A low level (STO-

    3G) ab initio (full geometry optimization) calculation was carried out on the

    unconstrained molecule. The (OPT) keyword requests that a geometry optimization be

    performed. The geometry is adjusted according to a mathematical algorithm to follow

    the energy surface “down hill” until a stationary point on the potential energy surface is

    found. Note that the algorithm by nature would not overcome local minima and special

    techniques such as automatic or manual conformational searches or molecular

    dynamics64,65,66

    should be employed to overcome, mainly, rotational energy barriers.

    This is more problematic in flexible molecules such as peptides.

    For the Hartree-Fock, DFT and semi-empirical methods, the Berny algorithm is the

    default algorithm for minimizations to a local minimum and optimizations to transition-

    states and higher order saddle points.57

    The purpose of a geometry optimization is to

    locate the lowest energy of the molecular structure that is in close proximity to the

    specified starting structure. The output structure of the optimization was submitted on a

    DEC Alpha workstation using the RHF/3-21+G* level of theory. A SCAN calculation

    was performed to locate the maxima on the PES. The starting structure for the

    transition-state (TS) calculation was obtained by manually extracting the coordinates of

    the structure that correspond to the maxima. Once a TS was obtained a frequency

    calculation was carried out to verify that there is only one negative eigenvalue.

  • 30

    CHAPTER 5

    5. RESULTS AND DISCUSSION

    5.1 Introduction

    The purpose of this study is to utilize computational techniques in the determination of

    the proposed mechanistic pathways, for the one-pot conversion of pentacyclo-undecane

    (PCU) dione 5.1 to pentacyclo-undecane cage lactam 5.2.23,25

    OO

    NaCN

    H2O

    OH

    C O

    NHHO

    5.1 5.2

    This chapter focuses on the calculation results obtained for the proposed mechanistic

    pathway as illustrated in Figure 5. 1 (see Page 31). The basis set incorporating diffuse

    functions, that is, 3-21+G* was used in this study as discussed in Chapter 2. The

    mechanism, which was one of two possible pathways that was proposed23,25,26,27

    is

    based on basic chemical principles and intuition and was discussed in Chapter 1. In this

    study, the pathway proposed above was used as basis for the calculation of the reaction

    profile. Based on the observations made during this investigation (i.e., a starting

    structure optimized to a different intermediate or transition structure) the proposed

    pathway was also modified. The energies for the calculated mechanism proposed in this

    study are depicted in the form of a reaction profile (Figure 5.2 Page 32).

    The reaction profile was investigated by first calculating the geometries and energies of

    the minima on the energy surface (Figure 5.2). Since the cage is very rigid, very few

    problems with conformational isomers of higher energies, were experienced.

    The structures and energies of the corresponding TS's were then calculated. The

    procedure for locating the TS's is described in Chapter 3, Section 3.3.

  • 31

    5.1

    NaCN

    H2O

    5.3.1

    H2O

    5.4

    -OH

    + NaOH

    5.9

    5.2

    5.5 5.6 5.7.1

    5.10 5.11

    N

    OH

    OH

    C

    5.13

    O

    O O

    O-

    CN

    O

    CN

    OH

    O

    HO

    CN

    O

    HO

    CN

    O

    HO

    HO

    OH

    CN

    OH

    HO

    HN

    OH

    C

    OHO

    HN

    O

    OH

    C

    HOHN

    O

    OH

    C

    HO

    5.12

    HN

    O

    OH

    C

    HO

    CN

    OH

    Figure 5. 1 Modified23,25,26,27

    mechanism for the conversion of the dione 5.1 to

    the lactam 5.2.

  • 32

    5.1

    5.2

    5.3.1

    5.4

    5.5

    5.7.1

    5.9

    5.12

    5.13

    -100

    -50

    0

    50

    100

    150

    200

    250

    300

    Reaction Co-ordinate

    5.105.65.11

    O

    OO

    CN

    OH

    CN

    O

    HO

    CN

    O

    HO N

    OH

    OH

    C

    HO

    OH

    CN

    OH

    HO

    HN

    OH

    C

    OHO

    HN

    O

    OH

    C

    HO

    HN

    O

    OH

    C

    HO

    HN

    O

    OH

    C

    HO

    CN

    OH

    OHO

    O-

    CN

    O

    Rel

    ati

    ve

    ener

    gy

    (k

    cal

    mo

    l-1)

    Figure 5.2 Calculated reaction profile for the proposed mechanism.

    The cartesian coordinates of all the 3D structures presented in this Chapter are included on the CD accompanying this Thesis.

  • 33

    5.2 Local minima on the energy profile

    Structures 5.1, 5.4, 5.6, 5.9, 5.11, and 5.2 shown below are stationary points classified

    as local minima on the energy profile of the reaction.

    5.1

    O

    O

    5.4

    O

    CN

    OH

    5.6

    CN

    O

    HO

    5.9

    N

    OH

    OH

    C

    HO

    5.11 5.2

    HN

    OH

    C

    OHOHN

    O

    OH

    C

    HO

    An energy minimization was performed for each of the above structures to remove

    any Van der Waals contacts or bond overlap. The calculated energies for the above

    structures are presented in the Table 5.1. Note that the only rotational flexibility in

    these structures are the C-OH bonds illustrated above. The corresponding bonds were

    rotated at angles of 30º intervals and re-optimized to ensure the lowest possible

    isomer was obtained.

    Table 5.1 Calculated energies of the local minima.

    Relative energiesa

    Structure numberb

    STO-3G/Hartrees 3-21+G*/Hartrees 3-21+G*/kcal mol-1

    5.1 -565.0296 -568.9543 0

    5.4 -656.7586 -661.3513 27.66

    5.6 -656.8147 -661.3657 18.66

    5.9 -731.7744 -731.9821 19.79

    5.11 -731.8214 -737.0049 6.19

    5.2 -771.8263 -737.0314 -10.46

    aRelative energies are expressed in Hartrees and kcal mol

    -1, performed at the HF level using the STO-

    3G basis set, followed by a higher 3-21+G* level of theory.

    bStructure number as per proposed reaction mechanism shown in Figure 5.1.

  • 34

    From the results presented in Table 5.1, it is evident that the higher level basis set

    produces a significantly lower energy value. (Note that energies obtained with

    different basis sets cannot be directly compared). In addition, when the reaction

    profile as shown in Figure 5.2, was plotted, it was also evident that the energy values

    (heats of formation) confirm that structures 5.1, 5.4, 5.6, 5.9, 5.11, and 5.2 are indeed

    minima.

    5.3 The Transition Structures (TS)

    Structures 5.3.1, 5.5, 5.7.1, 5.10, 5.12 and 5.13 shown in Figure 5.1 are characterized

    as the following transition-state structures.

    5.3.1

    O-

    CN

    O

    5.5

    CN

    O

    HO

    5.7.1

    CN

    OH

    O

    HO

    5.10

    OH

    CN

    OH

    HO

    5.12

    HN

    O

    OH

    C

    HO

    5.13

    HN

    O

    OH

    C

    HO

    The cartesian coordinates of all the 3-D structures presented in this Chapter are

    contained on the CD accompanying this Thesis. Please refer to Chapter 3 for a

    discussion on the techniques used to determine the transition-states below.

    An explanation of the different types of SCAN calculations used in the route section

    for the location of the various transition structures is given in Chapter 4. The

    calculated energies for the above structures are presented in the Table 5.2.

  • 35

    Table 5.2 Calculated energies of the transition structures.

    Relative energiesa

    Structure numberb

    3-21+G*/Hartrees 3-21+G*/kcal mol-1

    5.3.1 -660.7663 127.39

    5.3.2 -736.9142 63.11

    5.5 -661.2569 86.90

    5.7.1 -736.3609 188.43

    5.7.2 -812.5102 77.74

    5.10 -736.8741 88.27

    5.12 -736.8873 79.95

    5.13 -736.8880 79.55

    aRelative energies are expressed in Hartrees and kcal mol

    -1, performed at the HF level using the 3-

    21+G* level of theory.

    bStructure number as per proposed reaction mechanism shown in Figure 5.1.

    In the discussion of the results for each of the transition-states, the three dimensional

    (3-D) structures have been referred to as Scan start or Scan end. Scan start implies

    the starting structure that was submitted for the calculation. Scan end refers to the

    corresponding structure that was obtained at the end of the scan calculation. By

    inspecting the different structures at each step of the scan job, one could follow the

    reaction profile and by plotting the corresponding energies vs. the reaction coordinate,

    one can obtain an approximate indication of the transition structure involved. This

    approximate starting structure for a full (non-restrained) transition-state search was

    obtained from the SCAN calculation by manual isolation of the coordinates of the

    structure closest to the maxima on the reaction profile (see Figure 5.3).

    The same basic computational methods used for the calculation of the local minima

    energies were utilized in locating the geometries of the transition-state structures. The

    only difference is that the last part of the procedure made use of a transition-state

    optimization during a scan close to the maxima associated with the corresponding TS.

    The transition structure that was obtained is referred to TS end. A summary of the

    results obtained for each of the transition-state structures follows.

    Note that the Gaussian 98 frequency output files of the different transition-states are

    available on the CD attached to this thesis. Molekel (freeware) at the site:

  • 36

    www.cscs.ch/molekel can also be used to view the frequency output files, in

    particular the vibrations associated with the negative eigenvalue.

    5.3.1 Transition Structure 5.3.1

    A constrained optimized structure (C9-C13, fixed at 1.7 Å) was used as input for a

    relaxed SCAN calculation. The three dimensional (3-D) input structure for the

    relaxed SCAN calculation is represented below.

    5.3.1 (Scan start)

    It is clear from the geometry of the structure above that the structure would be close to

    the maxima on the energy profile or perhaps closer to the formation of the

    intermediates 5.4 or 5.6 (see Figure 5.2). Note the carbonyl carbon (C5) is bending

    out of plane, starting to become sp3 hybridised. The reaction coordinate C9 and C13

    was scanned from 1.7 Å to 2.2 Å since the TS involved the formation of the bond

    between atoms C9 and C13. The output of the Scan optimization is shown below as

    5.3.1 (Scan end).

    http://www.cscs.ch/molekel

  • 37

    5.3.1 (Scan end)

    On closer inspection of the Scan end file, it is clear that, as the reaction coordinate

    between atoms C9 and C13 increased, the CN group moved away from the cage, and

    the distance between atoms C5 and O15 increased accordingly. The carbonyl carbon

    (C5) is again sp2 hybridised. The reaction profile of the step-wise increase in bond

    length between atoms C9 and C13 is graphically presented in Figure 5.3.

    0.0

    1.0

    2.0

    3.0

    4.0

    5.0

    6.0

    1.6 1.8 2 2.2

    Reaction Coordinate/Å

    Rel

    ati

    ve

    En

    erg

    y/k

    cal

    mo

    l-1

    Figure 5.3 Graphical representation of energy vs reaction coordinate for

    structure 5.3.1.

    Figure 5.3 shows that a minimum energy value occurs at 1.9 Å and a maximum

    energy at 2.1 Å. Closer examination of the structure at the maxima of 1.8 Å and the

    structure at the minima of 1.9 Å suggests that it is due to the rotation of the cyano CN

    group (C13-N14), that is, the cyano group moving from the front of the cage to the

  • 38

    back, as shown in the 3-D structures below. This phenomenon is therefore not likely

    to be associated with the required transition structure - one would rather expect bond

    formation/dissociation between C9 and C13.

    The coordinates of the structure closest to the second maxima (2.1 Å) was manually

    extracted from the Scan output file. This geometry of the structure was found to be

    close to the expected TS. Thus a TS search was carried out using the structure at 2.1

    Å as approximate starting structure for a full transition-state optimization. The 3-D

    TS optimization is shown below.

    5.3.1 TS end

    The TS calculation was verified by performing a frequency (FREQ) calculation

    resulting in ONE negative eigenvalue only. The FREQ calculation was viewed using

    the GaussView61

    program, enabling the vibration mode, associated with the negative

    eigenvalue, so that the bond formation between the cyano group (C13-N14) and C9 is

    clearly visualized. The vibration associated with the imaginary frequency shows the

  • 39

    cyano group (C13) moving towards the cage to form a bond with C9. It is also clear

    that the carbonyl carbon (C9) is converted to a sp3 carbon while O15 is moving

    towards C5 as the nucleophile C13 is approaching the carbonyl carbon C9.

    5.3.2 Transition Structure 5.5

    Finding transition structure 5.5, (see Figure 5.2), involved monitoring the

    intramolecular transfer of the hydrogen atom (H26) between the two oxygen atoms

    (O12 and O15). Shown below is the 3-D structure (5.5-Scan start) which was used

    as the starting structure for the relaxed scan.

    5.5 (Scan start)

    The bond was scanned from 2.2 Å to 1.4 Å. The output 3-D structure 5.5 (Scan end)

    is shown below.

  • 40

    5.5 (Scan end)

    It is clear that the hydrogen (H26) is transferred during the Scan calculation. The

    structure 5.5 (Scan end), above, when compared to the 5.5 (Scan start) shows that

    the atoms C9-C13-N14 keep a linear bond as expected. Atoms C5 and O15 forms a

    bond as the distance between atoms O12 and H26 decreases as a result of the H-

    transfer. Atom H26 orientates itself between atoms O12 and O15. The scan is

    graphically depicted below.

    0.0

    5.0

    10.0

    15.0

    20.0

    25.0

    30.0

    35.0

    1.2 1.4 1.6 1.8 2 2.2

    Reaction Coordinate/Å

    Rel

    ati

    ve

    En

    ergy/k

    cal

    mol

    -1

    Figure 5.4 Graphical representation of energy vs. reaction coordinate for

    structure 5.5.

    The corresponding structure closest to the maximum in Figure 5.4 is at 1.6 Å. The

    structure at this bond length was manually extracted from the scan output file and

  • 41

    submitted as input structure for a TS optimization. The resultant TS structure that

    was obtained at the end of the TS optimization is represented below.

    5.5 TS end

    The TS structure was verified to be correct since the FREQ calculation produced only

    one negative eigenvalue and the movement of atoms associated with the negative

    eigenvalue correspond to the expected movement of atoms on the reaction profile.

    This movement includes transfer of atom H26 between atoms O15 and O12 and bond

    formation/dissociation between C5-O15 as discussed above.

    5.3.3 Transition Structure 5.7.1

    The transition structure depicted as structure 5.7.1 was complex and difficult to find

    as will be described next.

    In structure 5.7.1 the calculations involved monitoring the progress of the hydroxyl

    group (O27-H28) attaching to the cage and the breaking of the C9-O15 (ether/acetal)

    bond.

  • 42

    5.7.1 (Scan start)

    The graphical presentation of the relaxed scan, that is, the decrease of the reaction

    coordinate as the hydroxyl group (O27-H28) attaches to C9 from 2.4 Å to 1.3 Å is

    shown below.

    -20.0

    -10.0

    0.0

    10.0

    20.0

    30.0

    40.0

    1.2 1.4 1.6 1.8 2 2.2 2.4 2.6

    Reaction Coordinate/Å

    Rel

    ati

    ve

    En

    erg

    y/k

    cal

    mo

    l-1

    Figure 5.5 Graphical representation of energy vs. reaction coordinate for

    structure 5.7.1.

    This transition structure is also an example of a case for which the graph exhibits a

    maximum, but is not necessarily indicative of the required transition-state. The

    explanation of how the "transition-state" was found follows. The maximum in Figure

    5.5 is at 1.8 Å. Thus the transition-state optimization for structure 5.7.1 was carried

    out using the structure closest to this maximum bond length (1.8 Å). The 3-D input

    structure (5.7.1 TS1 start) which was used for the transition structure optimization

  • 43

    and the corresponding 3-D non-restricted output structure (5.7.1 TS1 end) that was

    obtained are shown below.

    The input structure, 5.7.1 TS1 start, had the hydro