Furka Combinatorial Chemistry

Embed Size (px)

DESCRIPTION

Furka Combinatorial ChemistryFurka Combinatorial ChemistryFurka Combinatorial ChemistryFurka Combinatorial ChemistryFurka Combinatorial Chemistry

Citation preview

  • rpd Furka

    Combinatorial Chemistry

    Principles and Techniques

  • rpd Furka

    Combinatorial Chemistry

    Principles and Techniques

  • Published by rpd Furka in electronic form Budapest 2007

    rpd Furka, 2007

  • v

    Preface

    Combinatorial technologies that were invented in 1980s provided a possibility to produce new compounds in practically unlimited number. New strategies and technologies have also been developed that made possible to screen very large number of compounds and to identify useful components of mixtures containing millions of different substances. This dramatically changed the drug discovery process in the pharmaceutical industry and the way the researchers design their experiments. Instead of preparing and examining a single compound, families of new substances are synthesized and screened. In addition, combinatorial thinking and practice proved to be useful in areas outside the pharmaceutical research. Such area are, for example, search for more effective catalysts and materials research. Combinatorial chemistry became an accepted new branch within chemistry. It is the subject of numerous books, journals, international conferences and university courses. This book is written for university students and young researchers. The author feels it important to make it freely available for all potential readers. For this reason the book will be published exclusively in electronic form that can be downloaded from appropriate Web sites free of charge. The author wishes to express his appreciation to Dr. Jzsef L. Margitfalvi of the Central Chemical Research Institute of the Hungarian Academy of Sciences and Dr. Gyrgy Kri of Gedeon Richter Ltd, Budapest for reading parts of the manuscript and for their valuable suggestions. The mother tong of the author is Hungarian. Despite all efforts the text obviously contains grammatical errors. Correction of these errors, of course, would be important. The help of the readers in this respect would be highly appreciated. If you can help please contact the author by e-mail: [email protected]. rpd Furka

  • vi

  • vii

    Table of Contents

    Preface............................................................................ v Table of Contents....................................................................................................... vi 1. Introduction............ 1 1.1. Birth of the combinatorial approach................................................................... 2 1.2. The translated version of the document notarized in 1982................................. 5 1.3. Publication of the split-mix combinatorial synthesis.......................................... 13 References.................................................................................................................. 14 2. The solid phase synthesis....................................................................................... 15 2.1. Solid supports...................................................................................................... 18 2.1.1. Crosslinked polystyrene................................................................................... 18 2.1.2. Polyethylene glycol (PEG) grafted supports.................................................... 19 2.1.3. Inorganic ports................................................................................................. 20 2.1.4. Non-bead form supports.................................................................................. 20 2.2. Linkers, anchors.................................................................................................. 20 2.3. Protecting groups................................................................................................ 22 2.3.1. Protection of amino groups.............................................................................. 23 2.3.2. Protection of carboxyl groups.......................................................................... 24 2.3.3. Protection of other functional groups.............................................................. 24 2.3.4. Coupling reagents for peptide synthesis.......................................................... 25 2.4. Solid phase synthesis of organic molecules........................................................ 26 2.5. Solid phase reagents and scavenger resins in solution phase synthesis.............. 27 References.................................................................................................................. 28 3. Parallel synthesis. Synthesis of compound arrays based on saving reaction time............................................................................................... 29 3.1. The parallel synthesis.......................................................................................... 30 3.1.1. The multipin metod of Geysen........................................................................ 31 3.1.2. The SPOT technique of Frank......................................................................... 32 3.1.3. Other devices for parallel synthesis................................................................. 33 3.1.4. Parallel synthetic methods with reduced number of operations...................... 36 3.1.4.1. Synthesis of oligonucleotides on paper discs................................................ 36 3.1.4.2. The tea-bag synthesis.................................................................................... 37 3.2. The Ugi multicomponent reactions..................................................................... 37 3.3. Solution phase combinatorial synthesis.............................................................. 39 3.3.1. Dendrimer supported synthesis........................................................................ 39 3.3.2. Separations using fluorous tags and fluorous solvents.................................... 40 3.3.3. Application of solid phase reagents................................................................. 40 3.3.4. The use of scavengers in solution phase reactions........................................... 41 3.4. Automation in parallel synthesis......................................................................... 42

  • viii

    3.4.1. Automatic parallel synthesizers....................................................................... 42 3.4.2. Quality control................................................................................................. 45 3.4.3. Parallel purification.......................................................................................... 47 3.4.4. Manufacturers of laboratory robots................................................................. 48 References.................................................................................................................. 52 4. Combinatorial synthetic methods.......................................................................... 55 4.1. Combinatorial synthesis on bead-form resin...................................................... 55 4.1.1. The split-mix synthesis.................................................................................... 55 4.1.1.1. The key features of the split-mix synthesis.................................................. 56 4.1.1.2. Encoding of beads in the synthesis of organic libraries............................... 61 4.1.1.3. Realization of the split-mix synthesis.......................................................... 64 4.1.1.4. Automation of the split-mix synthesis......................................................... 66 4.1.1.5. Preliminary considerations when planning experiments with peptide libraries......................................................................................................... 71 4.1.1.6. Full and partial libraries............................................................................... 75 4.1.1.7. Unusual partial libraries............................................................................... 79 4.1.1.8. Binary synthesis using the split-mix procedure........................................... 80 4.1.2. Combinatorial synthesis using amino acid mixtures....................................... 82 4.2. Combinatorial synthesis using soluble support.................................................. 83 4.3. Combinatorial synthesis on solid surface........................................................... 84 4.4. Combinatorial peptide synthesis by biological methods.................................... 86 4.5. Combinatorial synthesis using macroscopic solid support units........................ 87 4.5.1. Encoding by attached labels. The radiofrequency and optical encoding methods............................................................................................ 88 4.5.2. Units without labels. Encoding by position in space....................................... 92 4.5.2.1. The Encore technique................................................................................... 92 4.5.2.2. The String Synthesis..................................................................................... 93 4.5.2.3. String synthesis of cherry picked libraries................................................... 104 4.6. Examples............................................................................................................ 110 4.6.1. Split-Mix Synthesis of an encoded benzimidazole library............................. 110 4.6.2. Synthesis of a 10,000 member piperazine 2-carboxamide . library by Directed Sorting.............................................................................. 115 4.6.3. Synthesis of two libraries on one support........................................................ 117 References.................................................................................................................. 118 5. Screening methods................................................................................................. 121 5.1. High throughput screening of arrays of individual compounds.......................... 122 5.2. Screening of combinatorial libraries. Deconvolution methods........................... 124 5.2.1. Deconvolution methods for dissolved libraries............................................... 124 5.2.1.1. The iteration method..................................................................................... 124 5.2.1.2. Positional scanning....................................................................................... 129 5.2.1.3. Omission libraries......................................................................................... 133

  • ix

    5.2.1.4. The amino acid tester libraries...................................................................... 135 5.2.1.5. Other methods for identification of the bioactive component of combinatorial libraries.................................................................................. 138 5.2.1.6. Dynamic combinatorial libraries................................................................... 138 5.2.1.7. Examples....................................................................................................... 138 5.2.2. Deconvolution methods of libraries tethered to the solid support................... 145 5.2.2.1. Screening of combinatorial libraries in tethered form.................................. 146 5.2.2.2. Screening of combinatorial libraries by releasing the content of individual beads intosolution........................................................................ 147 5.2.2.3. Examples...................................................................................................... 148 References................................................................................................................. 150 6. Combinatorial methods in materials and catalyst research................................... 151 6.1. Inorganic materials............................................................................................. 151 6.1.1. Preparation of thin film libraries...................................................................... 151 6.1.2. Screening......................................................................................................... 154 6.2. Heterogeneous catalysts...................................................................................... 155 6.2.1. Fabrication and testing of catalyst libraries..................................................... 156 6.2.2. Catalyst library design..................................................................................... 161 6.3. Polymers............................................................................................................. 164 References................................................................................................................. 168 7. Computational aspects of library design and synthesis......................................... 171 7.1. Software companies............................................................................................ 176 References.................................................................................................................. 182 Index.......................................................................................................................... 183

  • 1

    1. Introduction The discovery of new materials played an important role in the history of mankind. Many

    discovered materials had effect on every days life. The impact of some of these materials was so definitive that they gave the name of long historical eras. So bronze gave the name for Bronze Age, for example, and iron for the Iron Age.

    The life today is also largely affected by the materials we use. The standard of life could not be the same without semiconductors, insulators, adhesives, synthetic fibers, drugs, pesticides, paints etc. In order to improve our life, more and more useful materials and compounds need to be discovered. The question is how to do that? When we need a new bridge or want to build a skyscraper, for example, first these objects are designed then they are built according to the plans. Can we follow this route when we wish to make a new super conductor or a new drug? Certainly not. Our theoretical knowledge may be sufficient for designing a bridge or a skyscraper but is definitely not enough for designing a new more effective drug or designing a super conductor working at or near room temperature. We do not know exactly how the super conducting or other important properties of materials depend on their structure. The drugs exert their effects by interactions with proteins or other molecules found in living organisms. The rules governing these interactions, however, are largely unknown. The rational design of drugs had some successes. The drug candidates are designed in computers based on the already known three dimensional structures of target proteins. Both the ligand molecule and the protein itself can take up a practically unlimited number of conformations, and that leads to difficulties. The consequence is that mostly the traditional approach is followed: series of compounds are synthesized then the useful drug candidates are identified by trial and error. In practice, thousands of compounds are needed to be prepared and tested in order to find a drug candidate.

    In the pharmaceutical research one of the bottlenecks was the synthesis of the very large number of compounds needed in the discovery process. Before 1980 the traditional approach was used. The compounds were prepared one at a time, and their testing were also carried out one by one. In the industry, however, sophisticated methods were developed and applied in order to improve productivity in the mass production of goods. It seems worthwhile to compare the production of compounds to that of automobiles. Compounds are mostly prepared step by step from the starting materials. The automobiles are also assembled from parts. The drug candidates are unique substances all differing from each other. The automobiles are also unique products since they can differ, for example, in their color, in their engines, in their transmission etc. They certainly differ from each other in their locks and keys.

    The first car manufacturers in the world were Panhard & Levassor in 1889 and Peugeot in 18911. These French manufacturers did not standardize their car models, each car was different from the other. The first standardized car was the Benz Velo. Benz manufactured 134 identical Velos in 1895.

    Ransome Eli Olds invented the basic concept of the assembly line in 1901 that was improved Henry Ford and installed it in his car factory in 1913. As a result, by 1927, 15 million Ford Model Ts had been manufactured. As a result of further improvements and application of

  • 2

    automation, today the streets are full of cars. This shows the power of organizing the process of production and application of automation. These methods that proved to be very successful in industry were not applied at all in the mass production of compounds. After 1980 the situation began to change. Several innovative papers were published which radically changed our theory and practice in designing and preparing new substances for pharmaceutical research and other areas of application. The new synthetic and screening procedures and, which is also very important, the new way of thinking introduced in these papers founded a rapidly growing new scientific field, Combinatorial Chemistry, revolutionized the pharmaceutical research and are gradually expanding to other areas within and outside chemistry. The new methods were developed in several laboratories. The way of thinking that led to these methods was probably different in all cases. The reasons that lad to the development of the combinatorial synthesis of peptides in the author's laboratory is described below. 1.1. Birth of the combinatorial approach

    In 1964/65 I was a post doctoral fellow at the University of Alberta, Canada. I worked on

    a project led by Professor L. B. Smillie which resulted in determination of the amino acid sequence of a pro-enzyme, chymotrypsinogen-B2. After returning to Budapest, I was wondering from how many sequence possibilities did we choose the right one. Since the protein comprised 245 amino acid residues and any of these positions could be occupied by any of the 20 different natural amino acids, the number of possible sequences, as expressed by a simple formula, amounted to 20245 (=5.65x10318) combinations. This certainly seemed to be a very big number, much-much bigger than, for example, the number of molecules in 1 mole substance (6.02x1023) but in order to really perceive its magnitude it had to be compared to something which was also very big. Finally I found an estimate of the mass of the universe based on an Einstein formula3. I also calculated the mass of a protein mixture in which each sequence variant of the 245 amino acids is represented by only a single molecule.

    Estimated mass of the universe: ~1053 kg

    Mass of the protein mixture: ~10295 kg The comparison showed that the mass of the protein mixture would exceed that of the

    universe by more than two hundred orders of magnitude. The number of possible protein sequences also seems striking if it is compared to the estimated number of elementary particles in the visible universe4.

    Number of sequences in the mixture: 5.65x10318

    Number of elementary particles: ~1088 This was my first meeting with the immense diversity of molecules and the result shocked

    me. Many years later I applied the same simple formula (20n, where n is the number of amino acid residues) to calculate the number of theoretically possible sequences in peptide families built up from the 20 natural amino acids. Such collections of compounds are named libraries. The results are listed in the second column of Table 1.1.

  • 3

    Although the figures expressing the number of components in peptide libraries were far from being as frightening as the number of the possible protein sequences, they seemed still very large if the possibility of their synthesis was considered. I thought that many useful bioactive peptides could - supposedly - be found among the largely unknown components of the libraries. For this reason the nonexistent peptide libraries reminded me of exceptionally rich gold reefs which await exploitation. Gold can be produced by mining out all the gold containing rock then separating the gold from the useless stone.

    Table 1.1. The number of possible peptide sequences.

    Number of residues

    Name Number of sequences

    2 Dipeptides 400 3 Tripeptides 8,0004 Tetrapeptides 160,0005 Pentapeptides 3,200,0006 Hexapeptides 64,000,0007 Heptapeptides 1,280,000,000

    Exploitation of the peptide libraries could be achieved via the synthesis of all possible sequences followed by screening them against all potential targets. At that time, however, even the synthesis of all, say, pentapeptides seemed absolutely impossible. We usually prepared one peptide at a time mostly by solid phase synthesis (see later the details of this synthetic method) with an elongation rate of one amino acid a day.

    Figure 1.1. The optimized synthesis of peptides from three amino acids (A, E and R). The solid

    support is represented by . With this rate, the synthesis of all the 3.2 million pentapeptides would have taken 3.2x5=16 million days, that is, 43.8 thousand years of uninterrupted work.

    A A AR

    E E E R R R

    AA

    A E R

    A E R

    EA RA AE EE RE AR ER RR

    A E R A E R A E R

  • 4

    The synthesis could have been optimized by reducing to an absolute minimum the number of necessary coupling steps. This can be achieved by using the already prepared peptides as starting materials in the synthesis of the longer ones. This is illustrated in Figure 1.1. A peptide library is prepared by solid phase synthesis using three amino acids (A, E and R). First the amino acids are attached to the solid support (resin). Then the resin containing one of the attached amino acid is divided into three portions and the synthesis is continued with the coupling of one of the amino acids to one of the resin portions and so on. In the first step 3 couplings are carried out, exactly the number of the formed products. In the second step 9 couplings are needed and 9 dipeptides are formed on the resin samples. In general, the number of coupling steps in such an optimized synthesis of a peptide library is the same as the total number of products formed in the whole synthetic process. If the 3.2 million pentapeptides are prepared the number of coupling steps is the sum of amino acids + dipeptides + tripeptides + tetrapeptides + pentapeptides.

    20 + 400 + 8,000 + 160,000 + 3,200,000 = 3,368,420

    Supposing again the rate of one coupling per day in order to get the necessary time in

    years, the above figure is divided by 365. The result is 9,228 years. This shows that optimization of the synthesis reduces the time of the synthetic process from 43,800 years to 9,228 years, which is still too long to be realizable. I considered the accessibility of all peptide sequences to be very important, and around 1980 I began to think about potential solutions for their synthesis. It took only a short time to find one, which would work at least in principle. The idea was also based on the method of solid phase synthesis developed by professor Merrifield5. According to this first idea, the amino acids used in the solid phase preparation of peptides would be replaced by an equimolar mixture of 20 different amino acids in every coupling step of the synthesis. This would lead - at least in principle - to formation of a rapidly growing number of sequences and finally a full peptide library could be cleaved from the support in the form of a mixture. It was clear, however, that in such couplings the products are expected to form in unequal molar quantities as a consequence of the differences in the reactivity of the amino acids. The differences in molarities would be amplified in each successive coupling step leading to a mixture with uncertain composition. I felt that a better solution might exist and I was rethinking the problem again and again. In early spring in 1982 I spent a weekend in a little town in South-East of Hungary forgetting this time the whole diversity problem. To my great surprise, however, next morning I awoke with the perfect solution in my mind. The method based on this idea is known nowadays as the split-mix procedure.

    The split-mix method opened the possibility for producing peptide mixtures containing millions of components. Such mixtures, however seemed unacceptable in the conventional drug discovery practice where single compounds were used in pure form. For this reason there was an urgent need to present in addition, an efficient strategy for identification of the bioactive substance that may be present in the complex synthetic mixture. This task, however, looked similar to finding the proverbial needle in a huge haystack. Nevertheless I could develop a theoretical solution in a very short time. I called it synthetic back searching strategy which later proved to be in principle identical with the "iteration strategy", published by others.

    I was fully aware of the importance of the combinatorial approach in the pharmaceutical research but one of the leading Hungarian pharmaceutical companies I contacted showed no interest at all. In addition, the split-mix method was considered by the patent attorneys only as a potential research tool and for this reason it was judged not to be patentable. They suggested me,

  • 5

    however, to describe the method in a document and - in order to give me some support in potential future priority disputes - notarize it. I did so and the document written in Hungarian - in which the principles of combinatorial chemistry including both synthesis and screening were first clearly explained - was notarized in May, 1982. The photo of the first and last pages of the document is demonstrated in Figure 1.2.

    Figure 1.2. The photo of the first and last page of the 1982 document The 1982 document, as shown in the Figure 1.2, was written in Hungarian. This is the first authentic document in which the principles of combinatorial chemistry are described. The translated version can be seen below. 1.2. The translated version of the document notarized in 1982 STUDY ON POSSIBILITIES OF SYSTEMATIC SEARCHING FOR PHARMACEUTICALLY

    USEFUL PEPTIDES

    Written by Dr. rpd Furka, university professor Budapest, May 29, 1982

    As exemplified, among others, by the peptide hormones discovered so far, the shorter-

    lengthier peptides take part in a number of important functions in the living organism. It can be supposed, that only a small fraction is known of these biologically active peptides having

  • 6

    potential therapeutic effect. This fact motivates the intensive international and domestic research activity in this field.

    Two, in principle different, approaches offer themselves for searching for peptides bearing new biological effects:

    1. Isolation of peptides from living organisms based on their previously known biological effects. 2. Preparation of peptides by synthesis with post determination of their biological effects.

    Until now the isolation procedure proved to be more effective in spite of the fact that this method is also very laborious. This may be explained by the fact that the number of possible peptides grows rapidly with the number of residues so even the synthesis of all tetrapeptides (160 thousands) seems to be a hopeless task. If we consider the 20 natural amino acids the dependence of the number (Nn) of possible peptides on the number of residues (n) is expressed by the following formula:

    Nn = 20n If the n-residue peptides are synthesized stepwise and independently, the number of the required synthetic steps (Sn) can be calculated as follows:

    Sn = (n-1) 20n It is noted, that a synthetic step means a complete coupling cycle, that is, in addition to the coupling step itself incorporates the operations connected with the protecting groups, too. With good organization, that is, choosing a systematic synthesis route the number of synthetic steps can be reduced. The minimum number of synthetic steps is:

    n

    i

    inS

    220

    The synthesized peptides are supposed to be submitted to screening tests. Since several tests have to be done on each peptide, the total number of the required screening tests is hopelessly large. If the number of kinds of screening tests is denoted by t, the total number of screening tests is expressed by the following equation:

    Tn = t 20n Table 1.2 shows the possible number of peptides depending on the number of residues, the number of synthetic steps required for their synthesis, and number of the screening tests, calculating with 10 different tests (t=10). The figures - which are rounded - clearly show, that even the synthesis and testing of all tripeptides would be an almost hopeless venture. Because of the very large number of possible peptides, the stepwise synthesis of all peptides - even in the case of small ones - is an unrealizable task. The large number of the screening experiments constitutes a further problem. The proposal to be outlined on the next pages will try to somewhat improve this almost hopeless situation.

  • 7

    Table 1.2. Possible number of peptides (Nn ) containing different number of residues (n), the number of synthetic steps required for their synthesis (Sn ) in an optimized

    process, furthermore the number of screening experiments (Tn ) calculating with 10 different screening tests (t=10)

    (the figures are rounded)

    n Nn Sn Tn 2 4 hundred 4 hundred 4 thousand 3 8 thousand 8 thousand 80 thousand 4 160 thousand 168 thousand 2 million 5 3 million 3 million 30 million 6 64 million 67 million 640 million 7 1 billion 1 billion 13 billion 8 25 billion 26 billion 256 billion 9 512 billion 537 billion 5 trillion

    10 10 trillion 10 trillion 102 trillion Systematic search for biologically active small peptides through synthesis and screening of

    peptide mixtures

    The proposal to be outlined here constitutes a research project which makes possible to search for biologically active peptides with much greater chance than before. When I write down this project I'm fully aware of its potential importance in industry. It is also clear, that it's realization is possible only through cooperation of different institutions. Primarily the participation of the pharmaceutical industry is desirable since the investments can be recovered through pharmaceutical industry.

    The essence of the proposal is that instead of one by one synthesis of peptides, peptide mixtures should be prepared containing several hundred or several thousand peptides in approximately 1 to 1 molar ratio, and these peptide mixtures should be submitted to screening tests. It will be shown that on this way much labor can be saved both in the synthetic work and in the screening experiments. In the first stage one has to determine whether or not the mixture shows any biological effect. If biological effect is observed, of course, it has to be determined which component (or which components) are responsible for the activity.

    Method for synthesis of peptide mixtures

    Since not single peptides but rather mixtures of peptides are synthesized, post synthetic

    purification and removal of by-products are out of question. Because of this, the classical method of synthesis (in solution) can not be used either. In the synthesis of peptide mixtures the solid phase method has to be applied. It is noted here, that in the syntheses not necessarily the 20 amino acids are used. In some cases more than 20 amino acids may be used, for example if - in addition - non-common amino acids are intended to be used as building blocks. Less than 20

  • 8

    amino acids may be used, for example, in decapeptides, since the synthesis of all peptides seems to be unrealistic and have to compromise with the use of fewer kinds of amino acids. Let denote by k the number of the amino acids intended to vary in the i-th position. The numbers of amino acids varied in the C-terminal and N-terminal position are k1 and kn , respectively. Realization of the synthesis

    The resin is divided into k1 equal portions (that is to as many portions as many amino

    acids are intended to vary at the C-terminal of peptides). Then each portion of resin is coupled with one of the k1 kinds of amino acids then the amino-protecting group is removed from every sample. A small quantity is removed from every sample and they are taken aside for later use, then the samples are thoroughly mixed. Then the mixture of aminoacyl resins is divided into k2 equal portions and each of them is coupled with one of the k2 kinds of protected amino acids then the amino-protecting groups are removed from each sample. Before mixing, again small samples are removed and taken aside. The mixture of dipeptides is cleaved from a small portion of the mixed resin to use it in biological tests. The rest of the mixed resin is divided into k3 equal parts and the amino acids intended to occupy the third position are coupled to them. Then the synthesis is likewise continued until the mixture of n-residue peptides is reached.

    It is worthwhile to add some notes. As in an ordinary solid phase synthesis, one has to make an effort to achieve good conversion by applying the reagents in excess. Fortunately, however, conversions lower than 100%, or minor unwanted splitting reactions do not cause so serious problems like in ordinary syntheses. The labour requirement could be significantly reduced by using mixtures of properly protected amino acids in acylation reactions. This, however, does not seem to be an acceptable solution because of the differences in the reactivity of the activated amino acids which would lead to the formation of peptides in significantly different concentrations thus causing problems in the screening experiments. Formation of peptides in equal concentrations can only be assured by mechanical mixing of samples followed by dividing into equal portions. This makes possible a complete conversion for every amino acid component. Possibility of acylations with mixtures of several amino acids of identical reactivity might be a matter of further considerations. Smaller differences in reactivities could be compensated by properly selected molar ratios of the amino acid derivatives of the mixture. In the following calculations, however, the possibility of acylations with the mixtures of amino acid derivatives will be left out of considerations.

    The number of peptides formed in the synthesis, that is, the number of components in the peptide mixtures - in a general case - can be calculated by the following formula:

    Nn = k1.k2 . . . . . . kn-1.kn If the same number (k) of amino acids are varied in every position

    Nn = kn The number of synthetic steps in the synthesis of a peptide mixture containing Nn peptides (considering the attachment of the first amino acid to the resin as separate step) is:

    Sn = k1 + k2 + . . . . + kn-1 + kn

  • 9

    If the same number (k) of amino acids are varied in each position,

    Sn = nk The formulae show the advantage of the synthesis of peptide mixtures: the number of the synthetic steps can be calculated by summing the numbers of the varied amino acids, while the number peptides is given by the product of the numbers of the varied amino acids. One example: the synthesis of the mixture of tetrapeptides prepared by varying the 20 kinds of amino acids, needs only 80 synthetic steps! It is noted, that in the same run all shorter peptides - that is the 400 dipeptides and the 8000 tripeptides - are formed, too. The traditional synthesis of these peptides would need 168 400 synthetic steps. A different comparison: in the traditional method with 80 steps only about 30 tetrapeptides can be synthesized.

    Screening of peptide mixtures

    Peptides mixtures - in the first approximation - are synthesized to determine whether or not they contain biologically active component. It is supposed - although it needs experimental verification - that screening experiments can be made with mixtures, too. This offers great advantage over the traditional method since the number of screening tests is reduced by a factor equal to the number of components of the mixture. For example, the mixture of the 8000 tripeptides can be examined by a single series of tests. If there is active peptide among them, one of the executable t tests gives positive result. If the number of active peptides is more than one, then, of course, more tests may give positive result. In the synthesis of the mixture of n-residue peptides it is wortwhile to test the shorter peptides, too. The synthesis is so designed to allow for this. Taking this requirement into account, and the number of kinds of tests being t, the total number of the executable tests is:

    Tn = t(n-1)

    Although this equation certainly holds, its realization in practice deserves some notes. There is - without any doubt - an upper limit in the number of components of the peptide mixtures to be submitted to screening tests. It is difficult to estimate this number without experiments. The mixtures may probably contain many thousands of components, and as it can be judged today, the method outlined above is rather limited by possibilities of screening tests than by the number of the required synthetic steps. If there are too many components in the mixture, too large samples have to be applied in the screening experiments to achieve observable effect for a single component. The mixture supposedly contains a number of more or less active analogs and their effect is probably summarized. Nevertheless, an unsurpassable limit in the number of components certainly exists. Therefore in certain cases may prove useful to examine the effect of the n-residue mixtures without final mixing. In other cases the synthesis should be designed so not to surpass the optimal number of components. "Back-searching" for the active peptide

    If the peptide mixture is detected to contain active component, that is, if the mixture

    shows a new type biological effect, then the further task is the isolation and structure determination of the active peptide followed by its synthesis. Once the mixture containing the

  • 10

    active component or components is in our hand the isolation can be carried out using the effective separation methods, since these make possible to separate the active compound even from thousands of inactive components. It is possible, however, to follow a different method, too. This will be outlined here. This approach to the identification of the active peptides is supposed to be less tedious then the isolation methods, moreover it supplies additional information concerning the structure-effect relationship. Applicability of the method requires a procedure for quantitative determination of activity. For the sake of simplicity let's suppose that the mixture contains a single effective component (besides analogs having the same kind of effect but smaller activity). Back-searching step No. 1

    The experiments are started with the kn samples taken aside in the synthesis of the n-

    residue peptides before final mixing. The mixtures of n-residue peptides are cleaved from each resin sample. The mixtures of peptides differ from each other only in the n-th (that is the N-terminal) residue of their component peptides. Each peptide mixture is submitted to a quantitative activity determination. This shows how the activity depends on the terminal amino acid residue, that is, this way we can determine the N-terminal residue of the active peptide, and in addition it will show the effect of its replacement by other amino acid residues. Let's suppose, for example, that the N-terminal residue in the sample showing the highest activity (as well as in the active peptide) is Phe (phenylalanine). It is noted here that if there are several samples showing equally high activity it is practical to choose as the N-terminal residue of the active peptide the cheapest or the synthetically less problematical amino acid. This note holds for the subsequent back-searching steps, too.

    Back-searching step No. 2

    The experiment is continued with the kn-1 samples taken aside in the synthetic stage of the

    (n-1)-residue peptides. The amino acid determined before, that is Phe in our example, is coupled to each sample. Cleavage of the peptides from the support gives kn-1 different peptide mixtures. Their common feature is that every peptide has Phe in the N-terminal position. By submitting the peptide mixtures to quantitative screening experiments one can determine the amino acid residue occupying position n-1 (that is, the pre-N-terminal position) in the active peptide. This experiment also shows the effect on activity of substitution of this amino acids with other ones. Let's suppose that the pre-aminoterminal amino acid is Arg (arginine). It should be noted that in this back-searching step Phe is coupled to kn-1 samples and the same number (kn-1) of screening experiments have to be done. Not all of the t kinds of tests are required, only the one proved before to be positive. Consequently the number of the synthetic steps and the number of screening experiments are the same: kn-1. It is also noted that in the previous back-searching step only screening test are done (their number is kn) synthetic steps are not needed.

    Back-searching step No. 3

    This, and the subsequent back-searching steps may be realized using two different

    approaches. The peptides in samples taken aside during the synthesis have to be elongated to contain n residues, in such way, to carry on their N-terminal section the amino acid residues assuring activity. This can be realized on two ways. Either by stepwise coupling with amino acids (in our example with protected Arg then Phe) or by coupling in a single step with a previously

  • 11

    synthesized oligopeptide having the required sequence (in our example Phe.Arg). The required synthetic steps in the two approaches significantly differ. The number of the screening experiments, however, are the same in both cases. Let's turn now to the No. 3. back-searching step.

    Stepwise elongation

    Let's take the kn-2 samples taken aside in the synthesis of (n-2)-residue peptides. Each

    sample is coupled first with protected Arg then with protected Phe. After cleaving the peptides from the support each of the kn-2 peptide mixtures are submitted to activity tests to determine the amino acid residue occupying the third position counting from the N-terminal end. The number of screening tests to be executed is kn-2. The number of the required synthetic steps is: 2kn-2. The multiplying factor preceding k is the bigger the shorter are the peptides to be elongated. The numerical value of the factor is equal to the number of amino acids to be coupled with in the elongation process.

    Elongation with oligopeptide

    A previously synthesized dipeptide (in our example Phe.Arg) is coupled to each of the kn-2

    samples taken aside and the process is continued as described above. The number of screening test is also kn-2. The number of synthetic steps (leaving out of consideration the synthesis of the oligopeptide) is also kn-2. This procedure seems to be more economical. In practice it means that the active peptide is synthesized in parallel with the screening tests using the classical method started from the N-terminus. Small fractions of the growing peptide are sacrificed in the back-searching steps. This back-searching method has the great advantage (in addition to the fact that it needs less synthetic steps) that when the back-searching procedure is finished the active peptide is synthesized, too.

    Back-searching of more than one active peptide

    In the synthetic peptide mixtures several active peptides may be present, showing

    different effects. In these cases the number of back-searching steps will be bigger by a factor equal to the number of the differing active peptides. That is, if the number of the active peptides is "a" the values deduced above are multiplied by a. It is noted that the presence in the mixture of peptides having different effects may complicate the back-searching process especially in the case of peptides with opposing effects. This, however, is not treated in details.

    The back-searching process ends when the sequences of all active peptides are determined by applying either the oligopeptide or the stepwise elongation method.

    Total number of synthetic steps and screening tests summarized

    for the whole synthetic backsearching process Number of synthetic steps using oligopeptide elongation

    In synthesis:

    n

    iin kS

    1

  • 12

    In back-searching:

    1

    1

    n

    iin kaS

    Total in synthesis and back-searching:

    1

    1)1(

    n

    inin kkaS

    If k amino acids are varied in each step: Sn = [n(a + 1) - a]k Number of synthetic steps using stepwise elongation.

    In synthesis:

    n

    iin kS

    1

    In back-searching:

    1

    1)1(

    n

    iin knaS

    Total in synthesis and backsearching:

    n

    i

    n

    iiin knakS

    1

    1

    1)1(

    If k amino acids are varied in each step:

    1

    1

    n

    in iaknkS

    Number of screening tests equally valid using the oligopeptide and stepwise elongation In synthesis: Tn = t(n-1)

    In back-searching:

    n

    iin kaT

    1

    Total in synthesis and back-searching:

    n

    iin kantT

    1)1(

    If k amino acids are varied in each step: Tn = t(n-1) + ank An example: preparation and screening of all pentapeptides N5 = 320000, n=5 k=20 t=10 a=1 Total number of synthetic steps Oligopeptide elongation 180 Stepwise elongation 300 Number of tests 140

  • 13

    Extension of the method to other types of compounds

    Applicability of the method outlined before is not restricted for only the systematic

    searching for active peptides. The same principle applies to all other sequential types of compounds, that is, when the compounds belonging to this type of compounds differ from each other only in their building blocks or the sequences of these building blocks. Among them may occur natural compounds like oligosaccharides or oligonucleotides but synthetic products may be taken into account, too. Among these later ones one may think about sequential copolymers or sequential polycondensates.

    Dr. rpd Furka university professor File number 36237/1982 I certify this stitched document comprising 14, that is, fourteen pages was subscribed in my presence by Dr. Arpad Furka, university professor, with his own hands. Budapest, 1982. Nineteen hundred and eighty two, June 15, (fifteen). Dr. Judit Bokai state notary public 1.3. Publication of the split-mix combinatorial synthesis The split-mix method was published in 1988 as posters on two international congresses. First on the 14th International Congress of Biochemistry in Prague6, then on the 10th International Symposium of Medicinal Chemistry, Budapest, Hungary7. The manuscript for publication in print was submitted in February 1990 to the International Journal of Peptide and Protein Research. The paper appeared in June 19918. It was unusual, however, that within the 16 month passed between submittance of the manuscript and appearance of the paper four patents were filed on the subject of the split-mix synthesis and the author of one of the patents was the Editor in Chief of the journal where the paper was submitted. Also soon after our paper appeared two other papers were published (in September 1991) on the same topic9,10 also with the Editor in Chief among the members of the authors of one of the publications10. More can be read on this subject in a paper published in Periodica Polytechnica Ser. Chem11 http://www.pp.bme.hu/ch/index.html (year 2004) and in the home page of the author of this book12. http://www.szerves.chem.elte.hu/furka

  • 14

    References 1. http://inventors.about.com/library/weekly/aacarsassemblya.htm 2. L. B. Smillie, . Furka, N. Nagabhushan, K. J. Stevenson, C. O. Parkes Nature 1968, 218,

    343. 3. A. Einstein The meaning of relativity, Princeton University Press, 1955, 5th Ed., Princeton,

    NY, p. 107. 4. A. Linde Scientific American 1994, November, p. 48. 5. R. B. Merrifield J. Am. Chem. Soc. 1963, 85, 2149. 6. . Furka, F. Sebestyn, M. Asgedom, G. Dib, In Highlights of Modern Biochemistry,

    Proceedings of the 14th International Congress of Biochemistry, VSP. Utrecht, The Netherland, 1988, Vol. 5, p 47.

    7. . Furka, F. Sebestyn, M. Asgedom, G. Dib Proceedings of the 10th International Symposium of Medicinal Chemistry, Budapest, Hungary, 1988, p 288, Abstract P-168.

    8. . Furka, F. Sebestyn, M. Asgedom, G. Dib Int. J. Peptide Protein Res. 1991, 37, 487. 9. R. A. Houghten, C. Pinilla, S. E. Blondelle, J. R. . Appel, C. T. Dooley, J. H. Cuervo Nature

    1991, 354, 84. 10. K. S. Lam, S. E. Salmon, Hersh E. M, V. J. Hruby, W. M. Kazmierski, R. J. Knapp Nature

    1991, 354, 82. 11. . Furka, I Hargittai PERIODICA POLYTECHNICA SER. CHEM. 2004, 48, No. 1, p. 13.

  • 15

    2. The solid phase synthesis

    The solid phase synthesis is very important in combinatorial chemistry since most of the combinatorial synthetic procedures are based on this method. The solid phase synthesis was developed by Merrifield1 and demonstrated in the synthesis of peptides. Peptides and proteins are built up from -amino acids. The structure of the -amino acids is expressed by the following general formula where R is the side chain by which the -amino acids differ from each other:

    Table 2.1. The natural -amino acids

    Name Side chain -R

    Three letter symbol One letter symbol

    Alanine -CH3 Ala A Arginine -(CH2)3NH(C=NH)NH2 Arg R Asparagine -CH2CONH2 Asn N Aspartic acid -CH2COOH Asp D Cysteine -CH2SH Cys C Glutamine -(CH2)2CONH2 Gln Q Glutamic acid -(CH2)2COOH Glu E Glycine -H Gly G Histidine -CH2(4-imidazolyl) His H Isoleucine -CH(CH3)CH2CH3 Ile I Leucine -CH2CH(CH3)2 Leu L Lysine -(CH2)4NH2 Lys K Methionine -(CH2)2SCH3 Met M Phenylalanine -(benzyl) Phe F Proline Pro P Serine -CH2OH Ser S Threonine -CH(CH3)OH Thr T Tryptophane -CH2(3-indolyl) Trp W Tyrosine -(4-hydroxybenzyl) Tyr Y Valine -CH(CH3)2 Val V

    The only exception is proline in which the side chain and the amino group form a ring.

    H2N-CH-COOH

    R

  • 16

    The twenty -amino acids that are components of proteins are listed in Table 2.1. In the traditional way, peptides are synthesized in solution from properly protected amino acids.

    The carboxyl group of one amino acid is protected (by protecting group B) while the amino group is free. The amino group of the other amino acid is protected (Z) and the carboxyl group is activated (X) in order to make it capable to acylate the other amino acid.

    Figure 2.1. Solid phase synthesis of a dipeptide. 1: Attachment of the first N-protected amino acid to the solid support ( ). 2. Removal of the

    protecting group (Z) from the amino group of the attached amino acid. 3: Coupling the second N-protected amino acid to the attached one. 4: Cleaving the dipeptide from the solid support and

    removing the protecting group.

    Z-NH-CH-COOX + H2N-CH-COOB Z-NH-CH-CO-NH-CH-COOBR1 R2 R1 R2

    Cl-CH2- Z-NH-CH-COOH +

    R

    -CH2- Z-NH-CH-COO R

    -CH2- NH2-CH-COO

    R

    -CH2- NH2-CH-COO

    R

    Z-NH-CH-COX +

    R1

    -CH2- NH-CH-COO

    R

    Z-NH-CH-CO- R1

    NH-CH-COOH +

    R

    NH2-CH-CO-

    R1

    1

    2

    3

    4

    +

    NH

    COOH

    H

  • 17

    In continuation of the synthesis the amino-protecting group of the dipeptide is removed then acylated with another amino-protected and activated amino acid. The product of every synthetic step of the synthesis is usually isolated from the solution then purified. This is mostly a tedious job. In the solid phase synthesis the first amino acid is attached by its carboxyl group to a polymer support then the next amino acid is coupled to the already attached one and the rest of the amino acids are coupled sequentially to the attached peptide chain. The solid support used by Merrifield was a styrene-divinylbenzene co-polymer in fine bead form. The resin was functionalyzed by introduction of chloromethyl groups that made possible to attach the first amino acid to the resin. The solid phase method is demonstrated in Figure 2.1 using the synthesis of a dipeptide as example. In the synthetic step No. 1 the first amino acid is attached to the resin. All functional groups of the amino acid are protected (Z) except the -carboxyl group. In step No. 2 the protecting group is removed from the -amino group of the attached amino acid in order to be able to couple the second amino acid to the first one. The second amino acid is also fully protected except again the -carboxyl group which is properly activated (by group X). The coupling is realized in step No. 3. The steps No. 2 and No. 3 form a cycle that is sequentially repeated until the attachment of the last amino acid is finished. The closing step of the synthesis (step No. 4 in our example) is the cleavage of the product from the resin which usually involves the cleavage of the protecting groups from the functional groups of the peptide. It seems worthwhile to note that the synthesis can be started with a resin which already contains an attached amino acid. Such resins are commercially available.

    Figure 2.2. Reaction vessel for solid phase synthesis The solid phase reactions can be carried out in a reaction vessel demonstrated in Figure 2.2. The glass or plastic tube has a grid at the bottom that keeps the resin in the vessel when the tap is open. The tube is usually mounted on a laboratory shaker. The resin (containing the attached amino acid) is placed in the reaction vessel then all the multi step operations are carried out in the tube. First the resin is swelled by adding solvent, usually a mixture of dimethyl formamide (DMF) and dichloro methane (DCM), then the group protecting the -amino group is removed by adding a proper reagent. Following this operation the protected amino acid and the coupling reagent are added in dissolved form. During the coupling reaction, the reaction vessel is shaked. The solid phase reactions are generally slower than the solution phase counterparts and shaking shortens the reaction time. After all of the described operations the resin is thoroughly washed with solvent. The peptide can be elongated by repeating the above cycle. After finishing

    Frit Resin

  • 18

    the coupling reactions a mixture of reagents is added which cleaves the peptide from the resin and removes the protecting groups. The synthesized peptide can be recovered from the filtrate. In the solid phase synthesis the amino acids and the reagents can be added in excess to drive the reactions to completion. The excess of the amino acids and reagents can easily be removed by filtration. The coupling step can even be repeated to ensure complete conversion. The traces of the reagents are removed by repeated washings and the product of coupling remains on the filter in pure form. As outlined above the elongation of the peptide chain on the support is realized in identical coupling cycles (of course the added protected amino acid may vary from cycle to cycle). This opens the possibility of automation. In fact Professor Merrifield and his colleagues constructed and published an automatic peptide synthesizer2. Today many kinds of solid phase peptide synthesizers are commercially available. In addition, solid phase automatic synthesizers have also been developed for preparation of other kinds of organic compounds, too (see later). 2.1. Solid supports Since the seminal publication of Merrifield in 19631 different types of solid supports have been developed. The solid support, of course, have to be insoluble in the solvents used in the solid phase synthesis and it is also a requirement not to react with the reagents applied in the synthesis. In the case of most solid supports the reactions take place both inside and at the surface of the solid particles. These supports are mostly used in the form of small resin beads that swell in the solvents applied in the synthesis. The reactions in other kinds of supports take place only at the surface. These supports are used as polymer or glass beads, rods, sheets etc. and (except their surface layer) do not swell in the solvents. The solid supports are usually composed of two parts: the core and the linker. The starting compound of the synthesis is attached to the support via the linker. The core ensures the insolubility of the support, determines the swelling properties, while the linker provides the functional group for attachment of the start compound and determines the reaction conditions for the cleavage of the product. The linker itself and the covalent bond formed with the start compound must be stable under the reaction conditions of the synthesis. 2.1.1. Crosslinked polystyrene

    Cross-linked polystyrene resins are the most commonly used supports for solid phase

    synthesis. The polystyrene resins are synthesized from styrene and divinylbenzene by suspension polymerization in the form of small beads. The ratio of divinylbenzene to styrene determines the density of cross links. Higher crosslink density increases the mechanical stability of the beads. Lowering the crosslink density, on the other hand, increases swelling and increases the accessibility of the functional groups buried inside the beads. In practice, mostly 1-2% divinylbenzene is used. Crosslinked polystyrene is very hydrophobic so it swells only in apolar

    Core Linker Start compound

  • 19

    solvents. Table 2.2 shows the swelling factor (ml/g) of 1% crosslinked polystyrene in different solvents.

    Table 2.2. Swelling factor of 1% crosslinked polystyrene

    Solvent Swelling factor Solvent Swelling factor Tetrahydrofurane 5.5 Acetonitrile 4.7 Toluene 5.3 Dimethylformamide 3.5 Dichloromethane 5.2 Methanol 1.8 Dioxane 4.9 Water -

    Functional groups can be introduced into the resin by two approaches: either by post-

    functionalization of the aromatic rings of polystyrene, or by using functionalized styrene in polymerization. The bead size of the resin is an important factor that has to be considered in solid phase synthesis. The reactions are faster when small beads are used, but application of very small beads may cause problems in filtration. The bead size is characterized either by the diameter of the beads or by the inversely proportional mesh size. In practice most often the 200-400 mesh (35-75 micron) or the 100-200 mesh (75-150 micron) bead sizes are used. The bead size distribution also deserves consideration. A narrow bead size distribution is advantageous. The capacity of the polystyrene beads is around 0.5 mmol/g. 2.1.2. Polyethylene glycol (PEG) grafted supports

    PEG-grafted polystyrene has a 1-2% crosslinked polystyrene core and to its aromatic rings, polyethylene glycol chains are covalently attached. Its commercial name is Tentagel3 (Figure 2.3).

    Figure 2.3. Structure of PEG-grafted polystyrene (n ~ 70). X is a functional group (Br, OH, SH or NH2) for attachment of the substrate

    The advantage of the PEG-grafted polystyrene is that the substrate at the end of a flexible

    chain is more accessible to reagents. It behaves like being in a solution-like environment. The PEG-chains gives a hydrophilic character to the resin and swells well in water and methanol but poorly in ether or ethanol.

    O O

    X n

  • 20

    2.1.3. Inorganic supports Glass beads with controlled pore size can be manufactured and are commercially available. The glass beads can be functionalized and can be used as supports in solid phase synthesis. The mechanical stability of such glass beads surpass that of the resin beads but do not swell in solvents. Functionalized ceramics can also be used as supports. 2.1.4. Non-bead form supports Polymers can be used as supports for solid phase synthesis not only as microscopic beads but also in the form of macroscopic objects if their surface can be functionalized with groups that can serve as anchors to hold the substrate in a reasonable quantity. Using appropriate monomers like styrene or others, polyolefin chains can be grafted by radiation into the surface of the objects and the chains can be functionalized.4 Such grafted macroscopic solid support units as SynPhase crowns and SynPhase lanterns are commercially available in different sizes at Mimotopes, Australia.5 2.2. Linkers, anchors The initial building block of the compound to be prepared by solid phase synthesis is covalently attached to the solid support via the linker. The linker is a bifunctional molecule. It has one functional group for irreversible attachment to the core resin and a second functional group for forming a reversible covalent bond with the initial building block of the product. The linker that is bound to the resin is called anchor.

    The anchor can also be considered as a protecting group of one of the functional groups of the final product and, as such, it determines the reaction conditions by which the product can be cleaved from the support. A large variety of the commercially available resins contain the already built in anchor. A series of selected examples are found below.

    Merrifield resin. The Merrifield resin can be used to attach carboxylic acids to the resin. The product can

    be cleaved from the resin in carboxylic acid form using HF. Trityl chloride resin. The trityl chloride resin is much more reactive than the Merrifield resin. It can be used for

    attachment of a vide variety of compounds like carboxylic acids, alcohols, phenols, amines,

    Resin + Linker Resin Anchor

    CH2-Cl

  • 21

    thiols. The products can be cleaved under mild conditions using a solution of trifluoroacetic acid (TFA) in varying concentrations (2-50%).

    Hydroxymethyl resin. The resin can be applied for attachment of activated carboxylic acids and the cleavage

    conditions resemble that of the Merrifield resin. Wang resin. The resin is used to bind carboxylic acids. The ester linkage formed has a good stability

    during the solid phase reactions but its cleavage conditions are milder than that of the Merrifield resin. Usually 95% TFA is applied. It is frequently used in peptide synthesis.

    Aminomethyl resin. Carboxylic acids in their activated form can be attached to the resin. Since the formed

    amide bond is resistant to cleavage, the resin is used when the synthesized products are not cleaved from the support; they are tested in bound form.

    Rink amide resin. The Rink resin is designed to bind carboxylic acids and cleave the product in

    carboxamide form under mild conditions. The amino group in the resin is usually present in protected form. For attachment of the substrate first the protecting group is removed then it is reacted with the activated carboxylic acid compound. Cleavage of the product in carboxamide form can be performed with dilute (~1%) TFA.

    CH2-OH

    CH2-OH O

    O-CH3

    CH2-NH2

    O C l

  • 22

    Photolabile anchors. Photolabile anchors have been developed that allow cleavage of the product from the

    support by irradiation without using any chemical reagents. Such anchors, like the 2-nitro-benzhydrylamine resin below, usually contain nitro group that absorbs UV light.

    Traceless anchors. The initial building block of a multi-step solid phase synthesis needs to have one

    functional group (in addition to others) for its attachment to the solid support. It may happen that in the end product this group is unnecessary and needs to be removed. For this reason anchors have been developed that can be cleaved without leaving any functionality in the end product at the cleavage site. These traceless anchors usually contain silicon based linkers. 2.3. Protecting groups If a chemist wants to carry out a reaction on only one functional group of a multi-functional group compound, the reactivity of the rest of the functional groups needs to be suppressed. This can be achieved by application of protecting groups. A protecting group is reversibly attached to the functional group to convert it to a less reactive form. When the protection is no longer needed, the protecting group is cleaved and the original functionality is restored. A large number of protecting groups were developed for use in peptide synthesis since the amino acids are multi-functional compounds. It is an important requirement for a protecting group to be stable under the expected reaction conditions and to be cleavable - if possible - at mild reaction conditions. The stability/cleavage conditions of a protecting group are considered relative to those of the others. Two protecting groups are said to be orthogonal if either of them can be removed without affecting the stability of the other one. Some of the protecting groups most widely used in peptide synthesis are described below.

    O CH NH2

    OCH3

    OCH3

    CH NH2

    NO2

  • 23

    2.3.1. Protection of amino groups The benzylcarbonyl (Z) group. Bergmann and Zervas suggested the benzyloxycarbonyl group for amino-protection in peptide synthesis in 1932 and this important protection type is still in use. The Z group can be introduced by the reaction of the amino group containing compound with benzylchloroformate under Schotten-Bauman conditions. The Z protection is stable under mildly basic conditions and nucleophilic reagents at ambient temperature. Cleavage can be brought about by HBr/AcOH, HBr/TFA or catalytic hydrogenolysis. The t-butoxycarbonyl (Boc) group. An alternative choice for amino group protection is the Boc group. Its advantage is that can be removed under milder conditions than the Z group. The Boc group is completely stable to catalytic hydrogenolysis and as such is orthogonal to the Z group. Basic and nucleophylic reagents are no effect on the Boc group and its removal can be carried out by TFA at room temperature. The most convenient reagent that can be used in the protection reaction is the Boc anhydride (Boc2O). The 9-fluorenylmethoxycarbonyl (Fmoc) group. The Fmoc group differs from both Z and Boc groups since it is very stable to acidic reagents.

    CH2-O-C-Cl

    O

    H2N~

    CH2-O-C-NH

    O

    ~

    +

    Me3C-O-C-NH~ O

  • 24

    HCH2-O-C-NH-

    O

    The Fmoc group can be removed under basic conditions. Usually 20% piperidine dissolved in DMF is used as reagent. One of the reagents for introducing the Fmoc group is the FmocCl. 2.3.2. Protection of carboxyl groups Carboxyl groups are most often protected by converting them to benzyl esters or t-butyl esters.

    The benzyl esters are cleaved by saponification, HBr/AcOH, HF and catalytic hydrogenation but not by TFA. Their response to acids is similar to that of the Z groups but somewhat less sensitive.

    The t-butyl esters, unlike benzyl esters, are stable to bases or nucleophilic attack. The properties of t-butyl esters are somewhat similar to those of the Boc groups although they are less sensitive to acidolysis. They can be cleaved by TFA.

    2.3.3. Protection of other functional groups The alcoholic and phenolic hydroxyl groups are protected by converting them to benzyl ether or t-butyl ether. The former protecting group can be cleaved by HF, HBr/AcOH or by catalytic hydrogenolysis and the latter one by TFA. Thiol groups can also be protected by benzyl ether formation or by tritylation. The guanidino group (present in arginine) can be protected by nitration or by arylsulphonyl groups. The nitro group resists HBr/AcOH and can be cleaved by liquid HF. Among the arylsulphonyl groups the tosyl (Tos) group can be cleaved by liquid HF or sodium in in liquid ammonia. Two other arylsulfonyl groups are more sensitive to acidic conditions. The 2,2,5,7,8-pentamethylchroman-6-sulphonyl (Pmc) group can be cleaved by TFA under conditions

    ~COO-CH2-~COO C

    CH3

    CH3

    CH3

    benzyl ester t-butyl ester

  • 25

    similar to the removal of the Boc group. The 4-methoxy-2,3,6-trimethylbenzenesulphonyl (Mtr) group is also cleaved by TFA but is less sensitive and requires a few hours for cleavage. The amide groups (in side chains of asparagine and glutamine) can be protected by tritylation. The trityl protecting group is stable to base, catalytic hydrogenolysis, very mild acid but is cleaved with TFA. It is used in conjunction with the Fmoc amino group protection strategy. The NH group of the imidazole ring (in the side chain of histidine) is protected in conjunction with the Fmoc strategy by tritylation. The trityl protecting group can be removed by TFA at room temperature. The indole ring (in tryptophane) can be protected by Boc group that can be removed by TFA. 2.3.4. Coupling reagents for peptide synthesis In the coupling reactions of peptide synthesis the carboxyl group of the acylating amino acid is activated. Care should be taken in selecting the activation method to avoid racemization. One of the choises is 1,3-diisopropylcarbodiimide (DIC) with addition of N-hydroxybenztriazole (HOBt) in order to reduce racemization.

    NH-C-NH~

    O

    O

    Me

    MeMe

    SO2

    MeMe

    NH-C-NH~

    O

    Me

    MeMe

    Me

    SO2

    Pmc protection Mtr protection

    NH-C-NH~

    O

    O2N-

    Protection by nitro group

    N

    N

    CPh3

    Protection by trityl group

    Protection by Boc group

    N

    Boc

  • 26

    1,3-Diisopropylcarbodiimide (DIC) N-Hydroxybenztriazole (HOBt) Another very often used coupling reagent is O-benztriazole-N,N,N,N-tetramethyl-uronium-hexafluoro-phosphate (HBTU) that is known not to cause racemization.

    N

    N

    N

    O

    NCH3H3C

    N

    CH3

    CH3

    PF6_

    +

    O-benztriazole-N,N,N,N-tetramethyl-uronium-hexafluoro-phosphate

    (HBTU) One of the bases applied in the coupling reactions is N,N'-Diisopropylethylamine

    (DIPEA). N,N'-Diisopropylethylamine (DIPEA) 2.4. Solid phase synthesis of organic molecules The vast number of reactions developed for the synthesis of organic molecules were optimized for solution phase. In the decades following its introduction by Merrifield,1 of the solid phase method was mainly used in peptide chemistry. Except the studies of Lezenoff,6-9 Camps,10 Frechet11-13 , Crowley and Rapoport,14 little experience has been accumulated in its application in the synthesis of organic molecules. The advent of combinatorial chemistry, however, induced radical changes and initiated a fast expanding research in the field. The classes of chemical reactions developed for solid phase as a result of this research are showed below:

    Anchoring reactions

    Amide bond forming reactions

    Aromatic substitutions

    Condensation reactions

    N=C=N CHCHH3C

    H3C

    CH3

    CH3

    N

    N

    N

    N

    OH

  • 27

    Cycloaddition reactions

    Organometallic reactions

    Michael additions

    Heterocyclic forming reactions

    Multi-component reactions

    Olefin forming reactions

    Oxidation reactions

    Reduction reactions Substitution reactions

    Protection/deprotection reactions

    Cleavage from supports

    Other types of solid phase reactions

    Excellent compilations of these reactions were prepared by Hermkers et al.15,16, . Furka17 and W. M. Bennett.18

    2.5. Solid phase reagents and scavenger resins in solution phase synthesis Solid phase additives are successfully applied in many solution phase synthetic reactions. In solid phase reactions the substrate is bound to the solid phase carrier and the reagents are in solution. In solution phase reactions both the substrates and the reagents are in solution. In some solution phase reactions, however, the reagent is bound to resin. The advantage of such reagents is that the by products of the reagent remains bound to the resin and can be easily removed from the reaction mixture by filtration. One example is the polymer bound HOBt that is used in amide formatting reactions.

    N

    N

    N

    OH

    Polymer bound HOBt More solid phase reagents and examples of their applications are found in the already mentioned compilations.15-18 Different types of resins can also be used in solution phase reactions for removal of the excess of reagents, substrates or by products. Those resins that can be used for such purposes are named scavenger resins. One example is formylpolystyrene19-20 which is used for removal of primary amines from reaction mixtures.

  • 28

    H

    O

    Formylpolystyrene

    Other examples of scavenger resins and their applications are found in the above mentioned compilations.15-18 References

    1. R. B. Merrifield J. Am. Chem. Soc. 1963, 85, 2149. 2. R. B. Merrifield J. M. Steward,N. Jernberg Anal. Chem. 1966, 38, 1905. 3. W. Rapp In G. Jung (Ed) Combinatorial Peptide and Nonpeptide Libraries 1996, VCH,

    New York, 425. 4. H. M. Geysen, R. H. Meloen, S. Barteling Proc Natl Acad Sci USA 1984, 81, 3998. 5. http://www.mimotopes.com 6. C. C. Lezenoff Acc. Chem. Res. 1978, 11, 327. 7. P.M. Worster, C. R. McArthur, C. C. Lezenoff Angew. Chem. 1979, 91, 255. 8. C. C. Lezenoff, V. Yedidia Can. J. Chem. 58, 1980, 287. 9. V. Yedidia, C. C. Lezenoff Can. J. Chem. 1980, 58, 1144. 10. E. Camps, J. Cartells, J. Pi Anales de Quimica 70, 848 (1974). 11. J. M. J. Frechet Tetrahedron 1981, 37, 663. 12. M. J. Farrall, J. M. J. Frechet J. Org. Chem. 1976, 46, 3877. 13. J. M. J. Frechet, C. Schuerch, J. Am. Chem. Soc. 1971, 93, 492. 14. J. I. Crowley, H. Rapoport Acc. Chem. Res. 1976, 9, 135. 15. P. H. H. Hermkens, H. C. J. Ottenheijm, D. Rees Tetrahedron 1996, 52, 4527. 16. P. H. H. Hermkens, H. C. J. Ottenheijm, D. Rees Tetrahedron 1997, 53, 5643. 17. . Furka In Combinatorial & Solid Phase Organic Chemistry 1998, Advanced

    ChemTech Handbook, Louisville, 35. 18. W. M. Bennett In H. Fenniri (Ed) Combinatorial Chemistry 2000, Oxford University

    Press, Oxford, New York, 139. 19. K. G. Dendrinos, A. G. Kalivretenos J. Chem. Soc. Perkin Trans. 1998, 1, 1463. 20. M.V. Creswell, G. L. Bolton, J. C. Hodges, M. Meppen Tetrahedron 1998, 54, 3983.

  • 29

    3. Parallel synthesis. Synthesis of compound arrays based on saving reaction time

    Execution of the chemical reactions used in the synthesis of compounds or in the analytical methods take time like the food cooking processes applied in the kitchen. The house wives know for a long time how to organize their work economically. A typical house wife, for example, begins cooking the food then starts the washing machine and while the soup is boiling and the washing machine runs the program she occupies herself with vacuum cleaning the carpet. That is, she is doing different activities in parallel in order to be more effective. The idea of making our activities more efficient by doing operations in parallel is really old. The Tibetan monks, for example, mechanized their praying by applying praying mills and they operate these praying machines in parallel that makes praying very efficient.

    Figure 3.1. Praying mills1 In chemistry, the concept of parallel work was introduced relatively late. The Hungarian microbiologist, Gy. Taktsy was the pioneer of this type of activity. Taktsy organized his microbiological analytical work into parallel operations.2 He developed a radically new method for serological titrations in 1955. Among the new tools that made the parallel work very easy he introduced the microtiter plates that gained wide application in both biology and chemistry. The mictotiter plates are plastic plates with holes drilled into them. The holes serve as reaction vessels. The standard plate has 96 reaction vessels and their arrangement is shown in Figure 3.2.

  • 30

    Figure 3.2. Standard microtiter plate: 8 rows and 12 columns 3.1. The parallel synthesis The principle of parallel synthesis is the same as that applied by the house wives in the kitchen and the Tibetan monks in praying. Execution of the chemical reactions takes time and during that time not only one but a series of reactions can be realized. Each synthetic reaction is started in a different reaction vessel and all the necessary operations are executed in parallel. Figure 3.3. shows the principle of the synthesis of five different trimers, for example tripeptides, in parallel.

    Figure 3.3. Parallel synthesis of five trimers in five (numbered) reaction vessels. The Black, gray and white circles represent building blocks, for example amino acids

    The five trimers are synthesized on solid support (P) in reaction vessels 1 to 5. At the end of the synthesis, each trimer is individually cleaved from the support and collected in one of the

    Side view

    Top view

    P P P 4 4 cleavage

    P

    P P P 2 2 cleavage

    P

    P P P 1 1 cleavage

    P

    P P P 5 5 cleavage

    P

    P P P 3 3 cleavage

    P

  • 31

    five vessels designated for storing the end products. The figure demonstrates that in parallel synthesis the number of reaction vessels is the same as the number of compounds to be prepared. The number of operations is practically the same as in the one by one synthesis of the same compounds since the solvents and reagents have to be serially transported into each reaction vessel. The real advantage is that the reaction time for the in synthesizing the 5 compounds is about the same as preparing a single one. The series of compounds prepared by the parallel and the other combinatorial methods are called compound libraries. 3.1.1. The multipin metod of Geysen

    The first example of parallel synthesis was published by Geysen and his colleagues.3 They synthesized series of peptide epitopes in an apparatus developed for this purpose (Figure 3.4.). In the multipin apparatus the authors used the microtiter plate introduced by Taktsy for reaction vessels (Figure 3.4./b) and a cover plate with mounted polyethylene rods fitting into the wells (Figure 3.4/a). The end of polyethylene rods (pins) were coated with derivatized polyacrilic acid (marked by black).

    Figure 3.4. The multipin apparatus

    Figure 3.4. The multipin apparatus The amino acids used in building the peptides and the coupling reagents were dissolved and added to the wells. The coated ends of the pins were immersed into solution and kept there until the coupling reactions ended. Washings and deprotection of the peptides were also executed in the apparatus. The formed peptides were attached to the pins. The peptides prepared in the Geysens experiment were screened - after deprotection - without cleaving them from the pins.

    Figure 3.5. The well row 4/column 9

    Pin

    Solution

    a

    b

    1 2 3 4 5 6 7 8 9 10 11 12

    1 2

    3

    4 5 6

    7

    8

  • 32

    The sequence of the peptide formed on a pin depended on the order of the amino acids added to the particular well. The amino acids or their order (or both) were different for each well so a different peptide formed on each pin. The wells, as well as the pins, were characterized by their coordinates: rows and columns. By recording the order of the added amino acids into each well, the expected sequences of the peptides could be determined from the position occupied in the plate.

    If the order of added amino acids in the well row 4/column 9 is Gly, Gly, Arg, Phe, for example (Figure 3.5), then the sequence of the tetrapeptide formed on the pin row 4/column 9 is Phe.Arg.Gly.Gly (taking into account that numbering of the amino acids in peptides stars at the N-terminus). The multipin method is still used and the multipin apparatus is a commercially available product. In such apparatus, however, not coated plastic rods are used as pins. The coated head of the roads is replaced by SynPhase crowns or SynPhase lanterns (Figure 3.6) mentioned in chapter 2.

    Figure 3.6. Pins with crown (a) and lantern (b)

    The multipin procedure was applied by Ellman and his colleagues4 in pioneering the preparation of organic libraries by parallel synthesis.

    Scheme 3.1.

    Derivatives of 1,4-benzodiazepines were constructed from 2-aminobenzophenones, amino

    acids and alkylating agents (Scheme 3.1). The Fmoc protected 2-aminobenzophenones were first attached to an acid labile linker (L) then through the linker to the pins (P). After removal of the protecting group it was coupled with a protected amino acid (1). This was followed by the removal of the Fmoc protecting group and cyclization (2), then by alkylation of the ring nitrogen to introduce R4 (3). Finally the product was cleaved from the support (4). 3.1.2. The SPOT technique of Frank The SPOT method introduced by Frank5,6 and his group was also developed for preparing

    O

    R 2

    N

    N O

    R 1

    R 3

    O

    R 2 R 4

    N

    N O

    R 1

    R 3

    O

    R 2 R 4

    N

    N O

    R 1

    R 3

    H

    H O

    R 2

    O

    NHFmoc

    R 1

    O

    R 2

    O

    NH

    R 1

    O

    NHFmoc

    R 3

    1 2 3 4

    L L L L P P P P

    a b

  • 33

    peptide arrays. The synthesis is carried out on cellulose paper membranes derivatized to serve as anchors for the first amino acids of the sequences to be prepared. Small droplets of solutions of protected amino acids dissolved in low volatility solvents and coupling reagents are pipetted onto predefined positions of the membrane (Figure 3.6). The spots thus formed can be considered as reaction vessels where the conversion reactions of the solid phase synthesis take place. An array of as many as 2000 peptides can be made on an 8x12 cm paper sheet. The peptides can be screened on the paper after removing the protecting groups. 3.1.3. Other devices for parallel synthesis De Witt and co-workers7 also developed an apparatus for parallel synthesis. It was designed for the synthesis of small organic molecules. The solid support was placed in porous tubes immersed into vials containing solutions of reagents which diffused into tubes. The temperature of the reaction mixtures could be controlled by heating or cooling the reaction block.

    Figure 3.6. The spot synthesis Another inexpensive device was described by Meyers et al.8 Beckman polypropylene deepwell plates were modified by drilling a small hole in the bottom of each well. A porous polyethylene frit was fitted into the bottom of the wells to allow removal of the solutions and solvents by vacuum. At the bottom, a rubber gasket prevented the leakage of the wells during the reactions.

    Figure 3.7. The LabMate manual synthesizer (photo: www.aapptec.com)

  • 34

    A simple commercially available parallel device, offered by the firm aapptec, is demonstrated in Figure 3.7. It is a manual synthesizer allowing the parallel synthesis of 25 different compounds in 0.05 to 2 mmol quantity. The content of the reaction vessels can be mixed and heated in 4 different zones. Cooling is also possible if connected to a circulating chiller. Another parallel synthesizer is offered by BCHI. The Syncore Reactor (Figure 3.8.) can accommodate 24 to 96 reaction vessels. It can be used for both solid phase and liquid phase parallel synthesis. Among the features vortex mixing, heating and cooling and parallel evaporation of the samples can be mentioned.

    Figure 3.8. The Syncore Reactor of BCHI (photo: www.buchi.com)

    Preparation of many organic compounds needs heating. Since the mid-1980s the use of microwave heating began to spread in chemical laboratory practice9. This kind of heating raised considerably the speed of chemical reactions in both solution and solid phase. The reaction times are typically reduced from days or hours to minutes or second often followed by increased yields, too. This type of heating is also applied in the synthesis of combinatorial libraries in order to save time. In microwave heating the energy is not transferred by conduction or convection so the reaction vessel is not heated only the solvent and the reactants. The energy is absorbed by dipolar molecules. Molecules that have larger dipolar momentum absorb better. For this reason solvents with large dielectric constant are preferred. Although examples for application of domestic microwave ovens in parallel synthesis of combinatorial libraries are found in the literature10 in practice rather specially constructed heaters are applied. The experiments carried out in domestic ovens are often difficult to reproduce because of the uneven electromagnetic field distribution, the pulsed irradiation and the unpredictable formation of hot spots. Two kinds of specially constructed commercial reactors are available that work either in multimode or in monomode operation.11,12

  • 35

    Figure 3.9. In the XP-1500 Plus system of CEM up to 12 samples can be heated simultaneously (photo: courtesy of CEM)

    The multimode reactor has a large cavity like the domestic oven but reflection by the walls and a mode stirrer ensures a nearly homogenous distribution of the electromagnetic field. In this reactor the samples can be heated in parallel. A parallel reactor for heating up to 12 samples is demonstrated in Figure 3.9.

    Figure 3.10. The software controlled ExplorerPLS system of CEM

    (photo: courtesy of CEM)

    The ovens operating in monomode, on the other hand, heat only one sample at a time. The vials containing the reactants are delivered serially into the oven. Such system, the ExplorerPLS of CEM is demonstrated in Figure 3.10. The Explorer handles all of the routine tasks necessary to execute a large number of reactions each d