|Home | About | Journals | Submit | Contact Us | Français|
In this account, we describe the results of a research program directed to the proposition that chemical synthesis can play a valuable role in identifying biologic level molecules worthy of pharma level development. We recount our journey towards the chemical synthesis of homogeneous erythropoietin, the challenges we encountered, and our efforts to address deficiencies in the current “state of the art” of glycopeptide synthesis. Here we describe new methods for the synthesis of glycopeptides that have emerged from the erythropoietin adventure, including the development of unique C-terminal acyl donors, novel amide bond forming methods, and new ligation and coupling strategies.
Within the traditional mindset of the pharmaceutical sciences there prevails an unstated but sharp division between the worlds of “biologics” and “small molecules.” The dominant and typical role for chemistry has been limited to the realm of “small molecules,” where its capacities for synthesis have traditionally been well recognized. In contrast, within the “biologics” domain, chemistry has played relatively minor roles. In this paper, we describe the results of a research program directed to the proposition that chemical synthesis can actually play a valuable role in identifying biologic level molecules worthy of further pharma level development.i A few remarks of a historical nature are perhaps in order before dealing with hardcore specifics.
In 1982 we described what was then a new reaction that ushered in a large body of chemistry. The reaction was the Lewis Acid Catalyzed Diene Aldehyde Cyclocondensation reaction that affords dihydropyrones in a remarkably general and straightforward fashion.ii Explorations along these lines carried a special appeal to us since, almost a decade earlier, our laboratory had launched the idea of 1,3-dioxygenated (synergistic) dienes in “all-carbon” Diels-Alder reactions.iii Reduction of the dihydropyrones afforded synthetic glycals. This led to a decade-long fascination with glycals, both synthetic glycals as well as those derivable from naturally available carbohydrates, particularly hexoses.iv In time, we learned how to take advantage of nuances in the chemistry of glycals. Control of the chemistry of glycals can be more discriminating than is the case with traditional pentafunctional hexoses. Indeed, we learned how to exploit glycals not only as donors, but as acceptors – a very new idea for its time.iv There then ensued a long term involvement directed to the chemical synthesis of oligosaccharides by what we called “glycal assembly” (Figure 1).v A particular focus of glycal assembly was that of fully synthetic vaccines, several of which have emerged from successful Phase I clinical trials.vi
In ca. 2002, there began to emerge in our group an interest in structures even more complex than oligosaccharides, i.e., asparagine linked glycopeptides and glycopolypeptides.vii Indeed, we were able to apply our earlier findings and later discoveries to reach such systems including structures mimicking parts of PSAviii and GP120.ix These triumphs, in time, led us to begin fantasizing about the complex, multiply glycosylated protein, erythropoietin (vide infra), in particular, about synthesizing erythropoietin (EPO) in a chemical laboratory as an alternative to the Golgi of CHO cells. In time, the fantasizing gave way to planning, and in due course, some visionary pioneers in our laboratory actually began experimentation directed toward the total synthesis of homogeneous EPO. In particular, we were drawn to EPO by its therapeutic value (vide infra) and its complex structure. It was in this context that we set for ourselves the formidable long-term challenge of preparing fully synthetic erythropoietin in homogeneous form. While our focus was on EPO, we anticipated the likelihood that the program would provide valuable new insights into the synthesis of glycopeptides and even insights into the nature of these fascinating bi-domainal “biologics”.
Glycoproteins arise from protein glycosylation, a complex post-translational process that plays a critical role in a wide range of cellular functions. Protein glycosylation has been implicated in mediating protein folding, in protecting against proteolysis, in cellular differentiation, and in cell-cell communication.x In mammalian systems, glycoproteins are commonly expressed as heterogeneous mixtures of glycoforms.xi Efforts to determine the exact biological role of a defined glycosylation profile have been seriously complicated by difficulties associated with isolating significant quantities of homogeneous glycoforms.xii While advances in genetic engineering have enabled access to glycoproteins, they do not as yet allow for homogeneous expression of glycoproteins. Furthermore, the emergence of their oligosaccharide regimes are structurally limited both by the host glycosylation machinery and by the genetically encoded amino acid template. Consequently, access to homogeneous and structurally definable glycoproteins could be of significant value for the systematic study of the effects of protein glycosylation on biology-level performance.xiii Of particular interest would be issues of biostability and the nature of the crosstalk between the two (polypeptidic and carbohydrate) domains and function.xiv In particular, de novo chemical synthesis might provide a powerful path to designed glycoproteins as it offers, in principle, precise structural control while providing opportunities for systematic variation of both the structures of the glycodomains and their location within the peptide sequence.
Erythropoietin (EPO) is a 30 kDa glycoprotein hormone with a 166 amino acid backbone and features three N-linked glycans (at Asn24, Asn38, and Asn83), one O-linked glycan (at Ser126), and disulfide bonds at Cys7 and Cys161, as well as Cys29 and Cys33.xv,xvi EPO is produced primarily in the kidneys. As a hematopoietic growth factor, erythropoietin regulates and maintains red blood cell, or erythrocyte, levels.xvii The production of erythropoietin is regulated by hypoxia following a classical feedback mechanism. While the major role of EPO was initially thought to be limited to erythrocyte production in the kidney, EPO and its receptor have been found in the nervous system and the EPO receptor is also present in the endothelium.xviii,xix Natural EPO exists as a heterogeneous mixture of glycoforms and is isolated in only low yield and after difficult purification.xx However, subsequent isolation, cloning of the EPO gene, and expression in mammalian cells have enabled the engineering of recombinant human erythropoietin (rhEPO).xxi
Moreover, heterogeneous EPO has found clinically valuable application as an adjuvant used in the treatment of anemia associated with renal failure and cancer chemotherapy.xxii Extensive efforts have been directed to the study and understanding of the structure and function of EPO, including the role of glycosylation. Specifically, Higuchi et al. demonstrated that erythropoietin's (rhuEPO) three N-linked glycosyl groups were not required for in vitro activity in terms of binding to the EPO receptor but were required for in vivo activity.xxiii In contrast, the single O-linked glycophorin apparently was not required for in vitro or in vivo activity.xxiii Furthermore, the in vivo activity of EPO was shown to be more than casually associated with its sialic acid content.xxiv Interestingly, Kent and coworkers have synthesized a polymer-modified EPO that apparently demonstrated incremental in vivo activity compared to the glycosylated rhuEPO.xxv Despite impressive biological and chemical studies that have been conducted to understand the structure and function of EPO, efforts to determine the exact biological roles of defined EPO glycoforms have been thwarted by difficulties associated with isolating significant quantities of homogeneous glycoforms. In fact, this goal has not been accomplished by any biologically enabled means.
Chemical synthesis of glycopeptides or glycopolypeptides bearing complex carbohydrates has only recently been realized.xxvi This situation is in sharp contrast to the plethora of non-glycosylated proteins that have been prepared by chemical synthesis, enabled by the discovery and development of native chemical ligation.xxvii,xxviii Indeed, using native chemical ligation (NCL), Kajihara and Dawson reported the first synthesis of a complex glycoprotein, i.e., a single glycoform of monocyte chemotactic protein-3 containing human complex sialyloligosaccharide.xxix Erythropoietin glycopeptide fragments containing complex sialyloligosaccharides have been prepared by our group using either NCL or direct condensation methods.xxx,xxxi,xxxii,xxxiii,xxxiv Recently, an asialo erythropoietin glycopeptide fragment was reported by Kajihara,xxxv for which serine ligation was effectively employed. The chemical syntheses of ribonuclease B and CTLA-4 glycopeptide fragments using NCL were also reported by the groups of Unverzagat and Kajihara, respectively.xxxvi,xxxvii Hojo and co-workers have also employed direct condensation methods in their chemical synthesis of an N-glycosylated Ig domain of emmprin.xxxviii Notwithstanding these clear advances, the chemical synthesis of complex glycoproteins and glycopeptides remains a significant and daunting challenge.
We were fully aware of the long road before us and of the many obstacles that would inevitably be faced in seeking to complete such a program. While our defining “line in the sand” goal is the synthesis of homogeneous erythropoietin, it seemed likely that the struggle itself would prove valuable, as well as stimulating. Such a journey itself would help us to identify and then address deficiencies in the current “state of the art” of glycopeptide and glycopolypeptide synthesis. As matters transpired, the attempted total synthesis of homogeneous EPO has been, and continues to be, a highly productive undertaking that has resulted in the development of unique C-terminal acyl donors, novel amide bond forming methods, and new ligation and coupling strategies. The scope and applicability of these new methods have been demonstrated in multiple contexts, most recently in our syntheses of erythropoietin glycopeptide fragments.xxxii,xxxiii,xxxiv Here we emphasize new methods for the synthesis of glycopeptides and glycopolypeptides that have already emerged from the EPO adventure.
As we were pondering possible strategies for the chemical synthesis of glycoproteins, we recognized the need for a highly convergent approach to glycopeptides that would enable access to multiply-glycosylated glycoproteins. Ideally, the synthesis would entail the coupling of separate glycopeptide fragments, which would offer not only maximum convergency for reaching a single glycoform of a glycoprotein, but would also provide a modular synthesis from which new glycoforms could be prepared. Furthermore, during the course of our studies on glycopeptide synthesis, we observed that Lansbury aspartylation reactionsxxxix between complex oligosaccharides and peptides were more efficient when the peptidic aspartyl acyl donors were of relatively modest length and presented relatively stable side-chain functional groups. Thus, we required methods that would not only enable the synthesis of glycopeptide domains from their respective glycosyl and peptide fragments, but would also enable the subsequent ligation of these product glycopeptide domains.
In the early phases of the effort, it had been demonstrated that a peptide αthioester and a N-terminal cysteinal glycopeptide fragment could be joined through native chemical ligation.xl An analogous approach could be employed to join two glycopeptide segments. Clearly, implementation of such a format would require, at some stage, the synthesis of a glycopeptide αthioester. Faced with the practical challenges of preparing such a αthioester, we developed a type of construct which would hopefully be more accessible and functionally even more valuable as an acyl donor. This is first presented in the form of a phenolic ester equipped with an unsymmetrical disulfide (see 1, Figure 2).xli It was presumed that the phenolic ester itself would be of relatively modest acyl donor proclivity. However, one could anticipate that reduction of the disulfide moiety would reveal a free ortho benzene thiol function (2) that could enhance the acylating ability of the phenolic ester. Alternatively, a bona fide O→S acyl transfer could serve to generate the corresponding aryl thioester (3). At the outset, it was presumed that the thioester valence tautomer would be present, at best, as a minor component in equilibrium with the O-ester. We anticipated that either of these two isomeric ester forms might act as viable acyl donors, under the appropriate reaction conditions.
Happily, the utility of this latent acyl donor in glycopeptide couplings has been well demonstrated, for instance, in the synthesis of 9 (Scheme 1).xli Two glycopeptide segments, one bearing our ortho-disulfide phenolic ester at the C-terminus (7) and the other a t-butylthio-protected cysteine at the N-terminus (8), were subjected to reductive cleavage of their disulfide bonds. The resulting glycopeptide intermediates underwent ligation to afford the doubly glycosylated polypeptide 9. The initial success in bringing together two glycopeptide segments through the use of our newly crafted ortho-disulfide phenolic ester via chemical ligation was certainly encouraging.
In terms of EPO, we remained apprehensive as to whether the requirement of an N-terminal cysteine for NCL could be a limiting factor. Cysteine is one of the least abundant amino acids found in proteins, second only to tryptophan.xlii In the context of erythropoietin, which has four glycosylation sites, a convergent synthesis would entail the sequential ligation of the four separate glycopeptide domains. However, the cysteine residues in EPO are unevenly distributed, thereby limiting their utility in ligating different glycopeptide domains. To deal with the paucity of cysteine residues in EPO, it was deemed essential to develop cysteine-independent ligation methods that would also complement our cysteine-based ligation method.
An opportunity to develop a cysteine-free ligation presented itself during our mechanistic studies on ligations with our newly crafted ortho-disulfide phenolic ester.xliii Under our reaction conditions, reduction of the ortho-disulfide group results in a free ortho benzene thiol that rearranges in a unidirectional O→S acyl transfer, to the corresponding thioester (Figure 2, 2→3). We recognized that such an intermediate thioester could potentially undergo a second acyl transfer event, with a proximal amine group, culminating in a cysteine-free ligation strategy (Figure 3). By introducing a second peptide substituent (12), two glycopeptide fragments could, in principle, be joined in sequential acyl transfers. As described below, removal of the thiol protecting group from 13 initiated the first acyl transfer, from O→S, to afford thioester 15. In the presence of the proximal amine group, a second acyl transfer, this time from S→N, gives rise to 16. Removal of the organizing auxiliary unit affords 17, in which the fragments are now joined by a new amide bond.
This novel cysteine-free ligation strategy was, in fact, used to prepare glycopeptide 19, which features two protected Tn antigens, from the differentially glycosylated substrate 18 (Scheme 2).xliii Following acidic removal of the thiol protecting group in 18, sequential acyl transfers afforded the desired product 19, wherein the new amidic nitrogen remains connected to the phenolic auxiliary. At this writing, we have not yet dealt with the cleavage of the now-vestigial 1-thiyl meta cresyl residue in 19.
Following the principles of native chemical ligation, combined with what we had learned from our studies on cysteine-free ligation, we developed a new method for utilizing the electron rich auxiliary 21 (Figure 4).xliv The role of the auxiliary, which would be situated at a glycopeptide N-terminus, was to provide a thiol handle that would essentially mimic cysteine.xlv In a manner analogous to native chemical ligation, the thiol auxiliary should join two glycopeptide fragments through an initial transthioesterification, thus enabling the critical S→N acyl transfer to furnish the new amide bond. In this approach, we first dealt with installation of auxiliary 21 onto a glycopeptide N-terminus (20) via reductive amination. The acyl donor would be our ortho-disulfide phenolic ester, situated at the C-terminus of a second glycopeptide fragment (22). Under reducing conditions, the ortho-thiol phenolic ester would undergo an O→S acyl transfer and the resulting thioester would react with the thiol auxiliary to provide a new thioester (24). Subsequent S→N acyl transfer and removal of the auxiliary would provide the desired bidomainal glycopeptide adduct 26.
Our new cysteine-free ligation method was successfully applied to the synthesis of several complex glycopeptide fragments.xliv In the synthesis of the bidomainal glycopeptide 31, glycopeptides 27 and 28 are joined together at a glycine-glutamine junction (Scheme 3). Initial disulfide bond reduction of 27 and 28 resulted in formation of the ligated product 29. The auxiliary was removed through a two-step sequence whereby the free thiol was first methylated to prevent any acid-mediated N→S acyl transfer. Subsequent treatment with TFA resulted in cleavage of the auxiliary to give the desired glycopeptide adduct 31. It is appropriate to point out that during the course of our studies, we observed that as the ligation site becomes more hindered, the S→N acyl transfer becomes increasingly more difficult. Consequently, this method, as currently practiced, is best applied to junctions where at least one of the amino acids is glycine or alanine.
In the context of erythropoietin, the synthesis would require the reiterative coupling of multiple glycopeptide and peptide fragments. To demonstrate the utility of our cysteine-free and cysteine-based ligations in reiterative couplings, we identified a tridomainal glycopeptide (32) containing three glycosyl units for total synthesis (Figure 5).xliv The target glycopeptide (32) would be assembled from the C→N terminal, using our cysteine-free ligation to first form the glycine-glutamine junction. This merger would be followed by a cysteine-based ligation to form the glycine-cysteine junction. The requisite glycopeptide starting materials are 27, 33, and 34, which can be readily prepared using techniques developed in our lab.
The reiterative coupling began with glycopeptides 33 and 34 (Scheme 4). Disulfide bond reduction followed by cysteine-free ligation provided the adduct 35. Subsequent cleavage of the the Fmoc and thiazolidine groups served to expose the N-terminal cysteine (36). The latter could be joined by cysteine-based ligation with glycopeptide 27 to afford the tridomainal glycopeptide 32.
While our cysteine-free ligation method has thus far been focused on the synthesis of multidomainal glycopeptides, it was appreciated that this method could, in principle, offer a potential alternative to the preparation of cyclic peptides. Compared to their linear counterparts, cyclic peptides often exhibit enhanced specificity, activity, and stability, making them an important class of biologically relevant compounds. Indeed, in contrast to the traditional methods used for cyclic peptide synthesis, Tam and co-workers have shown that native chemical ligation can also be used for this purpose.xlvi Considering the rarity of cysteine in proteins, the development of cysteine-free ligation methods to gain access to cyclic peptides is of particular importance. Our cysteine-free ligation format was thus applied to the synthesis of cyclic peptide 39 (Scheme 5).xliv In starting peptide 38, the C-terminus features an ortho-disulfide phenolic ester while the N-terminus was equipped with the auxiliary. Upon exposure to reducing conditions, 38 underwent intramolecular ligation to afford the cyclized product 39.
With the demonstration of our cysteine-free and cysteine-based ligations on model glycopeptide fragments, we were emboldened to apply these methods to fragments from which one could hypothesize about a synthesis of erythropoietin. Two glycopeptide fragments of EPO, Ala22-Glu37 (44) and Ala114-Arg166 (48) were prepared using the cysteine-based ligation and cysteine-free ligation, respectively.xxx,xxxi In the case of Ala22-Glu37 (44), the necessary glycopeptide fragment was assembled via Lansbury aspartylation between dodecasaccharide 40xxx,xlvii and peptide 41 (Scheme 6). Ligation of glycopeptide 42 and peptide 43 proceeded under reducing conditions to afford the Ala22-Glu37 glycopeptide domain (44) featuring the fucosylated biantennary N-glycan.
It is, of course, noted that the Ala114-Arg166 (48) glycopeptide fragment of EPO contains an O-linked glycophorin glycan at Ser126 (Scheme 7). By utilizing the glycophorin-presenting serine as a cassette in combination with Fmoc-based solid phase peptide synthesis, the glycopeptide fragment 45 was assembled.xxxi Applying cysteine-free ligation conditions, glycopeptide 45 and peptide 46 were joined to provide 47. Selective methylation of the free thiol group followed by acidic cleavage of the auxiliary afforded the Ala114-Arg166 (48) glycopeptide fragment, which should be suitable for the EPO target.
At this juncture in our journey, our cysteine-free and cysteine-based ligation methods had enabled the synthesis of multidomainal glycopeptides, including the preparation of two glycopeptide fragments of EPO (44 and 48). Using these methods, the reiterative coupling of (glyco)peptide segments had also been demonstrated, leading to the synthesis of a trifunctional glycopeptide (32). While encouraging, these methods are not without their limitations. As mentioned earlier, during the cysteine-free ligation, the quality of the culminating S→N acyl transfer step is highly dependent on the state of steric hindrance at the ligation site. Moreover, beyond the critical acyl transfer step lies another potential complication: cleavage of the auxiliary under strongly acidic conditions could compromise the vulnerable glycosidic bonds. Furthermore, our glycopeptide/peptide ligations thus far have been limited to a single direction, from the C to the N terminus. The capacity to couple glycopeptide and peptide segments in a reiterative manner could well be of value in attempting to reach EPO by chemical synthesis. A unidirectional synthesis implies that glycopeptide assembly will be limited to a linear and nonconvergent synthesis whereby short (glyco)peptide fragments will be sequentially coupled. By contrast, a convergent synthesis would require the development of methods that enable glycopeptide coupling from the N to the C terminus, thereby requiring accommodation with existing ligation strategies.xlviii
In exploring alternative glycopeptide coupling methods that could circumvent the need for a cysteine or auxiliary based routes (permitting a new coupling direction), we wondered if our ortho-disulfide phenolic ester (i.e., 22) could serve as a viable acyl donor with a free amine acceptor (e.g., 20). The fragment condensation studies of Blake and Aimoto, which are mediated by silver ion, cast new mechanistic light on our ortho-disulfide phenolic ester. We were led to speculate whether the uniquely situated ortho-thiol group could facilitate the generation of an enhanced acyl donor (Figure 6).xlix It was thought that in the presence of an additive, such as HOOBt, the metal-activated ortho-thiol phenolic ester could give rise to a reactive acyl donor (49). The new acyl donor so generated could undergo fragment coupling with the free N-terminus of a glycopeptide (20), wherein the resulting adduct (26) would be free of any auxiliary.
Indeed, it was found that the use of AgCl, in combination with HOOBt and Hunig's base, resulted in the merger of the N-linked glycopeptide 50 and peptide 51 to afford the desired product 52 in 80% yield (Scheme 8).l It will be noted that in the example shown, the only amino acid side chains that are protected are those of lysine.li The other amino acid side chains, in particular those of aspartic acid and glutamic acid, remained free. This method enables an unprecedented direct fragment coupling in the absence of an N-terminal cysteine or an auxiliary by exploiting the latent acyl donor potential of the ortho-disulfide phenolic ester.
Happily, the enhancement of the acyl donor capacity of our ortho-disulfide phenolic ester is not, however, limited to the use of silver salts. The use of tris(2-carboxyethyl)phosphine (TCEP), a water-soluble phosphine reducing agent, was also effective in activating the ortho-disulfide phenolic ester to permit fragment coupling. As shown in the example below, the O-linked glycopeptide 53 and peptide 54 were coupled to afford ligated glycopeptide 55 under TCEP-mediated conditions (Scheme 9). It will be noted that the fragment couplings in Schemes 8 and and99 occur at C-terminal glycine and proline residues, respectively. As these residues are not vulnerable to epimerization problems, they are currently the preferred C-terminal amino acids for such fragment couplings.
Having two different actuation protocols for the assembly of EPO directed fragments, we explored their joint potential application to reiterative glycopeptide ligations. In practice it proved possible to exploit differences in the reactivity of thioesters during the two protocols. Specifically, under TCEP-mediated activation conditions, C-terminal alkyl thioesters remain intact whereas they are activated under the AgCl protocol, thereby undergoing fragment coupling.lii The implications of such functional group orthogonality are not insignificant. By maintaining the C-terminal alkyl thioester during a TCEP-mediated fragment coupling, the resulting product can be used in a second fragment coupling mediated by AgCl to enable glycopeptide assembly from the N to the C terminus.
As shown in Scheme 10, glycopeptide 50 is equipped with our C-terminal ortho-disulfide phenolic ester while peptide 56 has a C-terminal alkyl thioester.l Fragment coupling of the two was accomplished using the TCEP protocol to provide glycopeptide 57, which has an intact C-terminal alkyl thioester. The resulting glycopeptide product was then subjected to a AgCl-mediated fragment coupling with peptide 51 to afford glycopeptide 58. The final product, 58, was prepared from the N to the C terminus using sequential TCEP/AgCl fragment couplings.
A state of the art example of a glycopeptide assembly utilizing the sequential TCEP/AgCl fragment couplings is seen in the synthesis of erythropoietin glycopeptide domain Gln78-Arg166 (63, Scheme 11).xxxii This fragment, prepared in homogeneous form, contains both the N-linked dodecasaccharide at Asn83 and the O-linked glycophorin at Ser126. A TCEP-mediated fragment coupling between glycopeptide 59 and peptide 60 resulted in the formation of erythropoietin glycopeptide domain Gln78-Gly113 (61). In the light of its exploitable linkage, C-terminal alkyl thioester 61 was treated with AgCl in the presence of HOOBt and iPr2NEt to react with glycopeptide 62, thereby furnishing the erythropoietin Gln78-Arg166 glycopeptide domain 63, in 30% yield.
By exploiting differences in the acyl donor capacities of alkyl thioesters and ortho-disulfide phenolic ester thioesters, the TCEP and AgCl protocols were successfully incorporated in reiterative glycopeptide couplings. We recognized that viable acyl donors need not necessarily be limited to thioesters. Accordingly, we explored the use of activated oxo-esters in peptide ligations. It was found that C-terminal para-nitrophenol estersliii are highly effective acyl donors, undergoing ligation with peptides containing N-terminal cysteine in good yield and in relatively short reaction times with minimal racemization (Scheme 12).liv In one study, the para-nitrophenol ester mediated ligation proceeded at a significantly faster rate than the corresponding native chemical ligation, demonstrating the enhanced reactivity of the para-nitrophenol ester in these ligations. However, competing hydrolysis was a complication. Fortunately, such undesired hydrolysis could be suppressed by addition of para-nitrophenol and application to more hindered C-terminal residues. Most notably, studies with the very hindered unnatural amino acid penicillamine (68) and C-terminal para-nitrophenol esters also resulted in successful ligations (Scheme 13). By analogy to chemistry used in alanine ligation as described below, the penicillamine residue could be transformed to valine (69→70) via radical desulfurization (vide infra). For instance, ligation product 69 was exposed to thiol reduction conditions, converting the penicillamine residue to the corresponding valine residue in 70.
During our studies on the synthesis of the erythropoietin Cys29-Gly77 glycopeptide domain (76), it was found that the para-nitrophenyl ester was actually too reactive as an acyl donor. This high reactivity had to be attenuated to avoid unwanted side reactions in the critical aspartylation reaction.xxxiii In the context of efforts to tune the reactivity of the acyl donor, it was found that para-cyanophenyl esters manifested decreased but still sufficient reactivity for our requirements. As shown in Scheme 14, the aspartylation reaction between dodecasaccharide 40 and peptide 71 had afforded the desired glycopeptide fragment 72 in 33% yield. Direct aminolysis of the C-terminal para-cyanophenyl ester in glycopeptide 72 with peptide 73 provided the erythropoietin Cys29-Gly57 glycopeptide domain (74). It was noted that the C-terminal masked thioester within peptide 73 possesses two ortho-functional groups, a disulfide group and a propyl group. The propyl substitution was introduced to suppress undesired hydrolysis during the aminolysis reaction, thus permitting the synthesis of the Cys29-Gly57 glycopeptide (74) while retaining the masked thioester. This example serves to demonstrate how subtle adjustments can modulate the capacity of an acyl donor. Indeed, a TCEP-mediated fragment coupling between glycopeptide 74 and peptide 75 afforded the potential erythropoietin Cys29-Gly77 glycopeptide domain 76. The reader will note that this fragment possesses a C-terminal alkyl thioester that could well serve as a means for future glycopeptide coupling during the assembly of erythropoietin.
The strategies, methods, and tools discussed in the previous sections focused on circumventing cysteine dependence in various assemblies. As described, we investigated the use of auxiliaries as a cysteine mimic and developed a modified Blake-Aimoto direct fragment coupling; both of these methods have been applied to the reiterative synthesis of complex glycopeptide fragments. An alternative to these methods that also addresses the problem of cysteine dependence is the direct reduction of the cysteine sidechain to an alanine sidechain, following native chemical ligation. This idea was first proposed by Yan and Dawson and utilized primarily Raney nickel or Pd/Al2O3 to effect the desired cysteine reduction.lv Essentially, cysteine serves as a surrogate for alanine, which occurs with considerably greater frequency than cysteine in peptides and proteins.xlii The potential opportunities presented by this approach are significant and the groups of Kentlvi and Wonglvii have also made contributions. However, given our interest in the synthesis of complex glycoproteins, which often contain extensive and sensitive functional groups, we were interested in developing a mild and metal-free reduction for the conversion of cysteine to alanine.
After taking note of a 1956 disclosure by Hoffmann describing a desulfurization reaction between mercaptan and trialkylphosphite,lviii in a totally different context, we explored the use of trialkylphosphines, in particular TCEP, to effect a cysteine reduction.lix TCEP was an attractive choice for several reasons. Thus, it can tolerate a range of glycopeptide functionality, it is water soluble, and it is easily manipulated in air. In conjunction with the water soluble 2,2’-azobis[2-(2-imidazolin-2-yl)propane]dihydrochloride (VA-044) and t-butylthiol, TCEP proved to be a very effective reducing agent. As shown in Scheme 15, glycopeptide 77, which has an N-terminal thiazolidine, an Acm-protected cysteine, and our C-terminal ortho-thiophenolic ester, was coupled with peptide 78, which has a methionine, a C-terminal thioester, and an N-terminal cysteine, in a kinetically controlled ligation.xlviii The resulting peptide (79) was subjected to the desulfurization conditions described above, thereby accomplishing clean reduction of the free cysteine to alanine and giving glycopeptide 80. It will be noted that the sulfur atoms within the thiazolidine, Acm-protected cysteine, methionine, and C-terminal thioester are all intact. The reduction protocol is both mild and versatile, tolerating all thiol-containing groups as well as oligosaccharide domains.
Given the versatility of this method, we anticipated that it could accommodate the different functional groups present within the erythropoietin Ala1-Gly28 glycopeptide domain (87).xxxiv This glycopeptide fragment lacks any exploitable cysteine and glycine/proline residues, thus precluding the use of either NCL or direct condensation methods for its assembly. However, the presence of several alanine residues in close proximity to the glycosylation site suggested that native chemical ligation/radical desulfurization could address the synthetic challenges of this fragment. As shown in Scheme 16, peptide 81 and glycopeptide 82 were joined via native chemical ligation to give 83. In situ thiolactone formation during the reaction resulted in formation of 84. Removal of the allyl protecting group followed by opening of the thiolactone with thiopropionic acid provided glycopeptide 86, in which the cysteine side chain is revealed for desulfurization. Treatment of 86 with VA-044, TCEP, and thiopropionic acid in buffered conditions at 37 °C cleanly afforded the reduced product 87 in 67% yield. The use of thiopropionic acid as the radical propagator also served to open any thiolactone produced during the reaction. The erythropoietin Ala1-Gly28 glycopeptide domain, 87, features both the biantennary glycan and a C-terminal αthioester, two critical features necessary for the convergent preparation of synthetic homogeneous EPO.
Following our results in “alanine ligation,” native chemical ligation/metal-based thiol reduction strategies have also been developed to enable ligations at phenylalanine residues.lx In principle, the native chemical ligation strategy could be applied to any amino acid site by introducing a sulfhydryl group within the sidechain of the desired N-terminal residue. Following ligation and desulfurization, the natural amino acid is then revealed at the ligation site. This logic led us to develop a ligation at valine, which is a relatively abundant amino acid.xlii We selected γ-thiol valine to serve as the valine precursor,lxi instead of the hindered penicillamine.lxii As shown in Scheme 17, peptide 89 is equipped with a protected γ-thiol valine residue at the N-terminus and peptide 88 has a C-terminal ortho-thiophenolic ester. Under native chemical ligation conditions, the two are joined through transthioesterification followed by S→N acyl transfer to afford 90. Radical desulfurization using our TCEP/VA-044/tBuSH protocol at 37 °C provided peptide 91 with the natural valine residue at the ligation site.
Another outcome of our interest in amide bond formation driven by our EPO journey was a revisitation of the chemistry of isonitriles. Our isonitrile program has already matured into a field of inquiry of its own. Here we merely recapitulate for the reader some of our introductory findings. In particular we developed what we call the two-component coupling between carboxylic acids and isonitriles to give an N-formyl amide.lxiii It is proposed that under microwave heating, carboxylic acid 92 and isonitrile 93 react to give rise to formimidate carboxylate mixed anhydride 94. The latter then undergoes a 1,3-O→N acyl transfer giving rise to the observed N-formyl amide 95 (Figure 7). The utility of this two-component coupling in building asparagine-linked glycopeptides was explored (Scheme 18). The reactions between glycosyl isonitriles and aspartic acid were anomerically specific, that is, the respective α- and β-isonitriles (99 and 96) produce, correspondingly, the α- and β-N-linked glycosyl amino acids (100 and 98). It is important to note that the C2 functional group must be non-participatory, or, as shown in equation 3 (Scheme 18), a highly reactive β-GlcNAc donor can form that results in ester bond formation (101→102).
The N-formyl amide unit is a highly versatile functional group (Figure 8). It can be cleaved under basic conditions to provide the corresponding secondary amide (98→108). Even more interesting is its reduction giving rise to an N-methyl amide via the hydroxymethyl intermediate (98→107). Alternatively, the N-formyl group undergoes a Lewis-acid induced nucleophilic aminomethylation reaction with allyl trimethylsilane (104→103). These transformations provide novel routes to tertiary amides that are otherwise only difficultly accessible. Morevover, the resulting N-formyl mixed imide can be readily converted to dihydropyridones (105→106). The isonitrile based methods briefly described above have sparked major new opportunities to generate a range of amidic subtypes. This work will be described in a separate treatment.
In this account, we have provided some of the key findings that have brought us to the threshold of realizing our impractical goal of 2003, i.e., a total synthesis of homogeneous erythropoietin. Obviously, there are still significant remaining issues. It may well be necessary to develop new chemistry to deal with additional difficulties as they arise. Nonetheless, it is fair to say that the long term goal of a total synthesis of erythropoietin has encouraged exploration of new ideas in amide-directed acylation. Many of these discoveries are likely to prove applicable to other biologic-level targets. In that sense, it can well be argued that the EPO journey has already been a success.
Cindy Kan began her chemistry training as an undergraduate at Barnard College. While at Barnard, she worked with Professor Christian M. Rojas on the use of acyl azides in amidoglycosylation reactions. She then went on to Stanford University to pursue her doctoral studies under the guidance of Professor Paul A. Wender. Her graduate work focused on the development and synthesis of bryostatin analogs as well as synthetic studies on daphnane diterpene orthoester natural products. After receiving her doctoral degree, she returned to New York City for a postdoctoral position with Professor Samuel J. Danishefsky at Memorial Sloan-Kettering Cancer Center. Her work in the Danishefsky lab has centered on the chemical synthesis of homogeneous glycoproteins.
Samuel J. Danishefsky received his B.S. degree at Yeshiva University. He did his graduate research under the late Professor Peter Yates. He then joined the laboratory of Professor Gilbert Stork at Columbia University as an NIH Postdoctoral Associate. His first academic position was at the University of Pittsburgh, where he joined as Assistant Professor in 1963. He was promoted to Associate Professor, Professor, and University Professor. In January 1980, he moved to Yale University and was named Eugene Higgins Professor in 1981. Appointed by President A. Bartlett Giamatti as Chairman of the Department of Chemistry, he served until 1987. He became Sterling Professor at Yale in 1990. In 1993, Professor Danishefsky moved back to New York as Professor of Chemistry at Columbia University and the Eugene Kettering Chair and Head of the Laboratory of Bioorganic Chemistry at Memorial Sloan-Kettering Cancer Center. In 1996, he shared the Wolf Prize in Chemistry with Professor Gilbert Stork. In 2006, he was the recipient of the Franklin Medal in Chemistry, the Bristol Myers Squibb Lifetime Achievement Award in Chemistry, and the National Academy of Sciences Award in the Chemical Sciences.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.