|Home | About | Journals | Submit | Contact Us | Français|
The Ala1—Gly28 glycopeptide fragment (28) of EPO was prepared by chemical synthesis as a single glycoform. Key steps in the synthesis include attachment of a complex dodecasaccharide (7) to a seven amino acid peptide via Lansbury aspartylation, native chemical ligation to join peptide 19 with the glycopeptide domain 18, and a selective desulfurization at the ligation site to reveal the natural Ala19. This glycopeptide fragment (28) contains both the requisite N-linked dodecasaccharide and a C-terminal αthioester handle, the latter feature permitting direct coupling with a glycopeptide fragment bearing N-terminal Cys29 without further functionalization.
As a part of our continuing effort to bring chemical synthesis to the realm of “biologics”, we are pursuing the preparation of homogeneous glycoproteins.1,2 In contrast to biochemical methods that do not allow for homogenous expression of glycoproteins, de novo chemical synthesis offers precise structural control for the preparation of homogeneous products while potentially deconvoluting key structure—function relationships of such complex structures. Central to our efforts to apply chemical synthesis to the preparation of “biologics” is our proposed total synthesis of erythropoietin.
Erythropoietin (EPO, 1, Figure 1) is a glycoprotein hormone used to treat anemia associated with renal failure and cancer chemotherapy.3 Extensive efforts have been made to study and understand the structure and function of EPO, including the role of glycosylation. Specifically, Higuchi et al. demonstrated that erythropoietin’s (rhuEPO) three N-linked glycosyl groups were not required for in vitro activity but were required for in vivo activity, while the single O-linked glycosyl group did not bear any biological role.4 Furthermore, the in vivo activity of EPO was shown to be directly related to the sialic acid content.5 Interestingly, Kent and coworkers have synthesized a polymer-modified EPO that demonstrated superior in vivo activity compared to the glycosylated rhuEPO.6 Despite the impressive biological and chemical studies that have been made to understand the structure and function of EPO, efforts to determine the exact biological role of defined EPO glycoforms have been hindered by difficulties associated with isolating significant quantities of homogeneous glycoforms.7 In fact, this has not been accomplished by any biologically enabled means. As discussed in the preceeding papers,1 a potentially powerful solution to this problem is chemical synthesis, which would potentially enable access to homogeneous EPO glycoforms and related analogues.
Chemical synthesis of glycoproteins or glycopeptides bearing complex carbohydrates has only recently been realized. This is in sharp contrast to the plethora of non-glycosylated proteins that have been prepared, aided by the power of native chemical ligation8,9 in polypeptide synthesis. Using native chemical ligation (NCL), Kajihara and Dawson reported the first synthesis of a complex glycoprotein, i.e., a single glycoform of monocyte chemotactic protein-3 containing human complex sialyloligosaccharide.10 Erythropoietin glycopeptide fragments containing complex sialyloligosaccharides have been prepared by our group1,2d using either NCL or direct condensation methods.2f,11 Recently, an asialo erythropoietin glycopeptide fragment was reported by Kajihara,12 in which a serine ligation was effectively employed. Clearly, the chemical synthesis of complex glycoproteins and glycopeptides remains a significant and daunting challenge.
Our program directed to reaching EPO by total synthesis has already yielded valuable contributions to the rapidly growing field of (glyco)peptide ligation tools and methods. Several different C-terminal functional groups have been employed in our glycoprotein synthesis studies, including cyanophenolic esters1b and nitrophenolic esters.2h Our ortho-disulfide phenolic ester2i was developed to serve as a stable, latent thioester. These C-terminal groups have enabled orthogonal ligations such that multiple polypeptide and glycopeptide couplings can be accomplished from the N→C terminus.1b,2f Our investigations into cysteine-free ligation methods have included the development of a thiol auxiliary that permits coupling of complex glycopeptide fragments,2c,13 the discovery of a two-component isonitrile/carboxylic acid coupling method to construct amide bonds,14 and alanine,2g,15 valine,16,17 and homocysteine18,19 ligations, which utilize a mild and selective desulfurization method that is also compatible with complex carbohydrates.20 The synthesis of homogeneous glycosylated EPO(1-28) (2, Figure 2), described below, serves to illustrate the value of “alanine” ligations.
Our vision for the assembly of erythropoietin emphasizes maximum convergency. It projects the synthesis of three glycopeptide fragments that will subsequently be merged. Indeed the syntheses of EPO(78-166) and EPO(29-77) have been described earlier in this series.1 In each case we were able to overcome potentially serious complications inherent in those domains. The EPO(1-28) segment is the shortest of the three peptide fragments, containing less than 20% of the EPO sequence. As the smallest fragment and (potentially) the last fragment in the synthesis of EPO (assuming a linear synthesis from the C→N-terminus), it may appear to be the simplest of the segments. However, the inherent challenge of the EPO(1–28) segment lies in the absence of any functional cysteine and glycine/proline residues upon which to base a retrosynthetic analysis. This peculiarity precluded the use of either NCL or direct condensation methods21 for its assembly. This is critical because during our initial studies, and as reported in the preceeding paper,1a we found that the direct attachment of dodecasaccharide 722 to Asp24 of the EPO(1–28) peptide sequence 323 via Lansbury aspartylation24 was unsuccessful, providing only aspartimide by-products. In contrast, the joining of disaccharide 5 and hexasaccharide 6 to the 28-residue peptide 3 by Lansbury aspartylation proceeded in 70% and 30% yields, respectively. Such unacceptably poor reactivity must surely be the consequence of the increased steric bulk presented by dodecasaccharide 7, rendering aspartimide formation not only kinetically competitive (as with 6), but dominant.
Recognizing the need to employ a smaller peptide to permit functional aspartylations of larger glycosylamines, we revised our plan for the synthesis of the EPO(1–28) fragment to include a strategic dipeptide scission between Ala19 and Lys20. We first favored this disconnection with the goal of applying our TCEP-assisted phenolic ester-directed amide coupling method1f to assemble the fragment, despite concerns about the potential for epimerization at Ala19. Under this direction, the protected Lys20-Gly28 sequence (9) was prepared and effectively joined with dodecasaccharide 7 by Lansbury aspartylation to provide, after Fmoc removal, our target fragment 11 (35%). Efforts to ligate the FmocHN-Ala1-Ala19-CO2Ph(o-SSEt) fragment (13) with glycopeptide 11 using our direct condensation method proceeded successfully to give 12, but the stereo-integrity of the alanine ligation site (see asterisk) could not be verified.
One way to determine whether the condensation product (12) suffered from epimerization during the key peptide coupling is to compare the retention times of 12 and the corresponding D-Ala19 diastereomer. Of course, we recognized the possibility that the two diastereomers might co-elute. Following the same sequence of steps as shown in Scheme 2, and using the D-Ala19 peptide 29, the diasteroemer 30 was prepared (Scheme 3). Co-injection of the two glycopeptides yielded a single peak, offering no resolution. Increasing the concentration of the D-Ala19 diastereomer did not change the peak shape. Despite these efforts, the stereo-integrity of the alanine ligation site remained uncertain.25
Given the knowledge that the attachment of dodecasaccharide 7 to a shorter sequence was successful, we turned to an alternative approach to address the issue of stereo-integrity. As stated earlier, the absence of any useful cysteine and glycine/proline residues precluded the use of either NCL or direct condensation methods. However, it was noted that several alanine residues are present in close proximity to Asp24, the site for glycosyl attachment. An established extension of NCL has been the conversion of cysteine to alanine via desulfurization,12a,b which enables ligation at alanine sites with the benefits of NCL (e.g., avoidance of epimerization, chemoselectivity). Recently, our laboratory had developed a desulfurization method that is compatible with oligosaccharides and is both extremely mild and selective.1g We anticipated that the versatility of this method would readily accommodate the different functional groups present within EPO(1–28), thus enabling the use of NCL to address our earlier problems in the synthesis of the EPO(1–28) fragment.
The EPO(1-28) fragment contains two alanine residues (Ala19 and Ala22) in close proximity to the N-glycan, either of which could serve as the ligation site. We elected to implement a ligation between Glu21 and Ala22 to obtain a shorter (glyco)peptide segment that would be more suitable for the essential Lansbury aspartylation as it is (1) smaller in size and (2) free of Lys20 and Glu21, the side chains of which would necessarily be protected during the aspartylation reaction. This disconnection also yields the longer 21-amino acid peptide terminating at Glu21. It should be noted that Botti has demonstrated that NCL at C-terminal glutamates and aspartates requires the side chains to be protected to avoid formation of the unnatural γ-amide bond during ligation.26 While inconvenient, a glutamate protecting group would be necessary regardless of which disconnection was selected.
The requisite glycopeptide segment 18 was prepared, as shown in Scheme 4. Starting from peptide 14 (95% yield via solid phase peptide synthesis), condensation with ethyl thiopropionate followed by removal of the t-butyl groups provided thioester 15. Attachment of dodecasaccharide 7 to peptide 15 via Lansbury aspartylation followed by in situ Fmoc cleavage proceeded in moderate yield to afford glycopeptide 17 in 65% yield over the two-step sequence. Finally, deallylation with Pd(PPh3)4 and PhSiH3 afforded the NCL partner 18 in 90% yield.
Under NCL conditions, glycopeptide 18 was coupled with the longer peptide segment 1927 to yield the desired ligation product 20, as well as the corresponding thiolactone, 21 (Scheme 5). Allowing the reaction to stir longer resulted in what was thought to be exclusive formation of the thiolactone (vide infra), which was isolated in 20% yield. Following ligation, the allyl protecting group on Glu21 was removed to avoid complications during the critical desulfurization.
Before desulfurization could be attempted, it was necessary to free the cysteine sidechain from the thiolactone. Treatment of 22 with thiopropionic acid effectively opened the thiolactone to afford the acyclic 23 (Scheme 6A); however, a by-product (24), identified as EPO(1-21)COS(CH2)2CO2H, was also formed during the reaction. There are several possible explanations for the formation of this product. One possibility is that during the NCL reaction between 19 and 18, a competitive intramolecular NCL within glycopeptide 18 had led to lactam 26,28 which subsequently underwent a transthioesterification with 19 to afford thioester 27 (Scheme 6B). Following removal of the allyl side chain group, the αthioester bond was readily cleaved in the presence of thiopropionic acid to give the observed by-product. Alternatively, it is possible, though less likely, that under the mildly acidic conditions used to open the thiolactone, the Glu21-Cys22 amide bond underwent an N→S acyl tranfer to generate the corresponding αthioester, which was subsequently cleaved to give the observed αthiopropionate ester.
Despite the loss of material, we were ready to test glycopeptide 23 in the key desulfurization reaction (Scheme 7). Treatment of 23 with VA-044 (a water soluble radical initiator), TCEP, and thiopropionic acid in buffered conditions at 37 °C cleanly afforded the reduced product 28 (67% yield). The use of thiopropionic acid as the radical propagator also served to open any thiolactone that formed during the reaction. The final product, 28, features both the biantennary glycan and a C-terminal αthioester, two critical features necessary for the convergent preparation of synthetic homogeneous EPO.
As described above, we have prepared the Ala1—Gly28 glycopeptide fragment (28) of EPO by chemical synthesis. Key steps in the synthesis included attachment of a complex dodecasaccharide (7) to a seven amino acid peptide via Lansbury aspartylation, native chemical ligation to join peptide 19 with the glycopeptide domain 18, and a selective desulfurization at the ligation site to expose the natural Ala19. This fragment presents both the requisite N-linked dodecasaccharide and a C-terminal αthioester handle, the latter feature permitting direct coupling with a glycopeptide fragment bearing N-terminal Cys29 without further functionalization.
In summary the preparation of this Ala1-Gly28, in the context of the accompanying reports, describing the syntheses of EPO(29-77), featuring the N-linked glycan, and EPO(78-166), presenting both the N-linked glycan and the O-linked glycophorin, suggests that the realization of our ultimate goal, i.e., biologically active homogeneous synthetic erythropoietin, is within reach. The assembly of these three fragments, corresponding to a full length homogenous erythropoietin, represents the closest and most convergent approach to the chemical synthesis of a single EPO glycoform. The primary drawback in our synthetic efforts is the limited availability of the complex dodecasaccharide 7. We note that much of the difficulty arises from the lack of commercial availability of the key building blocks. This situation could well change as the role of oligosaccharide chemistry in the synthesis of biologics becomes better understood.
We also think that the new amide bond forming chemistry delineated in this project, and summarized in the background section, could well allow for the combination of these fragments, recognizing that, with all complex target oriented total synthesis, intervention of the unexpected is predictable. That being said, we are hopeful that the convergent nature of our synthesis and its flexibility will enable its adaptation to reach our goals.
This work was supported by the National Institutes of Health (Grant CA28824). Postdoctoral fellowship support is gratefully acknowledged by C.K. (Grant Number T32 CA062948 from the National Cancer Institute), by B.W. (New York State Department of Health, New York State Breast Cancer Research and Education Fund), and by Q.W. (Mr. William H. Goodwin and Mrs. Alice Goodwin and the Commonwealth Foundation for Cancer Research, and the Experimental Therapeutics Center, SKI). We thank Dr. George Sukenick, Ms. Sylvi Rusli, and Ms. Hui Fang of the Sloan-Kettering Institute’s NMR core facility for mass spectral and NMR spectroscopic analysis (SKI core grant no. CA02848). We thank Ms. Rebecca Wilson for editorial counsel.
Supporting Information Available. Experimental procedures and spectroscopic and analytical data for all new compounds. This material is available free of charge via the internet at http://pubs.acs.org.