|Home | About | Journals | Submit | Contact Us | Français|
The gene clusters responsible for the biosynthesis of two anti-tumor antibiotics, ravidomycin and chrysomycin, have been cloned from Streptomyces ravidus and Streptomyces albaduncus, respectively. Sequencing of the 33.28 kb DNA region of the cosmid cosRav32 and the 34.65 kb DNA region of cosChry1-1 and cosChryF2 revealed 36 and 35 open reading frames (ORFs), respectively, harboring tandem sets of type II polyketide synthase (PKS) genes, d-ravidosamine and d-virenose biosynthetic genes, post-PKS tailoring genes, regulatory genes, and genes of unknown function. The isolated ravidomycin gene cluster was confirmed to be involved in ravidomycin biosynthesis through the production of a new analogue of ravidomycin along with anticipated pathway intermediates and biosynthetic shunt products upon heterologous expression of the cosmid, cosRav32, in Streptomyces lividans TK24. The identity of the cluster was further verified through cross complementation of gilvocarcin V (GV) mutants. Similarly, the chrysomycin gene cluster was demonstrated to be indirectly involved in chrysomycin biosynthesis through cross-complementation of gilvocarcin mutants deficient in the oxygenases GilOII, GilOIII, and GilOIV with the respective chrysomycin monooxygenase homologues. The ravidomycin glycosyltransferase (RavGT) appears to be able to transfer both amino- and neutral sugars, exemplified through the structurally distinct 6-membered d-ravidosamine and 5-membered d-fucofuranose, to the coumarin-based polyketide derived backbone. These results expand the library of biosynthetic genes involved in the biosyntheses of gilvocarcin class compounds that can be used to generate novel analogues through combinatorial biosynthesis.
Ravidomycins (RMs; 1, 2) and chrysomycins (CMs; 8, 9) are potent antibacterial and antitumor antibiotics isolated from the culture broths of Streptomyces ravidus and Streptomyces albaduncus, respectively.[1–3] Both antibiotics share a polyketide derived tetracyclic coumarin-based benzo[d]naphtho[1,2-b]pyran-6-one moiety, also found in other analogues such as the gilvocarcins (5–7), FE35 A (3) and B (4), BE 12 406A (15) and B (16),[5,6] Mer 1020 dA-D (10–13), and the recently isolated polycarcin V (14). However, they differ in their C-glycosidically linked sugar moieties: ravidomycin V (RMV, 1, also known as ravidomycin) possesses an amino pyranose, 4′-O-acetyl-d-ravidosamine, whereas chrysomycin V (CMV, 9, also known as chrysomycin A) possesses a branched neutral pyranose, d-virenose.[4,9–11] RMV shows a broad range of biological activities, such as antibacterial and antitumor both in vivo and in vitro. Such activities primarily arise from the inhibitory effect of 1 on bacterial DNA and RNA synthesis. In addition, excellent cytotoxicity of 1 was reported for human colon carcinoma cell lines.[1,11] Furthermore, 1 has been reported to act as potent inhibitor of human topoisomerase II. Similarly, antitumor activities of CMV have been reported against various cancer cell lines.[2,3,13] Despite all these reports of biological activities, the exact mechanism of action of 1 and 9 at the molecular level still remains elusive. The members of this class of antibiotics are also often called gilvocarcin-type anticancer agents; their biological activity is light-dependent. Mechanistic studies of gilvocarcin V (GV, 7) showed its equilibrium DNA binding and light-induced DNA adduct formation through [2+2] cycloaddition of the vinyl side chain with thymidine residues of DNA besides its strong DNA intercalation property. GV-mediated selective cross-linking of DNA with histone H3 in the presence of light represents an unique mode of action. The common coumarin-based structural backbone and light enhanced anti-tumor/antibacterial activities indicate that 1, 9 and 7 share a similar mode of action. (Scheme 1)
Although there are no direct experimental reports on activity comparisons of RMs with the other analogues, the literature indicates 1 to be the most active compound against tumor cell lines. This is a clear indication that the principally more hydrophilic amino sugar moiety plays a crucial role in the enhanced biological activities of RMV compared to its relatives. Such amino compounds also provide better formulation and delivery options, either as salt forms or for the loading into nanoparticles.[18–20] Chemical modifications of 1 and structure–activity relationship (SAR) studies suggested that the 2′- and 4′-hydroxyl groups of d-ravidosamine are essential for its biological activities. Further development of RM was hindered by its inherent cytotoxicity, thereby necessitating the generation of new analogues with better pharmacological profiles. More recently, total syntheses of 1, 5 and 6 have been reported.[21–24] However, complicated synthetic routes and poor yields hindered the generation of new analogues for further SAR studies. Future attempts to generate potentially valuable ravidomycin and chrysomycin analogues will require a detailed understanding of the role and orchestration of their biosynthetic enzymes. Glycodiversification provides a unique means to generate ravidomycin and chrysomycin analogues but it requires flexible glycosyltransferases (GTs) and a library of NDP-sugars.[25,26] Unfortunately, only a handful of C-GTs are known to possess marginal substrate promiscuity,[27–29] but no C-GTs capable of transferring amino or branched sugars have been reported except for Med-ORF8 (TDP-angolosamine). In this context, characterization of the 4′-O-acetyl-NDP-d-ravidosamine (45) and NDP-d-virenose (37) biosynthetic enzymes and the corresponding GTs that establish the C-glycosidic linkage to the coumarin backbone is particularly intriguing. Incorporation experiments showed that the gilvocarcin, ravidomycin, and chrysomycin backbone derived from acetate and propionate, and it was suggested that the pathway involves an oxidative rearrangement of an angucyclinone intermediate.[31–33] We recently isolated the GV gene cluster and characterized various genes through inactivation and cross complementation experiments.[14,34–37] However, our efforts to address the biosynthetic sequence of key post-PKS tailoring events and their relevant intermediates were hampered by the resistant nature of the producer strain for genetic manipulations, and the inability of several mutant constructs to express in a heterologous host. Cloning and characterization of RM's and CM's gene clusters provide additional pathways to further explore these issues, while providing crucial toolkits to generate new gilvocarcin-type natural products through combinatorial biosynthesis.[38–45] In this context, we have cloned and sequenced the RM and CM biosynthetic loci, and also have produced a new ravidomycin derivative through the heterologous expression of its gene cluster. Also reported here is a C-GT that accepts a structurally most distinct NDP-d-fucofuranose in addition to its natural substrate NDP-d-ravidosamine.
A library screening strategy was taken to clone the RMV and CMV biosynthetic gene clusters. Genomic libraries of S. ravidus and S. albaduncus were constructed in the pOJ446 E. coli-Streptomyces shuttle vector. Because it was obvious that the RMV and CMV polyketide cores derive from the activities of type II PKS enzymes[46–48] involving a ketosynthase α (KSα), and that their corresponding deoxysugar biosyntheses involve an NDP-sugar 4,6-dehydratase (4,6-DH), the corresponding genes were used to probe the genomic library. The primers based on the conserved residues of GilA (encodes the KSα in the GV pathway), and its homologues in the database resulted in the expected amplifications from S. ravidus and S. albaduncus genomic DNA. Similarly, nucleotide fragments of 4,6-DH encoding gene were amplified using a previously reported forward primer, but a modified reverse primer (Figure S3). Cosmid DNA was isolated from colonies hybridizing with the KSα and the 4,6-DH probes. Out of approximately 2000 colonies screened from each genomic library, four and three different colonies, respectively, from the S. ravidus and S. albaduncus library, respectively, were hybridized with both probes. One cosmid from each library (cosRav32 and cosChry1-1 from the S. ravidus and S. albaduncus library, respectively) was sequenced through a shotgun approach using the HyperMu™ MuA Transposase kit (Epicentre) following the reported protocol.[50–53] A third probe was constructed for screening S. albaduncus genomic DNA based on conserved residues from type II PKS-associated ketoacyl reductases, which resulted in three additional positively hybridized colonies. Bioinformatic analyses of the cosRav32 and the overlapping cosmids cosChry1-1 and cosChryF2 sequence using Frame plot and the NCBI BLAST database revealed clusters of 36 and 35 open reading frames (ORFs), respectively. Both the ravidomycin and chrysomycin gene clusters were flanked by ORFs with functions irrelevant to RMV and CMV biosynthesis, respectively. The organization of the clusters and the putative function of the genes are summarized in Figure 1 (Tables S1 and S2 in the Supporting Information). The sequence data have been deposited in the EMBL nucleotide database under the accession numbers FN565 485 (rav cluster) and FN565 166 (chry cluster).
Nine genes in the RMV and CMV clusters were identified to be involved in the biosynthesis of the polyketide backbone. The encoded proteins of ravA, ravB and ravC show typical homologies to the various minimal PKS ketoacyl synthases (KSα), chain length determinants (CLF or KSβ), and the acyl carrier protein (ACP), respectively), enzymes typical of a type II PKS system.[46–48] Likewise, chryA, chryB, and chryC represent the CMV counterparts. Interestingly, both clusters possess an additional acyl carrier protein gene (ravC1 and chryC1, respectively) between two cyclase genes (that is, ravK/chryK and ravG/chryG). Despite their close homologies, the corresponding ACPs might be specific for the selection and priming of either the acetate or the propionate starter unit.[54–59] Typically only one ACP has been reported for well-characterized type II PKS clusters, such as the aclacinomycin A, doxorubicin, aranciamycin, jadomycin B, actinorhodin, medermycin, and tetracenomycin C Even the closely related GV biosynthetic gene cluster contains only one ACP, despite the fact that GV as well as the anthracyclines aclacinomycin and doxorubicin also use propionate starter units. However, two ACP encoding genes were found previously, e.g., in the gene clusters of the frenolicin and the R1128 pathways, and it was discussed that the second ACP might serve as an intermediate anchor on the diketide stage. The second ACP in the R1128 pathway was identified as part of an initiation module that is responsible for starter-unit selection and catalysis of the first chain elongation step, and the additional ACP was found to be indispensable for the incorporation of nonacetate primer units. The products of ravP/chryP and ravQ/chryQ exhibit their closest homologies to gilP and gilQ, corresponding to malonyl CoA:ACP transacylases (MAT, also abbreviated as MCAT) and acyltransferases (AT) in the database, respectively. Based on sequence homologies and inactivation experiments of gilP and gilQ (unpublished results), we favor RavQ/ChryQ to be responsible for the selection of propionate as a starter unit which ends up as the C-8-bound vinyl side chain in 1 and 9. Given that type II PKSs often recruit the MAT from fatty acid biosynthesis, a process known as cross-talk, the presence of such genes seemed unique. Although it has been shown that a number of type II polyketide ACPs undergo rapid self-acylation from malonyl-CoA in the absence of an MAT, this observation has been often ascribed to contamination with Escherichia coli MAT (FabD) rather than an intrinsic property of the ACP. However, recent work with a synthetic holo-ACP from the actinorhodin pathway of S. coelicolor showed that this ACP can undergo self-malonylation in the presence of malonyl-CoA, and careful controls ruled out the possibility of contamination with E. coli MAT, proving that self-malonylation is indeed an inherent activity of the ACP of the actinorhodin PKS. The fact that we found MAT encoding genes in all three biosyntheses of gilvocarcin-type compounds suggests that the ACPs of their biosynthetic pathways cannot undergo self-malonylation. Two cyclases (encoded by ravK/chryK and ravG/chryG), and a ketoreductase (encoded by ravF/chryF) are the other PKS-associated enzymes that are hypothesized to catalyze multiple cyclizations and a ketoreduction step en route to homo-UWM6 (17).
As expected, the post-PKS tailoring enzymes identified in the RMV and CMV clusters are similar to those of their GV counterpart. Four different oxygenase encoding genes (ravOI/chryOI, OII, OIII and OIV) were identified in the RMV and CMV clusters. Recent results showed that the homologues of RavOIV/ChryOIV, JadF and GilOIV, both catalyze identical reactions (i.e., 2,3-dehydration of 17 and oxidation at the C-5 position of 19) despite the different end products of their respective pathways (Scheme 2).
Similarly, RavOI/ChryOI homologues, JadH and GilOI, also show identical activities: both catalyze 4a-12b dehydration of 23, and might also be involved in C–C bond cleavage, presumably through Baeyer–Villiger monooxygenation.[71–75] Based on these similarity studies, we propose that RavOIV/ChryOIV and RavOI/ChryOI are functionally identical to GilOIV and GilOI of the GV cluster, respectively. The translated protein sequences of ravH/chryH exhibit similarity to various NADPH-dependent FMN reductases, including the recently characterized GV pathway counterpart GilH. We assume RavH/ChryH generates the reduced coenzyme FADH from NADPH and FAD, which is necessary for RavOI,OIV/ChryOI,OIV during the oxygenation cascade.[19–25] The product sequences of ravOII/chryOII show their closest homologies to GilOII of the GV cluster (68%:77% and 65%:76% identity/similarity, respectively) and other putative anthrone oxygenases. Similar to GilOII of the GV cluster, RavOII/ChryOII might introduce a hydroxyl group at the naphthalene moiety of 26; this hydroxyl could later react with the aldehyde group resulting from the oxidative C–C bond cleavage to generate hemiacetal 29. The protein sequences encoded by ravOIII/chryOIII exhibit similarities to various cytochrome-P450 monooxygenases (CYPs), including the recently characterized GilOIII of the GV cluster. Through inactivation studies, we previously demonstrated the role of GilOIII in installing the C-8 vinyl side chain of GV. Therefore, RavOIII/ChryOIII are most likely involved in C-8 vinyl side chain formation in their respective pathways. A gene encoding a homologue of ferrodoxin protein was identified in both clusters (ravY/chryY). Because most of the cytochrome-P450 monooxygenases require ferredoxin as an electron carrier to carry out their oxidative activities, we assume that the product of ravY/chryY might supply ferredoxin to RavOIII/ChryOIII. However, a homologous gene encoding a ferredoxin protein was not found in the GV cluster. But it is well known that such genes are often located outside the secondary metabolite biosynthetic gene clusters.
Sequence analyses showed that the stop codon of ravOII/chryOII was translationally coupled with the start codon of a relatively bigger ORF (ravRM/chryRM), the translated N-terminal protein sequence of which shows homology to FAD-dependent oxidoreductases, including the recently characterized GilR, whereas the C-terminal region is similar to thiopurine-S-methyltransferase superfamily proteins from various organisms, GilM of the GV pathway being the closest homologue. The fact that GilR installs the lactone of GV at the last step of the biosynthetic pathway was recently confirmed both in vivo and in vitro. Similarly, inactivation of GilM demonstrated its involvement in GV biosynthesis, but its exact role in the biosynthetic pathway remained elusive due to the instability of the accumulated product of the GilM− (unpublished results). It has been proposed that GilM might be involved in C–C bond rotation of 27 and formation of the hemiacetal moiety in 29 via 28 (Scheme 2). Therefore, the fusion of the genes encoding these functionally distinct Gil-homologues in the RM and CM clusters is particularly notable. Based on the homology analyses and comparison to the gilvocarcin biosynthetic genes, we believe ravMT/chryMT to encode O-methyltransferases responsible for possibly both O-methylations of 29.
A total of five genes encoding deoxysugar biosynthetic enzymes were identified in the ravidomycin cluster. The products of ravD and ravE resemble various NDP-glucose synthases and NDP-glucose-4,6-dehydratases from Streptomyces sp., which generate NDP-4-keto-6-deoxy-d-glucose from TMP and glucose-1-phosphate (Scheme 3). The product of ravIM shows its closest homology to NDP-hexose-3,4-ketoisomerase ORF2 (51%:68% identity/similarity) and TyL1a (62%:48%) from the spiramycin and mycaminose biosynthetic pathways, respectively. In addition, RavIM also exhibits remarkable identity (39%) to the TDP-4-keto-6-deoxy-d-glucose-3,4-ketoisomerase (FdtA) from Aneurinibacillus thermoaerophilus. FdtA has been demonstrated to catalyze the reaction of TDP-4-keto-6-deoxy-d-glucose (35) to TDP-3-keto-6-deoxy-d-galactose (38) during the biosynthesis of surface layer 3-acetamido-3,6-dideoxy-α-d-galactose moieties of the bacterial surface glycan chain, whereas Ty L1a acts on the same substrate but alternatively generates the product TDP-3-keto-6-deoxy-d-glucose (44) with an equatorial-facing hydroxyl group at the C-4 position. Despite the closer similarity of RavIM with TyL1a, we propose that RavIM catalyzes an isomerization reaction identical to that of FdtA (Scheme 3) generating an axial hydroxyl group at the 4-position, which is later acetylated to furnish 4′-O-acetyl-d-ravidosamine. If this hypothesis is true, RavIM will be the first enzyme from a secondary metabolite biosynthetic pathway to have such an activity. ORFs ravAMT and ravNMT are homologous to various proposed/characterized deoxysugar amino-transferase encoding genes and to genes of N,N-dimethyltransferases, respectively.[80–85] In context with RM biosynthesis, we assume that RavAMT catalyzes the transamination of 38 to generate 39, which undergoes subsequent methylation of the amino-nitrogen to generate 40 through the activity of RavNMT.
The product of ravW shows modest similarity to GCN-5 related N-acetyltransferase (GNAT)-family enzymes that catalyze the transfer of an acetyl group from acetyl-CoA to the primary amine of a variety of acceptor substrates. This suggests the possibility of RavW-mediated N-acetylation of 39 followed by the migration of the acetyl group to the 4′-hydroxyl group (41→42) during NDP-O-acetyl ravidosamine biosynthesis. However, the presence of significant amounts of deacetylravidomycin (2) in the culture broth of S. ravidus indicates that RavW might also act as an O-acetyltransferase, thereby directly acetylating the axial 4′-hydroxyl group of 40 completing the 4′-O-acetyl-ravidosamine moiety of RMV prior to or after the glycosyltransfer step (Scheme 2). Further experiments are necessary to clarify the details of these late steps in RM biosynthesis.
The GT responsible for the C-glycosidic linkage between the aromatic backbone and amino sugar of RMV is particularly interesting from a biosynthetic point of view as a unique tool for further combinatorial biosynthetic studies. We have identified a GT-encoding gene, ravGT, upstream of ravNMT. The deduced amino acid sequence of RavGT exhibits high identity/similarity to various O- and C-GTs involved in secondary metabolite biosyntheses such as LanGT2 (47%:67%), GilGT (46%:66%), Med-ORF8 (48%:65%) and UrdGT2 (42%:60%) from the landomycin, GV, medermycin and urdamycin biosynthetic pathways, respectively.[30,86,87] Unlike GilN of the GV pathway, there are no additional GT-encoding homologues in the RM cluster, the products of which could potentially be involved or assist C-glycosylation. Therefore, we assume that RavGT catalyzes the attachment of the NDP-ravidosamine to 29, establishing a C–C bond. Here, it is important to note that UrdGT2, GilGT and IroB are the only C-GTs to be characterized at the functional level. The former two enzymes use NDP-deoxysugars as donor substrates whereas the latter uses UDP-d-glucose. There is obvious similarity of the gilvocarcin-type GTs regarding both the NDP-sugar donor substrate and the coumarin-like acceptor substrate to ChaGT1, a GT that is presumably responsible for the O-glycosidic linkage of a d-fucopyranose to the chartreusin aglycone, which resembles the coumarin-like polyketide core of the gilvocarcins. The sequence comparison here yielded the strongest relation of ChaGT1 to RavGT (32%:48% aa identity/similarity), then to ChryGT (32%:46%) and GilGT (27%:42%).
Given the fact that the sugar donor substrate flexibility of a GT determines its potential applicability to generate novel gilvocarcin-type analogues through glycodiversification strategies, we were curious to preliminarily explore the flexibility of RavGT. The mutant strain of the GV pathway (S. lividans TK24/cosG9B3-GilG), in which gilGT was deleted in-frame, was the ideal host to test the glycosylation activity of ravGT. This strain harbors the entire GV biosynthetic genes but lacks the GT (encoded by gilGT), thus providing the yet unidentified sugar acceptor substrate and the NDP-d-fucofuranose. Therefore, we attempted the cross-complementation of the GilGT− mutant with the ravGT expression construct (pRavGT). Interestingly, the complementation of the mutant restored gilvocarcin E (GE) production (Figure S1). Accumulation of GE instead of GV was not surprising as the activity of GilOIII, which is responsible for the vinyl group formation, was suppressed in the GilGT− mutant due to polar effects inherited by the inactivation. UrdGT2 is reportedly the only known C-GT to possess flexibility for both sugar donor and sugar acceptor substrates.[27,28] More recently, the generation of 4′-hydroxygilvocarcins through the inactivation of the ketoreductase GilU has demonstrated the flexibility of GilGT towards its donor substrate in the GV pathway. However, GilGT could not accept a variety of NDP-aminosugar donor substrates tested in our laboratory (unpublished results). Our current report provides the first example of a C-GT that can utilize both a neutral 5-membered NDP-d-fucofuranose and a basic 6-membered NDP-aminopyranose (NDP-d-ravidosamine) as sugar donor substrates.
The C-glycosidically linked d-virenose is a unique structural feature of CMs. d-virenose has an equatorial methyl group at C-3′ position and an axial C-3′ hydroxyl group, otherwise it is identical to d-fucopyranose. Analysis of the CM cluster revealed a set of four genes that could potentially be involved in the biosynthesis of NDP-d-virenose. The products of chryD and chryE catalyze the first two biosynthetic steps to generate the common intermediate 35. ChryCMT is proposed to catalyze the equatorial C-3 methylation of this intermediate. The deduced sequence of ChryCMT shows typical homology to several putative/characterized deoxysugar SAM-dependent C-methyltransferases from various organisms. The product encoded by chryU reduces the 4-keto group of 36 to afford NDP-d-virenose (37). Gene chryGT is the only candidate C-GT gene identified in the CM cluster, the product of which might establish the C–C bond between the PKS-derived coumarin backbone and C-1 of NDP-d-virenose. In fact, this gene encodes the first C-GT that potentially utilizes a branched sugar (NDP-d-virenose) as a natural sugar donor substrate. The substrate specificity of this enzyme needs to be explored to determine its future potential for combinatorial biosynthetic attempts towards new GV analogues.
RavX4 and ravX5 represent two regulatory genes, the corresponding amino acid sequences of which resemble transcriptional regulatory proteins from various Streptomyces sp. in the RMV cluster. The function of the corresponding enzymes of six other ORFs (ravX2, X3, J, X6, X7 and X8) could not be assigned through homology studies. The products of ravX and X1 represent homologues of phenylacetic acid degrading enzymes which might not be related to the RM biosynthesis. Located at the right end of the cluster, the product of ravX8 shows close similarity to several hypothetical proteins of unknown function not involved in biosyntheses of natural product secondary metabolites, thereby indicating the end of the cluster.
Similarly chryX2, X3, X4, X5, X6, and X7 represent the regulatory genes in the CM cluster. The deduced amino acid sequences of genes chryX, X8 and X9 show similarity to spore transformation proteins, hexokinase from glycolysis and acyl-CoA dehydrogenase from β-oxidation, respectively, and thus are likely not involved in CM biosynthesis, therefore providing the boundaries of the CM cluster. Like in case of the gilL product of the GV pathway, the role of the chryL product for CM biosynthesis could not be assigned through data bank searches. The protein encoded by chryJ likely serves as an efflux pump necessary for the self-protection of S. albaduncus from its own product chrysomycin, much like GilJ (aa-identity/similarity between ChryJ/GilJ is 62%:75%).
Analysis of the DNA sequence of cosRav32 indicated that the cosmid harbors all of the necessary genes for the production of RM. This prompted us to attempt a heterologous expression of the entire RM cluster in S. lividans TK24. Such expression would utilize all the naturally occurring promoters, regulatory, resistance and RM biosynthetic genes encoded within cosRav-32. The cosmid was introduced into S. lividans TK24 through a well-established E. coli: Streptomyces inter-generic conjugation protocol. The yellowish green exconjugants obtained on the plate were subjected to liquid culture for metabolite production. HPLC analysis showed that there are at least six new metabolite peaks in the culture broth of the S. lividans TK24-(cosRav32) as compared to the control strain S. lividans TK24/pOJ446. UV spectra and retention time (tR) in HPLC/MS comparisons revealed that four of the six peaks represented 2,3-dehydroUWM6 (20, 2 mg L−1), 2,3-dehydro-homoUWM6 (19, 20 mg L−1), preGM-o-quinone (22, 0.2 mg L−1) and preGE-o-quinone (21, 1.3 mg L−1), see Figure 2.
The other two peaks (eluted at tR = 6.6 and 7.1 min) exhibited UV spectra typical to that of ravidomycin, however with a slight hypsochromic shift at one of the maxima from 392 nm to 380 nm for both compounds. This indicated that the C-8 vinyl side chain of 1 might be replaced with a saturated ethyl or methyl group. Low resolution ESI-MS analyses further showed that the mass of the first compound (m/z 510 [M+H]+ in positive APCI MS and 508 m/z [M–H]− in negative APCI MS) is reduced by 14 amu compared to that of the second (m/z 524 [M+H]+ in positive and m/z 522 [M–H]− in negative APCI MS); this indicates their real molecular weights to be 511 and 523, respectively. This suggested that these compounds differ from one another by one methylene group (-CH2-), preferably at the 8-sidechain, and that the C-4′-O-acetyl group is missing in both compounds. The molecular formula of the latter compound obtained through the high resolution EI-MS (523.2207 gmol−1) matched the predicted structure of deacetylravidomycin E (31, C29H33NO8). However, poor production (<50 μg L−1) of these metabolites did not allow a larger scale isolation necessary for NMR analysis. However, we reasoned that compound 31 could be obtained from S. ravidus by inhibiting its putative P450 oxygenase RavOIII with ancymidol, a known, effective CYP450 inhibitor.[91,92] Indeed, fermentation of S. ravidus in presence of ancymidol revealed a new metabolite in HPLC/MS, the HPLC retention time and mass of which seemed identical to the larger ravidomycin compound from S. lividans TK24(cosRav32).
About 4 mg of the compound was isolated using preparative HPLC and its structure was analyzed by NMR and HRMS. When comparing its 1H NMR with that of 1 (RMV), all signals of the RV protons were observed except for the vinyl and 4′-O-acetyl protons which typically appear at δ 5.5–7.0 and δ 2.0– 2.5, respectively. The presence of an additional triplet at δ 1.32 and quartet at δ 2.83 clearly indicated the presence of an 8-ethyl side chain instead of the 8-vinyl side chain, thereby confirming structure 31 as shown in Scheme 2. The structure of the compound was further confirmed by 13C NMR analysis (see NMR data in the Supporting Information). Having standard 31 in hand, we compared it to the S. lividans TK24(cosRav32) metabolites. Identical HPLC Rt, UV spectrum and MS fingerprints confirmed that the second ravidomycin compound mentioned above is indeed 31. Based on these observations, we also propose the structure of the minor compound to be 32.
The results clearly show that the isolated cluster covers most of the genes required for the biosynthesis of RV. However, the accumulation of a marginal amount of 31 (deacetyl-RME) in instead of 1, and the accumulation of 19–22 as major metabolites is surprising. Gene inactivation, complementation and in vitro assay results have suggested that 19 and 20 are biosynthetic intermediates of the GV and jadomycin pathways, respectively.[34,36,93,94] As in the GV pathway, hydroxylation of 19 followed by lactone formation (19→25) through the activities of RavOI and RavOIV are the suggested RV biosynthetic steps in which 21 and 22 are generated through further shunt oxidation of 23 (Scheme 2). The accumulation of these intermediates indicated that there are some disturbances in the C–C bond cleavage process (particularly 19→25). The lack of the acetyl group at 4-O′-position of the sugar residue and the missing 8-vinyl side chain in 31 suggests that the proposed acetyltransferase RavW and CYP450 RavOIII do not function in the heterologous host for unclear reasons. For example, these enzymes are functionally inactive, and the real functional enzymes are encoded by genes located somewhere outside of the sequenced region. Alternatively, the expression of RavOIII and RavW might be negatively regulated in the heterologous host. To check the first hypothesis, a RavOIII expression construct (pRavOIII) was generated based on plasmid pEM4, in which the inserted ravOIII gene is expressed under the control of the erythromycin resistance gene promoter ermEp*. This construct was used to complement the S. lividans TK24 (cosG9B3-GilOIII− ) mutant that harbors the entire GV cluster except gilOIII, and produces GE (6) along with other biosynthetic shunt products, such as rabelomycin and homorabelomycin. The expression of RavOIII in this GilOIII− mutant resulted in the complete reconstitution of the GV production (Figure S2); this demonstrates that ravOIII encodes a functionally active P450 enzyme, which is responsible for the vinyl side chain formation of RV. Therefore, we still suspect that the expression of RavOIII as well as RavW might have been negatively regulated during heterologous expression of cosRav32, by unknown mechanisms. However, because both P450 enzymes and acetyltransferases are commonly found in many organisms including Streptomycetes, various regulation mechanisms might exist. Further investigation, including the in vitro characterization of these enzymes, might help to resolve these ambiguities, but so far all our attempts to prepare RavW in vitro repeatedly failed due to the insolubility of the enzyme.
Various attempts to heterologously produce CM in S. lividans TK24 through the expression of the cosmid cosChry1-1 were unsuccessful. Unfortunately, the CM producer strain S. albaduncus AD0819 turned out to be recalcitrant to direct manipulations of its CM biosynthetic genes. Therefore, cross-complementations of GV mutants with homologous CM genes were carried out to verify that the aforementioned cluster is responsible for CM biosynthesis. S. lividans TK24 (cosG9B3-GilOII−, cosG9B3-GilOIV− and cosG9B3-GilOIII−) mutants that are blocked in GV biosynthesis at various stages were complemented with overexpression plasmids containing their CM counterparts (pChryOII, pChryOIV and pChryOIII, pEM4-based plasmids containing the chry oxygenase genes under ermEp* promotion; Figure S2). All complementations successfully restored GV production, thus providing indirect evidence for their involvement in CM biosynthesis.
The above-mentioned experiments clearly proved that we cloned the gene clusters of the gilvocarcin-type anticancer antibiotics ravidomycin V and chrysomycin V; this significantly expands the repertoire of genes involved in gilvocarcin class compound biosyntheses that can be used to generate novel analogues through combinatorial biosynthesis.
The assignments of the product function of critical and related oxygenase-encoding genes were not only based on sequence data, but also on heterologous complementation (for example, of ravOIII, ravGT, chryOII, chryOIV, chryOIII). Furthermore, it was shown that P450 enzyme RavOIII, responsible for the generation of the vinyl side chain in RV, can be inhibited by the P450-inhibitor ancymidol, and that the aminosugar transferase RavGT possesses significant flexibility towards its sugar donor substrate. The experiments also yielded new gilvocarcin-type drugs, such as 31 and 32.
Plasmids and bacterial strains used in this study are summarized in Table S3. Routine DNA manipulations such as cloning, restriction analysis, ligation, transformation etc. were carried out following the standard protocols for E. coli and Streptomyces. High fidelity pfu DNA polymerase was used to amplify various DNA fragments throughout the experiments. PCR products were cloned into PCR-Blunt-II-TOPO cloning vectors and sequenced to confirm that no mutation had occurred during PCR amplification. For complementation studies, ravOIII, chryOIV, chryOIII, chryOII were amplified and ligated into the pEM4 vector at the XbaI and EcoRI sites, whereas ravGT was ligated at the NdeI and EcoRI sites of pUWL201PW.
Various strains of E. coli were cultured at 37 °C in Lauria Bertani (LB) broth or on the LB-agar medium supplemented with various antibiotics (kanamycin, 50 μg mL−1; chloramphenicol, 25 μg mL−1; apramycin, 50 μg mL−1 and ampicillin, 50 μg mL−1) whenever necessary. Streptomyces strains were grown on M2-agar or in SG liquid medium for the metabolites analyses as reported previously. MS-agar medium was used for the transfer of the cosmids from E. coli into Streptomyces following the standard protocol. Antibiotics (apramycin, 50 μg mL−1 and thiostrepton 25 μg mL−1) were supplemented in the medium whenever necessary.
Genomic DNA of S. ravidus and S. albaduncus were isolated following the standard protocol. The genomic DNA was partially digested with Sau3AI, dephosphorylated with calf intestinal phosphatase (CIP) and ligated to the pO J446 vector digested with BamHI and HpaI. The ligation sample was transduced into E. coli XL1-Blue MRF using Gigapack III XL packaging extract (Stratagene). More than 2300 colonies were obtained for each library. Two sets of degenerate primers (a standard KS-probe-for and KS-probe-rev, and a newly designed DH-probe-for and DH-probe-rev) were used to amplify the internal nucleotide sequences of ketoacyl synthase (KSα) and NDP-glucose-4,6-dehydratase from the genomic DNA of S. ravidus and S. albaduncus. The amplified KSα fragments were labeled with DIG and used to screen positive colonies. The positive colonies were further screened with the NDP-glucose-4,6-dehydratase probe. Colony hybridization and southern blot analyses revealed four and three different cosmid clones for S. ravidus and S. albaduncus genomic libraries, respectively. A probe suited to detect a type II PKS-associated ketoreductase gene (gilF homologue) was also constructed (CM_cons_F1 and CM_cons_R1) to probe for cosmids that contained chryF. Three positively hybridizing colonies yielded cosmids cosChryF1, cosChryF2, and cosChryF3, cosChryF2 with the most overlap with cosChry1–1 according to the restriction analysis map. Thus, cosmids cosRav32, cosChry1–1, and cosChryF2 were selected for sequencing. All primers used in this study are summarized in Table S4.
Sequencing of the cosmids was carried out involving a standard shotgun approach using the HyperMu™ MuA Transposase kit (Epicentre) following the reported protocol[50–53] used by the sequencing facilities of the Advanced Genetic Technologies Center (AGTC) located in the College of Agricultural Sciences of the University of Kentucky (http://www.uky.edu/Centers/AGTC/). Phred/Phrap/Consed software package (http://www.phrap.org.) was used to process and assemble raw sequence data into larger contigs. The small gaps between contigs were filled by primer walking. Frame plot (http://www0.nih.go.jp/~jun/cgi-bin/frameplot.pl) and ORF finder (http://www.ncbi.nlm.nih.gov/projects/gorf/) were used to assign the open reading frames. Functional assignments of ORFs were performed through database comparison using various BLAST search tools on the server of the National Center for Biotechnology Information, Bethesda, Maryland, USA (http://www.ncbi.nlm.nih.gov).
Streptomyces strains were grown on MS agar for 6 days at 30 °C. An agar chunk full of bacterial spores was inoculated into SG liquid (100 mL) medium in a Erlenmeyer flask (500 mL) and incubated for three days at 30 °C and 200 rpm to prepare seed cultures. For analyses of metabolites, the liquid seed culture (3%) was used to inoculate larger-scale SG liquid medium cultures (up to 4 L) and grown for five days. The cultures were harvested through centrifugation to obtain culture broth and pellet fractions. The broth was twice extracted with equal volumes of ethyl acetate. The organic layer was combined and dried in a rotary evaporator. Such prepared extracts were dissolved in methanol and filtered through syringe filter (0.2 μm) prior to High Performance Liquid Chromatography-Mass Spectrometry (HPLC-MS) analysis. The pellet fraction was re-suspended in acetone and sonicated for 10 min. The acetone extract was collected through centrifugation and dried under reduced pressure. The prepared extract was dissolved in methanol and subjected to HPLC-MS analysis.
The production of chrysomycins, ravidomycins and gilvocarcins by wild/mutant strains were analyzed by the aforementioned protocol. The production scale was increased to 4 L when isolating compounds for structural analysis. The culture was harvested through centrifugation, and the cell pellets were extracted with acetone as described above. The extract was combined with the broth and loaded onto a reverse phase silica (Rp18, 200 g) column (5 × 65 cm) pre-equilibrated with deionized water. The effluent was discarded and the column was washed with acetonitrile (1 L of 10%, in water). Elution was carried out using an increasing gradient of acetonitrile (20%, 40%, 60%, 80% and 100%, 1 L each). The fraction containing target compounds were dried in a lyophilizer. Final purification of the metabolites was carried out in preparative-scale HPLC (columns type and size, see below). The purity and authenticity of the compound was confirmed through 1H and 13C NMR analyses. A linear gradient of acetonitrile and acidified water (solvent A = 0.1% formic acid in H2O; solvent B = acetonitrile; 0–15 min 25% B to 100% B; 16–24 min 100% B; 25–26 min 100% to 25% B; 27–29 min 25% B) was used to separate compounds. Sun-Fire prepC18 column (19×150 mm, 5 μm) and Symmetry C18 (4.6 250 mm, 5 μm) columns were used for preparative and analytical scale separations, respectively. Flow rate was maintained to 10 mL min−1 and 1.0 mL min−1 for preparative and analytical scale separations, respectively. Micromass ZQ 2000 (Waters) equipped with HPLC (Waters alliance 2695 model) and photodiode array detector (Waters 2996) were used to analyze the compounds. Atmospheric pressure chemical ionization (APCI) and Electrospray ionization (ESI) probes were used to detect molecular ions. Electron ionization-High resolution mass (EI-HR-MS) of compounds were acquired using a high resolution electron impact (EI) ionization mass spectrometer. Data were recorded at 25 eV on a JEOL JMS-700T MStation at the University of Kentucky mass spectrometry facility. Samples were introduced through a heatable direct probe inlet. Perfluorokerosene (pfk) was used to produce reference masses.
To isolate deacetylravidomycin E, fully grown S. ravidus seed culture (20 mL) was cultured in SG medium (5 flasks × 100 mL) and of ancymidol solution (100 μL, in DMSO, 2 mm, final concentration) was added into the culture. Only DMSO was used in the control flask. The fermentation was carried out for 6 days and the metabolites were analyzed by HPLC-MS. Isolation and purification of 4′-O-deacetylravidomycin E (31) was carried out according to the aforementioned protocol. The structure of 31 was determined through comparative analysis of its experimental NMR data and reported NMR data of 2 (see the Supporting Information).
This work was supported by the US National Institutes of Health (grant CA 102 102 to J.R.). The University of Kentucky Advanced Genetic Technologies Center (AGTC) and Center for Mass Spectrometry are acknowledged for their services.
Supporting information for this article is available on the WWW under http://dx.doi.org/10.1002/cbic.200900673.