|Home | About | Journals | Submit | Contact Us | Français|
Miniproteins provide a bridge between proteins and small molecules. Here we adapt methods from combinatorial chemistry to optimize CD4M33, a synthetic miniprotein into which we had previously transplanted the HIV-1 gp120 binding surface of the CD4 receptor. Iterative deconvolution of generated libraries produced CD4M47, a derivative of CD4M33 that had been optimized at four positions. Surface-plasmon resonance demonstrated 4-to-6-fold improvement in CD4M47 affinity for gp120 to a level about 3-fold tighter than that of CD4 itself. Assessment of the neutralization properties of CD4M47 against a diverse range of isolates spanning from HIV-1 to SIVcpz showed that CD4M47 retained the extraordinary breadth of the parent CD4M33, but yielded only limited improvements in neutralization potencies. Crystal structures of CD4M47 and a phenylalanine variant ([Phe23]M47) were determined at 2.4 and 2.6 Å resolution, in ternary complexes with HIV-1 gp120 and the 17b antibody. Analysis of these structures revealed a correlation between mimetic affinity for gp120 and overall mimetic-gp120 interactive surface. A correlation was also observed between CD4- and mimetic-induced gp120-structural similarity and CD4- and mimetic-induced gp120 affinity for the CCR5 co-receptor. Despite mimetic substitutions, including a glycine to (D)-proline change, the gp120 conformation induced by CD4M47 was as close or closer to the conformation induced by CD4 as the one induced by the parent CD4M33. Our results demonstrate the ability of combinatorial chemistry to optimize a disulfide-containing miniprotein and of structural biology to decipher the resultant interplay between binding affinity, neutralization breadth, molecular mimicry, and induced affinity for CCR5.
Macromolecular interfaces, such as those between virus and receptor or between transcription factor and promoter, are tantalizing candidates for therapeutic intervention, and the design of small molecule compounds that interfere with these interfaces has been a long sought, though mostly unfulfilled, goal 1-4. We and others have transplanted macromolecular binding surfaces into synthetic miniprotein scaffolds 5-10, where they might be optimized by the tools of modern organic chemistry. These efforts provide a means to address the gap between therapeutically attractive macromolecular targets and current inhibitor-development technology.
With HIV-1, viral entry is initiated by the binding of the gp120 envelope glycoprotein to the CD4 receptor on the host cell 11-13. This binding induces conformational changes in gp120 that lead to the formation of the site for a requisite co-receptor, either CCR5 or CXCR4 14-17. Crystallographic analysis shows that the interface between gp120 and CD4 is a little under 2000 Å2 shared about equally between both binding partners 18-20. The interface itself is non-ideal, with interfacial cavities and captured water molecules. On CD4, it centers about a β-hairpin, which comprises roughly half the CD4 interface with gp120. We used the similarity between this critical β-hairpin and a similarly exposed β-hairpin found in scorpion toxins, such as charybdotoxin and scyllatoxin, to transplant the gp120-binding surface of CD4 into a toxin scaffold 21,22. Iterative optimization of the transplanted β-hairpin resulted in a synthetic mimetic of only 27 residues. This miniprotein mimetic, named CD4M33, displayed nanomolar affinity to gp120 23. The crystal structure of CD4M33 in complex with core gp120 and Fab 17b revealed extensive mimicry, between the binding surfaces on CD4M33 and CD4 for gp120 as well as between the CD4M33- and CD4-induced conformations of gp120 24.
The synthetic nature of the CD4M33 enabled us to adapt chemical methods of combinatorial optimization to enhance its binding affinity to HIV-1 gp120. One of the most elegant and popular combinatorial strategies, the “split and mix” method 25, 26, uses solid phase-peptide synthesis chemistry to obtain a high statistical sampling of all possible combinations of building blocks. Four positions in the β-hairpin of CD4M33 were selected for and submitted to combinatorial optimization. All (L) amino acids were incorporated at two positions, and both (L) and (D) configurations were incorporated at the two others. The “combinatorially optimized” peptide, named CD4M47, was then analyzed biochemically, in terms of its binding parameters to gp120, virologically, in terms of neutralization potency and breadth, crystallographically, in a ternary complex with YU2 core gp120 and the antigen-binding fragment (Fab) of the 17b antibody, and mechanistically, in terms of its ability to unmask cryptic epitopes related to the binding of the CCR5 co-receptor.
The β-hairpin of CD4M33 (residues 18-27) accounts for about 75% of its interface with HIV-1 gp120 (Fig. 1) 24. A “hotspot” of interaction involves the biphenylalanine at residue 23, which inserts into a gp120 pocket at the interface of several gp120 domains. The biphenyl moiety enhances CD4M33 affinity for gp120 about 10-fold over its phenyl derivative ([Phe23]M47) 24. To optimize CD4M33, we examined where variability might be best introduced. We avoided altering a number of residues: the six cysteines corresponding to three disulfides, which are vital to the correct miniprotein folding; and residues in the second β-strand of CD4M33, which hydrogen bond to β15-strand of gp120 in a manner similar to the C″-strand of CD4. Instead, we explored the first β-strand of CD4M33, which engages in peripheral interactions with gp120 (Fig. 1), by introducing L-amino acids into positions Lys18 and Ala20. We also explored the top of the β-hairpin, by introducing L- and D- amino acids into position Gly21 and Ser22 (positions i+1 and i+2 of the turn).
To enhance solubility and synthesis yield, the biphenylalanine residue at position 23 was replaced by a phenylalanine in the template used for combinatorial synthesis. Cysteines (both L-and D-) were omitted as choices for the four optimized residues to reduce potential misfolding during library synthesis. Iterative deconvolution was used to screen the peptide libraries, so that synthesis was required of only four libraries of, respectively, 19, 19, 34 and 34 sublibraries. Together, the generated variation encompassed 417,316 different peptides.
A combinatorial deconvolution strategy was iteratively used to study the four screened positions at residues 18, 20, 21 and 22. The strategy involved a series of four libraries comprising mixtures in which the identity of an amino acid at a given position was known, and combinations of amino acids were incorporated at the other positions. The overall procedure is illustrated in Fig. 2.
The first peptide library involved 19 defined (L)-amino acids at position 18, with 19×34×34 variants at positions 20, 21 and 22. Reverse-phase HPLC showed a time shift in retention of the main synthetic peak suggesting proper oxidization of the disulfides. Analyses by MALDI-TOF confirmed the presence of a mass pattern at the desired size. CD spectra were indicative of globally correct folding similar to the CD spectrum of CD4M33 (supplementary Fig. S1).
Peptide concentrations were normalized by quantitative amino-acid composition, and each sublibrary of the first library was tested at three different concentrations (10-6, 10-7 and 10-8 M) for the inhibition of gp120 binding to CD4 by fluorescence anisotropy competition assay 27 (Fig. 3) and by ELISA (supplementary Fig. S2). In the initial optimization, inhibition at 10-8 M was low for both methods. On the other hand, both methods showed that with the sublibraries for position 18 (B1A to B1Y), Arg had the highest activity at 10-6 and 10-7 M, followed by Ala in fluorescence anisotropy and Lys and His in ELISA (Fig. 3 and supplementary Fig. S2), and position 18 was fixed as an arginine.
The remaining positions were sequentially identified through an iterative procedure of (re)synthesis and screening (Fig. 2). Thus, a second peptide library was generated that comprised 19 sublibraries of 1156 compounds each (B2A to B2Y), with arginine fixed at position 18 and a defined L-amino acid at position 20. After validation of conformity and assessment of gp120 binding by fluorescence anisotropy (Fig. 3) and ELISA (supplementary Fig. S2), position 20 showed a small (10%) preference for Ala (over Arg and Ser), and position 20 was fixed as an alanine.
A third combinatorial library was synthesized and tested (Fig. 3 and supplementary Figs. S1 and S2). This library included 34 sublibraries of 34 different miniproteins each (B3A to B3dY) with a fixed amino acid of either L- or D-configuration at position 21. A pronounced difference was observed between sublibraries containing L-amino acids and those containing D-amino acids. A clear preference was observed for residues of D-configuration and to a lesser extent for glycine (which is achiral). The selection at this position for a whole family of amino acids suggested a structural effect of the mutation, which was confirmed by the small difference in activity between the different sublibraries with a D-amino acid. The type II′ β-turn seemed to be the most favourable turn for the activity of our miniproteins. Based on these considerations, position 21 was fixed as a (D)-proline.
The fourth and last library amounted to a parallel synthesis of 34 different peptides with a defined (L) or (D)-amino acid at position 22 (B4A to B4dY). The preferred amino acid here was not as clear as with previous positions. Several amino acids showed roughly equivalent inhibition at the 10-7 M. At lower concentrations, however, a preference was observed for threonine, which was chosen for position 22. The final selected sequence, shown in Fig. 3, contained Arg, Ala, (D)-Pro and Thr at positions 18, 20, 21 and 22, respectively.
The optimized compound and its homologue with a biphenyl instead of the phenyl moiety at position 23 were individually synthesized and named [Phe23]M47 and CD4M47, respectively. Surface-plasmon resonance was utilized to determine the binding parameters between HIV-1 gp120 and the various CD4-mimetic miniproteins as well as 2-domain CD4 (sCD4) (Table 1).
The mimetics or sCD4 were immobilized onto CM5 sensor chips using standard amine coupling, and the kinetic parameters to both HXBc2 and YU2 gp120 were measured with a Biacore 3000. Parameters for HXBc2 and YU2 were similar, though not exactly identical; changes in rates are discussed in the context of HXBc2, with those for YU2 shown in parenthesis when different. The on-rate of CD4M47 was mostly unchanged from the parent CD4M33, about 6-fold (7-fold) faster than the on-rate of sCD4. Optimization appeared to have its greatest effect on the CD4M47 off-rate, which was reduced almost 4-fold from that of CD4M33, to within 3-fold (2-fold) the rate of sCD4. Taken together, these changes in on- and off-rates resulted in an overall Kd for CD4M47 of 2.7 nM (2.1 nM), about 2-fold (4-fold) tighter than that of sCD4.
Kinetic parameters were also measured for the Phe23 variants of the CD4 mimetics. Both [Phe23]M33 and [Phe23]M47 had on-rates about half as fast as their biphenyl counterparts. Off-rates between phenyl- and biphenyl- containing ligands, meanwhile, were similar to within 1.1-fold (1.7-fold). The results suggest that the biphenyl moiety enhances the speed of complex formation. The biphenyl effect on off-rate, meanwhile, is variable, and dependent on strain-specific features.
To explore how enhancement in affinity translated into virus neutralization, we analyzed the ability of the CD4-mimetic miniproteins to neutralize a diverse panel of envelope clones. We used two assays, a broadly sensitive HIV-entry assay based on JC53BL-13 indicator cells, which is able to give quantified infection within a single cycle of replication 28 and an HIV-1 envelope-mediated fusion assay employing as effectors persistently infected PM1 cells that present the native envelope on their surface 23.
CD4 and miniprotein neutralization potencies were evaluated with a diverse range of viral envelopes, ranging from HIV-1 IIIB (which is extremely similar to the HXBc2 envelope used in the optimization procedure) to SIVcpz (which showed a genetic distance of almost 0.5 changes per residue from HXBc2). The neutralization results, expressed as IC50s, are summarized in Table 2.
CD4M47 was active over the entire range of diverse isolates, though its potency (IC50 of neutralization) varied substantially from 3 to 370 nM. Only minor enhancements in neutralization potency were observed between CD4M33 and CD4M47 (1.3-4.3-fold over the entire range), less than the observed increase in relative affinity.
Overall, results from neutralization potency agreed with the rank order of measured affinities (Kd), with one notable exception, that of sCD4. Thus, the mimetics showed a rank order of gp120 affinity ([Phe23]M33 > CD4M33 > [Phe23]M47 > CD4M47) that matched the rank order of mimetic neutralization potency. sCD4, however, had an intermediate affinity, but the most potent neutralization. Two possibilities might account for this. First, the larger size of sCD4 (7-times the mass of the miniproteins) might boost its neutralization potency, and second, the contribution of on- and off-rate to neutralization potency is unclear, and the low off-rate of sCD4, slower than any of the mimetics, may enhance its neutralization potency relative to its Kd.
Although CD4M47 displayed only minor improvements in neutralization breadth versus CD4M33, this improvement was seen over the entire range of isolates. Thus, despite increased divergence from the CD4 sequence (Gly to (D)-Pro and Ser to Thr), CD4M47 retained the remarkable breadth of the parent CD4M33 (Fig. 4).
Both CD4M47 and [Phe23]M47 were crystallized in ternary complexes with core gp120 from the primary HIV-1 isolate YU2 and the antigen binding fragment (Fab) of the 17b antibody. The structures were isomorphous with the previously determined CD4M33 crystal structure 24 and were refined to nominal resolutions of 2.4 and 2.6 Å for the CD4M47 and the [Phe23]M47-containing complexes, respectively (Table 3). Crystal grew as thin plates, and while lower resolution shells were complete, higher resolution shells were less so. Nonetheless, collected data were sufficient to obtain a detailed atomic-level picture of the mimetic-gp120 interactions. Moreover, the crystal lattice contained two independent ternary complexes in the crystal asymmetric unit, allowing the effects of lattice packing and refinement uncertainty to be gauged.
The structure of the newly determined CD4M47 complex is shown in Fig. 5 (along with the structure of the [Phe23]M47 variant). Superpositions with CD4M33 are shown in Figs. 5D and 5E and with CD4 in Figs 5F and 5G. The mimetic structures, before and after combinatorial optimization, were extremely similar, with root-mean-square (rms) deviations for the entire backbone of 0.27-0.67 Å, and of 0.19-0.26 Å for residues 18-22. Residue 18 was not closely constrained by gp120, with considerable distance between this portion of the mimetic and gp120. The added bulk of Arg 18 in CD4M47 versus the parent CD4M33 Lys allowed for additional contact surface area with gp120, although these contacts were somewhat tenuous. Residues 20 and 21 were more constrained by gp120; optimization added a total of four carbon atoms to these two residues, increasing the fit between mimetic and gp120 (Fig. 5E).
One measure of the degree of fit between two molecules is their surface complementarity 29. The surface complementarity between CD4 and core gp120 ranged from 0.674 for HXBc2 core gp120 and 0.716 for the YU2 core (with the value for an optimal fit being 1.000). With the CD4-mimetic miniproteins, surface complementarities ranged from 0.769 for both lattice molecules of CD4M33 to 0.801 for molecule 1 of [Phe23]M33 (supplementary Table S1). Within this range, differences from lattice variation had a considerable effect, with the two molecules of CD4M47 showing surface complementarities of 0.769 and 0.793. Perhaps because of the highly restricted range, no correlation was observed between surface complementary and overall affinity.
A second way to analyze structural interfaces is to quantify the molecular contact surface. For mimetic residues 18, 20 and 21, in each case, optimization resulted in an increase in the contact surface. For residue 18, the increase was roughly 20Å2, and for residues 21 and 22, the increase was roughly 10Å2 (Table 4). Thus, the total increase in contact surface from the optimized residues was ~40Å2, or about 10% in the total mimetic contact surface.
While these results suggest a correlation between affinity for gp120 and contact surface area between mimetic and gp120, they must be tempered by the caveat that other amino acids that might increase surface area were not observed to enhance affinity in the optimization. Thus, while interactive surface area was observed to correlate with enhanced affinity, other factors such as surface complementarity and the specific chemistry of the interactions need to be taken into account.
Another structural measure of the interaction between two molecules is the similarity in their induced conformational change. In the case of the gp120-CD4 interaction (and also the gp120-CD4-mimetic interaction 24), gp120 undergoes substantial structural rearrangement, with much of the secondary structure of the gp120-inner domain rearranging 30-32.
We used distance-sorted distance matrices 24, 33 to quantify conformational comparisons of gp120 around the interaction hotspot (Fig. 6). In the 10Å shell extending from the Cα of residue 23 on the mimetic, the structures of the gp120 molecules complexed to CD4M47 showed normalized linear difference distances of 0.22-0.23Å, similar to that seen with CD4M33 (0.22-0.24Å). The linear difference distances decreased with [Phe23]M47, with normalized linear difference distances of 0.16-0.20Å, a bit larger than with the [Phe23]M33 analogue (0.13Å). Thus, in terms of induced gp120 structure, [Phe23]M33 remained the closest mimic to CD4, with [Phe23]M47 intermediate to CD4M47 and CD4M33.
Here the two independent copies of the mimetic in the P21 lattice provide a means to verify the relevance of the distance-sorted difference distance metrics. However, the overall linear and square difference distances were small, only a bit larger than coordinate error. Moreover, the structural measurements were actually made in the ternary complex of mimetic, gp120 and 17b, and thus the gp120 conformation was influenced by 17b. We therefore sought to determine whether these small matrix differences were biologically relevant.
Unliganded gp120 binds poorly to the CCR5 coreceptor, with high affinity CCR5-gp120 interaction requiring a CD4-induced conformational change 34. To define the biological effects of the mimetic-induced changes in gp120, we compared the amount of gp120 bound to CCR5+ cell lines in the absence or in the presence of CD4 or of one of the CD4-mimetic miniproteins. The amount of gp120 bound to CCR5+cell lines was evaluated with two CCR5-tropic gp120 (SF162 and YU2, the same as used for co-crystals). Sequential incubations were carried out with the gp120-reactive antibody, D7324, and with a secondary PE-conjugated antibody. As shown in Fig. 7, [Phe23]M33 and [Phe23]M47 were more capable than the biphenylalanine derivatives in inducing a CCR5-binding conformation. Moreover, the induced affinity of gp120 to CCR5 correlated well with induced structural mimicry, as quantified by distance-sorted distance matrices.
Previous applications of combinatorial optimization have generally been with small molecules, with attempts of up to 2000 Da limited to linear peptides, for example, with antigens and peptide ligands of MHC class molecules 35, 36. As none of these prior efforts required disulfide formation for folding, methodological optimizations were required.
The first synthesized sublibrary did not pass quality controls of integrity: HPLC profiles showed large quantities of truncated peptide, a result confirmed by CD spectroscopy. Synthesis with monitored deprotection, moreover, showed that double coupling was required starting from residue 10. The presence of the sequence –GLLG- may have also complicated miniprotein synthesis, as glycine-containing sequences have been reported to favour aggregation 37, 38, thereby hindering the N-terminal accessibility required for chain elongation. To resolve these difficulties, the homogeneity of each sublibrary was monitored by HPLC spectra before and after disulfide bond formation. These spectra served to validate both miniprotein folding and synthesis completion. Results were confirmed by CD- and mass-spectroscopic analysis of each sublibrary.
Incomplete peptides were assumed to have neither inhibitory nor interference properties in terms of the activity of the complete peptide. As the last coupled residue of the mimetic was a thiopropionyl, which was required to form the third disulfide bond, we felt presence of this residue would likely be required for proper folding. However, the first pool of miniproteins comprised more than 20,000 different molecules – and potentially many more incomplete ones – and we could not be certain that all of the incomplete ones were inactive. To compensate for this uncertainty, we used two different biochemical tests to evaluate the biological activity of each sublibrary. The tests were based on different physical properties: one relying on molecular adsorption at a liquid-solid interface (ELISA); and the other on light depolarization and solution equilibrium (fluorescence anisotropy).
For the first peptide library, an arginine was selected at position 18, whereas the parent CD4M33 residue was a lysine. The increased bulk of the arginine appeared to increase contact surface with gp120. For the second peptide library, the residue at position 20 retained its original amino acid, an alanine. However, sublibrary B2A was only slightly more active than B2R. This position at the beginning of the β-hairpin plays a critical structural role: its main-chain atoms hydrogen bond to those of residue 23 to stabilize the β-turn. Potential interactions between residue 20 and gp120 appeared to be counterbalanced by steric clashes, which destabilize the turn, thereby limiting clear selection of new amino acids at this position. On the other hand, the third library presented a striking difference between amino acids of configuration (D) and (L). The pronounced selection of (D)-amino acids may relate to a strong preference in the mimetic for a type II′ β-turns, combined with a strong propensity with these turns for a (D)-amino acids at the i+1 position of the turn 39, 40. Sublibraries of the fourth peptide library were similarly divided in two groups, though in this case, the biologically active amino acids were of (L)-configuration. The negative selection of (D)-residues at the fourth position might correspond to the destabilization of type II′ β-hairpins by consecutive amino acids of the same configuration 41. Moreover, steric hindrance strongly selects for small amino acids, particularly polar ones such as Thr, Ser and Asn, which are capable of hydrogen bonding to the backbone carbonyl oxygen of gp120 Met426.
The combinatorial optimizations carried out here are similar in some respects to phage display methods of combinatorially enhancing protein affinity 42-44. One advantage of combinatorial chemistry is that non-natural amino acids can be incorporated, for example, the (D)-amino acids used to augment the variation in optimization of positions 21 and 22.
Although the non-natural biphenyl moiety at position 23 has a clear advantage in affinity relative to the natural Phe, because of solubility considerations, we optimized [Phe23]M33, not its biphenyl variant. Direct optimization of CD4M33 might have lead to another set of amino acids with increased compatibility for the biphenyl moiety. Nonetheless the current work allows [Phe23]M47 to be used as the basis for further optimizations, especially off the zeta position of the phenyl ring, where it opens into a large interfacial binding pocket that extends into the hydrophobic interior of gp120 19, 20. Extensions into this conserved pocket show potential for increasing mimetic affinity, as has been demonstrated for thio-modifications of a Phe43Cys variant of CD4 45 and more recently with this present scaffold 46. Multivalent gp120 inhibitors, containing several copies of the CD4-mimetic miniprotein, may also exhibit increased neutralization potency 47, 48. Finally, we demonstrate the ability of our mimetics to induce epitopes, in this case, the binding surface for the co-receptor CCR5. This induction raises the possibility that CD4-mimetics of the type described here may synergize in a therapeutic setting with the high titer of antibodies reactive with these epitopes, which are present in most HIV-1 infected individuals 49.
Fmoc-Bip was from Advanced Chemtech (Louisville, KT, USA). Alexa-488 succinimidyl ester was from Molecular Probes (Leiden, The Netherlands). X4-tropic HXB2 and R5-tropic YU2 recombinant gp120s were obtained from Dr F. Veas (Retroviral and Molecular Immunology Laboratory, Montpellier, France). Recombinant gp120 from the HIV-1 strains R5-tropic SF162 was provided by Dr Indresh Srivastava (Chiron Vaccine Co., Emeryville, USA). 4-domain soluble CD4 was obtained from Progenics (Progenics, Tarrytown, USA) and 2-domain soluble CD4 from the NIH AIDS Research and Reference Reagent Program. CHO-CCR5+ cells were obtained from Dr Marc Parmentier (University of Brussels, Belgium). All other reagents and solvents used were from Sigma-Aldrich and Fluka (St Quentin Fallavier, France) or SDS (Solvants Documentation Syntheses, Peypin, France).
All combinatorial libraries and individual peptides were synthesized on an Advanced Chemtech 357 Multisynthesizer by solid phase method using fluorenylmethyloxycarbonyl(Fmoc)-protected amino acids and 2-(1-H-Benzotriazol-1-yl)-1,1,3,3-tetramethyluronium hexafluorophosphate (HBTU) coupling reagent 50. Each sublibrary was synthesized using 33 μmol Fmoc-PAL-PEG-PS-resin (PerSeptive Biosystems, Warrington, GB). The 10 first residues were introduced by 10 molar equivalent Fmoc-protected amino acids, using 9.1 equivalent HBTU and 22 equivalent N,N-Diisopropylethylamine (DIEA), during 30 minutes. The end of the synthesis was performed using double coupling steps with half the previous quantities. The last residue was incorporated as a 3,3′-dithiodipropionic acid moiety. To introduce variability at the desired positions, the resin was collected before coupling, mixed in N-methylpyrrolidone/Dichloromethane (50/50) under nitrogen flow, and divided equally into n wells (where n is the number of different amino acids we wanted to couple). This process, called “split and mix” method, was repeated for each variable position so that, if we consider a peptide with a variability of n, n′ and n″ amino acids at three different position, we end up with n wells containing n′×n″ peptides each. The resin was finally treated with tributylphosphine for 1.5 h, washed and the sublibraries cleaved by treatment with reagent K′ (81.5% trifluoroacetic acid, 5% water, 5% phenol, 5% thioanisole, 2.5% ethanedithiol, 1% triisopropylsilane) for 2.5 h at room temperature. After one hour precipitation in diethyl oxide, the peptides were washed and lyophilized. Disulfide bridges were formed with the sublibraries dissolved at 0.4 mg/mL in 100 mM TrisHCl buffer (pH 7.8) in the presence of 0.5 mM/5 mM oxidized/reduced glutathione. The sublibraries were purified by reverse-phase HPLC on a C18 Discovery column (10 × 250 mm) (Supelco, Bellefonte, USA) and their composition was verified by quantitative amino acid analysis and MALDI-TOF mass spectrometry. Finally, the peptide libraries were analyzed by circular dichroism (CD) on a Jobin Yvon CD6 dichrograph.
Optimization of the combinatorial protocols and synthesis of individual peptides were performed on an ABI-433 automated peptide synthesizer (Applied Biosystem, Applera Corporation). The fluorescent peptides were labelled by the specific incorporation of Alexa-488 or fluorescein at Lys11, Nε-protected during synthesis with 1-(4,4′ dimethyl-2,6-dioxocyclohex-1-ylidene)-3-methylbutyl (ivDde) 51.
Amino acid compositions were made under standard conditions: samples were vacuum-dried, sealed in glass tubes and hydrolyzed under vapor phase of 6N HCl with a crystal of phenol through 17h at 110°C. Amino acids were derivatized with ninhydrin and quantified on an aminoTac JLC-500/V amino acids analyzer (JEOL, Japan).
Competition binding assays in ELISA were performed in 96-wells plates (Maxisorb, Nunc). Briefly, 50 ng per well of antibody D7324 (Aalto Bio Reagents, Dublin) were coated overnight at 4 °C. Wells were then saturated with PBS 3% BSA buffer, washed 3 times, and 5 ng per well of gp120HXBc2 were added, followed by addition of 250 pg of sCD4 (Progenics, Tarrytown, NY) and different concentrations of soluble competitors. After one night at 4 °C, we successively added anti-CD4 mAb L120 (Centralised Facility for AIDS Reagents, NIBSC, UK), goat anti-mouse peroxidase-conjugated antibody (Jackson ImmunoResearch, West Grove, PA), and the 3,3′,5,5′-tetramethylbenzidine substrate (Sigma). After acidification, optical density was measured at 450 nm and expressed as the mean of duplicate 52.
Binding assays by fluorescence anisotropy were made in a LJL Analyst (LJL Biosystems, Sunnyvale, CA) microplate reader using 384-wells plates. All peptides were diluted in 10 mM sodium phosphate buffer, pH 7.0, with 135 mM NaCl and 0.05% Tween 20.
As previously described 27, competitive binding assays for screening were performed in a final volume of 21 μL by using 1 nM CD4M33-Fl, 12.5 nM gp120HxB2 and different concentrations (10-6, 10-7 and 10-8 M) of each peptide sublibrary. Fluorescence anisotropy was determined after 40 min equilibration time at room temperature, using 485 nm excitation and 530 nm emission filters and an additional 505 nm dichroic filter. All experiments were performed in duplicate.
Adherent CHO-K1 cells expressing CCR5 53, 54 were used to analyze envelope conformational change by FACS. Briefly, 0.06 μg of gp120 (from either strain SF162 or strain YU2) were incubated overnight at 4°C in the absence or the presence of 500 nM CD4 or 500 nM CD4-mimetic miniprotein, and then added to 2 × 105 cells in HAM F12 supplemented with 10% SVF, 1% penicillin/streptomycin. After 1 h incubation at room temperature, cells were washed 3 times with phosphate buffer saline (PBS) containing 5% BSA. Bound gp120 was stained with antibody D7324, washed, and then labeled with goat anti-sheep secondary antibody conjugated to phycoerythrin (PE) (R&D Systems). After a final wash, cells were analyzed for envelope binding by FACS.
Crystals of CD4M47 (or [Phe23]M47) were grown as a ternary complex between YU2 core gp120 and the antigen-binding fragment (Fab) of the 17b antibody. Protein production, crystal growth, and data collection were carried out using procedures described previously for CD4M33 24. Briefly, YU2 core gp120 were produced in Drosophila S2 cells and deglycosylated with Endoglycosidases H and D. The deglycosylated protein was combined with CD4M47 (or [Phe23]M47) and purified by Superdex S200 gel filtration chromatography. The complex peak was combined with Fab 17b, concentrated, and again subjected to Superdex S200 chromatography to obtain the purified ternary complex.
As previously described for CD4M33 24, crystals were grown from mixtures of high molecular weight PEG and isopropanol, cross-linked by vapour diffusion with glutaraldehyde 55, briefly placed in a cryo-solution of 20% ethylene glycol, 10% 2R-3R butanediol, mounted in a cryo-loop, and flashcooled in a nitrogen cryostat set to 100° K). X-ray diffraction data were collected at 1.0 Å wavelength, beamline ID-22, Advanced Photon Source, and processed with HKL2000 56 (Table 1). Refinement was carried out with CNS 57, using O 58 for interactive model building. The spacegroup, P21, and lattice were isomorphous with the previously described CD4M33 ternary complex crystals (pdb accession code 1YYL) 24. With the CD4M33 ternary complex as a starting model, after rigid-body refinement, the CD4M47 and [Phe23]M47 structures were built in accordance with mimetic sequences and inspection of difference electron-density maps. Refinement used torsion-angle simulated annealing with slow cooling, iterative manual fitting, automated water placement, and positional and individual isotropic B-value refinement. Noncrystallographic symmetry restraints were used in the initial stages of refinement and later removed. An Rfree test set consisting of 10% of the data was used as a monitor throughout the refinement. Crystallographic statistics are listed in Table 1.
Surface complementarities were calculated with SC (CCP4) 59, distance-sorted distance matrices with DDMP 24, 33, and intermolecular contact surfaces with MS 60. Structure figures were made with PyMol 61.
Kinetic parameters of sCD4 and CD4-mimetics were determined with a Biacore 3000 surface-plasmon resonance spectrometer (GE Healthcare). CD4M33, [Phe23]M33, CD4M47, [Phe23]M47, and sCD4 were immobilized onto CM5 sensor chips using standard amine coupling to surface densities of 485, 239, 238, 286, and 141 response units (RU), respectively. Full length HXBc2 and YU2 gp120 were injected sequentially over the coupled sensor chips with increasing concentrations of 2.7 nM, 5.4 nM, 10.8 nM, 21.7 nM, 43.5 nM, and 87 nM at a flow rate of 30 μl/min for 300 s at 25 °C. The dissociation rates of the complexes were monitored for 600 s at a flow rate of 30 μl/min. The buffer used for preparation of the gp120 samples and buffer blanks contained 10 mM HEPES, pH7.4, 150 mM NaCl, 3 mM EDTA, 0.005% surfactant P-20, and 0.1% carboxymethyl dextran. The same buffer was used as running buffer during the binding studies. A 1:1 Langmuir binding model with drifting baseline (BiaEvaluation 4.1) was used to fit data globally and to extract kinetic parameters of interaction with gp120.
We dedicate this work to the memory of Dr Claudio Vita and Dr André Ménez. We thank Lawrence Shapiro for comments on the manuscript, Francesca Sironi for assistance with neutralization assays, Indresh Srivastava (Chiron Co., Emeryville, CA) for gp120, and Jonathan Stuckey for assistance with figures. Support was provided by the European Microbicides Project (EMPRO), the Intramural AIDS Targeted Antiviral Program of the NIH, and a grant from the Bill and Melinda Gates Foundation Grand Challenges in Global Health Initiative. We acknowledge Becton Dickinson and the Centre for AIDS Reagents supported by EU programme EVA/MRC (contract QLKZ-CT-1999-00609) and the UK Medical Research Council. Use of SER-CAT at the Advanced Photon Source was supported by the U.S. Department of Energy, Basic Energy Sciences, Office of Science, under contract W-31-109-Eng-38.
Accession Numbers: Coordinates and structure factors have been deposited with the Protein Data Bank under accession numbers 2I5Y (CD4M47 complex) and 2I60 ([Phe23]M47 complex).
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.