At the first step, we studied binding of all six synthesized compounds to a fragment of L3MBTL1 consisting of three MBT domains (residues 200-522) by means of ITC. The results (,
Supplementary Figures) suggest that the probes, with exception of compounds
1 (Nme0) and
4 (Nme3), demonstrate a dose-dependent interaction with affinities in the order pyrrolidine > Nme2 ≈ N(me)et ≈ Nme1 >> Nme3 ≈ Nme0. The relative order and absolute binding free energies of the studied compounds are consistent with the trend observed for histone peptides, which supports their use as biophysical probes to study MBT-Kme recognition. In order to make sure that our probes do actually bind to the second MBT domain of L3MBTL1 we have also measured binding of compounds
2,
3,
5 and
6 to the D355A L3MBTL1 mutant. This mutation has previously been demonstrated to efficiently switch off any binding of mono- or di-methylated histones
6 because of the absence of the main binding anchor Asp355. Our ITC experiments with the mutant did not show any measurable binding (
Supplementary ITC figures). This strongly supports the hypothesis that the studied compounds bind to the lysine binding pocket of the second MBT domain.
Our experimental affinity data confirm the intriguing structure-activity relationships (SAR), where the same chemical modification, i.e. adding or removal of a single methyl group, in a similar context, may result either in no change in affinity or in its full loss. More specifically, it is unclear how adding a single methyl group to Nme0 or removing it from Nme3 results in a huge gain in affinity to a “reader” protein, but does not affect the affinity when adding a methyl to Nme1. Consequently, we performed a series of FEP and MD simulations (see Methods and
Supplementary Scheme 1a) in order to provide an atomic-scale structural rationale for these “atypical” SAR. The computed free energies obtained demonstrate a strong correlation with ITC results, which justifies the use of FEP for the energetic analysis of Kme recognition.
The same computational protocol was also used to determine relative weight of polar and non-polar contributions to the binding affinity and to ascertain “preferred” interaction modes for each compound. To this end, we made use of virtual probes that represent neutral isomorphs of compounds
1-6 obtained by replacing their amino nitrogens with carbon atoms. As previously demonstrated
13-15, the binding free energy difference between an ionizable compound and its respective non-polar isomorph (

) can be exclusively attributed to polar interactions, i.e. hydrogen bonding, cation-π and long range ionic interactions (see Methods and
Supplementary Scheme. 1b). Furthermore, the nonpolar contribution to the difference in affinities of two compounds can be expressed as the difference in affinities of the two respective nonpolar isomorphs. Here, we performed six additional FEP calculations to compute relative binding affinities of neutral isomorphs
1′-
6′ to L3MBTL1 (see ) and used these affinities in the further energetic and structural analyses.
We first compared data obtained for compounds
2 (Nme1) and
3 (Nme2). Their respective affinities to L3MBTL1 were similar in both ITC and FEP experiments. However, binding free energies of their respective neutral isomorphs
2′ and
3′ are significantly different (,
Supplementary Scheme 2). The neutral isomorph
2′ binds tighter than
3′ by 1.10 kcal/mol due to its more efficient non-polar interactions. Additionally, we made use of conventional MD simulations to provide a structural interpretation for the MBT-Nme recognition. Ten thousand structural snapshots from 20 ns simulations were clustered and analyzed in order to determine the most representative bound ligand conformations. As shown in , the bound state of compound
2 (Nme1) may be represented by 5 conformations, varying significantly in positions of both the amino nitrogen and the methyl group. However, despite its high mobility within the binding pocket, the only methyl group of compound
2 manages to keep optimal distances of
ca. 3.5 Å to each of three aromatic side chains, Phe379, Trp382 and Tyr386 (see Methods and ). In contrast, the bound compound
3 (Nme2) is represented by three quite distinct orientations of its methyl groups (). The distance distribution in shows that only one methyl group can keep an optimal distance to the aromatic side chains, while the other can significantly diverge beyond the range of optimal vdW interactions. One of representative conformers of compound
3 is identical to the X-ray conformation of Kme2 with L3MBTL1
11 and two others differ only in methyl group orientations while keeping the amino nitrogen in a position of optimal electrostatic interaction with Asp355.
Therefore, both structurally and energetically, the identical binding affinities of 2 (Nme1) and 3 (Nme2) are achieved through quite different recognition mechanisms (as depicted in ). Most remarkably, both methylation forms are moderate-affinity MBT binders. For instance, compound 2 (Nme1) benefits from more efficient vdW interactions and a higher bound-state mobility, leading to a favorable entropic contribution. However, it is penalized by suboptimal electrostatic interactions because its positive charge is delocalized between two hydrogens of which only one can efficiently interact with the carboxyl group of Asp355. Alternatively, the compound 3 (Nme2) is penalized by less favorable non-polar interactions, but benefits from a much stronger electrostatic contribution due to its highly localized positive charge. Consequently, it appears that the lysine pocket of L3MBTL1 that equally non-optimally binds both cognate ligands (i.e. Kme1 and Kme2) has emerged as an evolutionary “compromise” between two possible pocket designs, each of which would tightly bind either Kme1 or Kme2. Energetically, this “compromise” pocket equally penalizes both methylation states, although through different interaction forces.
The proposed “compromise” pocket hypothesis is also compatible with the extremely weak affinity of compounds
1 (Nme0) and
4 (Nme3). Indeed, compounds
1 (Nme0) lacks all favorable vdW interactions, from which its closest analog
2 (Nme1) benefits. This results in a non-polar free energy loss of 2.90 kcal/mol. Furthermore, compound
1 (Nme0) is also penalized by a weaker electrostatic contribution because its positive charge is delocalized between three hydrogen atoms, resulting in a loss of an additional 1.83 kcal/mol. As to the low affinity of compound
4 (Nme3), it has been previously hypothesized that steric repulsion is the major force preventing MBT domains from binding the trimethylated lysine
4. Our decomposition analysis suggests that while the non-polar term (0.73 kcal/mol) penalizes compound
4, the polar term (1.38 kcal/mol) is even more important for the decline of its affinity (relative to
3). The above is in agreement with the observation that a negatively charged residue is essential for a functional MBT domain, but not mandatory for a Kme3 binding domain
4. For example, the Kme3 binding pocket of BPTF, a PHD finger-containing protein, does not contain any acidic residues; interestingly, a Y17E mutation in the binding pocket led to BPTF preferentially recognizing Kme2 over Kme3
6.
Finally, in line with the finding that the native methylation state binding is imperfect by design, one might also infer that a tighter MBT binder is possible. To test this hypothesis we have synthesized and studied two alkylated lysine analogs
5 (N(me)et) and
6 (pyrrolidine), whose ability to bind MBT domains was established in our previous studies
16,17. ITC data show that compound
6 is the most potent of all alkylation states we have examined and binds tighter than
3 (Nme2) by 0.66 kcal/mol, while the potency of compound
5 (N(me)et) is comparable to that of
3 (Nme2). Our free energy decomposition analysis suggests that the calculated gain in affinity of
6 (pyrrolidine) over
3 (Nme2) (1.51 kcal/mol), is due to a huge electrostatic contribution of 2.75 kcal/mol. Compound
6 is however penalized for its two extra carbons by a non-polar penalty of 1.23 kcal/mol compared to
3. On the atomic level, the significant electrostatics-related advantage of
6 is hard to explain considering a relatively modest change in partial charge as compared to
3 (Nme2). To further examine this interaction, we analyzed the conventional MD trajectory to provide a structural rationale for the apparent electrostatics reward. The analysis demonstrates that the pyrrolidine ring, slightly withdrawn from the pocket, has a very low mobility in its bound state and is constrained to optimally interact with Asp355 (). Moreover, the pyrrolidine ring is parallel to the aromatic side chain of Trp382 and forms an aromatic-aliphatic ring stacking motif, commonly seen in the Cambridge Structure Database
18 and in some proteins featuring an aromatic binding cage
19.
To conclude, in this study we proposed the L3MBTL1 lysine binding pocket has evolved as a “compromise” that features equally non-optimal fits to both mono-and demethylation states. This finding may help in understanding the evolutionary development of the methylation machinery and also provide guidance to the development of more efficient exogenous modulators of the histone code.