|Home | About | Journals | Submit | Contact Us | Français|
The ribosome represents a major target for antibacterial drugs. Being a complex molecular machine, it offers many potential sites for functional interference. The high-resolution structures of ribosome in complex with various antibiotics provide a unique data set for understanding the universal features of drug-binding pockets on the ribosome. In this work, we have analyzed the structural and evolutionary properties of 65 antibiotic binding sites (ABSs) in the ribosome. We compared these sites to similar-size computed pockets extracted from the small and large ribosomal subunits. Based on this analysis, we defined properties of the known drug-binding sites, which constitute the signature of a ‘druggable’ site. The most noticeable properties of the ABSs are prevalence of non-paired bases, a strong bias in favor of unusual syn conformation of the RNA bases and an unusual sugar pucker. We propose that despite the different geometric and chemical properties of diverse antibiotics, their binding sites tend to have common attributes that possibly reflect the potency of the pocket for binding small molecules. Finally, we utilized the ensemble of properties to derive a druggability index, which can be used in conjunction with site functionality information to identify new drug-binding sites on the ribosome.
The ribosome is responsible for protein synthesis in all living cells. The bacterial 70S ribosome is formed by the association of the large (50S) subunit and the small (30S) subunit. The small subunit interprets the genetic information encoded in the messenger RNA (mRNA), and the large subunit catalyzes polymerization of amino acids into a polypeptide. The ribosome is an RNA-based machine; the RNA accounts for about two-thirds of the molecular weight of each subunit. The 23S rRNA, ~2900 nt long, and the 5S rRNA, comprised of ~120 nt, associate with some 30 proteins to form the large ribosomal subunit. The small subunit contains an ~1500-nt-long 16S rRNA plus 20 proteins (1). The high-resolution crystallographic and cryo-EM structures of the ribosome and its subunits support the results of many years of genetic and biochemical studies confirming that the main function of the ribosome is promoted by rRNA (2–4).
The ribosome has been an important target for antibiotics, both natural and synthetic (5). Structural and genetic studies show that most of the drugs inhibiting ribosome functions interact directly with rRNA. Despite the ribosome’s enormous size and the diversity of natural and synthetic inhibitors of translation, the majority of clinically effective antibiotics bind to only a few functional sites in the ribosome. For example, most antibiotics that act on the large ribosomal subunit, including diverse antibiotic classes such as phenicols, lincosamides, streptogramins, macrolides, oxazolidinones and others, bind at or near the peptidyl transferase center (PTC), which is located in a cavity on the interface side of the large subunit. The binding of these compounds interferes with the catalysis of peptide bond formation and/or the progression of the nascent peptide toward the exit tunnel. Similarly, several families of aminoglycoside drugs, including important and diverse antibiotics, such as neamine, ribostamycin, neomycin B, gentamycin, tobramycin, paromomycin, etc., interact with the decoding center in the bacterial 16S rRNA. Notably, some of the antibiotics (such as edeine or pactamycin), which broadly affect protein synthesis in all kingdoms of life, are not clinically relevant due to their limited selectivity (6). In addition, it is important to mention that some antibiotics, e.g. tetracyclines, were demonstrated to bind to both a primary (clinically relevant) site as well as to secondary sites which may not be clinically relevant (7).
High evolutionary conservation of many rRNA regions on the ribosome suggests their functional importance (8). Key ribosomal activities have been assigned to several major functional centers. However, the functionality of many other sites in the ribosome remains an enigma. Although it is clear that complex processes, such as translocation, signal transduction, interaction with multiple cellular ligands and many other ribosome-associated activities that are central to protein synthesis, must involve specific sites on the ribosome. In theory, some of these cryptic functional sites could be good candidates for antibiotic action. Indeed, in recent years, the binding of antibiotics to previously unrecognized sites on the ribosome has been reported (9–14). Furthermore, genetic and biochemical screening methods have been employed to reveal ribosomal sites whose structure is essential for protein synthesis (15–17). It remains unknown, however, which of these sites could be targeted by antibiotics.
A number of high-resolution structures of ribosome–antibiotic complexes have been obtained recently using X-ray crystallography. These structures provide fertile ground for attempts to identify common features of antibiotic binding sites (ABSs) and to deduce what distinguishes a ‘true’ ABS from other ‘pockets’ in the ribosome. Moreover, the observed variability in antibiotic binding justifies the search for new sites and improved compounds. Pursuing the general goal of identifying new antibiotic sites in the ribosome, here, we comprehensively analyzed 65 experimentally confirmed ABSs extracted from high-resolution structures of ribosome–drug complexes and identified common characteristics of the drug-binding pockets.
Structures of Thermus thermophilus and Escheruchia coli 30S, as well as Haloarcula marismortui and Deinococcus radiodurans 50S ribosomal subunits in complex with various ligands (antibiotics) were obtained from the Protein Data Bank (PDB). The complete list of 65 crystal structures of ribosome–antibiotics complexes is presented in Table 1. A binding site was defined to include all atoms within a radius of 6 Å from any atom of the ligand. PDB ID 2HHH with identifier KSG 1524 was excluded from the table due to the low number of nucleotides at its binding site. Furthermore, the atoms of the binding sites were defined in the apo structures of the small ribosomal subunit of T. thermophilus (PDB 1J5E) (18) and E. coli (PDB 2AVY) (19), and the large ribosomal subunit of H. marismortui (PDB 1JJ2) (20) and D. radiodurans (PDB 2ZJR) (21). These structures were then used to calculate the structural properties for centrality measures calculations and for extracting putative pockets in the ribosome, as described below.
The program ‘solvent’ from the 3V package (http://geometry.molmovdb.org/3v/) was applied to identify putative pockets in the ribosome (22). Briefly, the program gets as an input a PDB file and the radius of a small and a large probe sphere, and outputs the void in the ribosome structures that can accommodate a small probe size but not a large probe size. The void itself, which corresponds to a possible binding pocket, is represented by the oxygen atoms of water molecules that define the borders of the void (i.e. the surface of the pocket). The program was applied to the PDB IDs 1J5E, 2AVY, 1JJ2 and 2ZJR files, excluding ions and including all ATOM coordinates (rRNA and ribosomal proteins). Grid spacing was set to 1 Å.
The calibration procedure was applied to define the parameters generating computed pockets that best fit the average dimension of the known binding pockets (Table 1). The calibration procedure was performed individually for each of the 11 selected known binding pockets that have ~30% atoms in common (Table 1, indices; 2, 3, 4, 11, 12, 13, 23, 25, 31, 34 and 35).
Previous studies have shown that the large ribosomal subunit (PDB 1JJ2) contains no cavities big enough to enclose a sphere having a radius greater than 9.5 Å. Thus, in the first round of calibration, the large probe radius was fixed to 9.5 Å, while the small probe radius was gradually reduced (1 Å decline) in the 1.5–8.5 Å range. After each individual run of the ‘solvent’ algorithm with a defined set of parameters, the surface of the pockets was delineated by the coordinates of computed oxygen atoms of water molecules. The relativity between a computed pocket and a known binding site was described by the minimal distance between the computed oxygen atoms and the center of mass of all atoms included in the known binding sites. This procedure was repeated for each known ABS, testing the full range of small probe radii in order to define the optimal parameters for which the shortest distance between known and computed pockets was reached. Finally, a small probe radius of 1.5 Å was chosen as the optimal radius that gave the best results for all known binding pockets tested (Supplementary Table S1). The size of the small probe indicates accessibility of the computed pockets to solvent water molecules.
To decrease the number of computed binding pockets and eliminate pockets that do not match the dimension of known binding sites, a second round of calibration was performed. In the second round, only the size of the large probe radius was decreased, while the small probe radius was fixed to 1.5 Å. Finally, after gradually decreasing the large probe, a radius of 5.5 Å was chosen, keeping the distance from the computed pocket to the known binding sites at a minimum (Supplementary Table S2).
To define the computed pocket boundaries, we examined various radii cutoffs around the computed oxygen atoms (Supplementary Table S3). For each radius, we defined the pocket size as the number of nts it comprises, i.e. a nt was considered in the computed pocket if it had at least one atom within a defined radius. A radius of 9 Å around the computed oxygen atoms was chosen, as it resulted in approximately the same average number of nts as in the known binding sites (Supplementary Table S3). Finally, we retained only pockets that comprise 12 nt and differ in at least 1 nt, resulting in 17 610 (calculated for PDB 1J5E and 1JJ2) pockets that we refer to as the background. The number of pockets calculated for all structures used in this study is listed in Supplementary Table S4.
Specificity and sensitivity were calculated based on the degree of overlap between the computed pockets and the known binding sites (Supplementary Table S5). ‘Overlapping specificity’ equals the fraction of common nts in the putative and the known pockets divided by the number of nts in the known binding site. ‘Overlapping sensitivity’ equals the fraction of common nts divided by the number of nts included in the putative (computed) pocket.
The nitrogen and oxygen atoms in the base were classified as donors or acceptors. Backbone oxygen atoms were labeled as sugar or phosphate.
For the RNA structural properties classification, including base-pairing (23,24) and base-base stacking interactions (25), the MC-Annotate (version 1.6.2) program (26) was used. Base-pairing was defined according to three edges of the corresponding RNA bases available for H-bonding interactions, labeled the Watson–Crick (W) edge, the Hoogsteen edge (H) and the Sugar edge (S). For more details see refs. 25 and 28. Base–base stacking interactions were categorized as stacking interactions between adjacent and non-adjacent nts. For more details see ref. 27.
The features were extracted as described in ref. 28 using an in-house Perl script converting the MC-Annotate output files into binary format, i.e. each nt was given a score of ‘1’ when a specific property was present and a score of ‘0’ when it was absent. To calculate the relative abundance of a specific property, the fraction of nts in the pocket having this property was calculated. In order to generate the signature of the binding sites (Table 1), only properties covering at least 1% of all rRNA nts were included.
Interface residues of the 16S rRNA and 23S rRNA with ribosomal proteins mRNA and tRNA were calculated using the Intervor web server (http://cgal.inria.fr/abs/Intervor) (29) with default parameters. The interface with ribosomal proteins was calculated on the PDB IDs 1JJ2 and 1J5E. The interface with the tRNA and mRNA was calculated on the PDB IDs 2J00 and 2J01 (30). The A-minor motif annotation (PDB ID’s 1VQO, 2J00) was downloaded from http://www.biomath.nyu.edu/motifs/rrna.html.
We modeled the 16S rRNA (PDB IDs 1J5E, 2AVY) and the 23S rRNA (PDB IDs 1JJ2, 2ZJR) as a structure-based network of interacting nts (31). Briefly, each nt was considered as a node in the network, and chemical interactions between nts defined the edges. The degree of a node i is defined as the number of edges connected to i. The betweenness of a node i is defined by the number of shortest paths that cross node i. The closeness of a node i is defined as the inverse average length of the shortest paths to all other nodes in the graph (32). The closeness of node i is given in Equation (1)
where N is the total number of nodes in the network, and dij is the shortest path length to node i. For each known binding site and computed pocket, we calculated the average closeness, betweenness and degree.
Aligned sequences and a guide tree for both the 16S/18S and the 23S/28S were downloaded from the ARB-SILVA database (release 96) (http://www.arb-silva.de). The 16S/18S alignment comprises high-quality sequences, with a minimum length of 1200 bases for bacteria and eukarya and 900 bases for archaea. The 23S/28S alignment comprises high-quality rRNA sequences with a minimum length of 1900 bases. Positional variability in bacteria was calculated with the ARB package using parsimony function (37). The score of the evolutionary rate was multiplied by −1 to create values of evolutionary conservation.
The relative abundance of each property in a known site was evaluated relative to a background of putative pockets. Since we could not assume normality, an individual score (ui) of each property was standardized as given in Equation (2):
MAD denotes the median absolute deviation, given by Equation (3):
The statistical significance of the structural properties and 16 types of base pairing presentation in ABSs was evaluated based on the hyper geometric distribution using the Fisher’s exact test.
To study the distinguishing characteristics of ABSs in the ribosome, we sought to compare the properties of known ABSs (Table 1) to a collection of computed structural pockets having similar dimensions. We defined a ‘known ABS’ as all rRNA atoms within 6 Å from any atom of the ligand (antibiotic) observed in the crystal structure of an antibiotic-ribosome complex. We then selected the coordinates of the corresponding rRNA atoms in the apo-structure, which represented the unbound drug-binding pocket (see ‘Materials and Methods’ section). To generate the collection of structural pockets in both ribosomal subunits, we first calibrated the parameters of the pocket-extraction protocol to correctly identify the known ABSs, and then used these parameters to identify all other similar pockets. Since the majority of antibiotics that bind the ribosome interact mostly with the ribosomal RNA (rRNA) (33), we chose to concentrate on pockets that consist exclusively of rRNA. The location and composition of the pockets were computed using the ‘small and the large probe spheres’ approach, previously applied for identifying ligand-binding sites in proteins (34). Here, we defined a pocket as a volume in the ribosome into which a small probe can enter but a large probe cannot. For automatic extraction of the pockets, we adopted the program ‘solvent’ from the 3V package (http://geometry.molmovdb.org/3v/) on a 1 Å grid space (22).
A plausible way of evaluating the pocket extraction procedure would be to examine the degree of inclusion of the known ABSs in the collection of all computed pockets (Supplementary Tables S1–S4). To achieve a maximum overlap between the extracted pockets and the known binding sites, we applied a calibration procedure that tested different small and large probe radii (see ‘Materials and Methods’ section). After selecting the optimal probe radii (Supplementary Tables S1–S2), we set the size of the computed pocket to be equal to the average number of nts in the known ABS (Supplementary Table S3). Finally, we defined the optimal parameters for the pockets (1.5 Å for the small probe radius, 5.5 Å for the large probe radius, and a pocket size of 12 nts). These parameters yielded a minimal number of computed pockets that provides a maximal overlap with the known ABSs. Overall, the overlapping values between the known and computed pockets were in the range between 50% and 100 % (Supplementary Table S5). The high accuracy of pocket extraction is exemplified in Figure 1, which illustrates the overlap between a computed pocket and the known ABS of tetracycline in the 16S rRNA (Figure 1A), and the erythromycin binding site in the 23S rRNA (Figure 1B). The high overlap between computed and known sites indicates that the computational procedure used in this study is suitable for identifying putative pockets with similar dimensions to the known ABSs.
Having applied the procedure to the apo-structures of the T. thermophilus small ribosomal subunit (PDB ID 1J5E) and the H. marismortui large subunit (PDB ID 1JJ2), we extracted a total of 17 610 overlapping pockets that differ in at least 1 nt (Supplementary Table S4). The large number of pockets is not surprising given the enormous size of the ribosome, and is consistent with the notion that the RNA component of the ribosome presents a large network of interconnected channels that are readily accessible to small ligands (22). Furthermore, the discrepancy between the large number of putative pockets and the number of experimentally defined ABSs implies that the dimension parameter alone is not a sufficient criterion to explain the specific binding of antibiotics to the rRNA.
Despite the high number of potential binding pockets for small molecules in the ribosome, known ribosomal inhibitors act on a limited number of sites. In order to identify the properties distinguishing a real ABS from other random pockets on the ribosome (Table 1), we computed a set of physical, chemical and structural features (listed in Supplementary Table S6), as well as the extent of evolutionary conservation for both the experimentally validated ABSs and for all computed pockets. Clearly, apart from evolutionary conservation, a very important feature that is expected from clinically affective antibiotic binding sites is their specificity to the drug (i.e. the binding site in pathogenic bacteria should differ from the corresponding site in the host ribosome). However, since bacterial specificity usually boils down to a single nucleotide [as in the macrolide binding pockets (35)] which could not be identified in our statistic-based method and is not expected from all ABSs that we have included in our analysis (i.e. the archaeal ABSs), we did not incorporate specificity among the computed parameters. The selected features (Supplementary Table S6) were further compared between known binding sites and all computed pockets. To further test whether the properties that over represented in ABSs in of both the T. thermophilus, and H. marismortui are unique to druggable sites, we examined their abundance in both ribosomal protein-binding interfaces and in sites that consist of ribosome–ligands interactions. Specifically, we included nucleotides that interact with the mRNA and the tRNA, i.e. the A, P and E sites (see ‘Materials and Methods’ section). The properties of the 65 known binding sites, which exhibit statistically significant bias compared to all computed pockets, are presented in Figure 2 and are discussed in more detail in the subsequent paragraphs.
When analyzing the structural composition of the known ABSs in comparison to the computed pockets, we noticed an increased representation of atoms belonging to the nitrogen bases rather than atoms of the sugar-phosphate backbone. This phenomenon was prominent specifically in binding sites of the large ribosomal subunit (Figure 2A, indices 22–63). The overabundance of atoms of the nitrogen bases in the ABSs may reflect the known sequence specific interactions of many antibiotics with the ribosome that form the basis of selectivity of clinically important medicines. Nevertheless, we did not identify a statistically significant preference for a specific type of nitrogen base or an explicit donor/acceptor group in the known ABSs (Figure 2A). Upon examining the occurrence of the 16 possible base pairs (26) in ABSs (see ‘Materials and Methods’ section), we noticed a significant overrepresentation of the non canonical UC base pair and an underrepresentation of the canonical UA pair in the 23S rRNA binding sites. Furthermore, the GC was underrepresented in ABSs in both 16S and 23S rRNA. Notably, such bias was specific to ABSs and was not observed in the ribosomal protein-binding interface, while the GC and UA underrepresentations were found in the natural ligand-binding sites in the large subunit (Supplementary Table S7).
The most apparent bias in the characteristics of the known ABSs was the overrepresentation of non-paired bases (Figure 2C). This phenomenon was more prominent in the large ribosomal subunit (Supplementary Table S7 and S8). Non-paired bases obviously offer a greater variation in the chemical groups available for structure-specific interaction with the drug (or natural ligand) compared to the nitrogen bases engaged in the inter-nucleotide interactions. Therefore, rRNA pockets with the higher representation of non-paired bases could be evolutionary selected as preferred ABSs. We further analyzed various types of interactions between the nts in the pockets. Interactions were annotated according to three edges of H-bonding interactions [Watson–Crick (W)/Hoogsteen (H)/Sugar (S)] (24,27). Further, base–base interactions were categorized in terms of stacking interactions between adjacent and non-adjacent nts (26). We observed a bias to the non-canonical W-H pairing in the known ABSs. Additionally, we noted an overrepresentation of non-adjacent base–base stacking interactions in the large ribosomal subunit (Figure 2C).
The A-minor motif, which involves inserting the edges of adenine bases into the minor groove of RNA helices, was previously found to be the most abundant structural motif in the large ribosomal subunit (36–39). Consistent with previous observations showing the importance of the A-minor motif in functional sites (51), we found that a considerable number (Supplementary Table S9) of ABSs, specifically in the 16S rRNA, comprise diverse elements of the A-minor motif. Overall, the results of structural analysis of RNA bases in the ABSs are consistent with the notion that antibiotic binding, both in the large and small subunits, involves RNA nts belonging to single-stranded RNA regions or distortions in the RNA helices such as in bulges, loops, etc. (40).
Interestingly, we also observed a preference for syn over anti conformation of the RNA bases in the majority of known ABSs (Supplementary Tables S7 and S8). Syn and anti conformations are defined by the glycosidic torsion angle χ. It was previously shown that the syn conformation has higher energies than the anti conformation in pyrimidine nucleosides (41). The sugar conformation is known to have a great effect on the distribution of glycosidic torsion angles. Specifically, the C2 endo sugar pucker provides less steric hindrance than the C3 endo, allowing a wider range of χ values and less energy cost for a syn conformation (19,30). Consistent with the latter, we found that the ribose sugar pucker in C3 endo conformation was underrepresented in the vast majority of ABSs. The relative overabundance of the unusual sugar pucker conformation (both C2 endo and C4 exo) and syn conformation of RNA bases in ligand-binding sites could be associated with the higher flexibility of rRNA in these sites (42,43). The flexibility of the glycosidic bonds of the bases in the antibiotic binding sites of the large ribosomal subunit (specifically at the PTC and the tunnel region) was recently suggested to explain antibiotic selectivity and action (44). Possibly during evolution, antibiotics were selected to target flexible RNA pockets that are capable of the induced fit-binding mode (45). Consistent with the result in the 23S rRNA, a preference for non-paired bases and the syn conformation was also observed in the 16S rRNA (Figure 2).
Interestingly, we observed that stacking interactions (both adjacent and non-adjacent) are underrepresented in the ribosome–ligand-binding interface, but the non-adjacent interactions are overrepresented in both the ABSs and protein binding of the 23S rRNA (Supplementary Table S7). Remarkably, in the protein-binding interface, we could not identify bias to the non-paired bases, Watson–Hoogsteen pairing, syn conformation, or irregular sugar pucker. Comparison of structural properties of known antibiotic sites between the T. thermophilus and Escherichia coli ribosomes (34) showed that the preference for non-paired nts, non-adjacent base-base stacking interactions, and high occurrence of syn conformation appear to be conserved between ribosomes of different bacterial species (Figure S1). The conservation of unique properties in the binding pockets of these species emphasizes the importance of the unique structural properties to the function of these sites.
Recent studies have employed the network (graph) approach to analyze unique properties of macromolecule structures (31,46–48). In such structure-based networks, the amino-acids/bases are represented as nodes, and inter-residue interactions as edges. In a previous study, we showed that the centrality parameters (i.e. closeness, betweenness and degree) were highly correlated with functionality of the nt in the ribosome (31). As demonstrated in Figure 2D, we found that the nts in most known ABSs are characterized with high centrality values, and a specifically high closeness. When we further calculated the evolutionary conservation for the three domains of life (bacteria, archaea and eukarya) using the ARB-SILVA database (49), a high degree of conservation was observed in most ABSs. This observation is in accord with previously noted evolutionary conservation of rRNA in the functional centers of the ribosome (Figure 2E) (50).
Summarizing our findings discussed in the previous paragraphs, we note that the known sites of antibiotic binding in the ribosome are preferentially endowed with specific structural properties. Characteristics such as the prevalence of the A-minor motif (42,51) and high centrality values (31) have been previously shown to be associated with ribosome functional sites. Therefore, we tested whether the other structural properties distinguishing the ABSs could be attributable to the functional sites in general. Upon selecting a subset of top 25% highly conserved nts, we found that they exhibit a property bias similar to that found for the antibiotic sites, such as overabundance of non-paired bases, unusual sugar pucker and prevalence of syn conformation (Figure 3). These results are also consistent with the above analysis of mRNA and the A, P and E sites, in which we observed bias in favor of non-paired bases and the syn conformation, while an underrepresentation was observed in the C3 endo conformation and canonical Watson–Crick base pairing (GC) (Supplementary Table S7). Likely, the bias we observed in ABSs reflects the unique structural properties of functional sites on the ribosome that contribute to the high affinity binding of their natural ligands.
Knowledge of the properties of known ABSs can illuminate which other pockets in rRNA could serve as binding sites for new antibiotics. However, clearly further research on site functionality and specificity is required. In order to estimate whether a computed pocket can be a potential drug-binding site, we assigned a ‘druggability index’ (DI) score [Equation (4)] to each computed pocket. DI weights the properties that were found significantly biased in the known ABS in H. marismortui 50S subunit (Table 1) as follows:
where x is a property presented in Supplementary Table S7, Px is the P-value of property x, and Fx is the fraction of nts in the computed pocket with property x. It is important to note that the DI was derived from the preferences for specific structural properties we observed mainly in the large subunit (Figure 2). As a result, we applied the DI to identify new structural pockets exclusively in the large ribosomal subunit. Figure 4 plots the DI values of computed pockets in the large ribosomal subunit with DI >0 (comprising 63% of all computed pockets) versus their calculated average evolutionary conservation, usually employed to identify functional sites.
As shown, while many computed sites in the 23S rRNA possess a very high conservation (demonstrated by the high density of dots in the right-hand side of Figure 4), we found that the computed pockets that overlap with the known ABSs are both highly conserved and possess a high DI score, which mainly reflects their unique structural properties (Figure 4, red dots). Interestingly, among the known sites of antibiotic action, the computed site that overlapped the macrolides binding pocket in the nascent peptide exit tunnel had the highest DI value. The top four ranked computed pockets (with the highest DI scores) are presented in Figure 5. Three of them overlap experimentally validated ABSs. Pockets 1 and 2 (Figure 5A and B) overlap the binding sites of the macrolides and ketolides antibiotics. The third top-ranking pocket (Figure 5C) is located near the ribosomal E site and overlaps the binding site of girodazole (12). The site ranked in fourth place (Figure 5D) is located about 12 Å from the nascent peptide tunnel close to the loop of ribosomal protein L22. This computed site comprises a methylated residue m6A1618 (52) and is not targeted by any antibiotic currently in use. The overall distribution of the DI score over the 23S rRNA molecular surface is presented in Figure 6. As demonstrated, very high DI values were obtained in the vicinity of the PTC and the E-site region. A zoom to the E-site region showing a view into the exit tunnel is also presented. As shown, the red (high DI) patch decorating the tunnel walls is the region of macrolides-binding sites. Notably, other regions of the large ribosomal subunit demonstrate patches of high DI values, suggesting other potential binding sites.
To test the validity of the DI score in clinically relevant sites, we examined the DI scores of the tetracycline computed pockets in the small subunit that comprise both the clinically relevant primary site and additional sites that may include non specific interactions with tetracycline (53). Interestingly, of all sites the highest DI was calculated for the primary binding site (1HNW, TAC1001, DI = 0.72; 1I97, TAC2001, DI = 0.65); this score was higher (Supplementary Table S10) than the secondary site (1HNW TAC1002, DI = 0.59) and the other observed sites (1I97) (7). To further evaluate the potential of DI to detect ABSs, we concentrated on sites possessing a DI value that fell in the range of the DI of known ABSs (0.44<DI<0.98). These sites accounted for 21% (i.e. 2487 pockets including the 606 known ABSs). Overall, we indentified 1181 overlapping pockets on the 23S rRNA as potential new druggable sites. Interestingly, several of them overlap our previously detected deleterious point mutations (16,17) (Figure 4, cyan and blue dots). As shown in Figure 4, the conservation and DI of these pockets resemble the values of known ABSs. Two 23S rRNA sites outside PTC, the sarcin–ricin loop (centered around nt 2664) and the internal loop in the A-site finger (around nt 864) are characterized by clusters of deleterious mutations that make them attractive candidates for drug development. Of these two sites, the sarcin–ricin loop pocket has a higher DI (DI = 0.68) than the A-site finger pocket (DI = 0.49). Yet, the DI values of both sites are high enough to place both of them in the list of potentially druggable ribosomal sites.
Figure 7 further demonstrates the potential to combine the DI with the existence of structural motifs preferentially found in known binding pockets, such as the A-minor motif. An example of such a putative site was found in the A-site finger. The pocket was computed from the H. marismortui structure (PDB 1JJ2) and has a DI value of 0.49. This pocket consists of 12 nt, six which form non-adjacent base–base stacking interactions. Additionally, the pocket has a UC mismatch base pair (Figure 6; positions 913, 914), and an unpaired adenine in position 910, which participates in an A-minor motif interaction. The nts that were identified as A-minor motif elements refer to positions 862 and 863, which form interactions with adenines in the 5S rRNA. As discussed above, these properties were commonly found in functional sites as well as in known ABSs. The functional role of this highly conserved region is also denoted by the presence of a strong deleterious mutation (position 864) (17). Although the function of this region is unclear due to its high DI value and overall functional characteristics, we suggest that this pocket could be a good candidate for a novel drug.
In an attempt to better understand the atomic and structural features that could elucidate the druggability of ABSs on the ribosome, we studied the properties of 65 antibiotic binding pockets extracted from the available high-resolution structures of ribosome-antibiotic complexes (3,4,7,12,35,53–56). In addition, we applied an automatic method to compute structural pockets from the T. thermophilus and H. marismortui small and large ribosomal subunits. Overall, we extracted ~17 000 overlapping computed pockets on the ribosome, all in the same range of dimensions as the known binding sites. The enormous number of computed binding sites clearly shows that the ribosome, which is comprised of a large network of interconnected channels, is much more accessible to small ligands than traditional protein drug targets. However, to date, only a small number of possible binding pockets in the ribosome has been verified to bind antibiotics.
A comprehensive analysis of the physicochemical and structural (local and global) features of RNA in the known antibiotic binding pockets revealed that while the sites differ in their dimension and atomic details, they share several unique properties, namely an ‘RNA signature’. The most noticeable structural properties found in the majority of known binding sites were the prevalence of non-paired bases, preference for the unusual syn base conformation and unusual ribose sugar pucker. We postulate that the unique base conformation and sugar pucker may reflect higher flexibility of the rRNA bases in the ABSs as in natural ligands. We believe that the properties we identified, specifically the suggested DI, will facilitate the identification of new functional sites suitable for drug design. Clearly, other considerations such as bacterial specificity of the site and experimental validation of site functionality will be required. Furthermore, we believe that a similar approach could be implemented for characterizing and identifying other ligand-binding sites in non-ribosomal RNA targets (such as non-coding RNAs) for which selective ligands are extremely scarce or do not exist.
Supplementary Data are available at NAR Online.
The United States, Israel Binational Science Foundation (Grant Number 2007453 to Y.M.G. and A.S.M.); Israeli Science Foundation (Grant number 1297/09 to Y.M.G.); and by the J. and A. Taub Biological Research (H.D.E.). Funding for open access charge: The United States, Israel Binational Science Foundation (grant Number 2007453).
Conflict of interest statement. None declared.
We would like to thank Ada Yonath for her inspiring ideas and helpful suggestions throughout the course of this study. Many thanks to Anat Bashan and Chen Davidovich for their help in generating the ABSs list and for their many helpful suggestions.