Drug attrition late in preclinical or clinical development is a serious economic problem in the field of drug discovery. These problems can be linked, in part, to the quality of the compound collections used during the hit generation stage and to the selection of compounds undergoing optimization. Here, we present FAF-Drugs3, a web server that can be used for drug discovery and chemical biology projects to help in preparing compound libraries and to assist decision-making during the hit selection/lead optimization phase. Since it was first described in 2006, FAF-Drugs has been significantly modified. The tool now applies an enhanced structure curation procedure, can filter or analyze molecules with user-defined or eight predefined physicochemical filters as well as with several simple ADMET (absorption, distribution, metabolism, excretion and toxicity) rules. In addition, compounds can be filtered using an updated list of 154 hand-curated structural alerts while Pan Assay Interference compounds (PAINS) and other, generally unwanted groups are also investigated. FAF-Drugs3 offers access to user-friendly html result pages and the possibility to download all computed data. The server requires as input an SDF file of the compounds; it is open to all users and can be accessed without registration at http://fafdrugs3.mti.univ-paris-diderot.fr.
Open screening endeavors play and will play a key role to facilitate the identification of new bioactive compounds in order to foster innovation and to improve the effectiveness of chemical biology and drug discovery processes. In this line, we developed the new web server MTiOpenScreen dedicated to small molecule docking and virtual screening. It includes two services, MTiAutoDock and MTiOpenScreen, allowing performing docking into a user-defined binding site or blind docking using AutoDock 4.2 and automated virtual screening with AutoDock Vina. MTiOpenScreen provides valuable starting collections for screening, two in-house prepared drug-like chemical libraries containing 150 000 PubChem compounds: the Diverse-lib containing diverse molecules and the iPPI-lib enriched in molecules likely to inhibit protein–protein interactions. In addition, MTiOpenScreen offers users the possibility to screen up to 5000 small molecules selected outside our two libraries. The predicted binding poses and energies of up to 1000 top ranked ligands can be downloaded. In this way, MTiOpenScreen enables researchers to apply virtual screening using different chemical libraries on traditional or more challenging protein targets such as protein–protein interactions. The MTiOpenScreen web server is free and open to all users at http://bioserv.rpbs.univ-paris-diderot.fr/services/MTiOpenScreen/.
Fundamental processes in living cells are largely controlled by macromolecular interactions and among them, protein–protein interactions (PPIs) have a critical role while their dysregulations can contribute to the pathogenesis of numerous diseases. Although PPIs were considered as attractive pharmaceutical targets already some years ago, they have been thus far largely unexploited for therapeutic interventions with low molecular weight compounds. Several limiting factors, from technological hurdles to conceptual barriers, are known, which, taken together, explain why research in this area has been relatively slow. However, this last decade, the scientific community has challenged the dogma and became more enthusiastic about the modulation of PPIs with small drug-like molecules. In fact, several success stories were reported both, at the preclinical and clinical stages. In this review article, written for the 2014 International Summer School in Chemoinformatics (Strasbourg, France), we discuss in silico tools (essentially post 2012) and databases that can assist the design of low molecular weight PPI modulators (these tools can be found at www.vls3d.com). We first introduce the field of protein–protein interaction research, discuss key challenges and comment recently reported in silico packages, protocols and databases dedicated to PPIs. Then, we illustrate how in silico methods can be used and combined with experimental work to identify PPI modulators.
Protein–protein interaction modulators; Drug discovery; Drug-like molecules; In silico methods; PPI network
Protein-protein interactions (PPI) are involved in vital cellular processes and are therefore associated to a growing number of diseases. But working with them as therapeutic targets comes with some major hurdles that require substantial mutations from our way to design to drugs on historical targets such as enzymes and G-Protein Coupled Receptor (GPCR). Among the numerous ways we could improve our methodologies to maximize the potential of developing new chemical entities on PPI targets, is the fundamental question of what type of compounds should we use to identify the first hits and among which chemical space should we navigate to optimize them to the drug candidate stage. In this review article, we cover different aspects on PPI but with the aim to gain some insights into the specific nature of the chemical space of PPI inhibitors. We describe the work of different groups to highlight such properties and discuss their respective approach. We finally discuss a case study in which we describe the properties of a set of 115 PPI inhibitors that we compare to a reference set of 1730 enzyme inhibitors. This case study highlights interesting properties such as the unfortunate price that still needs to be paid by PPI inhibitors in terms of molecular weight, hydrophobicity, and aromaticity in order to reach a critical level of activity. But it also shows that not all PPI targets are equivalent, and that some PPI targets can demonstrate a better druggability by illustrating the better drug likeness of their associated inhibitors.
Binding Sites; Computer Simulation; Drug Discovery; methods; Enzyme Inhibitors; chemistry; pharmacology; Principal Component Analysis; Protein Binding; Protein Interaction Maps; Proteins; chemistry; Small Molecule Libraries; chemistry; pharmacology; Chemical space; protein-protein interactions; compound collection; ADME; chemoinformatics; therapeutic targets; mutations; enzymes; G-Protein Coupled Receptor (GPCR); druggability
Drug metabolizing enzymes play a key role in the metabolism, elimination and detoxification of xenobiotics, drugs and endogenous molecules. While their principal role is to detoxify organisms by modifying compounds, such as pollutants or drugs, for a rapid excretion, in some cases they render their substrates more toxic thereby inducing severe side effects and adverse drug reactions, or their inhibition can lead to drug–drug interactions. We focus on sulfotransferases (SULTs), a family of phase II metabolizing enzymes, acting on a large number of drugs and hormones and showing important structural flexibility. Here we report a novel in silico structure-based approach to probe ligand binding to SULTs. We explored the flexibility of SULTs by molecular dynamics (MD) simulations in order to identify the most suitable multiple receptor conformations for ligand binding prediction. Then, we employed structure-based docking-scoring approach to predict ligand binding and finally we combined the predicted interaction energies by using a QSAR methodology. The results showed that our protocol successfully prioritizes potent binders for the studied here SULT1 isoforms, and give new insights on specific molecular mechanisms for diverse ligands’ binding related to their binding sites plasticity. Our best QSAR models, introducing predicted protein-ligand interaction energy by using docking, showed accuracy of 67.28%, 78.00% and 75.46%, for the isoforms SULT1A1, SULT1A3 and SULT1E1, respectively. To the best of our knowledge our protocol is the first in silico structure-based approach consisting of a protein-ligand interaction analysis at atomic level that considers both ligand and enzyme flexibility, along with a QSAR approach, to identify small molecules that can interact with II phase dug metabolizing enzymes.
Pockets are today at the cornerstones of modern drug discovery projects and at the crossroad of several research fields, from structural biology to mathematical modeling. Being able to predict if a small molecule could bind to one or more protein targets or if a protein could bind to some given ligands is very useful for drug discovery endeavors, anticipation of binding to off- and anti-targets. To date, several studies explore such questions from chemogenomic approach to reverse docking methods. Most of these studies have been performed either from the viewpoint of ligands or targets. However it seems valuable to use information from both ligands and target binding pockets. Hence, we present a multivariate approach relating ligand properties with protein pocket properties from the analysis of known ligand-protein interactions. We explored and optimized the pocket-ligand pair space by combining pocket and ligand descriptors using Principal Component Analysis and developed a classification engine on this paired space, revealing five main clusters of pocket-ligand pairs sharing specific and similar structural or physico-chemical properties. These pocket-ligand pair clusters highlight correspondences between pocket and ligand topological and physico-chemical properties and capture relevant information with respect to protein-ligand interactions. Based on these pocket-ligand correspondences, a protocol of prediction of clusters sharing similarity in terms of recognition characteristics is developed for a given pocket-ligand complex and gives high performances. It is then extended to cluster prediction for a given pocket in order to acquire knowledge about its expected ligand profile or to cluster prediction for a given ligand in order to acquire knowledge about its expected pocket profile. This prediction approach shows promising results and could contribute to predict some ligand properties critical for binding to a given pocket, and conversely, some key pocket properties for ligand binding.
In the past decade, the spleen tyrosine kinase (Syk) has shown a high potential for the discovery of new treatments for inflammatory and autoimmune disorders. Pharmacological inhibitors of Syk catalytic site bearing therapeutic potential have been developed, with however limited specificity towards Syk. To address this topic, we opted for the design of drug-like compounds that could impede the interaction of Syk with its cellular partners while maintaining an active kinase protein. To achieve this challenging task, we used the powerful potential of intracellular antibodies for the modulation of cellular functions in vivo, combined to structure-based in silico screening. In our previous studies, we reported the anti-allergic properties of the intracellular antibody G4G11. With the aim of finding functional mimics of G4G11, we developed an Antibody Displacement Assay and we isolated the drug-like compound C-13, with promising in vivo anti-allergic activity. The likely binding cavity of this compound is located at the close vicinity of G4G11 epitope, far away from the catalytic site of Syk. Here we report the virtual screen of a collection of 500,000 molecules against this new cavity, which led to the isolation of 1000 compounds subsequently evaluated for their in vitro inhibitory effects using the Antibody Displacement Assay. Eighty five compounds were selected and evaluated for their ability to inhibit the liberation of allergic mediators from mast cells. Among them, 10 compounds inhibited degranulation with IC50 values ≤10 µM. The most bioactive compounds combine biological activity, significant inhibition of antibody binding and strong affinity for Syk. Moreover, these molecules show a good potential for oral bioavailability and are not kinase catalytic site inhibitors. These bioactive compounds could be used as starting points for the development of new classes of non-enzymatic inhibitors of Syk and for drug discovery endeavour in the field of inflammation related disorders.
In zebrafish, vascular endothelial growth factor-C precursor (proVEGF-C) processing occurs within the dibasic motif HSIIRR214 suggesting the involvement of one or more basic amino acid-specific proprotein convertases (PCs) in this process. In the present study, we examined zebrafish proVEGF-C expression and processing and the effect of unprocessed proVEGF-C on caudal fin regeneration.
Cell transfection assays revealed that the cleavage of proVEGF-C, mainly mediated by the proprotein convertases Furin and PC5 and to a less degree by PACE4 and PC7, is abolished by PCs inhibitors or by mutation of its cleavage site (HSIIRR214 into HSIISS214). In vitro, unprocessed proVEGF-C failed to activate its signaling proteins Akt and ERK and to induce cell proliferation. In vivo, following caudal fin amputation, the induction of VEGF-C, Furin and PC5 expression occurs as early as 2 days post-amputation (dpa) with a maximum levels at 4–7 dpa. Using immunofluorescence staining we localized high expression of VEGF-C and the convertases Furin and PC5 surrounding the apical growth zone of the regenerating fin. While expression of wild-type proVEGF-C in this area had no effect, unprocessed proVEGF-C inhibited fin regeneration.
Taken together, these data indicate that zebrafish fin regeneration is associated with up-regulation of VEGF-C and the convertases Furin and PC5 and highlight the inhibitory effect of unprocessed proVEGF-C on fin regeneration.
Protein-protein interactions (PPIs) may represent one of the next major classes of therapeutic targets. So far, only a minute fraction of the estimated 650,000 PPIs that comprise the human interactome are known with a tiny number of complexes being drugged. Such intricate biological systems cannot be cost-efficiently tackled using conventional high-throughput screening methods. Rather, time has come for designing new strategies that will maximize the chance for hit identification through a rationalization of the PPI inhibitor chemical space and the design of PPI-focused compound libraries (global or target-specific). Here, we train machine-learning-based models, mainly decision trees, using a dataset of known PPI inhibitors and of regular drugs in order to determine a global physico-chemical profile for putative PPI inhibitors. This statistical analysis unravels two important molecular descriptors for PPI inhibitors characterizing specific molecular shapes and the presence of a privileged number of aromatic bonds. The best model has been transposed into a computer program, PPI-HitProfiler, that can output from any drug-like compound collection a focused chemical library enriched in putative PPI inhibitors. Our PPI inhibitor profiler is challenged on the experimental screening results of 11 different PPIs among which the p53/MDM2 interaction screened within our own CDithem platform, that in addition to the validation of our concept led to the identification of 4 novel p53/MDM2 inhibitors. Collectively, our tool shows a robust behavior on the 11 experimental datasets by correctly profiling 70% of the experimentally identified hits while removing 52% of the inactive compounds from the initial compound collections. We strongly believe that this new tool can be used as a global PPI inhibitor profiler prior to screening assays to reduce the size of the compound collections to be experimentally screened while keeping most of the true PPI inhibitors. PPI-HitProfiler is freely available on request from our CDithem platform website, www.CDithem.com.
Protein-protein interactions (PPIs) are essential to life and various diseases states are associated with aberrant PPIs. Therefore significant efforts are dedicated to this new class of therapeutic targets. Even though it might not be possible to modulate the estimated 650,000 PPIs that regulate human life with drug-like compounds, a sizeable number of PPI should be druggable. Only 10-15% of the human genome is thought to be druggable with around 1000-3000 druggable protein targets. A hypothetical similar ratio for PPIs would bring the number of druggable PPIs to about 65,000, although no data can yet support such a hypothesis. PPI have been historically intricate to tackle with standard experimental and virtual screening techniques, possibly because of the shift in the chemical space between today's chemical libraries and PPI physico-chemical requirements. Therefore, one possible avenue to circumvent this conundrum is to design focused libraries enriched in putative PPI inhibitors. Here, we show how chemoinformatics can assist library design by learning physico-chemical rules from a data set of known PPI inhibitors and their comparison with regular drugs. Our study shows the importance of specific molecular shapes and a privileged number of aromatic bonds.
Discovery of new bioactive molecules that could enter drug discovery programs or that could serve as chemical probes is a very complex and costly endeavor. Structure-based and ligand-based in silico screening approaches are nowadays extensively used to complement experimental screening approaches in order to increase the effectiveness of the process and facilitating the screening of thousands or millions of small molecules against a biomolecular target. Both in silico screening methods require as input a suitable chemical compound collection and most often the 3D structure of the small molecules has to be generated since compounds are usually delivered in 1D SMILES, CANSMILES or in 2D SDF formats.
Here, we describe the new open source program DG-AMMOS which allows the generation of the 3D conformation of small molecules using Distance Geometry and their energy minimization via Automated Molecular Mechanics Optimization. The program is validated on the Astex dataset, the ChemBridge Diversity database and on a number of small molecules with known crystal structures extracted from the Cambridge Structural Database. A comparison with the free program Balloon and the well-known commercial program Omega generating the 3D of small molecules is carried out. The results show that the new free program DG-AMMOS is a very efficient 3D structure generator engine.
DG-AMMOS provides fast, automated and reliable access to the generation of 3D conformation of small molecules and facilitates the preparation of a compound collection prior to high-throughput virtual screening computations. The validation of DG-AMMOS on several different datasets proves that generated structures are generally of equal quality or sometimes better than structures obtained by other tested methods.
Virtual or in silico ligand screening combined with other computational methods is one of the most promising methods to search for new lead compounds, thereby greatly assisting the drug discovery process. Despite considerable progresses made in virtual screening methodologies, available computer programs do not easily address problems such as: structural optimization of compounds in a screening library, receptor flexibility/induced-fit, and accurate prediction of protein-ligand interactions. It has been shown that structural optimization of chemical compounds and that post-docking optimization in multi-step structure-based virtual screening approaches help to further improve the overall efficiency of the methods. To address some of these points, we developed the program AMMOS for refining both, the 3D structures of the small molecules present in chemical libraries and the predicted receptor-ligand complexes through allowing partial to full atom flexibility through molecular mechanics optimization.
The program AMMOS carries out an automatic procedure that allows for the structural refinement of compound collections and energy minimization of protein-ligand complexes using the open source program AMMP. The performance of our package was evaluated by comparing the structures of small chemical entities minimized by AMMOS with those minimized with the Tripos and MMFF94s force fields. Next, AMMOS was used for full flexible minimization of protein-ligands complexes obtained from a mutli-step virtual screening. Enrichment studies of the selected pre-docked complexes containing 60% of the initially added inhibitors were carried out with or without final AMMOS minimization on two protein targets having different binding pocket properties. AMMOS was able to improve the enrichment after the pre-docking stage with 40 to 60% of the initially added active compounds found in the top 3% to 5% of the entire compound collection.
The open source AMMOS program can be helpful in a broad range of in silico drug design studies such as optimization of small molecules or energy minimization of pre-docked protein-ligand complexes. Our enrichment study suggests that AMMOS, designed to minimize a large number of ligands pre-docked in a protein target, can successfully be applied in a final post-processing step and that it can take into account some receptor flexibility within the binding site area.
Drug discovery and chemical biology are exceedingly complex and demanding enterprises. In recent years there are been increasing awareness about the importance of predicting/optimizing the absorption, distribution, metabolism, excretion and toxicity (ADMET) properties of small chemical compounds along the search process rather than at the final stages. Fast methods for evaluating ADMET properties of small molecules often involve applying a set of simple empirical rules (educated guesses) and as such, compound collections' property profiling can be performed in silico. Clearly, these rules cannot assess the full complexity of the human body but can provide valuable information and assist decision-making.
This paper presents FAF-Drugs2, a free adaptable tool for ADMET filtering of electronic compound collections. FAF-Drugs2 is a command line utility program (e.g., written in Python) based on the open source chemistry toolkit OpenBabel, which performs various physicochemical calculations, identifies key functional groups, some toxic and unstable molecules/functional groups. In addition to filtered collections, FAF-Drugs2 can provide, via Gnuplot, several distribution diagrams of major physicochemical properties of the screened compound libraries.
We have developed FAF-Drugs2 to facilitate compound collection preparation, prior to (or after) experimental screening or virtual screening computations. Users can select to apply various filtering thresholds and add rules as needed for a given project. As it stands, FAF-Drugs2 implements numerous filtering rules (23 physicochemical rules and 204 substructure searching rules) that can be easily tuned.
The number of protein targets with a known or predicted tri-dimensional structure and of drug-like chemical compounds is growing rapidly and so is the need for new therapeutic compounds or chemical probes. Performing flexible structure-based virtual screening computations on thousands of targets with millions of molecules is intractable to most laboratories nor indeed desirable. Since shape complementarity is of primary importance for most protein-ligand interactions, we have developed a tool/protocol based on rigid-body docking to select compounds that fit well into binding sites.
Here we present an efficient multiple conformation rigid-body docking approach, MS-DOCK, which is based on the program DOCK. This approach can be used as the first step of a multi-stage docking/scoring protocol. First, we developed and validated the Multiconf-DOCK tool that generates several conformers per input ligand. Then, each generated conformer (bioactives and 37970 decoys) was docked rigidly using DOCK6 with our optimized protocol into seven different receptor-binding sites. MS-DOCK was able to significantly reduce the size of the initial input library for all seven targets, thereby facilitating subsequent more CPU demanding flexible docking procedures.
MS-DOCK can be easily used for the generation of multi-conformer libraries and for shape-based filtering within a multi-step structure-based screening protocol in order to shorten computation times.
During these last 15 years, drug discovery strategies have essentially focused on identifying small molecules able to inhibit catalytic sites. However, other mechanisms could be targeted. Protein-protein interactions play crucial roles in a number of biological processes, and, as such, their disruption or stabilization is becoming an area of intense activity. Along the same line, inhibition of protein-membrane could be of major importance in several disease indications. Despite the many challenges associated with the development of such classes of interaction modulators, there has been considerable success in the recent years. Importantly, through the existence of protein hot-spots and the presence of druggable pockets at the macromolecular interfaces or in their vicinities, it has been possible to find small molecule effectors using a variety of screening techniques, including combined virtual ligand-in vitro screening strategy. Indeed such in silico-in vitro protocols emerge as the method of choice to facilitate our quest of novel drug-like compounds or of mechanistic probes aiming at facilitating the understanding of molecular reactions involved in the Health and Disease process. In this review, we comment recent successes of combined in silico-in vitro screening methods applied to modulating macromolecular interactions with a special emphasis on protein-membrane interactions.
Virtual screening; structure-based drug design; drug discovery; protein-protein interaction; protein-membrane interaction
In silico screening based on the structures of the ligands or of the receptors has become an essential tool to facilitate the drug discovery process but compound collections are needed to carry out such in silico experiments. It has been recognized that absorption, distribution, metabolism, excretion and toxicity (ADME/tox) are key properties that need to be considered early on, even during the database preparation stage. FAF-Drugs is an online service based on Frowns (a chemoinformatics toolkit) that allows users to process their own compound collections via simple ADME/Tox filtering rules such as molecular weight, polar surface area, logP or number of rotatable bonds. SMILES (Simplified Molecular Input Line Entry System), CANSMILES (canonical smiles) or SDF (structure data file) files are required as input and molecules that pass or do not pass the filters are sent back in CANSMILES format. This service should thus help scientists engaging in drug discovery campaigns. Other utilities and several compound collections suitable for in silico screening are available at our site. FAF-Drugs can be accessed at .
PCE (protein continuum electrostatics) is an online service for protein electrostatic computations presently based on the MEAD (macroscopic electrostatics with atomic detail) package initially developed by D. Bashford [(2004) Front Biosci., 9, 1082–1099]. This computer method uses a macroscopic electrostatic model for the calculation of protein electrostatic properties, such as pKa values of titratable groups and electrostatic potentials. The MEAD package generates electrostatic energies via finite difference solution to the Poisson–Boltzmann equation. Users submit a PDB file and PCE returns potentials and pKa values as well as color (static or animated) figures displaying electrostatic potentials mapped on the molecular surface. This service is intended to facilitate electrostatics analyses of proteins and thereby broaden the accessibility to continuum electrostatics to the biological community. PCE can be accessed at .