PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-25 (47)
 

Clipboard (0)
None

Select a Filter Below

Year of Publication
1.  A Docking Server for the Identification of Heparin Binding Sites on Proteins 
Many proteins of widely differing functionality and structure are capable of binding heparin and heparan sulfate. Since crystallizing protein-heparin complexes for structure determination is generally difficult, computational docking can be a useful approach for understanding specific interactions. Previous studies used programs originally developed for docking small molecules to well-defined pockets, rather than for docking polysaccharides to highly charged shallow crevices that usually bind heparin. We have extended the program PIPER and the automated protein-protein docking server ClusPro to heparin docking. Using a molecular mechanics energy function for scoring and the fast Fourier transform correlation approach, the method generates and evaluates close to a billion poses of a heparin tetrasaccharide probe. The docked structures are clustered using pairwise root mean square deviations as the distance measure. It was shown that clustering of heparin molecules close to each other but having different orientations and selecting the clusters with the highest protein-ligand contacts reliably predicts the heparin binding site. In addition, the centers of the five most populated clusters include structures close to the native orientation of the heparin. These structures can provide starting points for further refinement by methods that account for flexibility such as molecular dynamics. The heparin docking method is available as an advanced option of the ClusPro server at http://cluspro.bu.edu/.
doi:10.1021/ci500115j
PMCID: PMC4184157  PMID: 24974889
2.  Docking Server for the Identification of Heparin Binding Sites on Proteins 
Many proteins of widely differing functionality and structure are capable of binding heparin and heparan sulfate. Since crystallizing protein–heparin complexes for structure determination is generally difficult, computational docking can be a useful approach for understanding specific interactions. Previous studies used programs originally developed for docking small molecules to well-defined pockets, rather than for docking polysaccharides to highly charged shallow crevices that usually bind heparin. We have extended the program PIPER and the automated protein–protein docking server ClusPro to heparin docking. Using a molecular mechanics energy function for scoring and the fast Fourier transform correlation approach, the method generates and evaluates close to a billion poses of a heparin tetrasaccharide probe. The docked structures are clustered using pairwise root-mean-square deviations as the distance measure. It was shown that clustering of heparin molecules close to each other but having different orientations and selecting the clusters with the highest protein–ligand contacts reliably predicts the heparin binding site. In addition, the centers of the five most populated clusters include structures close to the native orientation of the heparin. These structures can provide starting points for further refinement by methods that account for flexibility such as molecular dynamics. The heparin docking method is available as an advanced option of the ClusPro server at http://cluspro.bu.edu/.
doi:10.1021/ci500115j
PMCID: PMC4184157  PMID: 24974889
3.  Insights into the Architecture of the eIF2Bα/β/δ Regulatory Subcomplex 
Biochemistry  2014;53(21):3432-3445.
Eukaryotic translation initiation factor 2B (eIF2B), the guanine nucleotide exchange factor for the G-protein eIF2, is one of the main targets for the regulation of protein synthesis. The eIF2B activity is inhibited in response to a wide range of stress factors and diseases, including viral infections, hypoxia, nutrient starvation, and heme deficiency, collectively known as the integrated stress response. eIF2B has five subunits (α–ε). The α, β, and δ subunits are homologous to each other and form the eIF2B regulatory subcomplex, which is believed to be a trimer consisting of monomeric α, β, and δ subunits. Here we use a combination of biophysical methods, site-directed mutagenesis, and bioinformatics to show that the human eIF2Bα subunit is in fact a homodimer, at odds with the current trimeric model for the eIF2Bα/β/δ regulatory complex. eIF2Bα dimerizes using the same interface that is found in the homodimeric archaeal eIF2Bα/β/δ homolog aIF2B and related metabolic enzymes. We also present evidence that the eIF2Bβ/δ binding interface is similar to that in the eIF2Bα2 homodimer. Mutations at the predicted eIF2Bβ/δ dimer interface cause genetic neurological disorders in humans. We propose that the eIF2B regulatory subcomplex is an α2β2δ2 hexamer, composed of one α2 homodimer and two βδ heterodimers. Our results offer novel insights into the architecture of eIF2B and its interactions with the G-protein eIF2.
doi:10.1021/bi500346u
PMCID: PMC4045321  PMID: 24811713
4.  How Proteins Bind Macrocycles 
Nature chemical biology  2014;10(9):723-731.
The potential utility of synthetic macrocycles as drugs, particularly against low druggability targets such as protein-protein interactions, has been widely discussed. There is little information, however, to guide the design of macrocycles for good target protein-binding activity or bioavailability. To address this knowledge gap we analyze the binding modes of a representative set of macrocycle-protein complexes. The results, combined with consideration of the physicochemical properties of approved macrocyclic drugs, allow us to propose specific guidelines for the design of synthetic macrocycles libraries possessing structural and physicochemical features likely to favor strong binding to protein targets and also good bioavailability. We additionally provide evidence that large, natural product derived macrocycles can bind to targets that are not druggable by conventional, drug-like compounds, supporting the notion that natural product inspired synthetic macrocycles can expand the number of proteins that are druggable by synthetic small molecules.
doi:10.1038/nchembio.1584
PMCID: PMC4417626  PMID: 25038790
druglikeness; druggability; ligand efficiency; binding mode; macrocyclic drugs
5.  A Subspace Semi-Definite programming-based Underestimation (SSDU) method for stochastic global optimization in protein docking* 
We propose a new stochastic global optimization method targeting protein docking problems. The method is based on finding a general convex polynomial underestimator to the binding energy function in a permissive subspace that possesses a funnel-like structure. We use Principal Component Analysis (PCA) to determine such permissive subspaces. The problem of finding the general convex polynomial underestimator is reduced into the problem of ensuring that a certain polynomial is a Sum-of-Squares (SOS), which can be done via semi-definite programming. The underestimator is then used to bias sampling of the energy function in order to recover a deep minimum. We show that the proposed method significantly improves the quality of docked conformations compared to existing methods.
doi:10.1109/CDC.2014.7040111
PMCID: PMC4405505  PMID: 25914440
6.  Optimization on the space of rigid and flexible motions: an alternative manifold optimization approach 
In this paper we consider the problem of minimization of a cost function that depends on the location and poses of one or more rigid bodies, or bodies that consist of rigid parts hinged together. We present a unified setting for formulating this problem as an optimization on an appropriately defined manifold for which efficient manifold optimizations can be developed. This setting is based on a Lie group representation of the rigid movements of a body that is different from what is commonly used for this purpose. We illustrate this approach by using the steepest descent algorithm on the manifold of the search space and specify conditions for its convergence.
doi:10.1109/CDC.2014.7040301
PMCID: PMC4357846  PMID: 25774073
7.  Stimulators of Translation Identified During a Small Molecule Screening Campaign 
Analytical biochemistry  2014;447:6-14.
In screening a library of natural and synthetic products for eukaryotic translation modulators, we identified two natural products, isohymenialdisine and hymenialdisine, that exhibit stimulatory effects on translation. The characterization of these compounds lead to the insight that mRNA used to program the translation extracts during high throughput assay set-up was leading to phosphorylation of eIF2α, a potent negative regulatory event that is mediated by one of four kinases. We identified double-stranded RNA-dependent protein kinase (PKR) as the eIF2α kinase that was being activated by exogenously added mRNA template. Characterization of the mode of action of isohymenialdisine revealed that it directly acts on PKR by inhibiting autophosphorylation, perturbs the PKR-eIF2α phosphorylation axis, and can be modeled into the PKR ATP binding site. Our results identify a source of false positives for high throughput screening (HTS) campaigns using translation extracts, raising a cautionary note for this type of screen.
doi:10.1016/j.ab.2013.10.026
PMCID: PMC3943918  PMID: 24513115
High Throughput Screens; Translation; PKR; eIF2α; Isohymenialdisine; Hymenialdisine
8.  How Good is Automated Protein Docking? 
Proteins  2013;81(12):2159-2166.
The protein docking server ClusPro has been participating in CAPRI since its introduction in 2004. This paper evaluates the performance of ClusPro 2.0 for targets 46–58 in rounds 22–27 of CAPRI. The analysis leads to a number of important observations. First, ClusPro reliably yields acceptable or medium accuracy models for targets of moderate difficulty that have also been successfully predicted by other groups, and fails only for targets that have few acceptable models submitted. Second, the quality of automated docking by ClusPro is very close to that of the best human predictor groups, including our own submissions. This is very important, because servers have to submit results within 48 hours and the predictions should be reproducible, whereas human predictors have several weeks and can use any type of information. Third, while we refined the ClusPro results for manual submission by running computationally costly Monte Carlo minimization simulations, we observed significant improvement in accuracy only for two of the six complexes correctly predicted by ClusPro. Fourth, new developments, not seen in previous rounds of CAPRI, are that the top ranked model provided by ClusPro was acceptable or better quality for all these six targets, and that the top ranked model was also the highest quality for five of the six, confirming that ranking models based on cluster size can reliably identify the best near-native conformations.
doi:10.1002/prot.24403
PMCID: PMC3934018  PMID: 23996272
protein-protein docking; structure refinement; method development; CAPRI docking experiment; web based server; user community
9.  Community-wide Evaluation of Methods for Predicting the Effect of Mutations on Protein-Protein Interactions 
Proteins  2013;81(11):1980-1987.
Community-wide blind prediction experiments such as CAPRI and CASP provide an objective measure of the current state of predictive methodology. Here we describe a community-wide assessment of methods to predict the effects of mutations on protein-protein interactions. Twenty-two groups predicted the effects of comprehensive saturation mutagenesis for two designed influenza hemagglutinin binders and the results were compared with experimental yeast display enrichment data obtained using deep sequencing. The most successful methods explicitly considered the effects of mutation on monomer stability in addition to binding affinity, carried out explicit side chain sampling and backbone relaxation, and evaluated packing, electrostatic and solvation effects, and correctly identified around a third of the beneficial mutations. Much room for improvement remains for even the best techniques, and large-scale fitness landscapes should continue to provide an excellent test bed for continued evaluation of methodological improvement.
doi:10.1002/prot.24356
PMCID: PMC4143140  PMID: 23843247
CAPRI; hemagglutinin; binding; deep mutational scanning; yeast display
10.  Ligand Binding and Activation of PPARγ by Firemaster® 550: Effects on Adipogenesis and Osteogenesis in Vitro 
Environmental Health Perspectives  2014;122(11):1225-1232.
Background: The use of alternative flame retardants has increased since the phase out of pentabromodiphenyl ethers (pentaBDEs). One alternative, Firemaster® 550 (FM550), induces obesity in rats. Triphenyl phosphate (TPP), a component of FM550, has a structure similar to that of organotins, which are obesogenic in rodents.
Objectives: We tested the hypothesis that components of FM550 are biologically active peroxisome proliferator-activated receptor γ (PPARγ) ligands and estimated indoor exposure to TPP.
Methods: FM550 and its components were assessed for ligand binding to and activation of human PPARγ. Solvent mapping was used to model TPP in the PPARγ binding site. Adipocyte and osteoblast differentiation were assessed in bone marrow multipotent mesenchymal stromal cell models. We estimated exposure of children to TPP using a screening-level indoor exposure model and house dust concentrations determined previously.
Results: FM550 bound human PPARγ, and binding appeared to be driven primarily by TPP. Solvent mapping revealed that TPP interacted with binding hot spots within the PPARγ ligand binding domain. FM550 and its organophosphate components increased human PPARγ1 transcriptional activity in a Cos7 reporter assay and induced lipid accumulation and perilipin protein expression in BMS2 cells. FM550 and TPP diverted osteogenic differentiation toward adipogenesis in primary mouse bone marrow cultures. Our estimates suggest that dust ingestion is the major route of exposure of children to TPP.
Conclusions: Our findings suggest that FM550 components bind and activate PPARγ. In addition, in vitro exposure initiated adipocyte differentiation and antagonized osteogenesis. TPP likely is a major contributor to these biological actions. Given that TPP is ubiquitous in house dust, further studies are warranted to investigate the health effects of FM550.
Citation: Pillai HK, Fang M, Beglov D, Kozakov D, Vajda S, Stapleton HM, Webster TF, Schlezinger JJ. 2014. Ligand binding and activation of PPARγ by Firemaster® 550: effects on adipogenesis and osteogenesis in vitro. Environ Health Perspect 122:1225–1232; http://dx.doi.org/10.1289/ehp.1408111
doi:10.1289/ehp.1408111
PMCID: PMC4216168  PMID: 25062436
11.  Evidence of Conformational Selection Driving the Formation of Ligand Binding Sites in Protein-Protein Interfaces 
PLoS Computational Biology  2014;10(10):e1003872.
Many protein-protein interactions (PPIs) are compelling targets for drug discovery, and in a number of cases can be disrupted by small molecules. The main goal of this study is to examine the mechanism of binding site formation in the interface region of proteins that are PPI targets by comparing ligand-free and ligand-bound structures. To avoid any potential bias, we focus on ensembles of ligand-free protein conformations obtained by nuclear magnetic resonance (NMR) techniques and deposited in the Protein Data Bank, rather than on ensembles specifically generated for this study. The measures used for structure comparison are based on detecting binding hot spots, i.e., protein regions that are major contributors to the binding free energy. The main tool of the analysis is computational solvent mapping, which explores the surface of proteins by docking a large number of small “probe” molecules. Although we consider conformational ensembles obtained by NMR techniques, the analysis is independent of the method used for generating the structures. Finding the energetically most important regions, mapping can identify binding site residues using ligand-free models based on NMR data. In addition, the method selects conformations that are similar to some peptide-bound or ligand-bound structure in terms of the properties of the binding site. This agrees with the conformational selection model of molecular recognition, which assumes such pre-existing conformations. The analysis also shows the maximum level of similarity between unbound and bound states that is achieved without any influence from a ligand. Further shift toward the bound structure assumes protein-peptide or protein-ligand interactions, either selecting higher energy conformations that are not part of the NMR ensemble, or leading to induced fit. Thus, forming the sites in protein-protein interfaces that bind peptides and can be targeted by small ligands always includes conformational selection, although other recognition mechanisms may also be involved.
Author Summary
Many protein-protein interfaces (PPIs) are biologically compelling drug targets. Disrupting the interaction between two large proteins by a small inhibitor requires forming a high affinity binding site in the interface that generally can bind both peptides and drug-like compounds. Here we investigate whether such sites are induced by peptide or ligand binding, or already exist in the unbound state. The analysis requires comparing ligand-free and ligand-bound structures. To avoid any potential bias, we study ensembles of ligand-free protein conformations obtained by nuclear magnetic resonance (NMR) rather than generated by simulations. The analysis is based on computational solvent mapping, which explores the surface of the target protein by docking a large number of small “probe” molecules. Results show that ensembles of ligand-free models always include conformations that are fairly similar to some peptide-bound or ligand-bound structure in terms of the properties of the binding site. The analysis also identifies the models that are the most similar to a bound state, and shows the maximum level of similarity that is achieved without any influence from a ligand. While forming the binding site may require a combination of recognition mechanisms, there is preference for the spontaneous formation of bound-like structures.
doi:10.1371/journal.pcbi.1003872
PMCID: PMC4183424  PMID: 25275445
12.  A New Distributed Algorithm for Side-Chain Positioning in the Process of Protein Docking* 
Side-chain positioning (SCP) is an important component of computational protein docking methods. Existing SCP methods and available software have been designed for protein folding applications where side-chain positioning is also important. As a result they do not take into account significant special structure that SCP for docking exhibits. We propose a new algorithm which poses SCP as a Maximum Weighted Independent Set (MWIS) problem on an appropriately constructed graph. We develop an approximate algorithm which solves a relaxation of the MWIS and then rounds the solution to obtain a high-quality feasible solution to the problem. The algorithm is fully distributed and can be executed on a large network of processing nodes requiring only local information and message-passing between neighboring nodes. Motivated by the special structure in docking, we establish optimality guarantees for a certain class of graphs. Our results on a benchmark set of enzyme-inhibitor protein complexes show that our predictions are close to the native structure and are comparable to the ones obtained by a state-of-the-art method. The results are substantially improved if rotamers from unbound protein structures are included in the search. We also establish that the use of our SCP algorithm substantially improves docking results.
doi:10.1109/CDC.2013.6759970
PMCID: PMC4024309  PMID: 24844567
13.  Flexible Refinement of Protein-Ligand Docking on Manifolds* 
Our work is motivated by energy minimization of biological macromolecules, an essential step in computational docking. By allowing some ligand flexibility, we generalize a recently introduced novel representation of rigid body minimization as an optimization on the SO(3)×R3 manifold, rather than on the commonly used Special Euclidean group SE(3). We show that the resulting flexible docking can also be formulated as an optimization on a Lie group that is the direct product of simpler Lie groups for which geodesics and exponential maps can be easily obtained. Our computational results for a local optimization algorithm developed based on this formulation show that it is about an order of magnitude faster than the state-of-the-art local minimization algorithms for computational protein-small molecule docking.
doi:10.1109/CDC.2013.6760077
PMCID: PMC4019453  PMID: 24830567
14.  FTFlex: accounting for binding site flexibility to improve fragment-based identification of druggable hot spots 
Bioinformatics  2013;29(9):1218-1219.
Computational solvent mapping finds binding hot spots, determines their druggability and provides information for drug design. While mapping of a ligand-bound structure yields more accurate results, usually the apo structure serves as the starting point in design. The FTFlex algorithm, implemented as a server, can modify an apo structure to yield mapping results that are similar to those of the respective bound structure. Thus, FTFlex is an extension of our FTMap server, which only considers rigid structures. FTFlex identifies flexible residues within the binding site and determines alternative conformations using a rotamer library. In cases where the mapping results of the apo structure were in poor agreement with those of the bound structure, FTFlex was able to yield a modified apo structure, which lead to improved FTMap results. In cases where the mapping results of the apo and bound structures were in good agreement, no new structure was predicted.
Availability: FTFlex is freely available as a web-based server at http://ftflex.bu.edu/.
Contact: vajda@bu.edu or midas@bu.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
doi:10.1093/bioinformatics/btt102
PMCID: PMC3634182  PMID: 23476022
15.  Comprehensive Experimental and Computational Analysis of Binding Energy Hot Spots at the NF-κB Essential Modulator (NEMO)/IKKβ Protein-Protein Interface 
We report a comprehensive analysis of binding energy hot spots at the protein-protein interaction (PPI) interface between NF-κB Essential Modulator (NEMO) and IκB kinase subunit β (IKKβ), an interaction that is critical for NF-κB pathway signaling, using experimental alanine scanning mutagenesis and also the FTMap method for computational fragment screening. The experimental results confirm that the previously identified NBD region of IKKβ contains the highest concentration of hot spot residues, the strongest of which are W739, W741 and L742 (ΔΔG = 4.3, 3.5 and 3.2 kcal/mol, respectively). The region occupied by these residues defines a potentially druggable binding site on NEMO that extends for ~16 Å to additionally include the regions that bind IKKβ L737 and F734. NBD residues D738 and S740 are also important for binding but do not make direct contact with NEMO, instead likely acting to stabilize the active conformation of surrounding residues. We additionally found two previously unknown hot spot regions centered on IKKβ residues L708/V709 and L719/I723. The computational approach successfully identified all three hot spot regions on IKKβ. Moreover, the method was able to accurately quantify the energetic importance of all hot spots residues involving direct contact with NEMO. Our results provide new information to guide the discovery of small molecule inhibitors that target the NEMO/IKKβ interaction. They additionally clarify the structural and energetic complementarity between “pocket-forming” and “pocket occupying” hot spot residues, and further validate computational fragment mapping as a method for identifying hot spots at PPI interfaces.
doi:10.1021/ja400914z
PMCID: PMC3680600  PMID: 23506214
IKKγ; alanine scanning mutagenesis; protein-protein interactions; IKKγ; fluorescence polarization; fluorescence anisotropy
16.  A New Approach to Rigid Body Minimization with Application to Molecular Docking* 
Our work is motivated by energy minimization in the space of rigid affine transformations of macromolecules, an essential step in computational protein-protein docking. We introduce a novel representation of rigid body motion that leads to a natural formulation of the energy minimization problem as an optimization on the SO(3)×R3 manifold, rather than the commonly used SE(3). The new representation avoids the complications associated with optimization on the SE(3) manifold and provides additional flexibilities for optimization not available in that formulation. The approach is applicable to general rigid body minimization problems. Our computational results for a local optimization algorithm developed based on the new approach show that it is about an order of magnitude faster than a state of art local minimization algorithms for computational protein-protein docking.
doi:10.1109/CDC.2012.6426267
PMCID: PMC3992991  PMID: 24763338
17.  Encounter complexes and dimensionality reduction in protein–protein association 
eLife  2014;3:e01370.
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition.
DOI: http://dx.doi.org/10.7554/eLife.01370.001
eLife digest
Proteins rarely act alone. Instead, they tend to bind to other proteins to form structures known as complexes. When two proteins come together to form a complex, they twist and turn through a series of intermediate states before they form the actual complex. These intermediate states are difficult to study because they don’t last for very long, which means that our knowledge of how complexes are formed remains incomplete.
One promising approach for studying the formation of complexes is called paramagnetic relaxation enhancement. In this technique certain areas in one of the proteins are labelled with magnetic particles, which produce signals when the two proteins are close to each other. Repeating the measurement several times with the magnetic particles in different positions provides information about the overall structure of the complex. Computational modelling can then be used to work out the fine details of the structure, including the shapes of the intermediate structures made by the proteins as they interact.
A computer method called docking can be used to predict the most favourable positions that the proteins can take, relative to one another, in a complex. This involves calculating the energy contained in the system, with the correct structure having the lowest energy. Docking methods also predict protein models with slightly higher energies, but with structures that are radically different. Modellers usually ignore these structures, but comparing the docking results to paramagnetic relaxation enhancement data, Kozakov et al. found that these structures actually represent the intermediate states.
Analysing the structure of the intermediate states revealed that the movement of the two proteins relative to one another is severely restricted as they form the final complex. Kozakov et al. found that proteins associate along preferred pathways, similar to the way a protein slides along DNA in the process of protein-DNA recognition. Knowing that the movement of the proteins is restricted in this way will enable researchers to improve the efficiency of docking calculations.
DOI: http://dx.doi.org/10.7554/eLife.01370.002
doi:10.7554/eLife.01370
PMCID: PMC3978769  PMID: 24714491
encounter landscapes; FFT sampling; protein–protein interactions; none
18.  Rigid Body Energy Minimization on Manifolds for Molecular Docking 
Virtually all docking methods include some local continuous minimization of an energy/scoring function in order to remove steric clashes and obtain more reliable energy values. In this paper, we describe an efficient rigid-body optimization algorithm that, compared to the most widely used algorithms, converges approximately an order of magnitude faster to conformations with equal or slightly lower energy. The space of rigid body transformations is a nonlinear manifold, namely, a space which locally resembles a Euclidean space. We use a canonical parametrization of the manifold, called the exponential parametrization, to map the Euclidean tangent space of the manifold onto the manifold itself. Thus, we locally transform the rigid body optimization to an optimization over a Euclidean space where basic optimization algorithms are applicable. Compared to commonly used methods, this formulation substantially reduces the dimension of the search space. As a result, it requires far fewer costly function and gradient evaluations and leads to a more efficient algorithm. We have selected the LBFGS quasi-Newton method for local optimization since it uses only gradient information to obtain second order information about the energy function and avoids the far more costly direct Hessian evaluations. Two applications, one in protein-protein docking, and the other in protein-small molecular interactions, as part of macromolecular docking protocols are presented. The code is available to the community under open source license, and with minimal effort can be incorporated into any molecular modeling package.
doi:10.1021/ct300272j
PMCID: PMC3561712  PMID: 23382659
19.  Application of asymmetric statistical potentials to antibody–protein docking 
Bioinformatics  2012;28(20):2608-2614.
Motivation: An effective docking algorithm for antibody–protein antigen complex prediction is an important first step toward design of biologics and vaccines. We have recently developed a new class of knowledge-based interaction potentials called Decoys as the Reference State (DARS) and incorporated DARS into the docking program PIPER based on the fast Fourier transform correlation approach. Although PIPER was the best performer in the latest rounds of the CAPRI protein docking experiment, it is much less accurate for docking antibody–protein antigen pairs than other types of complexes, in spite of incorporating sequence-based information on the location of the paratope. Analysis of antibody–protein antigen complexes has revealed an inherent asymmetry within these interfaces. Specifically, phenylalanine, tryptophan and tyrosine residues highly populate the paratope of the antibody but not the epitope of the antigen.
Results: Since this asymmetry cannot be adequately modeled using a symmetric pairwise potential, we have removed the usual assumption of symmetry. Interaction statistics were extracted from antibody–protein complexes under the assumption that a particular atom on the antibody is different from the same atom on the antigen protein. The use of the new potential significantly improves the performance of docking for antibody–protein antigen complexes, even without any sequence information on the location of the paratope. We note that the asymmetric potential captures the effects of the multi-body interactions inherent to the complex environment in the antibody–protein antigen interface.
Availability: The method is implemented in the ClusPro protein docking server, available at http://cluspro.bu.edu.
Contact: midas@bu.edu or vajda@bu.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
doi:10.1093/bioinformatics/bts493
PMCID: PMC3467743  PMID: 23053206
20.  Relationship between Hot Spot Residues and Ligand Binding Hot Spots in Protein-Protein Interfaces 
In the context of protein-protein interactions, the term “hot spot” refers to a residue or cluster of residues that makes a major contribution to the binding free energy, as determined by alanine scanning mutagenesis. In contrast, in pharmaceutical research a hot spot is a site on a target protein that has high propensity for ligand binding and hence is potentially important for drug discovery. Here we examine the relationship between these two hot spot concepts by comparing alanine scanning data for a set of 15 proteins with results from mapping the protein surfaces for sites that can bind fragment-sized small molecules. We find the two types of hot spots are largely complementary; the residues protruding into hot spot regions identified by computational mapping or experimental fragment screening are almost always themselves hot spot residues as defined by alanine scanning experiments. Conversely, a residue that is found by alanine scanning to contribute little to binding rarely interacts with hot spot regions on the partner protein identified by fragment mapping. In spite of the strong correlation between the two hot spot concepts, they fundamentally differ, however. In particular, while identification of a hot spot by alanine scanning establishes the potential to generate substantial interaction energy with a binding partner, there are additional topological requirements to be a hot spot for small molecule binding. Hence, only a minority of hot spots identified by alanine scanning represent sites that are potentially useful for small inhibitor binding, and it is this subset that is identified by experimental or computational fragment screening.
doi:10.1021/ci300175u
PMCID: PMC3623692  PMID: 22770357
21.  Minimal ensembles of side chain conformers for modeling protein-protein interactions 
Proteins  2011;80(2):591-601.
The goal of this paper is to reduce the complexity of the side chain search within docking problems. We apply six methods of generating side chain conformers to unbound protein structures, and determine their ability of obtaining the bound conformation in small ensembles of conformers. Methods are evaluated in terms of the positions of side chain end groups. Results for 68 protein complexes yield two important observations. First, the end group positions change less than 1 Å upon association for over 60% of interface side chains. Thus, the unbound protein structure carries substantial information about the side chains in the bound state, and the inclusion of the unbound conformation into the ensemble of conformers is very beneficial. Second, considering each surface side chain separately in its protein environment, small ensembles of low energy states include the bound conformation for a large fraction of side chains. In particular, the ensemble consisting of the unbound conformation and the two highest probability predicted conformers includes the bound conformer with an accuracy of 1 Å for 78% of interface side chains. Since more than 60% of the interface side chains have only one conformer and many others only a few, these ensembles of low energy states substantially reduce the complexity of side chain search in docking problems. This approach was already used for finding pockets in protein-protein interfaces that can bind small molecules to potentially disrupt protein-protein interactions. Side chain search with the reduced search space will also be incorporated into protein docking algorithms.
doi:10.1002/prot.23222
PMCID: PMC3297704  PMID: 22105850
rotamer libraries; side chain flexibility; protein binding; structure prediction; preexisting ensemble of conformers
22.  A Message Passing Approach to Side Chain Positioning with Applications in Protein Docking Refinement * 
We introduce a message-passing algorithm to solve the Side Chain Positioning (SCP) problem. SCP is a crucial component of protein docking refinement, which is a key step of an important class of problems in computational structural biology called protein docking. We model SCP as a combinatorial optimization problem and formulate it as a Maximum Weighted Independent Set (MWIS) problem. We then employ a modified and convergent belief-propagation algorithm to solve a relaxation of MWIS and develop randomized estimation heuristics that use the relaxed solution to obtain an effective MWIS feasible solution. Using a benchmark set of protein complexes we demonstrate that our approach leads to more accurate docking predictions compared to a baseline algorithm that does not solve the SCP.
PMCID: PMC3600151  PMID: 23515575
23.  Hot spot analysis for driving the development of hits into leads in fragment based drug discovery 
Fragment based drug design (FBDD) starts with finding fragment-sized compounds that are highly ligand efficient and can serve as a core moiety for developing high affinity leads. Although the core-bound structure of a protein facilitates the construction of leads, effective design is far from straightforward. We show that protein mapping, a computational method developed to find binding hot spots and implemented as the FTMap server, provides information that complements the fragment screening results and can drive the evolution of core fragments into larger leads with a minimal loss or, in some cases, even a gain in ligand efficiency. The method places small molecular probes, the size of organic solvents, on a dense grid around the protein, and identifies the hot spots as consensus clusters formed by clusters of several probes. The hot spots are ranked based on the number of probe clusters, which predicts the binding propensity of the subsites and hence their importance for drug design. Accordingly, with a single exception the main hot spot identified by FTMap binds the core compound found by fragment screening. The most useful information is provided by the neighboring secondary hot spots, indicating the regions where the core can be extended to increase its affinity. To quantify this information, we calculate the density of probes from mapping, which describes the binding propensity at each point, and show that the change in the correlation between a ligand position and the probe density upon extending or repositioning the core moiety predicts the expected change in ligand efficiency.
doi:10.1021/ci200468p
PMCID: PMC3264775  PMID: 22145575
Protein mapping; protein docking; drug design; ligand efficiency; affinity prediction
24.  FTSite: high accuracy detection of ligand binding sites on unbound protein structures 
Bioinformatics  2011;28(2):286-287.
Motivation: Binding site identification is a classical problem that is important for a range of applications, including the structure-based prediction of function, the elucidation of functional relationships among proteins, protein engineering and drug design. We describe an accurate method of binding site identification, namely FTSite. This method is based on experimental evidence that ligand binding sites also bind small organic molecules of various shapes and polarity. The FTSite algorithm does not rely on any evolutionary or statistical information, but achieves near experimental accuracy: it is capable of identifying the binding sites in over 94% of apo proteins from established test sets that have been used to evaluate many other binding site prediction methods.
Availability: FTSite is freely available as a web-based server at http://ftsite.bu.edu.
Contact: vajda@bu.edu; midas@bu.edu
Supplementary information: Supplementary data are available at Bioinformatics online.
doi:10.1093/bioinformatics/btr651
PMCID: PMC3259439  PMID: 22113084
25.  Analysis of protein binding sites by computational solvent mapping 
Summary
Computational solvent mapping globally samples the surface of target proteins using molecular probes – small molecules or functional groups – to identify potentially favorable binding positions. The method is based on X-ray and NMR screening studies showing that the binding sites of proteins also bind a large variety of fragment-sized molecules. We have developed the multi-stage mapping algorithm FTMap (available as a server at http://ftmap.bu.edu/) based on the fast Fourier transform (FFT) correlation approach. Identifying regions of low free energy rather than individual low energy conformations, FTMap reproduces the available experimental mapping results. Applications to a variety of proteins show that the probes always cluster in important subsites of the binding site, and the amino acid residues that interact with many probes also bind the specific ligands of the protein. The “consensus” sites at which a number of different probes cluster are likely to be “druggable” sites, capable of binding drug-size ligands with high affinity. Due to its sensitivity to conformational changes the method can also be used for comparing the binding sites in different structures of a protein.
doi:10.1007/978-1-61779-465-0_2
PMCID: PMC3526383  PMID: 22183527
Protein structure; protein-ligand interactions; binding site; binding hot spots; fragment-based ligand design; druggability; binding site comparison; docking

Results 1-25 (47)