Human HtrA3 protease, which induces mitochondria-mediated apoptosis, can be a tumor suppressor and a potential therapeutic target in the treatment of cancer. However, there is little information about its structure and biochemical properties. HtrA3 is composed of an N-terminal domain not required for proteolytic activity, a central serine protease domain and a C-terminal PDZ domain. HtrA3S, its short natural isoform, lacks the PDZ domain which is substituted by a stretch of 7 C-terminal amino acid residues, unique for this isoform. This paper presents the crystal structure of the HtrA3 protease domain together with the PDZ domain (ΔN-HtrA3), showing that the protein forms a trimer whose protease domains are similar to those of human HtrA1 and HtrA2. The ΔN-HtrA3 PDZ domains are placed in a position intermediate between that in the flat saucer-like HtrA1 SAXS structure and the compact pyramidal HtrA2 X-ray structure. The PDZ domain interacts closely with the LB loop of the protease domain in a way not found in other human HtrAs. ΔN-HtrA3 with the PDZ removed (ΔN-HtrA3-ΔPDZ) and an N-terminally truncated HtrA3S (ΔN-HtrA3S) were fully active at a wide range of temperatures and their substrate affinity was not impaired. This indicates that the PDZ domain is dispensable for HtrA3 activity. As determined by size exclusion chromatography, ΔN-HtrA3 formed stable trimers while both ΔN-HtrA3-ΔPDZ and ΔN-HtrA3S were monomeric. This suggests that the presence of the PDZ domain, unlike in HtrA1 and HtrA2, influences HtrA3 trimer formation. The unique C-terminal sequence of ΔN-HtrA3S appeared to have little effect on activity and oligomerization. Additionally, we examined the cleavage specificity of ΔN-HtrA3. Results reported in this paper provide new insights into the structure and function of ΔN-HtrA3, which seems to have a unique combination of features among human HtrA proteases.
Compound P131 displays antiparasitic activity in a mouse model of Cryptosporidium infection, validating IMP dehydrogenase (IMPDH) as a drug target. Here, the structure of the enzyme–substrate–P131 complex is reported at 2.05 Å resolution. The structure is an important step to further refine the design of IMPDH inhibitors.
Inosine 5′-monophosphate dehydrogenase (IMPDH) is a promising target for the treatment of Cryptosporidium infections. Here, the structure of C. parvum IMPDH (CpIMPDH) in complex with inosine 5′-monophosphate (IMP) and P131, an inhibitor with in vivo anticryptosporidial activity, is reported. P131 contains two aromatic groups, one of which interacts with the hypoxanthine ring of IMP, while the second interacts with the aromatic ring of a tyrosine in the adjacent subunit. In addition, the amine and NO2 moieties bind in hydrated cavities, forming water-mediated hydrogen bonds to the protein. The design of compounds to replace these water molecules is a new strategy for the further optimization of C. parvum inhibitors for both antiparasitic and antibacterial applications.
Cryptosporidium; inosine 5′-monophosphate dehydrogenase; P131
The structure of the I domain of integrin αLβ 2 bound to the Ig superfamily ligand ICAM-1 reveals the open ligand binding conformation and the first example of an integrin-IgSF interface. The I domain Mg2+ directly coordinates Glu-34 of ICAM-1, and a dramatic swing of I domain residue Glu-241 enables a critical salt bridge. Liganded and unliganded structures for both high- and intermediate-affinity mutant I domains reveal that ligand binding can induce conformational change in the αL I domain and that allosteric signals can convert the closed conformation to intermediate or open conformations without ligand binding. Pulling down on the C-terminal α7 helix with introduced disulfide bonds ratchets the β6-α7 loop into three different positions in the closed, intermediate, and open conformations, with a progressive increase in affinity.
Bacteria utilize multiple sigma factors that associate with core RNA polymerase (RNAP) to control transcription in response to changes in environmental conditions. In Escherichia coli and Salmonella enterica, Crl positively regulates the σS regulon by binding to σS to promote its association with core RNAP. We recently characterized the determinants in σS responsible for specific binding to Crl. However, little is known about the determinants in Crl required for this interaction. Here, we present the X-ray crystal structure of a Crl homolog from Proteus mirabilis in conjunction with in vivo and in vitro approaches that probe the Crl-σS interaction in E. coli. We show that the P. mirabilis, Vibrio harveyi, and E. coli Crl homologs function similarly in E. coli, indicating that Crl structure and function are likely conserved throughout gammaproteobacteria. We utilize phylogenetic conservation and bacterial two-hybrid analyses to predict residues in Crl important for the interaction with σS. The results of p-benzoylphenylalanine (BPA)-mediated UV cross-linking studies further support the model in which an evolutionarily conserved central cleft is the surface on Crl that binds to σS. Within this conserved binding surface, we identify a key residue in Crl that is critical for activation of EσS-dependent transcription in vivo and in vitro. Our study provides a physical basis for understanding the σS-Crl interaction.
Seven crystal structures of alanyl
aminopeptidase from Neisseria meningitides (the etiological
agent of meningitis, NmAPN) complexed with organophosphorus
compounds were resolved
to determine the optimal inhibitor–enzyme interactions. The
enantiomeric phosphonic acid analogs of Leu and hPhe, which correspond
to the P1 amino acid residues of well-processed substrates, were used
to assess the impact of the absolute configuration and the stereospecific
hydrogen bond network formed between the aminophosphonate polar head
and the active site residues on the binding affinity. For the hPhe
analog, an imperfect stereochemical complementarity could be overcome
by incorporating an appropriate P1 side chain. The constitution of
P1′-extended structures was rationally designed and the lead,
phosphinic dipeptide hPhePψ[CH2]Phe, was modified
in a single position. Introducing a heteroatom/heteroatom-based fragment
to either the P1 or P1′ residue required new synthetic pathways.
The compounds in the refined structure were low nanomolar and subnanomolar
inhibitors of N. meningitides, porcine and human
APNs, and the reference leucine aminopeptidase (LAP). The unnatural
phosphinic dipeptide analogs exhibited a high affinity for monozinc
APNs associated with a reasonable selectivity versus dizinc LAP. Another
set of crystal structures containing the NmAPN dipeptide
ligand were used to verify and to confirm the predicted binding modes;
furthermore, novel contacts, which were promising for inhibitor development,
were identified, including a π–π stacking interaction
between a pyridine ring and Tyr372.
Background: IMP dehydrogenase (IMPDH) is an important drug target because of its role in de novo purine nucleotide biosynthesis.
Results: First substrate/cofactor- and substrate/inhibitor-bound complexes of bacterial IMPDHs are determined.
Conclusion: A new distinct binding mode of the cofactor adenosine moiety is revealed.
Significance: This work offers new insights for the design of more potent and selective inhibitors and the evolution of the active site.
The steadily rising frequency of emerging diseases and antibiotic resistance creates an urgent need for new drugs and targets. Inosine 5′-monophosphate dehydrogenase (IMP dehydrogenase or IMPDH) is a promising target for the development of new antimicrobial agents. IMPDH catalyzes the oxidation of IMP to XMP with the concomitant reduction of NAD+, which is the pivotal step in the biosynthesis of guanine nucleotides. Potent inhibitors of bacterial IMPDHs have been identified that bind in a structurally distinct pocket that is absent in eukaryotic IMPDHs. The physiological role of this pocket was not understood. Here, we report the structures of complexes with different classes of inhibitors of Bacillus anthracis, Campylobacter jejuni, and Clostridium perfringens IMPDHs. These structures in combination with inhibition studies provide important insights into the interactions that modulate selectivity and potency. We also present two structures of the Vibrio cholerae IMPDH in complex with IMP/NAD+ and XMP/NAD+. In both structures, the cofactor assumes a dramatically different conformation than reported previously for eukaryotic IMPDHs and other dehydrogenases, with the major change observed for the position of the NAD+ adenosine moiety. More importantly, this new NAD+-binding site involves the same pocket that is utilized by the inhibitors. Thus, the bacterial IMPDH-specific NAD+-binding mode helps to rationalize the conformation adopted by several classes of prokaryotic IMPDH inhibitors. These findings offer a potential strategy for further ligand optimization.
Antibiotic Resistance; Enzyme Inhibitor; Ligand-binding Protein; Microbial Pathogenesis; Nicotinamide Adenine Dinucleotide (NAD); Cryptosporidium parvum-selective Inhibitors; Antibacterial; Cofactor-binding Site; Inosine 5′-Monophosphate Dehydrogenase
In the effort to produce proteins coded by diverse genomes, structural genomics projects often must express genes containing codons that are rare in the production strain. To address this problem, genes expressing tRNAs corresponding to those codons are typically coexpressed from a second plasmid in the host strain, or from genes incorporated into production plasmids. Here we describe the modification of a series of LIC pMCSG vectors currently used in the high-throughput production of proteins to include crucial tRNA genes covering rare codons for Arg (AGG/AGA) and Ile (AUA). We also present variants of these new vectors that allow analysis of ligand binding or co-expression of multiple proteins introduced through two independent LIC steps. Additionally, to accommodate the cloning of multiple large proteins, the size of the plasmids was reduced by approximately one kilobase through the removal of non-essential DNA from the base vector. Production of proteins from core vectors of this series validated the desired enhanced capabilities: higher yields of proteins expressed from genes with rare codons occurred in most cases, biotinylated derivatives enabled detailed automated ligand binding analysis, and multiple proteins introduced by dual LIC cloning were expressed successfully and in near balanced stoichiometry, allowing tandem purification of interacting proteins.
LIC; rare codons; tRNA genes; His-tag; co-expression; biotinylation; ligand binding; high-throughput; structural genomics
Cas4 proteins, a core protein family associated with the microbial system of adaptive immunity CRISPR, are predicted to function in the adaptation step of the CRISPR mechanism. Here we show that the Cas4 protein SSO0001 from the archaeon Sulfolobus solfataricus has metal-dependent endonuclease and 5' to 3' exonuclease activities against single-stranded DNA, as well as ATP-independent DNA unwinding activity toward double-stranded DNA. The crystal structure of SSO0001 revealed a decameric toroid formed by five dimers with each protomer containing one [4Fe-4S] cluster and one Mn2+ ion bound in the active site located inside the internal tunnel. The conserved RecB motif and four Cys residues are important for DNA binding and cleavage activities, whereas DNA unwinding depends on several residues located near the [4Fe-4S]-cluster. Our results suggest that Cas4 proteins might contribute to the addition of novel CRISPR spacers through the formation of 3'-DNA overhangs and to the degradation of foreign DNA.
CRISPR interference; Cas4; exonuclease; RecB motif; [4Fe-4S] cluster
Rumex hastatulus is the North American endemic dioecious plant with heteromorphic sex chromosomes. It is differentiated into two chromosomal races: Texas (T) race characterised by a simple XX/XY sex chromosome system and North Carolina (NC) race with a polymorphic XX/XY1Y2 sex chromosome system. The gross karyotype morphology in NC race resembles the derived type, but chromosomal changes that occurred during its evolution are poorly understood. Our C-banding/DAPI and fluorescence in situ hybridization (FISH) experiments demonstrated that Y chromosomes of both races are enriched in DAPI-positive sequences and that the emergence of polymorphic sex chromosome system was accompanied by the break of ancestral Y chromosome and switch in the localization of 5S rDNA, from autosomes to sex chromosomes (X and Y2). Two contrasting domains were detected within North Carolina Y chromosomes: the older, highly heterochromatinised, inherited from the original Y chromosome and the younger, euchromatic, representing translocated autosomal material. The flow-cytometric DNA estimation showed ∼3.5 % genome downsizing in the North Carolina race. Our results are in contradiction to earlier reports on the lack of heterochromatin within Y chromosomes of this species and enable unambiguous identification of autosomes involved in the autosome-heterosome translocation, providing useful chromosome landmarks for further studies on the karyotype and sex chromosome differentiation in this species.
Electronic supplementary material
The online version of this article (doi:10.1007/s10577-014-9446-4) contains supplementary material, which is available to authorized users.
Rumex hastatulus; Sex chromosomes; Karyotype; FISH; rDNA; C-banding/DAPI
Chitin is a fungal microbe-associated molecular pattern recognized in Arabidopsis by a lysin motif receptor kinase (LYK), AtCERK1. Previous research suggested that AtCERK1 is the major chitin receptor and mediates chitin-induced signaling through homodimerization and phosphorylation. However, the reported chitin binding affinity of AtCERK1 is quite low, suggesting another receptor with high chitin binding affinity might be present. Here, we propose that AtLYK5 is the primary chitin receptor in Arabidopsis. Mutations in AtLYK5 resulted in a significant reduction in chitin response. However, AtLYK5 shares overlapping function with AtLYK4 and, therefore, Atlyk4/Atlyk5-2 double mutants show a complete loss of chitin response. AtLYK5 interacts with AtCERK1 in a chitin-dependent manner. Chitin binding to AtLYK5 is indispensable for chitin-induced AtCERK1 phosphorylation. AtLYK5 binds chitin at a much higher affinity than AtCERK1. The data suggest that AtLYK5 is the primary receptor for chitin, forming a chitin inducible complex with AtCERK1 to induce plant immunity.
Invading fungi are responsible for many of the plant diseases that affect global crop production. Plants have to be able to identify these fungi, and activate the right defense strategies if they are to protect themselves. Chitin is a polymer that is found in the cell walls of all fungi, but not in plants, so if the plant detects chitin, it knows that a potentially harmful fungus may be nearby.
The detection of chitin, and the resulting activation of a plant's defenses, requires a receptor protein called CERK1. In rice, CERK1 needs to interact with another receptor protein called CEBiP, which binds to chitin. However, in Arabidopsis thaliana—which is widely studied in plant research—CERK1 can bind to chitin on its own, although this interaction is very weak, so it has been suggested that a second protein may be involved.
Cao et al. have now found that a receptor protein called LYK5, which is very similar to CERK1, is much better at attaching to chitin in A. thaliana. It can also bind to CERK1, but only when chitin is present, and is required for activation of basic plant defenses. The experiments suggest that LYK5 detects chitin on behalf of CERK1, in a similar way to how CEBiP works in rice.
The next step in this research is to work out how CERK1 and LYK5 are able to activate plant defenses.
Arabidoposis; plant innate immunity; chitin receptor; CERK1; LYK5; Arabidopsis
The current and the attainable coverage by X-ray structures of proteins and their functions on the scale of the ‘protein universe’ are estimated. A detailed analysis of the coverage across nearly 2000 proteomes from all superkingdoms of life and functional annotations is performed, with particular focus on the human proteome and the family of GPCR proteins.
Structural genomics programs have developed and applied structure-determination pipelines to a wide range of protein targets, facilitating the visualization of macromolecular interactions and the understanding of their molecular and biochemical functions. The fundamental question of whether three-dimensional structures of all proteins and all functional annotations can be determined using X-ray crystallography is investigated. A first-of-its-kind large-scale analysis of crystallization propensity for all proteins encoded in 1953 fully sequenced genomes was performed. It is shown that current X-ray crystallographic knowhow combined with homology modeling can provide structures for 25% of modeling families (protein clusters for which structural models can be obtained through homology modeling), with at least one structural model produced for each Gene Ontology functional annotation. The coverage varies between superkingdoms, with 19% for eukaryotes, 35% for bacteria and 49% for archaea, and with those of viruses following the coverage values of their hosts. It is shown that the crystallization propensities of proteomes from the taxonomic superkingdoms are distinct. The use of knowledge-based target selection is shown to substantially increase the ability to produce X-ray structures. It is demonstrated that the human proteome has one of the highest attainable coverage values among eukaryotes, and GPCR membrane proteins suitable for X-ray structure determination were determined.
crystallization propensity; proteome coverage; fDETECT
Lignin comprises 15.25% of plant biomass and represents a major environmental carbon source for utilization by soil microorganisms. Access to this energy resource requires the action of fungal and bacterial enzymes to break down the lignin polymer into a complex assortment of aromatic compounds that can be transported into the cells. To improve our understanding of the utilization of lignin by microorganisms, we characterized the molecular properties of solute binding proteins of ATP.binding cassette transporter proteins that interact with these compounds. A combination of functional screens and structural studies characterized the binding specificity of the solute binding proteins for aromatic compounds derived from lignin such as p-coumarate, 3-phenylpropionic acid and compounds with more complex ring substitutions. A ligand screen based on thermal stabilization identified several binding protein clusters that exhibit preferences based on the size or number of aromatic ring substituents. Multiple X-ray crystal structures of protein-ligand complexes for these clusters identified the molecular basis of the binding specificity for the lignin-derived aromatic compounds. The screens and structural data provide new functional assignments for these solute.binding proteins which can be used to infer their transport specificity. This knowledge of the functional roles and molecular binding specificity of these proteins will support the identification of the specific enzymes and regulatory proteins of peripheral pathways that funnel these compounds to central metabolic pathways and will improve the predictive power of sequence-based functional annotation methods for this family of proteins.
ABC transporter; functional annotation; Rhodopseudomonas palustris; solute-binding protein; p.coumaric acid
In the F family of conjugative plasmids, TraJ is an essential transcriptional activator of the tra operon that encodes most of the proteins required for conjugation. Here we report for the first time the X-ray crystal structures of the TraJ N-terminal domains from the prototypic F plasmid (TraJF11−130) and from the Salmonella virulence plasmid pSLT (TraJpSLT1−128). Both structures contain similar Per-ARNT-Sim (PAS) folds, which further homodimerize through the N-terminal helix and the structurally conserved β-sheet of the PAS fold from each protomer. Mutational analysis reveals that the observed dimeric interface is critical for TraJF transcriptional activation, indicating that dimerization of TraJ is required for its in vivo function. TraJ is specific in activating its cognate tra operon promoter; however, heterologous PAS domains from pSLT and R100 TraJ can functionally replace the TraJF PAS domain, suggesting that the allelic specificity of TraJ is solely mediated by the region C-terminal to the PAS domain.
Cas4 nucleases constitute a core family of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) associated proteins, but little is known about their structure and activity. Here we report the crystal structure of the Cas4 protein Pcal_0546 from Pyrobaculum calidifontis, which revealed a monomeric protein with a RecB-like fold and one [2Fe-2S] cluster coordinated by four conserved Cys residues. Pcal_0546 exhibits metal-dependent 5′ to 3′ exonuclease activity against ssDNA substrates, whereas the Cas4 protein SSO1391 from Sulfolobus solfataricus can cleave ssDNA in both the 5′ to 3′ and 3′ to 5′ directions. The active site of Pcal_0546 contains a bound metal ion coordinated by the side chains of Asp123, Glu136, His146, and the main chain carbonyl of Ile137. Site-directed mutagenesis of Pcal_0546 and SSO1391 revealed that the residues of RecB motifs II, III and QhXXY are critical for nuclease activity, whereas mutations of the conserved Cys residues resulted in a loss of the iron-sulfur cluster, but had no effect on DNA cleavage. Our results revealed the biochemical diversity of Cas4 nucleases, which can have different oligomeric states, contain [4Fe-4S] or [2Fe-2S] clusters, and cleave single stranded DNA in different directions producing single-stranded DNA overhangs, which are potential intermediates for the synthesis of new CRISPR spacers.
Attempts to express a truncated form of murine Bax in the periplasm by using an expression vector that attached the OmpA signal sequence to the protein failed to alleviate this toxicity. In contrast, attachment of a peptide based on a portion of the E. coli cochaperone GroES reduced Bax’s toxicity significantly and allowed good expression. The peptide, which was attached to the N-terminus, included the amino acid sequence of the mobile loop of GroES that has been demonstrated to interact with the chaperonin, GroEL. Under normal growth conditions, expression of this construct was still toxic, but generated a small amount of detectable recombinant Bax. However, when cells were grown in the presence of 2% ethanol, which stimulated overproduction of the molecular chaperones GroEL and DnaK, toxicity was reduced and good overexpression occurred. Two-dimensional gel electrophoresis analysis showed that approximately 15-fold more GroES-loop-Bax was produced under these conditions than under standard conditions and that GroEL and DnaK were elevated approximately 3-fold.
Escherichia coli; chaperonin; protein expression; expression vector; GroES; GroEL; Bax; Bcl-2
The goal of structural biology is to reveal details of the molecular structure of proteins in order to understand their function and mechanism. X-ray crystallography and NMR are the two best methods for atomic level structure determination. However, these methods require milligram quantities of proteins. In this chapter a reproducible methodology for large-scale protein production applicable to a diverse set of proteins is described. The approach is based on protein expression in E. coli as a fusion with a cleavable affinity tag that was tested on over 20,000 proteins. Specifically, a protocol for fermentation of large quantities of native proteins in disposable culture vessels is presented. A modified protocol that allows for the production of selenium-labeled proteins in defined media is also offered. Finally, a method for the purification of His6-tagged proteins on immobilized metal affinity chromatography columns that generates high-purity material is described in detail.
Protein expression; Protein purification; Disposable vessel fermentation; Selenomethionine-labeling; IMAC; His-tag; High-throughput
Carrier proteins (CPs) play a critical role in the biosynthesis of various natural products, especially in nonribosomal peptide synthetase (NRPS) and polyketide synthase (PKS) enzymology, where the CPs are referred to as peptidyl-carrier proteins (PCPs) or acyl-carrier proteins (ACPs), respectively. CPs can either be a domain in large multifunctional polypeptides or standalone proteins, termed Type I and Type II, respectively. There have been many biochemical studies of the Type I PKS and NRPS CPs, and of Type II ACPs. However, recently a number of Type II PCPs have been found and biochemically characterized. In order to understand the possible interaction surfaces for combinatorial biosynthetic efforts we crystallized the first characterized and representative Type II PCP member, BlmI, from the bleomycin biosynthetic pathway from Streptomyces verticillus ATCC 15003. The structure is similar to CPs in general but most closely resembles PCPs. Comparisons with previously determined PCP structures in complex with catalytic domains reveals a common interaction surface. This surface is highly variable in charge and shape, which likely confers specificity for interactions. Previous nuclear magnetic resonance (NMR) analysis of a prototypical Type I PCP excised from the multimodular context revealed three conformational states. Comparison of the states with the structure of BlmI and other PCPs reveals that only one of the NMR states is found in other studies, suggesting the other two states may not be relevant. The state represented by the BlmI crystal structure can therefore serve as a model for both Type I and Type II PCPs.
protein–protein interaction; natural product; biosynthesis; phylogenetics; structural genomics; reductive methylation
Phleum echinatum Host (2n = 2x = 10) is an annual Mediterranean species which differs from other representatives of the genus Phleum by reduced chromosome number, asymmetric karyotype and unusually high amount of DNA in the genome. Chromosomes of this plant were studied using conventional acetic-orcein staining and fluorescence in situ hybridization (FISH). FISH showed the major 35S ribosomal DNA (rDNA) site at the secondary constriction of satellite chromosome (3) and the minor 35S rDNA site near 5S rDNA cluster in the monobrachial chromosome 5. Telomeric repeats were detected at all chromosome ends within secondary constriction in satellited chromosome 3 and at the centromeric regions of chromosomes 1 and 2. Intrachromosomally located telomeric repeats are probably traces of chromosomal rearrangements that have shaped P.echinatum genome; they were prone to breakage which was manifested in chromosome fragmentation. The most distinct telomeric signals, suggesting massive amplification of interstitial telomeric sequences (ITRs), were observed at the nucleolar organizer region (NOR) of the third chromosome pair. Double FISH confirmed co-localization of telomeric and 35S rDNA repeats in this locus characterized by the biggest fragility in the karyotype. Fragile sites of P.echinatum, composed of amplified telomeric repeats, may bear a resemblance to metazoan rare fragile sites enriched in microsatellite repeats.
Electronic supplementary material
The online version of this article (doi:10.1007/s00709-014-0681-5) contains supplementary material, which is available to authorized users.
Phleum echinatum; Fragile sites; FISH; rDNA; Interstitial telomeric sequences; Chromosome fusions
Bacterial species in the Enterobacteriaceae typically contain multiple paralogues of a small domain of unknown function (DUF1471) from a family of conserved proteins also known as YhcN or BhsA/McbA. Proteins containing DUF1471 may have a single or three copies of this domain. Representatives of this family have been demonstrated to play roles in several cellular processes including stress response, biofilm formation, and pathogenesis. We have conducted NMR and X-ray crystallographic studies of four DUF1471 domains from Salmonella representing three different paralogous DUF1471 subfamilies: SrfN, YahO, and SssB/YdgH (two of its three DUF1471 domains: the N-terminal domain I (residues 21–91), and the C-terminal domain III (residues 244–314)). Notably, SrfN has been shown to have a role in intracellular infection by Salmonella Typhimurium. These domains share less than 35% pairwise sequence identity. Structures of all four domains show a mixed α+β fold that is most similar to that of bacterial lipoprotein RcsF. However, all four DUF1471 sequences lack the redox sensitive cysteine residues essential for RcsF activity in a phospho-relay pathway, suggesting that DUF1471 domains perform a different function(s). SrfN forms a dimer in contrast to YahO and SssB domains I and III, which are monomers in solution. A putative binding site for oxyanions such as phosphate and sulfate was identified in SrfN, and an interaction between the SrfN dimer and sulfated polysaccharides was demonstrated, suggesting a direct role for this DUF1471 domain at the host-pathogen interface.
The growth of diffraction-quality single crystals is of primary importance in protein X-ray crystallography. Chemical modification of proteins can alter their surface properties and crystallization behavior. The Midwest Center for Structural Genomics (MCSG) has previously reported how reductive methylation of lysine residues in proteins can improve crystallization of unique proteins that initially failed to produce diffraction-quality crystals. Recently, this approach has been expanded to include ethylation and isopropylation in the MCSG protein crystallization pipeline. Applying standard methods, 180 unique proteins were alkylated and screened using standard crystallization procedures. Crystal structures of 12 new proteins were determined, including the first ethylated and the first isopropylated protein structures. In a few cases, the structures of native and methylated or ethylated states were obtained and the impact of reductive alkylation of lysine residues was assessed. Reductive methylation tends to be more efficient and produces the most alkylated protein structures. Structures of methylated proteins typically have higher resolution limits. A number of well-ordered alkylated lysine residues have been identified, which make both intermolecular and intramolecular contacts. The previous report is updated and complemented with the following new data; a description of a detailed alkylation protocol with results, structural features, and roles of alkylated lysine residues in protein crystals. These contribute to improved crystallization properties of some proteins.
Chemical modification; Lysine reductive alkylation; Methylation; Ethylation; Isopropylation; Protein crystallization
In Structural Genomics projects, virtual high-throughput ligand screening can be utilized to provide important functional details for newly determined protein structures. Using a variety of publicly available software tools, it is possible to computationally model, predict, and evaluate how different ligands interact with a given protein. At the Center for Structural Genomics of Infectious Diseases (CSGID) a series of protein analysis, docking and molecular dynamics software is scripted into a single hierarchical pipeline allowing for an exhaustive investigation of protein-ligand interactions. The ability to conduct accurate computational predictions of protein-ligand binding is a vital component in improving both the efficiency and economics of drug discovery. Computational simulations can minimize experimental efforts, the slowest and most cost prohibitive aspect of identifying new therapeutics.
Protein; Ligand; High-throughput screening; Docking; Molecular modeling
Although callose occurs during megasporogenesis in most flowering plants, the knowledge about its general function and the mechanisms by which the callose layer is formed in particular places is still not sufficient. The results of previous studies suggest a total lack of callose in the ovules of diplosporous plants in which meiosis is omitted or disturbed. This report is the first documentation of callose events in dandelions ovules. We demonstrated the pattern of callose deposition during the formation of megaspores through diplospory of Taraxacum type and during normal meiotic megasporogenesis in apomictic triploid Taraxacum atricapillum and amphimictic diploid Taraxacum linearisquameum. We found the presence of callose in the megasporocyte wall of both diplosporous and sexual dandelions. However, in a diplosporous dandelion, callose predominated at the micropylar pole of megaspore mother cell (MMC) which may be correlated with abnormal asynaptic meiosis and may indicate diplospory of the Taraxacum type. After meiotic division, callose is mainly deposited in the walls between megaspores in tetrads and in diplodyads. In subsequent stages, callose gradually disappears around the chalazal functional megaspore. However, some variations in the pattern of callose deposition within tetrad may reflect variable positioning of the functional megaspore (FM) observed in the ovules of T. linearisquameum.
Apomixis; Callose; Chromosome number; Diplospory megasporogenesis; Taraxacum
Cryptosporidium parvum is an enteric protozoan parasite that has emerged as a major cause of diarrhea, malnutrition and gastroenteritis as well as posing a potential bioterrorism threat. C. parvum synthesizes guanine nucleotides from host adenosine in a streamlined pathway that relies on inosine 5′-monophosphate dehydrogenase (IMPDH). We have previously identified several parasite-selective C. parvum IMPDH (CpIMPDH) inhibitors by high-throughput screening. In this paper, we report the structure-activity relationship (SAR) for a series of benzoxazole derivatives with many compounds demonstrating CpIMPDH IC50 values in the nanomolar range and > 500-fold selectivity over human IMPDH (hIMPDH). Unlike previously reported CpIMPDH inhibitors, these compounds are competitive inhibitors versus NAD+. The SAR study reveals that pyridine and other small heteroaromatic substituents are required at the 2-position of the benzoxazole for potent inhibitory activity. In addition, several other SAR conclusions are highlighted with regard to the benzoxazole and the amide portion of the inhibitor, including preferred stereochemistry. An x-ray crystal structure of a representative E•IMP•inhibitor complex is also presented. Overall, the secondary amine derivative 15a (Q67) demonstrated excellent CpIMPDH inhibitory activity (IC50 = 0.5 ± 0.1 nM) and moderate stability (t1/2 = 44 min) in mouse liver microsomes. Compound 73, the racemic version of 15a, also displayed superb antiparasitic activity in a Toxoplasma gondii strain that relies on CpIMPDH (EC50 = 20 ± 20 nM), and selectivity versus a wild-type T. gondii strain (200-fold). No toxicity was observed (LD50 > 50 μM) against a panel of four mammalian cells lines.
The emergence of antibiotic-resistant bacterial strains underscores the importance of identifying new drug targets and developing new antimicrobial compounds. Lysine and meso-diaminopimelic acid are essential for protein production and bacterial peptidoglycan cell wall remodeling and are synthesized in bacteria by enzymes encoded within dap operon. Therefore dap enzymes may serve as excellent targets for developing a new class of antimicrobial agents. The dapE-encoded N-succinyl-L,L-diaminopimelic acid desuccinylase (DapE) converts N-succinyl-L,L-diaminopimelic acid to L,L-diaminopimelic acid and succinate. The enzyme is composed of catalytic and dimerization domains, and belongs to the M20 peptidase family. To understand the specific role of each domain of the enzyme we engineered dimerization domain deletion mutants of DapEs from Haemophilus influenzae and Vibrio cholerae, and characterized these proteins structurally and biochemically. No activity was observed for all deletion mutants. Structural comparisons of wild-type, inactive monomeric DapE enzymes with other M20 peptidases suggest that the dimerization domain is essential for DapE enzymatic activity. Structural analysis and molecular dynamics simulations indicate that removal of the dimerization domain increased the flexibility of a conserved active site loop that may provide critical interactions with the substrate.
Phage viruses that infect prokaryotes integrate their genome into the host chromosome; thus, microbial genomes typically contain genetic remnants of both recent and ancient phage infections. Often phage genes occur in clusters of atypical G+C content that reflect integration of the foreign DNA. However, some phage genes occur in isolation without other phage gene neighbors, probably resulting from horizontal gene transfer. In these cases, the phage gene product is unlikely to function as a component of a mature phage particle, and instead may have been co-opted by the host for its own benefit. The product of one such gene from Salmonella enterica serovar Typhimurium, STM3605, encodes a protein with modest sequence similarity to phage-like lysozyme (N-acetylmuramidase) but appears to lack essential catalytic residues that are strictly conserved in all lysozymes. Close homologs in other bacteria share this characteristic. The structure of the STM3605 protein was characterized by X-ray crystallography, and functional assays showed that it is a stable, folded protein whose structure closely resembles lysozyme. However, this protein is unlikely to hydrolyze peptidoglycan. Instead, STM3605 is presumed to have evolved an alternative function because it shows some lytic activity and partitions to micelles.
Crystal structure; mutagenesis; oligomeric state; phage-like lysozyme; Salmonella