The Store.Synchrotron service, a fully functional, cloud computing-based solution to raw X-ray data archiving and dissemination at the Australian Synchrotron, is described.
The Store.Synchrotron service, a fully functional, cloud computing-based solution to raw X-ray data archiving and dissemination at the Australian Synchrotron, is described. The service automatically receives and archives raw diffraction data, related metadata and preliminary results of automated data-processing workflows. Data are able to be shared with collaborators and opened to the public. In the nine months since its deployment in August 2013, the service has handled over 22.4 TB of raw data (∼1.7 million diffraction images). Several real examples from the Australian crystallographic community are described that illustrate the advantages of the approach, which include real-time online data access and fully redundant, secure storage. Discoveries in biological sciences increasingly require multidisciplinary approaches. With this in mind, Store.Synchrotron has been developed as a component within a greater service that can combine data from other instruments at the Australian Synchrotron, as well as instruments at the Australian neutron source ANSTO. It is therefore envisaged that this will serve as a model implementation of raw data archiving and dissemination within the structural biology research community.
Australian Store.Synchrotron; data archiving
A truncated heterodimer of human PSPC1–NONO protein, a paraspeckle-specific complex, has been crystallized and the diffraction data collected to a resolution of 1.9 Å.
The paraspeckle component 1 (PSPC1) and non-POU-domain-containing octamer-binding protein (NONO) heterodimer is an essential structural component of paraspeckles, ribonucleoprotein bodies found in the interchromatin space of mammalian cell nuclei. PSPC1 and NONO both belong to the Drosophila behaviour and human splicing (DBHS) protein family, which has been implicated in many aspects of RNA processing. A heterodimer of the core DBHS conserved region of PSPC1 and NONO comprising two tandemly arranged RNA-recognition motifs (RRMs), a NONA/paraspeckle (NOPS) domain and part of a predicted coiled-coil domain has been crystallized in space group C2, with unit-cell parameters a = 90.90, b = 67.18, c = 94.08 Å, β = 99.96°. The crystal contained one heterodimer in the asymmetric unit and diffracted to 1.9 Å resolution using synchrotron radiation.
paraspeckles; PSPC1–NONO heterodimer; RNA-recognition motifs; DBHS-family proteins
KARRIKIN INSENSITIVE 2 (KAI2) is an α/β hydrolase involved in seed germination and seedling development. It is essential for plant responses to karrikins, a class of butenolide compounds derived from burnt plant material that are structurally similar to strigolactone plant hormones. The mechanistic basis for the function of KAI2 in plant development remains unclear. We have determined the crystal structure of Arabidopsis thaliana KAI2 in space groups P21 21 21 (a = 63.57 Å, b = 66.26 Å, c = 78.25 Å) and P21 (a = 50.20 Å, b = 56.04 Å, c = 52.43 Å, β = 116.12°) to 1.55 and 2.11 Å respectively. The catalytic residues are positioned within a large hydrophobic pocket similar to that of DAD2, a protein required for strigolactone response in Petunia hybrida. KAI2 possesses a second solvent-accessible pocket, adjacent to the active site cavity, which offers the possibility of allosteric regulation. The structure of KAI2 is consistent with its designation as a serine hydrolase, as well as previous data implicating the protein in karrikin and strigolactone signalling.
Trypanothione reductase is a key enzyme in the trypanothione-based redox metabolism of pathogenic trypanosomes. Since this system is absent in humans, being replaced with glutathione and glutathione reductase, it offers a target for selective inhibition. The rational design of potent inhibitors requires accurate structures of enzyme-inhibitor complexes, but this is lacking for trypanothione reductase. We therefore used quinacrine mustard, an alkylating derivative of the competitive inhibitor quinacrine, to probe the active site of this dimeric flavoprotein. Quinacrine mustard irreversibly inactivates Trypanosoma cruzi trypanothione reductase, but not human glutathione reductase, in a time-dependent manner with a stoichiometry of two inhibitors bound per monomer. The rate of inactivation is dependent upon the oxidation state of trypanothione reductase, with the NADPH-reduced form being inactivated significantly faster than the oxidised form. Inactivation is slowed by clomipramine and a melarsen oxide-trypanothione adduct (both are competitive inhibitors) but accelerated by quinacrine. The structure of the trypanothione reductase-quinacrine mustard adduct was determined to 2.7 Å, revealing two molecules of inhibitor bound in the trypanothione-binding site. The acridine moieties interact with each other through π-stacking effects, and one acridine interacts in a similar fashion with a tryptophan residue. These interactions provide a molecular explanation for the differing effects of clomipramine and quinacrine on inactivation by quinacrine mustard. Synergism with quinacrine occurs as a result of these planar acridines being able to stack together in the active site cleft, thereby gaining an increased number of binding interactions, whereas antagonism occurs with non-planar molecules, such as clomipramine, where stacking is not possible.
enzyme-inhibitor complex; Trypanosoma cruzi; trypanothione reductase; quinacrine mustard; X-ray diffraction
The pentatricopeptide repeat (PPR) is a helical repeat motif found in an exceptionally large family of RNA–binding proteins that functions in mitochondrial and chloroplast gene expression. PPR proteins harbor between 2 and 30 repeats and typically bind single-stranded RNA in a sequence-specific fashion. However, the basis for sequence-specific RNA recognition by PPR tracts has been unknown. We used computational methods to infer a code for nucleotide recognition involving two amino acids in each repeat, and we validated this model by recoding a PPR protein to bind novel RNA sequences in vitro. Our results show that PPR tracts bind RNA via a modular recognition mechanism that differs from previously described RNA–protein recognition modes and that underpins a natural library of specific protein/RNA partners of unprecedented size and diversity. These findings provide a significant step toward the prediction of native binding sites of the enormous number of PPR proteins found in nature. Furthermore, the extraordinary evolutionary plasticity of the PPR family suggests that the PPR scaffold will be particularly amenable to redesign for new sequence specificities and functions.
RNA binding proteins dictate RNA fate and function by modulating RNA processing, localization, translation, and stability. The consequences of RNA/protein interactions are determined, in part, by the position at which the protein binds the RNA. However, it is impossible to predict the target sites of most RNA binding proteins or to design them to bind chosen RNA sequences. In contrast, we show that the pentatricopeptide repeat (PPR) protein family holds exceptional promise for the rational design of specified RNA–binding properties. PPR proteins harbor tandem arrays of a repeating structural unit that form a surface for binding single-stranded RNA. We show that PPR tracts bind specific RNA nucleotides via the combinatorial action of two amino acids in each repeat. This mechanism mimics the simplicity and predictability of the Watson-Crick pairing between nucleic acid strands, but at a protein/RNA interface. Our findings will facilitate the prediction of binding sites for the large number of PPR proteins found in nature. Additionally, our demonstration that a PPR tract can be engineered to bind specified RNA sequences implies that PPR proteins can be designed to bind desired RNA targets for applications in biotechnology, medicine, and basic research.
The majority of peroxisomal matrix proteins destined for translocation into the peroxisomal lumen are recognised via a C-terminal Peroxisomal Target Signal type 1 by the cycling receptor Pex5p. The only structure to date of Pex5p in complex with a cargo protein is that of the C-terminal cargo-binding domain of the receptor with sterol carrier protein 2, a small, model peroxisomal protein. In this study, we have tested the contribution of a second, ancillary receptor-cargo binding site, which was found in addition to the characterised Peroxisomal Target Signal type 1.
To investigate the function of this secondary interface we have mutated two key residues from the ancillary binding site and analyzed the level of binding first by a yeast-two-hybrid assay, followed by quantitative measurement of the binding affinity and kinetics of purified protein components and finally, by in vivo measurements, to determine translocation capability. While a moderate but significant reduction of the interaction was found in binding assays, we were not able to measure any significant defects in vivo.
Our data therefore suggest that at least in the case of sterol carrier protein 2 the contribution of the second binding site is not essential for peroxisomal import. At this stage, however, we cannot rule out that other cargo proteins may require this ancillary binding site.
Paraspeckles are ribonucleoprotein bodies found in the interchromatin space of mammalian cell nuclei. These structures play a role in regulating the expression of certain genes in differentiated cells by nuclear retention of RNA. The core paraspeckle proteins (PSF/SFPQ, P54NRB/NONO, and PSPC1 [paraspeckle protein 1]) are members of the DBHS (Drosophila melanogaster behavior, human splicing) family. These proteins, together with the long nonprotein-coding RNA NEAT1 (MEN-ϵ/β), associate to form paraspeckles and maintain their integrity. Given the large numbers of long noncoding transcripts currently being discovered through whole transcriptome analysis, paraspeckles may be a paradigm for a class of subnuclear bodies formed around long noncoding RNA.
The detoxification enzyme glyoxalase I from L. major has been crystallized. Preliminary molecular-replacement calculations indicate the presence of three glyoxalase I dimers in the asymmetric unit.
Glyoxalase I (GLO1) is a putative drug target for trypanosomatids, which are pathogenic protozoa that include the causative agents of leishmaniasis. Significant sequence and functional differences between Leishmania major and human GLO1 suggest that it may make a suitable template for rational inhibitor design. L. major GLO1 was crystallized in two forms: the first is extremely disordered and does not diffract, while the second, an orthorhombic form, produces diffraction to 2.0 Å. Molecular-replacement calculations indicate that there are three GLO1 dimers in the asymmetric unit, which take up a helical arrangement with their molecular dyads arranged approximately perpendicular to the c axis. Further analysis of these data are under way.
glyoxalase I; leishmaniasis
P54nrb is a protein implicated in multiple nuclear processes whose specific functions may correlate with its presence at different nuclear locations. Here we characterize paraspeckles, a subnuclear domain containing p54nrb and other RNA-binding proteins including PSP1, a protein with sequence similarity to p54nrb that acts as a marker for paraspeckles. We show that PSP1 interacts in vivo with a subset of the total cellular pool of p54nrb. We map the domain within PSP1 that is mediating this interaction and show it is required for the correct localization of PSP1 to paraspeckles. This interaction is necessary but not sufficient for paraspeckle targeting by PSP1, which also requires an RRM capable of RNA binding. Blocking the reinitiation of RNA Pol II transcription at the end of mitosis with DRB prevents paraspeckle formation, which recommences after removal of DRB, indicating that paraspeckle formation is dependent on RNA Polymerase II transcription. Thus paraspeckles are the sites where a subset of the total cellular pool of p54nrb is targeted in a RNA Polymerase II-dependent manner.
Homologous recombinational repair is an essential mechanism for repair of double-strand breaks in DNA. Recombinases of the RecA-fold family play a crucial role in this process, forming filaments that utilize ATP to mediate their interactions with single- and double-stranded DNA. The recombinase molecules present in the archaea (RadA) and eukaryota (Rad51) are more closely related to each other than to their bacterial counterpart (RecA) and, as a result, RadA makes a suitable model for the eukaryotic system. The crystal structure of Sulfolobus solfataricus RadA has been solved to a resolution of 3.2 Å in the absence of nucleotide analogues or DNA, revealing a narrow filamentous assembly with three molecules per helical turn. As observed in other RecA-family recombinases, each RadA molecule in the filament is linked to its neighbour via interactions of a short β-strand with the neighbouring ATPase domain. However, despite apparent flexibility between domains, comparison with other structures indicates conservation of a number of key interactions that introduce rigidity to the system, allowing allosteric control of the filament by interaction with ATP. Additional analysis reveals that the interaction specificity of the five human Rad51 paralogues can be predicted using a simple model based on the RadA structure.
Two archaeal Holliday junction resolving enzymes, Holliday junction cleavage (Hjc) and Holliday junction endonuclease (Hje), have been characterized. Both are members of a nuclease superfamily that includes the type II restriction enzymes, although their DNA cleaving activity is highly specific for four-way junction structure and not nucleic acid sequence. Despite 28% sequence identity, Hje and Hjc cleave junctions with distinct cutting patterns—they cut different strands of a four-way junction, at different distances from the junction centre. We report the high-resolution crystal structure of Hje from Sulfolobus solfataricus. The structure provides a basis to explain the differences in substrate specificity of Hje and Hjc, which result from changes in dimer organization, and suggests a viral origin for the Hje gene. Structural and biochemical data support the modelling of an Hje:DNA junction complex, highlighting a flexible loop that interacts intimately with the junction centre. A highly conserved serine residue on this loop is shown to be essential for the enzyme's activity, suggesting a novel variation of the nuclease active site. The loop may act as a conformational switch, ensuring that the active site is completed only on binding a four-way junction, thus explaining the exquisite specificity of these enzymes.