PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-20 (20)
 

Clipboard (0)
None

Select a Filter Below

Journals
more »
Year of Publication
1.  The HUPO proteomics standards initiative- mass spectrometry controlled vocabulary 
Controlled vocabularies (CVs), i.e. a collection of predefined terms describing a modeling domain, used for the semantic annotation of data, and ontologies are used in structured data formats and databases to avoid inconsistencies in annotation, to have a unique (and preferably short) accession number and to give researchers and computer algorithms the possibility for more expressive semantic annotation of data. The Human Proteome Organization (HUPO)–Proteomics Standards Initiative (PSI) makes extensive use of ontologies/CVs in their data formats. The PSI-Mass Spectrometry (MS) CV contains all the terms used in the PSI MS–related data standards. The CV contains a logical hierarchical structure to ensure ease of maintenance and the development of software that makes use of complex semantics. The CV contains terms required for a complete description of an MS analysis pipeline used in proteomics, including sample labeling, digestion enzymes, instrumentation parts and parameters, software used for identification and quantification of peptides/proteins and the parameters and scores used to determine their significance. Owing to the range of topics covered by the CV, collaborative development across several PSI working groups, including proteomics research groups, instrument manufacturers and software vendors, was necessary. In this article, we describe the overall structure of the CV, the process by which it has been developed and is maintained and the dependencies on other ontologies.
Database URL: http://psidev.cvs.sourceforge.net/viewvc/psidev/psi/psi-ms/mzML/controlledVocabulary/psi-ms.obo
doi:10.1093/database/bat009
PMCID: PMC3594986  PMID: 23482073
2.  Correction: Conserved BK Channel-Protein Interactions Reveal Signals Relevant to Cell Death and Survival 
PLoS ONE  2012;7(1):10.1371/annotation/15e95626-9f5c-4882-ba78-826b80c48028.
doi:10.1371/annotation/15e95626-9f5c-4882-ba78-826b80c48028
PMCID: PMC3268691
4.  Conserved BK Channel-Protein Interactions Reveal Signals Relevant to Cell Death and Survival 
PLoS ONE  2011;6(12):e28532.
The large-conductance Ca2+-activated K+ (BK) channel and its β-subunit underlie tuning in non-mammalian sensory or hair cells, whereas in mammals its function is less clear. To gain insights into species differences and to reveal putative BK functions, we undertook a systems analysis of BK and BK-Associated Proteins (BKAPS) in the chicken cochlea and compared these results to other species. We identified 110 putative partners from cytoplasmic and membrane/cytoskeletal fractions, using a combination of coimmunoprecipitation, 2-D gel, and LC-MS/MS. Partners included 14-3-3γ, valosin-containing protein (VCP), stathmin (STMN), cortactin (CTTN), and prohibitin (PHB), of which 16 partners were verified by reciprocal coimmunoprecipitation. Bioinformatics revealed binary partners, the resultant interactome, subcellular localization, and cellular processes. The interactome contained 193 proteins involved in 190 binary interactions in subcellular compartments such as the ER, mitochondria, and nucleus. Comparisons with mice showed shared hub proteins that included N-methyl-D-aspartate receptor (NMDAR) and ATP-synthase. Ortholog analyses across six species revealed conserved interactions involving apoptosis, Ca2+ binding, and trafficking, in chicks, mice, and humans. Functional studies using recombinant BK and RNAi in a heterologous expression system revealed that proteins important to cell death/survival, such as annexinA5, γ-actin, lamin, superoxide dismutase, and VCP, caused a decrease in BK expression. This revelation led to an examination of specific kinases and their effectors relevant to cell viability. Sequence analyses of the BK C-terminus across 10 species showed putative binding sites for 14-3-3, RAC-α serine/threonine-protein kinase 1 (Akt), glycogen synthase kinase-3β (GSK3β) and phosphoinositide-dependent kinase-1 (PDK1). Knockdown of 14-3-3 and Akt caused an increase in BK expression, whereas silencing of GSK3β and PDK1 had the opposite effect. This comparative systems approach suggests conservation in BK function across different species in addition to novel functions that may include the initiation of signals relevant to cell death/survival.
doi:10.1371/journal.pone.0028532
PMCID: PMC3235137  PMID: 22174833
5.  The IntAct molecular interaction database in 2012 
Nucleic Acids Research  2011;40(D1):D841-D846.
IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275 000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.
doi:10.1093/nar/gkr1088
PMCID: PMC3245075  PMID: 22121220
6.  Towards BioDBcore: a community-defined information specification for biological databases 
The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources; and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.
doi:10.1093/database/baq027
PMCID: PMC3017395  PMID: 21205783
7.  Meeting Report from the Second “Minimum Information for Biological and Biomedical Investigations” (MIBBI) workshop 
Standards in Genomic Sciences  2010;3(3):259-266.
This report summarizes the proceedings of the second workshop of the ‘Minimum Information for Biological and Biomedical Investigations’ (MIBBI) consortium held on Dec 1-2, 2010 in Rüdesheim, Germany through the sponsorship of the Beilstein-Institute. MIBBI is an umbrella organization uniting communities developing Minimum Information (MI) checklists to standardize the description of data sets, the workflows by which they were generated and the scientific context for the work. This workshop brought together representatives of more than twenty communities to present the status of their MI checklists and plans for future development. Shared challenges and solutions were identified and the role of MIBBI in MI checklist development was discussed. The meeting featured some thirty presentations, wide-ranging discussions and breakout groups. The top outcomes of the two-day workshop as defined by the participants were: 1) the chance to share best practices and to identify areas of synergy; 2) defining a series of tasks for updating the MIBBI Portal; 3) reemphasizing the need to maintain independent MI checklists for various communities while leveraging common terms and workflow elements contained in multiple checklists; and 4) revision of the concept of the MIBBI Foundry to focus on the creation of a core set of MIBBI modules intended for reuse by individual MI checklist projects while maintaining the integrity of each MI project. Further information about MIBBI and its range of activities can be found at http://mibbi.org/.
doi:10.4056/sigs.147362
PMCID: PMC3035314  PMID: 21304730
8.  Meeting Report: BioSharing at ISMB 2010 
Standards in Genomic Sciences  2010;3(3):254-258.
This report summarizes the proceedings of the one day BioSharing meeting held at the Intelligent Systems for Molecular Biology (ISMB) 2010 conference in Boston, MA, USA This inaugural BioSharing event was hosted by the Genomic Standards Consortium as part of its M3 & BioSharing special interest group (SIG) workshop. The BioSharing event included invited talks from a range of community leaders and a panel discussion at the end of the day. The panel session led to the formal agreement among community leaders to join together to promote cross-community knowledge exchange and collaborations. A key focus of the newly formed Biosharing community will be linking up resources to promote real-world data sharing (virtuous cycle of data) and supporting compliance with data policies through the creation of a one-stop-portal of information. Further information about the newly established BioSharing effort can be found at http://biosharing.org.
doi:10.4056/sigs/1403501
PMCID: PMC3035313  PMID: 21304729
9.  Towards BioDBcore: a community-defined information specification for biological databases 
Nucleic Acids Research  2010;39(Database issue):D7-D10.
The present article proposes the adoption of a community-defined, uniform, generic description of the core attributes of biological databases, BioDBCore. The goals of these attributes are to provide a general overview of the database landscape, to encourage consistency and interoperability between resources and to promote the use of semantic and syntactic standards. BioDBCore will make it easier for users to evaluate the scope and relevance of available resources. This new resource will increase the collective impact of the information present in biological databases.
doi:10.1093/nar/gkq1173
PMCID: PMC3013734  PMID: 21097465
10.  The SDR (Short-Chain Dehydrogenase/Reductase and Related Enzymes) Nomenclature Initiative 
Chemico-biological interactions  2008;178(1-3):94-98.
Summary
Short-chain dehydrogenases/reductases (SDR) constitute one of the largest enzyme superfamilies with presently over 46 000 members. In phylogenetic comparisons, members of this superfamily show early divergence where the majority have only low pair-wise sequence identity, although sharing common structural properties. The SDR enzymes are present in virtually all genomes investigated, and in humans over 70 SDR genes have been identified. In humans, these enzymes are involved in the metabolism of a large variety of compounds, including steroid hormones, prostaglandins, retinoids, lipids and xenobiotics. It is now clear that SDRs represent one of the oldest protein families and contribute to essential functions and interactions of all forms of life. As this field continues to grow rapidly, a systematic nomenclature is essential for future annotation and reference purposes. A functional subdivision of the SDR superfamily into at least 200 SDR families based upon hidden Markov models forms a suitable foundation for such a nomenclature system, which we present in this paper using human SDRs as examples.
doi:10.1016/j.cbi.2008.10.040
PMCID: PMC2896744  PMID: 19027726
SDR; enzymes; nomenclature; bioinformatics; hidden Markov models
11.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project 
Nature biotechnology  2008;26(8):889-896.
The Minimum Information for Biological and Biomedical Investigations (MIBBI) project provides a resource for those exploring the range of extant minimum information checklists and fosters coordinated development of such checklists.
doi:10.1038/nbt.1411
PMCID: PMC2771753  PMID: 18688244
12.  Nucleoside diphosphate kinase (NDPK, NM23, AWD): recent regulatory advances in endocytosis, metastasis, psoriasis, insulin release, fetal erythroid lineage and heart failure; translational medicine exemplified 
The guest editor (AM) provides his perspective on the most recent advances on nucleoside diphosphate kinase (NDPK, otherwise known as AWD or NM23) showcasing phospho-histidine biochemistry and its impact on diverse pathology when disordered. His co-author (SO) provides state-of-the-art analyses from the European institute of Bioinformatics in an appendix to support the most recent advances made by the NDPK community. Unfortunately, to those outside the field, NDPK is often dismissed as a tiny ‘ancient housekeeper’ protein found in marine sponges, social amoebae, worms, fruit flies, rodents and humans but the state-of-the-art papers overviewed here show that NDPK does not act simply in mindless rote, inter-converting cellular ‘energy currencies’. That two NDPK isoforms regulate fetal erythroid lineage is a developmental case in point. Seminal Cancer Research UK support is gratefully acknowledged that generated additional resources to enable the NDPK community to meet in Dundee in 2007 (www.dundee.ac.uk/mchs/ndpk; next meeting is planned: 2010/Mannheim-Heidelberg). The presented papers illustrate the point that when scientists are left alone ‘shut up in the narrow cell of their laboratory’ (as the philosopher Ortega once said, a sentiment echoed by Erwin Schrödinger), then progress will ultimately occur bridging the gap between specialization and translation for human benefit. To aid translation, this overview initially introduces the NDPK family to the non-specialist, who serendipitously finds these proteins in their biology. This is immediately followed by examples of the diverse biology generated by this self-aggregating group of multi-functional proteins and finally capped by an emerging idea explaining how this diversity might arise.
doi:10.1007/s11010-009-0114-5
PMCID: PMC2721137  PMID: 19415463
HAART; Drosophila; Jade Goody; Bioinformatics; Dictyostelium; Ion transport
13.  MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data 
Genome Biology  2008;9(Suppl 2):S5.
Background
In the absence of consolidated pipelines to archive biological data electronically, information dispersed in the literature must be captured by manual annotation. Unfortunately, manual annotation is time consuming and the coverage of published interaction data is therefore far from complete. The use of text-mining tools to identify relevant publications and to assist in the initial information extraction could help to improve the efficiency of the curation process and, as a consequence, the database coverage of data available in the literature. The 2006 BioCreative competition was aimed at evaluating text-mining procedures in comparison with manual annotation of protein-protein interactions.
Results
To aid the BioCreative protein-protein interaction task, IntAct and MINT (Molecular INTeraction) provided both the training and the test datasets. Data from both databases are comparable because they were curated according to the same standards. During the manual curation process, the major cause of data loss in mining the articles for information was ambiguity in the mapping of the gene names to stable UniProtKB database identifiers. It was also observed that most of the information about interactions was contained only within the full-text of the publication; hence, text mining of protein-protein interaction data will require the analysis of the full-text of the articles and cannot be restricted to the abstract.
Conclusion
The development of text-mining tools to extract protein-protein interaction information may increase the literature coverage achieved by manual curation. To support the text-mining community, databases will highlight those sentences within the articles that describe the interactions. These will supply data-miners with a high quality dataset for algorithm development. Furthermore, the dictionary of terms created by the BioCreative competitors could enrich the synonym list of the PSI-MI (Proteomics Standards Initiative-Molecular Interactions) controlled vocabulary, which is used by both databases to annotate their data content.
doi:10.1186/gb-2008-9-s2-s5
PMCID: PMC2559989  PMID: 18834496
14.  Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions 
BMC Biology  2007;5:44.
Background
Molecular interaction Information is a key resource in modern biomedical research. Publicly available data have previously been provided in a broad array of diverse formats, making access to this very difficult. The publication and wide implementation of the Human Proteome Organisation Proteomics Standards Initiative Molecular Interactions (HUPO PSI-MI) format in 2004 was a major step towards the establishment of a single, unified format by which molecular interactions should be presented, but focused purely on protein-protein interactions.
Results
The HUPO-PSI has further developed the PSI-MI XML schema to enable the description of interactions between a wider range of molecular types, for example nucleic acids, chemical entities, and molecular complexes. Extensive details about each supported molecular interaction can now be captured, including the biological role of each molecule within that interaction, detailed description of interacting domains, and the kinetic parameters of the interaction. The format is supported by data management and analysis tools and has been adopted by major interaction data providers. Additionally, a simpler, tab-delimited format MITAB2.5 has been developed for the benefit of users who require only minimal information in an easy to access configuration.
Conclusion
The PSI-MI XML2.5 and MITAB2.5 formats have been jointly developed by interaction data producers and providers from both the academic and commercial sector, and are already widely implemented and well supported by an active development community. PSI-MI XML2.5 enables the description of highly detailed molecular interaction data and facilitates data exchange between databases and users without loss of information. MITAB2.5 is a simpler format appropriate for fast Perl parsing or loading into Microsoft Excel.
doi:10.1186/1741-7007-5-44
PMCID: PMC2189715  PMID: 17925023
15.  New developments in the InterPro database 
Nucleic Acids Research  2007;35(Database issue):D224-D228.
InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (), and for download by anonymous FTP (). The InterProScan search tool is now also available via a web service at .
doi:10.1093/nar/gkl841
PMCID: PMC1899100  PMID: 17202162
16.  InterPro, progress and status in 2005 
Nucleic Acids Research  2004;33(Database Issue):D201-D205.
InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is provided in an abstract, Gene Ontology mapping and links to specialized databases. New features of InterPro include extended protein match views, taxonomic range information and protein 3D structure data. One of the new match views is the InterPro Domain Architecture view, which shows the domain composition of protein matches. Two new entry types were introduced to better describe InterPro entries: these are active site and binding site. PIRSF and the structure-based SUPERFAMILY are the latest member databases to join InterPro, and CATH and PANTHER are soon to be integrated. InterPro release 8.0 contains 11 007 entries, representing 2573 domains, 8166 families, 201 repeats, 26 active sites, 21 binding sites and 20 post-translational modification sites. InterPro covers over 78% of all proteins in the Swiss-Prot and TrEMBL components of UniProt. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).
doi:10.1093/nar/gki106
PMCID: PMC540060  PMID: 15608177
17.  IntAct: an open source molecular interaction database 
Nucleic Acids Research  2004;32(Database issue):D452-D455.
IntAct provides an open source database and toolkit for the storage, presentation and analysis of protein interactions. The web interface provides both textual and graphical representations of protein interactions, and allows exploring interaction networks in the context of the GO annotations of the interacting proteins. A web service allows direct computational access to retrieve interaction networks in XML format. IntAct currently contains ∼2200 binary and complex interactions imported from the literature and curated in collaboration with the Swiss-Prot team, making intensive use of controlled vocabularies to ensure data consistency. All IntAct software, data and controlled vocabularies are available at http://www.ebi.ac.uk/intact.
doi:10.1093/nar/gkh052
PMCID: PMC308786  PMID: 14681455
18.  Progress in Establishing Common Standards for Exchanging Proteomics Data: The Second Meeting of the HUPO Proteomics Standards Initiative 
The Proteomics Standards Initiative (PSI) aims to define community standards for data representation in proteomics and to facilitate data comparison, exchange and verification. Rapid progress has been made in the development of common standards for data exchange in the fields of both mass spectrometry and protein–protein interactions since the first PSI meeting [1]. Both hardware and software manufacturers have agreed to work to ensure that a proteomics-specific extension is created for the emerging ASTM mass spectrometry standard and the data model for a proteomics experiment has advanced significantly. The Protein–Protein Interactions (PPI) group expects to publish the Level 1 PSI data exchange format for protein–protein interactions by early summer this year, and discussion as to the additional content of Level 2 has been initiated.
doi:10.1002/cfg.279
PMCID: PMC2447408  PMID: 18629121
19.  The HUPO Proteomics Standards Initiative Meeting: Towards Common Standards for Exchanging Proteomics Data 
The Proteomics Standards Initiative (PSI) aims to define community standards for data representation in proteomics and to facilitate data comparison, exchange and verification. Initially the fields of protein–protein interactions (PPI) and mass spectroscopy have been targeted and the inaugural meeting of the PSI addressed the questions of data storage and exchange in both of these areas. The PPI group rapidly reached consensus as to the minimum requirements for a data exchange model; an XML draft is now being produced. The mass spectroscopy group have achieved major advances in the definition of a required data model and working groups are currently taking these discussions further. A further meeting is planned in January 2003 to advance both these projects.
doi:10.1002/cfg.232
PMCID: PMC2447389  PMID: 18629102
20.  The InterPro Database, 2003 brings increased coverage and new features 
Nucleic Acids Research  2003;31(1):315-318.
InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a means of amalgamating the major protein signature databases into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are provided in a single format that rationalises the results that would be obtained by searching the member databases individually. The latest release of InterPro contains 5629 entries describing 4280 families, 1239 domains, 95 repeats and 15 post-translational modifications. Currently, the combined signatures in InterPro cover more than 74% of all proteins in SWISS-PROT and TrEMBL, an increase of nearly 15% since the inception of InterPro. New features of the database include improved searching capabilities and enhanced graphical user interfaces for visualisation of the data. The database is available via a webserver (http://www.ebi.ac.uk/interpro) and anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).
PMCID: PMC165493  PMID: 12520011

Results 1-20 (20)