1.  The EBI RDF platform: linked open data for the life sciences 
Bioinformatics  2014;30(9):1338-1339.
Motivation: Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI.
PMCID: PMC3998127  PMID: 24413672
2.  MageComet—web application for harmonizing existing large-scale experiment descriptions 
Bioinformatics  2012;28(10):1402-1403.
Motivation: Meta-analysis of large gene expression datasets obtained from public repositories requires consistently annotated data. Curation of such experiments, however, is an expert activity which involves repetitive manipulation of text. Existing tools for automated curation are few, which bottleneck the analysis pipeline.
Results: We present MageComet, a web application for biologists and annotators that facilitates the re-annotation of gene expression experiments in MAGE-TAB format. It incorporates data mining, automatic annotation, use of ontologies and data validation to improve the consistency and quality of experimental meta-data from the ArrayExpress Repository.
Availability and implementation: Source and tutorials for MageComet are openly available at under the GNU GPL v3 licenses. An implementation can be found at
Contact: or
PMCID: PMC3348561  PMID: 22474121
3.  Annotare—a tool for annotating high-throughput biomedical investigations and resulting data 
Bioinformatics  2010;26(19):2470-2471.
Summary: Computational methods in molecular biology will increasingly depend on standards-based annotations that describe biological experiments in an unambiguous manner. Annotare is a software tool that enables biologists to easily annotate their high-throughput experiments, biomaterials and data in a standards-compliant way that facilitates meaningful search and analysis.
Availability and Implementation: Annotare is available from under the terms of the open-source MIT License ( It has been tested on both Mac and Windows.
PMCID: PMC2944206  PMID: 20733062
4.  Modeling sample variables with an Experimental Factor Ontology 
Bioinformatics  2010;26(8):1112-1118.
Motivation: Describing biological sample variables with ontologies is complex due to the cross-domain nature of experiments. Ontologies provide annotation solutions; however, for cross-domain investigations, multiple ontologies are needed to represent the data. These are subject to rapid change, are often not interoperable and present complexities that are a barrier to biological resource users.
Results: We present the Experimental Factor Ontology, designed to meet cross-domain, application focused use cases for gene expression data. We describe our methodology and open source tools used to create the ontology. These include tools for creating ontology mappings, ontology views, detecting ontology changes and using ontologies in interfaces to enhance querying. The application of reference ontologies to data is a key problem, and this work presents guidelines on how community ontologies can be presented in an application ontology in a data-driven way.
Supplementary information: Supplementary data are available at Bioinformatics online.
PMCID: PMC2853691  PMID: 20200009
5.  Importing ArrayExpress datasets into R/Bioconductor 
Bioinformatics  2009;25(16):2092-2094.
Summary:ArrayExpress is one of the largest public repositories of microarray datasets. R/Bioconductor provides a comprehensive suite of microarray analysis and integrative bioinformatics software. However, easy ways for importing datasets from ArrayExpress into R/Bioconductor have been lacking. Here, we present such a tool that is suitable for both interactive and automated use.
Availability: The ArrayExpress package is available from the Bioconductor project at A users guide and examples are provided with the package.
Supplementary information:Supplementary data are available Bioinformatics online.
PMCID: PMC2723004  PMID: 19505942
6.  MAGETabulator, a suite of tools to support the microarray data format MAGE-TAB 
Bioinformatics  2008;25(2):279-280.
Summary: The MAGE-TAB format for microarray data representation and exchange has been proposed by the microarray community to replace the more complex MAGE-ML format. We present a suite of tools to support MAGE-TAB generation and validation, conversion between existing formats for data exchange, visualization of the experiment designs encoded by MAGE-TAB documents and the mining of such documents for semantic content.
Availability: Software is available from
PMCID: PMC2638998  PMID: 19038988

