The Illumina HumanMethylation450 BeadChip has become a popular platform for interrogating DNA methylation in epigenome-wide association studies (EWAS) and related projects as well as resource efforts such as the International Cancer Genome Consortium (ICGC) and the International Human Epigenome Consortium (IHEC). This has resulted in an exponential increase of 450k data in recent years and triggered the development of numerous integrated analysis pipelines and stand-alone packages. This review will introduce and discuss the currently most popular pipelines and packages and is particularly aimed at new 450k users.
HumanMethylation450; Epigenetics; Analysis pipelines; EWAS; DNA methylation; 450k BeadChip
The speed and resolution at which we can scour the genome for DNA methylation changes has improved immeasurably in the last 10 years and the advent of the Illumina 450K BeadChip has made epigenome-wide association studies (EWAS) a reality. The resulting datasets are conveniently formatted to allow easy alignment of significant hits to genes and genetic features, however; methods that parse significant hits into discreet differentially methylated regions (DMRs) remain a challenge to implement. In this paper we present details of a novel DMR caller, the Probe Lasso: a flexible window based approach that gathers neighbouring significant-signals to define clear DMR boundaries for subsequent in-depth analysis. The method is implemented in the R package ChAMP (Morris et al., 2014) and returns sets of DMRs according to user-tuned levels of probe filtering (e.g., inclusion of sex chromosomes, polymorphisms) and probe-lasso size distribution. Using a sub-sample of colon cancer- and healthy colon-samples from TCGA we show that Probe Lasso shifts DMR calling away from just probe-dense regions, and calls a range of DMR sizes ranging from tens-of-bases to tens-of-kilobases in scale. Moreover, using TCGA data we show that Probe Lasso leverages more information from the array and highlights a potential role of hypomethylated transcription factor binding motifs not discoverable using a basic, fixed-window approach.
Differentially methylated regions; DNA methylation; Epigenetics; EWAS; Illumina 450K BeadChip
•A method is presented for 5hmC detection and analysis using Infinium 450K BeadChips.•The oxBS-450K method can discriminate between 5mC and 5hmC in human gDNA•5hmC levels were quantified genome-wide in 3 distinct biological samples.•The reported 5hmC signal was validated using mass spectrometry and pyrosequencing.•The effects of differing amounts of input DNA on final 5hmC call rate are discussed.
DNA methylation analysis has become an integral part of biomedical research. For high-throughput applications such as epigenome-wide association studies, the Infinium HumanMethylation450 (450K) BeadChip is currently the platform of choice. However, BeadChip processing relies on traditional bisulfite (BS) based protocols which cannot discriminate between 5-methylcytosine (5mC) and 5-hydroxymethylcytosine (5hmC). Here, we report the adaptation of the recently developed oxidative bisulfite (oxBS) chemistry to specifically detect both 5mC and 5hmC in a single workflow using 450K BeadChips, termed oxBS-450K. Supported by validation using mass spectrometry and pyrosequencing, we demonstrate reproducible (R2 > 0.99) detection of 5hmC in human brain tissue using the optimised oxBS-450K protocol described here.
Hydroxymethylation; DNA methylation; 450K BeadChip; Oxidation; Bisulfite conversion; Epigenetics
Epigenome-wide association studies (EWASs) provide a systematic approach to uncovering epigenetic variants underlying common diseases. Discoveries have shed light on novel molecular mechanisms of disease and enabled the application of epigenetic variants as biomarkers. Here, we highlight the recent advances in this emerging line of research and discuss key challenges for current and future studies.
GWAS; EWAS; DNA methylation; disease mechanism; biomarker
Wilms tumor is the most common pediatric renal malignancy and there is a clinical need for a molecular biomarker to assess treatment response and predict relapse. The known mutated genes in this tumor type show low mutation frequencies, whereas aberrant methylation at 11p15 is by far the most common aberration. We therefore analyzed the epigenome, rather than the genome, to identify ubiquitous tumor-specific biomarkers.
Methylome analysis of matched normal kidney and Wilms tumor identifies 309 preliminary methylation variable positions which we translate into three differentially methylated regions (DMRs) for use as tumor-specific biomarkers. Using two novel algorithms we show that these three DMRs are not confounded by cell type composition. We further show that these DMRs are not methylated in embryonic blastema but are intermediately methylated in Wilms tumor precursor lesions. We validate the biomarker DMRs using two independent sample sets of normal kidney and Wilms tumor and seven Wilms tumor histological subtypes, achieving 100% and 98% correct classification, respectively. As proof-of-principle for clinical utility, we successfully use biomarker DMR-2 in a pilot analysis of cell-free circulating DNA to monitor tumor response during treatment in ten patients.
These findings define the most common methylated regions in Wilms tumor known to date which are not associated with their embryonic origin or precursor stage. We show that this tumor-specific methylated DNA is released into the blood circulation where it can be detected non-invasively showing potential for clinical utility.
Electronic supplementary material
The online version of this article (doi:10.1186/s13059-014-0434-y) contains supplementary material, which is available to authorized users.
Recently it has been observed that cancer tissue is characterised by an increased variability in DNA methylation patterns. However, how the correlative patterns in genome-wide DNA methylation change during the carcinogenic progress has not yet been explored. Here we study genome-wide inter-CpG correlations in DNA methylation, in addition to single site variability, during cervical carcinogenesis. We demonstrate how the study of changes in DNA methylation covariation patterns across normal, intra-epithelial neoplasia and invasive cancer allows the identification of CpG sites that indicate the risk of neoplastic transformation in stages prior to neoplasia. Importantly, we show that the covariation in DNA methylation at these risk CpG loci is maximal immediately prior to the onset of cancer, supporting the view that high epigenetic diversity in normal cells increases the risk of cancer. Consistent with this, we observe that invasive cancers exhibit increased covariation in DNA methylation at the risk CpG sites relative to normal tissue, but lower levels relative to pre-cancerous lesions. We further show that the identified risk CpG sites undergo preferential DNA methylation changes in relation to human papilloma virus infection and age. Results are validated in independent data including prospectively collected samples prior to neoplastic transformation. Our data are consistent with a phase transition model of carcinogenesis, in which epigenetic diversity is maximal prior to the onset of cancer. The model and algorithm proposed here may allow, in future, network biomarkers predicting the risk of neoplastic transformation to be identified.
DNA methylation is a covalent modification of DNA which can regulate how active genes are. DNA methylation is altered at many genomic loci in cancer cells, leading to widespread functional disruption. Importantly, DNA methylation alterations across the genome are seen even in early carcinogenesis. Although the pattern of DNA methylation change during carcinogenesis has been studied at individual genomic loci, no study has yet analysed how these patterns change at a systems-level, specifically how do DNA methylation patterns at pairs of genomic sites change during disease progression. Doing so can shed light on how the epigenetic diversity of cell populations changes during the carcinogenic process. This study performs a systems-level analysis of the dynamic changes in DNA methylation correlation pattern during cervical carcinogenesis, demonstrating that epigenetic diversity is maximal just prior to the onset of cancer. Importantly, this supports the view that the risk of cancer development is closely related to an increase in epigenetic diversity in apparently healthy cells. In addition, the study provides a computational algorithm which successfully identifies the altered genomic sites confering the risk of cervical cancer.
Many genes important in immunity are found as multigene families. The butyrophilin genes are members of the B7 family, playing diverse roles in co-regulation and perhaps in antigen presentation. In humans, a fixed number of butyrophilin genes are found in and around the major histocompatibility complex (MHC), and show striking association with particular autoimmune diseases. In chickens, BG genes encode homologues with somewhat different domain organisation. Only a few BG genes have been characterised, one involved in actin-myosin interaction in the intestinal brush border, and another implicated in resistance to viral diseases. We characterise all BG genes in B12 chickens, finding a multigene family organised as tandem repeats in the BG region outside the MHC, a single gene in the MHC (the BF-BL region), and another single gene on a different chromosome. There is a precise cell and tissue expression for each gene, but overall there are two kinds, those expressed by haemopoietic cells and those expressed in tissues (presumably non-haemopoietic cells), correlating with two different kinds of promoters and 5′ untranslated regions (5′UTR). However, the multigene family in the BG region contains many hybrid genes, suggesting recombination and/or deletion as major evolutionary forces. We identify BG genes in the chicken whole genome shotgun sequence, as well as by comparison to other haplotypes by fibre fluorescence in situ hybridisation, confirming dynamic expansion and contraction within the BG region. Thus, the BG genes in chickens are undergoing much more rapid evolution compared to their homologues in mammals, for reasons yet to be understood.
Many immune genes are multigene families, presumably in response to pathogen variation. Some multigene families undergo expansion and contraction, leading to copy number variation (CNV), presumably due to more intense selection. Recently, the butyrophilin family in humans and other mammals has come under scrutiny, due to genetic associations with autoimmune diseases as well as roles in immune co-regulation and antigen presentation. Butyrophilin genes exhibit allelic polymorphism, but gene number appears stable within a species. We found that the BG homologues in chickens are very different, with great changes between haplotypes. We characterised one haplotype in detail, showing that there are two single BG genes, one on chromosome 2 and the other in the major histocompatibility complex (BF-BL region) on chromosome 16, and a family of BG genes in a tandem array in the BG region nearby. These genes have specific expression in cells and tissues, but overall are expressed in either haemopoietic cells or tissues. The two singletons have relatively stable evolutionary histories, but the BG region undergoes dynamic expansion and contraction, with the production of hybrid genes. Thus, chicken BG genes appear to evolve much more quickly than their closest homologs in mammals, presumably due to increased pressure from pathogens.
Aging is a progressive process that results in the accumulation of intra- and extracellular alterations that in turn contribute to a reduction in health. Age-related changes in DNA methylation have been reported before and may be responsible for aging-induced changes in gene expression, although a causal relationship has yet to be shown. Using genome-wide assays, we analyzed age-induced changes in DNA methylation and their effect on gene expression with and without transient induction with the synthetic transcription modulating agent WY14,643. To demonstrate feasibility of the approach, we isolated peripheral blood mononucleated cells (PBMCs) from five young and five old healthy male volunteers and cultured them with or without WY14,643. Infinium 450K BeadChip and Affymetrix Human Gene 1.1 ST expression array analysis revealed significant differential methylation of at least 5 % (ΔYO > 5 %) at 10,625 CpG sites between young and old subjects, but only a subset of the associated genes were also differentially expressed. Age-related differential methylation of previously reported epigenetic biomarkers of aging including ELOVL2, FHL2, PENK, and KLF14 was confirmed in our study, but these genes did not display an age-related change in gene expression in PBMCs. Bioinformatic analysis revealed that differentially methylated genes that lack an age-related expression change predominantly represent genes involved in carcinogenesis and developmental processes, and expression of most of these genes were silenced in PBMCs. No changes in DNA methylation were found in genes displaying transiently induced changes in gene expression. In conclusion, aging-induced differential methylation often targets developmental genes and occurs mostly without change in gene expression.
Electronic supplementary material
The online version of this article (doi:10.1007/s11357-014-9648-x) contains supplementary material, which is available to authorized users.
Molecular aging; Epigenetics; DNA methylation; Gene expression; PBMCs; Epigenetic biomarkers of aging
Common human diseases are caused by the complex interplay of genetic susceptibility as well as environmental factors. Due to the environment’s influence on the epigenome, and therefore genome function, as well as conversely the genome’s facilitative effect on the epigenome, analysis of this level of regulation may increase our knowledge of disease pathogenesis.
In order to identify human-specific epigenetic influences, we have performed a novel genome-wide DNA methylation analysis comparing human, chimpanzee and rhesus macaque.
We have identified that the immunological Leukotriene B4 receptor (LTB4R, BLT1 receptor) is the most epigenetically divergent human gene in peripheral blood in comparison with other primates. This difference is due to the co-ordinated active state of human-specific hypomethylation in the promoter and human-specific increased gene body methylation. This gene is significant in innate immunity and the LTB4/LTB4R pathway is involved in the pathogenesis of the spectrum of human inflammatory diseases. This finding was confirmed by additional neutrophil-only DNA methylome and lymphoblastoid H3K4me3 chromatin comparative data. Additionally we show through functional analysis that this receptor has increased expression and a higher response to the LTB4 ligand in human versus rhesus macaque peripheral blood mononuclear cells. Genome-wide we also find human species-specific differentially methylated regions (human s-DMRs) are more prevalent in CpG island shores than within the islands themselves, and within the latter are associated with the CTCF motif.
This result further emphasises the exclusive nature of the human immunological system, its divergent adaptation even from very closely related primates, and the power of comparative epigenomics to identify and understand human uniqueness.
The integration of genomic and epigenomic data is an increasingly popular approach for studying the complex mechanisms driving cancer development. We have developed a method for evaluating both methylation and copy number from high-density DNA methylation arrays. Comparing copy number data from Infinium HumanMethylation450 BeadChips and SNP arrays, we demonstrate that Infinium arrays detect copy number alterations with the sensitivity of SNP platforms. These results show that high-density methylation arrays provide a robust and economic platform for detecting copy number and methylation changes in a single experiment. Our method is available in the ChAMP Bioconductor package: http://www.bioconductor.org/packages/2.13/bioc/html/ChAMP.html.
Stem cells have been found in most tissues/organs. These somatic stem cells produce replacements for lost and damaged cells, and it is not completely understood how this regenerative capacity becomes diminished during aging. To study the possible involvement of epigenetic changes in somatic stem cell aging, we used murine hematopoiesis as a model system. Hematopoietic stem cells (HSCs) were enriched for via Hoechst exclusion activity (SP-HSC) from young, medium-aged and old mice and subjected to comprehensive, global methylome (MeDIP-seq) analysis. With age, we observed a global loss of DNA methylation of approximately 5%, but an increase in methylation at some CpG islands. Just over 100 significant (FDR < 0.2) aging-specific differentially methylated regions (aDMRs) were identified, which are surprisingly few considering the profound age-based changes that occur in HSC biology. Interestingly, the polycomb repressive complex -2 (PCRC2) target genes Kiss1r, Nav2 and Hsf4 were hypermethylated with age. The promoter for the Sdpr gene was determined to be progressively hypomethylated with age. This occurred concurrently with an increase in gene expression with age. To explore this relationship further, we cultured isolated SP-HSC in the presence of 5-aza-deoxycytdine and demonstrated a negative correlation between Sdpr promoter methylation and gene expression. We report that DNA methylation patterns are well preserved during hematopoietic stem cell aging, confirm that PCRC2 targets are increasingly methylated with age, and suggest that SDPR expression changes with age in HSCs may be regulated via age-based alterations in DNA methylation.
hematopoietic stem cells; aging; epigenetics; methylomics; methylome; Nano-MeDIP-seq; DNA methylation; Sdpr polycomb repressive complex -2 (PCRC2); Nav2; Kiss1r; Hsf4
It is now well established that the genomic landscape of DNA methylation (DNAm) gets altered as a function of age, a process we here call ‘epigenetic drift’. The biological, functional, clinical and evolutionary significance of this epigenetic drift, however, remains unclear. We here provide a brief review of epigenetic drift, focusing on the potential implications for ageing, stem cell biology and disease risk prediction. It has been demonstrated that epigenetic drift affects most of the genome, suggesting a global deregulation of DNAm patterns with age. A component of this drift is tissue-specific, allowing remarkably accurate age-predictive models to be constructed. Another component is tissue-independent, targeting stem cell differentiation pathways and affecting stem cells, which may explain the observed decline of stem cell function with age. Age-associated increases in DNAm target developmental genes, overlapping those associated with environmental disease risk factors and with disease itself, notably cancer. In particular, cancers and precursor cancer lesions exhibit aggravated age DNAm signatures. Epigenetic drift is also influenced by genetic factors. Thus, drift emerges as a promising biomarker for premature or biological ageing, and could potentially be used in geriatrics for disease risk prediction. Finally, we propose, in the context of human evolution, that epigenetic drift may represent a case of epigenetic thrift, or bet-hedging. In summary, this review demonstrates the growing importance of the ‘ageing epigenome’, with potentially far-reaching implications for understanding the effect of age on stem cell function and differentiation, as well as for disease prevention.
One in six cancers worldwide is caused by infection and human papillomavirus (HPV) is one of the main culprits. To better understand the dynamics of HPV integration and its effect on both the viral and host methylomes, we conducted whole-genome DNA methylation analysis using MeDIP-seq of HPV+ and HPV- head and neck squamous cell carcinoma (HNSCC). We determined the viral subtype to be HPV-16 in all cases and show that HPV-16 integrates into the host genome at multiple random sites and that this process predominantly involves the transcriptional repressor gene (E2) in the viral genome. Comparative analysis identified 453 (FDR ≤ 0.01) differentially methylated regions (DMRs) in the HPV+ host methylome. Bioinformatics characterization of these DMRs confirmed the previously reported cadherin genes to be affected but also revealed new targets for HPV-mediated methylation changes at regions not covered by array-based platforms, including the recently identified super-enhancers.
human papillomavirus (HPV); head and neck squamous cell carcinoma (HNSCC); DNA methylation; methylome; epigenome
Cell type-specific patterns of gene expression reflect epigenetic changes imposed through a particular developmental lineage as well as those triggered by environmental cues within adult tissues. There is great interest in elucidating the molecular basis and functional importance of epigenetic mechanisms in both normal physiology and disease – particularly in cancer, where abnormal ‘-omic’ states are often observed. In this article we review recent progress in studies of epigenetic mechanisms in the most common primary adult brain cancer, glioblastoma multiforme. Three distinct areas are discussed. First, the evidence in support of ongoing ‘normal’ epigenetic processes associated with differentiation – as predicted by ‘cancer stem cell’ models of the disease. Second, identification of site-specific and global epigenetic abnormalities. Third, genetic disruptions directly within the core epigenetic machinery, exemplified by the recently identified mutations within isocitrate dehydrogenase genes IDH1/2 and variant histone genes H3.3/H3F3A. These constitute the ‘good, the bad and the ugly’ of epigenetic mechanisms in cancer.
Epigenetics; Glioma; DNA methylation; Central nervous system (CNS); Differentiation; Cancer stem cells
Human papillomavirus positive (HPV+) head and neck squamous cell carcinoma (HNSCC) is an emerging disease, representing a distinct clinical and epidemiological entity. Understanding the genetic basis of this specific subtype of cancer could allow therapeutic targeting of affected pathways for a stratified medicine approach.
Twenty HPV+ and 20 HPV- laser-capture microdissected oropharyngeal carcinomas were used for paired-end sequencing of hybrid-captured DNA, targeting 3,230 exons in 182 genes often mutated in cancer. Copy number alteration (CNA) profiling, Sequenom MassArray sequencing and immunohistochemistry were used to further validate findings.
HPV+ and HPV- oropharyngeal carcinomas cluster into two distinct subgroups. TP53 mutations are detected in 100% of HPV negative cases and abrogation of the G1/S checkpoint by CDKN2A/B deletion and/or CCND1 amplification occurs in the majority of HPV- tumors.
These findings strongly support a causal role for HPV, acting via p53 and RB pathway inhibition, in the pathogenesis of a subset of oropharyngeal cancers and suggest that studies of CDK inhibitors in HPV- disease may be warranted. Mutation and copy number alteration of PI3 kinase (PI3K) pathway components appears particularly prevalent in HPV+ tumors and assessment of these alterations may aid in the interpretation of current clinical trials of PI3K, AKT, and mTOR inhibitors in HNSCC.
Major histocompatibility complex (MHC) genes play a critical role in vertebrate immune response and because the MHC is linked to a significant number of auto-immune and other diseases it is of great medical interest. Here we describe the clone-based sequencing and subsequent annotation of the MHC region of the gorilla genome. Because the MHC is subject to extensive variation, both structural and sequence-wise, it is not readily amenable to study in whole genome shotgun sequence such as the recently published gorilla genome. The variation of the MHC also makes it of evolutionary interest and therefore we analyse the sequence in the context of human and chimpanzee. In our comparisons with human and re-annotated chimpanzee MHC sequence we find that gorilla has a trimodular RCCX cluster, versus the reference human bimodular cluster, and additional copies of Class I (pseudo)genes between Gogo-K and Gogo-A (the orthologues of HLA-K and -A). We also find that Gogo-H (and Patr-H) is coding versus the HLA-H pseudogene and, conversely, there is a Gogo-DQB2 pseudogene versus the HLA-DQB2 coding gene. Our analysis, which is freely available through the VEGA genome browser, provides the research community with a comprehensive dataset for comparative and evolutionary research of the MHC.
Epigenetic changes have been associated with ageing and cancer. Identifying and interpreting epigenetic changes associated with such phenotypes may benefit from integration with protein interactome models. We here develop and validate a novel integrative epigenome-interactome approach to identify differential methylation interactome hotspots associated with a phenotype of interest. We apply the algorithm to cancer and ageing, demonstrating the existence of hotspots associated with these phenotypes. Importantly, we discover tissue independent age-associated hotspots targeting stem-cell differentiation pathways, which we validate in independent DNA methylation data sets, encompassing over 1000 samples from different tissue types. We further show that these pathways would not have been discovered had we used a non-network based approach and that the use of the protein interaction network improves the overall robustness of the inference procedure. The proposed algorithm will be useful to any study seeking to identify interactome hotspots associated with common phenotypes.
Human papillomavirus-positive (HPV+) head and neck squamous cell carcinoma (HNSCC) represents a distinct clinical and epidemiological condition compared with HPV-negative (HPV-) HNSCC. To test the possible involvement of epigenetic modulation by HPV in HNSCC, we conducted a genome-wide DNA-methylation analysis.
Using laser-capture microdissection of 42 formalin-fixed paraffin wax-embedded (FFPE) HNSCCs, we generated DNA-methylation profiles of 18 HPV+ and 14 HPV- samples, using Infinium 450 k BeadArray technology. Methylation data were validated in two sets of independent HPV+/HPV- HNSCC samples (fresh-frozen samples and cell lines) using two independent methods (Infinium 450 k and whole-genome methylated DNA immunoprecipitation sequencing (MeDIP-seq)). For the functional analysis, an HPV- HNSCC cell line was transduced with lentiviral constructs containing the two HPV oncogenes (E6 and E7), and effects on methylation were assayed using the Infinium 450 k technology.
Results and discussion
Unsupervised clustering over the methylation variable positions (MVPs) with greatest variation showed that samples segregated in accordance with HPV status, but also that HPV+ tumors are heterogeneous. MVPs were significantly enriched at transcriptional start sites, leading to the identification of a candidate CpG island methylator phenotype in a sub-group of the HPV+ tumors. Supervised analysis identified a strong preponderance (87%) of MVPs towards hypermethylation in HPV+ HNSCC. Meta-analysis of our HNSCC and publicly available methylation data in cervical and lung cancers confirmed the observed DNA-methylation signature to be HPV-specific and tissue-independent. Grouping of MVPs into functionally more significant differentially methylated regions identified 43 hypermethylated promoter DMRs, including for three cadherins of the Polycomb group target genes. Integration with independent expression data showed strong negative correlation, especially for the cadherin gene-family members. Combinatorial ectopic expression of the two HPV oncogenes (E6 and E7) in an HPV- HNSCC cell line partially phenocopied the hypermethylation signature seen in HPV+ HNSCC tumors, and established E6 as the main viral effector gene.
Our data establish that archival FFPE tissue is very suitable for this type of methylome analysis, and suggest that HPV modulates the HNSCC epigenome through hypermethylation of Polycomb repressive complex 2 target genes such as cadherins, which are implicated in tumor progression and metastasis.
Despite the success of genome-wide association studies (GWAS) in identifying loci associated with common diseases, a significant proportion of the causality remains unexplained. Recent advances in genomic technologies have placed us in a position to initiate large-scale studies of human disease-associated epigenetic variation, specifically variation in DNA methylation (DNAm). Such Epigenome-Wide Association Studies (EWAS) present novel opportunities but also create new challenges that are not encountered in GWAS. We discuss EWAS study design, cohort and sample selections, statistical significance and power, confounding factors, and follow-up studies. We also discuss how integration of EWAS with GWAS can help to dissect complex GWAS haplotypes for functional analysis.
Epigenomics; Disease Genetics; DNA Methylation; Epigenetics; Quantitative Trait
Motivation: The Illumina Infinium 450 k DNA Methylation Beadchip is a prime candidate technology for Epigenome-Wide Association Studies (EWAS). However, a difficulty associated with these beadarrays is that probes come in two different designs, characterized by widely different DNA methylation distributions and dynamic range, which may bias downstream analyses. A key statistical issue is therefore how best to adjust for the two different probe designs.
Results: Here we propose a novel model-based intra-array normalization strategy for 450 k data, called BMIQ (Beta MIxture Quantile dilation), to adjust the beta-values of type2 design probes into a statistical distribution characteristic of type1 probes. The strategy involves application of a three-state beta-mixture model to assign probes to methylation states, subsequent transformation of probabilities into quantiles and finally a methylation-dependent dilation transformation to preserve the monotonicity and continuity of the data. We validate our method on cell-line data, fresh frozen and paraffin-embedded tumour tissue samples and demonstrate that BMIQ compares favourably with two competing methods. Specifically, we show that BMIQ improves the robustness of the normalization procedure, reduces the technical variation and bias of type2 probe values and successfully eliminates the type1 enrichment bias caused by the lower dynamic range of type2 probes. BMIQ will be useful as a preprocessing step for any study using the Illumina Infinium 450 k platform.
Availability: BMIQ is freely available from http://code.google.com/p/bmiq/.
Supplementary data are available at Bioinformatics online
BMC Research Notes recently published a research article regarding the use of ligated DNA extracted from formalin-fixed paraffin embedded (FFPE) tissue on the Illumina Infinium methylation platform - “Interpretation of genome-wide infinium methylation data from ligated DNA in formalin-fixed, paraffin-embedded paired tumor and normal tissue” Jasmine et al. BMC Research Notes 2012, 5:117. This article repeatedly refers to our previous work and concludes that methylation data obtained from ligated FFPE extracted DNA should be used with great caution. In this Discussion we review the data analysis performed in Jasmine et al’s paper and suggest limitations which subsequently lead the authors to draw what we believe are incorrect conclusions. Moreover, we continue to analyse genome-wide methylation data from DNA extracted from FFPE tissue successfully on both the HumMeth27 and 450 K arrays.
Regulatory change has long been hypothesized to drive the delineation of the human phenotype from other closely related primates. Here we provide evidence that CpG dinucleotides play a special role in this process. CpGs enable epigenome variability via DNA methylation, and this epigenetic mark functions as a regulatory mechanism. Therefore, species-specific CpGs may influence species-specific regulation. We report non-polymorphic species-specific CpG dinucleotides (termed “CpG beacons”) as a distinct genomic feature associated with CpG island (CGI) evolution, human traits and disease. Using an inter-primate comparison, we identified 21 extreme CpG beacon clusters (≥ 20/kb peaks, empirical p < 1.0 × 10−3) in humans, which include associations with four monogenic developmental and neurological disease related genes (Benjamini-Hochberg corrected p = 6.03 × 10−3). We also demonstrate that beacon-mediated CpG density gain in CGIs correlates with reduced methylation in these species in orthologous CGIs over time, via human, chimpanzee and macaque MeDIP-seq. Therefore mapping into both the genomic and epigenomic space the identified CpG beacon clusters define points of intersection where a substantial two-way interaction between genetic sequence and epigenetic state has occurred. Taken together, our data support a model for CpG beacons to contribute to CGI evolution from genesis to tissue-specific to constitutively active CGIs.
epigenetics; epigenomics; CpG islands; gene regulation; evolution; human disease
Methylated DNA immunoprecipitation (MeDIP) is a popular enrichment based method and can be combined with sequencing (termed MeDIP-seq) to interrogate the methylation status of cytosines across entire genomes. However, quality control and analysis of MeDIP-seq data have remained to be a challenge.
We report genome-wide DNA methylation profiles of wild type (wt) and mutant mouse cells, comprising 3 biological replicates of Thymine DNA glycosylase (Tdg) knockout (KO) embryonic stem cells (ESCs), in vitro differentiated neural precursor cells (NPCs) and embryonic fibroblasts (MEFs). The resulting 18 methylomes were analysed with MeDUSA (Methylated DNA Utility for Sequence Analysis), a novel MeDIP-seq computational analysis pipeline for the identification of differentially methylated regions (DMRs). The observed increase of hypermethylation in MEF promoter-associated CpG islands supports a previously proposed role for Tdg in the protection of regulatory regions from epigenetic silencing. Further analysis of genes and regions associated with the DMRs by gene ontology, pathway, and ChIP analyses revealed further insights into Tdg function, including an association of TDG with low-methylated distal regulatory regions.
We demonstrate that MeDUSA is able to detect both large-scale changes between cells from different stages of differentiation and also small but significant changes between the methylomes of cells that only differ in the KO of a single gene. These changes were validated utilising publicly available datasets and confirm TDG's function in the protection of regulatory regions from epigenetic silencing.
Methylome; MeDIP-seq; Epigenetics; Epigenomics; DNA methylation; Computational pipeline; MeDUSA
Regulatory T cells (Tregs) manipulated ex vivo have potential as cellular therapeutics in autoimmunity and transplantation. Although it is possible to expand naturally occurring Tregs, an attractive alternative possibility, particularly suited to solid organ and bone marrow transplantation, is the stimulation of total T cell populations with defined allogeneic antigen presenting cells under conditions that lead to the generation or expansion of donor-reactive, adaptive Tregs. Here we demonstrate that stimulation of mouse CD4+ T cells by immature allogeneic dendritic cells (DCs) combined with pharmacological inhibition of phosphodiesterase 3 (PDEi) results in a functional enrichment of Foxp3+ T cells. Without further manipulation or selection, the resultant population delayed skin allograft rejection mediated by polyclonal CD4+ effectors or donor-reactive CD8+ TCR transgenic T cells and inhibited both effector cell proliferation and T cell priming for IFN-γ production. Notably, PDE inhibition also enhanced the enrichment of human Foxp3+ CD4+ T cells driven by allogeneic APC. These cells inhibited T cell proliferation in a standard in vitro mixed lymphocyte assay and importantly, attenuated the development of vasculopathy mediated by autologous PBMC in a functionally relevant humanized mouse transplant model. These data establish a method for the ex vivo generation of graft-reactive, functional mouse and human Tregs that uses a clinically approved agent, making pharmacological PDE inhibition a potential strategy for Treg-based therapies