Background: Smoking increases the risk of many diseases, and it is also linked to blood DNA methylation changes that may be important in disease etiology.
Objectives: We sought to identify novel CpG sites associated with cigarette smoking.
Methods: We used two epigenome-wide data sets from the Sister Study to identify and confirm CpG sites associated with smoking. One included 908 women with methylation measurements at 27,578 CpG sites using the HumanMethylation27 BeadChip; the other included 200 women with methylation measurements for 473,844 CpG sites using the HumanMethylation450 BeadChip. Significant CpGs from the second data set that were not included in the 27K assay were validated by pyrosequencing in a subset of 476 samples from the first data set.
Results: Our study successfully confirmed smoking associations for 9 previously established CpGs and identified 2 potentially novel CpGs: cg26764244 in GNG12 (p = 9.0 × 10–10) and cg22335340 in PTPN6 (p = 2.9 × 10–05). We also found strong evidence of an association between smoking status and cg02657160 in CPOX (p = 7.3 × 10–7), which has not been previously reported. All 12 CpGs were undermethylated in current smokers and showed an increasing percentage of methylation in former and never-smokers.
Conclusions: We identified 2 potentially novel smoking related CpG sites, and provided independent replication of 10 previously reported CpGs sites related to smoking, one of which is situated in the gene CPOX. The corresponding enzyme is involved in heme biosynthesis, and smoking is known to increase heme production. Our study extends the evidence base for smoking-related changes in DNA methylation.
Citation: Harlid S, Xu Z, Panduri V, Sandler DP, Taylor JA. 2014. CpG sites associated with cigarette smoking: analysis of epigenome-wide data from the Sister Study. Environ Health Perspect 122:673–678; http://dx.doi.org/10.1289/ehp.1307480
Epigenetic mechanisms are implicated in gene regulation and the development of different diseases. The epigenome differs between cell types and has until now only been characterized for a few human tissues. Environmental factors potentially alter the epigenome. Here we describe the genome-wide pattern of DNA methylation in human adipose tissue from 23 healthy men, with a previous low level of physical activity, before and after a six months exercise intervention. We also investigate the differences in adipose tissue DNA methylation between 31 individuals with or without a family history of type 2 diabetes. DNA methylation was analyzed using Infinium HumanMethylation450 BeadChip, an array containing 485,577 probes covering 99% RefSeq genes. Global DNA methylation changed and 17,975 individual CpG sites in 7,663 unique genes showed altered levels of DNA methylation after the exercise intervention (q<0.05). Differential mRNA expression was present in 1/3 of gene regions with altered DNA methylation, including RALBP1, HDAC4 and NCOR2 (q<0.05). Using a luciferase assay, we could show that increased DNA methylation in vitro of the RALBP1 promoter suppressed the transcriptional activity (p = 0.03). Moreover, 18 obesity and 21 type 2 diabetes candidate genes had CpG sites with differences in adipose tissue DNA methylation in response to exercise (q<0.05), including TCF7L2 (6 CpG sites) and KCNQ1 (10 CpG sites). A simultaneous change in mRNA expression was seen for 6 of those genes. To understand if genes that exhibit differential DNA methylation and mRNA expression in human adipose tissue in vivo affect adipocyte metabolism, we silenced Hdac4 and Ncor2 respectively in 3T3-L1 adipocytes, which resulted in increased lipogenesis both in the basal and insulin stimulated state. In conclusion, exercise induces genome-wide changes in DNA methylation in human adipose tissue, potentially affecting adipocyte metabolism.
Given the important role of epigenetics in gene regulation and disease development, we here present the genome-wide DNA methylation pattern of 476,753 CpG sites in adipose tissue obtained from healthy men. Since environmental factors potentially change metabolism through epigenetic modifications, we examined if a six months exercise intervention alters the DNA methylation pattern as well as gene expression in human adipose tissue. Our results show that global DNA methylation changes and 17,975 individual CpG sites alter the levels of DNA methylation in response to exercise. We also found differential DNA methylation of 39 candidate genes for obesity and type 2 diabetes in human adipose tissue after exercise. Additionally, we provide functional proof that genes, which exhibit both differential DNA methylation and gene expression in human adipose tissue in response to exercise, influence adipocyte metabolism. Together, this study provides the first detailed map of the genome-wide DNA methylation pattern in human adipose tissue and links exercise to altered adipose tissue DNA methylation, potentially affecting adipocyte metabolism.
OBJECTIVE— Genetic and environmental factors modulate the susceptibility to diabetic nephropathy, as initiating and/or progression factors. The objective of the European Rational Approach for the Genetics of Diabetic Complications (EURAGEDIC) study is to identify nephropathy susceptibility genes. We report molecular genetic studies for 127 candidate genes for nephropathy.
RESEARCH DESIGN AND METHODS— Polymorphisms were identified through sequencing of promoter, exon, and flanking intron gene regions and a database search. A total of 344 nonredundant SNPs and nonsynonymous variants were tested for association with diabetic nephropathy (persistent albuminuria ≥300 mg/24 h) in a large type 1 diabetes case/control (1,176/1,323) study from three European populations.
RESULTS— Only one SNP, rs2281999, located in the UNC13B gene, was significantly associated with nephropathy after correction for multiple testing. Analyses of 21 additional markers fully characterizing the haplotypic variability of the UNC13B gene showed consistent association of SNP rs13293564 (G/T) located in intron 1 of the gene with nephropathy in the three populations. The odds ratio (OR) for nephropathy associated with the TT genotype was 1.68 (95% CI 1.29–2.19) (P = 1.0 × 10−4). This association was replicated in an independent population of 412 case subjects and 614 control subjects (combined OR of 1.63 [95% CI 1.30–2.05], P = 2.3 × 10−5).
CONCLUSIONS— We identified a polymorphism in the UNC13B gene associated with nephropathy. UNC13B mediates apopotosis in glomerular cells in the presence of hyperglycemia, an event occurring early in the development of nephropathy. We propose that this polymorphism could be a marker for the initiation of nephropathy. However, further studies are needed to clarify the role of UNC13B in nephropathy.
Offspring exposed to gestational diabetes mellitus (GDM) have an increased risk for chronic diseases, and one promising mechanism for fetal metabolic programming is epigenetics. Therefore, we postulated that GDM exposure impacts the offspring’s methylome and used an epigenomic approach to explore this hypothesis. Placenta and cord blood samples were obtained from 44 newborns, including 30 exposed to GDM. Women were recruited at first trimester of pregnancy and followed until delivery. GDM was assessed after a 75-g oral glucose tolerance test at 24–28 weeks of pregnancy. DNA methylation was measured at > 485,000 CpG sites (Infinium HumanMethylation450 BeadChips). Ingenuity Pathway Analysis was conducted to identify metabolic pathways epigenetically affected by GDM. Our results showed that 3,271 and 3,758 genes in placenta and cord blood, respectively, were potentially differentially methylated between samples exposed or not to GDM (p-values down to 1 × 10−06; none reached the genome-wide significance levels), with more than 25% (n = 1,029) being common to both tissues. Mean DNA methylation differences between groups were 5.7 ± 3.2% and 3.4 ± 1.9% for placenta and cord blood, respectively. These genes were likely involved in the metabolic diseases pathway (up to 115 genes (11%), p-values for pathways = 1.9 × 10−13 < p < 4.0 × 10−03; including diabetes mellitus p = 4.3 × 10−11). Among the differentially methylated genes, 326 in placenta and 117 in cord blood were also associated with newborn weight. Our results therefore suggest that GDM has epigenetic effects on genes preferentially involved in the metabolic diseases pathway, with consequences on fetal growth and development, and provide supportive evidence that DNA methylation is involved in fetal metabolic programming.
epigenetics; epigenome-wide; in utero; maternal hyperglycemia; DNA methylation; fetal metabolic programming
Antiepileptic drugs (AEDs) are used to treat a variety of neuropsychiatric illnesses commonly encountered in women during their reproductive years, including epilepsy and bipolar disorder. Despite their widespread use, the impact of prenatal exposure on fetal development remains obscure. To evaluate whether AEDs taken by pregnant mothers influence DNA methylation patterns in their neonates, DNA was extracted from the umbilical cord blood of 201 neonates whose mothers were treated for neuropsychiatric illness during pregnancy and interrogated across 27,578 CpG sites using the Illumina HumanMethylation27 BeadChip. The association of each methylation value with the cumulative duration of prenatal AED exposure was examined using a linear mixed model. The average methylation level across all CpG sites was calculated for each subject, and this global methylation measure was evaluated similarly. Neonates with a longer duration of AED exposure in pregnancy showed a decrease in average global methylation (p = 0.0045). Further, DNA methylation of CpG sites in 14 genes significantly decreased with the duration of prenatal AED exposure even after adjusting for multiple comparisons (FDR < 0.05). For a small subset (n = 19) of these neonates, a second tissue, placenta, was available in addition to cord blood. Methylation of 3 of these 14 CpG sites was also significantly decreased in placental tissue. These novel data suggest decreased DNA methylation in neonates of mothers who took AEDs during pregnancy. The long-term stability and potential impact of these changes warrant further attention, and caution may be warranted before prescribing AEDs to pregnant women.
Antiepileptic drugs (AEDs); DNA Methylation; epigenetic; HumanMethylation27 BeadChip; lamotrigine; neonatal; pregnancy
All tissues of the organism are affected by aging. This process is associated with epigenetic modifications such as methylation changes at specific cytosine residues in the DNA (CpG sites). Here, we have identified an Epigenetic-Aging-Signature which is applicable for many tissues to predict donor age. DNA-methylation profiles of various cell types were retrieved from public data depositories - all using the HumanMethylation27 BeadChip platform which represents 27,578 CpG sites. Five datasets from dermis, epidermis, cervical smear, T-cells and monocytes were used for Pavlidis Template Matching to identify 19 CpG sites that are continuously hypermethylated upon aging (R > 0.6; p-value <10−13). Four of these CpG sites (associated with the genes NPTX2, TRIM58, GRIA2 and KCNQ1DN) and an additional hypomethylated CpG site (BIRC4BP) were implemented in a model to predict donor age. This Epigenetic-Aging-Signature was tested on a validation group of eight independent datasets corresponding to several cell types from different tissues. Overall, the five CpG sites revealed age-associated DNA-methylation changes in all tissues. The average absolute difference between predicted and real chronological age was about 11 years. This method can be used to predict donor age in various cell preparations - for example in forensic analysis.
DNA methylation; epigenetic; predictor; age; tissue
Aberrant DNA methylation (DNAm) is a feature of most types of cancers. Genome-wide DNAm profiling has been performed successfully on tumor tissue DNA samples. However, the invasive procedure limits the utility of tumor tissue for epidemiological studies. While recent data indicate that cell-free circulating DNAm (cfDNAm) profiles reflect DNAm status in corresponding tumor tissues, no studies have examined the association of cfDNAm with cancer or precursors on a genome-wide scale. The objective of this pilot study was to evaluate the putative significance of genome-wide cfDNAm profiles in esophageal adenocarcinoma (EA) and Barrett esophagus (BE, EA precursor). We performed genome-wide DNAm profiling in EA tissue DNA (n = 8) and matched serum DNA (n = 8), in serum DNA of BE (n = 10), and in healthy controls (n = 10) using the Infinium HumanMethylation27 BeadChip that covers 27,578 CpG loci in 14,495 genes. We found that cfDNAm profiles were highly correlated to DNAm profiles in matched tumor tissue DNA (r = 0.92) in patients with EA. We selected the most differentially methylated loci to perform hierarchical clustering analysis. We found that 911 loci can discriminate perfectly between EA and control samples, 554 loci can separate EA from BE samples, and 46 loci can distinguish BE from control samples. These results suggest that genome-wide cfDNAm profiles are highly consistent with DNAm profiles detected in corresponding tumor tissues. Differential cfDNAm profiling may be a useful approach for the noninvasive screening of EA and EA premalignant lesions.
Modification of DNA by methylation of cytosines at CpG dinucleotides is a widespread phenomenon that leads to changes in gene expression, thereby influencing and regulating many biological processes. Recent technical advances in the genome-wide determination of single-base DNA-methylation enabled epigenome-wide association studies (EWASs). Early EWASs established robust associations between age and gender with the degree of CpG methylation at specific sites. Other studies uncovered associations with cigarette smoking. However, so far these studies were mainly conducted in Caucasians, raising the question of whether these findings can also be extrapolated to other populations.
Here, we present an EWAS with age, gender, and smoking status in a family study of 123 individuals of Arab descent. We determined DNA methylation at over 450,000 CpG sites using the Illumina Infinium HumanMethylation450 BeadChip, applied state-of-the-art data processing protocols, including correction for blood cell type heterogeneity and hidden confounders, and eliminated probes containing SNPs at the targeted CpG site using 40× whole-genome sequencing data. Using this approach, we could replicate the leading published EWAS associations with age, gender and smoking, and recovered hallmarks of gender-specific epigenetic changes. Interestingly, we could even replicate the recently reported precise prediction of chronological age based on the methylation of only a few selected CpG sites.
Our study supports the view that when applied with state-of-the art protocols to account for all potential confounders, DNA methylation arrays represent powerful tools for EWAS with more complex phenotypes that can also be successfully applied to non-Caucasian populations.
Electronic supplementary material
The online version of this article (doi:10.1186/s13148-014-0040-6) contains supplementary material, which is available to authorized users.
DNA methylation; Age; Gender; Smoking; Association study; Epigenetics
Hepatocellular carcinoma (HCC) incidence has increased in the US and also has one of the fastest growing death rates of any cancer. The purpose of the current study was to discover novel genome-wide aberrant DNA methylation patterns in HCC tumors that are predominantly HCV-related. Infinium HumanMethylation 450K BeadChip arrays were used to examine genome-wide DNA methylation profiles in 66 pairs of HCC tumor and adjacent non-tumor tissues. After Bonferroni adjustment, a total of 130,512 CpG sites significantly differed in methylation level in tumor compared with non-tumor tissues, with 28,017 CpG sites hypermethylated and 102,495 hypomethylated in tumor tissues. Absolute tumor/non-tumor methylation differences ≥ 20% were found in 24.9% of the hypermethylated and 43.1% of the hypomethylated CpG sites; almost 10,000 CpG sites have ≥ 30% DNA methylation differences. Most (60.1%) significantly hypermethylated CpG sites are located in CpG islands, with 21.6% in CpG shores and 3.6% in shelves. In contrast, only a small proportion (8.2%) of significantly hypomethylated CpG sites are situated in islands, while most are found in open sea (60.2%), shore (17.3%) or shelf (14.3%) regions. A total of 2,568 significant CpG sites (2,441 hypermethylated and 127 hypomethylated) covering 589 genes are located within 684 differentially methylated regions defined as regions with at least two significant CpG sites displaying > 20% methylation differences in the same direction within 250-bp. The top 500 significant CpG sites can significantly distinguish HCC tumor from adjacent tissues with one misclassification. Within adjacent non-tumor tissues, we also identified 75 CpG sites significantly associated with gender, 228 with HCV infection, 17,207 with cirrhosis, and 56 with both HCV infection and cirrhosis after multiple comparisons adjustment. Aberrant DNA methylation profiles across the genome were identified in tumor tissues from US HCC cases that are predominantly related to HCV infection. These results demonstrate the significance of aberrant DNA methylation in HCC tumorigenesis.
genome-wide; DNA methylation; alterations; hepatocellular carcinoma; 450K BeadChips
Single nucleotide polymorphisms (SNPs) in MYH9 and APOL1 on chromosome 22 (c22) are powerfully associated with non-diabetic end-stage renal disease (ESRD) in African Americans (AAs). Many AAs diagnosed with type 2 diabetic nephropathy (T2DN) have non-diabetic kidney disease, potentially masking detection of DN genes. Therefore, genome-wide association analyses were performed using the Affymetrix SNP Array 6.0 in 966 AA with T2DN and 1,032 non-diabetic, non-nephropathy (NDNN) controls, with and without adjustment for c22 nephropathy risk variants. No associations were seen between FRMD3 SNPs and T2DN before adjusting for c22 variants. However, logistic regression analysis revealed seven FRMD3 SNPs significantly interacting with MYH9—a finding replicated in 640 additional AA T2DN cases and 683 NDNN controls. Contrasting all 1,592 T2DN cases with all 1,671 NDNN controls, FRMD3 SNPs appeared to interact with the MYH9 E1 haplotype (e.g., rs942280 interaction p-value = 9.3E−7 additive; odds ratio [OR] 0.67). FRMD3 alleles were associated with increased risk of T2DN only in subjects lacking two MYH9 E1 risk haplotypes (rs942280 OR = 1.28), not in MYH9 E1 risk allele homozygotes (rs942280 OR = 0.80; homogeneity p-value = 4.3E−4). Effects were weaker stratifying on APOL1. FRMD3 SNPS were associated with T2DN, not type 2 diabetes per se, comparing AAs with T2DN to those with diabetes lacking nephropathy. T2DN-associated FRMD3 SNPs were detectable in AAs only after accounting for MYH9, with differential effects for APOL1. These analyses reveal a role for FRMD3 in AA T2DN susceptibility and accounting for c22 nephropathy risk variants can assist in detecting DN susceptibility genes.
African Americans have high rates of kidney disease attributed to type 2 diabetes mellitus. However, approximately 25% of patients are misclassified and have non-diabetic kidney disease on renal biopsy. The APOL1-MYH9 gene region on chromosome 22 is powerfully associated with non-diabetic kidney diseases in African Americans. Therefore, we tested for interactions between single nucleotide polymorphisms across the genome with APOL1 and MYH9 non-diabetic nephropathy risk variants in African Americans with presumed diabetic nephropathy. Markers in FRMD3, a gene associated with type 1 diabetic nephropathy in Caucasians, appeared to interact with MYH9; however, increased nephropathy risk was seen in diabetic cases lacking two MYH9 risk haplotypes, and protective effects were seen in those with two MYH9 risk haplotypes. Stratified analyses based on the chromosome 22 nephropathy risk haplotypes demonstrated that FRMD3 variants were associated with diabetic nephropathy risk in cases without two MYH9 (or APOL1) risk haplotypes. It appears that African Americans with diabetes and kidney disease who are not chromosome 22 nephropathy risk variant homozygotes are enriched for the presence of diabetic nephropathy and FRMD3 risk alleles. This genetic dissection ultimately allowed for detection of the FRMD3 diabetic nephropathy gene association in a subset of cases enriched for this disorder.
The prevalence of asthma in girls increases after puberty. Previous studies have detected associations between sex hormones and asthma, as well as between sex hormones and T helper 2 (Th2) asthma-typical immune responses. Therefore, we hypothesized that exogenous or endogenous sex hormone exposure (represented by oral contraceptive pill (OCP) use and early menarche, respectively) are associated with DNA methylation (DNA-M) of the Th2 transcription factor gene, GATA3, in turn affecting the risk of asthma in girls, possibly in interaction with genetic variants.
Blood samples were collected from 245 female participants aged 18 years randomly selected for methylation analysis from the Isle of Wight birth cohort, UK. Information on use of OCPs, age at menarche, and concurrent asthma were assessed by questionnaire. Genome-wide DNA-M was determined using the Illumina Infinium HumanMethylation450 beadchip. In a first stage, we tested the interaction between sex hormone exposure and genetic variants on DNA-M of specific cytosine-phosphate-guanine (CpG) sites. In a second stage, we determined whether these CpG sites interact with genetic variants in GATA3 to explain the risk of asthma.
Interactions between OCP use and seven single nucleotide polymorphisms (SNPs) of GATA3 were analyzed for 14 CpG sites (stage 1). The interaction between OCP use and SNP rs1269486 was found to be associated with the methylation level of cg17124583 (P = 0.002, false discovery rate (FDR) adjusted P = 0.04). DNA-M of this same CpG site was also influenced by the interaction between age at menarche and rs1269486 (P = 0.0017). In stage 2, we found that cg17124583 modified the association of SNP rs422628 with asthma risk at the age of 18 years (P = 0.006, FDR adjusted P = 0.04). Subjects with genotype AG showed an increase in average risk ratio (RR) from 0.31 (95% CI: 0.10 to 0.8) to 11.65 (95% CI: 1.71 to 79.5) when methylation level increased from 0.02 to 0.12, relative to genotype AA.
A two-stage model consisting of genetic variants in the GATA3 gene, OCP use, age at menarche, and DNA-M may explain how sex hormones in women can increase the asthma prevalence after puberty.
GATA3 gene; DNA methylation; genetic variants; epigenetics; oral contraceptives; age at menarche; asthma; puberty; adolescence; single nucleotide polymorphism; CpG
Both genetic and epigenetic factors influence the development and progression of epithelial ovarian cancer (EOC). However, there is an incomplete understanding of the interrelationship between these factors and the extent to which they interact to impact disease risk. In the present study, we aimed to gain insight into this relationship by identifying DNA methylation marks that are candidate mediators of ovarian cancer genetic risk.
We used 214 cases and 214 age-matched controls from the Mayo Clinic Ovarian Cancer Study. Pretreatment, blood-derived DNA was profiled for genome-wide methylation (Illumina Infinium HumanMethylation27 BeadArray) and single nucleotide polymorphisms (SNPs, Illumina Infinium HD Human610-Quad BeadArray). The Causal Inference Test (CIT) was implemented to distinguish CpG sites that mediate genetic risk, from those that are consequential or independently acted on by genotype.
Controlling for the estimated distribution of immune cells and other key covariates, our initial epigenome-wide association analysis revealed 1,993 significantly differentially methylated CpGs that between cases and controls (FDR, q < 0.05). The relationship between methylation and case-control status for these 1,993 CpGs was found to be highly consistent with the results of previously published, independent study that consisted of peripheral blood DNA methylation signatures in 131 pretreatment cases and 274 controls. Implementation of the CIT test revealed 17 CpG/SNP pairs, comprising 13 unique CpGs and 17 unique SNPs, which represent potential methylation-mediated relationships between genotype and EOC risk. Of these 13 CpGs, several are associated with immune related genes and genes that have been previously shown to exhibit altered expression in the context of cancer.
These findings provide additional insight into EOC etiology and may serve as novel biomarkers for EOC susceptibility.
Integrative genomics; Ovarian cancer; Blood-based DNA methylation
Evidence supports a role for epigenetic mechanisms in the pathogenesis of late-onset Alzheimer’s disease (LOAD), but little has been done on a genome-wide scale to identify potential sites involved in disease. This study investigates human postmortem frontal cortex genome-wide DNA methylation profiles between 12 LOAD and 12 cognitively normal age- and gender-matched subjects. Quantitative DNA methylation is determined at 27,578 CpG sites spanning 14,475 genes via the Illumina Infinium HumanMethylation27 BeadArray. Data are analyzed using parallel linear models adjusting for age and gender with empirical Bayes standard error methods. Gene-specific technical and functional validation is performed on an additional 13 matched pair samples, encompassing a wider age range. Analysis reveals 948 CpG sites representing 918 unique genes as potentially associated with LOAD disease status pending confirmation in additional study populations. Across these 948 sites the subtle mean methylation difference between cases and controls is 2.9%. The CpG site with a minimum false discovery rate located in the promoter of the gene Transmembrane Protein 59 (TMEM59) is 7.3% hypomethylated in cases. Methylation at this site is functionally associated with tissue RNA and protein levels of the TMEM59 gene product. The TMEM59 gene identified from our discovery approach was recently implicated in amyloid-β protein precursor post-translational processing, supporting a role for epigenetic change in LOAD pathology. This study demonstrates widespread, modest discordant DNA methylation in LOAD-diseased tissue independent from DNA methylation changes with age. Identification of epigenetic biomarkers of LOAD risk may allow for the development of novel diagnostic and therapeutic targets.
DNA methylation; epigenetics; late onset Alzheimer’s disease; prefrontal cortex
Background: Schizophrenia is a complex psychiatric disorder with a lifetime morbidity rate of 0.5–1.0%. The pathophysiology of schizophrenia still remains obscure. Accumulating evidence indicates that DNA methylation, which is the addition of a methyl group to the cytosine in a CpG dinucleotide, might play an important role in the pathogenesis of schizophrenia.
Methods: To gain further insight into the molecular mechanisms underlying schizophrenia, a genome-wide DNA methylation profiling (27,578 CpG dinucleotides spanning 14,495 genes) of the human dorsolateral prefrontal cortex (DLPFC) was conducted in a large cohort (n = 216) of well characterized specimens from individuals with schizophrenia and non-psychiatric controls, combined with an analysis of genetic variance at ~880,000 SNPs.
Results: Aberrant DNA methylation in schizophrenia was identified at 107 CpG sites at 5% Bonferroni correction (p < 1.99 × 10−6). Of these significantly altered sites, hyper-DNA methylation was observed at 79 sites (73.8%), mostly in the CpG islands (CGIs) and in the regions flanking CGIs (CGI: 31 sites; CGI shore: 35 sites; CGI shelf: 3 sites). Furthermore, a large number of cis-methylation quantitative trait loci (mQTL) were identified, including associations with risk SNPs implicated in schizophrenia.
Conclusions: These results suggest that altered DNA methylation might be involved in the pathophysiology and/or treatment of schizophrenia, and that a combination of epigenetic and genetic approaches will be useful to understanding the molecular mechanism of this complex disorder.
schizophrenia; DNA methylation; SNP; postmortem; array; expression
Crohn disease (CD) and ulcerative colitis (UC) are common forms of inflammatory bowel diseases (IBD). Monozygotic (MZ) twin discordance rates and epidemiologic data implicate that environmental changes and epigenetic factors may play a pathogenic role in IBD. DNA methylation (the methylation of cytosines within CpG dinucleotides) is an epigenetic modification, which can respond to environmental influences. We investigated whether DNA methylation might be connected with IBD in peripheral blood leukocyte (PBL) DNA by utilizing genome-wide microarrays.
Two different high-throughput microarray based methods for genome wide DNA methylation analysis were employed. First, DNA isolated from MZ twin pairs concordant (CD: 4; UC: 3) and discordant (CD: 4; UC: 7) for IBD was interrogated by a custom made methylation specific amplification microarray (MSAM). Second, the recently developed Illumina Infinium HumanMethylation450 BeadChip arrays were used on 48 samples of PBL DNA from discordant MZ twin pairs (CD:3; UC:3) and treatment naive pediatric cases of IBD (CD:14; UC:8), as well as controls (n=14). The microarrays were validated with bisulfite pyrosequencing.
The Methylation BeadChip approach identified a single DNA methylation association of IBD at TEPP (testis, prostate and placenta-expressed protein) when DNA isolated selectively from peripheral blood mononuclear cells was analyzed (8.6% increase in methylation between CD and control, FDR=0.0065).
Microarray interrogation of IBD dependent DNA methylation from PBLs appears to have limited ability to detect significant disease associations. More detailed and/or selective approaches may be useful for the elucidation of connections between the DNA methylome and IBD in the future.
inflammatory bowel disease; DNA methylation; peripheral blood; twin; TEPP
The objective of this study was to analyze genome-wide differential methylation patterns in maternal leukocyte DNA in early pregnant and non-pregnant states. This is an age and body mass index matched case-control study comparing the methylation patterns of 27,578 cytosine-guanine (CpG) sites in 14,495 genes in maternal leukocyte DNA in early pregnancy (n = 14), in the same women postpartum (n = 14), and in nulligravid women (n = 14) on a BeadChip platform. Transient widespread hypomethylation was found in early pregnancy as compared with the non-pregnant states. Methylation of nine genes was significantly different in early pregnancy compared with both postpartum and nulligravid states (< 10% False Discovery Rate). Early pregnancy may be characterized by widespread hypomethylation compared with non-pregnant states; there is no apparent permanent methylation imprint after a normal term gestation. Nine potential candidate genes were identified as differentially methylated in early pregnancy and may play a role in the maternal adaptation to pregnancy.
DNA Methylation; epigenetics; immune; leukocyte; maternal; postpartum; pregnancy
Pre-eclampsia is a serious complication of pregnancy that can affect both maternal and fetal outcomes. Early-onset pre-eclampsia (EOPET) is a severe form of pre-eclampsia that is associated with altered physiological characteristics and gene expression in the placenta. DNA methylation is a relatively stable epigenetic modification to DNA that can reflect gene expression, and can provide insight into the mechanisms underlying such expression changes. This case–control study focused on DNA methylation and gene expression of whole chorionic villi samples from 20 EOPET placentas and 20 gestational age-matched controls from pre-term births. DNA methylation was also assessed in placentas affected by late-onset pre-eclampsia (LOPET) and normotensive intrauterine growth restriction (nIUGR). The Illumina HumanMethylation450 BeadChip was used to assess DNA methylation at >480 000 cytosine-guanine dinucleotide (CpG) sites. The Illumina HT-12v4 Expression BeadChip was used to assess gene expression of >45 000 transcripts in a subset of cases and controls. DNA methylation analysis by pyrosequencing was used to follow-up the initial findings in four genes with a larger cohort of cases and controls, including nIUGR and LOPET placentas. Bioinformatic analysis was used to identify overrepresentation of gene ontology categories and transcription factor binding motifs. We identified 38 840 CpG sites with significant (false discovery rate <0.01) DNA methylation alterations in EOPET, of which 282 had >12.5% methylation difference compared with the controls. Significant sites were enriched at the enhancers and low CpG density regions of the associated genes and the majority (74.5%) of these sites were hypomethylated in EOPET. EOPET, but not associated clinical features, such as intrauterine growth restriction (IUGR), presented a distinct DNA methylation profile. CpG sites from four genes relevant to pre-eclampsia (INHBA, BHLHE40, SLC2A1 and ADAM12) showed different extent of changes in LOPET and nIUGR. Genome-wide expression in a subset of samples showed that some of the gene expression changes were negatively correlated with DNA methylation changes, particularly for genes that are responsible for angiogenesis (such as EPAS1 and FLT1). Results could be confounded by altered cell populations in abnormal placentas. Larger sample sizes are needed to fully address the possibility of sub-profiles of methylation within the EOPET cohort. Based on DNA methylation profiling, we conclude that there are widespread DNA methylation alterations in EOPET that may be associated with changes in placental function. This property may provide a useful tool for early screening of such placentas. This study identifies DNA methylation changes at many loci previously reported to have altered gene expression in EOPET placentas, as well as in novel biologically relevant genes we confirmed to be differentially expressed. These results may be useful for DNA- methylation-based non-invasive prenatal diagnosis of at-risk pregnancies.
pre-eclampsia; DNA methylation; placenta; 450 K array
To test the hypothesis that changes in DNA methylation are involved in vitamin D deficiency–related immune cell regulation using an unbiased genome-wide approach combined with a genomic and epigenomic integrative approach.
We performed a genome-wide methylation scan using the Illumina HumanMethylation 27 BeadChip on leukocytes DNAs of 11 cases of vitamin D deficiency (serum 25(OH)D≤ 25 nmol/L) and 11 age-matched controls (serum 25(OH)D>75 nmol/L); the subjects were African American normal-weight (BMI<85th percentile) males aged 14-19 years. The Limma package was used to analyze each CpG site for differential methylation between cases and controls. To correct for multiple testing, the set of raw p values were converted to false discovery rates (FDR). We also compared our findings with the recent data from GWAS of circulating 25(OH) D levels and then performed a permutation test to examine whether the “double hit” genes were randomly enriched.
A total of 79 CpG sites achieved raw p<0.001. Of the 79 CpG sites, 2 CpG sites survived multiple testing: cg16317961 (raw p=3.5 × 10-6, FDR=0.078, in MAPRE2) and cg04623955 (raw p=5.9 × 10-6, FDR=0.078, in DIO3). Furthermore, 3 out of the 4 genes previously identified in the two GWAS studies were also significant at the methylation level (DHCR7: cg07487535, p=0.015 & cg10763288, p=0.017; CYP2R1: cg25454890, p=0.040; CYP24A1: cg18956481, p=0.022), reflecting significant enrichment (p=0.0098).
Severe vitamin D deficiency is associated with methylation changes in leukocyte DNA. The genomic and epigenomic approach reinforce the crucial roles played by the DHCR7, CYP2R1 and CYP24A1 genes in vitamin D metabolism.
Vitamin D deficiency; DNA methylation; genome-wide association study; African Americans; extreme phenotypes
Previously, we reported strong influences of genetic variants on metabolic phenotypes, some of them with clinical relevance. Here, we hypothesize that DNA methylation may have an important and potentially independent effect on human metabolism. To test this hypothesis, we conducted what is to the best of our knowledge the first epigenome-wide association study (EWAS) between DNA methylation and metabolic traits (metabotypes) in human blood. We assess 649 blood metabolic traits from 1814 participants of the Kooperative Gesundheitsforschung in der Region Augsburg (KORA) population study for association with methylation of 457 004 CpG sites, determined on the Infinium HumanMethylation450 BeadChip platform. Using the EWAS approach, we identified two types of methylome–metabotype associations. One type is driven by an underlying genetic effect; the other type is independent of genetic variation and potentially driven by common environmental and life-style-dependent factors. We report eight CpG loci at genome-wide significance that have a genetic variant as confounder (P = 3.9 × 10−20 to 2.0 × 10−108, r2 = 0.036 to 0.221). Seven loci display CpG site-specific associations to metabotypes, but do not exhibit any underlying genetic signals (P = 9.2 × 10−14 to 2.7 × 10−27, r2 = 0.008 to 0.107). We further identify several groups of CpG loci that associate with a same metabotype, such as 4-vinylphenol sulfate and 4-androsten-3-beta,17-beta-diol disulfate. In these cases, the association between CpG-methylation and metabotype is likely the result of a common external environmental factor, including smoking. Our study shows that analysis of EWAS with large numbers of metabolic traits in large population cohorts are, in principle, feasible. Taken together, our data suggest that DNA methylation plays an important role in regulating human metabolism.
It has been suggested that genetic susceptibility plays an important role in the pathogenesis of diabetic nephropathy. A large-scale genotyping analysis of gene-based single nucleotide polymorphisms (SNPs) in Japanese patients with type 2 diabetes identified the gene encoding acetyl-coenzyme A carboxylase beta (ACACB) as a candidate for a susceptibility to diabetic nephropathy; the landmark SNP was found in the intron 18 of ACACB (rs2268388: intron 18 +4139 C > T, p = 1.4×10−6, odds ratio = 1.61, 95% confidence interval [CI]: 1.33–1.96). The association of this SNP with diabetic nephropathy was examined in 9 independent studies (4 from Japan including the original study, one Singaporean, one Korean, and two European) with type 2 diabetes. One case-control study involving European patients with type 1 diabetes was included. The frequency of the T allele for SNP rs2268388 was consistently higher among patients with type 2 diabetes and proteinuria. A meta-analysis revealed that rs2268388 was significantly associated with proteinuria in Japanese patients with type 2 diabetes (p = 5.35×10−8, odds ratio = 1.61, 95% Cl: 1.35–1.91). Rs2268388 was also associated with type 2 diabetes–associated end-stage renal disease (ESRD) in European Americans (p = 6×10−4, odds ratio = 1.61, 95% Cl: 1.22–2.13). Significant association was not detected between this SNP and nephropathy in those with type 1 diabetes. A subsequent in vitro functional analysis revealed that a 29-bp DNA fragment, including rs2268388, had significant enhancer activity in cultured human renal proximal tubular epithelial cells. Fragments corresponding to the disease susceptibility allele (T) had higher enhancer activity than those of the major allele. These results suggest that ACACB is a strong candidate for conferring susceptibility for proteinuria in patients with type 2 diabetes.
Although cumulative epidemiological findings have suggested that genetic susceptibility plays an important role in the pathogenesis of diabetic nephropathy, no gene conferring susceptibility to diabetic nephropathy has been definitively identified. In a large-scale association study of 1,312 Japanese subjects with type 2 diabetes using SNPs from a Japanese SNP database, we show that the T-allele of ACACB rs2268388 is associated with diabetic nephropathy. We also show that the association is consistently observed in patients with type 2 diabetes and proteinuria across different ethnic groups, including populations of European descent. Because a DNA fragment corresponding to the disease susceptibility allele is shown to have higher enhancer activity, we hypothesize that the increase in the expression and/or activity of the encoded acetyl-coenzyme A carboxylase beta contributes to the development and progression of diabetic nephropathy. Our present analysis provides novel insight into the pathogenesis of diabetic nephropathy. This finding is important because diabetic nephropathy is a leading cause of end-stage renal disease and affects life expectancy in subjects with type 2 diabetes.
Studies have found an association between aberrant DNA methylation and arsenic-induced skin lesions. Yet, little is known about DNA methylation changes over time in people who develop arsenic-induced skin lesions. We sought to investigate epigenome-wide changes of DNA methylation in people who developed arsenic-induced skin lesions in a ten year period. In 2009–2011, we conducted a follow-up study of 900 skin lesion cases and 900 controls and identified 10 people who developed skin lesions since a baseline survey in 2001–2003. The 10 cases (“New Cases”) were matched with 10 controls who did not have skin lesions at baseline or follow-up (“Persistent Controls”). Drinking water and blood samples were collected and skin lesion was diagnosed by the same physician at both time points. We measured DNA methylation in blood using Infinium HumanMethylation450K BeadChip, followed by quantitative validation using pyrosequencing. Two-sample t-tests were used to compare changes in percent methylation between New Cases and Persistent Controls. Six CpG sites with greatest changes of DNA methylation over time among New Cases were further validated with a correlation of 93% using pyrosequencing. One of the validated CpG site (cg03333116; change of %methylation was 13.2 in New Cases versus −0.09 in Persistent Controls; P <0.001) belonged to the RHBDF1 gene, which was previously reported to be hypermethylated in arsenic-exposed cases. We examined DNA methylation changes with the development of arsenic-induced skin lesions over time but nothing was statistically significant given the small sample size of this exploratory study and the high dimensionality of data.
Arsenic; DNA methylation; Illumina 450K; longitudinal; skin lesion
Background: Prenatal exposure to maternal cigarette smoking (prenatal smoke exposure) had been associated with altered DNA methylation (DNAm) at birth.
Objective: We examined whether such alterations are present from birth through adolescence.
Methods: We used the Infinium HumanMethylation450K BeadChip to search across 473,395 CpGs for differential DNAm associated with prenatal smoke exposure during adolescence in a discovery cohort (n = 132) and at birth, during childhood, and during adolescence in a replication cohort (n = 447).
Results: In the discovery cohort, we found five CpGs in MYO1G (top-ranking CpG: cg12803068, p = 3.3 × 10–11) and CNTNAP2 (cg25949550, p = 4.0 × 10–9) to be differentially methylated between exposed and nonexposed individuals during adolescence. The CpGs in MYO1G and CNTNAP2 were associated, respectively, with higher and lower DNAm in exposed versus nonexposed adolescents. The same CpGs were differentially methylated at birth, during childhood, and during adolescence in the replication cohort. In both cohorts and at all developmental time points, the differential DNAm was in the same direction and of a similar magnitude, and was not altered appreciably by adjustment for current smoking by the participants or their parents. In addition, four of the five EWAS (epigenome-wide association study)–significant CpGs in the adolescent discovery cohort were also among the top sites of differential methylation in a previous birth cohort, and differential methylation of CpGs in CYP1A1, AHRR, and GFI1 observed in that study was also evident in our discovery cohort.
Conclusions: Our findings suggest that modifications of DNAm associated with prenatal maternal smoking may persist in exposed offspring for many years—at least until adolescence.
Citation: Lee KW, Richmond R, Hu P, French L, Shin J, Bourdon C, Reischl E, Waldenberger M, Zeilinger S, Gaunt T, McArdle W, Ring S, Woodward G, Bouchard L, Gaudet D, Davey Smith G, Relton C, Paus T, Pausova Z. 2015. Prenatal exposure to maternal cigarette smoking and DNA methylation: epigenome-wide association in a discovery sample of adolescents and replication in an independent cohort at birth through 17 years of age. Environ Health Perspect 123:193–199; http://dx.doi.org/10.1289/ehp.1408614
DNA methylation is a widely studied epigenetic phenomenon; alterations in methylation patterns influence human phenotypes and risk of disease. As part of the Atherosclerosis Risk in Communities (ARIC) study, the Illumina Infinium HumanMethylation450 (HM450) BeadChip was used to measure DNA methylation in peripheral blood obtained from ~3000 African American study participants. Over 480,000 cytosine-guanine (CpG) dinucleotide sites were surveyed on the HM450 BeadChip. To evaluate the impact of technical variation, 265 technical replicates from 130 participants were included in the study.
For each CpG site, we calculated the intraclass correlation coefficient (ICC) to compare variation of methylation levels within- and between-replicate pairs, ranging between 0 and 1. We modeled the distribution of ICC as a mixture of censored or truncated normal and normal distributions using an EM algorithm. The CpG sites were clustered into low- and high-reliability groups, according to the calculated posterior probabilities. We also demonstrated the performance of this clustering when applied to a study of association between methylation levels and smoking status of individuals. For the CpG sites showing genome-wide significant association with smoking status, most (~96%) were seen from sites in the high reliability cluster.
We suggest that CpG sites with low ICC may be excluded from subsequent association analyses, or extra caution needs to be taken for associations at such sites.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2105-15-312) contains supplementary material, which is available to authorized users.
DNA methylation; Infinium 450 K chip; Technical error; Intraclass correlation; Normal mixture models
The occurrence of asthma is weakly explained by known genetic variants. Epigenetic marks, DNA methylation (DNA-M) in particular, are considered to add to the explanation of asthma. However, no etiological model has yet been developed that integrates genetic variants and DNA-M. To explore a new model, we focused on one asthma candidate gene, the IL-4 receptor (IL4R). We hypothesized that genetic variants of IL4R in interaction with DNA-M at cytosine-phosphate-guanine (CpG) sites jointly alter the risk of asthma during adolescence. Blood samples were collected at age 18 years from 245 female cohort participants randomly selected for methylation analysis from a birth cohort (n = 1,456, Isle of Wight, UK). Genome-wide DNA-M was assessed using the Illumina Infinium HumanMethylation450 BeadChip.
Thirteen single nucleotide polymorphisms (SNPs) and twelve CpG sites of IL4R gene were analyzed. Based on linkage disequilibrium and association with asthma, eight SNPs and one CpG site were selected for further analyses. Of the twelve CpG sites in the IL4R gene, only methylation levels of cg09791102 showed an association with asthma at age 18 years (Wilcoxon test: P = 0.01). Log-linear models were used to estimate risk ratios (RRs) for asthma adjusting for uncorrelated SNPs within the IL4R gene and covariates. Testing for interaction between the eight SNPs and the methylation levels of cg09791102 on the risk for asthma at age 18 years, we identified the statistically significant interaction term of SNP rs3024685 × methylation levels of cg09791102 (P = 0.002; after adjusting for false discovery rate). A total of 84 participants had methylation levels ≤0.88, 112 participants between 0.89 and 0.90, and 35 between 0.91 and 0.92. For the SNP rs3024685 (‘CC’ vs. ‘TT’) at methylation levels of ≤0.85, 0.86, 0.90, 0.91, and 0.92, the RRs were 0.01, 0.04, 4.65, 14.76, 14.90, respectively (interaction effect, P = 0.0003).
Adjusting for multiple testing, our results suggest that DNA-M modulates the risk of asthma related to genetic variants in the IL4R gene. The strong interaction of one SNP and DNA-M is encouraging and provides a novel model of how a joint effect of genetic variants and DNA-M can explain occurrence of asthma.
Interleukin-4 receptor gene; DNA methylation; Genetic variants; Asthma; Epigenetics
Motivation: DNA methylation is a molecular modification of DNA that plays crucial roles in regulation of gene expression. Particularly, CpG rich regions are frequently hypermethylated in cancer tissues, but not methylated in normal tissues. However, there are not many methodological literatures of case-control association studies for high-dimensional DNA methylation data, compared with those of microarray gene expression. One key feature of DNA methylation data is a grouped structure among CpG sites from a gene that are possibly highly correlated. In this article, we proposed a penalized logistic regression model for correlated DNA methylation CpG sites within genes from high-dimensional array data. Our regularization procedure is based on a combination of the l1 penalty and squared l2 penalty on degree-scaled differences of coefficients of CpG sites within one gene, so it induces both sparsity and smoothness with respect to the correlated regression coefficients. We combined the penalized procedure with a stability selection procedure such that a selection probability of each regression coefficient was provided which helps us make a stable and confident selection of methylation CpG sites that are possibly truly associated with the outcome.
Results: Using simulation studies we demonstrated that the proposed procedure outperforms existing main-stream regularization methods such as lasso and elastic-net when data is correlated within a group. We also applied our method to identify important CpG sites and corresponding genes for ovarian cancer from over 20 000 CpGs generated from Illumina Infinium HumanMethylation27K Beadchip. Some genes identified are potentially associated with cancers.
Supplementary data are available at Bioinformatics online.