|Home | About | Journals | Submit | Contact Us | Français|
Conceived and designed the experiments: FE NB JMG. Performed the experiments: RFT TDB. Analyzed the data: RFT MF JMG. Contributed reagents/materials/analysis tools: FE RFT TDB AV. Wrote the paper: FE RFT MF NB JMG.
Perturbations of the intrauterine environment can affect fetal development during critical periods of plasticity, and can increase susceptibility to a number of age-related diseases (e.g., type 2 diabetes mellitus; T2DM), manifesting as late as decades later. We hypothesized that this biological memory is mediated by permanent alterations of the epigenome in stem cell populations, and focused our studies specifically on DNA methylation in CD34+ hematopoietic stem and progenitor cells from cord blood from neonates with intrauterine growth restriction (IUGR) and control subjects.
Our epigenomic assays utilized a two-stage design involving genome-wide discovery followed by quantitative, single-locus validation. We found that changes in cytosine methylation occur in response to IUGR of moderate degree and involving a restricted number of loci. We also identify specific loci that are targeted for dysregulation of DNA methylation, in particular the hepatocyte nuclear factor 4α (HNF4A) gene, a well-known diabetes candidate gene not previously associated with growth restriction in utero, and other loci encoding HNF4A-interacting proteins.
Our results give insights into the potential contribution of epigenomic dysregulation in mediating the long-term consequences of IUGR, and demonstrate the value of this approach to studies of the fetal origin of adult disease.
The concept of fetal origin of adult disease suggests that early life conditions can “program” the fetus for a spectrum of adverse health outcomes as an adult . The link between intrauterine growth restriction (IUGR) and adult diseases such as type 2 diabetes and cardiovascular disease has been extensively supported by epidemiological , , , , ,  and animal , , ,  studies. To date, reports have focused on characterizing the pathophysiological consequences of an adverse intrauterine environment, and have revealed dysregulation of gene expression and a variety of functional impairments in individual tissues (including liver , , skeletal muscle , , pancreas , , kidney , and bone ) that precede and are potentially contributory to a range of later adult diseases. Taken together, these studies define a precocious aging phenotype, culminating in an increased risk of premature death , .
Permanent and sometimes progressive changes in gene expression have been observed in multiple tissues as a consequence of IUGR , , , . Dysregulation of the epigenome may explain changes that are propagated from parent to daughter cells in IUGR offspring throughout life. For instance, offspring of rats fed a restricted protein diet throughout pregnancy showed changes in DNA methylation at multiple genes, with corresponding changes in gene expression, both of which were prevented by maternal supplementation of folic acid (an important methyl donor), further implicating an epigenetic mechanism , , . Additionally, human studies also demonstrated epigenetic differences in response to an adverse intrauterine environment, as periconceptional exposure to famine was associated with altered DNA methylation at multiple sites within the known IGF2 differentially methylated region (DMR) . Similar epigenetic dysregulation has been observed in a variety of tissues , , , , consistent with IUGR-induced susceptibility to age-related diseases affecting multiple organ systems.
As the life-long progenitors of many differentiated lineages, stem cells must act as custodians of epigenomic regulatory patterns in order that these changes persist in most tissues throughout the lifetime of an organism. Hematopoietic stem (CD34+) cells are one of the best-characterized stem cell types, and represent a neonatally accessible population, present as ~1% of mononucleated umbilical cord blood cells . Following tissue injury, this stem cell population is able to mobilize, proliferate, and home to target microenvironments at the site of damage , and may coordinate injury repair by stimulating tissue-specific progenitors , . Moreover, these stem cells are multipotent progenitors of the immune system, which itself is likely to mediate inflammation, development, and progression of T2DM and cardiovascular disease .
To determine whether an adverse intrauterine environment leads to epigenetic changes that have functional pathological consequences, we studied DNA methylation in multipotent hematopoietic (CD34+) stem cells of IUGR neonates and matched controls. We employed our new, high-resolution version of the microarray-based HELP assay  to study ~1.32 million loci throughout the human genome. Rather than focus on known candidate loci, we expanded the search genome-wide to identify novel loci involved and also tested for global, non-specific effects of IUGR on cytosine methylation. We followed this screening approach with a second stage of analysis, validating methylation status using quantitative, nucleotide-resolution bisulphite MassArray  to define the dysregulatory effects of IUGR on the epigenome.
This study was approved by the institutional review board (IRB) of the Montefiore Medical Center and the Committee on Clinical Investigation at the Albert Einstein College of Medicine and is in accordance with Health Insurance Portability and Accountability Act (HIPAA) regulations. Written informed consent was obtained from all subjects prior to participation.
Biological samples and clinical information were collected from consenting women who delivered IUGR or infants with appropriate growth (matched for gestational age at delivery, ethnicity and gender). Both birth weight and ponderal index (a measurement of neonatal weight relative to length) were used to identify cases and controls. IUGR had birth weight and ponderal index <10th percentile for gestational age and gender and controls had normal percentiles (>10th and <90th) for both parameters. Pertinent data on maternal medical history, pregnancy complications and neonatal hospital stay were recorded.
CD34+ cells, which constitute approximately 1% of nucleated blood cells in umbilical cord blood , were isolated from the cord blood specimen using an immunomagnetic separation technique, as previously described . Mononuclear cells are separated by Ficoll-Paque density gradient, following which CD34+ cells are obtained by positive immunomagnetic bead selection, using Macs columns (Miltenyi Biotech). The isolated cells having >95% purity ,  are cryopreserved in 10% DMSO by controlled rate freezing.
High-resolution HELP assays were performed according to recent advances in the technology . Genomic DNA was isolated, digested to completion by either HpaII or MspI separately, and then ligated to a mixture of two oligonucleotide pair adapters, each complementary to the cohesive ends generated by the restriction digest. The adapters then served as a priming site for a PCR reaction (ligation-mediated PCR, LM-PCR) that we have described to generate a product predominantly in the 50–2,000 bp size range . Following PCR, the HpaII and MspI representations were labeled with different fluorophores using random priming and were then cohybridized on a customized genomic microarray representing ~1.32 million HpaII/MspI fragments of 50–2,000 bp in unique sequence .
Microarray data were pre-processed and subject to quality control and quantile normalization as previously described . HpaII/MspI ratio values were compared between groups for all loci throughout the genome using a paired t-test with cases (n=5) and controls (n=5) matched by gender and gestational age, as well as ethnicity where possible (see Table S1). Changes in methylation state were defined using a HpaII/MspI ratio threshold of zero, where methylated loci and hypomethylated loci had ratio values less than zero and greater than zero, respectively. Ordered lists of differences were generated, the first relying purely on paired t-test computed p-values. Our alternative approach identified differential methylation, ranked according to the formula: This method, similar to the modified T test , places more weight on the fold-change relative to the within group variability. However, unlike p-values based on the modified T test, computed values are not interpreted in these studies as a probabilistic quantity, and instead are used exclusively to rank and isolate important loci.
Target regions were amplified by PCR using the primers and cycling conditions described in Table S2. Primers were selected with MethPrimer (http://www.urogene.org/methprimer/) using parameters as follows: 200–400 bp amplicon size, 56–60°C Tm, 24–30 bp length, and ≥1 CG in product. 50 µl PCR reactions were performed using the Roche FastStart High Fidelity Kit. In cases where products showed primer-dimer or other contaminants, bands of appropriate size were excised from 2% agarose gels, purified by Qiagen Gel Extraction Kit, and eluted with 1X Roche FastStart High Fidelity Reaction Buffer (+MgCl2). All PCR products (5 µl) were aliquotted onto 384-well microtiter plates and were treated with 2 µl of Shrimp Alkaline Phosphatase (SAP) mix for 20 minutes at 37°C to dephosphorylate unincorporated dNTPs. Microtiter plates were processed by the MassArray Matrix Liquid Handler. A 2 µl volume of each SAP-treated sample was then heat-inactivated at 85°C for 5 minutes and subsequently incubated for 3 hours at 37°C with 5 µl of Transcleave mix (T or C Cleavage Mix) for concurrent in vitro transcription and base-specific cleavage. Samples were transferred onto the spectroCHIP array by nanodispensation calibrated to ambient temperature and humidity, and analysis with the Sequenom MALDI-TOF MS Compact Unit following 4-point calibration with oligonuculeotides of different mass provided in the Sequenom kit. Matched peak data was exported using EpiTYPER software and analyzed for quality and single nucleotide polymorphisms according to analytical tools that we have recently developed .
The most significantly differentially methylated loci were mapped to RefSeq gene identifiers by chromosomal position within 10 kb upstream of the transcription start site or overlapping the gene body. The list of RefSeq identifiers was then uploaded to the Ingenuity Pathway Analysis program (Redwood City, CA), enabling exploration of ontology and molecular interaction networks. Each uploaded gene identifier was mapped to its corresponding gene object (focus genes) in the Ingenuity Pathways Knowledge Base. Core networks were constructed for both direct and indirect interactions using default parameters, and the focus genes with the highest connectivity to other focus genes were selected as seed elements for network generation. New focus genes with high specific connectivity (overlap between the initialized network and gene's immediate connections) were added to the growing network until the network reached a default size of 35 nodes. Non-focus genes (those that were not among our differentially methylated input list) that contained a maximum number of links to the growing network were also incorporated.
The ranking score for each network was then computed by a right-tailed Fisher's exact test as the negative log of the probability that the number of focus genes in the network is not due to random chance. Similarly, significances for functional enrichment of specific genes were also determined by the right-tailed Fisher's exact test, using all input genes as a reference set.
Umbilical cord blood samples were collected from 10 well-matched, consenting individuals: IUGR (n=5), and appropriate for gestational age controls (n=5), matched for gestational age at delivery, gender, and ethnicity). We controlled for variable cellular composition of mixed leukocyte populations in the cord blood by purifying for hematopoietic stem (CD34+) cells, which served as a reference cell type for comparison in all subjects, thus reducing this potential source of variability which could have otherwise influenced our subsequent methylation assays . We also note that while CD34+ cells are functionally heterogeneous themselves , purification reduces cell-specific variability in cytosine methylation that could otherwise mask (or artificially create) IUGR-related changes.
By design, IUGR weighed less and had a lower ponderal index than their matched controls, which were required to have normal percentiles for birth weight and ponderal index (Table 1). Both groups had similar gestational age at delivery and proportions of each sex and ethnic composition, and comparable maternal age, BMI, pregnancy weight gain, and 1-hour maternal plasma glucose screen values after a 50 g glucose load (Table 1). All of the women and their infants were healthy. None of the mothers had chronic hypertension, pre-eclampsia or gestational hypertension, pre-gestational or gestational diabetes, renal or autoimmune disease. The neonates had similar 1-minute and 5-minute Apgar scores in both groups (data not shown). One infant in the IUGR group had unilateral mild pyelectasis, which did not require treatment. There were no other neonatal complications.
To detect global patterns of epigenetic changes that could distinguish between IUGR and controls, we performed the HELP assay on the CD34+ cells. These genome-wide cytosine methylation profiles are available as a public resource through http://greallylab.aecom.yu.edu/~greally/humanIUGR/ and through the GEO repository (accession number GSE17727). We compared the full dataset of HELP data from IUGR and normal birthweight subjects using unsupervised clustering. We observed consistent patterns of methylation across all samples, without any apparent global changes in methylation status (e.g. the hypomethylation commonly seen in cancer ) between IUGR and control groups (Figure S1). Global Pearson correlation coefficients were calculated for the methylation patterns for each pair of samples, confirming a high degree of inter-sample consistency (R=0.90−0.95).
We tested to see whether cytosine methylation changes were present but undetected in global comparisons because they occur at only a subset of loci and/or to a limited degree at each locus. Paired T tests were applied to detect group differences for all individual loci represented on our arrays. This analysis yielded a subset of potentially informative loci with statistically significant differences in methylation. Figure 1A shows the distribution of p-values observed for IUGR compared to controls. Figure 1B shows the distribution of p-values when class labels are randomly permuted once across all loci and are representative of what we would see under the null distribution of no group effect. This illustrates how the observed p-values for the IUGR/control comparison are associated with a distribution consistent with lower than expected p-values for a small subset of loci, with 56 candidate loci (Table 2) having significant group differences (p<0.00001, chosen because no test statistics derived under any other permutation obtained a p-value below 0.000015), and an additional 646 loci with moderate group differences (p<0.0001).
For comparison, p-value distributions calculated for grouping by gender and ethnicity are shown in Figures S2A and S2B, respectively. Gender comparison shows ~2,500 significant loci (p<0.00001 threshold), an expected outcome due to the vast majority of these discriminatory loci (99.4%) being located on the X and Y chromosomes. We found no evidence for ethnicity (Latino versus non-Latino) influencing cytosine methylation (Figure S2B).
When we looked in detail at the 56 differentially-methylated loci, we found that the magnitude of DNA methylation changes observed in IUGR compared to controls is markedly less than many tissue-specific differences in methylation that we have previously observed (; and data not shown). Furthermore, we analyzed methylation status at four known imprinted regions on chromosome 11, some of which have been shown to harbor small changes in methylation in subjects with IUGR (the IGF2 differentially-methylated region (DMR), the H19 DMR and promoter region, and the KCNQ1 DMR) ,  but found no detectable differences in the IUGR group compared to controls in this particular cell type (data not shown).
The next component of our two-stage experimental design was to validate the HELP data with single-locus, nucleotide-resolution quantitative studies, for which we used bisulphite MassArray . To choose the loci for validation, we assessed the candidacy of the top 56 loci in terms of related biological function, reasoning that epigenetic dysregulation of the small degrees we were observing would have to affect multiple components of a biological pathway to cause functionally-significant changes. We used an Ingenuity Pathway Analysis (IPA) for 33 of the top 56 candidate loci, chosen because of physical proximity to 35 RefSeq-annotated genes (22 promoters and 13 gene bodies). While the remaining 23/56 loci in intergenic regions could not be linked to genes using these proximity criteria, almost half of these loci occur at CG clusters  or phastCons conserved DNA elements , both characteristics having potential relevance as cis-regulatory sites  and therefore may have unrecognized roles regulating transcription.
Of the two highest-scoring molecular interaction networks generated from IPA analysis, one was associated with cell signaling, nucleic acid metabolism, and molecular transport (Figure S3), while the other network contained 12 out of 35 input nodes and was functionally associated with the cell cycle, cellular maintenance, and connective tissue development (Figure 2). Moreover, this network was centered primarily on the hepatic nuclear factor 4α (HNF4A), a transcription factor that has been strongly implicated in an early-onset form of type II diabetes, maturity onset diabetes of the young . Although HNF4A did not meet our strict threshold for significance in the primary analysis, it was ranked first in our secondary analysis using a test statistic that was more heavily weighted towards the mean shift between groups (see Methods).
An additional IPA that was applied specifically to candidate loci overlapping RefSeq gene promoters revealed a third molecular interaction network including 17 of 23 input nodes. This network is functionally associated with cancer, cellular development, cellular growth and proliferation (Figure S4), and is centered on a number of transcription factors and growth hormones (e.g. NCOR2, GH1, TGFβ1, HNF4A), and TP53 which shows altered DNA methylation in renal tissue of IUGR rats in adult life .
We therefore proceeded to bisulphite MassArray experiments , initially testing the technical performance of the HELP assays by choosing four loci, two each representing constitutively hypomethylated and constitutively methylated sites identified by the HELP assay. Figure S5 demonstrates the agreement between methylation values determined independently for these loci by HELP and MassArray (with a between-assay correlation in this case of R=−0.96353).
The technical validation results confirmed that the HELP data were accurate and allowed us to proceed to the analysis of the specific loci at which dysregulation of cytosine methylation was suspected. We focused on the HNF4A locus, where HELP data showed evidence for changes in cytosine methylation at an internal promoter of this complex gene (Figure 3). Furthermore, the identified locus represents highly conserved sequence and contains multiple known transcription factor binding sites for HNF1α and β, SP1, HNF6, and GATA6 . Differences in DNA methylation at the HNF4A promoter were confirmed by MassArray, with the informative HpaII site showing hypermethylation in IUGR compared to controls (65.7% and 59.6% methylation, respectively; p<0.01), consistent with the HELP data (log2(HpaII/MspI)=1.89±0.42 and 3.46±0.60, respectively; p=0.00006). This increased methylation level is of a magnitude comparable with that observed in samples from the Dutch famine cohort . Functionally, the acquisition of cytosine methylation at this promoter may reduce HNF4A expression in IUGR offspring, consistent with the inherited loss-of-function mutations in HNF4A that lead to an autosomal dominant form of maturity onset diabetes of the young (MODY) .
One of our goals was to use our data post hoc to define how best to design this kind of study to test the dysregulation of cytosine methylation in a human disease. We therefore calculated an estimation of effect size for the top 1,000 loci with differences in methylation between IUGR and controls. Similar to the distribution of methylation differences noted for the top 56 loci, the 1,000 most informative loci had a mean difference of 0.60 with a standard deviation of 0.20 log2 units. The difference in average methylation between control and IUGR at all loci measured tended to be subtle (between 0.20 and 1.00) and less likely to represent broad methylation state changes (going from a completely methylated state to a hypomethylated state and vice versa). We simulated informative genes with this information as a guide for true group differences in this population (Table 3).
We present preliminary evidence for a unifying new hypothesis in which epigenetic modification induced in early development may contribute to an increased susceptibility to age-related diseases by prematurely advancing the normal aging process. Using a genome-wide, high-resolution cytosine methylation assay (testing ~1.32 million loci throughout the genome), we avoid the necessity for any a priori assumptions about candidate loci and reveal potential new candidates in mediating the fetal origin of adult disease. We identified dysregulation of DNA methylation in purified populations of cord blood-derived CD34+ stem cells from IUGR neonates compared with matched controls, and find that these changes occur throughout the genome, although they are of a more modest degree and relatively limited extent compared with the epigenomic dysregulation observed in conditions such as cancer. As our study was based on a limited number of subjects, such subtle differences in cytosine methylation challenge the ability of genome-wide approaches to identify genuinely significant loci even at standard significance levels (0.05–0.001) which do not reflect multiple testing. In large-scale studies, when thousands to millions of loci are measured, the threshold to declare any locus significant is typically based on a more stringent level of significance (e.g. Bonferroni correction or similar) in order to minimize the possibility of false positives, therefore we have displayed the power at two such thresholds. Based on these simulations, a more expansive approach, with sample size of at least 25 subjects per group is recommended for studies of methylation changes in conditions where such subtle effects are expected.
Nonetheless, we found that the IUGR subjects were distinctive for having a number of consistent differences in methylation near genes involved in processes critical for stem cell function, including cell cycle and cellular maintenance. Quantitative bisulphite validation studies confirmed our ability to discriminate differences in methylation in these samples, and the biological coherence of results in terms of functional pathway relatedness is suggestive of underlying changes in epigenetic regulation as a response to IUGR.
A locus that emerged consistently in this pathway analysis was the HNF4A gene, already implicated in T2DM , but not previously demonstrated to undergo epigenetic dysregulation as a response to IUGR. Best known for its implications as a monogenic, autosomal dominant form of maturity onset diabetes of the young (MODY) , HNF4A is involved in development and function of both the liver and the pancreas  and actively coordinates gene expression of many important metabolic pathways in both tissues , , . We find differences in DNA methylation targeted to only one of the HNF4A promoters, supporting a model of isoform variation of the gene being related to susceptibility to T2DM, a major age-related disease.
Other loci identified in this study were found to be related functionally to HNF4A and also include ATG5 and TADA3L, which may have roles in mediating susceptibility to later disease. ATG5 is an essential component of autophagy that, when depleted, renders cells more susceptible to starvation and starvation-induced cell death . We found that ATG5 is relatively methylated in IUGR at a CpG island-containing site just downstream of the transcription start site, potentially reducing expression in IUGR compared with controls, in parallel with increased sensitivity to starvation-induced cell death. Transcriptional adaptor 3 (TADA3L) is associated with and is required for full p53 activity, causing growth arrest, senescence, and p53-mediated apoptosis . TADA3L isoforms are highly expressed in CD34+ stem cells , but we find that TADA3L is relatively methylated in IUGR at a CpG island-containing bidirectional promoter, potentially downregulating its expression and altering CD34+ stem cell population dynamics.
However, the modest level of differences in methylation that we and others have observed  raises an important question: what is the biological significance of changes in methylation on the order of ~6%? We note that such a change must represent a difference in a proportion of cells and/or alleles undergoing methylation within the broader population of CD34+ cells. Because CD34+ stem cells are multipotent progenitors, the presence of an epigenetically dysregulated subpopulation may go on to mediate susceptibility to chronic disease, with potentially greater effects over time should this subpopulation expand. Alternatively, this stem cell population may serve to define loci susceptible to constitutive “epimutations”  that are likely to exist in descendent cell types or unrelated lineages (e.g. liver or pancreatic progenitors) where they may have the chance to induce functional changes in critical cell types or tissues. Adding further information about epigenetic and transcriptional regulators other than cytosine methylation plus transcriptional profiling studies will be very valuable in gaining a greater understanding of the epigenetic dysregulation and its functional consequences in IUGR.
We hypothesize that the changes we observe by focused studies of hematopoietic (CD34+) stem cells are representative of the influence of the intrauterine environment on epigenetic regulation and independent cellular programs throughout the developing fetus. While all cells are believed to accumulate epimutations over time , periods of rapid cell division (e.g. during fetal development) represent the most vulnerable windows for cellular injury and potential dysregulation of the epigenome, with associated decreases in cellular fitness, function, and, of particular note for multipotent stem cells, replicative capacity. We therefore propose that adverse intrauterine conditions are more likely to contribute to replicative senescence and early exhaustion of regenerative pools of stem cell precursors throughout the body, increasing susceptibility to and speeding onset of age-related diseases like T2DM.
Although the exact mechanism remains unclear, the results of this study are indicative of epigenetic dysregulation associated with intrauterine growth restriction. Moreover, these findings suggest that epigenetic changes serve as a steward of cellular memory of aberrant intrauterine environments, and that site-specific changes in DNA methylation may mediate the increased susceptibility to age-related diseases observed later in life.
GEO database, accession number GSE17727.
Global correlation using the entire dataset demonstrates a high degree of epigenetic correlation among all samples, with no apparent global difference between IUGR and controls. A tree-pair plot was generated using a visualization tool that we have developed as part of an R package (Thompson, 2008). Pairwise correlations, calculated from >1.32 million independent loci, are shown in the upper right portion of the figure, where R values indicate the Pearson correlation for each pair of samples (labeled along the diagonal) and blue dotplots show a visual representation of the similarity between samples. Thin red lines within each of these sub-panels correspond to a lowess-fit of the pairwise data. In the lower left portion of the figure, a tree determined by Ward's minimum variance clustering shows an alternative unsupervised clustering approach. Branching order is shown in solid lines, colored by group. The diagonal dotted lines are numbered and indicate the Euclidean distance scale. The dotted red line indicates the Euclidean distance cutoff used to separate the individual groups of samples. Thompson, R.F., Reimers, M., Khulan, B., Gissot, M., Richmond, T.A., Chen, Q., Zheng, X., Kim, K. and Greally, J.M. (2008) An analytical pipeline for genomic representations used for cytosine methylation studies, Bioinformatics, 24, 1161–1167.
(0.70 MB TIF)
Supervised group comparisons reveal differences in methylation based on gender but not ethnicity. Panel (A) shows a histogram distribution of p-values calculated from an unpaired T test of males (n=4) in one group and females (n=6) in another. Along the x-axis, leftmost data represent low p-values (i.e. highly significant differences in methylation between groups), while data towards the right represent high p-values and, thus, loci that are uniform across groups. P-value frequency is shown along the y-axis, with larger values indicating increasing numbers of differentially methylated loci corresponding to the indicated p-value level. Note that the y-axis in this panel is adjusted to a different scale in order to account for the higher frequency of highly significant p-values. Panel (B) shows an analogous histogram with data obtained from samples grouped by ethnicity (Latin, n=7, compared to non-Latin, n=3).
(0.04 MB TIF)
The top-scoring molecular interaction network suggested by Ingenuity Pathway Analysis (IPA) consists of 13 genes among the top 56 differentially methylated loci. RefSeq IDs for 33 of the top 56 sites that mapped to genes were uploaded onto the “Core Analysis” tool of IPA. The top-scoring molecular interaction network was constructed by 35 nodes, 13 of which were located on the input list (shaded nodes), and is associated with cell signaling, nucleic acid metabolism, and molecular transport. The nodal relationships are indicated by solid lines (direct interaction) and dashed lines (indirect interactions), with or without filled arrows indicating functional interaction or merely physical association, respectively. The shape of each node indicates the class of molecule: horizontal ovals are transcription factors, squares are cytokines, vertical rectangles are G-protein coupled receptors, triangles are phosphatases, diamonds are enzymes, trapezoids are transporters, small ovals are chemicals, and circles correspond to “other” molecules. In alphabetical order, this network consists of ADCY, ADCY5, ADCY9, ARPC4, ATG5, β-estradiol, CAP2, CRYM, CUX2, EMILIN3, FSH, GALR1, GALR3, GH1, GJA1, GNAI2, GNAL, GNG7, GSTM3, HTR1F, LPAR4, MAMLD1, 5-methoxytryptamine, MllRN181C, NCOR2, noladin ether, PALM, PCDH19, PPP3CA, RXFP4, SH3BP4, SYP, TADA3L, and TP53.
(0.32 MB TIF)
A molecular interaction network suggested by Ingenuity Pathway Analysis (IPA) “Core Analysis”, with HNF4A as a central node among other transcription and growth factors. This network, associated with cancer, cellular development, cellular growth and proliferation, consists of 17 of 23 input nodes, each corresponding to the promoter of a RefSeq gene showing differential methylation by HELP. Nodes are shaded in green for relative hypermethylation in IUGR compared to controls, while red-shaded nodes are hypomethylated in IUGR. The nodal relationships are indicated by solid lines (direct interaction) and dashed lines (indirect interactions), with or without filled arrows indicating functional interaction or physical association, respectively. The shape of each node indicates the class of molecule: horizontal ovals are transcription factors, squares are growth factors, vertical rectangles are ion channels, triangles are phosphatases, diamonds are enzymes, trapezoids are transporters, and circles correspond to “other” molecules with concentric circles indicated complexes. In alphabetical order, this network consists of ADCY5, ARPC4, ATG5, β-estradiol, BUD31, C11ORF10, C9ORF5, Ca2+, CDKN2A, CHCHD8, CPA2, CPN1, CRY1, CTNNBL1, CUX2, FSH, GATA2, GH1, GINS3, GRIK4, HNF4A, INO80D, NCOR2, PPP3CA, PRSS3, RUVBL2, SH3BP4, SLC35A1, SLC35A5, SPP1, SYP, TADA3L, TGFB1, TP53, and TSKU.
(0.46 MB TIF)
Technical validation studies using MassArray confirm HELP data. Four loci were identified, two each representing constitutively hypo- and hyper-methylated sites. The hypomethylated sites were located at chr12:63986847-63987489 and chr20:45414330-45414885, with hypermethylated sites at chr4:188493650-188494271 and chr7:69549255-69549831 (hg18, human genome, March 2006, UCSC Genome Browser). HELP data as log2(HpaII/MspI) ratios are shown along the x-axis, with methylation towards the left and hypomethylation towards the right. MassArray data for the same loci are plotted along the y-axis, from 0% (hypomethylated) to 100% (methylated).
(0.03 MB TIF)
Neonatal and Maternal Characteristics for IUGR and Controls
(0.03 MB DOC)
Characteristics of top 56 candidate loci identified by HELP. All positions correspond to coordinates in the human genome, hg18 March 2006 UCSC Genome Browser; IUGR and Control data given as group averages of log2(HpaII/MspI); difference is IUGR minus Control; P-values are all x10−6; CpG, overlap with CpG islands; CGc overlap CG clusters; phC overlap with mammalian or vertebrate phastCons conserved elements; Rep overlap with repetitive elements (RT for retrotransposable elements including LINEs and SINEs, LT for long terminal repeats); Loc overlap with promoters (PRO), bidirectional promoters (PRO2), or gene bodies (GB) of RefSeq genes; Gene names and corresponding RefSeq identifiers are also shown.
(0.04 MB DOC)
The authors acknowledge the support of Einstein's Center for Epigenomics.
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work is supported by a research training scholarship from American Association of Obstetricians & Gynecologists Foundation with the Society for Maternal-Fetal Medicine to F.E., grants from the National Institutes of Health to J.M.G. (HG004401, HD044078) and N.B. (AG21654 and AG18381) and by the Core laboratories of the Albert Einstein Diabetes Research and Training Center (DK 20541). R.F.T. was supported by a NIA T32 training grant, and by National Institutes of Health (NIH) MSTP Training Grant GM007288. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.