To define gene expression changes peculiar to the extremelongevity phenotype, transcript profiles were assessed on fullgenome microarrays comprising > 22 000 synthetic oligomers (60-mers) printed on each epoxy slide (Genome Sequencing Center, Washington University, St Louis, MO, USA). Eight arrays compared eight biological expansions of very long-lived
age-1 alleles
mg44 and
m333 (four arrays each) to four distinct expansions of the weaker
age-1(hx546) allele, which in the N2DRM background confers approximately twofold life extension typical of long-lived IIS mutants. Another six arrays were used to compare the two
age-1 nonsense alleles (three expansions each) to the wild-type N2DRM strain. All mutants utilized here had been outcrossed to N2DRM for at least six generations, to create a near-isogenic set of strains, and all cultures were maintained at 20 °C. Second-generation homozygotes of
age-1(mg44) and
age-1(m333) are infertile (
Ayyadevara et al., 2008), in marked contrast to all other strains tested. Although ‘pregravid’ worms can be collected within the first 12 h after the L4/adult molt, this does not ensure complete exclusion of eggs and embryos; moreover, chemical or genetic methods to block reproduction may themselves perturb survival phenotypes and transcription profiles. We therefore harvested the fertile strains when postgravid (as assessed by microscopic examination of at least 30 worms) but prior to senescence (typically at 6 days of adult age, ~38% of the median adult lifespan), and F2 worms of the strong
age-1 mutants at 9–12 days of adult age. With developmental time included, N2 worms were 8.5 days posteclosion,
age-1(mg44) worms were at 18–21 days and
age-1(m333) worms 27–30 days. Because strains could not be matched with respect to both chronological and physiological age, and in any case there is no consensus as to whether total or adult ages should be considered, we chose these conditions for initial comparisons and left relative-age issues to be addressed in subsequent experiments (see below).
SAM Analysis These two data sets [eight arrays using
age-1(hx546) as the reference and six arrays with N2DRM as the reference] were analyzed separately by significance analysis of microarrays (SAM) [(
Tusher et al., 2001) ver. 3.02, Stanford University], using the ‘two-class paired
t-test’ procedure to identify genes preferentially or uniquely altered in the two longest-lived
age-1 strains.
Table S1 (Supporting information) lists 386 differentially expressed loci for the comparison between the very long-lived strains and
age-1(hx546); 276 of these were significant at a nominal false-discovery rate (FDR or
q) of < 5% (with true FDR estimated to be < 2.3%).
Table S2 (Supporting information) is a parallel listing of 339 genes with significantly altered expression relative to N2DRM wild-type adults, at FDR < 5%. Each list comprises ~2% of all genes surveyed. These tables include brief descriptions of the encoded proteins, statistical parameters from SAM and notes of other relevant data [confirmation or contradiction by RT-qPCR assays, and life-span effects of RNA interference (RNAi)]. Numbers of significant genes found, and their FDR values determined empirically by permutation, differ somewhat (as expected) between the two comparisons.
Significance analysis of microarray (SAM) ‘outlier plots’ for these two data analyses are shown in . The striking asymmetry of reflects the greater number of differentially expressed genes that are downregulated, in strong
age-1 mutants, than are upregulated. A ‘heat map’ of gene expression, by gene and array [;
Fig. S1 (Supporting information)], better illustrates this imbalance. Of genes significantly altered in transcript levels (at FDR < 5%), 92% were attenuated by the stronger
age-1 alleles, while only 8% were upregulated (;
Fig. S1;
Table S1). It was this unexpected bias in significant transcript changes that prompted the second set of microarrays, in which the same strong
age-1 mutants were contrasted to wild-type (N2DRM) near-isogenic controls. As depicted in , a far less pronounced imbalance was seen in this comparison: at FDR< 5%, 39% of significant genes increased, while 61% decreased, in
age-1 F2 adults (see also
Table S2). The probability of such disparate proportions arising merely by chance, as in two random samplings from the same distribution of transcript levels, is
P < 10
−17 by chi-squared test.
The intersection of these two lists comprises genes that changed significantly in both comparisons: just 20 that increased and 104 that decreased, in strong age-1 mutants relative to each control. Together, these make up 37% of the genes that were differential with respect to N2DRM and 32% of genes in the age-1(hx546) comparison, whereas > 60% of differential genes were unique to each comparison.
The 23 genes significantly upregulated in strong
age-1 alleles, vs.
hx546 (
Table S1a), include two of the three
C. elegans catalase genes,
ctl-2 and
ctl-3; two genes involved in nucleoside diphosphate metabolism (
mig-23,
ndx-1); the
ace-3 gene encoding an acetylcholinesterase; and two genes of lipid metabolism (
dhs-3,
lips-2). Among these 23 genes, RNAi had been reported to shorten lifespan only for the catalase gene
ctl-2 (
Murphy et al., 2003).
Of 253 genes significantly downregulated in strong
age-1 mutants relative to the weaker
age-1(hx546) allele (
Table S1b), longevity has been reported as increased by RNAi targeting five genes, but reduced for two. The genes (function of encoded protein) for which RNAi extends life are Y71G12B.4 (a mono-oxygenase),
akt-2 (part of the insulin/IGF-1 signaling pathway),
erm-1 (a cytoskeletal linker protein),
dyf-2 (involved in sensory cilia) and Y105C5B.12b (S-adenosylmethionine synthetase 2). A sixth gene reported to extend life by RNAi,
coq-1 (required for ubiquinone/coQ9 biosynthesis), was underexpressed at marginal significance (FDR = 5.4%). Two further genes were reported to shorten life upon RNA interference:
lin-40 (part of a complex mediating histone deacetylation and transcriptional repression) and
gdi-1 (a Rab-GDP dissociation inhibitor). Taken together, six results agreed with expectation (lifespan up for five genes downregulated with increased lifespan, but down for one upregulated gene), based on allele-specific transcript levels. These data imply a modest 1.6- to 1.8-fold enrichment of RNAi life-span effects (exact Binomial
P = 0.12–0.19 for
X = 6,
n = 276), over the 1.2–1.4% frequency found in unbiased screens for RNAi effects on nematode longevity (
Lee et al., 2003b;
Hamilton et al., 2005;
Hansen et al., 2005).
Among the genes significantly downregulated by strong age-1 alleles relative to hx546 (exceptions are shown as [upregulated]), we find genes that encode 16 predicted ubiquitin-complex components (ubl-5, ufd-1, uba-2, ubc-22, nhl-3, tag-214, zer-1, rnf-113, C06C3.5, T20F5.6, T12E12.1, C44E4.7, C27A12.7b, F16B4.6, Y71F9AL.10, K09F6.7); 22 membrane receptors (aph-1, npr-9, sra-1, srab-21, srab-25, srbc-62, srh-140, srh-184, sri-43, srx-90, srz-28, [str-18], str-74, str-116, str-129, str-151, str-172, gar-2, C35A11.1, F21C10.12, F39B3.2, Y38F2AR.3); 11 transcription factors (hif-1, ngn-1, btf-1, athp-1, ztf-3, ztf-9, egl-5, C23H3.3, C27D6.4, Y43F8B.13, C55A6.9); two RNA polymerase subunits (Y105E8A.23, rbp-6); three protein/histone deacetylases (mys-3, mys-4, lin-40); four DNA repair enzymes (pms-2, him-6, rad54, Y43F8B.13); two cytochromes P450 (cyp-35A5, cyp-35C1); three K+-channel and one Na+-channel protein (twk-11, twk-21, C27F2.6, T28D9.7); eight other transporter proteins (chtl-1, dyf-2, cft-1, imb-2, cho-1, K08C7.1, F16H11.3, C13C4.5, H11E01.2); two FMRF-like peptides (flp-4, flp-11); six RNA splicing factors (lsm-1, prpf-4, rsr-1, [rsp-5], Y76B12C.4, F28C1.1); three chaperones (pfd-6, [uri-1], T24H7.2); four lipid dehydrogenases (sdz-1, alh-9, alh-11, [dhs-3]); seven other genes implicated in lipid metabolism (lact-3, W05E10.2, Y105C5B.12, C56E10.3, K02E10.5, C06A5.10, [lips-2]); three thioredoxin-related proteins (txl-1, ZK973.11, R06B9.1); seven F-box proteins (fbxa-24, -43, -73, -90, fbxb-60, C10E2.2, ZK262.8) and two UMP synthetases (ZK228.7, R12E2.11).
Over-representation of upstream motifs The bias toward downregulation of genes in strong
age-1 mutants suggested the possibility that these changes may be mediated by one or more transcription factors that are subject to transcriptional regulation by
age-1, acting through DAF-16 and/or other factors affected post-translationally by PIP
3 or a PIP
3-dependant kinase. We screened for oligomeric sequences (six, seven or eight nucleotides) over-represented within 1 kilobase (kb) upstream of the 184 most significantly downregulated genes from the comparison of strong
age-1 alleles to
age-1(hx546) (at FDR< 0.7%,
Table S1b). Oligo-analysis in Regulatory Sequence Analysis Tools (RSAT) (
http://rsat.ulb.ac.be/rsat/) identified 12 octamers as the most significantly enriched upstream motifs. Overlap among these patterns implicated three highly over-represented consensus sequences of 9–11 nucleotides each: TTC(
A/
T)GAAAAT(T) (
P < 10
−21, binomial ‘E-value’ after Bonferroni adjustment); CTACAGTAA(C)(C) (
P < 10
−19); and AGAGA(
A/
C)G(
A/
C)AGA (
P < 10
−14). Within 1-kb downstream of the same genes, only one significant motif was found: CTGAAAAT(
T/
G) at
P < 10
−16, specifying a subset of TTC(
A/
T)GAAAAT(T). None of the above sequences is associated with specific binding of any known transcription factor. Near-identical results were obtained (with slightly reduced significance) on expanding upstream regions to 2 kb or on increasing the input gene set to include all significant genes. For purposes of comparison, we also analyzed 1-kb regions preceding all genes significantly downregulated in
age-1 alleles relative to N2DRM controls. The same three upstream motifs defined in the first comparison were here observed at greatly reduced frequency and significance, contributing only six of the 16 octamers enriched at 10
−7 <
P < 0.01. These consensus motifs thus appear to be rather specific to genes whose expression is downregulated by strong but not weak
age-1 alleles.
The same set of 184 upstream sequences, preceding genes downregulated in strong age-1 mutants (relative to the weaker hx546 allele), was also screened for transcription-factor binding sites from the Genomatix™ library. These consensus sites comprise over 750 nucleotide-frequency matrices, each summarizing many defined binding sites (on average, 36 per matrix) from diverse taxa. As a more conservative significance criterion than the P-values reported by Genomatix, we calculated the ‘P ratio’ for each site: the ratio of its binomial P-value from the set of 184 downregulated genes, to that obtained for 200 randomly selected upstream sequences. We observed 19 vertebrate sites that are significantly over-represented in 1-kb regions preceding downregulate genes, with P ratios ranging from 10−4 to 10−16. Twelve of these motifs are present upstream of 95–100% of the 184 most downregulated genes ().
| Table 1Transcription-factor binding motifs (vertebrate consensus matrices) enriched in 1-kb regions immediately preceding start sites of 184 age-1-downregulated C. elegans genes. These sites are defined as nucleotide-frequency matrices describing vertebrate (more ...) |
The consensus matrices were compiled from vertebrate databases, but are presumed to include related DNA-binding sites recognized by nematode proteins. A ForkHead binding motif, FKHD, might be expected to adjoin many of these genes, because most of them are regulated by DAF-16/FOXO. Although over 99% of the implicated genes (183/184,
P < 7 × 10
−3) are preceded within a kilobase by a FKHD site (actually an amalgamation of 16 distinct ForkHead binding sites defined by probability matrices), only 28% (51/184) have one or more exact matches with either the
C. elegans consensus DAF-16 binding element (DBE), GTAAA(
C/
T)AA or a second DAF-16-associated element (DAE), CTTATCA (
Kenyon & Murphy, 2006;
Oh et al., 2006).
Why should there by less enrichment of two DAF-16-associated consensus sequences than of other regulatory motifs (either the 11-mers discussed above or transcription-factor binding sites)? This simply reflects the experimental design, in which strong age-1 mutants were contrasted to a weaker age-1 allele. Because these strains to some extent share the same DAF-16-mediated gene set, whereas the SAM gene-selection procedures are designed to detect strain differences, SAM results will necessarily tend to be depleted in DAF-16 consensus binding sites.
Gene ontology analysis Gene sets implicated by microarray assays often show enrichment of genes with similar biological functions. Of the genes identified by SAM to have differential expression (FDR ≤ 0.1) between strong
age-1 alleles (
mg44 and
m333) and the weaker
hx546 allele, 382 are annotated in WormBase (
http://www.wormbase.org). These genes were evaluated for enrichment of functional annotation terms, via penalty-modified Fisher’s exact tests (‘EASE scores’) as implemented in DAVID version 6 (2008;
http://david.abcc.ncifcrf.gov) (
Dennis et al., 2003;
Huang et al., 2009).
Within this set of genes, 15 nonredundant gene ontology (GO) terms were significantly enriched, at < 5% FDR for this assessment (). These terms designate 30 metal-binding proteins, 25 of which bind zinc; 24 hydrolases; 21 ATP-binding proteins, including 14 kinases and seven Ser/Thr protein kinases; 17 genes involved in alternative splicing of RNA; 20 transferases; 20 nucleotide-binding proteins; 17 zinc finger proteins, of which 13 bind DNA; 19 transmembrane proteins; 17 nuclear proteins; 107 genes involved in cellular metabolic processes and 11 oxidoreductases. Each of these terms was chiefly enriched in the downregulated group, as indicated by arrows (↓) in the second column, with some further contribution from upregulated genes.
| Table 2Analysis of enrichment of functional annotation terms. Over-representation of gene ontology (GO) terms was assessed among 382 annotated genes differentially expressed in F2 adults homozygous for strong age-1 alleles (mg44 and m333), relative to the weaker (more ...) |
We also analyzed GO terms for the second gene set, 976 annotated genes with expression altered in strong age-1 alleles relative to N2DRM controls (at FDR ≤ 0.1). This set duplicates only 28% of the genes in the first set, but shares ten of the 15 significant functional-annotation groups listed above and approaches significance for another three. A further eight GO terms distinguish the second set, being largely absent from the first: 48 receptor designations, 25 transport, eight SH2-domain terms, ten instances of WD repeats, six of ribonucleoprotein, 53 membrane (overlapping but not identical to the transmembrane group), 16 glycoprotein and six cytoplasm labels. As noted in the preceding section, the comparison of strong age-1 alleles to age-1(hx546) is expected to cancel out much of the effect on genes regulated via DAF-16 and that may account for the absence (or reduced presence) of these eight GO categories.
Conversely, several terms were more prominent when strong age-1 alleles were contrasted to hx546 than when they were compared with N2DRM. Of the 15 terms significantly enriched in the first comparison, ‘alternative splicing’ and ‘kinase’ were less enriched and less significant in the N2DRM comparison, while ‘cellular metabolic process’ and ‘DNA binding’ were absent from those results (i.e., they did not meet the criteria of an EASE score of ≥ 0.1 and 2 or more genes per term), despite the inclusion of over 2.5 times as many genes in the latter case. Such a shift implies the loss of positive transcriptional regulators in the strong age-1 mutants or the recruitment of negative regulators that are not invoked by the weaker allele. This is entirely consistent with the conclusions from the preceding section, from analyses of upstream regulatory motifs.
These
age-1 GO patterns differ considerably from those reported for dauer larvae or long-lived
daf-2 adults (
Murphy et al., 2003;
McElwee et al., 2004,
2006;
Murphy, 2006) or for normal aging (
Lund et al., 2002;
Kim, 2007). As an example, 246 genes identified in an aging-profile study (
Lund et al., 2002) included 12 insulin-related genes and six nuclear hormone receptors, but relatively sparse representation of the genes most enriched here, e.g. those encoding hydrolases, zinc-finger proteins or other metal-binding proteins. To provide a more direct comparison, DAVID was used to assess the annotation of 514 genes differentially expressed between worms defective in
daf-2 expression (chiefly through RNAi) and controls (exposed to RNAi targeting both
daf-2 and
daf-16) in a quite extensive microarray study (
Murphy et al., 2003), and to ~2000 dauer-specific genes specified by a careful reanalysis (
McElwee et al., 2004,
2006) of microarray raw data generated previously (
Wang & Kim, 2003), comparing N2DRM as dauer larvae to 12 h postrecovery. As noted above, arrows are used in to indicate enrichment primarily in strong-
age-1- or dauer-upregulated gene sets (↑), enrichment in strong-
age-1- or dauer-downregulated gene sets (↓) or both (↑↓). The
daf-2 gene sets were not separated by direction of change.
These results, summarized in , illustrate both similarities and differences in expression profiles among these groups. Genes annotated as impacting adult lifespan are especially prominent in the
daf-2 set [in some measure as a result of the prior study (
Murphy et al., 2003)], but failed to attain significance in
age-1 and dauer comparisons. Nine significant terms are shared between the
daf-2 and
age-1 studies; of these, metal binding, hydrolase and transmembrane are comparably enriched, whereas zinc, alternative splicing, transferase and nucleotide binding are more enriched within one or both
age-1 gene sets, and 12 additional terms are enriched only in those sets but not in the
daf-2 comparison. Several terms (e.g. storage protein and trehalose metabolism), are unique to the
daf-2 study, but many more are shared only with dauer larvae.
Pearson correlation coefficients were calculated for the ‘fold-enrichment’ factors reported in these four DAVID analyses. Significant correlations were seen between the two
age-1 gene-annotation enrichments (
R = 0.59,
n = 19,
P < 0.02), and between the
daf-2 and dauer enrichments (
R = 0.47,
n = 23,
P< 0.04). However, enrichments in the dauer gene list have essentially no correlation to those for strong-
age-1 alleles vs.
hx546, and an inverse correlation (
R = −0.32) to those for strong-
age-1 alleles vs. N2 [see
Table S3 (Supporting information)].
This side-by-side comparison is intended primarily to make the point that the present transcript analysis is not redundant with previous studies of disruptions to the same pathway. Specific changes must be interpreted with caution, however, because enrichment of gene-ontology categories can depend strongly on the choice of control strains and on many subtler differences between the experimental protocols employed.