This study constitutes the first systematic analysis of the influence of nitrogen on the yeast transcriptome. We find that the levels of expression of almost 10% of all protein-encoding yeast genes (506 out of 5,690) vary significantly according to the nature of the unique nitrogen source available in the medium. This list of 506 genes is not significantly enriched in genes responding to stress. Even in cells grown on tryptophan, a nitrogen source supporting very slow growth, the expression levels of genes normally up- or down-regulated under stress conditions is not significantly different from that observed on nitrogen sources supporting fast growth. We believe that the lack of a stress response in the cells examined in this study is due to harvesting during steady-state growth. The variation in expression of the many genes subject to the ESR or CER (
20,
47) is most visible in cells subjected to various perturbations of their environment, e.g., a shift to a different nutrient supply source, temperature, pH, or osmolarity. Also in support of this view, the transcriptome of yeast was recently examined in cells growing under steady-state conditions on proline or glutamine as the sole nitrogen source and in cells shifted for 2 hours from glutamine to proline (
112). This analysis revealed a highly significant variation in the levels of expression of ESR/CER target genes in cells shifted from glutamine to proline but no significant difference in the expression levels of these genes between cells engaged in steady-state growth on one or the other medium.
We have used the expression data of nitrogen-regulated genes (i) to classify the nitrogen sources, (ii) to inventory the nitrogen-sensitive transcriptional controls and the corresponding target genes, and (iii) to evaluate the degree of activation of these transcriptional circuits on all tested nitrogen sources (Fig. ). We have also tried to infer a function for some nitrogen-regulated orphan genes.
Most nitrogen sources fall into two main groups according to the supported growth rate and to the wide-spectrum transcriptional controls occurring in yeast cells growing on these sources (Fig. and ). Group A comprises asparagine, glutamine, serine, ammonium, aspartate, alanine, arginine, and glutamate. On these nitrogen sources, growth is rapid and NCR occurs. This result is essentially in agreement with the concept that NCR is active on good (preferential) and inactive on poor (nonpreferential) nitrogen sources (
26,
85,
136). Yet we have also observed that the level of NCR does not correlate exactly with the generation time. For instance, NCR is partially relieved on glutamate and aspartate, despite rapid growth. Our study, furthermore, has revealed new genes potentially subject to NCR. Starting from a list of 41 established NCR target genes and using our expression data combined with data generated by other genome-wide studies or provided by the analysis of upstream gene sequences, we have extended this list with 44 additional genes, and experiments carried out on 18 of them confirmed that they are sensitive to NCR (see Table S2 in the supplemental material). Among the new NCR target genes are
GDH1 and
GLT1. The enzymes Gdh1 (anabolic glutamate dehydrogenase), Gdh2 (catabolic glutamate dehydrogenase), Glt1 (glutamate synthase), and Gln1 (glutamine synthetase) constitute the hub of yeast nitrogen metabolism (Fig. ) (
85,
136). Among them,
GLN1 and
GDH2 were previously established as being sensitive to NCR (
84), but
GDH1 and
GLT1 were not, although observations consistent with this view have been reported (
65,
129). It thus seems that the complete set of enzymes forming the heart of nitrogen metabolism in yeast is subject to NCR. Many of the new probable NCR target genes have known functions, and some code for proteins directly involved in nitrogen transport or metabolism. The functions of about 30 others, however, are still unknown. For 14 of these genes, clues to their roles in cell metabolism have been derived from in silico analyses of their protein sequences (see Table S2 in the supplemental material). Furthermore, we have found that two loci defined as pseudogenes in strain S288C are true protein-encoding genes in the Σ1278b strain (used in this study) and respond to NCR. One (
YIL167-168W) encodes a homolog of serine/threonine deaminase and the other (
YIL164-165C) a homolog of nitrilases. We also note that group A nitrogen sources are characterized by activation of the UPR pathway, a transcriptional control circuit typically activated when incorrectly folded proteins accumulate in the secretory pathway (
83,
99,
120). Conversely, the expression of UPR target genes is reported to be reduced when cells are shifted to nitrogen starvation conditions (
113). This suggests that the higher protein synthesis rate of cells growing on a nitrogen-rich medium is associated with a greater number of poorly folded proteins, resulting in UPR activation (
113). Furthermore, other observations suggest that the UPR and signaling pathways involved in pseudohyphal growth or meiosis of diploid cells might be more connected than anticipated (
113,
114).
The group B nitrogen sources comprise leucine, isoleucine, methionine, threonine, tryptophan, and tyrosine. On these nitrogen sources, growth is particularly slow and NCR does not occur. Furthermore, GAAC is activated on these nitrogen media (Fig. and ). The importance of GAAC in cells growing on group B amino acids is also illustrated by the specific growth defect of a
gcn2Δ mutant on the corresponding media. In agreement with GAAC being particularly active under these conditions, the increased activities of two amino acid-biosynthetic enzymes, namely, indole-3-glycerol-phosphate synthase (
TRP3) and arginosuccinate lyase (
ARG4), have been observed on media containing isoleucine, leucine, methionine, threonine, or tyrosine as the sole nitrogen source (
92). Furthermore, this activation depends on Ndr1/Gcn1 (
92), a positive regulator of the function of Gcn2 (
58). It has been proposed that the activation of GAAC during growth on these amino acids is due to amino acid imbalances caused, for instance, by feedback inhibition of enzymes shared by branched amino acid-biosynthetic pathways (
92). In support of this view, the presence of an additional amino acid relieving the amino acid imbalance also leads to a reduction of GAAC (
92).
Another difference between the group A and group B nitrogen sources lies in the fate of the carbon derivatives resulting from the catabolism of these compounds. Whereas the transamination or deamination of group A nitrogen sources yields derivatives directly assimilable by the cell metabolism (Fig. ), the transamination of group B compounds leads to keto acids undergoing decarboxylation to aldehydes which are in turn converted by dehydrogenases into long-chain or complex alcohols. The amino acid derivatives generated by this so-called Ehrlich pathway (
100,
115,
135) are toxic and thus excreted by the cell, and this contributes to the formation of fusel oils. The mechanisms involved in the excretion of aldehydes and higher alcohols remain unknown. In this respect it is noteworthy that growth on tryptophan (and to a lesser extent on tyrosine) leads to the up-regulation of several genes known to be sensitive to the PDR transcriptional control. These genes encode plasma membrane transporters known to promote the excretion of drugs: Snq2 and Yor1 are ATP binding cassette transporters (
107), and Flr1 and Qdr3 are probable proton antiporters (
91). Our observation that these genes are up-regulated on tryptophan medium suggests that their expression products may play a direct role in the excretion of certain derivatives of tryptophan catabolism, such as tryptophol.
The enzymes involved in the catabolism of the group B nitrogen sources are generally not nearly as well known as those contributing to the degradation of the other nitrogen sources. The main transaminase and the decarboxylase involved in degrading aromatic amino acids (phenylalanine, tryptophan, and tyrosine) have been identified (
64,
133); the corresponding genes (
ARO9 and
ARO10) are controlled by the Aro80 transcription factor (
62). Our results reveal that the
ARO regulon is induced not only during growth on aromatic amino acids but also in cells grown on any other group B nitrogen source. This strongly suggests a possible involvement of the Aro9 and Aro10 enzymes in the catabolism of all these compounds. In agreement with this view, it was recently shown that Aro10 is a broad-spectrum decarboxylase (
100,
132), and it is also known that Aro9 exhibits specificity for a wide range of substrates (
76,
127). We have also observed that the
ALT1/YLR089C gene, encoding an alanine transaminase homolog, shows higher expression on methionine, isoleucine, citrulline, valine, and alanine than on the other tested nitrogen sources. Hence, the protein encoded by this gene might be involved in the catabolism of these amino acids as well. The involvement of multiple, partially redundant enzymes displaying overlapping substrate specificities very likely explains why classical genetics-based approaches have not allowed effective dissection of the catabolic pathways of group B amino acids, in marked contrast to their successful use in dissecting amino acid anabolic pathways (
69). We have also extended the
ARO regulon involved in amino acid catabolism to two other genes, one of which (
ESBP6/MCH3) encodes a putative transporter that localizes to the inner mitochondrial membrane (
87). As Aro9 and Aro10 are cytoplasmic enzymes (
61), Mch3 might be responsible for the transport of aldehydes into mitochondria, where several alcohol dehydrogenase activities have been detected (
42).
A major difference between the two groups of nitrogen sources defined above is that NCR is active and GAAC is inactive on group A compounds, whereas the opposite is true on group B compounds. This raises the question of whether links might exist between the GAAC and NCR regulations. Our data reveal that the expression of
GCN4, encoding the principal transcription factor of GAAC, depends on the nitrogen source supplied and is subject to NCR. On the other hand, previous work has highlighted a contribution of Gcn4 to NCR (
118). Together these data suggest a possible general cross-regulation between the two major transcriptional controls of nitrogenous anabolism and catabolism in yeast, in which NCR (most active on good nitrogen sources) would down-regulate GAAC, which in turn would up-regulate NCR. Yet on several other nitrogen sources (valine, phenylalanine, ornithine, proline, citrulline, and urea), neither of these two major transcriptional control circuits appears to be very active (Fig. ). Further experiments will be required to investigate possible molecular links between NCR and GAAC.
Some regulons are active on both group A and group B nitrogen sources (Fig. ). This is the case, for instance, for the regulon formed by several amino acid permease genes induced by various external amino acids via the SPS sensor system and the Stp1, Stp2, and Uga35/Dal81 transcription factors (
15,
46). We have observed that the target genes of the SPS regulon display quite divergent expression profiles when tested on the different nitrogen sources used here. This was not unexpected, since a similar result was obtained with a
lacZ reporter fused to several of these permease genes (our unpublished data). It is likely that the different induction profiles of the genes of this regulon reflect their responsiveness to additional transcriptional controls, whose nature and importance vary from one gene to another. Many other nitrogen-responsive genes share this property of being under a combination of transcriptional controls rather than a single one. This likely explains why hierarchical clustering based on their expression profiles often show these genes to be scattered among distinct clusters of coexpressed genes (see below). Another regulon is less active on various nitrogen sources (especially glutamate and proline); it comprises the genes inhibited by glutamate via the RTG control (
22,
79) (Fig. ). These genes can be expected to be poorly expressed on proline medium, because the internal pool of glutamate is high on this medium. The average expression of genes subject to RTG control is also lower on group A nitrogen sources. This may again reflect a high glutamate pool, to be expected in the presence of a good nitrogen source.
Finally, several regulons are most active in cells grown on a more limited number of nitrogen sources (Fig. ). Among them is the one inducible by GABA (
121) and for which we identified two new target genes,
MAE1 and
AMD2. The
MAE1 gene encodes a mitochondrial malic enzyme (
16) that might contribute to optimizing the catabolism of succinate deriving from GABA catabolism (Fig. ). We have also identified
MMF1 as a new potential gene of the
CHA regulon inducible by serine and threonine. A significant fraction of the 506 genes show weak though significant differential expression according to the nitrogen source code for transcription factors (22 genes) or for other proteins having a regulatory function (15 genes). Some of these regulatory genes encode proteins known to be directly involved in the regulation of nitrogen metabolism genes. For instance, we mention above a role of NCR in the transcriptional control of the
GCN4 gene, while this gene appears to be required for optimal NCR (
118). We have also identified two probable NCR target genes encoding potentially interesting regulatory factors, including one (
LEE1/
YPL054W) which may be involved in controlling mRNA stability. Experiments are in progress to test whether this protein plays such a role. Many other regulatory genes apparently controlled by nitrogen are involved in other metabolic pathways or cell processes. Further experiments will be needed to determine whether the apparent regulation by nitrogen of these regulatory genes contributes to the overall influence of nitrogen on other cellular pathways.
We have thus explored in a systematic manner the influence of different nitrogen sources on the yeast transcriptome. The same type of analysis could be used to acquire a general view of the transcriptional regulations involved in carbon, sulfur, and phosphorous metabolism. So far, genomic studies have focused on the mechanisms of adaptation to environmental change and especially to starvation (
14,
47,
94,
138). Yet just as yeast is able to utilize many different nitrogen sources, it can also utilize different sources of carbon, sulfur, and/or phosphorous. A systematic study of the yeast transcriptome during growth on all these alternative nutrient sources should provide a more comprehensive view of the metabolic potentialities of yeast and of the associated transcriptional control mechanisms. It should also help to attribute potential functions to the still large number of orphan genes of yeast.