The
STA1,
STA2, and
STA3 genes encode extracellular glucoamylase isozymes which enable
Saccharomyces cerevisiae cells to utilize starch as a carbon source (reviewed in references
38,
40, and
50). The three genes have nearly identical sequences, and are located on chromosomes II (
STA2), IV (
STA1), and XIV (
STA3). All three members of the
STA gene family are located in subtelomeric positions, similar to the
FLO (reviewed in reference
48),
SUC (reviewed in reference
15), and
MAL (reviewed in reference
32) gene families, which probably evolved through genomic duplications and chromosomal rearrangements. The 5′ upstream region of
STA1 and
STA2 (the nucleotide sequence of
STA3 has not previously been determined) is almost identical to that of
MUC1, which encodes a large membrane-bound, mucin-like protein that plays an important role in the processes of invasive growth, pseudohyphal development, and flocculation (
8,
23,
27,
28). The homology extends over more than 3,500 bp upstream of the ATG start codon and includes the first 60 bp of the open reading frame (ORF) encoding a secretion signal sequence (
52). With the exception of a few single nucleotide dissimilarities, the only significant differences between the promoters of
STA2 and
MUC1 are two inserts of 20 and 64 bp in the
MUC1 promoter, which are absent from the
STA2 promoter (
23). These inserts stretch from nucleotides −1333 to −1313 and −933 to −869, respectively. This very limited sequence divergence between the
STA and
MUC1 promoter regions suggests a recent origin of the
STA genes. The
STA genes probably evolved through a recombination and sequence duplication process between the promoter and signal sequence of
MUC1 and the ORF of the
SGA1 gene that encodes a sporulation-specific intracellular glucoamylase.
MUC1 and
SGA1 are located on the right and left arms of chromosome IX, respectively (
59,
60). Besides the strong sequence conservation between these genes, other arguments in favor of a recent origin of the
STA genes and of the proposed molecular mechanism are (i) the subtelomeric position of the
STA genes compared to the more central position of both
MUC1 and
SGA1; (ii) the presence of
STA genes in only some
S. cerevisiae strains, compared to the general presence of
MUC1 (
4,
59,
60) and
SGA1 (
59,
60) in all
S. cerevisiae strains investigated so far; and (iii) the existence of homologous repeated sequences on either side of the proposed junctions (
59,
60).
Analyses of the upstream areas of
STA1 (
1,
46),
STA2 (
22), and
MUC1 (
44) demonstrated that elements at distances of up to 2,800 bp from the translation start codon (ATG) are involved in the transcriptional control of the respective genes, therefore representing the largest
S. cerevisiae promoters identified to date (
7,
44). The
STA and
MUC1 upstream regions are therefore of particular interest from both an evolutionary and a functional point of view.
The extent of the promoter homology would suggest that genes involved in starch metabolism and pseudohyphal differentiation or invasive growth are coregulated to a large extent, and experimental data so far have supported this hypothesis. Lambrechts et al. (
23,
24) and Gagiano et al. (
8) showed that two transcriptional regulators, Msn1p and Mss11p, strongly induce transcription of both the
STA2 and
MUC1 genes when present on multiple-copy plasmids. Conversely, Δ
msn1 or Δ
mss11 strains show strongly reduced transcription of these genes. Furthermore, Lo and Dranginis (
28) demonstrated that
MUC1 is regulated by Ste12p, a transcription factor responsible for both pheromone-specific (reviewed in reference
21) and, in combination with the TEA (or ATTS) family transcription factor Tec1p, filamentation-specific (
9,
30) gene regulation. Gagiano et al. (
8) presented evidence that the same factor regulates the
STA genes in a similar way.
Other regulatory factors have so far only been associated with regulation of
MUC1 or
STA1,
STA2, and
STA3 independently. Recent data suggest that the transcription of
MUC1 might be specifically regulated by a network of signal transduction pathways which control invasive growth and pseudohyphal differentiation (
8,
28,
44). This network combines inputs from at least three interacting signal transduction modules, including (i) the filamentation-specific mitogen-activated protein (MAP) kinase cascade (
25,
31,
42), (ii) the cyclic AMP (cAMP) and cAMP-dependent kinase (
29,
43), and (iii) the cyclin-dependent kinase Cdc28p (
6). In addition to the result mentioned above that
MUC1 was subjected to MAP kinase-dependent regulation by Ste12p and Tec1p, the gene was shown to be regulated by cAMP levels, a regulation that occurs via Flo8p (
44), a transcription factor initially identified for its role in flocculation (
18). The gene was also shown to be negatively regulated by a suppressor of flocculation, Sfl1p, which interacts specifically with the yeast A kinase, Tpk2p, to repress
MUC1 transcription in the absence of a cAMP signal (
43).
Numerous data concerning the regulation of the
STA genes have been published. Expression of
STA1 to -
3 is negatively regulated at several levels. Transcription is repressed on most readily metabolized carbon sources, including glucose, sucrose, maltose, and galactose (
5,
17,
20,
41,
47). Carbon catabolite repression was reported to involve two separate pathways, of which one requires
HXK2 and the other
HAP2 (
17). It was also reported that repression of
STA2 does not require Mig1p, the common repressor of genes under carbon catabolite control.
MUC1 was also shown to be repressed in medium containing glucose as a carbon source (
8,
27), probably via the same mechanisms as
STA1 and
STA2. Transcription of
STA1 to -
3 is repressed in most, but not all, diploid strains of
S. cerevisiae (
5,
41). The mechanism through which repression occurs is not defined, since the removal of the putative
a1-α2 repressor binding sites from the
STA2 promoter does not relieve the repressive effect observed in diploid strains (
22). In rich medium,
MUC1 is also repressed in diploid strains, but under nitrogen starvation conditions it seems to be repressed more in haploid than diploid strains (
28,
44).
Most laboratory strains of
S. cerevisiae contain an undefined repressor,
STA10, which reduces transcription of the
STA1 to -
3 genes at least 20-fold (
37,
41). It was reported that the repressive effect of
STA10 results from interaction between two unlinked genes,
IST1 and
IST2 (
34), but this was not confirmed. The negative effects of several other genes (i.e.,
INH1 [
58],
SGL1 [
36], and
SNS1 and
MSS1 [
1]) on the transcription of the
STA genes have also been reported, but the relationships between these negatively acting genes and the repressive effect of
STA10 remain to be determined.
Transcription of
STA1 to -
3 is subject to the repressive effect of chromatin on promoters, since
SUD1, a component of a global chromatin-associated repressor of promoter activity, was shown to act on the
STA1 promoter (
57). Furthermore, transcription of
STA1 to -
3 also requires the presence of components of the SWI-SNF global activation complex (
13,
20,
33,
61–
63), which associates with the RNA polymerase holoenzyme at specific promoters and relieves the repressive effect of chromatin on transcription (
19,
55).
cis-acting promoter elements in several regions within the
STA1 (
1,
46),
STA2 (
22), and
MUC1 (
44) promoters were shown to be required for transcriptional regulation. Two areas hosting upstream activating sequences (UASs) (UAS1 between nucleotides −1390 and −1074 and UAS2 between nucleotides −1940 and −1815), as well as three upstream repression sequences (URSs), were identified in the
STA2 promoter (
22). URS1 was found to reside in the area between nucleotides −1390 and −1074, which also hosts UAS1. URS2 was identified between nucleotides −1650 and −1390 and URS3 upstream of position −2457. Similar regions were defined for the
STA1 promoter (
1,
46). A recent, more systematic, analysis of the
MUC1 promoter (
44) revealed a vast array of regulatory elements which confer the regulation of several nutritional and cell-type signals on
MUC1 expression levels. In good agreement with the previous studies of the highly homologous
STA1 and
STA2 promoters, four areas required for the activation of
MUC1 and nine areas required for the repression thereof were identified. The transcriptional activator encoded by
FLO8 was found to exert its activating effect through a 200-bp sequence stretching from nucleotides −1200 to −1000 in the upstream region of
MUC1 (
44).
The 5′ upstream areas of
MUC1,
STA1, and
STA2 are predicted to contain a single small ORF, YIR020c, of unknown function, situated from nucleotides −1285 to −882 in the upstream region of
MUC1. YIR020c lies in an area identified and experimentally defined as a regulatory region for
STA1,
STA2, and
MUC1, and other regulatory regions were shown to exist upstream of this ORF (
1,
22,
44,
46). Its occurrence therefore does not affect conclusions regarding the transcriptional regulation of
STA1,
STA2, or
MUC1, independently of whether this ORF encodes a functional protein or not.
The homologous sequences from nucleotides −1390 to −1074 of the
STA2 promoter and from nucleotides −1479 to −1136 of the
MUC1 promoter are of particular interest since they (i) have previously been identified as areas hosting a UAS as well as a URS (
22), (ii) confer increased levels of activity from a far-upstream position, and (iii) include one of the two significant differences between the upstream areas of
MUC1 and
STA1 to -
3 (a sequence of 20 bp that is deleted in the
STA2 promoter). The region might therefore contain an evolutionarily significant molecular change explaining differences in the regulation of
STA1 to -
3 and
MUC1.
In this paper, we compare expression levels conferred by the full MUC1 and STA2 promoters on reporter gene expression. We furthermore present a detailed analysis of the promoter region from nucleotides −1390 to −1074 of STA2 and the corresponding area of MUC1, from nucleotides −1479 to −1136. We show that these regions of MUC1 and STA2 confer both similar and divergent regulation and contain sequences involved in general repression as well as areas for (i) activation by the transcriptional activators encoded by MSN1 and MSS11, (ii) activation by the transcriptional activator encoded by FLO8, (iii) carbon catabolite repression, and (iv) diploid repression. Our data indicate that differences in expression levels observed between MUC1 and STA2 are largely due to the two deletions of 20 and 64 bp that have occurred in the STA promoters. We also show that the repressive effect identified as STA10 in most laboratory S. cerevisiae strains is due to the absence of the FLO8-encoded transcriptional activator. Epistatic analysis furthermore suggests that FLO8 requires or is situated upstream of MSS11, but acts independently of MSN1.