We report here the identification of human, mouse and
Drosophila orthologs
of the
S.cerevisiae MMS19 gene.
Alignments of these as well as several other apparent orthologs
with the yeast Mms19 protein show extensive amino acid sequence conservation
as well as similarity in size. In particular, we have identified
highly conserved HEAT repeat domains in the C-terminus
of this family of proteins. HEAT repeats have been identified in
a variety of cytoplasmic and nuclear eukaryotic proteins. The typical
repeat unit consists of 30–34 amino acid residues and forms
two α-helices separated by a short loop,
as inferred from the crystal structure of human β-importin
(
21). Arrays of HEAT repeats
form tandemly arranged bi-helical structures that appear to function
as scaffolds for assembly of other subunits of the corresponding
complexes (
21–
23). A common denominator for many of
these proteins is their association with and involvement in the
assembly of large, multisubunit protein complexes such as coated
vesicles and microtubules. The recent demonstration of the presence
of HEAT repeats in several chromatin-associated proteins, including
subunits of cohesins and condensins and several transcription regulators,
is particularly interesting (
20).
Functional complementation of mutant phenotypes by NER proteins
across species is the exception rather than the rule (
24,
25), presumably reflecting specific structural
requirements for efficient assembly of multiprotein complexes. Hence,
the observed complementation of thermosensitive growth and UV radiation
sensitivity of the yeast
mms19Δ mutant
with the
MMS19 genes from
Drosophila and
humans provides suggestive evidence for both structural and functional
conservation of Mms19 protein.
While this manuscript was in preparation cloning and characterization
of the h
MMS19 gene was independently published (
9). However, while the putative human polypeptide
protein identified in our study is precisely the same size, it differs
from the previously published sequence over a stretch of 39 amino acids
in the central part of the protein. This difference does not appear
to result from alternative splicing since we have characterized
the genomic structure corresponding to the coding region. Furthermore,
the mouse cDNA is 90% identical to our human cDNA clone
and this identity includes the 39 amino acids that differ from the
previously published translated sequence. We suggest that the previously
published sequence (
9) may have
one or more mutations in the ORF and that the failure to observe
correction of the yeast
mms19Δ mutant phenotype
may derive from such mutations. No GenBank accession number was
provided to verify this. Alternatively, the gene previously reported
might represent a slightly altered duplication of the y
MMS19 gene.
There is a precedent for duplication of a gene with an essential
role in NER and RNAP II transcription. Specifically, p44 protein
is a subunit of TFIIH that regulates the activity of XPD DNA helicase,
another TFIIH subunit. In humans two closely linked
p44 genes
have been identified (
p44t and
p44c),
which can encode proteins that differ by just three amino acids
(
26).
Using RT–PCR, molecular cloning and DNA sequencing
we have demonstrated that the h
MMS19 primary transcript
undergoes extensive processing and uses two different polyadenylation
sites. Extensive alternative splicing is not uncommon in genes that
encode transcription factors and may play a critical role(s) in
the regulation of developmental pathways and in cancer (
27,
28).
Differences in the 5′-UTR have been
implicated in regulating transcription efficiency (
29)
and different 3′-UTRs may be associated
with differential stability or translatability of various mRNA products
(reviewed in
30). Furthermore,
different protein isoforms may regulate or interact with distinct
sets of targets (
31,
32). For example, it was recently reported
that splice variants of the Wilms’ tumor 1 gene (
WT1) encode
proteins that have opposite effects on tumorigenicity (
33). It is possible that the h
MMS19 gene
evolved different protein isoforms with different functions.
The functional significance of the different
MMS19 cDNAs cloned
is supported by the identification of two major mRNAs in both human
and mouse northern blots. We suggest that the larger transcript
results from use of the second polyadenylation site and that this
choice correlates with a different promoter. However, we were unable
to precisely correlate the splice variants of
MMS19 detected
by RT–PCR and cloning with the transcripts detected by
northern analysis. Previous studies (
9) detected
a single transcript at ~4 kb in a mouse
northern blot. It is likely that this corresponds to our transcript
of ~3.8 kb.
The h
MMS19 gene maps to region 10q24 of the
human genome, a region frequently rearranged in human cancer (
34). However, no genetic disorders obviously
associated with NER or transcription defects have been mapped to
this region of the human genome. The precise function(s) of Mms19
protein remains elusive. It was recently demonstrated that cells
from the hereditary human disease trichothiodystrophy (TTD), belonging
to genetic complementation group A (TTD-A), have reduced levels
of TFIIH activity (
35). This
observation led to the suggestion that the putative TTD-A protein
normally determines the stability of the TFIIH complex.
MMS19 is
an attractive candidate for such a function. As mentioned earlier, RNAP
II transcription in yeast
mms19Δ extracts
is inactivated by heat treatment (
5).
This thermolabile defect can be fully complemented by purified TFIIH,
but not by purified yeast Mms19 protein, suggesting that Mms19 protein
indeed influences the functional integrity of TFIIH (
5). Consistent with this notion, the previously
published form of human Mms19 protein was shown to physically interact
with TFIIH through its XPD and XPB subunits (
9).
The observation of conserved HEAT repeats in the Mms19 proteins
is consistent with a role of this protein in the assembly of TFIIH-containing
complexes for NER and RNAP II transcription.
The genetic complementation group TTD-A is presently represented
by a single patient (TTD1BR) which complemented the phenotypes of
TTD cells with defects in TFIIH subunit XPB or XPD (
36).
We were unable to correct the NER defect of TTD1BR cells following
microinjection of a construct containing full-length h
MMS19 cDNA.
However, we cannot exclude the possibility of a mutation in a regulatory region
of
MMS19 in TTD-A cells. Likewise, we cannot exclude
differences in the relative abundance of specific
MMS19 splice
variants in the TTD-A disorder or in the several TTD-like
syndromes previously described (
37 and
references therein). Experiments are in progress to evaluate these
hypotheses.