Specific proteolytic cleavage plays a critical role in many biological pathways, including the liberation of SREBP from the endoplasmic reticulum, the release of Notch from the plasma membrane, the maturation of hedgehog signaling molecules, and the activation of caspases and their subsequent cleavage of death substrates during apoptosis. Here, we demonstrate that mammalian MLL is proteolytically processed at cleavage sites which follow P1 aspartic acid residues located within two highly conserved D/GVDD and D/GADD motifs. The precise protease which mediates this cleavage remains to be determined but is likely to be the same for the similar CS1 and CS2 motifs. Of note, we have tested inhibitors of calpain, the 26S proteasome, and caspases and demonstrated no substantial effect on the cleavage. We also noted a D/GVDD motif deleted in a
Drosophila TRX that is located within a 281-aa region deleted in a described mutant
trx allele,
trxE3 (
32). Unlike null mutations of
trx that fail to maintain the expression of homeotic genes of the
bithorax (BX-C) and
antennapedia (ANT-C) complexes (
3),
trxE3 displays selectively reduced expression of
ANT-C but not
BX-C genes (
28). Subsequent studies using antibodies recognizing different regions of trx are consistent with the notion that this recognized motif serves as a proteolytic site (
19). The homeotic defects in
trxE3 argue that cleavage of trx has a developmental consequence.
The precise mechanisms whereby MLL exerts its principal function to maintain the expression of target genes, including
Hox genes, remain uncertain. However, the presence of conserved motifs with MLL provides some insight. MLL possesses domains with ascribed biochemical properties: the three AT hook motifs provide minor-groove DNA binding (
31,
36), the DNMT homology domain binds methylated DNA differentially and can repress transcription (
8,
11,
22,
36), a conserved subnuclear localization signal directs punctate nuclear distribution (
32), the PHD fingers mediate protein-protein interaction (
14), the transactivation domain interacts with CREB-binding protein (
13), and the highly conserved SET domain binds to histone H3, Sbf1, and ASH1, as well as itself (
2,
9,
17,
25). Here we identify the D/GVDD and D/GADD motifs and assign a role for them in MLL processing as well as identifying roles for the FYRN and FYRC domains. The FYRN and the FYRC plus SET domains mediate the interaction between the N320 and C180 subunits of MLL. Of note,
Drosophila TRX also possesses FYRN and FYRC motifs, suggesting that it may also form a complex of processed fragments. Two distant MLL family members, ALR and MLL3, are present in the human genome (
23,
29) but lack the D/GVDD cleavage motifs. Interestingly, they have a varied organization of domains compared to MLL, possessing a PHD domain, a FYRN domain, a FYRC domain, and a SET domain all clustered within the C-terminal 450 aa of ALR and MLL3. This highly conserved architecture is also present in
Caenorhabditis elegans and
Drosophila ALR homologues (
27). The interaction between the FYRN and the FYRC plus SET domains noted here for MLL raises a testable possibility that ALR and MLL3 may have clustered these domains to create a similar functional moiety. Perhaps one role for the cleavage of MLL is to physically juxtapose these previously separated domains.
Chromosomal translocations in human leukemia which disrupt MLL produce a chimeric protein in which the N-terminal portion of MLL is fused to 1 of more than 20 partners. These fusion partners do not have any recognizable motif in common, and they range widely from nuclear factors to cytoplasmic proteins and cell junction proteins. Gene expression profiles of human leukemias bearing an
MLL translocation identified a pattern of upregulated genes. Among these genes were some of the well-recognized targets of wt MLL, including
HOXA4,
HOXA5, and
HOXA9 (
1,
33). This argues that the N-terminal 1,300-aa portion of MLL is sufficient to confer at least some target gene specificity to MLL fusion proteins. Mice expressing an MLL-AF9 fusion product progressed to acute myeloid leukemia, establishing the oncogenic potential of MLL chimeric proteins (
7). In contrast,
Mll-exon8Myc mice in which a small myc epitope tag was fused with MLL failed to develop leukemia, arguing that a truncated MLL fragment was not in itself sufficient to cause transformation. However,
Mll-exon8LacZ mice in which MLL was fused with the β-galactosidase protein did progress to acute leukemia (
10). β-Galactosidase has been extensively used in transgenic mouse models and plays no independent, demonstrated role in tumor formation. The compilation of these findings indicates that individual fusion partners play an indispensable role but that they need not be directly oncogenic. The observations here concerning protein instability may provide an explanation. The FYRN domain, required for interaction of the processed MLL fragments, is located downstream of the chromosomal breakpoint, indicating that MLL fusion proteins would lose interaction with the stabilizing C180 component. Consequently, one testable characteristic that might prove to be shared by all the partner proteins is the capacity to confer protein stability to the MLL N terminus.