In higher eukaryotes, pre-mRNA alternative splicing is a powerful and versatile regulatory mechanism by which single genes can produce multiple mRNAs. The final draft of the human genome contains only 20

000–25

000 genes (
1), which encode several fold this number of proteins. This fact, plus estimates that >60% of human genes are alternatively spliced (
2,
3), underscores the importance of alternative splicing for the diversity of the human proteome. Alternative splicing events are regulated by
trans-acting factors that bind to regulatory
cis-acting elements within pre-mRNAs (
4,
5). These
trans-acting factors can be expressed either in a tissue- or in a developmental stage-specific manner or ubiquitously, making the ratio between
trans-acting factors important in the regulation of alternative splicing (
6,
7).
One family of splicing regulators that controls developmentally regulated and tissue-specific splicing events is the CUG-BP and ETR-3-like factor (CELF) protein family [also known as Bruno-like (BRUNOL) proteins] (
8,
9). There are six CELF paralogs in human and mouse. CELF proteins are well-characterized positive regulators of human and chicken cardiac troponin T (cTNT) exon 5 alternative splicing and bind intronic elements called muscle specific elements (MSEs) downstream of the alternative exon (
8,
10–
12). CELF proteins have also been shown to regulate splicing of human insulin receptor (IR) exon 11 (
13), human muscle-specific chloride channel (ClC-1) intron 2 (
14), rat
N-methyl-
d-aspartate (NMDA) receptor R1 exon 5 and exon 21 (
15), alpha actinin NM and SM exons (
16), human cystic fibrosis transmembrane conductance regulator (CFTR) exon 9, human myotubularin-related 1 (MTMR 1) exon 2.1 (
17) and human polypyrimidine tract binding protein (PTB) exon 11 (
18). Interestingly, CELF proteins can have either positive (e.g. cTNT, NMDA R1 exon 21 and PTB) or negative (e.g. IR, NMDA R1 exon 5 and ClC-1) effects on splicing of different target pre-mRNAs. Systematic evolution of ligands by exponential enrichment (SELEX) analysis of one CELF protein, ETR-3 (CUG-BP2, BRUNOL3, NAPOR), identified preferred U/G-rich binding sites that are found within introns near several of the exons listed above. Insertion of U/G-rich motifs, representative of the selected sequences into a cTNT minigene, made non-responsive to ETR-3 restored responsiveness not only to ETR-3 but also to two CELF paralogs, CUG-BP1 (BRUNOL2) and CELF4 (BRUNOL4) (
17).
CELF proteins contain two N-terminal and one C-terminal RNA recognition motif (RRM) domains separated by a 160–230 residue divergent domain. The divergent domain shows limited homology between CELF protein family members and no overt homology to other proteins. To define regions required for splicing regulation, we have performed a functional analysis of two CELF protein family members, CELF4 and ETR-3, which represent the two different CELF family subgroups distinguished based on phylogenetic analysis and functional differences (
8).
In this study, we identified the regions of two CELF proteins required for splicing activation and repression. Our goals are to identify residues required for positive and negative regulation, determine whether the same residues are required for positive regulation of different exons activated by CELF proteins and to determine whether residues required for activation are separate from or overlap with residues required for repression. The CELF protein family consists of two subgroups: one contains CUG-BP1 and ETR-3, which share 78% amino acids sequence identity, and the other contains CELF3 (BRUNOL1), CELF4, CELF5 (BRUNOL5) and CELF6 (BRUNOL6), which share 62–66% amino acid sequence identity to each other and ~44% identity to CUG-BP1 (
8). We chose to use ETR-3 and CELF4 as representatives of each subgroup in these analyses.
For CELF4, we had previously demonstrated that RRM-1 and -2 plus 66 amino acids within the divergent domain were sufficient to function as a splicing activator for cTNT exon 5 (
19). In this study, we demonstrate that this CELF4 activation domain (66 amino acids within divergent domain) retains the ability to activate cTNT alternative splicing when fused to the similar but heterologous RNA-binding domain of ETR-3. In addition, we identified a minimal 20 residues (amino acids 239–258) within the divergent domain of CELF4 that are required to activate cTNT exon 5. In contrast, a different 20 residues within the CELF4 activation domain (amino acids 279–298) were required for the activation of PTB exon 11. Meanwhile, repression of NMDA R1 exon 5 required overlapping residues within both the regions required for the activation of cTNT and PTB exons.
While the regions required for CELF4 activity are relatively well defined, deletion analysis of the two putative activation domains previously identified within ETR-3 exhibited incremental effects. These results suggest that the residues required for activation by ETR-3 are spread within two separate 40- and 90-residue regions of the divergent domain. In addition, we found significant differences in the regions of ETR-3 required for the activation of cTNT exon 5 and NMDA R1 exon 21 and repression of IR exon 11. Our results suggest a surprising degree of diversity between the regions of CELF proteins that are required to perform the same function on different pre-mRNAs and a surprising similarity in the residues required for opposite functions on different pre-mRNAs.