In seeking to explain the phenotypic disparity between mice lacking
Ocrl, which are normal, and humans lacking
OCRL, who have Lowe syndrome, we eliminated interspecies differences in the
OCRL/
Ocrl genes themselves. First, the human and mouse
OCRL/
Ocrl orthologs are 91% identical and 95% similar at the amino acid level. Both are expressed highly in nearly all tissues except lymphocytes in both species (Janne et al.
1998; Olivos-Glander et al.
1995). The only alternative splicing is conserved in the two species, resulting in inclusion or exclusion of a highly homologous 24-base exon (NM_000276.3 and NM_0001587.3, respectively, in human, NM_177215.3 and numerous ESTs, respectively, in mouse) located at the same position in the mRNA. There is also a highly unusual splice acceptor site, an AT, at the 3′ end of intron 2 in both human and mouse in
OCRL and
Ocrl that serves as acceptor for a canonical GT 5′ splice donor. Otherwise,
OCRL and
Ocrl splicing is constitutive, uses canonical sites, and generates exons of identical size in both species.
In contrast, the autosomal paralogs, mouse
Inpp5b and human
INPP5B, which are the closest paralogs to
Ocrl in the genome with many structural motifs in common with it, differ in a number of ways that might explain why the two species compensate for an Ocrl enzyme deficiency differently.
INPP5B and
Inpp5b are the only PtdIns 5-phosphatases to have a Rho-GAP domain along with the shared inositol phosphate 5-phosphatase domain. The rho-GAP domain is noncatalytically active but has affinity for the Rac (Faucherre et al.
2003) and ARF (Lichter-Konecki et al.
2006) small G proteins. The paralogs also share a pleckstrin-homology domain (Mao et al.
2009) and an ASH (ASPM, SPD-2, Hydin) domain (Ponting
2006). The ASH and Rho-GAP domains together interact with APPL1, a Rab5 effector protein (Erdmann et al.
2007; McCrea et al.
2008), and the endosomal proteins Ses1 and Ses2 (Swan et al.
2010). Ocrl, however, has a clathrin-binding domain (Choudhury et al.
2005,
2009) that is absent from Inpp5b (Erdmann et al.
2007). Inpp5b has similar intracellular localization to Ocrl (Erdmann et al.
2007) as well as similar, but not identical, substrate specificity for PtdIns(4,5)P
2 (Janne et al.
1998; Schmid et al.
2004). Finally, although cellular assays
ex vivo suggest that Inpp5b is not capable of complementing all of Ocrl’s functionality (Coon et al.
2009; Williams et al.
2007), genetic analysis indicates that
Ocrl and
Inpp5b have overlapping function
in vivo (Bernard and Nussbaum
2010; Janne et al.
1998).
We did find differences in transcription, splicing, and sequence between INPP5B and Inpp5b: (1) The full-length human Inpp5b enzyme lacks 80 amino acids because a GC splice site is used that causes 240 bp of sequence that is evolutionarily conserved with mouse exon 7 to be treated as intron and spliced out; (2) relative to the respective full-length transcripts, there is a reduced amount of a shorter alternative transcript originating from an internal promoter in humans compared to mouse; and (3) exon 8 is more divergent than the rest of the enzyme in the two species.
The reasons for the difference in 5′ splice site utilization between mouse
Inpp5b and human
INPP5B are unknown. Among mammalian splice sites, 98.71% contain canonical GT-AG junctions while only 0.56% has a noncanonical GC-AG splice-site pair (Burset et al.
2001). A GC 5′ splice site is preceded by a G, as it is in the human, 97.6% of the time, and by an A, as it is in C57Bl/6 and most inbred mouse strains, only 1.6% of the time. Of interest, rat (
R. norvegicus) and bovine (
B. taurus)
Inpp5b also splice predominantly at the GC splice site rather than the GT site the mouse uses and both of these mammals have a G immediately preceding the GC splice donor. However, this difference is not sufficient to explain why mouse does not use the internal GC 5′ donor because sequencing of 12 inbred strains of mice revealed that AKR/J mice and their descendants have a G preceding the internal GC splice site and yet still do not use this splice site in liver, brain, and kidney mRNA (data not shown). Thus, the exclusive use of the downstream GT splice site by mouse seems unique to the mouse and not present in human, bovine, and rat species.
Conversely, at the GT 5′ splice site utilized by mice, both humans and mice have the typical CAG immediately preceding the 5′ splice site, both have the canonical GT at positions +1 and +2, both have the highly preferred (82%) G at the +5 position, and neither has the preferred A or G at +3 or the preferred A at +4 (Cartegni et al.
2002). The Analyzer Splice Tool, which calculates scores for 5′ and 3′ splice sites (Shapiro and Senapathy
1987), gives very comparable scores for the mouse GT 5′ splice site and the analogous human site of 79.78 and 73.91, respectively. The human GT 5′ splice site can clearly be functional since it is used to generate the shorter alternative transcript in humans. Conversely, the human GC 5′ splice site and the analogous mouse GC dinucleotide from the AKR/J mice have comparable scores of 69.73 and 68.92, although the score when the GC is preceded by an A, as it is in most inbred mouse strains, is much less. Thus, the immediate contexts around the GT and GC 5′ splice sites are not significantly different between the two species and do not explain the difference in splice-site usage. Furthermore, the 3′ splice sites at the end of intron 7 also have comparable scores of 83.32 and 88.37, respectively, in human and mouse. Finally, intron sizes are large in both species (25,649 in mouse and 40,459 in human) and therefore provide no restraint on the utilization of the GT splice site in humans. Despite the lack of clear sequence differences that would explain the preference for the GC versus the GT 5′ splice site in human
INPP5B, the phenomenon is a
cis and not a
trans effect since human
INPP5B integrated into the mouse genome on a bacterial artificial chromosome in a transgenic mouse (R. L. Nussbaum, unpublished) shows the same overwhelming preference for the GC splice site (Supplementary Fig. 1).
The biochemical and cellular functional consequences of the interspecies differences between the human and mouse orthologs of Inpp5b remain to be elucidated. Nonetheless, they represent a significant interspecies difference in a potential modifying paralog for Ocrl that might explain why Inpp5b and INPP5B differ in their ability to compensate for loss of Ocrl function. These results set the stage for investigating which of the differences are responsible for the disparate phenotype in Ocrl-deficient humans and mice, which may, in turn, provide additional insights into the pathogenesis of Lowe syndrome in humans.