|Home | About | Journals | Submit | Contact Us | Français|
Hereditary fructose intolerance (HFI) is a potentially fatal inherited metabolic disease caused by a deficiency of aldolase B activity in the liver and kidney. Over 40 disease-causing mutations are known in the protein-coding region of ALDOB. Mutations upstream of the protein-coding portion of ALDOB are reported here for the first time. DNA sequence analysis of 61 HFI patients revealed single base mutations in the promoter, intronic enhancer, and the first exon, which is entirely untranslated. One mutation, g.–132G>A, is located within the promoter at an evolutionarily conserved nucleotide within a transcription factor-binding site. A second mutation, IVS1+1G>C, is at the donor splice site of the first exon. In vitro electrophoretic mobility shift assays show a decrease in nuclear extract-protein binding at the g.–132G>A mutant site. The promoter mutation results in decreased transcription using luciferase reporter plasmids. Analysis of cDNA from cells transfected with plasmids harboring the IVS1+1G>C mutation results in aberrant splicing leading to complete retention of the first intron (~ 5 kb). The IVS1+1G>C splicing mutation results in loss of luciferase activity from a reporter plasmid. These novel mutations in ALDOB represent 2% of alleles in American HFI patients, with IVS1+1G>C representing a significantly higher allele frequency (6%) among HFI patients of Hispanic and African-American ethnicity.
Hereditary fructose intolerance (HFI) [MIM 229600] is a rare and potentially fatal inherited metabolic disease resulting from a deficiency of aldolase B (EC 126.96.36.199) activity in the liver and kidneys (Hers and Joassin 1961). Ingested fructose is metabolized in these tissues where aldolase B is required to cleave fructose-1-phosphate (Fru-1-P). HFI is an autosomal recessive disorder (Froesch et al 1963) that affects approximately 1 in 20,000 individuals (Gitzelmann and Baerlocher 1973; James et al 1996), with a carrier frequency estimated at 1 in 50. It is likely that HFI could be more common as many adults are believed to be living undiagnosed in the population (Cox 1988). Symptoms appear in the newborn following weaning when fructose-containing foods are first introduced. Poor feeding, vomiting, and an overall failure to thrive are common presentations, although many ambiguous symptoms have been reported, all of which make diagnosis a challenge. If undiagnosed, the persistent intake of fructose can lead to liver and kidney failure, coma, and death (Morris 1968; Baerlocher et al 1978; Laméire et al 1978; Odièvre et al 1978; Cox 1993; Steinmann et al 2001). An accurate early diagnosis and the exclusion of fructose from the diet are key in alleviating symptoms and preventing morbidity. It is estimated that Americans consume 100 g of this sugar daily (Yudkin et al 1980; Anderson 1982) (http://www.ers.usda.gov/Briefing/Sugar/data.htm#yearbook). For this reason, a diet free from fructose can remain difficult.
Clinical diagnosis of HFI can be made by two methods: either the direct assay of aldolase B activity taken from a liver biopsy of a suspected patient, or by an intravenous (I.V.) fructose challenge (Laméire et al 1978; Steinmann and Gitzelmann 1981). Both of these tests are invasive, carry significant risk, and can cause uncomfortable side effects. Less invasive methods to diagnose HFI include whole-gene sequencing or allele-specific oligonucleotide (ASO) hybridization analysis that requires only a small blood sample (Tolan and Brooks 1992; Coffee et al 2009), and requires knowledge of HFI mutations. Of the known HFI-causing mutations, seven are common and make up 82% of HFI alleles worldwide. Private mutations account for 7%, and over 10% of HFI alleles still remain unknown (Coffee et al 2009). However, in the American HFI population, the percentage of unknown alleles was reported at over 33% (Coffee et al 2009). Furthermore, the percentage of unknown mutations is even higher in non-Caucasian populations, such as Hispanic and African-American populations (nearly 60%). The implementation of a more accurate diagnostic DNA-based genetic screen would benefit from the identification of additional common HFI alleles.
Over 40 mutations have been identified in the aldolase B gene (start of transcription defined by UCSC genome browser as location: chr9:104,198,062) that result in loss of enzyme activity and and HFI (http://www.bu.edu/aldolase/HFI/hfidb/hfidb.html). These include missense mutations, nonsense mutations, splicing mutations, insertions and deletions (ranging from 1–1600 bp). All mutations identified thus far are located in the protein-coding sequence of ALDOB (exons 2–9). It is well documented that mutations in non-coding regions of a gene can result in disease (Crossley and Brownlee 1990; Sakai et al 1991; Koivisto et al 1994; Perez-Tur et al 1995). Stretches of DNA that contain conserved transcription-factor binding sites, such as within the promoter and other regulatory regions, and splice junctions, while not present in the final protein, can nevertheless adversely affect gene transcription and protein biosynthesis.
Recent enumeration of HFI alleles in the American HFI population found that the seven most common mutations in ALDOB account for 65% of HFI alleles, with over a third of the mutations in ALDOB responsible for causing HFI in American patients remain unknown (Coffee et al 2009). The identification of novel mutations in unique regions would help to further define the mutational spectrum of HFI. It was recognized that the non-coding regions of ALDOB, including the promoter, intronic enhancer, and untranslated first exon, had not been investigated as a potential site for HFI-causing mutations. Therefore, DNA sequence analysis of the aforementioned regions was performed in a cohort of clinically diagnosed American HFI patients to uncover novel mutations in 5′-region of ALDOB that underlie the HFI phenotype. Identification and characterization of two novel mutations in these regions that impaired transcription or splicing are reported, with the latter being common among Hispanic and African-American HFI patients.
DNA was isolated from peripheral leukocytes in 38 HFI patients who were diagnosed by low aldolase B activity in liver biopsy samples, diagnostic response to intravenous (I.V.) fructose challenge, the identification of one HFI allele combined with clinical symptoms, and/or patients of Hispanic descent with typical dietary and clinical histories consistent with HFI. A follow-up analysis by ASO hybridization analysis (see below) was performed on leukocyte DNA from an additional group of suspected patients with one or more symptoms suggestive of HFI.
The specific genotypes of six probands are reported. Patient-34 was diagnosed with HFI by liver biopsy. Patient-50 presented at 1.5 years of age and was diagnosed with HFI by liver biopsy. Patient-249 was a female who presented at six months of age with failure to thrive and fructosemia. Patient-278 presented at three years of age with neonatal giant cell hepatitis of unknown etiology, and was diagnosed with HFI by liver biopsy. Patient-284 was a male who presented at five months of age with lactic acidosis, and was diagnosed with HFI by identification of one HFI allele. Patient-295 was a female who presented at seven months with liver failure following Pedialyte treatment for diarrhea and dehydration, and was diagnosed by identification of one HFI allele and successful exclusion of dietary fructose. This study was approved by the Institutional Review Board and informed consent was obtained from all subjects or their guardians.
Plasmids containing DNA inserts with either wild-type or mutant alleles were constructed from genotyped patient genomic DNA as controls for ASO screening. Each DNA insert was amplified by PCR and cloned into TOPO TA (Invitrogen). All control clones were confirmed by DNA sequencing. Amplified genomic DNA from patient samples and plasmid DNA from control clones (100 ng) were denatured and applied to a Zeta-Probe® GT nylon membrane (Bio-Rad). Blots were used for ASO hybridization analysis for A149P, A174D, N334K, R59Op, Δ4E4, A337V, and L256P as described previously (Coffee et al 2009). For new alleles, ASO probes for wild-type (g.–132wt 5′-ATTTTAAGGACTGGTTG-3′ and IVS1+1wt 5′-CCCAAACTGTAAGTAAA-3′) and mutant alleles (g.–132G>A 5′-ATTTTAAGAACTGGTTG-3′ and IVS1+1G>C 5′-CCCAAACTCTAAGTAAA-3′) were used. High stringency washes were performed at the following discriminatory temperatures: g.–132wt at 54 °C, g.–132G>A at 52 °C, IVS1+1wt at 58 °C, and IVS1+1G>C at 56 °C.
Genomic DNA was extracted from leukocytes isolated from whole blood as described previously (Orkin et al 1978). A 982 bp fragment containing the ALDOB proximal promoter and exon 1 was amplified by PCR (Saiki et al 1985) using sense and antisense primers (50 μM) 5′-ACTGCGTACAGACACTATACAAC-3′ and 5′-CATAAGGCAGTAGATATGTA-3′, respectively. PCR was carried out in a buffer containing 16 mM (NH4)2SO4, 67 mM Tris-HCl, pH 8.8, 200 μM dNTPs, 2 mM MgCl2, and 5 U/ml of Taq polymerase. Cycling conditions were as follows: 94 °C for 1.5 min; 60 °C for 1 min; 72 °C for 1 min, repeated for five cycles; 94 °C for 1.5 min, 60 °C for 1 min; 72 °C for 3 min, repeated for 25 cycles; 72 °C for 10 min; hold at 4 °C. A 1.2 kb fragment containing the ALDOB intronic enhancer was amplified similarly by PCR using sense and antisense primers 5′-TAGGATGTAACTTGCAATCC-3′ and 5′-CTGCTCATTGTAGTTGCTCA-3′, respectively. ALDOB exons 2–9 were PCR-amplified as previously described (Coffee et al 2009).
Each PCR product was purified using the NucleoSpin® Extract II silica-membrane columns (Macherey-Nagel) as per manufacturer’s instructions. DNA (0.5 μg) was submitted for sequence determination (≥3 independent times) (GENEWIZ, Inc., South Plainfield, NJ) using several internal primers to ensure overlapping reads along both strands. Sequence results were analyzed using Sequencher® 4.9 software (Gene Codes Corporation).
The pGL3-basic luciferase reporter plasmid (Promega) was used for generation of reporter plasmids for in vivo studies of expression, and the pcDNA3.1(−) mammalian expression plasmid (Invitrogen) was used for studies of splicing. Genomic DNA from non-HFI subjects was used as a template for PCR amplification of a 2,798 bp region from position −264 to +2534. This included the promoter (264 bp), the first exon (72 bp), and part of the first intron (2,462 bp). Primers were designed with 5′ extensions containing Xma I restriction sites for cloning into pGL3-basic at an Xma I site upstream of the luciferase open reading frame. Additionally, a 96-bp fragment from position +4795 to +4890 containing 87 bp upstream of exon 2 and the first nine bases of exon 2 was PCR-amplified from wild type genomic DNA. These primers were designed with 5′ extensions containing Bgl II restriction sites for cloning into pGL3-basic at a Bgl II site between the Xma I site and the luciferase open reading frame. This luciferase reporter plasmid, pProm2, contained a total of 2,893 bp of wild type ALDOB sequence.
Wild type ALDOB sequence was subcloned into the EcoR I, EcoR V, and BamH I sites of pcDNA3.1(−) mammalian expression vector (Invitrogen) for analysis of the splicing mutation. A 1.4 kb EcoR V/EcoR I fragment was isolated from pEE313 (Tolan and Penhoet 1986). A 3.5 kb EcoR I/BamH I fragment was isolated from pBE313 (Tolan and Penhoet 1986). Both fragments were cloned adjacently into pcDNA3.1(−) resulting in the wild type plasmid, pSplice_wt. The point mutations at g. 132 and IVS1+1 were introduced into pProm2 and pSplice_wt, respectively, using the QuikChange® Site-Directed Mutagenesis Kit (Stratagene) with primers containing 30–34 complementary bases producing p–132G>A and pSplice_mut, respectively.
The luciferase gene was PCR-amplified from pGL3-basic using a forward primer with a 5′ extension containing a BamH I restriction site enabling cloning in frame with the ALDOB translation start codon and a reverse primer complementary to pGL3-basic sequence downstream of a BamH I restriction site in the vector. The 1.8 kb BamH I fragment was cloned into pSplice_mut creating the plasmid pMut_Luc. Sub-cloning a fragment containing the wild-type splice sequence of exon 1 from pSplice_wt into pMut_Luc created the plasmid pWt_Luc. The correct construction of all plasmids was confirmed by DNA sequencing.
Human kidney-derived A293 cells were grown in DMEM (Invitrogen) supplemented with 10% fetal bovine serum (FBS) (Invitrogen), 100 U (60 μg) of penicillin, and 100 μg of streptomycin (Invitrogen) at 37 °C with 5% CO2. Cells were seeded in 6-well tissue culture plates and transfected in triplicate when they reached approximately 30% confluence. The media was replaced and DNA was mixed with polyethylenimine (PEI) (1 μg/μl, pH 7.2) (Polysciences Inc, Warrington, PA) at a ratio of 1μg DNA:3μl PEI and added to an aliquot of serum-free media. This DNA mixture was incubated at room temperature for 15 min and then added to the cells. Cells were harvested after 42–48 h.
Luciferase activity in transfected A293 cells was measured using the Luciferase Assay System (Promega). Briefly, media was removed and cells were rinsed with phosphate-buffered saline. Reporter Lysis Buffer (Promega) was added to each well and cells were gently scraped and collected. Cells were lysed by freeze thaw, vortexed briefly, and centrifuged for 3 min at 20,800 × g at 4 °C. Supernatant fractions were collected and transferred to a new tube. Luciferase activity was determined by adding 100 μl of Luciferase Reagent (Promega) to 20 μl of cell extract and luminescence was measured for 10 s in a 20/20n Luminometer (Turner BioSystems). Transfection efficiency was measured by adding 20 μl of cell extract to a final volume of 135 μl in a reaction containing 122.5 mM sodium phosphate, pH 7.5, 3 mM O-nitrophenyl-β-D-galactopyranoside, 1.1 mM MgCl2, and 53 mM β-mercaptoethanol and incubated at 37 °C until reactions became yellow. The reactions were stopped by the addition of Na2CO3 to 644 mM and absorbance values were measured at 415 nm. Luciferase values were normalized to transfection levels determined by β galactosidase expression. Error calculated by either standard error of the mean (SEM) from two to four trials as noted. Unpaired Student’s t-test was performed to determine p values.
Human liver-derived HepG2 cells were grown to confluency and cells were collected and resuspended in 0.4 ml of Buffer E (10 mM HEPES, pH 7.9, 10 mM KCl, 0.1 mM EDTA, 0.1 mM EGTA, 1 mM DTT, 1 mM PMSF, and 1 μg/ml pepstatin A). After 10 min at 4 °C, 5% (v/v) NP-40 was added to 0.6%, cells were vortexed briefly, incubated at 4 °C for 15 min, and centrifuged for 30 s at 20,800 × g. The cytoplasm-containing supernatant fraction was removed and the nuclear pellet was resuspended in 0.1 ml of Buffer E and centrifugation was repeated. The pellet was resuspended in 0.1 ml of 20 mM HEPES, pH 7.9, 400 mM NaCl, 1 mM EDTA, 1 mM EGTA, 1 mM DTT, 1 mM PMSF, and 1 μg/ml pepstatin A. Nuclei were vortexed and incubated for 45 min with gentle rocking at 4 °C. The sample was centrifuged for 10 min at 20,800 × g at 4 °C. The supernatant fraction was transferred to a new tube, supplemented with glycerol to a final concentration of 20% (v/v), aliquoted, and stored at −80 °C.
An expression plasmid that contained human C/EBPα cloned into pcDNA3 (Invitrogen) was used for in vitro translation (kind gift from Dr. Geoffrey Cooper). The TNT® Coupled Wheat Germ Extract System (Promega) was used with T7 RNA polymerase to transcribe and translate C/EBP in vitro.
DNA fragments of 24 bp corresponding to the ALDOB promoter from position −144 to −121 relative to the start of transcription were synthesized (Eurofins MWG Operon, Huntsville, AL) with either wild-type sequence or the point mutant at position −132. Complementary oligonucleotides (1 nmol) were annealed by incubating at 95 °C for 10 min and allowed to cool overnight in 10 mM Tris, pH 7.6, 10 mM NaCl, 2 mM MgCl2, and 0.1 mM EDTA. Annealed probes were radioactively labeled at the 5′-end by incubation with γ-[32P ]-ATP and T4 polynucleotide kinase (New England Biolabs) at 37 °C and purified from unincorporated ATP by a Bio-Gel® P-6 DG (Bio-Rad) column. DNA-binding reactions contained 5 μg of nuclear extract, 1 μg poly dI:dC (Sigma), 1 μg BSA (NEB), and 2 × 106 cpm of radiolabelled ALDOB promoter probes in 50 mM Tris, pH 7.4, 250 mM NaCl, 5 mM DTT, 5 mM EDTA, and 20% (w/v) glycerol. Samples were incubated at room temperature for 40 min. DNA-protein complexes were resolved on 5% polyacrylamide gels in 25 mM Tris, 0.19 M glycine, 1 mM EDTA, and 2.5% (v/v) glycerol run at 25 mA at room temperature. Gels were dried for 60 min at 80 °C prior to autoradiography. Relative binding was quantified using ImageJ software (Abramoff et al 2004).
Total RNA was extracted using TRIzol® Reagent (Invitrogen). RNA (5 μg) was incubated with 2 nM poly(T) primer at 70 °C for 5 min and immediately cooled on ice. M-MLV reverse transcriptase (200 U) (Promega), dNTPs (10 mM), and RNasin® (25 U) (Promega) were incubated in 5x M-MLV reaction buffer (Promega) for 2 h at 42 °C. PCR was performed with 5 μl of this cDNA reaction using the T7 primer (5′-TAATACGACTCACTATAGGG-3′), a reverse primer (5′-TCAGCGGTTTAAACTTAAGC-3′), and 2 mM MgCl2 under the following cycling conditions: 94 °C for 4 min; (94 °C for 30 sec; 60 °C for 30 sec; 72 °C for 2 min) repeat for thirty cycles; hold at 4 °C. PCR products were visualized on agarose gels stained with ethidium bromide.
DNA was separated on 1.5% TAE agarose gel and transferred to a nylon membrane under conditions previously described (Tolan and Penhoet 1986). The membrane was incubated in a buffer containing 5x SSC, 20 mM NaH2PO4, pH 7.2, 1 mM EDTA, 7% (w/v) SDS, and 10X Denhardt’s (Ficoll:polyvinylpyrrolidone:BSA (2g each/L)) for 1 h at 37 °C in a rotating hybridization oven. A radiolabeled probe (splice junction 5′-TCCCAAACTATGGCCCAC-3′, exon 1 5′-ACTCTTCTCTCCCAAACT -3′, or exon 2 5′-GGCAATTTCTGAAGAGCT-3′) was added and hybridization proceeded overnight at 37 °C. Blots probed with oligonucleotides for the splice junction, exon 1, and exon 2 were washed with 5x SSC and 0.1% (w/v) SDS at 37 °C, 52°C, and 45 °C, respectively, and autoradiography was performed.
Recent enumeration of HFI alleles in 153 American patients with 268 independent alleles using ASO hybridization showed that seven known mutations (A149P, A174D, N334K, A337V, Δ4E4, R59Op, and L256P) found in exons 5, 9, 4, 3, and 7, respectively, accounted for 65% of the HFI alleles in this group (Coffee et al 2009). Following analysis for these seven mutations, 57 patients remained incompletely genotyped. A 38-patient subset harboring 58 unknown alleles were investigated for mutations at the 5′ end of ALDOB. These patients were diagnosed with HFI by low aldolase B activity in liver biopsy samples, diagnostic response to I.V. fructose challenge, the identification of one HFI allele combined with clinical symptoms, and/or patients of Hispanic descent with dietary histories suggestive of HFI.
Identification of novel and perhaps common mutations in this cohort of patients was accomplished by DNA sequence analysis of the promoter, the first exon, and the intronic enhancer of ALDOB (all non protein-coding regions). Sequence determination of a 981 bp fragment containing both the ALDOB proximal promoter and first exon revealed three mutations (Figure 1). In the promoter, one mutation was identified in a Patient-50 at an evolutionarily conserved site (Berardini et al 1999). The mutation, a G>A transition at position −132 relative to the start of transcription (Berardini et al 1999) was termed g.–132G>A. The location of this mutation was within a protein-binding site identified by DNase footprinting analysis (Tsutsumi et al 1989; Raymondjean et al 1991). A second mutation, a T>A transversion at position − 129 in the promoter was identified in six patients (not shown in Figure 1). This mutation was the same as a known single nucleotide polymorphism (SNP) found in 18% of Caucasian alleles (SNP rs12337537) (http://www.ncbi.nlm.nih.gov/SNP/). A third mutation was identified in two patients (Patients-278 and -295) at the first nucleotide of intron 1 (the donor splice site of exon 1). This mutation was termed IVS1+1G>C. The relative locations of the two potential HFI mutations are shown in Figure 2. A second PCR-amplified product from each of the three patients was cloned and DNA sequencing confirmed the variant.
Analysis of the enhancer sequence in the first intron revealed four mutations at positions IVS1+214, IVS1+1873, IVS1+1887, and IVS1+2379. Three of these were among known SNPs in this region; IVS1+214 (SNP id: rs515313) found in 20–50% of alleles, IVS1+1887 (rs970385) found in 40% of alleles, and IVS1+2379 (rs285470) found in 40–50% of alleles. These nucleotide changes were not characterized further.
Having no definitive clinical diagnosis, but strong indications provided by dietary history, including improved health after the successful exclusion of dietary fructose, 23 additional suspected HFI patients were screened by ASO hybridization analysis for the two novel promoter-region mutations, g.–132G>A and IVS1+1G>C. A 411 bp DNA fragment containing the promoter and first exon was amplified from genomic DNA. Control clones were generated containing fragments of both wild type (g.–132 and IVS1+1) and mutant (g.–132G>A and IVS1+1G>C) sequence amplified from genomic DNA from Patients-50 and -295. The ASO screen identified a third patient with the g.-132G>A promoter mutation (Patient-34) and two additional patients with the IVS1+1G>C splice-site mutation (Patients-249 and -284) (data not shown). The six HFI probands identified with these mutations in the promoter and first exon including method of diagnosis, ethnic background, and the genotypes are listed in Table 1. Further sequence analysis of all protein-coding regions in the six patients did not uncover any additional mutations. Therefore, the classification of g.–132G>A and IVS1+1G>C as HFI-causing alleles was supported by the genetic evidence. To determine whether either of the mutations were common polymorphisms found in the general population, an ASO screen was performed on non-HFI affected samples from various ethnic populations (15 Caucasians, 12 African-Americans, 12 Asians, and 12 Hispanics). The results showed that none of these 51 non-HFI affected individuals were positive for these mutations, indicating they are not common polymorphisms found in the general population and further supporting their classification as HFI mutations (data not shown).
DNase footprinting analysis using rat liver nuclear extracts identified a protein binding site at −162 to −142 (Tsutsumi et al 1989; Raymondjean et al 1991). Based on alignment with rat DNA sequence (Berardini et al 1999), the human nucleotide at position −132 lies within this binding site. The location of the g.–132G>A mutation at a conserved site in the ALDOB promoter suggests a role for this sequence in transcription factor binding (Figure 2). A SNP has been identified at position −129, at an evolutionarily conserved site three nucleotides downstream of −132 within the same protein-binding site(s). This SNP has no known clinical significance. It was necessary, therefore, to establish what clinical significance, if any, a single nucleotide change at position −132 would have on transcription. The effect of the G>A transition at position −132 on protein binding ability was measured by electrophoretic mobility shift assay (EMSA) using HepG2 and A293 nuclear extracts. Wild-type and mutant probes were radiolableled for EMSA analysis. The mutant probe showed a decrease in protein binding as compared to wild type using nuclear extracts from both HepG2 and A293 cells (Figure 3). The g.–132G>A mutation resulted in an approximate 3-fold decrease (2.8±1.1). This result was confirmed by competition experiments (Figure 3) where approximately 2 to 3-fold more cold mutant competitor was needed to show similar binding. While it has been reported that position −162 to −142 in the rat ALDOB promoter (which shares 72% sequence identity to human ALDOB promoter −146 to −124 (Berardini et al 1999)) is a C/EBP binding site (Raymondjean et al 1991), in vitro translated C/EBP failed to produce binding to either wild type or mutant probes (data not shown).
Given the modest 2-fold effect on nuclear factor binding, the promoter mutant, g.–132G>A, was further assessed for its effect on transcription using a luciferase reporter assay. A reporter plasmid, pProm2, was made that contained 2,798 bp of ALDOB sequence from −264 to +2534. The G>A point mutant at position −132 was introduced by site-directed mutagenesis. Human kidney-derived A293 cells, which are more efficiently transfected than HepG2 cells (data not shown) were transfected with 1 μg of either wild type ALDOB promoter plasmid (pProm2) or mutant ALDOB promoter plasmid (p-132G>A), plus 1 μg of a β-galactosidase transfection control plasmid (pRSV-β-gal). Kidney cells express aldolase B at the same levels as liver (Funari et al 2010). Luciferase activity was measured and normalized to the transfection efficiencies determined by a β-galactosidase assay. As seen in Table 2, wild type ALDOB promoter sequence showed an approximate 3-fold increase in expression of luciferase as compared to a negative control, which is consistent with previous studies (Gregori et al 1991). The g.–132G>A mutation caused a significant (p<0.0001) decrease in luciferase expression, which was not significantly different from the negative control (p=0.3). These studies confirmed that g.–132G>A leads to loss of aldolase B activity due to a loss of transcription from the ALDOB promoter.
Mutations occurring at both donor and acceptor splice sites in ALDOB have been found in HFI patients (Cross and Cox 1990; Brooks et al 1991; Ali et al 1994; Ali et al 1996; Esposito et al 2004; Santer et al 2005). While these studies hypothesize that these mutations affect ALDOB expression at the mRNA level, demonstration of the splicing defect is rarely reported. However, because exon 1 is entirely 5′-UTR, knowing the effect of splicing mutations is particularly important. While exon 1 is not included in the final protein, correct splicing at this site is nevertheless required to remove the large first intron (4.8 kb) such that the correct translational start codon in exon 2 is used. In the absence of a wild type consensus donor splice site at the end of exon 1, two consequences could result. One outcome is the use of a downstream cryptic splice site in intron 1 that splices to exon 2, inserting a sequence from intron 1 into the 5′-UTR. If the insert did not contain an ATG sequence, the translational start codon in exon 2 would be used with no effect on ALDOB expression. If the insert contained a start codon that falls in frame with the aldolase B open reading frame (ORF), and in the absence of a premature stop codon, extra amino acids would be added at the amino-terminal end of the protein, likely not disrupting function (Doyle and Tolan 1995; Malay et al 2002). However, if the insert had a start codon that fell out of frame with the ORF, a nonfunctional protein would result. The second possibility is in lieu of splicing at a cryptic donor site, complete retention of the entire first intron (adding a 4,800 bp insertion in the 5′-UTR) could occur. This would result in no aldolase B expression unless there were use of an internal ribosome binding site. Multiple potential ATG start codons lie within this 4.8 kb sequence, and all fall in frame with downstream premature stop codons. Translation from the first ATG in this mRNA would result in a short polypeptide, and would likely induce nonsense-mediated mRNA decay (McGlincy and Smith 2008; Silva and Romao 2009). With multiple possible effects on the protein, ranging from negligible to deleterious, the determination of the effect the IVS1+1G>C mutation has on the transcript was imperative.
An expression plasmid that contained 4.9 kb of ALDOB DNA sequence from position +42 to +4940 was constructed, pSplice_wt, by subcloning from plasmids containing wild type DNA sequence. The IVS1+1G>C point mutant was introduced by site-directed mutagenesis generating the plasmid pSplice_mut. The effect of the IVS1+1G>C mutation on splicing was investigated by transient transfection of A293 cells with wild type and mutant plasmids, followed by extraction of total RNA, and cDNA synthesis. PCR amplification of cDNA from cells transfected with pSplice_wt produced a transcript indicative of correct splicing (248 bp), while cells transfected with pSplice_mut produced a transcript of greater than 5 kb, indicating complete retention of the first intron (Figure 4). This mutant product was similar in size to a non-transfected unspliced control plasmid.
Further investigation of the transcripts made from these ALDOB plasmids utilized Southern blot hybridization using radiolabeled oligonucleotides that spanned the splice junction between exon 1 and exon 2. Only PCR-amplified cDNA from cells transfected with pSplice_wt annealed to this probe, indicating normal splicing took place. Moreover, normal splicing did not take place from the pSplice_mut (Figure 4). However, radiolabeled oligonucleotides specific to either exon 1 or exon 2 showed that the large PCR-amplified cDNA of pSplice_mut transfected cells was specific to ALDOB (Figure 4). Both probes hybridized specifically to this >5.0 kb fragment, consistent with the retention of the entire first intron. Additional smaller PCR-amplified products from cells transfected with pSplice_wt hybridized to both the exon 1 and exon 2 probes indicating the presence of smaller alternatively spliced transcripts. These transcripts have been observed in various rare ESTs (eg., Accession number BJ994276).
While RT-PCR and Southern hybridization indicated that the IVS1+1G>C splice site mutation causes retention of the first intron, it was necessary to discern whether it was still possible that this transcript could be used for translation resulting in production of aldolase B via an internal ribosome entry site (Lopez-Lastra et al 2005). The luciferase gene was cloned downstream of ALDOB and in frame with the ALDOB ATG translation initiation codon in both pSplice_wt and pSplice_mut, generating the plasmids pWt_Luc and pMut_Luc, respectively. Human kidney-derived A293 cells were transfected with 1 μg of pWt_Luc or pMut_Luc and 1 μg of pRSV-β-gal, and luciferase activity was measured and normalized as described above. Unlike the previous wild type ALDOB plasmid (pProm2), pWt_Luc is under the control of a CMV promoter to maximize expression. As seen in Table 3, luciferase expression from pWt_Luc indicated normal splicing followed by successful translation of the protein occurred. On the contrary, pMut_Luc containing the IVS1+1G>C mutant splice site showed luciferase expression that was not significantly different than the negative control. These studies confirmed that the IVS1+1G>C mutation at the exon 1 donor splice site leads to loss of ALDOB expression due to intron retention.
Two novel mutations in the non-coding sequence of ALDOB in HFI patients were identified and characterized to show that each leads to loss of ALDOB expression. While numerous ALDOB mutations have been identified throughout the protein-coding sequence, this is the first description of mutations occurring in the promoter region that ultimately lead to a loss of aldolase B activity.
Significant work has been performed towards uncovering the transcriptional regulation of the aldolase B gene. The ALDOB proximal promoter is approximately 200 bp in length and contains binding sites for a number of ubiquitous and tissue-specific transcription factors that are required for transcription in hepatocytes and renal cells. In addition to the promoter, transcription of ALDOB requires an enhancer located within the first intron (Gregori et al 1998). Studies using the ALDOB-promoter constructs have shown that it is tissue-specific, but a relatively weak promoter (Gregori et al 1991). The cis-elements and their trans-factors involved in ALDOB regulation have been studied in rat, but remain ill-defined. Binding sites for NFY, HNF1, and HNF3 (see Figure 2) were identified by EMSA analysis and transient transfection assays (Raymondjean et al 1991; Gregori et al 1993; 1994; Vallet et al 1995). The least well-defined transcription factor binding site is within the promoter at position −144 to −127; the region containing the HFI mutation. While DNase footprinting analysis clearly indicated protein(s) binding to this sequence, those factor(s) binding have not been clearly identified. Previous reports have suggested that this region from −144 to −127 could encompass the binding site for C/EBP, DBP, or A1F-C (Gregori et al 1991; Yabuki et al 1993; Berardini et al 1999), but clear demonstration that any of these transcription factors binding to this site is lacking. Sequence alignment of the ALDOB promoter between rat and human showed that this region (−144 to −127) had less conservation (72% identity) compared to other cis-elements (−120 to −111;89% identity and −110 to −85;92% identity), which bind to NFY and HNF1/3, respectively (Berardini et al 1999). Nevertheless, this new promoter mutation at g.–132 occurred at a conserved and clearly important position.
While mutations in promoters of other genes at known transcription-factor binding sites have been identified that result in a disease phenotype (Crossley and Brownlee 1990; Sakai et al 1991; Koivisto et al 1994; Park et al 2009), the ALDOB promoter has not previously been investigated. The novel g.–132G>A mutation causes a decrease in expression from a reporter plasmid likely due to a disrupted transcription factor binding site as determined by EMSA analysis.
While the first exon of ALDOB is not translated, it still plays a role in proper transcript formation. Splice sites at both the donor and acceptor position are highly conserved, and mutations in these consensus sequences can alter splicing and cause disease (Faustino and Cooper 2003). One report estimates that of all point mutants responsible for causing disease, 15% of them are splicing mutations (Krawczak et al 1992). When a consensus splice site is mutated, one possible consequence is that an intron is not appropriately excised and the sequence can remain in the final mRNA transcript causing insertion of amino acids. Besides the addition of unwanted amino acids, another result of intron retention is the induction of nonsense-mediated mRNA decay (McGlincy and Smith, 2008; Silva and Romao, 2009). The machinery can detect when shorter than normal transcripts are being produced, recognize the insertion of a premature stop codon, and can target the mRNA for degradation ensuring no protein is synthesized. It is possible that retention of the first intron caused by IVS1+1G>C induces nonsense-mediated mRNA decay in hepatocytes and renal cells, but without analyzing these tissues directly from patients, this can not be confirmed. Cells transfected with the mutant plasmid retained the >5.0 kb transcript and did not show signs of degradation, although the plasmid was overexpressed.
One important observation arising from this study was the correlation of the ethnic background of the families carrying g.–132G>A or IVS1+1G>C. While g.–132G>A was found in families of Northern European descent, IVS1+1G>C was more common in patients of Hispanic descent. Table 4 compares the frequency of the IVS1+1G>C allele relative to other HFI alleles among Hispanics with HFI and relative to those in all American HFI patients. While this mutation represents 1.5% of all American alleles, there was a significantly (p<0.01) increased frequency among Hispanics of nearly 6%, making it one of the top HFI causing alleles in this population. This would be the first allele identified that could be common to the African-American racial and Hispanic ethnic groups and will be useful for screening in this population.
The authors would like to thank Marilyn McCorn-St. Fleur, Michelle Garcia, Rachel Gibbons, and Taliesin Lenhart for their contributions in performing DNA sequence analysis. We’d like to thank Dr. Harvey Levy for critical reading of the manuscript. This work was supported in part by National Institutes of Health grant DK-065089 (to DRT).