|Home | About | Journals | Submit | Contact Us | Français|
We have followed-up on the recent genome-wide association (GWA) of the clusterin gene (CLU) with increased risk for Alzheimer disease (AD), by performing an unbiased resequencing of all CLU coding exons and regulatory regions in an extended Flanders-Belgian cohort of Caucasian AD patients and control individuals (n = 1930). Moreover, we have replicated genetic findings by targeted resequencing in independent Caucasian cohorts of French (n = 2182) and Canadian (n = 573) origin and by performing meta-analysis combining our data with previous genetic CLU screenings.
In the Flanders-Belgian cohort, we identified significant clustering in exons 5-8 of rare genetic variations leading to non-synonymous substitutions and a 9-bp insertion/deletion affecting the CLU β-chain (p = 0.02). Replicating this observation by targeted resequencing of CLU exons 5-8 in 2 independent Caucasian cohorts of French and Canadian origin identified identical as well as novel non-synonymous substitutions and small insertion/deletions. A meta-analysis, combining the datasets of the 3 cohorts with published CLU sequencing data, confirmed that rare coding variations in the CLU β-chain were significantly enriched in AD patients (ORMH = 1.96 [95% CI = 1.18-3.25]; p = 0.009). Single nucleotide polymorphisms (SNPs) association analysis indicated the common AD risk association (GWA SNP rs11136000, p = 0.013) in the 3 combined datasets could not be explained by the presence of the rare coding variations we identified. Further, high-density SNP mapping in the CLU locus mapped the common association signal to a more 5' CLU region.
We identified a new genetic risk association of AD with rare coding CLU variations that is independent of the 5' common association signal identified in the GWA studies. At this stage the role of these coding variations and their likely effect on the β-chain domain and CLU protein functioning remains unclear and requires further studies.
Genome-wide association (GWA) studies lead to long-awaited breakthroughs in the genetics of late-onset Alzheimer disease (AD) [MIM 104300]  by providing conclusive genetic association evidence for novel AD risk genes [2-5]. Notably, GWA significance with similar effect sizes was reached for the top single nucleotide polymorphism (SNP) rs11136000 in the clusterin gene (CLU) [MIM 185430] . The CLU protein (also known as apolipoprotein J) is a multifunctional protein showing functional similarities with the major apolipoprotein of the brain, apolipoprotein E (APOE) . In relation to AD, CLU expression is increased in pyramidal neurons and astrocytes of the hippocampus and entorhinal cortex, the most severely affected brain regions in AD . CLU is present in senile plaques , binds to Aβ and is involved in Aβ42 clearance across the blood brain barrier . Moreover, CLU enhances endocytosis of Aβ aggregates to brain phagocytes . Taking its variety of physiological functions, CLU could be a guardian or enemy in AD .
The CLU transcriptional unit is located in the chromosomal region 8p21-p12 and comprises 9 exons in the longest transcript that translates in the main CLU protein isoform of a 449 amino-acid residues. The CLU precursor peptide is internally cleaved to produce an α- and β-subunit, held together by disulphide bridges and is subsequently secreted from the cell (Figure (Figure11).
In this follow-up study aiming at identifying the genetic variant underlying the CLU association with increased AD risk, we examined the CLU genetic variability using a resequencing approach including all coding exons and regulatory regions in a well-documented Flanders-Belgian patient/control cohort (n = 1930 subjects) [2,3]. Significant genetic findings were verified by targeted resequencing in two independently ascertained French  and Canadian  AD cohorts and by meta-analyses of the genetic data sets generated in this study with published data sets obtained of previous genetic screenings of CLU in AD cohorts [14,15].
The stage I resequencing of the coding exons and the regulatory regions of CLU in patients and control individuals of the Flanders-Belgian AD cohort (n = 1930) (Table (Table1),1), identified in total 19 rare to intermediate rare non-synonymous single nucleotide variations predicting an amino acid substitution in the CLU protein of which only 5 had been reported earlier [14,15] (Table (Table2).2). Further, we detected an in-frame 9-bp deletion predicting a 3 amino acid deletion p.T445_D447del. Fourteen of the 19 non-synonymous substitutions occurred in 31 AD patients (n = 849, 3.6%), of which 8 appeared only in patients (n = 11), and 6 in patients (n = 20) and control individuals (n = 20). One AD patient carried 2 non-synonymous substitutions p.R338W and p.T345M, and one healthy individual carried both p.S16R and p.R234H. All 3 AD patients with p.T445_D447del carried also p.A309T. The remaining 5 non-synonymous substitutions occurred only in control individuals (n = 5), adding up to 25 control carriers (n = 659, 3.8%) (Table (Table22 Figure Figure11).
Seven of the 14 non-synonymous substitutions observed in patients were predicted to have a possible or probable harmful effect on CLU protein structure and function using physical and comparative considerations embedded in the PolyPhen software (Table ](Table2).2). SIFT software [17,18] assessing whether the amino acid substitutions had an effect on CLU protein function based on evolutionary conservation identified 3 non-tolerated substitutions (p.R338W, p.S16R and p.H37Q) (Table (Table2)2) (Figure (Figure1).1). For 4 patients carrying a rare non-synonymous CLU variant, we obtained positive disease history data. In their pedigree, we identified only one other affected first degree relative which suggested that these rare variants have an intermediate disease penetrance.
In contrast, all 5 non-synonymous substitutions detected only in control individuals were labeled by PolyPhen software as benign and were tolerated according to SIFT software (Table (Table22 Figure Figure11).
Genetic variations in regulatory elements were also detected in AD patients but none of these genetic variants were located within conserved transcription factor binding sites (See table part D in Additional file 1). In addition, no CLU variants were found affecting splicing using FSPLICE, Netgene2 and SPL prediction programs (See table part E in Additional file 1).
Though overall the percentages of CLU rare variant carriers among AD patients and control individuals was similar, visible inspection of the distribution of the non-synonymous substitutions in AD patients suggested clustering in exons 5-8 which for the most part are coding for the CLU β-chain domain (Table (Table2)2) (Figure (Figure1).1). Statistical analysis confirmed significant clustering in exons 5-8 with 3.8 times more carriers of non-synonymous substitutions or a deletion mutation among patients than control individuals (p = 0.02). To confirm this observation, we used targeted resequencing of exons 5-8 in two independent replication AD cohorts of French and Canadian origin (n = 2755). Again we identified non-synonymous CLU substitutions of which p.R338W, p.N369H and insertion/deletion p.T445_D447del that were already observed in Flanders-Belgian patients. Also, in the Lille AD cohort one 1 bp-insertion predicted a frameshift and premature stop codon - p.I303NfsX13 was found in one AD patient (Table (Table3)3) (Figure (Figure11).
Allele sharing among p.T445_D447del carriers i.e. 3 Flanders-Belgian, 3 French, 1 Canadian patients and offspring of two Belgian patients was examined by genotyping short tandem repeat markers located in a 1.3 Mb region of the CLU locus (See table in Additional file 2). DNA available of two children (non-affected, inclusion ages < 50 years) allowed to reconstruct phased haplotypes. No other family members were available. The 3 Flanders-Belgian patients shared alleles at 10 neighboring markers constituting a haplotype of 1.3 Mb around CLU. Seven of these consecutive markers were shared by 2 French patients. A third French patient and a Canadian patient carrier shared 3 markers of this haplotype near CLU. Of note, only Flanders-Belgian patients also carried non-synonymous variant p.A309T.
All non-synonymous substitutions and the deletion/insertion mutations observed in the 3 AD cohorts affecting the CLU β-chain domain were carried forward in a meta-analysis (Table (Table4).4). Similarly, we included in the meta-analysis previously reported non-synonymous substitutions that were observed in Portuguese, UK and US Caucasian AD cohorts [14,15]. Combining the information of in total 6724 patient and 4820 control chromosomes, the meta-analysis confirmed that non-synonymous substitutions and insertion/deletions in exons 5-8 were significantly more enriched in AD patients compared to healthy control individuals (ORMH 1.96 [95% CI 1.18-3.25]; pMH = 0.009, pwoolf = 0.551) (Table (Table44).
To assess whether the mutation frequencies of rare coding variants differed by ethnicity, we also resequenced exons 5-8 in a Caribbean Hispanic AD patient/control cohort (Table (Table5).5). This identified 2 novel benign non-synonymous substitutions affecting the CLU β-chain domain - p.E431Q and p.D331N. In addition, 3 non-synonymous substitutions found in Belgian, French and Canadian individuals were more frequently observed in Caribbean Hispanics - p.N369H, p.D380N and p.S448L (Table (Table5)5) (Figure (Figure11).
We analyzed association in the Flanders-Belgian AD cohort with the GWA top SNP rs11136000 and 14 other SNPs in the CLU locus, aiming at replicating the common AD risk association with CLU as well as fine-mapping the common association signal. We observed significant allelic association with rs11136000 (p = 0.004) and 4 other SNPs (p < 0.05) (Table (Table6a).6a). Stratification by APOE ε4 genotype indicated that allelic associations were only significant in the APOE ε4 + stratum (See table in Additional file 3). All five SNPs are positioned within the same LD block (D' between 0.12 and 1.00, r2 between 0.00 and 0.96) (See figure in Additional file 4 and table in Additional file 5). Conditional logistic regression analysis showed that the 4 associated SNPs were not independent from GWA top SNP rs11136000 (data not shown).
Next, we genotyped the 5 associated SNPs in the Lille and Toronto AD cohorts and calculated allelic association with AD risk (Table (Table6a).6a). Nominal significance was observed in the Lille AD cohort with 4 out of 5 SNPs and in the APOE ε4+ stratum of the Lille AD cohort with rs9331908 (p = 0.024) (See table in Additional file 6). In the Toronto cohort association was only significant in the APOE ε4+ stratum with rs867230 (p = 0.024) and rs7982 (p = 0.023) (See table in Additional file 6).
Meta-analysis combining data from all 3 AD cohorts, confirmed allelic association with GWA SNP rs11136000 (p = 0.013), but showed the strongest association after Bonferroni correction with rs1532278 (p = 0.005) (Table (Table6b,6b, figure in Additional file 7).
Next, we investigated whether the significantly increased presence of rare non-synonymous substitutions and deletion/insertions in AD patients could have driven the common association of CLU with AD risk. Hereto, we recalculated the allelic association in the meta-analysis after excluding the 58 carriers from the Flanders-Belgian (26), the Lille (24) and the Toronto (8) AD cohorts. Allelic associations with the 5 common SNPs remained significant (p-values < 0.05), indicating that the associations with the common SNPs and the rare coding variations represent two independent observations (data not shown).
We used an extensive resequencing approach to follow-up on the significant association of common SNPs in the CLU locus with increased risk for AD in 2 independent GWA studies. In the discovery stage, we used unbiased resequencing of all coding and regulatory regions of CLU in a Flanders-Belgian AD cohort followed by targeted sequencing of 2 independent replication cohorts of French and Canadian origin. In the Flanders-Belgian AD cohort, we obtained significant evidence for clustering of rare coding variants in exons 5 to 8, predicting 3 possible (p.T345M, p.N369H, p.T440M) and one probable (p.R338W) damaging non-synonymous substitutions and a 9-bp insertion/deletion p.T445_D447del affecting the CLU β-chain domain. Interestingly, 4 AD patients carried multiple CLU variations: a patient with a positive family history and onset age of 68 years harbored 2 predicted pathogenic substitutions (p.R338W, p.T345M) and three patients carried the deletion p.T445_D447del together with the benign variant p.A309T. Targeted resequencing in the Lille and Toronto AD cohorts identified predicted pathogenic substitutions (p.R338W and p.N369H) in French AD patients and the p.T445_D447del deletion in French and Canadian AD patients. Noteworthy, we observed haplotype sharing with 7 markers around the insertion/deletion site in 3 Belgian and 2 French AD patient carriers which is suggestive for a common ancestor. Partial haplotype sharing was further observed with another French and Canadian patient.
Moreover, we identified a 1-bp insertion/deletion variation p.I303NfsX13, predicting a premature stop codon after 315 amino acid residues (24 of which in β-chain), in a French patient (onset age 68 years). This frame-shift mutation would most likely produce an unstable transcript that is degraded by the nonsense mediated mRNA decay control system or an unstable C-truncated protein that is degraded within the cell, however, no patient material was available for further testing. Substitutions p.R338W and p.T345M are positioned inside the disulphide rich region which is involved in the folding of the α and β-domains into a heterodimer complex. Variant p.N369H deletes a CLU N-glycosylation signal , and since CLU is highly glycosylated (20-25% of its total mass comprises carbohydrates), this substitution might affect CLU processing or functioning in ligand interaction. Although absent from all Belgian and French control individuals (> 2500 control chromosomes), p.N369H was present in one Canadian control person aged 58 years, which is suggestive of an intermediate penetrance for this substitution. We also observed 3 non-synonymous substitutions with possible damaging effects (α-chain and β-chain) in both Flanders-Belgian patients and control individuals. As these variants were found in 1 to 4 control individuals only and the control group had on average a younger inclusion age than the patient group, these substitutions might represent risk factors of intermediate disease penetrance and might still contribute to AD risk. In contrast, all non-synonymous substitutions observed in control individuals were predicted to be benign and to unlikely affect CLU protein functioning, strengthening our observation that rare non-synonymous substitutions might contribute to AD risk.
Meta-analysis including all rare coding variants affecting the CLU β-chain domain observed in the discovery and replication cohorts plus those reported in 3 additional cohorts [14,15], totaling 5772 patients and controls, lent further support to an increased occurrence of rare coding variants predicted to affect the CLU β-chain in AD patients compared to healthy individuals (ORMH 1.96 [95% CI 1.18-3.25]; p = 0.009). Assessing the prevalence of variations affecting the β-chain domain by targeted resequencing of CLU exons 5 to 8 in a Caribbean Hispanic AD cohort, detected novel benign variants while predicted pathogenic variants found in stage I and II AD cohorts were absent. Of note, predicted pathogenic variant p.N369H was more frequently observed in Hispanic patient and control individuals then in stage I and II Caucasian cohorts, suggesting this variant might be a polymorphism which is in agreement with previous reports in African-Americans . Excluding p.N369H from our meta-analysis on rare variants did not alter the significance of our observation that rare coding variants affecting the CLU β-chain are more frequently observed in AD patients than control individuals (p = 0.001). It is possible that rare CLU variants contribute in a different way to disease risk in diverse populations and alternatively, that part of the genetic CLU variability is population-specific (e.g. Hispanic origin). Screening other and larger AD cohorts of different ethnicity and conducting functional analyses on putative pathogenic variants will help unraveling the role of these rare coding variants to AD risk. In this context it is relevant to mention that we have primarily used protein prediction algorithms that have their limitations as well. While our meta-analysis suggested that the CLU β-chain is involved in AD risk, to adequately discriminate between rare or low frequent pathogenic variants and benign polymorphisms, their pathogenicity needs to be experimentally evaluated. These experiments will have to be performed along with comparable efforts aiming at understanding the role of CLU in AD pathogenesis.
Although we identified an association of AD risk with clustering of rare coding variants affecting the CLU β chain, this association was independent from the association with rs11136000, the top GWA SNP [2,3]. Genotyping rs11136000 in the stage I and II cohorts followed by a meta- analysis showed significant association (p = 0.013). This association remained significant after exclusion of carriers with predicted pathogenic β-chain variants. This is in line with recent LRRK2 observations in which independent common risk associations were found after exclusion of carriers with known pathogenic mutations . Further, high-density association fine-mapping by genotyping common SNPs throughout the CLU locus in the stage I cohort, identified 4 additional SNP associations, that all except one (rs9331908) were in high LD with rs11136000 (r2 > 0.89) and located in the 5' CLU region. This association region ranges from CLU intron 1 to exon 5, which is only marginally overlapping with the rare variant association region in the β-chain (exon 5 to 8). Combined with the two independent association signals for common and rare CLU variants, this could imply two distinct mechanisms underlying AD risk.
Further genotyping of the 4 significant SNPs and combined meta-analyses on all 3 AD cohorts showed, after Bonferroni correction, the strongest evidence of association with rs1532278 (p = 0.005). This variant is located in an intron 3 sequence that strongly resembles a regulatory element based on sequence alignment of seven species. Also, association evidence was found for rs867230 in CLU intron 1, which potentially affects a MEF2 transcription factor binding site. These two novel associated common variants may represent variants of functional relevance underlying the common association signal with rs11136000 in various populations [21-23]. In agreement with other reports [14,21], we did not observe any association with common coding variants. Though we did not find association with the recently described splice site variant rs9331888  in our Flanders-Belgian population (OR 1.11 [95% CI 0.94-1.31]; p = 0.2), our association findings do point to common variants with possible regulatory effects (rs1532278, rs867230). It is likely that a number of common and rare CLU variants contribute to an increased risk for AD. In addition, different mechanisms may contribute to disease: our common CLU associations might regulate CLU transcription while our rare CLU association signal results from non-synonymous variants affecting protein functioning. Whether these rare CLU variants represent loss or gain of function variants remains at present unclear and requires further studies.
In conclusion, in-depth CLU resequencing showed significant clustering and association of rare coding variants with AD risk in a combined meta-analysis of 3 Caucasian AD cohorts analyzed in this study and cohorts of previous published studies. The rare coding variants are non-synonymous substitutions and insertion/deletion mutations that affect the CLU β-chain domain. While the physiological properties of the β-chain domain remain unclear, our data suggests that this protein subunit may be of interest in AD pathogenesis, and merits follow-up with detailed functional analyses. Also, in our combined AD data set, the rare coding variant association was independent of the common AD risk association signal that we fine-mapped to a more 5' region of CLU. The strongest association was not obtained with GWA SNP rs11136000 but with an intron 3 SNP (rs1532278) located in a sequence with high regulatory potential. Altogether, our data suggest that CLU may be a risk gene in which multiple rare and common variants have independent effects on AD disease susceptibility.
In the stage I analysis, we used as discovery sample an extended Flanders-Belgian AD cohort of 1930 subjects. A fraction of this AD cohort (n = 1750, 91%) contributed to the replication phase of the 2 AD GWA studies confirming AD risk association at the CLU locus [2,3]. Since then we extended the AD cohort dataset with newly ascertained subjects and additional genotypes. The AD patients have been ascertained at memory clinics and neurology divisions in a prospective study of neurodegenerative and vascular dementia in Flanders, the Dutch-speaking region of Belgium [25,26], and in a comparable prospective study on molecular genetics of cognitive impairment  using the same clinical assessments and biosampling schemes (Table (Table1).1). Each AD patient underwent a neuropsychological examination, including Mini-Mental State Examination (MMSE) and structural and/or functional neuroimaging . Consensus diagnosis of possible or probable AD was obtained by minimal two neurologists based on the National Institute of Neurological and Communication Disorders and Stroke-Alzheimer's Disease and Related Disorders Association (NINCDS-ADRDA) criteria . Unrelated control individuals had no neurological or psychiatric antecedents or had neurological complaints or organic disease involving the central nervous system. Additional community control individuals were included after interview concerning medical and family history and a Mini Mental State Examination (MMSE) > 24 .
In the stage II analysis, we used two independent Caucasian AD patient/control cohorts for replication of rare coding variants in CLU exons 5-8 by targeted resequencing. A first French AD cohort consisted of patients ascertained in the North of France (Lille AD cohort) that were diagnosed with probable AD according to DSM-III-R and NINCDS/ADRDA criteria. Healthy control individuals were from the same geographical area and were cognitively intact, had no family history of AD or DSM-III-R criteria for dementia and a MMSE ≥ 25 (Table (Table1)1) [30,31]. In a second Canadian AD cohort (Toronto AD cohort ), diagnoses of probable or possible AD were defined according to the NINCDS-ADRDA criteria at clinics specialized in memory disorders . Subjects were classified as control individuals when they were without cognitive impairment or dementia at last visit (Table (Table1)1) .
In the stage III analysis, we initiated an assessment of the contribution of rare coding CLU variants in populations of different ethnicity by analyzing an AD cohort of Caribbean Hispanics ancestry for rare coding mutations by resequencing CLU exons 5-8 (Table (Table1).1). The diagnosis of AD was based on the NINCDS/ADRDA criteria [29,33].
All clinical and genetic studies described in this manuscript were approved by the medical ethical committees of the respective hospital divisions and university genetic laboratories at the respective cohort sampling sites in Flanders-Belgium, France, Canada and USA. Informed consent was obtained from all participants using procedures approved by institutional review boards at each of the clinical research centers enrolling subjects.
Unbiased resequencing was performed in the Flanders-Belgian AD cohort by PCR sequencing of all 9 CLU exons and their flanking intron-exon boundaries [NM_001831.2], the 5' and 3'UTR regions of longest transcript 1 [NM_001831.2], the 5'UTR region of transcript 2 [NM_20339.1] and regulatory elements (Table (Table22 and see table in Additional file 1). Primers were designed using ExonPrimer software and primer3. In stages II and III, we performed targeted resequencing of CLU exons 5-8 in the Lille, Toronto and Caribbean Hispanics AD cohorts (Table (Table33 Table Table5).5). All sequences were analyzed by two independent researchers using Seqman, the NovoSNP software package  or Vector NTI software (Invitrogen).
In stage I, an exon-by-exon approach was used to examine significance of putative clustering in AD patients of rare CLU non-synonymous variants with a MAF of < 0.01-0.05 (Table (Table4).4). Alleles were collapsed and overall frequency differences between AD patients and control individuals were compared using χ2 statistics. Additionally, we performed a literature search identifying in PubMed all published studies that were reporting a genetic screening of CLU in AD patients and healthy individuals of Caucasian origin, yielding two publications describing data in three AD cohorts from UK, Portugal and US [14,15]. For the meta-analysis, we included all rare coding variants predicting non-synonymous and insertion/deletion mutations in the CLU β-chain domain. Rare variant counts in patients and control individuals were obtained for each dataset. To obtain summary ORs and 95% CI, a fixed-effect (Mantel-Haenszel) meta-analysis was performed in R using the library rmeta-version 2.16.
The stage I Flanders-Belgian AD cohort (n = 1930, Table Table1)1) had an estimated power of 97% to detect a modest association with an odds ratio (OR) of ~1.5 and a common genetic variation with a minor allele frequency (MAF) of 0.2 . Tagging SNPs were selected throughout the CLU locus (chr8:27.538.244 - 27.500.368; including 10 kb up- and downstream regions), using the CEPH (Centre d'Etude du Polymorphism Humain) population of HapMap (Data Rel24/phase II Nov08). SNPs with a MAF < 0.05, with a Hardy-Weinberg equilibrium (HWE) p-value < 0.05 or within repeat sequences (> 50%) were removed from the selection. A total of 15 SNPs including the GWA top SNP rs11136000 [2,3], were used in the stage I Flanders-Belgian AD cohort for association analysis and for fine-mapping of the association signal in the CLU locus (See figure in Additional file 4).
Four SNPs were genotyped by sequencing (rs7982, rs867230, rs3216167, rs11136000), the others SNPs by Sequenom MassArray® assay, followed by MALDI-TOF mass spectrometry (Sequenom, Inc., Hamburg, Germany). PCR and extension primers were designed using Assay Design 3.1 Software. Genotypes were scored both automatically (MassArray Typer version 4.0) and by two researchers blind to disease status. Genotyping success rate for SNPs genotyped with Sequenom was > 95%. Inter-plate controls showed 100% concordance for all genotyped SNPs. Deviations from HWE were determined using an exact HWE test .
For common SNPs, differences in allele frequencies between AD patients and control individuals were tested using χ2 statistics. P-values and odds ratios (OR) with 95% confidence intervals for the minor allele were calculated relative to the common allele and corrected for gender, onset age or age at inclusion using binary logistic regression models (Table (Table6a).6a). Allelic associations were further stratified for APOE status i.e. presence or absence of APOE ε4 alleles (See table in additional file 3 and figure in Additional file 4). Statistical analyses were performed using SPSS 16.0 (SPSS, Inc., Chicago, IL). SNPs showing significant association at stage I, were genotyped in the Lille and Toronto AD cohorts and associations calculated (See table in Additional file 6). Fixed-effect (Mantel-Haenszel) meta-analysis was performed based on effect estimates of the separate cohorts adjusted for age, gender and APOE ε4 status in R using the library rmeta-version 2.16. Woolf's test for heterogeneity was calculated. A Bonferroni p-value < 0.05, corrected for the number of SNPs included in the meta-analysis (n = 5), was considered significant (Table (Table6b6b and see forest plots figure in Additional file 7).
AA: Amino Acids; AD: Alzheimer's disease; AAO: age at onset; AAI: age at inclusion; CI: confidence interval; CLU: clusterin; GWA: genome-wide association; HWE: Hardy-Weinberg equilibrium; MAF: minor allele frequency; nt: nucleotides; OR: odds ratio; SD: standard deviation; SNP: single nucleotide polymorphism; UTR: untranslated region.
The authors declare that they have no competing interests.
KB, KS, NB and CVB were involved in the conception and design of the study, and in the analysis and interpretation of the data. SE, RV, NLB, MM, KP, PDD, J-CL, ER, RM, PSH collected the patient materials included in this study. KB, SV and JVD participated in the sequencing studies, KB performed the statistical analyses, KB and KS drafted the manuscript. J-CL, ER, RV, FP, RM, PSH, PA, SE, RV, NLB, PDD have helped to revise the manuscript for intellectual content. CVB is the principal investigator. All authors have read and approved the final manuscript.
Synonymous CLU variants and CLU variants in 3' UTR, 5' UTR, regulatory regions, and splice sites in Flanders-Belgian AD cohort. aGene location position according to the longest CLU transcript with 9 coding exons [NM_001831.2], bNumbering according to build GRCh37/hg19 (Feb.2009), nucleotide changes given for complementary negative strand; MAF = Minor Allele Frequency, calculated upon the minimum number of successful sequences (1698 AD alleles, 1318 control alleles) (B) The miRanda algorithm (microRNA resource) was used to predict whether 3' UTR variants are involved in miRNA-binding . TargetScan  and Pictar  were applied for prediction of miRNA binding site positions. Polymorphisms in microRNA Target Site (PolymiRTS) and Patrocles databases were used  to estimate miRNA binding effects of variants. (D) Regulatory elements (proximal promoter, OREG0018671) predicted using FirstEF and OregAnno from UCSC browser Human Mar. 2006) were sequenced, encompassing 458 nt before the translation initiation codon (transcript 1) and 457 nt downstream of coding exon 1 [NM_001831.2]. No variants were positions in conserved transcription factor binding sites (according to UCSC). (E) Variants near splice sites. FSPLICE, Netgene2 and SPL were used to predict possible splicing effects.
Haplotype sharing of p.T445_D447del carriers. Allele sharing of p.T445_D447del carriers (3 Belgian patients, 3 French and 1 Canadian AD patient) was examined by genotyping 10 short tandem repeat (STR) markers located in a 1.3 Mb region covering the CLU locus. aPhysical location of STR markers is relative to the NCBI genome build 36, DNA of two unaffected children (with inclusion age < 50 years) of Belgian Flanders-Belgian carriers allowed reconstructing haplotypes; Flanders-Belgian carriers of the insertion/deletion shared a 10-marker haplotype covering 1.3 Mb around CLU (shared alleles are indicated in bold), two French patient carriers shared alleles at 7 consecutive markers of this haplotype, while French patient 2 and the Canadian patient 1 only shared 3 consecutive markers with the Belgian patients. Of note, Belgian insertion/deletion carriers also carried p.A309T (Table (Table2),2), which was not detected in the French or Canadian individuals.
Common CLU allelic associations in Flanders-Belgian APOE ε4 strata. aGene location position according to the longest CLU transcript with 9 coding exons [NM_001831.2], bNumbering according to build GRCh37/hg19 (Feb.2009), minor alleles given for complementary negative strand. Allele frequencies are shown with absolute numbers in brackets. Calculations of odds ratios, presented with 95% confidence intervals (CI), were performed using the common allele as reference allele. Nominal p-values were adjusted for age (onset age for patients, inclusion age for control individuals) and gender in the APOE subgroups. Nominally significant p-values are marked in bold.
Schematic overview of CLU allelic associations and LD pattern in Flanders-Belgian AD cohort. -log10 p-values for allelic associations of 15 common SNPs (MAF > 0.05) encompassing the CLU locus are given for the total cohort adjusted for age, gender and APOE, and for APOE ε4 genotype strata adjusted for age and gender (Additional file 3). Based upon 15 common SNP genotypes of stage I, the overall linkage disequilibrium (LD) plot was reconstructed with D' as LD measure drawn using the LDheatmap v0.2-8 package. The LD pattern consisted of a major LD block (12 consecutive SNPs starting from intron 3 to 3' intergenic region) and a minor LD block of 3 SNPs upstream from CLU.
Linkage disequilibrium measures in stage I and II AD cohorts. Pairwise linkage measures (D' and r2) are given for the associated SNPs in stage I and stage II cohorts.
Common CLU allelic associations in replication AD cohorts APOE ε4 strata. Allele frequencies are shown with absolute numbers in brackets, minor alleles given for complementary negative strand. Calculations of odds ratios, presented with 95% confidence intervals (CI), were performed using the common allele as reference allele. Nominal p-values were adjusted for age (onset age for patients, inclusion age for control individuals) and gender in the APOE subgroups. Nominally significant p-values are marked in bold.
Forest plots of common CLU association in stage I and II AD cohorts. Odds ratio's and 95% confidence intervals are given for stage I (Flanders-Belgian) and stage II cohorts (Lille, Toronto) separately as well as overall combining stage I and II.
The authors acknowledge the important participation of patients and their relatives as well as control individuals. The authors are grateful to the personnel of the VIB Genetic Service Facility, the Biobank of the Institute Born-Bunge, the Departments of Neurology and Memory Clinics Hospital Network Antwerp Middelheim and Hoge Beuken and the University Hospitals Leuven. The research at the Antwerp site was in part supported by the Interuniversity Attraction Poles program P6/43 of the Belgian Science Policy Office (BELSPO, http://www.belspo.be/), the Foundation for Alzheimer Research (SAO/FRMA, http://alzh.org/), a Methusalem Excellence Grant of the Flemish Government (EWI, http://www.ewi-vlaanderen.be/), the Research Foundation Flanders (FWO, http://www.fwo.be/), the Special Research Fund of the University of Antwerp (UA, http://www.ua.ac.be/), the Antwerp Medical Research Foundation and Neurosearch, Belgium. KB, NB and KS are postdoctoral fellows, and RV a senior clinical investigator of the FWO. The Lille site was funded in part by the National Foundation for Alzheimer's disease and related disorders, the Institute Pasteur de Lille and INSERM. The Toronto site was supported by grants from the Canadian Institutes of Health Research, Ontario Research Fund, Weston foundation (PSH), the Howard Hughes Medical Institute, The Wellcome Trust, the Alzheimer Society of Ontario (PSH).