Data summaries are provided in Table . Differences in means and standard deviations between countries are largely attributable to differing measurement protocols. All four traits are inter-correlated. The cup area has a much higher coefficient of variation than the rim area. Thus, the majority of variation in CDR is driven by the observed variation in the cup area. CDR, the trait commonly used clinically, is highly correlated with the cup area (Pearson correlation 0.89). Since the disc area is the sum of the cup and rim areas, both the disc rim and disc cup correlations are also high (>0.6). Given these high correlations, the reported P-values are not corrected for the testing of multiple traits.
| Table 1.Descriptive statistics for optic disc areas in the two cohorts |
Genotyping quality appeared excellent with an average discordance rate of 49 per million across the seven Australian individuals genotyped at both the Center for Inherited Disease Research (CIDR, The Johns Hopkins University, Baltimore, MD, USA) and deCODE Genetics (Reykjavik, Iceland). In the Australian data, after the removal of ancestry outliers, there was no evidence for population stratification.
Supplementary Material Figures S1–S4 show that the Q–Q plots for each trait approximate what would be expected in a homogeneous population. The only strong deviation observed was for the disc area, a trait for which we have found clear evidence for genome-wide significant associations. The genomic inflation factor (λ) in the Australian cohort was 1.01 for all four optic nerve traits here. The λ values in the UK data were 0.99, 1.04, 0.98 and 1.02 for the disc area, cup area, rim area and CDR, respectively. Results are presented uncorrected for λ.
The results for association testing across the genome in the Australian cohort are provided in Figure . There is a clear genome-wide significant signal at rs3858145 for the disc area (P = 6.2 × 10−10), <20 kb from the nearest gene, atonal homolog 7 (ATOH7) on chromosome 10q21. SNP rs3858145 was also associated with the cup area (P = 1.4 × 10−5), CDR (P = 1.1 × 10−3) and the rim area (P = 2.0 × 10−3). Imputing SNPs in HapMap slightly increased the evidence for association in the ATOH7 region, with the most associated (P = 2.0 × 10−10) SNP for the disc area being rs10762201, at 69 710 117 bp. No other region revealed genome-wide significant signals (P < 5 × 10−8) in the Australian cohort alone, and no SNPs reached genome-wide significance in the UK cohort alone.
To confirm the
ATOH7 locus and identify further SNPs of smaller effect, we performed a meta-analysis of the Australian and UK cohorts (Table ). The
ATOH7 region replicated in the UK cohort, with
P = 1.3 × 10
−2 for rs3858145 for the disc area. After imputation, the meta-analysis of both cohorts indicated maximum evidence for association at SNP rs1900004 for the disc area (
P = 4.2 × 10
−11). The effect size estimates (standard errors) at rs1900004 (C allele) were 0.298 (0.049),
P = 1.7 × 10
−9 and 0.179 (0.06),
P = 2.9 × 10
−3 in the Australian and UK data, respectively. All SNPs in the
ATOH7 region with minor allele frequency (MAF) >0.1 in HapMap Centre d'Etude du Polymorphisme Humain (CEU) samples could be successfully imputed in both the Australian and UK cohorts; meta-analysis results for both imputed and genotyped SNPs are displayed in Figure . There is little recombination across the region from the 5′ (centromeric) end of
ATOH7 at 69.660 mb to the 5′ (centromeric) end of the
phenazine biosynthesis-like protein domain-containing gene (
PBLD) at 69.712 mb. SNP s1900004 is <5 kb from the telomeric end of
ATOH7, whereas several other SNPs, from rs7916697 (
P = 1.3 × 10
−10) at 69661859 to rs10762201 (
P = 6.9 × 10
−11) at 69710117, have
P-values around 10
−10. SNP rs7916697 lies within
ATOH7. Two additional SNPs in the region typed in the HapMap CEU samples, rs34888891 and rs7896916, have MAF in the range 0.05–0.1 but could not be imputed in the Australian or UK data sets. All of the most associated SNPs in the region had MAFs close to 0.25. We examined publicly available data on the effects of SNPs in the
ATOH7 region on gene expression in transformed lymphocytes (
15,
16). None of the SNPs in or around
ATOH7, that were typed by Dixon
et al. (
15), were significantly associated (all
P > 0.001) with
ATOH7 expression. Similarly, rs3858145 had no significant influence on
ATOH7 expression in the data described by Stranger
et al. (
16).
| Table 2.Directly genotyped variants identified on meta-analysis associated with optic nerve endophenotypes (P < 5 × 10−7) |
Atoh7/
Ath5 embryonic expression and function have been studied in multiple vertebrate model organisms (reviewed in
17). In the prenatal mouse retina,
Atoh7/
Math5 is activated just prior to the initiation of RGC histogenesis (
18) and expressed by exiting/postmitotic retinal progenitor cells (
19,
20). As displayed in Figure , these cells coexpress
Atoh7, and the transcription factor Pax6, which was recently shown to bind directly to a highly conserved 5′ retinal enhancer (
21–
23).
Sequence results in our optic nerve hypoplasia cohort revealed distinct cases with two SNPs in the 5′-UTR (untranslated gene region) (rs7916697, rs61854782), one novel coding synonymous SNP (g.487C>T; Gly22Gly) and two novel coding non-synonymous SNPs: a single nucleotide substitution with an amino acid change of arginine to glycine (g.614A>G; Arg65Gly) within helix 1 of the encoded ATOH7 protein, and a single base pair substitution resulting in an amino acid sequence change of an alanine to a threonine (g.560G>A; Ala47Thr) located N terminal to the basic domain. Neither of these non-synonymous coding SNPs were identified in 90 unrelated control individuals (combined Fisher's exact test P = 0.0136).
The predicted effects of the novel coding non-synonymous variants (g.560G>A; Ala47Thr and g.614A>G; Arg65Gly) were examined using SIFT analysis (
http://blocks.fhcrc.org/sift/SIFT.html) and PolyPhen prediction (
http://genetics.bwh.harvard.edu/pph). Both variants were predicted to be damaging based on SIFT analysis, although PolyPhen analysis determined Ala47Thr to be benign. The Ala47Thr variant is highly conserved across eutherian mammals (
Supplementary Material, Fig. S5) in which a hydrophobic-aliphatic alanine is converted to a neutral threonine that consists of polar side chains. The Arg65Gly variant is highly conserved across eutherian mammals (
Supplementary Material, Fig. S5) and is predicted to alter the hydrophobicity of the amino acid from a hydrophilic arginine to a neutral glycine.
Based on these results, we decided to pursue determination of variant frequency of the Arg65Gly variant only. As mentioned earlier, none of the 90 bidirectionally sequenced controls had the variant. A TaqMan assay of the SNP was designed and an additional 672 genomic DNA samples from a Duke myopia cohort with ophthalmic examination information were genotyped. In this cohort, eight DNA samples had the SNP by TaqMan genotyping, but only one was confirmed with additional bidirectional sequencing of these specific DNA samples. Therefore, out of 672 Duke DNA samples with full eye examinations and no optic nerve hypoplasia noted clinically, 1 sample had the Arg to Gly SNP variant by TaqMan genotyping and confirmed with bidirectional sequencing. In the total set of 762 individuals typed at this SNP, the allele frequency was estimated as 1/(2 × 762) = 0.00066. Comparing the frequency of this SNP in 12 hypoplasia cases (1/24 = 0.042) compared with 762 controls yielded a Fisher's exact test P = 0.035.
An SNP in the
raftlin lipid raft linker 1 gene (
RFTN1) on chromosome 3p24, rs690037, was associated with the cup area in both cohorts, with the meta-analysis providing suggestive evidence for association (
P = 1.5 × 10
−7, Table and
Supplementary Material, Table S3). Imputation for this region of chromosome 3 did not improve the evidence for association, with the genotyped SNP rs690037 remaining the most associated SNP after imputation.
In the meta-analysis for the disc area, there was also suggestive evidence (
P = 3.1 × 10
−7) for association at rs1192415 near
heat shock protein 90 kDa beta (Grp94) member 3 (
HSP90B3P), a pseudogene on chromosome 1p22 (Table and
Supplementary Material, Table S2). Similarly, there were signals with
P < 5 × 10
−7 for the disc area and SNPs near the genes
low density lipoprotein-related protein 1B (
LRP1B, chromosome 2q21) and
zinc finger protein 157 (
ZNF157, chromosome Xp11). The meta-analysis results for disc, cup, rim and CDR for the top genotyped SNPs are shown in
Supplementary Material, Tables S2–S5.