The QT interval, an electrocardiographic measure reflecting myocardial repolarization, is a heritable trait. QT prolongation is a risk factor for ventricular arrhythmias and sudden cardiac death (SCD) and could indicate the presence of the potentially lethal Mendelian Long QT Syndrome (LQTS). Using a genome-wide association and replication study in up to 100,000 individuals we identified 35 common variant QT interval loci, that collectively explain ∼8-10% of QT variation and highlight the importance of calcium regulation in myocardial repolarization. Rare variant analysis of 6 novel QT loci in 298 unrelated LQTS probands identified coding variants not found in controls but of uncertain causality and therefore requiring validation. Several newly identified loci encode for proteins that physically interact with other recognized repolarization proteins. Our integration of common variant association, expression and orthogonal protein-protein interaction screens provides new insights into cardiac electrophysiology and identifies novel candidate genes for ventricular arrhythmias, LQTS,and SCD.
genome-wide association study; QT interval; Long QT Syndrome; sudden cardiac death; myocardial repolarization; arrhythmias
Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here, we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain providing biological plausibility for the findings. Many findings have the potential to provide entirely novel insights into aetiology, but associations at DRD2 and multiple genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that play important roles in immunity, providing support for the hypothesized link between the immune system and schizophrenia.
We conducted imputation to the 1000 Genomes Project of four genome-wide association studies of lung cancer in populations of European ancestry (11,348 cases and 15,861 controls) and genotyped an additional 10,246 cases and 38,295 controls for follow-up. We identified large-effect genome-wide associations for squamous lung cancer with the rare variants of BRCA2-K3326X (rs11571833; odds ratio [OR]=2.47, P=4.74×10−20) and of CHEK2-I157T (rs17879961; OR=0.38 P=1.27×10−13). We also showed an association between common variation at 3q28 (TP63; rs13314271; OR=1.13, P=7.22×10−10) and lung adenocarcinoma previously only reported in Asians. These findings provide further evidence for inherited genetic susceptibility to lung cancer and its biological basis. Additionally, our analysis demonstrates that imputation can identify rare disease-causing variants having substantive effects on cancer risk from pre-existing GWAS data.
Background and Purpose
Ischemic stroke (IS) and coronary artery disease (CAD) share several risk factors and each have a substantial heritability. We conducted a genome-wide analysis to evaluate the extent of shared genetic determination of the two diseases.
Genome-wide association data were obtained from the METASTROKE, CARDIoGRAM, and C4D consortia. We first analyzed common variants reaching a nominal threshold of significance (p<0.01) for CAD for their association with IS and vice versa. We then examined specific overlap across phenotypes for variants that reached a high threshold of significance. Finally, we conducted a joint meta-analysis on the combined phenotype of IS or CAD. Corresponding analyses were performed restricted to the 2,167 individuals with the ischemic large artery stroke (LAS) subtype.
Common variants associated with CAD at p<0.01 were associated with a significant excess risk for IS and for LAS and vice versa. Among the 42 known genome-wide significant loci for CAD, three and five loci were significantly associated with IS and LAS, respectively. In the joint meta-analyses, 15 loci passed genome-wide significance (p<5×10-8) for the combined phenotype of IS or CAD and 17 loci passed genome-wide significance for LAS or CAD. Since these loci had prior evidence for genome-wide significance for CAD we specifically analyzed the respective signals for IS and LAS and found evidence for association at chr12q24/SH2B3 (pIS=1.62×10-07) and ABO (pIS =2.6×10-4) as well as at HDAC9 (pLAS=2.32×10-12), 9p21 (pLAS =3.70×10-6), RAI1-PEMT-RASD1 (pLAS =2.69×10-5), EDNRA (pLAS =7.29×10-4), and CYP17A1-CNNM2-NT5C2 (pLAS =4.9×10-4).
Our results demonstrate substantial overlap in the genetic risk of ischemic stroke and particularly the large artery stroke subtype with coronary artery disease.
Sequence polymorphisms linked to human diseases and phenotypes in genome-wide association studies often affect non-coding regions. A single nucleotide polymorphism (SNP) within an intron of the gene encoding Interferon Regulatory Factor 4 (IRF4), a transcription factor with no known role in melanocyte biology, is strongly associated with sensitivity of skin to sun exposure, freckles, blue eyes and brown hair color. Here we demonstrate that this SNP lies within an enhancer of IRF4 transcription in melanocytes. The allele associated with this pigmentation phenotype impairs binding of the TFAP2A transcription factor which together with the melanocyte master regulator MITF, regulates activity of the enhancer. Assays in zebrafish and mice reveal that IRF4 cooperates with MITF to activate expression of Tyrosinase (TYR), an essential enzyme in melanin synthesis. Our findings provide a clear example of a non-coding polymorphism that affects a phenotype by modulating a developmental gene regulatory network.
We performed a genome-wide association study on 1,292 individuals with abdominal aortic aneurysms (AAAs) and 30,503 controls from Iceland and The Netherlands, with a follow-up of top markers in up to 3,267 individuals with AAAs and 7,451 controls. The A allele of rs7025486 on 9q33 was found to associate with AAA, with an odds ratio (OR) of 1.21 and P = 4.6 × 10−10. In tests for association with other vascular diseases, we found that rs7025486[A] is associated with early onset myocardial infarction (OR = 1.18, P = 3.1 × 10−5), peripheral arterial disease (OR = 1.14, P = 3.9 × 10−5) and pulmonary embolism (OR = 1.20, P = 0.00030), but not with intracranial aneurysm or ischemic stroke. No association was observed between rs7025486[A] and common risk factors for arterial and venous diseases—that is, smoking, lipid levels, obesity, type 2 diabetes and hypertension. Rs7025486 is located within DAB2IP, which encodes an inhibitor of cell growth and survival.
Abdominal aortic aneurysm (AAA) is a common human disease with a high estimated heritability (0.7); however, only a small number of associated genetic loci have been reported to date. In contrast, over 100 loci have now been reproducibly associated with either blood lipid profile and/or coronary artery disease (CAD) (both risk factors for AAA) in large-scale meta-analyses. This study employed a staged design to investigate whether the loci for these two phenotypes are also associated with AAA. Validated CAD and dyslipidaemia loci underwent screening using the Otago AAA genome-wide association data set. Putative associations underwent staged secondary validation in 10 additional cohorts. A novel association between the SORT1 (1p13.3) locus and AAA was identified. The rs599839 G allele, which has been previously associated with both dyslipidaemia and CAD, reached genome-wide significance in 11 combined independent cohorts (meta-analysis with 7048 AAA cases and 75 976 controls: G allele OR 0.81, 95% CI 0.76–0.85, P = 7.2 × 10−14). Modelling for confounding interactions of concurrent dyslipidaemia, heart disease and other risk factors suggested that this marker is an independent predictor of AAA susceptibility. In conclusion, a genetic marker associated with cardiovascular risk factors, and in particular concurrent vascular disease, appeared to independently contribute to susceptibility for AAA. Given the potential genetic overlap between risk factor and disease phenotypes, the use of well-characterized case–control cohorts allowing for modelling of cardiovascular disease risk confounders will be an important component in the future discovery of genetic markers for conditions such as AAA.
Long non-coding ribonucleic acids (lncRNAs) have been proposed as biomarkers in prostate cancer. This paper proposes a selection method which uses data from tiled microarrays to identify relatively long regions of moderate expression independent of the microarray platform and probe design. The method is used to search for candidate long non-coding ribonucleic acids (lncRNAs) at locus 8q24 and is run on three independent experiments which all use samples from prostate cancer patients. The robustness of the method is tested by utilizing repeated copies of tiled probes. The method shows high consistency between experiments that used the same samples, but different probe layout. There also is statistically significant consistency when comparing experiments with different samples. The method selected the long non-coding ribonucleic acid PCNCR1 in all three experiments.
Loss-of-function mutations protective against human disease provide in vivo validation of therapeutic targets1,2,3, yet none are described for type 2 diabetes (T2D). Through sequencing or genotyping ~150,000 individuals across five ethnicities, we identified 12 rare protein-truncating variants in SLC30A8, which encodes an islet zinc transporter (ZnT8)4 and harbors a common variant (p.Trp325Arg) associated with T2D risk, glucose, and proinsulin levels5–7. Collectively, protein-truncating variant carriers had 65% reduced T2D risk (p=1.7×10−6), and non-diabetic Icelandic carriers of a frameshift variant (p.Lys34SerfsX50) demonstrated reduced glucose levels (−0.17 s.d., p=4.6×10−4). The two most common protein-truncating variants (p.Arg138X and p.Lys34SerfsX50) individually associate with T2D protection and encode unstable ZnT8 proteins. Previous functional study of SLC30A8 suggested reduced zinc transport increases T2D risk8,9, yet phenotypic heterogeneity was observed in rodent Slc30a8 knockouts10–15. Contrastingly, loss-of-function mutations in humans provide strong evidence that SLC30A8 haploinsufficiency protects against T2D, proposing ZnT8 inhibition as a therapeutic strategy in T2D prevention.
Elevated resting heart rate is associated with greater risk of cardiovascular disease and mortality. In a 2-stage meta-analysis of genome-wide association studies in up to 181,171 individuals, we identified 14 new loci associated with heart rate and confirmed associations with all 7 previously established loci. Experimental downregulation of gene expression in Drosophila melanogaster and Danio rerio identified 20 genes at 11 loci that are relevant for heart rate regulation and highlight a role for genes involved in signal transmission, embryonic cardiac development and the pathophysiology of dilated cardiomyopathy, congenital heart failure and/or sudden cardiac death. In addition, genetic susceptibility to increased heart rate is associated with altered cardiac conduction and reduced risk of sick sinus syndrome, and both heart rate–increasing and heart rate–decreasing variants associate with risk of atrial fibrillation. Our findings provide fresh insights into the mechanisms regulating heart rate and identify new therapeutic targets.
Low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides, and total cholesterol are heritable, modifiable, risk factors for coronary artery disease. To identify new loci and refine known loci influencing these lipids, we examined 188,578 individuals using genome-wide and custom genotyping arrays. We identify and annotate 157 loci associated with lipid levels at P < 5×10−8, including 62 loci not previously associated with lipid levels in humans. Using dense genotyping in individuals of European, East Asian, South Asian, and African ancestry, we narrow association signals in 12 loci. We find that loci associated with blood lipids are often associated with cardiovascular and metabolic traits including coronary artery disease, type 2 diabetes, blood pressure, waist-hip ratio, and body mass index. Our results illustrate the value of genetic data from individuals of diverse ancestries and provide insights into biological mechanisms regulating blood lipids to guide future genetic, biological, and therapeutic research.
Triglycerides are transported in plasma by specific triglyceride-rich lipoproteins; in epidemiologic studies, increased triglyceride levels correlate with higher risk for coronary artery disease (CAD). However, it is unclear whether this association reflects causal processes. We used 185 common variants recently mapped for plasma lipids (P<5×10−8 for each) to examine the role of triglycerides on risk for CAD. First, we highlight loci associated with both low-density lipoprotein cholesterol (LDL-C) and triglycerides, and show that the direction and magnitude of both are factors in determining CAD risk. Second, we consider loci with only a strong magnitude of association with triglycerides and show that these loci are also associated with CAD. Finally, in a model accounting for effects on LDL-C and/or high-density lipoprotein cholesterol, a polymorphism's strength of effect on triglycerides is correlated with the magnitude of its effect on CAD risk. These results suggest that triglyceride-rich lipoproteins causally influence risk for CAD.
Approaches exploiting extremes of the trait distribution may reveal novel loci for common traits, but it is unknown whether such loci are generalizable to the general population. In a genome-wide search for loci associated with upper vs. lower 5th percentiles of body mass index, height and waist-hip ratio, as well as clinical classes of obesity including up to 263,407 European individuals, we identified four new loci (IGFBP4, H6PD, RSRC1, PPP2R2A) influencing height detected in the tails and seven new loci (HNF4G, RPTOR, GNAT2, MRPS33P4, ADCY9, HS6ST3, ZZZ3) for clinical classes of obesity. Further, we show that there is large overlap in terms of genetic structure and distribution of variants between traits based on extremes and the general population and little etiologic heterogeneity between obesity subgroups.
Schizophrenia is a heritable disorder with substantial public health
impact. We conducted a multi-stage genome-wide association study (GWAS) for
schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243
controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases,
12,067 controls) and finally by replication of SNPs in 168 genomic regions in
independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22
regions met genome-wide significance (14 novel and one previously implicated in
bipolar disorder). The results strongly implicate calcium signaling in the
etiology of schizophrenia, and include genome-wide significant results for
CACNA1C and CACNB2 whose protein products
interact. We estimate that ∼8,300 independent and predominantly common
SNPs contribute to risk for schizophrenia and that these collectively account
for most of its heritability. Common genetic variation plays an important role
in the etiology of schizophrenia, and larger studies will allow more detailed
understanding of this devastating disorder.
schizophrenia; genetics; genome-wide association; meta-analysis
Central corneal thickness (CCT) is associated with eye conditions including keratoconus and glaucoma. We performed a meta-analysis on >20,000 individuals in European and Asian populations that identified 16 new loci associated with CCT at genome-wide significance (P < 5 × 10−8). We further showed that 2 CCT-associated loci, FOXO1 and FNDC3B, conferred relatively large risks for keratoconus in 2 cohorts with 874 cases and 6,085 controls (rs2721051 near FOXO1 had odds ratio (OR) = 1.62, 95% confidence interval (CI) = 1.4–1.88, P = 2.7 × 10−10, and rs4894535 in FNDC3B had OR = 1.47, 95% CI = 1.29–1.68, P = 4.9 × 10−9). FNDC3B was also associated with primary open-angle glaucoma (P = 5.6 × 10−4; tested in 3 cohorts with 2,979 cases and 7,399 controls). Further analyses implicate the collagen and extracellular matrix pathways in the regulation of CCT.
To search for new sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conducted a genome-wide association study of 38.5 million single nucleotide polymorphisms (SNPs) and small indels identified through whole-genome sequencing of 2230 Icelanders. We imputed genotypes for 4208 BCC patients and 109 408 controls using Illumina SNP chip typing data, carried out association tests and replicated the findings in independent population samples. We found new BCC susceptibility loci at TGM3 (rs214782[G], P = 5.5 × 10−17, OR = 1.29) and RGS22 (rs7006527[C], P = 8.7 × 10−13, OR = 0.77). TGM3 encodes transglutaminase type 3, which plays a key role in production of the cornified envelope during epidermal differentiation.
End-stage coagulation and the structure/function of fibrin are implicated in the pathogenesis of ischaemic stroke. We explored whether genetic variants associated with end-stage coagulation in healthy volunteers account for the genetic predisposition to ischemic stroke and examined their influence on stroke subtype.
Common genetic variants identified through genome-wide association studies of coagulation factors and fibrin structure/function in healthy twins (n=2,100 Stage 1) were examined in ischemic stroke (n=4,200 cases) using 2 independent samples of European ancestry (Stage 2). A third clinical collection having stroke subtyping (total 8,900 cases 55,000 controls) was used for replication (Stage 3).
Stage 1 identified 524 SNPs from 23 LD blocks having significant association (p<5 ×10-8) with one or more coagulation/fibrin phenotypes. Most striking associations included SNP rs5985 with factor XIII activity (p=2.6×10-186), rs10665 with FVII (p = 2.4×10-47) and rs505922 in the ABO gene with both von Willebrand Factor (vWF p=4.7×10-57) and factor VIII (p=1.2×10-36). In Stage 2, the 23 independent SNPs were examined in stroke cases/non-cases using MORGAM and WTCCC2 collections. SNP rs505922 was nominally associated with ischaemic stroke, odds ratio = 0.94 (95% confidence intervals, 0.88-0.99), p=0.023. Independent replication in Meta-Stroke confirmed the rs505922 association with stroke, beta=0.066 (0.02) p = 0.001, a finding specific to large vessel and cardioembolic stroke (p = 0.001 and p = <0.001 respectively) but not seen with small vessel stroke (p=0.811).
ABO gene variants are associated with large vessel and cardioembolic stroke but not small vessel disease. This work sheds light on the different pathogenic mechanisms underpinning stroke subtype.
GWAS; thrombosis; stroke; coagulation factor; stroke subtype
Epidemiological and genetic data support the notion that schizophrenia and bipolar disorder share genetic risk factors. In our previous genome-wide association (GWA) study, meta-analysis and follow-up (totaling as many as 18,206 cases and 42,536 controls), we identified four loci showing genome-wide significant association with schizophrenia. Here we consider a mixed schizophrenia and bipolar disorder (psychosis) phenotype (addition of 7,469 bipolar disorder cases, 1,535 schizophrenia cases, 333 other psychosis cases, 808 unaffected family members and 46,160 controls). Combined analysis reveals a novel variant at 16p11.2 showing genome-wide significant association (rs4583255[T], OR = 1.08, P = 6.6 × 10−11). The new variant is located within a 593 kb region that substantially increases risk of psychosis when duplicated. In line with the association of the duplication with reduced body mass index (BMI), rs4583255[T] is also associated with lower BMI (P = 0.0039 in the public GIANT consortium dataset; P = 0.00047 in 22,651 additional Icelanders).
schizophrenia; bipolar disorder; association; 16p11.2; cross-disorder
Sexual dimorphism in various bone phenotypes, including bone mineral density (BMD), is widely observed; however the extent to which genes explain these sex differences is unclear. To identify variants with different effects by sex, we examined gene-by-sex autosomal interactions genome-wide, and performed eQTL analysis and bioinformatics network analysis.
We conducted an autosomal genome-wide meta-analysis of gene-by-sex interaction on lumbar spine (LS-) and femoral neck (FN-) BMD, in 25,353 individuals from eight cohorts. In a second stage, we followed up the 12 top SNPs (P<1×10−5) in an additional set of 24,763 individuals. Gene-by-sex interaction and sex-specific effects were examined in these 12 SNPs.
We detected one novel genome-wide significant interaction associated with LS-BMD at the Chr3p26.1-p25.1 locus, near the GRM7 gene (male effect = 0.02 & p-value = 3.0×10−5; female effect = −0.007 & p-value=3.3×10−2) and eleven suggestive loci associated with either FN- or LS-BMD in discovery cohorts. However, there was no evidence for genome-wide significant (P<5×10−8) gene-by-sex interaction in the joint analysis of discovery and replication cohorts.
Despite the large collaborative effort, no genome-wide significant evidence for gene-by-sex interaction was found influencing BMD variation in this screen of autosomal markers. If they exist, gene-by-sex interactions for BMD probably have weak effects, accounting for less than 0.08% of the variation in these traits per implicated SNP.
gene-by-sex; interaction; BMD; association; aging
We report the results of an association study of melanoma based on the genome-wide imputation of the genotypes of 1,353 cases and 3,566 controls of European origin conducted by the GenoMEL consortium. This revealed a novel association between several single nucleotide polymorphisms (SNPs) in intron 8 of the FTO gene, including rs16953002, which replicated using 12,313 cases and 55,667 controls of European ancestry from Europe, the USA and Australia (combined p=3.6×10−12, per-allele OR for A=1.16). As well as identifying a novel melanoma susceptibility locus, this is the first study to identify and replicate an association with SNPs in FTO not related to body mass index (BMI). These SNPs are not in intron 1 (the BMI-related region) and show no association with BMI. This suggests FTO’s function may be broader than the existing paradigm that FTO variants influence multiple traits only through their associations with BMI and obesity.
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes.
Schizophrenia is a severe mental disorder affecting ~1% of the world population, with heritability of up to 80%. To identify new common genetic risk factors, we performed a genome-wide association study (GWAS) in the Han Chinese population. The discovery sample set consisted of 3,750 patients and 6,468 healthy controls (1,578 cases and 1,592 controls from the Northern Han; 1,238 cases and 2,856 controls from the Central Han; 934 cases and 2,020 controls from the Southern Han); and we followed up the top association signals in an additional independent cohort of 4,383 cases and 4,539 controls from the Han Chinese. Meta-analysis identified genome-wide significant association of common SNPs with schizophrenia on chromosome 8p12 (rs16887244, P=1.27×10−10) and 1q24.2 (rs10489202, P=9.50×10−9). Our findings provide new insights into the pathogenesis of schizophrenia.
Effects of susceptibility variants may depend on from which parent they are inherited. While many associations between sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin has largely been ignored. Combining genealogy with long range phasing, we demonstrate that for 38,167 Icelanders genotyped using SNP chips, the parental origin of most alleles can be determined. We then focused on SNPs that associate with diseases and are within 500kb of known imprinted genes. Seven independent SNP associations were examined. Five, one each with breast cancer and basal cell carcinoma, and three with type 2 diabetes (T2D), exhibit parental-origin specific associations. These variants are located in two genomic regions, 11p15 and 7q32, each harbouring a cluster of imprinted genes. Furthermore, a novel variant rs2334499 at 11p15 was seen to associate with T2D where the allele that confers risk when paternally inherited is protective when maternally transmitted. We identified a differentially methylated CTCF binding site at 11p15 and demonstrated correlation of rs2334499 with decreased methylation of that site.
We identified a marker in LINGO1 showing genome-wide significant association (P = 1.2 × 10−9, odds ratio = 1.55) with essential tremor. LINGO1 has potent, negative regulatory influences on neuronal survival and is also important in regulating both central-nervous-system axon regeneration and oligodendrocyte maturation. An increase in the number of fusiform swellings of Purkinje cell axons in LINGO1 knockout models highlights the potential role of LINGO1 in essential tremor pathophysiology.