We report novel defects of mitochondrial translation elongation factor Ts (EFTs), with high carrier frequency in Finland and expand the manifestations of this disease group from infantile cardiomyopathy to juvenile neuropathy/encephalopathy disorders.
DNA analysis, whole-exome analysis, protein biochemistry, and protein modeling.
We used whole-exome sequencing to find the genetic cause of infantile-onset mitochondrial cardiomyopathy, progressing to juvenile-onset Leigh syndrome, neuropathy, and optic atrophy in 2 siblings. We found novel compound heterozygous mutations, c.944G>A [p.C315Y] and c.856C>T [p.Q286X], in the TSFM gene encoding mitochondrial EFTs. The same p.Q286X variant was found as compound heterozygous with a splice site change in a patient from a second family, with juvenile-onset optic atrophy, peripheral neuropathy, and ataxia. Our molecular modeling predicted the coding-region mutations to cause protein instability, which was experimentally confirmed in cultured patient cells, with mitochondrial translation defect and lacking EFTs. Only a single TSFM mutation has been previously described in different populations, leading to an infantile fatal multisystem disorder with cardiomyopathy. Sequence data from 35,000 Finnish population controls indicated that the heterozygous carrier frequency of p.Q286X change was exceptionally high in Finland, 1:80, but no homozygotes were found in the population, in our mitochondrial disease patient collection, or in an intrauterine fetal death material, suggesting early developmental lethality of the homozygotes.
We show that in addition to early-onset cardiomyopathy, TSFM mutations should be considered in childhood and juvenile encephalopathies with optic and/or peripheral neuropathy, ataxia, or Leigh disease.
Genome-wide association analysis on monozygotic twin pairs offers a route to discovery of gene–environment interactions through testing for variability loci associated with sensitivity to individual environment/lifestyle. We present a genome-wide scan of loci associated with intra-pair differences in serum lipid and apolipoprotein levels. We report data for 1,720 monozygotic female twin pairs from GenomEUtwin project with 2.5 million SNPs, imputed or genotyped, and measured serum lipid fractions for both twins. We found one locus associated with intra-pair differences in high density lipoprotein (HDL) cholesterol, rs2483058 in an intron of SRGAP2, where twins carrying the C allele are more sensitive to environmental factors (p = 3.98 × 10−8). We followed up the association in further genotyped monozygotic twins (N = 1 261) which showed a moderate association for the variant (p = .002, same direction of an effect). In addition, we report a new association on the level of apolipoprotein A-II (p = 4.03 × 10−8).
twins; association; lipids; apolipoproteins; interaction
Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.
anorexia nervosa; eating disorders; GWAS; genome-wide association study; body mass index; metabolic
Schizophrenia is a highly heritable disorder. Genetic risk is conferred by a large number of alleles, including common alleles of small effect that might be detected by genome-wide association studies. Here, we report a multi-stage schizophrenia genome-wide association study of up to 36,989 cases and 113,075 controls. We identify 128 independent associations spanning 108 conservatively defined loci that meet genome-wide significance, 83 of which have not been previously reported. Associations were enriched among genes expressed in brain providing biological plausibility for the findings. Many findings have the potential to provide entirely novel insights into aetiology, but associations at DRD2 and multiple genes involved in glutamatergic neurotransmission highlight molecules of known and potential therapeutic relevance to schizophrenia, and are consistent with leading pathophysiological hypotheses. Independent of genes expressed in brain, associations were enriched among genes expressed in tissues that play important roles in immunity, providing support for the hypothesized link between the immune system and schizophrenia.
We present the analysis of a prospective multicentre study to investigate genetic effects on the prognosis of newly treated epilepsy. Patients with a new clinical diagnosis of epilepsy requiring medication were recruited and followed up prospectively. The clinical outcome was defined as freedom from seizures for a minimum of 12 months in accordance with the consensus statement from the International League Against Epilepsy (ILAE). Genetic effects on remission of seizures after starting treatment were analysed with and without adjustment for significant clinical prognostic factors, and the results from each cohort were combined using a fixed-effects meta-analysis. After quality control (QC), we analysed 889 newly treated epilepsy patients using 472 450 genotyped and 6.9 × 106 imputed single-nucleotide polymorphisms. Suggestive evidence for association (defined as Pmeta < 5.0 × 10−7) with remission of seizures after starting treatment was observed at three loci: 6p12.2 (rs492146, Pmeta = 2.1 × 10−7, OR[G] = 0.57), 9p23 (rs72700966, Pmeta = 3.1 × 10−7, OR[C] = 2.70) and 15q13.2 (rs143536437, Pmeta = 3.2 × 10−7, OR[C] = 1.92). Genes of biological interest at these loci include PTPRD and ARHGAP11B (encoding functions implicated in neuronal development) and GSTA4 (a phase II biotransformation enzyme). Pathway analysis using two independent methods implicated a number of pathways in the prognosis of epilepsy, including KEGG categories ‘calcium signaling pathway’ and ‘phosphatidylinositol signaling pathway’. Through a series of power curves, we conclude that it is unlikely any single common variant explains >4.4% of the variation in the outcome of newly treated epilepsy.
Plasma fibrinogen is an acute phase protein playing an important role in the blood coagulation cascade having strong associations with smoking, alcohol consumption and body mass index (BMI). Genome-wide association studies (GWAS) have identified a variety of gene regions associated with elevated plasma fibrinogen concentrations. However, little is yet known about how associations between environmental factors and fibrinogen might be modified by genetic variation. Therefore, we conducted large-scale meta-analyses of genome-wide interaction studies to identify possible interactions of genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentration. The present study included 80,607 subjects of European ancestry from 22 studies. Genome-wide interaction analyses were performed separately in each study for about 2.6 million single nucleotide polymorphisms (SNPs) across the 22 autosomal chromosomes. For each SNP and risk factor, we performed a linear regression under an additive genetic model including an interaction term between SNP and risk factor. Interaction estimates were meta-analysed using a fixed-effects model. No genome-wide significant interaction with smoking status, alcohol consumption or BMI was observed in the meta-analyses. The most suggestive interaction was found for smoking and rs10519203, located in the LOC123688 region on chromosome 15, with a p value of 6.2×10−8. This large genome-wide interaction study including 80,607 participants found no strong evidence of interaction between genetic variants and smoking status, alcohol consumption or BMI on fibrinogen concentrations. Further studies are needed to yield deeper insight in the interplay between environmental factors and gene variants on the regulation of fibrinogen concentrations.
We previously used a single nucleotide polymorphism (SNP) in the CHRNA5-A3-B4 gene cluster associated with heaviness of smoking within smokers to confirm the causal effect of smoking in reducing body mass index (BMI) in a Mendelian randomisation analysis. While seeking to extend these findings in a larger sample we found that this SNP is associated with 0.74% lower body mass index (BMI) per minor allele in current smokers (95% CI -0.97 to -0.51, P = 2.00×10−10), but also unexpectedly found that it was associated with 0.35% higher BMI in never smokers (95% CI +0.18 to +0.52, P = 6.38×10−5). An interaction test confirmed that these estimates differed from each other (P = 4.95×10−13). This difference in effects suggests the variant influences BMI both via pathways unrelated to smoking, and via the weight-reducing effects of smoking. It would therefore be essentially undetectable in an unstratified genome-wide association study of BMI, given the opposite association with BMI in never and current smokers. This demonstrates that novel associations may be obscured by hidden population sub-structure. Stratification on well-characterized environmental factors known to impact on health outcomes may therefore reveal novel genetic associations.
We found that a single nucleotide polymorphism in the CHRNA5-A3-B4 gene cluster, which is known to influence smoking heaviness, is associated with lower body mass index (BMI) in current smokers, but higher BMI in never smokers. This difference in effects suggests that the variant influences BMI both via pathways other than smoking, and via the weight-reducing effects of smoking, in opposite directions. The overall effect on BMI would therefore be undetectable in an unstratified genome-wide association study, indicating that novel associations may be obscured by hidden population sub-structure.
Inherited alleles account for most of the genetic risk for schizophrenia. However, new (de novo) mutations, in the form of large chromosomal copy number changes, occur in a small fraction of cases and disproportionally disrupt genes encoding postsynaptic proteins. Here, we show that small de novo mutations, affecting one or a few nucleotides, are overrepresented among glutamatergic postsynaptic proteins comprising activity-regulated cytoskeleton-associated protein (ARC) and N-methyl-D-aspartate receptor (NMDAR) complexes. Mutations are additionally enriched in proteins that interact with these complexes to modulate synaptic strength, namely proteins regulating actin filament dynamics and those whose mRNAs are targets of fragile X mental retardation protein (FMRP). Genes affected by mutations in schizophrenia overlap those mutated in autism and intellectual disability, as do mutation-enriched synaptic pathways. Aligning our findings with a parallel case-control study, we demonstrate reproducible insights into aetiological mechanisms for schizophrenia and reveal pathophysiology shared with other neurodevelopmental disorders.
Genome-wide association studies have identified several genetic variants associated with coronary heart disease (CHD). The aim of this study was to evaluate the genetic risk discrimination and reclassification and apply the results for a two-stage population risk screening strategy for CHD.
Approach and Results
We genotyped 28 genetic variants in 24 124 participants in four Finnish population-based, prospective cohorts (recruitment years 1992-2002). We constructed a multi-locus genetic risk score and evaluated its association with incident cardiovascular disease events. During the median follow-up time of 12 years (IQR 8.75–15.25 years), we observed 1093 CHD, 1552 cardiovascular disease (CVD) and 731 acute coronary syndrome (ACS) events. Adding genetic information to conventional risk factors and family history improved risk discrimination of CHD (C-index 0.856 vs. 0.851, P=0.0002) and other end points (CVD: C-index 0.840 vs. 0.837, P=0.0004; ACS: C-index 0.859 vs. 0.855, P=0.001). In a standard population of 100 000 individuals, additional genetic screening of subjects at intermediate risk for CHD would reclassify 2144 (12%) subjects into high risk category. Statin allocation for these subjects is estimated to prevent 135 CHD cases over 14 years. Similar results were obtained by external validation, where the effects were estimated from a training dataset and applied for a test dataset.
Genetic risk score improves risk prediction of CHD and helps to identify individuals at high risk for the first CHD event. Genetic screening for individuals at intermediate cardiovascular risk could help to prevent future cases through better targeting of statins.
Cardiovascular genomics; Genetic association; Genetic epidemiology; Risk factor; Risk prediction
To investigate whether associations of smoking with depression and anxiety are likely to be causal, using a Mendelian randomisation approach.
Mendelian randomisation meta-analyses using a genetic variant (rs16969968/rs1051730) as a proxy for smoking heaviness, and observational meta-analyses of the associations of smoking status and smoking heaviness with depression, anxiety and psychological distress.
Current, former and never smokers of European ancestry aged ≥16 years from 25 studies in the Consortium for Causal Analysis Research in Tobacco and Alcohol (CARTA).
Primary outcome measures
Binary definitions of depression, anxiety and psychological distress assessed by clinical interview, symptom scales or self-reported recall of clinician diagnosis.
The analytic sample included up to 58 176 never smokers, 37 428 former smokers and 32 028 current smokers (total N=127 632). In observational analyses, current smokers had 1.85 times greater odds of depression (95% CI 1.65 to 2.07), 1.71 times greater odds of anxiety (95% CI 1.54 to 1.90) and 1.69 times greater odds of psychological distress (95% CI 1.56 to 1.83) than never smokers. Former smokers also had greater odds of depression, anxiety and psychological distress than never smokers. There was evidence for positive associations of smoking heaviness with depression, anxiety and psychological distress (ORs per cigarette per day: 1.03 (95% CI 1.02 to 1.04), 1.03 (95% CI 1.02 to 1.04) and 1.02 (95% CI 1.02 to 1.03) respectively). In Mendelian randomisation analyses, there was no strong evidence that the minor allele of rs16969968/rs1051730 was associated with depression (OR=1.00, 95% CI 0.95 to 1.05), anxiety (OR=1.02, 95% CI 0.97 to 1.07) or psychological distress (OR=1.02, 95% CI 0.98 to 1.06) in current smokers. Results were similar for former smokers.
Findings from Mendelian randomisation analyses do not support a causal role of smoking heaviness in the development of depression and anxiety.
Mendelian randomisation; Smoking; Depression; Anxiety
Estimates of the heritability of plasma fibrinogen concentration, an established predictor of cardiovascular disease (CVD), range from 34 to 50%. Genetic variants so far identified by genome-wide association (GWA) studies only explain a small proportion (< 2%) of its variation.
Methods and Results
We conducted a meta-analysis of 28 GWA studies, including more than 90,000 subjects of European ancestry, the first GWA meta-analysis of fibrinogen levels in 7 African Americans studies totaling 8,289 samples, and a GWA study in Hispanic-Americans totaling 1,366 samples. Evaluation for association of SNPs with clinical outcomes included a total of 40,695 cases and 85,582 controls for coronary artery disease (CAD), 4,752 cases and 24,030 controls for stroke, and 3,208 cases and 46,167 controls for venous thromboembolism (VTE). Overall, we identified 24 genome-wide significant (P<5×10−8) independent signals in 23 loci, including 15 novel associations, together accounting for 3.7% of plasma fibrinogen variation. Gene-set enrichment analysis highlighted key roles in fibrinogen regulation for the three structural fibrinogen genes and pathways related to inflammation, adipocytokines and thyrotrophin-releasing hormone signaling. Whereas lead SNPs in a few loci were significantly associated with CAD, the combined effect of all 24 fibrinogen-associated lead SNPs was not significant for CAD, stroke or VTE.
We identify 23 robustly associated fibrinogen loci, 15 of which are new. Clinical outcome analysis of these loci does not support a causal relationship between circulating levels of fibrinogen and CAD, stroke or VTE.
Fibrinogen; cardiovascular disease; genome-wide association study
It has long been recognized that generalized deficits in cognitive ability represent a core component of schizophrenia, evident prior to full illness onset and independent of medication. The possibility of genetic overlap between risk for schizophrenia and cognitive phenotypes has been suggested by the presence of cognitive deficits in first-degree relatives of patients with schizophrenia; however, until recently, molecular genetic approaches to test this overlap have been lacking. Within the last few years, large-scale genome-wide association studies (GWAS) of schizophrenia have demonstrated that a substantial proportion of the heritability of the disorder is explained by a polygenic component consisting of many common SNPs of extremely small effect. Similar results have been reported in GWAS of general cognitive ability. The primary aim of the present study is to provide the first molecular genetic test of the classic endophenotype hypothesis, which states that alleles associated with reduced cognitive ability should also serve to increase risk for schizophrenia. We tested the endophenotype hypothesis by applying polygenic SNP scores derived from a large-scale cognitive GWAS meta-analysis (~5000 individuals from 9 non-clinical cohorts comprising the COGENT consortium) to four schizophrenia case-control cohorts. As predicted, cases had significantly lower cognitive polygenic scores compared to controls. In parallel, polygenic risk scores for schizophrenia were associated with lower general cognitive ability. Additionally, using our large cognitive meta-analytic dataset, we identified nominally significant cognitive associations for several SNPs that have previously been robustly associated with schizophrenia susceptibility. Results provide molecular confirmation of the genetic overlap between schizophrenia and general cognitive ability, and may provide additional insight into pathophysiology of the disorder.
GWAS; schizophrenia; general cognitive ability; polygenic; endophenotype
Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5–5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10−8) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10−117). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10−4), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.
We explored the coding regions of 3,000 Finnish individuals with 3,000 non-Finnish Europeans (NFEs) using whole-exome sequence data, in order to understand how an individual from a bottlenecked population might differ from an individual from an out-bred population. We provide empirical evidence that there are more rare and low-frequency deleterious alleles in Finns compared to NFEs, such that an average Finn has almost twice as many low-frequency complete knockouts of a gene. As such, we hypothesized that some of these low-frequency loss-of-function variants might have important medical consequences in humans and genotyped 83 of these variants in 36,000 Finns. In doing so, we discovered that completely knocking out the TSFM gene might result in inviability or a very severe phenotype in humans and that knocking out the LPA gene might confer protection against coronary heart diseases, suggesting that LPA is likely to be a good potential therapeutic target.
Epilepsy is highly heritable, but its genetic architecture is poorly understood. Speed et al. estimate the number of susceptibility loci, show that common variants account for the majority of heritability, and demonstrate that epilepsy consists of genetically distinct subtypes. They conclude that gene-based prediction models may have clinical utility in first-seizure settings.
Epilepsy is a disease with substantial missing heritability; despite its high genetic component, genetic association studies have had limited success detecting common variants which influence susceptibility. In this paper, we reassess the role of common variants on epilepsy using extensions of heritability analysis. Our data set consists of 1258 UK patients with epilepsy, of which 958 have focal epilepsy, and 5129 population control subjects, with genotypes recorded for over 4 million common single nucleotide polymorphisms. Firstly, we show that on the liability scale, common variants collectively explain at least 26% (standard deviation 5%) of phenotypic variation for all epilepsy and 27% (standard deviation 5%) for focal epilepsy. Secondly we provide a new method for estimating the number of causal variants for complex traits; when applied to epilepsy, our most optimistic estimate suggests that at least 400 variants influence disease susceptibility, with potentially many thousands. Thirdly, we use bivariate analysis to assess how similar the genetic architecture of focal epilepsy is to that of non-focal epilepsy; we demonstrate both significant differences (P = 0.004) and significant similarities (P = 0.01) between the two subtypes, indicating that although the clinical definition of focal epilepsy does identify a genetically distinct epilepsy subtype, there is also scope to improve the classification of epilepsy by incorporating genotypic information. Lastly, we investigate the potential value in using genetic data to diagnose epilepsy following a single epileptic seizure; we find that a prediction model explaining 10% of phenotypic variation could have clinical utility for deciding which single-seizure individuals are likely to benefit from immediate anti-epileptic drug therapy.
epilepsy; association studies; heritability analysis; complex trait prediction
Variation in plasma levels of cortisol, an essential hormone in the stress response, is associated in population-based studies with cardio-metabolic, inflammatory and neuro-cognitive traits and diseases. Heritability of plasma cortisol is estimated at 30–60% but no common genetic contribution has been identified. The CORtisol NETwork (CORNET) consortium undertook genome wide association meta-analysis for plasma cortisol in 12,597 Caucasian participants, replicated in 2,795 participants. The results indicate that <1% of variance in plasma cortisol is accounted for by genetic variation in a single region of chromosome 14. This locus spans SERPINA6, encoding corticosteroid binding globulin (CBG, the major cortisol-binding protein in plasma), and SERPINA1, encoding α1-antitrypsin (which inhibits cleavage of the reactive centre loop that releases cortisol from CBG). Three partially independent signals were identified within the region, represented by common SNPs; detailed biochemical investigation in a nested sub-cohort showed all these SNPs were associated with variation in total cortisol binding activity in plasma, but some variants influenced total CBG concentrations while the top hit (rs12589136) influenced the immunoreactivity of the reactive centre loop of CBG. Exome chip and 1000 Genomes imputation analysis of this locus in the CROATIA-Korcula cohort identified missense mutations in SERPINA6 and SERPINA1 that did not account for the effects of common variants. These findings reveal a novel common genetic source of variation in binding of cortisol by CBG, and reinforce the key role of CBG in determining plasma cortisol levels. In turn this genetic variation may contribute to cortisol-associated degenerative diseases.
Cortisol is a steroid hormone from the adrenal glands that is essential in the response to stress. Most cortisol in blood is bound to corticosteroid binding globulin (CBG). Diseases causing cortisol deficiency (Addison's disease) or excess (Cushing's syndrome) are life-threatening. Variations in plasma cortisol have been associated with cardiovascular and psychiatric diseases and their risk factors. To dissect the genetic contribution to variation in plasma cortisol, we formed the CORtisol NETwork (CORNET) consortium and recruited collaborators with suitable samples from more than 15,000 people. The results reveal that the major genetic influence on plasma cortisol is mediated by variations in the binding capacity of CBG. This is determined by differences in the circulating concentrations of CBG and also in the immunoreactivity of its ‘reactive centre loop’, potentially influencing not only binding affinity for cortisol but also the stability of CBG and hence the tissue delivery of cortisol. These findings provide the first evidence for a common genetic effect on levels of this clinically important hormone, suggest that differences in CBG between individuals are biologically important, and pave the way for further research to dissect causality in the associations of plasma cortisol with common diseases.
The pubertal height growth spurt is a distinctive feature of childhood growth reflecting both the central onset of puberty and local growth factors. Although little is known about the underlying genetics, growth variability during puberty correlates with adult risks for hormone-dependent cancer and adverse cardiometabolic health. The only gene so far associated with pubertal height growth, LIN28B, pleiotropically influences childhood growth, puberty and cancer progression, pointing to shared underlying mechanisms. To discover genetic loci influencing pubertal height and growth and to place them in context of overall growth and maturation, we performed genome-wide association meta-analyses in 18 737 European samples utilizing longitudinally collected height measurements. We found significant associations (P < 1.67 × 10−8) at 10 loci, including LIN28B. Five loci associated with pubertal timing, all impacting multiple aspects of growth. In particular, a novel variant correlated with expression of MAPK3, and associated both with increased prepubertal growth and earlier menarche. Another variant near ADCY3-POMC associated with increased body mass index, reduced pubertal growth and earlier puberty. Whereas epidemiological correlations suggest that early puberty marks a pathway from rapid prepubertal growth to reduced final height and adult obesity, our study shows that individual loci associating with pubertal growth have variable longitudinal growth patterns that may differ from epidemiological observations. Overall, this study uncovers part of the complex genetic architecture linking pubertal height growth, the timing of puberty and childhood obesity and provides new information to pinpoint processes linking these traits.
Balanced chromosomal rearrangements occasionally have strong phenotypic effects, which may be useful in understanding pathobiology. However, conventional strategies for characterizing breakpoints are laborious and inaccurate. We present here a proband with a thoracic aortic aneurysm and a balanced translocation t(10;11)(q23.2;q24.2). Our purpose was to sequence the chromosomal breaks in this family to reveal a novel candidate gene for aneurysm.
Methods and results
Intracranial and thoracic aortic aneurysms appear to run in the family in an autosomal dominant manner: After exploring the family history, we observed that the proband’s two siblings both died from cerebral hemorrhage, and the proband’s parent and parent’s sibling died from aortic rupture. After application of a genome-wide paired-end DNA sequencing method for breakpoint mapping, we demonstrate that this translocation breaks intron 1 of a splicing isoform of Neurotrimin (NTM) at 11q25 in a previously implicated candidate region for intracranial (IAs) and aortic aneurysms (AAs) (OMIM 612161).
Our results demonstrate the feasibility of genome-wide paired-end sequencing for the characterization of balanced rearrangements and identification of candidate genes in patients with potentially disease-associated chromosome rearrangements. The family samples were gathered as a part of our recently launched National Registry of Reciprocal Balanced Translocations and Inversions in Finland (n=2575), and we believe that such a registry will be a powerful resource for the localization of chromosomal aberrations, which can bring insight into the etiology of related phenotypes.
Molecular genetics; Cardiovascular Medicine; Chromosomal; Clinical Genetics; Genome-wide
Mega- or meta-analytic studies (e.g. genome-wide association studies) are increasingly used in behavior genetics. An issue in such studies is that phenotypes are often measured by different instruments across study cohorts, requiring harmonization of measures so that more powerful fixed effect meta-analyses can be employed. Within the Genetics of Personality Consortium, we demonstrate for two clinically relevant personality traits, Neuroticism and Extraversion, how Item-Response Theory (IRT) can be applied to map item data from different inventories to the same underlying constructs. Personality item data were analyzed in >160,000 individuals from 23 cohorts across Europe, USA and Australia in which Neuroticism and Extraversion were assessed by nine different personality inventories. Results showed that harmonization was very successful for most personality inventories and moderately successful for some. Neuroticism and Extraversion inventories were largely measurement invariant across cohorts, in particular when comparing cohorts from countries where the same language is spoken. The IRT-based scores for Neuroticism and Extraversion were heritable (48 and 49 %, respectively, based on a meta-analysis of six twin cohorts, total N = 29,496 and 29,501 twin pairs, respectively) with a significant part of the heritability due to non-additive genetic factors. For Extraversion, these genetic factors qualitatively differ across sexes. We showed that our IRT method can lead to a large increase in sample size and therefore statistical power. The IRT approach may be applied to any mega- or meta-analytic study in which item-based behavioral measures need to be harmonized.
Electronic supplementary material
The online version of this article (doi:10.1007/s10519-014-9654-x) contains supplementary material, which is available to authorized users.
Personality; Item-Response Theory; Measurement; Genome-wide association studies; Consortium; Meta-analysis
Previous studies have failed to identify mutations in the Wilson’s disease gene ATP7B in a significant number of clinically diagnosed cases. This has led to concerns about genetic heterogeneity for this condition but also suggested the presence of unusual mutational mechanisms. We now present our findings in 181 patients from the United Kingdom with clinically and biochemically confirmed Wilson’s disease. A total of 116 different ATP7B mutations were detected, 32 of which are novel. The overall mutation detection frequency was 98%. The likelihood of mutations in genes other than ATP7B causing a Wilson’s disease phenotype is therefore very low. We report the first cases with Wilson’s disease due to segmental uniparental isodisomy as well as three patients with three ATP7B mutations and three families with Wilson’s disease in two consecutive generations. We determined the genetic prevalence of Wilson’s disease in the United Kingdom by sequencing the entire coding region and adjacent splice sites of ATP7B in 1000 control subjects. The frequency of all single nucleotide variants with in silico evidence of pathogenicity (Class 1 variant) was 0.056 or 0.040 if only those single nucleotide variants that had previously been reported as mutations in patients with Wilson’s disease were included in the analysis (Class 2 variant). The frequency of heterozygote, putative or definite disease-associated ATP7B mutations was therefore considerably higher than the previously reported occurrence of 1:90 (or 0.011) for heterozygote ATP7B mutation carriers in the general population (P < 2.2 × 10-16 for Class 1 variants or P < 5 × 10-11 for Class 2 variants only). Subsequent exclusion of four Class 2 variants without additional in silico evidence of pathogenicity led to a further reduction of the mutation frequency to 0.024. Using this most conservative approach, the calculated frequency of individuals predicted to carry two mutant pathogenic ATP7B alleles is 1:7026 and thus still considerably higher than the typically reported prevalence of Wilson’s disease of 1:30 000 (P = 0.00093). Our study provides strong evidence for monogenic inheritance of Wilson’s disease. It also has major implications for ATP7B analysis in clinical practice, namely the need to consider unusual genetic mechanisms such as uniparental disomy or the possible presence of three ATP7B mutations. The marked discrepancy between the genetic prevalence and the number of clinically diagnosed cases of Wilson’s disease may be due to both reduced penetrance of ATP7B mutations and failure to diagnose patients with this eminently treatable disorder.
Wilson’s disease; ATP7B; genetic prevalence
Low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides, and total cholesterol are heritable, modifiable, risk factors for coronary artery disease. To identify new loci and refine known loci influencing these lipids, we examined 188,578 individuals using genome-wide and custom genotyping arrays. We identify and annotate 157 loci associated with lipid levels at P < 5×10−8, including 62 loci not previously associated with lipid levels in humans. Using dense genotyping in individuals of European, East Asian, South Asian, and African ancestry, we narrow association signals in 12 loci. We find that loci associated with blood lipids are often associated with cardiovascular and metabolic traits including coronary artery disease, type 2 diabetes, blood pressure, waist-hip ratio, and body mass index. Our results illustrate the value of genetic data from individuals of diverse ancestries and provide insights into biological mechanisms regulating blood lipids to guide future genetic, biological, and therapeutic research.
Triglycerides are transported in plasma by specific triglyceride-rich lipoproteins; in epidemiologic studies, increased triglyceride levels correlate with higher risk for coronary artery disease (CAD). However, it is unclear whether this association reflects causal processes. We used 185 common variants recently mapped for plasma lipids (P<5×10−8 for each) to examine the role of triglycerides on risk for CAD. First, we highlight loci associated with both low-density lipoprotein cholesterol (LDL-C) and triglycerides, and show that the direction and magnitude of both are factors in determining CAD risk. Second, we consider loci with only a strong magnitude of association with triglycerides and show that these loci are also associated with CAD. Finally, in a model accounting for effects on LDL-C and/or high-density lipoprotein cholesterol, a polymorphism's strength of effect on triglycerides is correlated with the magnitude of its effect on CAD risk. These results suggest that triglyceride-rich lipoproteins causally influence risk for CAD.
Inter-individual variation in mean leukocyte telomere length (LTL) is associated with cancer and several age-associated diseases. Here, in a genome-wide meta-analysis of 37,684 individuals with replication of selected variants in a further 10,739 individuals, we identified seven loci, including five novel loci, associated with mean LTL (P<5x10−8). Five of the loci contain genes (TERC, TERT, NAF1, OBFC1, RTEL1) that are known to be involved in telomere biology. Lead SNPs at two loci (TERC and TERT) associate with several cancers and other diseases, including idiopathic pulmonary fibrosis. Moreover, a genetic risk score analysis combining lead variants at all seven loci in 22,233 coronary artery disease cases and 64,762 controls showed an association of the alleles associated with shorter LTL with increased risk of CAD (21% (95% CI: 5–35%) per standard deviation in LTL, p=0.014). Our findings support a causal role of telomere length variation in some age-related diseases.
Implicating particular genes in the generation of complex brain and behavior phenotypes requires multiple lines of evidence. The rarity of most high impact genetic variants typically precludes the possibility of accruing statistical evidence that they are associated with a given trait. We show here that the enrichment of a rare Chromosome 22q11.22 deletion in a recently expanded Northern Finnish sub-isolate enables the detection of association between TOP3β and both schizophrenia and cognitive impairment. Biochemical analysis of TOP3β revealed that this topoisomerase is a component of cytosolic messenger ribonucleoproteins (mRNPs) and is catalytically active on RNA. The recruitment of TOP3β to mRNPs was independent of RNA cis-elements and was coupled to the co-recruitment of FMRP, the disease gene product in fragile X mental retardation syndrome (FXS). Thus, we uncover a novel role for TOP3β in mRNA metabolism and provide several lines of evidence implicating it in neurodevelopmental disorders.
Variants in the growth factor receptor-bound protein 10 (GRB10) gene were in a GWAS meta-analysis associated with reduced glucose-stimulated insulin secretion and increased risk of type 2 diabetes (T2D) if inherited from the father, but inexplicably reduced fasting glucose when inherited from the mother. GRB10 is a negative regulator of insulin signaling and imprinted in a parent-of-origin fashion in different tissues. GRB10 knock-down in human pancreatic islets showed reduced insulin and glucagon secretion, which together with changes in insulin sensitivity may explain the paradoxical reduction of glucose despite a decrease in insulin secretion. Together, these findings suggest that tissue-specific methylation and possibly imprinting of GRB10 can influence glucose metabolism and contribute to T2D pathogenesis. The data also emphasize the need in genetic studies to consider whether risk alleles are inherited from the mother or the father.
In this paper, we report the first large genome-wide association study in man for glucose-stimulated insulin secretion (GSIS) indices during an oral glucose tolerance test. We identify seven genetic loci and provide effects on GSIS for all previously reported glycemic traits and obesity genetic loci in a large-scale sample. We observe paradoxical effects of genetic variants in the growth factor receptor-bound protein 10 (GRB10) gene yielding both reduced GSIS and reduced fasting plasma glucose concentrations, specifically showing a parent-of-origin effect of GRB10 on lower fasting plasma glucose and enhanced insulin sensitivity for maternal and elevated glucose and decreased insulin sensitivity for paternal transmissions of the risk allele. We also observe tissue-specific differences in DNA methylation and allelic imbalance in expression of GRB10 in human pancreatic islets. We further disrupt GRB10 by shRNA in human islets, showing reduction of both insulin and glucagon expression and secretion. In conclusion, we provide evidence for complex regulation of GRB10 in human islets. Our data suggest that tissue-specific methylation and imprinting of GRB10 can influence glucose metabolism and contribute to T2D pathogenesis. The data also emphasize the need in genetic studies to consider whether risk alleles are inherited from the mother or the father.