We summarize the work done by the contributors to Group 13 at Genetic Analysis Workshop 17 (GAW17) and provide a synthesis of their data analyses. The Group 13 contributors used a variety of approaches to test associations of both rare variants and common single-nucleotide polymorphisms (SNPs) with the GAW17 simulated traits, implementing analytic methods that incorporate multiallelic genotypes and haplotypes. In addition to using a wide variety of statistical methods and approaches, the contributors exhibited a remarkable amount of flexibility and creativity in coding the variants and their genes and in evaluating their proposed approaches and methods. We describe and contrast their methods along three dimensions: (1) selection and coding of genetic entities for analysis, (2) method of analysis, and (3) evaluation of the results. The contributors consistently presented a strong rationale for using multiallelic analytic approaches. They indicated that power was likely to be increased by capturing the signals of multiple markers within genetic entities defined by sliding windows, haplotypes, genes, functional pathways, and the entire set of SNPs and rare variants taken in aggregate. Despite this variability, the methods were fairly consistent in their ability to identify two associated genes for each simulated trait. The first gene was selected for the largest number of causal alleles and the second for a high-frequency causal SNP. The presumed model of inheritance and choice of genetic entities are likely to have a strong effect on the outcomes of the analyses.
rare variants; sequence data; multiallelic data; Bayesian regression; penalized regression; tree-based clustering; pathway analysis; haplotypes
Non-human primates provide genetic model systems biologically intermediate between humans and other mammalian model organisms. Populations of Caribbean vervet monkeys (Chlorocebus aethiops sabaeus) are genetically homogeneous and large enough to permit well-powered genetic mapping studies of quantitative traits relevant to human health, including expression quantitative trait loci (eQTL). Previous transcriptome-wide investigation in an extended vervet pedigree identified 29 heritable transcripts for which levels of expression in peripheral blood correlate strongly with expression levels in the brain. Quantitative trait linkage analysis using 261 microsatellite markers identified significant (n = 8) and suggestive (n = 4) linkages for 12 of these transcripts, including both cis- and trans-eQTL. Seven transcripts, located on different chromosomes, showed maximum linkage to markers in a single region of vervet chromosome 9; this observation suggests the possibility of a master trans-regulator locus in this region. For one cis-eQTL (at B3GALTL, beta-1,3-glucosyltransferase), we conducted follow-up single nucleotide polymorphism genotyping and fine-scale association analysis in a sample of unrelated Caribbean vervets, localizing this eQTL to a region of <200 kb. These results suggest the value of pedigree and population samples of the Caribbean vervet for linkage and association mapping studies of quantitative traits. The imminent whole genome sequencing of many of these vervet samples will enhance the power of such investigations by providing a comprehensive catalog of genetic variation.
The identification of autism susceptibility genes has been hampered by phenotypic heterogeneity of autism, among other factors. However, the use of endophenotypes has shown preliminary success in reducing heterogeneity and identifying potential autism-related susceptibility regions. To further explore the utility of using language related endophenotypes, we performed linkage analysis on multiplex autism families stratified according to delayed expressive speech and also assessed the extent to which parental phenotype information would aid in identifying regions of linkage. A whole genome scan using a multipoint nonparametric linkage approach was performed in 133 families, stratifying the sample by phrase speech delay and word delay. None of the regions reached suggested genome-wide or replication significance thresholds. However, several loci on chromosomes 1, 2, 4, 6, 7, 8, 9, 10, 12, 15, and 19 yielded nominally higher linkage signals in the delayed groups. The results did not support reported linkage findings for loci on chromosomes 7 or 13 that were a result of stratification based on the language delay endophenotype. In addition, inclusion of information on parental history of language delay did not appreciably affect the linkage results. The nominal increase in NPL scores across several regions using language delay endophenotypes for stratification suggests that this strategy may be useful in attenuating heterogeneity. However, the inconsistencies in regions identified across studies highlight the importance of increasing sample sizes to provide adequate power to test replications in independent samples.
Autism; linkage; endophenotypes; language; AGRE
The Xq28 region containing IRAK1 and MECP2 has been identified as a risk locus for systemic lupus erythematosus (SLE) in previous genetic association studies. However, due to the strong linkage disequilibrium between IRAK1 and MECP2, it remains unclear which gene is affected by the underlying causal variant(s) conferring risk of SLE.
We fine-mapped ≥136 SNPs in a ~227kb region on Xq28, containing IRAK1, MECP2 and 7 adjacent genes (L1CAM, AVPR2, ARHGAP4, NAA10, RENBP, HCFC1 and TMEM187), for association with SLE in 15,783 case-control subjects derived from 4 different ancestral groups.
Multiple SNPs showed strong association with SLE in European Americans, Asians and Hispanics at P<5×10−8 with consistent association in subjects with African ancestry. Of these, 6 SNPs located in the TMEM187-IRAK1-MECP2 region captured the underlying causal variant(s) residing in a common risk haplotype shared by all 4 ancestral groups. Among them, rs1059702 best explained the Xq28 association signals in conditional testings and exhibited the strongest P value in trans-ancestral meta-analysis (Pmeta=1.3×10−27, OR=1.43), and thus was considered to be the most-likely causal variant. The risk allele of rs1059702 results in the amino acid substitution S196F in IRAK1 and had previously been shown to increase NF-κB activity in vitro. We also found that the homozygous risk genotype of rs1059702 was associated with lower mRNA levels of MECP2, but not IRAK1, in SLE patients (P=0.0012) and healthy controls (P=0.0064).
These data suggest contributions of both IRAK1 and MECP2 to SLE susceptibility.
Systemic Lupus Erythematosus; Gene Polymorphism; Xq28; IRAK1; MECP2
Alternation of synaptic homeostasis is a biological process whose disruption might predispose children to autism spectrum disorders (ASD). Calcium channel genes (CCG) contribute to modulating neuronal function and evidence implicating CCG in ASD has been accumulating. We conducted a targeted association analysis of CCG using existing genome-wide association study (GWAS) data and imputation methods in a combined sample of parent/affected child trios from two ASD family collections to explore this hypothesis.
A total of 2,176 single-nucleotide polymorphisms (SNP) (703 genotyped and 1,473 imputed) covering the genes that encode the α1 subunit proteins of 10 calcium channels were tested for association with ASD in a combined sample of 2,781 parent/affected child trios from 543 multiplex Caucasian ASD families from the Autism Genetics Resource Exchange (AGRE) and 1,651 multiplex and simplex Caucasian ASD families from the Autism Genome Project (AGP). SNP imputation using IMPUTE2 and a combined reference panel from the HapMap3 and the 1,000 Genomes Project increased coverage density of the CCG. Family-based association was tested using the FBAT software which controls for population stratification and accounts for the non-independence of siblings within multiplex families. The level of significance for association was set at 2.3E-05, providing a Bonferroni correction for this targeted 10-gene panel.
Four SNPs in three CCGs were associated with ASD. One, rs10848653, is located in CACNA1C, a gene in which rare de novo mutations are responsible for Timothy syndrome, a Mendelian disorder that features ASD. Two others, rs198538 and rs198545, located in CACN1G, and a fourth, rs5750860, located in CACNA1I, are in CCGs that encode T-type calcium channels, genes with previous ASD associations.
These associations support a role for common CCG SNPs in ASD.
Autism spectrum disorders; Calcium channel genes; Common variants; Imputed SNPs; Association studies
High serum triglyceride (TG) levels is an established risk factor for coronary heart disease (CHD). Fat is stored in the form of TGs in human adipose tissue. We hypothesized that gene co-expression networks in human adipose tissue may be correlated with serum TG levels and help reveal novel genes involved in TG regulation.
Gene co-expression networks were constructed from two Finnish and one Mexican study sample using the blockwiseModules R function in Weighted Gene Co-expression Network Analysis (WGCNA). Overlap between TG-associated networks from each of the three study samples were calculated using a Fisher’s Exact test. Gene ontology was used to determine known pathways enriched in each TG-associated network.
We measured gene expression in adipose samples from two Finnish and one Mexican study sample. In each study sample, we observed a gene co-expression network that was significantly associated with serum TG levels. The TG modules observed in Finns and Mexicans significantly overlapped and shared 34 genes. Seven of the 34 genes (ARHGAP30, CCR1, CXCL16, FERMT3, HCST, RNASET2, SELPG) were identified as the key hub genes of all three TG modules. Furthermore, two of the 34 genes (ARHGAP9, LST1) reside in previous TG GWAS regions, suggesting them as the regional candidates underlying the GWAS signals.
This study presents a novel adipose gene co-expression network with 34 genes significantly correlated with serum TG across populations.
Mexicans; Finns; RNA sequencing; Triglycerides; Adipose tissue; Weighted gene co-expression network analysis
The molecular mechanisms underlying the changes in the nigrostriatal pathway in Parkinson’s disease (PD) are not completely understood. Here, we use mass spectrometry and microarrays to study the proteomic and transcriptomic changes in the striatum of two mouse models of PD, induced by the distinct neurotoxins 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) and methamphetamine (METH). Proteomic analyses resulted in the identification and relative quantification of 912 proteins with two or more unique peptides and 86 proteins with significant abundance changes following neurotoxin treatment. Similarly, microarray analyses revealed 181 genes with significant changes in mRNA, following neurotoxin treatment. The combined protein and gene list provides a clearer picture of the potential mechanisms underlying neurodegeneration observed in PD. Functional analysis of this combined list revealed a number of significant categories, including mitochondrial dysfunction, oxidative stress response, and apoptosis. These results constitute one of the largest descriptive data sets integrating protein and transcript changes for these neurotoxin models with many similar end point phenotypes but distinct mechanisms.
Parkinson’s disease; transcriptomics; proteomics; codon usage; miRNA; mouse model
This study was undertaken to determine whether there is familial aggregation of Hyperemesis Gravidarum making it a disease amenable to genetic study.
Cases with severe nausea and vomiting in a singleton pregnancy treated with intravenous hydration and unaffected friend controls completed a survey regarding family history.
Sisters of women with Hyperemesis Gravidarum have a significantly increased risk of having Hyperemesis Gravidarum themselves (OR=17.3, p=0.005). Cases have a significantly increased risk of having a mother with severe nausea and vomiting; 33% of cases reported an affected mother compared to 7.7% of controls (p<.0001). Cases reported a similar frequency of affected second-degree maternal and paternal relatives (18% maternal lineage, 23% paternal lineage).
There is familial aggregation of Hyperemesis Gravidarum. This study provides strong evidence for a genetic component to hyperemesis gravidarum. Identification of the predisposing gene(s) may determine the cause of this poorly understood disease of pregnancy.
Familial Aggregation; Genetic; Hyperemesis Gravidarum; Nausea; Pregnancy
The genetic association of interferon regulatory factor 5 (IRF5) with systemic lupus erythematosus (SLE) susceptibility has been convincingly established. To gain understanding of the effect of IRF5 variation in individuals without SLE, a study was undertaken to examine whether such genetic variation predisposes to activation of the interferon α (IFNα) pathway.
Using a computer simulated approach, 14 single nucleotide polymorphisms (SNPs) and haplotypes of IRF5 were tested for association with mRNA expression levels of IRF5, IFNα and IFN-inducible genes and chemokines in lymphoblastoid cell lines (LCLs) from individuals of European (CEU), Han Chinese (CHB), Japanese (JPT) and Yoruba Nigerian (YRI) backgrounds. IFN-inducible gene expression was assessed in LCLs from children with SLE in the presence and absence of IFNα stimulation.
The major alleles of IRF5 rs13242262 and rs2280714 were associated with increased IRF5 mRNA expression levels in the CEU, CHB+JPT and YRI samples. The minor allele of IRF5 rs10488631 was associated with increased IRF5, IFNα and IFN-inducible chemokine expression in CEU (pc=0.0005, 0.01 and 0.04, respectively). A haplotype containing these risk alleles of rs13242262, rs10488631 and rs2280714 was associated with increased IRF5, IFNα and IFN-inducible chemokine expression in CEU LCLs. In vitro studies showed specific activation of IFN-inducible genes in LCLs by IFNα.
SNPs of IRF5 in healthy individuals of a number of ethnic groups were associated with increased mRNA expression of IRF5. In European-derived individuals, an IRF5 haplotype was associated with increased IRF5, IFNα and IFN-inducible chemokine expression. Identifying individuals genetically predisposed to increased IFN-inducible gene and chemokine expression may allow early detection of risk for SLE.
Chromosome 17q11-q21 is a region of the genome likely to harbor susceptibility to autism (MIM) based on prior evidence of linkage to the disorder. This linkage is specific to multiplex pedigrees containing only male probands (MO) within the Autism Genetic Resource Exchange (AGRE). Previously, Stone et al.1 completed a high-density SNP association study of 13.7Mb within this interval, but common variant association was not sufficient to account for the linkage signal. Here we extend this SNP-based association study to complete the coverage of the 2 LOD support interval around the chromosome 17q linkage peak by testing the majority of common alleles in 284 MO trios.
Markers within an interval containing the gene CACNA1G were found to be associated with Autism Spectrum Disorder at a locally significant level (p = 1.9 × 10-5). While establishing CACNA1G as a novel candidate for autism, these alleles do not contribute sufficient genetic effect to explain the observed linkage, indicating there is substantial genetic heterogeneity despite the clear linkage signal. The region thus likely harbors a combination of multiple common and rare alleles contributing to the genetic risk. These data, along with previous studies of Chromosomes 5 and 7q3, suggest few if any major common risk alleles account for ASD risk under major linkage peaks in the AGRE sample. This provides important evidence for strategies to identify ASD genes, suggesting they should focus on identifying rare variants and common variants of small effect.
Autism; Autism Spectrum Disorder; Association; Chromosome 17q; CACNA1G
Although the Mexican population has a high predisposition to dyslipidemias and premature coronary artery disease, this population is underinvestigated for the genetic factors conferring the high susceptibility.
Methods and Results
First, we investigated apolipoprotein B (apoB) levels in Mexican extended families with familial combined hyperlipidemia (FCHL) using a two-step testing strategy. In the screening step, we screened 5,721 SNPs for linkage signals with apoB. In the test step, we analyzed the 130 SNPs residing in regions of suggestive linkage signals for association with apoB. We identified significant associations with two SNPs, rs1424032 (P=6.07×10−6) and rs1349411 (P=2.72×10−4), that surpassed the significance level for the number of tests performed in the test step (P<3.84×10−4). Second, these SNPs were tested for replication in Mexican hyperlipidemic cases-control samples. The same risk alleles as in the FCHL families were significantly associated (P<0.05) with apoB in the case-control samples. The rs1349411 resides near the apoB mRNA editing enzyme (APOBEC1) involved in the processing of APOB mRNA in the small intestine. The rs1424032 resides in a highly conserved non-coding region predicted to function as a regulatory element.
We identified two novel variants, rs1349411 and rs1424032, for serum apoB levels in Mexicans.
association; lipids; apolipoproteins; cardiovascular disease and Mexican population
Although epidemiological studies have demonstrated an increased predisposition to low HDL cholesterol (HDL-C) and high triglyceride (TG) levels in the Mexican population, Mexicans have not been included in any of the previously reported genome-wide association studies (GWAS) for lipids.
We investigated six SNPs associated with TGs, seven with HDL-C and one with both TGs and HDL-C in recent Caucasian GWAS in Mexican familial combined hyperlipidemia families and hypertriglyceridemia case-control study samples. These variants were within or near the genes ABCA1, ANGPTL3, APOA5, APOB, CETP, GALNT2, GCKR, LCAT, LIPC, LPL (2), MMAB-MVK, TRIB1 and XKR6-AMAC1L2. We performed a combined analysis of the family-based and case-control studies (n=2,298) using the Z-method to combine statistics. Ten of the SNPs were nominally significant and five were significant after Bonferroni correction (P = 2.20 × 10-3 – 2.6 × 10-11) for the number of tests performed (APOA5, CETP, GCKR and GALNT2). Interestingly, our strongest signal was obtained for TGs with the minor allele of rs964184 (P=2.6 × 10-11) in the APOA1/C3/A4/A5 gene cluster region that is significantly more common in Mexicans (27%) than in Caucasians (12%).
It is important to confirm whether known loci have a consistent effect across ethnic groups. We show replication of five Caucasian GWAS lipid associations in Mexicans. The remaining loci will require a comprehensive investigation to exclude or verify their significance in Mexicans. We also demonstrate that rs964184 has a large effect (OR=1.74) and is more frequent in the Mexican population, and thus it may contribute to the high predisposition to dyslipidemias in Mexicans.
Lipids; High-density lipoprotein cholesterol; triglycerides; genome-wide association studies; replication; diverse populations; Mexicans; Single-nucleotide polymorphism
Deposits of abnormally hyperphosphorylated tau are a hallmark of several dementias, including Alzheimer disease (AD), and about 10% of familial frontotemporal dementia (FTD) cases are caused by mutations in the tau gene. As a known tau kinase, GSK3B is a promising candidate gene in the remaining cases of FTD and in AD, for which tau mutations have not been found.
To examine the promoter of GSK3B and all 12 exons, including the surrounding intronic sequence, in patients with FTD, patients with AD, and aged healthy subjects to identify single-nucleotide polymorphisms associated with disease.
Design, Setting, and Participants
Single-nucleotide polymorphism frequency was examined in a case-control cohort of 48 patients with probable AD, 102 patients with FTD, 38 patients with primary progressive aphasia, and 85 aged healthy subjects. Results were followed up in 2 independent AD family samples consisting of 437 multiplex families with AD (National Institute of Mental Health Genetics Initiative AD Study) or 150 sibships discordant for AD (Consortium on Alzheimer’s Genetics Study).
Several rare sequence variants in GSK3B were identified in the case-control study. An intronic polymorphism (IVS2−68G>A) occurred at more than twice the frequency among patients with FTD (10.8%) and patients with AD (14.6%) than in aged healthy subjects (4.1%). The polymorphism showed association with disease in both follow-up samples independently, although only the Consortium on Alzheimer’s Genetics sample showed the same direction of association as the case-control sample.
To our knowledge, this is the first evidence that a gene known to be involved in tau phosphorylation, GSK3B, is associated with risk for primary neurodegenerative dementias. This supports previous work in animal models suggesting that such genes are therapeutic targets.
It has been suggested that autism, like other complex genetic disorders, may benefit from the study of rare or Mendelian variants associated with syndromic or non-syndromic forms of the disease. However, there are few examples in which common variation in genes causing a Mendelian neuropsychiatric disorder has been shown to contribute to disease susceptibility in an allied common condition. Joubert syndrome (JS) is a rare recessively inherited disorder, with mutations reported at several loci including the gene Abelson’s Helper Integration 1 (AHI1). A significant proportion of patients with JS, in some studies up to 40%, have been diagnosed with autism spectrum disorder (ASD) and several linkage studies in ASD have nominally implicated the region on 6q where AHI1 resides. To evaluate AHI1 in ASD, we performed a three-stage analysis of AHI1 as an a priori candidate gene for autism. Re-sequencing was first used to screen AHI1, followed by two subsequent association studies, one limited and one covering the gene more completely, in Autism Genetic Resource Exchange (AGRE) families. In stage 3, we found evidence of an associated haplotype in AHI1 with ASD after correction for multiple comparisons, in a region of the gene that had been previously associated with schizophrenia. These data suggest a role for AHI1 in common disorders affecting human cognition and behavior.
Stearoyl-coA desaturase 1 (SCD1) is the rate-limiting enzyme involved in the synthesis of monounsaturated fatty acids, and in mice SCD1 activity is associated with plasma triglyceride levels.
We used the fatty acid desaturation index (the plasma ratio of 18:1/18:0), as a marker of SCD1 activity to investigate the relationship of SCD1 to familial combined hyperlipidemia (FCHL).
Methods and Results
The fatty acid desaturation index was measured in 400 individuals from 18 extended FCHL pedigrees. FCHL-affected individuals exhibited increased SCD1 activity when compared to unrelated controls (P<0.0001). The fatty acid desaturation index was found to be highly heritable (h2 = 0.48, p= 2.2 × 10−11) in this study sample. QTL analysis in 346 sibling pairs from 18 FCHL families revealed suggestive linkage of the desaturation index to chromosomes 3p26.1-3p13 (z=2.7, P=0.003), containing the peroxisome proliferator-activated receptor gamma (PPARγ) gene, and 20p11.21-20q13.32 (z=1.7, P=0.04), containing the hepatocyte nuclear factor 4, alpha (HNF4α) gene. A specific haplotype of HNF4α was found to be associated with the desaturation index in these FCHL families (P=0.002).
Our results demonstrate that the fatty acid desaturation index is a highly heritable trait that is associated with the dyslipidemia observed in FCHL.
familial combined hyperlipidemia; genetics; Stearoyl-coA desaturase 1; peroxisome proliferator-activated receptor gamma; hepatocyte nuclear factor 4 alpha
The hypothesis that brain-derived neurotrophic factor (BDNF) is involved in the pathogenesis of major depression is supported by several research findings; however, genetic studies assessing the relationship between BDNF and psychiatric disorders have produced conflicting results.We examined the effect of a BDNF polymorphism on depression susceptibility in Mexican-Americans.The single nucleotide polymorphism (Val66Met), which has been shown to have functional and behavioral effects, was genotyped in 284 depressed participants and 331 controls, showing association with depression (P=0.005). Individuals homozygous for the major allele (GG) had an increased chance of being depressed (OR=1.7 95% CI 1.17-2.47).Our findings support the association of BDNF single nucleotide polymorphism rs6265 and depression, suggesting that this polymorphism may increase susceptibility to major depression in Mexican-Americans.
BDNF (brain derived neurotrophic factor); major depressive disorder; Mexican-Americans; pharmacogenetics; polymorphism; psychiatry; single nucleotide polymorphism; rs6265; Val66Met
The regulation of gene expression is an emerging area of investigation. Increased knowledge can deepen our understanding of the genetic contributions to variations in complex traits. The purpose of this study is to explore the feasibility of detecting regulatory elements of gene expression with multivariate analyses.
Peripheral blood lymphocyte expression levels of 30 genes on chromosome 5 and a single gene, DEAD, on chromosome 22 were analyzed in single-point variance-component linkage analyses in multiplex families to identify putative regulatory regions. To explore the possibility of regulatory regions having individual relationships with the expression levels of a single gene, we utilized stepwise regression. To explore the possibility of pleiotropy of a single regulatory locus for multiple genes, bivariate linkage analysis was applied.
Twenty-one loci were linked to five expression levels. The two most significant were for the known region on chromosome 22 (LOD = 4.62). On chromosome 5 a LOD of 4.57 was found for the gene leukocyte-derived arginine aminopeptidase (LRAP) with a single-nucleotide polymorphism (SNP) within 5 Mb. Both genes showed evidence of linkage to multiple SNPs. When 194 family members were treated as independent, stepwise regression identified fewer single-nucleotide polymorphisms with significant predictive values (p < 0.05), providing evidence for multiple regulatory regions of unequal effect. However, when corrections for non-independence were applied these results could no longer be detected.
The complex nature of gene regulation can be explored by linkage analysis with single-nucleotide polymorphisms followed by multivariate methods to explore co-regulation.
In our previous genome-wide scan of Finnish nuclear families, obesity was linked to chromosome Xq24. Here we analyzed this 15-Mb region by genotyping 9 microsatellite markers and 36 single nucleotide polymorphisms (SNPs) for 11 positional and functional candidate genes in an extended sample of 218 obese Finnish sibling pairs (sibpairs) (BMI > 30 kg/m2). Evidence of linkage emerged mainly from the obese male sibpairs, suggesting a gender-specific effect for the underlying gene. By constructing haplotypes among the obese male sibpairs, we restricted the region from 15 Mb to 4 Mb, between markers DXS8088 and DXS8067. Regional functional candidate genes were tested for association in an initial sample of 117 cases and 182 controls. Significant evidence was observed for association for an SNP in the 3′-untranslated region of the solute carrier family 6 member 14 (SLC6A14) gene (P = 0.0002) and for SNP haplotypes of the SLC6A14 gene (P = 0.0007–0.006). Furthermore, an independent replication study sample of 837 cases and 968 controls from Finland and Sweden also showed significant differences in allele frequencies between obese and non-obese individuals (P = 0.003). The SLC6A14 gene is an interesting novel candidate for obesity because it encodes an amino acid transporter, which potentially regulates tryptophan availability for serotonin synthesis and thus possibly affects appetite control.
Systemic lupus erythematosus (SLE) is an autoimmune disease characterized by various autoantibodies that recognize autoantigens displayed on the surface of cells undergoing apoptosis. The genetic contribution to SLE susceptibility has been widely recognized. We previously reported evidence for linkage to SLE of the human chromosome 1q41–q42 region and have now narrowed it from 15 to 5 cM in an extended sample using multipoint linkage analysis. Candidate genes within this region include (a) PARP, poly(ADP-ribose) polymerase, encoding a zinc-finger DNA-binding protein that is involved in DNA repair and apoptosis; (b) TGFB2, encoding a transforming growth factor that regulates cellular interactions and responses; and (c) HLX1, encoding a homeobox protein that may regulate T-cell development. Using a multiallelic, transmission-disequilibrium test (TDT), we found overall skewing of transmission of PARP alleles to affected offspring in 124 families (P = 0.00008), preferential transmission of a PARP allele to affected offspring (P = 0.0003), and lack of transmission to unaffected offspring (P = 0.004). Similar TDT analyses of TGFB2 and HLX1 polymorphisms yielded no evidence for association with SLE. These results suggest that PARP may be (or is close to) the susceptibility gene within the chromosome 1q41–q42 region linked to SLE.
We previously reported that the G allele of rs3853839 at 3′untranslated region (UTR) of Toll-like receptor 7 (TLR7) was associated with elevated transcript expression and increased risk for systemic lupus erythematosus (SLE) in 9,274 Eastern Asians [P = 6.5×10−10, odds ratio (OR) (95%CI) = 1.27 (1.17–1.36)]. Here, we conducted trans-ancestral fine-mapping in 13,339 subjects including European Americans, African Americans, and Amerindian/Hispanics and confirmed rs3853839 as the only variant within the TLR7-TLR8 region exhibiting consistent and independent association with SLE (Pmeta = 7.5×10−11, OR = 1.24 [1.18–1.34]). The risk G allele was associated with significantly increased levels of TLR7 mRNA and protein in peripheral blood mononuclear cells (PBMCs) and elevated luciferase activity of reporter gene in transfected cells. TLR7 3′UTR sequence bearing the non-risk C allele of rs3853839 matches a predicted binding site of microRNA-3148 (miR-3148), suggesting that this microRNA may regulate TLR7 expression. Indeed, miR-3148 levels were inversely correlated with TLR7 transcript levels in PBMCs from SLE patients and controls (R2 = 0.255, P = 0.001). Overexpression of miR-3148 in HEK-293 cells led to significant dose-dependent decrease in luciferase activity for construct driven by TLR7 3′UTR segment bearing the C allele (P = 0.0003). Compared with the G-allele construct, the C-allele construct showed greater than two-fold reduction of luciferase activity in the presence of miR-3148. Reduced modulation by miR-3148 conferred slower degradation of the risk G-allele containing TLR7 transcripts, resulting in elevated levels of gene products. These data establish rs3853839 of TLR7 as a shared risk variant of SLE in 22,613 subjects of Asian, EA, AA, and Amerindian/Hispanic ancestries (Pmeta = 2.0×10−19, OR = 1.25 [1.20–1.32]), which confers allelic effect on transcript turnover via differential binding to the epigenetic factor miR-3148.
Systemic lupus erythematosus (SLE) is a debilitating autoimmune disease contributed to by excessive innate immune activation involving toll-like receptors (TLRs, particularly TLR7/8/9) and type I interferon (IFN) signaling pathways. TLR7 responds against RNA–containing nuclear antigens and activates IFN-α pathway, playing a pivotal role in the development of SLE. While a genomic duplication of Tlr7 promotes lupus-like disease in the Y-linked autoimmune accelerator (Yaa) murine model, the lack of common copy number variations at TLR7 in humans led us to identify a functional single nucleotide polymorphism (SNP), rs3853839 at 3′ UTR of the TLR7 gene, associated with SLE susceptibility in Eastern Asians. In this study, we fine-mapped the TLR7-TLR8 region and confirmed rs3853839 exhibiting the strongest association with SLE in European Americans, African Americans, and Amerindian/Hispanics. Individuals carrying the risk G allele of rs3853839 exhibited increased TLR7 expression at the both mRNA and protein level and decreased transcript degradation. MicroRNA-3148 (miR-3148) downregulated the expression of non-risk allele (C) containing transcripts preferentially, suggesting a likely mechanism for increased TLR7 levels in risk-allele carriers. This trans-ancestral mapping provides evidence for the global association with SLE risk at rs3853839, which resides in a microRNA–gene regulatory site affecting TLR7 expression.
Asymmetry is a prominent feature of human brains with important functional consequences. Many asymmetric traits show population bias, but little is known about the genetic and environmental sources contributing to inter-individual variance. Anatomic asymmetry has been observed in Old World monkeys, but the evidence for the direction and extent of asymmetry is equivocal and only one study has estimated the genetic contributions to inter-individual variance. In this study we characterize a range of qualitative and quantitative asymmetry measures in structural brain MRIs acquired from an extended pedigree of Old World vervet monkeys (n = 357), and implement variance component methods to estimate the proportion of trait variance attributable to genetic and environmental sources. Four of six asymmetry measures show pedigree-level bias and one of the traits has a significant heritability estimate of about 30%. We also found that environmental variables more significantly influence the width of the right compared to the left prefrontal lobe.
Data sharing in autism neuroimaging presents scientific, technical, and social obstacles. We outline the desiderata for a data-sharing scheme that combines imaging with other measures of phenotype and with genetics, defines requirements for comparability of derived data and recommendations for raw data, outlines a core protocol including multispectral structural and diffusion-tensor imaging and optional extensions, provides for the collection of prospective, confound-free normative data, and extends sharing and collaborative development not only to data but to the analytical tools and methods applied to these data. A theme in these requirements is the need to preserve creative approaches and risk-taking within individual laboratories at the same time as common standards are provided for these laboratories to build on.
Imaging; MRI; PET; Morphometry; Segmentation; Data sharing
The Abelson helper integration site 1 (AHI1) gene locus on chromosome 6q23 is among a group of candidate loci for schizophrenia susceptibility that were initially identified by linkage followed by linkage disequilibrium mapping, and subsequent replication of the association in an independent sample. Here, we present results of a replication study of AHI1 locus markers, previously implicated in schizophrenia, in a large European sample (in total 3907 affected and 7429 controls). Furthermore, we perform a meta-analysis of the implicated markers in 4496 affected and 18 920 controls. Both the replication study of new samples and the meta-analysis show evidence for significant overrepresentation of all tested alleles in patients compared with controls (meta-analysis; P = 8.2 × 10−5–1.7 × 10−3, common OR = 1.09–1.11). The region contains two genes, AHI1and C6orf217, and both genes—as well as the neighbouring phosphodiesterase 7B (PDE7B)—may be considered candidates for involvement in the genetic aetiology of schizophrenia.
Autism spectrum disorders (ASDs) represent a group of childhood neurodevelopmental and neuropsychiatric disorders characterized by deficits in verbal communication, impairment of social interaction, and restricted and repetitive patterns of interests and behaviour. To identify common genetic risk factors underlying ASDs, here we present the results of genome-wide association studies on a cohort of 780 families (3,101 subjects) with affected children, and a second cohort of 1,204 affected subjects and 6,491 control subjects, all of whom were of European ancestry. Six single nucleotide polymorphisms between cadherin 10 (CDH10) and cadherin 9 (CDH9)—two genes encoding neuronal cell-adhesion molecules—revealed strong association signals, with the most significant SNP being rs4307059 (P = 3.4 × 10−8, odds ratio = 1.19). These signals were replicated in two independent cohorts, with combined P values ranging from 7.4 × 10−8 to 2.1 × 10−10. Our results implicate neuronal cell-adhesion molecules in the pathogenesis of ASDs, and represent, to our knowledge, the first demonstration of genome-wide significant association of common variants with susceptibility to ASDs.