Rationale: Genetic variation plays a significant role in the etiology of sarcoidosis. However, only a small fraction of its heritability has been explained so far.
Objectives: To define further genetic risk loci for sarcoidosis, we used the Immunochip for a candidate gene association study of immune-associated loci.
Methods: Altogether the study population comprised over 19,000 individuals. In a two-stage design, 1,726 German sarcoidosis cases and 5,482 control subjects were genotyped for 128,705 single-nucleotide polymorphisms using the Illumina Immunochip for the screening step. The remaining 3,955 cases, 7,514 control subjects, and 684 parents of affected offspring were used for validation and replication of 44 candidate and two established risk single-nucleotide polymorphisms.
Measurements and Main Results: Four novel susceptibility loci were identified with genome-wide significance in the European case-control populations, located on chromosomes 12q24.12 (rs653178; ATXN2/SH2B3), 5q33.3 (rs4921492; IL12B), 4q24 (rs223498; MANBA/NFKB1), and 2q33.2 (rs6748088; FAM117B). We further defined three independent association signals in the HLA region with genome-wide significance, peaking in the BTNL2 promoter region (rs5007259), at HLA-B (rs4143332/HLA-B*0801) and at HLA-DPB1 (rs9277542), and found another novel independent signal near IL23R (rs12069782) on chromosome 1p31.3.
Conclusions: Functional predictions and protein network analyses suggest a prominent role of the drug-targetable IL23/Th17 signaling pathway in the genetic etiology of sarcoidosis. Our findings reveal a substantial genetic overlap of sarcoidosis with diverse immune-mediated inflammatory disorders, which could be of relevance for the clinical application of modern therapeutics
Immunochip; HLA; IL23; BTNL2; association
Neurosarcoidosis is a clinical subtype of sarcoidosis characterized by the presence of granulomas in the nervous system. Here, we report a highly significant association with a variant (rs75652600, P = 3.12 × 10−8, odds ratios = 4.34) within a zinc finger gene, ZNF592, from an imputation-based fine-mapping study of the chromosomal region 15q25 in African-Americans with neurosarcoidosis. We validate the association with ZNF592, a gene previously shown to cause cerebellar ataxia, in a cohort of European-Americans with neurosarcoidosis by uncovering low-frequency variants with a similar risk effect size (chr15:85309284, P = 0.0021, odds ratios = 5.36).
HLA-DRB1 is a sarcoidosis risk gene, and the *03:01 allele is strongly associated with disease resolution in European sarcoidosis cases. Whereas the HLA-DRB1 variation is associated with sarcoidosis susceptibility in African Americans, DRB1 risk alleles are not as well defined, and associations with disease resolution have not been studied. Associations between genotyped and imputed HLA-DRB1 alleles and disease susceptibility/resolution were evaluated in a sample of 1,277 African-American patients with sarcoidosis and 1,467 control subjects. In silico binding assays were performed to assess the functional significance of the associated alleles. Increased disease susceptibility was associated with the HLA-DRB1 alleles *12:01 (odds ratio [OR], 2.11; 95% confidence interval [CI], 1.65–2.69; P = 3.2 × 10−9) and *11:01 (OR, 1.69; 95% CI, 1.42–2.01; P = 3.0 × 10−9). The strongest protective association was found with *03:01 (OR, 0.56; 95% CI, 0.44–0.73; P = 1.0 × 10−5). The African-derived allele *03:02 was associated with decreased risk of persistent radiographic disease (OR, 0.52; 95% CI, 0.37–0.72; P = 1.3 × 10−4), a finding consistent across the three component studies comprising the analytic sample. The DRB1*03:01 association with disease persistence was dependent upon local ancestry, with carriers of at least one European allele at DRB1 at a decreased risk of persistent disease (OR, 0.36; 95% CI, 0.14–0.94; P = 0.037). Results of in silico binding analyses showed that DRB1*03:01 consistently demonstrated the highest binding affinities for six bacterial peptides previously found in sarcoidosis granulomas, whereas *12:01 displayed the lowest binding affinities. This study has identified DRB1*03:01 and *03:02 as novel alleles associated with disease susceptibility and course in African Americans. Further investigation of DRB1*03 alleles may uncover immunologic factors that favor sarcoidosis protection and resolution among African Americans.
HLA-DRB1 chains; molecular docking simulation; pulmonary sarcoidosis; SNPs; association studies; genetic
CD4+ T cells predominate in salivary gland (SG) inflammatory lesions in Sjögren’s syndrome (SS). However, their antigen specificity, degree of clonal expansion, and relationship to clinical disease features remain unknown. We used multiplex reverse-transcriptase PCR to amplify paired T cell receptor α (TCRα) and β transcripts of single CD4+CD45RA− T cells from SG and peripheral blood (PB) of 10 individuals with primary SS, 9 of whom shared the HLA DR3/DQ2 risk haplotype. TCRα and β sequences were obtained from a median of 91 SG and 107 PB cells per subject. The degree of clonal expansion and frequency of cells expressing two productively rearranged α genes were increased in SG versus PB. Expanded clones from SG exhibited complementary-determining region 3 (CDR3) sequence similarity both within and among subjects, suggesting antigenic selection and shared antigen recognition. CDR3 similarities were shared among expanded clones from individuals discordant for canonical Ro and La autoantibodies, suggesting recognition of alternative SG antigen(s). The extent of SG clonal expansion correlated with reduced saliva production and increased SG fibrosis, linking expanded SG T cells with glandular dysfunction. Knowledge of paired TCRα and β sequences enables further work toward identification of target antigens and development of novel therapies.
CD4+ T cells predominate in salivary gland (SG) inflammatory lesions in Sjögren’s syndrome (SS). However, their antigen specificity, degree of clonal expansion, and relationship to clinical disease features remain unknown. We used multiplex reverse-transcriptase PCR to amplify paired T cell receptor α (TCRα) and β transcripts of single CD4+CD45RA– T cells from SG and peripheral blood (PB) of 10 individuals with primary SS, 9 of whom shared the HLA DR3/DQ2 risk haplotype. TCRα and β sequences were obtained from a median of 91 SG and 107 PB cells per subject. The degree of clonal expansion and frequency of cells expressing two productively rearranged α genes were increased in SG versus PB. Expanded clones from SG exhibited complementary-determining region 3 (CDR3) sequence similarity both within and among subjects, suggesting antigenic selection and shared antigen recognition. CDR3 similarities were shared among expanded clones from individuals discordant for canonical Ro and La autoantibodies, suggesting recognition of alternative SG antigen(s). The extent of SG clonal expansion correlated with reduced saliva production and increased SG fibrosis, linking expanded SG T cells with glandular dysfunction. Knowledge of paired TCRα and β sequences enables further work toward identification of target antigens and development of novel therapies.
CD4+ T cell clonal expansions in Sjögren’s syndrome salivary glands correlate with impaired glandular function and share TCR sequence similarity indicative of antigen-driven selection.
To develop a simple systemic lupus erythematosus (SLE) severity index that requires knowledge of only American College of Rheumatology (ACR) criteria and subcriteria.
This study used demographic, mortality and medical records data of 1915 patients with lupus from the Lupus Family Registry and Repository. The data were randomly split (2:1 ratio) into independent training and validation sets. A logistic regression with ridge penalty was used to model the probability of being prescribed major immunosuppressive drugs—a surrogate indicator of lupus severity. ACR criteria and subcriteria were used as predictor variables in this model, and the resulting regression coefficient estimates obtained from the training data were used as item weightings to construct the severity index.
The resulting index was tested on the independent validation dataset and was found to have high predictive accuracy for immunosuppressive use and early mortality. The index was also found to be strongly correlated with a previously existing severity score for lupus. In addition, demographic factors known to influence lupus severity (eg, age of onset, gender and ethnicity) all showed robust associations with our severity index that were consistent with observed clinical trends.
This new index can be easily computed using ACR criteria, which may be among the most readily available data elements from patient medical records. This tool may be useful in lupus research, especially large dataset analyses to stratify patients by disease severity, an important prognostic indicator in SLE.
Lupus Severity; Severity Index; ACR criteria
Exploiting genotyping, DNA sequencing, imputation and trans-ancestral mapping, we used Bayesian and frequentist approaches to model the IRF5–TNPO3 locus association, now implicated in two immunotherapies and seven autoimmune diseases. Specifically, in systemic lupus erythematosus (SLE), we resolved separate associations in the IRF5 promoter (all ancestries) and with an extended European haplotype. We captured 3230 IRF5–TNPO3 high-quality, common variants across 5 ethnicities in 8395 SLE cases and 7367 controls. The genetic effect from the IRF5 promoter can be explained by any one of four variants in 5.7 kb (P-valuemeta = 6 × 10−49; OR = 1.38–1.97). The second genetic effect spanned an 85.5-kb, 24-variant haplotype that included the genes IRF5 and TNPO3 (P-valuesEU = 10−27–10−32, OR = 1.7–1.81). Many variants at the IRF5 locus with previously assigned biological function are not members of either final credible set of potential causal variants identified herein. In addition to the known biologically functional variants, we demonstrated that the risk allele of rs4728142, a variant in the promoter among the lowest frequentist probability and highest Bayesian posterior probability, was correlated with IRF5 expression and differentially binds the transcription factor ZBTB3. Our analytical strategy provides a novel framework for future studies aimed at dissecting etiological genetic effects. Finally, both SLE elements of the statistical model appear to operate in Sjögren's syndrome and systemic sclerosis whereas only the IRF5–TNPO3 gene-spanning haplotype is associated with primary biliary cirrhosis, demonstrating the nuance of similarity and difference in autoimmune disease risk mechanisms at IRF5–TNPO3.
Recent studies have implicated the role of differential co-expression or correlation structure in gene expression data to help explain phenotypic differences. However, few attempts have been made to characterize the function of variants based on their role in regulating differential co-expression. Here, we describe a statistical methodology that identifies pairs of transcripts that display differential correlation structure conditioned on genotypes of variants that regulate co-expression. Additionally, we present a user-friendly, computationally efficient tool, dcVar, that can be applied to expression quantitative trait loci (eQTL) or RNA-Seq datasets to infer differential co-expression variants (dcVars). We apply dcVar to the HapMap3 eQTL dataset and demonstrate the utility of this methodology at uncovering novel function of variants of interest with examples from a height genome-wide association and cancer drug resistance. We provide evidence that differential correlation structure is a valuable intermediate molecular phenotype for further characterizing the function of variants identified in GWAS and related studies.
eQTL; molecular phenotype; genome-wide association study; microarray gene expression; common variant; RNA-Seq
HPGD and SLCO2A1 genes encode components of the prostaglandin catabolic pathway, HPGD encoding the degradative enzyme 15-PGDH, and SLCO2A1 encoding the prostaglandin transporter PGT that brings substrate to 15-PGDH. HPGD null mice show increased PGE2, marked susceptibility to developing colon tumors, and resistance to colon tumor prevention by NSAIDs. But in humans, HPGD and SLCO2A1 mutations have only been associated with familial digital clubbing. We here characterize a family with digital clubbing and early onset colon neoplasia. Whole exome sequencing identified a heterozygous nonsense mutation (G104X) in the SLCO2A1 gene segregating in three males with digital clubbing. Two of these males further demonstrated notably early onset colon neoplasia, one with an early onset colon cancer and another with an early onset sessile serrated colon adenoma. Two females also carried the mutation, and both these women developed sessile serrated colon adenomas without any digital clubbing. Males with clubbing also showed marked elevations in the levels of urinary prostaglandin E2 metabolite, PGE-M; whereas, female mutation carriers were in the normal range. Furthermore, in the male proband urinary PGE-M remained markedly elevated during NSAID treatment with either celecoxib or sulindac. Thus, in this human kindred, a null SLCO2A1 allele mimics the phenotype of the related HPGD null mouse, with increased prostaglandin levels that cannot be normalized by NSAID therapy, plus with increased colon neoplasia. The development of early onset colon neoplasia in male and female human SLCO2A1 mutation carriers suggests that disordered prostaglandin catabolism can mediate inherited susceptibility to colon neoplasia in man.
Genome-wide association studies (GWAS) that draw samples from multiple studies with a mixture of relationship structures are becoming more common. Analytical methods exist for using mixed-sample data, but few methods have been proposed for the analysis of genotype-by-environment (G×E) interactions. Using GWAS data from a study of sarcoidosis susceptibility genes in related and unrelated African Americans, we explored the current analytic options for genotype association testing in studies using both unrelated and family-based designs. We propose a novel method—generalized least squares (GLX)—to estimate both SNP and G×E interaction effects for categorical environmental covariates and compared this method to generalized estimating equations (GEE), logistic regression, the Cochran–Armitage trend test, and the WQLS and MQLS methods. We used simulation to demonstrate that the GLX method reduces type I error under a variety of pedigree structures. We also demonstrate its superior power to detect SNP effects while offering computational advantages and comparable power to detect G×E interactions versus GEE. Using this method, we found two novel SNPs that demonstrate a significant genome-wide interaction with insecticide exposure—rs10499003 and rs7745248, located in the intronic and 3′ UTR regions of the FUT9 gene on chromosome 6q16.1.
GWAS; G×E; gene-by-environment; generalized least squares; mixed samples; sarcoidosis
Open source tools are needed to facilitate the construction, analysis, and visualization of gene-gene interaction networks for sequencing data. To address this need, we present Encore, an open source network analysis pipeline for GWAS and rare variant data. Encore constructs Genetic Association Interaction Networks or Epistasis Networks using two optional approaches: our previous information-theory method or a generalized linear model approach. Additionally, Encore includes multiple data filtering options, including Random Forest/Random Jungle for main effect enrichment and Evaporative Cooling and Relief-F filters for enrichment of interaction effects. Encore implements SNPrank network centrality for identifying susceptibility hubs (nodes containing a large amount of disease susceptibility information through the combination of multivariate main effects and multiple gene-gene interactions in the network), and it provides appropriate files for interactive visualization of a network using tools from our online Galaxy instance. We implemented these algorithms in C++ using OpenMP for shared-memory parallel analysis on a server or desktop. To demonstrate Encore’s utility in analysis of genetic sequencing data, we present an analysis of exome resequencing data from healthy individuals and those with Systemic Lupus Erythematous (SLE). Our results verify the importance of the previously associated SLE genes HLA-DRB and NCF2, and these two genes had the highest gene-gene interaction degrees among the susceptibility hubs. An additional 14 genes previously associated with SLE emerged in our epistasis network model of the exome data, and three novel candidate genes, ST8SIA4, CMTM4, and C2CD4B, were implicated in the model. In summary, we present a comprehensive tool for epistasis network analysis and the first such analysis of exome data from a genetic study of SLE.
epistasis network; machine learning; network analysis; network centrality; Systemic Lupus Erythematous
The expense of human leukocyte antigen (HLA) allele genotyping has motivated the development of imputation methods that use dense single nucleotide polymorphism (SNP) genotype data and the region’s haplotype structure, but the performance of these methods in admixed populations (such as African Americans) has not been adequately evaluated. We compared genotype-based—derived from both genome-wide genotyping and targeted sequencing—imputation results to existing allele data for HLA–DRB1, −DQB1, and –DPB1.
In European Americans, the newly-developed HLA Genotype Imputation with Attribute Bagging (HIBAG) method outperformed HLA*IMP:02. In African Americans, HLA*IMP:02 performed marginally better than HIBAG pre-built models, but HIBAG models constructed using a portion of our African American sample with both SNP genotyping and four-digit HLA class II allele typing had consistently higher accuracy than HLA*IMP:02. However, HIBAG was significantly less accurate in individuals heterozygous for local ancestry (p ≤0.04). Accuracy improved in models with equal numbers of African and European chromosomes. Variants added by targeted sequencing and SNP imputation further improved both imputation accuracy and the proportion of high quality calls.
Combining the HIBAG approach with local ancestry and dense variant data can produce highly-accurate HLA class II allele imputation in African Americans.
HLA; African American; Single nucleotide polymorphisms; Imputation; Admixture
Sjögren’s syndrome is a common autoimmune disease (~0.7% of European Americans) typically presenting as keratoconjunctivitis sicca and xerostomia. In addition to strong association within the HLA region at 6p21 (Pmeta=7.65×10−114), we establish associations with IRF5-TNPO3 (Pmeta=2.73×10−19), STAT4 (Pmeta=6.80×10−15), IL12A (Pmeta =1.17×10−10), FAM167A-BLK (Pmeta=4.97×10−10), DDX6-CXCR5 (Pmeta=1.10×10−8), and TNIP1 (Pmeta=3.30×10−8). Suggestive associations with Pmeta<5×10−5 were observed with 29 regions including TNFAIP3, PTTG1, PRDM1, DGKQ, FCGR2A, IRAK1BP1, ITSN2, and PHIP amongst others. These results highlight the importance of genes involved in both innate and adaptive immunity in Sjögren’s syndrome.
Sarcoidosis is a complex, multi-organ granulomatous disease with a likely genetic component. West African ancestry confers a higher risk for sarcoidosis than European ancestry. Admixture mapping provides the most direct method to locate genes that underlie such ethnic variation in disease risk. We sought to identify genetic risk variants within four previously-identified ancestry-associated regions—6p24.3–p12.1, 17p13.3–13.1, 2p13.3–q12.1, and 6q23.3–q25.2—in a sample of 2,727 African Americans. We used logistic regression fit by generalized estimating equations and the MIX score statistic to determine which variants within ancestry-associated regions were associated with risk and responsible for the admixture signal. Fine mapping was performed by imputation, based on a previous genome-wide association study; significant variants were validated by direct genotyping. Within the 6p24.3–p12.1 locus, the most significant ancestry-adjusted SNP was rs74318745 (p = 9.4*10−11), an intronic SNP within the HLA-DRA gene that did not solely explain the admixture signal, indicating the presence of more than a single risk variant within this well-established sarcoidosis risk region. The locus on chromosome 17p13.3–13.1 revealed a novel sarcoidosis risk SNP, rs6502976 (p = 9.5*10−6), within intron 5 of the gene X-linked Inhibitor of Apoptosis Associated Factor 1 (XAF1) that accounted for the majority of the admixture linkage signal. Immunohistochemical expression studies demonstrated lack of expression of XAF1 and a corresponding high level of expression of its downstream target, X-linked Inhibitor of Apoptosis (XIAP) in sarcoidosis granulomas. In conclusion, ancestry and association fine mapping revealed a novel sarcoidosis susceptibility gene, XAF1, which has not been identified by previous genome-wide association studies. Based on the known biology of the XIAP/XAF1 apoptosis pathway and the differential expression patterns of XAF1 and XIAP in sarcoidosis granulomas, we suggest that this pathway may play a role in the maintenance of sarcoidosis granulomas.
To compare the performance of the American-European Consensus Group (AECG) and the newly proposed American College of Rheumatology (ACR) classification criteria for Sjögren's syndrome in a well-characterized sicca cohort, given ongoing efforts to resolve discrepancies and weaknesses in the systems.
In a multidisciplinary clinic for the evaluation of sicca, we assessed features of salivary and lacrimal gland dysfunction and autoimmunity as defined by tests of both AECG and ACR criteria in 646 participants. Global gene expression profiles were compared in a subset of 180 participants.
Application of the AECG and ACR criteria resulted in classification of 279 and 268 participants with SS, respectively. Both criteria were met by 244 participants (81%). In 26 of the 35 AECG+/ACR- participants, the minor salivary gland biopsy focal score was ≥1 (74%), while 9 had positive anti-Ro/La (26%). There were 24 AECG-/ACR+ who met ACR criteria mainly due to differences in the scoring of corneal staining. All patients with SS, regardless of classification, had similar gene expression profiles, which were distinct from the healthy controls.
The two sets of classification criteria yield concordant results in the majority of cases and gene expression profiling suggests that patients meeting either set of criteria are more similar to other SS participants than to healthy controls. Thus, there is no clear evidence for increased value of the new ACR criteria over the old AECG criteria from the clinical or biological perspective. It is our contention, supported by this report, that improvements in diagnostic acumen will require a more fundamental understanding of the pathogenic mechanisms than is at present available.
Sjögren's syndrome; Classification; Diagnosis
Systemic lupus erythematosus (SLE) is an autoimmune disease characterized by autoantibody production and altered type I interferon expression. Genetic surveys and genome-wide association studies have identified more than 30 SLE susceptibility genes. One of these genes, TNIP1, encodes the ABIN1 protein. ABIN1 functions in the immune system by restricting the NF-κB signaling. In order to better understand the genetic factors that influence association with SLE in genes that regulate the NF-κB pathway, we analyzed a dense set of genetic markers spanning TNIP1 and TAX1BP1, as well as the TNIP1 homolog, TNIP2, in case-control sets of diverse ethnic origins.
We fine-mapped TNIP1, TNIP2, and TAX1BP1 in a total of 8372 SLE cases and 7492 healthy controls from European-ancestry, African-American, Hispanic, East Asian, and African-American Gullah populations. Levels of TNIP1 mRNA and ABIN1 protein were analyzed using quantitative RT-PCR and Western blotting, respectively, in EBV-transformed human B cell lines.
We found significant associations between genetic variants within TNIP1 and SLE but not in TNIP2 or TAX1BP1. After resequencing and imputation, we identified two independent risk haplotypes within TNIP1 in individuals of European-ancestry that were also present in African-American and Hispanic populations. These risk haplotypes produced lower levels of TNIP1 mRNA and ABIN1 protein suggesting they harbor hypomorphic functional variants that influence susceptibility to SLE by restricting ABIN1 expression.
Our results confirmed the association signals between SLE and TNIP1 variants in multiple populations and provide new insight into the mechanism by which TNIP1 variants may contribute to SLE pathogenesis.
While Sjögren’s syndrome (SS) is more common than related autoimmune disorders, such as systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA), scientific and medical research in SS has lagged behind significantly. This is especially true in the field of SS genetics, where efforts to date have relied heavily on candidate gene approaches. Within the last decade, the advent of the genome-wide association (GWA) scan has altered our understanding of disease pathogenesis in hundreds of disorders through the successful identification of novel risk loci. With strong evidence for a genetic component in SS as evidenced by familial aggregation of SS as well as similarities between SS and SLE and RA, the application of GWA approaches would likely yield numerous novel risk loci in SS. Here we review the fundamental scientific principles employed in GWA scans as well as the limitations of this tool, and we discuss the application of GWA scans in determining genetic variants at play in complex disease. We also examine the successful application of GWA scans in SLE, which now has more than 40 confirmed risk loci, and consider the possibility for a similar trajectory of SS genetic discovery in the era of GWA scans. Ultimately, the GWA studies that will be performed in SS have the potential to identify a myriad of novel genetic loci that will allow scientists to begin filling in the gaps in our understanding of the SS pathogenesis.
genetics; Sjögren’s syndrome; genome-wide association
Recent advances in the field of genetics have dramatically changed our understanding of autoimmune disease. Candidate gene and, more recently, genome-wide association (GWA) studies have led to an explosion in the number of loci and pathways known to contribute to autoimmune phenotypes. Since the 1970s, researchers have known that several alleles in the MHC region play a role in the pathogenesis of many autoimmune diseases. More recent work has identified numerous risk loci involving both the innate and adaptive immune responses. However, much remains to be learned about the heritability of autoimmune conditions. Most regions found through GWA scans have yet to isolate the association to the causal allele(s) responsible for conferring disease risk. A role for rare variants (allele frequencies of <1%) has begun to emerge. Future research will use next generation sequencing (NGS) technology to comprehensively evaluate the human genome for risk variants. Whole transcriptome sequencing is now possible, which will provide much more detailed gene expression data. The dramatic drop in the cost and time required to sequence the entire human genome will ultimately make it possible for this technology to be used as a clinical diagnostic tool.
Genetics; Genomics; Genome-wide association study; Autoimmune disease
Systemic lupus erythematosus (SLE) is an autoimmune disease with diverse clinical manifestations characterized by the development of pathogenic autoantibodies manifesting in inflammation of target organs such as the kidneys, skin and joints. Genome-wide association studies have identified genetic variants in the UBE2L3 region that are associated with SLE in subjects of European and Asian ancestry. UBE2L3 encodes an ubiquitin-conjugating enzyme, UBCH7, involved in cell proliferation and immune function. In this study, we sought to further characterize the genetic association in the region of UBE2L3 and use molecular methods to determine the functional effect of the risk haplotype. We identified significant associations between variants in the region of UBE2L3 and SLE in individuals of European and Asian ancestry that exceeded a Bonferroni corrected threshold (P < 1 × 10−4). A single risk haplotype was observed in all associated populations. Individuals harboring the risk haplotype display a significant increase in both UBE2L3 mRNA expression (P = 0.0004) and UBCH7 protein expression (P = 0.0068). The results suggest that variants carried on the SLE associated UBE2L3 risk haplotype influence autoimmunity by modulating UBCH7 expression.
Systemic Lupus Erythematosus; UBE2L3; Multi Ethnic Association Study; UBCH7 Expression
Sarcoidosis is a systemic inflammatory disease characterized by the formation of granulomas in affected organs. Genome-wide association studies (GWASs) of this disease have been conducted only in European population. We present the first sarcoidosis GWAS in African Americans (AAs, 818 cases and 1,088 related controls) followed by replication in independent sets of AAs (455 cases and 557 controls) and European Americans (EAs, 442 cases and 2,284 controls). We evaluated >6 million SNPs either genotyped using the Illumina Omni1-Quad array or imputed from the 1000 Genomes Project data. We identified a novel sarcoidosis-associated locus, NOTCH4, that reached genome-wide significance in the combined AA samples (rs715299, PAA-meta = 6.51×10−10) and demonstrated the independence of this locus from others in the MHC region in the same sample. We replicated previous European GWAS associations within HLA-DRA, HLA-DRB5, HLA-DRB1, BTNL2, and ANXA11 in both our AA and EA datasets. We also confirmed significant associations to the previously reported HLA-C and HLA-B regions in the EA but not AA samples. We further identified suggestive associations with several other genes previously reported in lung or inflammatory diseases.
Systemic lupus erythematosus (SLE) is a complex autoimmune disease characterized by autoantibody production and organ damage. Lupus nephritis (LN) is one of the most severe manifestations of SLE. Multiple studies reported associations between renal diseases and variants in the non-muscle myosin heavy chain 9 (MYH9) and the neighboring apolipoprotein L 1 (APOL1) genes. We evaluated 167 variants spanning MYH9 for association with LN in a multiethnic sample. The two previously identified risk variants in APOL1 were also tested for association with LN in European-Americans (EAs) (N = 579) and African-Americans (AAs) (N = 407). Multiple peaks of association exceeding a Bonferroni corrected p-value of p < 2.03 × 10−3 were observed between LN and MYH9 in EAs (N=4620), with the most pronounced association at rs2157257 (p = 4.7 × 10−4; odds ratio [OR]=1.205). A modest effect with MYH9 was also detected in Gullah (rs8136069, p = 0.0019, OR = 2.304). No association between LN and MYH9 was found in AAs, Asians, Amerindians or Hispanics. This study provides the first investigation of MYH9 in LN in non-Africans and of APOL1 in LN in any population and presents novel insight into the potential role of MYH9 in LN in EAs.
MYH9; APOL1; lupus nephritis; systemic lupus erythematosus; multiethnic association study