In studies of case-parent trios, we define copy number variants (CNVs) in the offspring that differ from the parental copy numbers as de novo and of interest for their potential functional role in disease. Among the leading array-based methods for discovery of de novo CNVs in case-parent trios is the joint hidden Markov model (HMM) implemented in the PennCNV software. However, the computational demands of the joint HMM are substantial and the extent to which false positive identifications occur in case-parent trios has not been well described. We evaluate these issues in a study of oral cleft case-parent trios.
Our analysis of the oral cleft trios reveals that genomic waves represent a substantial source of false positive identifications in the joint HMM, despite a wave-correction implementation in PennCNV. In addition, the noise of low-level summaries of relative copy number (log R ratios) is strongly associated with batch and correlated with the frequency of de novo CNV calls. Exploiting the trio design, we propose a univariate statistic for relative copy number referred to as the minimum distance that can reduce technical variation from probe effects and genomic waves. We use circular binary segmentation to segment the minimum distance and maximum a posteriori estimation to infer de novo CNVs from the segmented genome. Compared to PennCNV on simulated data, MinimumDistance identifies fewer false positives on average and is comparable to PennCNV with respect to false negatives. Genomic waves contribute to discordance of PennCNV and MinimumDistance for high coverage de novo calls, while highly concordant calls on chromosome 22 were validated by quantitative PCR. Computationally, MinimumDistance provides a nearly 8-fold increase in speed relative to the joint HMM in a study of oral cleft trios.
Our results indicate that batch effects and genomic waves are important considerations for case-parent studies of de novo CNV, and that the minimum distance is an effective statistic for reducing technical variation contributing to false de novo discoveries. Coupled with segmentation and maximum a posteriori estimation, our algorithm compares favorably to the joint HMM with MinimumDistance being much faster.
Trios; Oral cleft; Copy number variants; de novo; High-throughput arrays; Segmentation; batch effects; Genomic waves
Clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) was detected using SNP microarray data from over 50,000 subjects recruited for genome-wide association studies. This detection method requires a relatively high frequency of cells (>5–10%) with the same abnormal karyotype (presumably of clonal origin) in the presence of normal cells. The frequency of detectable clonal mosaicism in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rises rapidly to 2–3% in the elderly. Many of the mosaic anomalies are characteristic of those found in hematological cancers and identify common deleted regions that pinpoint the locations of genes previously associated with hematological cancers. Although only 3% of subjects with detectable clonal mosaicism had any record of hematological cancer prior to DNA sampling, those without a prior diagnosis have an estimated 10-fold higher risk of a subsequent hematological cancer (95% confidence interval = 6–18).
Two recent metaanalyses of genome-wide association studies conducted by the CHARGE and SpiroMeta consortia identified novel loci yielding evidence of association at or near genome-wide significance (GWS) with FEV1 and FEV1/FVC. We hypothesized that a subset of these markers would also be associated with chronic obstructive pulmonary disease (COPD) susceptibility. Thirty-two single-nucleotide polymorphisms (SNPs) in or near 17 genes in 11 previously identified GWS spirometric genomic regions were tested for association with COPD status in four COPD case-control study samples (NETT/NAS, the Norway case-control study, ECLIPSE, and the first 1,000 subjects in COPDGene; total sample size, 3,456 cases and 1,906 controls). In addition to testing the 32 spirometric GWS SNPs, we tested a dense panel of imputed HapMap2 SNP markers from the 17 genes located near the 32 GWS SNPs and in a set of 21 well studied COPD candidate genes. Of the previously identified GWS spirometric genomic regions, three loci harbored SNPs associated with COPD susceptibility at a 5% false discovery rate: the 4q24 locus including FLJ20184/INTS12/GSTCD/NPNT, the 6p21 locus including AGER and PPT2, and the 5q33 locus including ADAM19. In conclusion, markers previously associated at or near GWS with spirometric measures were tested for association with COPD status in data from four COPD case-control studies, and three loci showed evidence of association with COPD susceptibility at a 5% false discovery rate.
Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations.
To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects.
GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed.
Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation.
We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients.
Chronic Obstructive Pulmonary Disease (COPD); Genome Wide Association study (GWAS); smoking behaviors; Single Nucleotide Polymorphism (SNP)
Case–parent trio studies concerned with children affected by a disease and their parents aim to detect single nucleotide polymorphisms (SNPs) showing a preferential transmission of alleles from the parents to their affected offspring. A popular statistical test for detecting such SNPs associated with disease in this study design is the genotypic transmission/disequilibrium test (gTDT) based on a conditional logistic regression model, which usually needs to be fitted by an iterative procedure. In this article, we derive exact closed-form solutions for the parameter estimates of the conditional logistic regression models when testing for an additive, a dominant, or a recessive effect of a SNP, and show that such analytic parameter estimates also exist when considering gene–environment interactions with binary environmental variables. Because the genetic model underlying the association between a SNP and a disease is typically unknown, it might further be beneficial to use the maximum over the gTDT statistics for the possible effects of a SNP as test statistic. We therefore propose a procedure enabling a fast computation of the test statistic and the permutation-based p-value of this MAX gTDT. All these methods are applied to whole-genome scans of the case–parent trios from the International Cleft Consortium. These applications show our procedures dramatically reduce the required computing time compared to the conventional iterative methods allowing, for example, the analysis of hundreds of thousands of SNPs in a few minutes instead of several hours.
Conditional logistic regression; Family-based design; Genome-wide association studies; Genotypic transmission/disequilibrium test; International Cleft Consortium; MAX test
The FaceBase Consortium consists of ten interlinked research and technology projects whose goal is to generate craniofacial research data and technology for use by the research community through a central data management and integrated bioinformatics hub. Funded by the National Institute of Dental and Craniofacial Research (NIDCR) and currently focused on studying the development of the middle region of the face, the Consortium will produce comprehensive datasets of global gene expression patterns, regulatory elements and sequencing; will generate anatomical and molecular atlases; will provide human normative facial data and other phenotypes; conduct follow up studies of a completed genome-wide association study; generate independent data on the genetics of craniofacial development, build repositories of animal models and of human samples and data for community access and analysis; and will develop software tools and animal models for analyzing and functionally testing and integrating these data. The FaceBase website (http://www.facebase.org) will serve as a web home for these efforts, providing interactive tools for exploring these datasets, together with discussion forums and other services to support and foster collaboration within the craniofacial research community.
Craniofacial development; Cleft lip and palate; Human genetics; Animal models; Database; Morphometrics
Non-syndromic cleft palate (CP) is a common birth defect with a complex and heterogeneous etiology involving both genetic and environmental risk factors. We conducted a genome wide association study (GWAS) using 550 case-parent trios, ascertained through a CP case collected in an international consortium. Family based association tests of single nucleotide polymorphisms (SNP) and three common maternal exposures (maternal smoking, alcohol consumption and multivitamin supplementation) were used in a combined 2 df test for gene (G) and gene-environment (G×E) interaction simultaneously, plus a separate 1 df test for G×E interaction alone. Conditional logistic regression models were used to estimate effects on risk to exposed and unexposed children. While no SNP achieved genome wide significance when considered alone, markers in several genes attained or approached genome wide significance when G×E interaction was included. Among these, MLLT3 and SMC2 on chromosome 9 showed multiple SNPs resulting in increased risk if the mother consumed alcohol during the peri-conceptual period (3 months prior to conception through the first trimester). TBK1 on chr. 12 and ZNF236 on chr. 18 showed multiple SNPs associated with higher risk of CP in the presence of maternal smoking. Additional evidence of reduced risk due to G×E interaction in the presence of multivitamin supplementation was observed for SNPs in BAALC on chr. 8. These results emphasize the need to consider G×E interaction when searching for genes influencing risk to complex and heterogeneous disorders, such as non-syndromic CP.
Interferon regulatory factor 2 (IRF2) is a member of a family of transcriptional factors involved in the modulation of interferon induced immune responses to viral infection. To test whether genetic variants in IRF2 predict risk of AD and ADEH, we genotyped 78 IRF2 tagging single nucleotide polymorphisms (SNPs) in both European American (n=435) and African American (n = 339) populations. Significant associations were observed between AD and two SNPs (rs793814, P = 0.007, odds ratio (OR) = 0.52; rs3756094, P = 0.037, OR = 0.66) among European Americans and one SNP (rs3775572, P = 0.016, OR = 0.46) among African Americans. Significant associations were also observed between ADEH and five SNPs (P = 0.049-0.022) among European Americans. The association with ADEH was further strengthened by haplotype analyses, wherein a 5-SNP (CAGGA) haplotype showed the strongest association with ADEH (P = 0.0008). Eight IRF2 SNPs were significantly associated with IFNγ production post-herpes simplex virus (HSV) stimulation (P = 0.048-0.0008), including an AD-associated SNP (rs13139310, P = 0.008). Our findings suggest distinct markers in IRF2 may be associated with AD and ADEH, which may depend upon ethnic ancestry, and genetic variants in IRF2 may contribute to an abnormal immune response to HSV.
The receptor tyrosine kinase-like orphan receptor 2 (ROR2) gene has been recently shown to play important roles in palatal development in animal models and resides in the chromosomal region linked to non syndromic cleft lip with or without cleft palate in humans. The aim of this study was to investigate the possible association between ROR2 gene and non-syndromic oral clefts.
Here we tested 38 eligible single-nucleotide polymorphisms (SNPs) in ROR2 gene in 297 non-syndromic cleft lip with or without cleft palate and in 82 non-syndromic cleft palate case parent trios recruited from Asia and Maryland. Family Based Association Test was used to test for deviation from Mendelian inheritance. Plink software was used to test potential parent of origin effect. Possible maternally mediated in utero effects were assessed using the TRIad Multi-Marker approach under an assumption of mating symmetry in the population.
Significant evidence of linkage and association was shown for 3 SNPs (rs7858435, rs10820914 and rs3905385) among 57 Asian non-syndromic cleft palate trios in Family Based Association Tests. P values for these 3 SNPs equaled to 0.000068, 0.000115 and 0.000464 respectively which were all less than the significance level (0.05/38=0.0013) adjusted by strict Bonferroni correction. Relevant odds ratios for the risk allele were 3.42 (1.80–6.50), 3.45 (1.75–6.67) and 2.94 (1.56–5.56), respectively. Statistical evidence of linkage and association was not shown for study groups other than non-syndromic cleft palate. Neither evidence for parent-of-origin nor maternal genotypic effect was shown for any of the ROR2 markers in our analysis for all study groups.
Our results provided evidence of linkage and association between the ROR2 gene and a gene controlling risk to non-syndromic cleft palate.
receptor tyrosine kinase-like orphan receptor 2; cleft lip; cleft palate; association; transmission disequilibrium test
Sensitization to cockroach allergen is one of the strongest predictors of asthma morbidity, especially among African Americans.
Our aims were to determine the genomic basis of cockroach sensitization and the specific response to cockroach antigen.
We investigated the Th1/Th2 cytokine profile of co-cultured plasmacytoid DCs (pDCs) and CD4+ T cells and the “transcript signature” of the immune response to cockroach antigen using high-throughput expression profiling of co-cultured cells.
We observed significantly elevated levels of IL-13, IL-10 and TNF-α, but undetectable levels of IL-12p70 and IFN-α, when cultures were exposed to crude cockroach antigen. A significant difference was observed for IL-13 between cockroach allergic and non-allergic individuals (p = 0.039). Microarray analyses demonstrated a greater response at 48 hours compared to 4 hours, with 50 genes being uniquely expressed in cockroach antigen-treated cells, including CD14, S100A8, CCL8, and IFI44L. The increased CD14 expression was further observed in purified pDCs, human monocytic THP-1 cells, and supernatant of co-cultured pDCs and CD4+ T cells in exposure to cockroach extract. Furthermore, the most differential expression of CD14 between cockroach allergy and non-cockroach allergy was only observed among individuals with the CC “high-risk” genotype of the CD14 -260C/T. Ingenuity Pathways Analysis (IPA) analyses suggested the interferon-signaling as the most significant canonical pathway.
Our results suggest these differentially expressed genes, particularly CD14, and genes in the interferon-signaling pathway may be important candidates for further investigation of their role in the immune response to cockroach allergen.
asthma; CD4+ T cells; CD14; cockroach sensitization; Dendritic cells (DCs); high-throughput expression profiling
The potential for genome-wide association studies to relate phenotypes to specific genetic variation is greatly increased when data can be combined or compared across multiple studies. To facilitate replication and validation across studies, RTI International (Research Triangle Park, North Carolina) and the National Human Genome Research Institute (Bethesda, Maryland) are collaborating on the consensus measures for Phenotypes and eXposures (PhenX) project. The goal of PhenX is to identify 15 high-priority, well-established, and broadly applicable measures for each of 21 research domains. PhenX measures are selected by working groups of domain experts using a consensus process that includes input from the scientific community. The selected measures are then made freely available to the scientific community via the PhenX Toolkit. Thus, the PhenX Toolkit provides the research community with a core set of high-quality, well-established, low-burden measures intended for use in large-scale genomic studies. PhenX measures will have the most impact when included at the experimental design stage. The PhenX Toolkit also includes links to standards and resources in an effort to facilitate data harmonization to legacy data. Broad acceptance and use of PhenX measures will promote cross-study comparisons to increase statistical power for identifying and replicating variants associated with complex diseases and with gene-gene and gene-environment interactions.
environmental exposure; epidemiologic methods; genetic research; genetics; genome-wide association study; meta-analysis as topic; phenotype; research design
Atopic dermatitis; eczema herpeticum; thymic stromal lymphopoietin (TSLP); interleukin (IL) 7-like cytokine (IL7R); thymic stromal lymphopoietin receptor (TSLPR); single nucleotide polymorphism (SNP); gene-gene interaction
Clefts of the lip and/or palate (CLP) are common birth defects of complex etiology. CLP can occur in isolation or as part of a broad range of chromosomal, Mendelian, or teratogenic syndromes. Although there has been marked progress in identifying genetic and environmental triggers for syndromic CLP, the etiology of the more common non-syndromic (isolated) forms remains poorly characterized. Recently, using a combination of epidemiology, careful phenotyping, genome-wide association studies and analysis of animal models, several distinct genetic and environmental risk factors have been identified and confirmed for non-syndromic CLP. These findings have advanced our understanding of developmental biology and created new opportunities for clinical translation research.
COPDGeneis a multicenter observational study designed to identify genetic factors associated with COPD. It will also characterize chest CT phenotypes in COPD subjects, including assessment of emphysema, gas trapping, and airway wall thickening. Finally, subtypes of COPD based on these phenotypes will be used in a comprehensive genome-wide study to identify COPD susceptibility genes.
COPDGene will enroll 10,000 smokers with and without COPD across the GOLD stages. Both Non-Hispanic white and African-American subjects are included in the cohort. Inspiratory and expiratory chest CT scans will be obtained on all participants. In addition to the cross-sectional enrollment process, these subjects will be followed regularly for longitudinal studies. A genome-wide association study (GWAS) will be done on an initial group of 4000 subjects to identify genetic variants associated with case-control status and several quantitative phenotypes related to COPD. The initial findings will be verified in an additional 2000 COPD cases and 2000 smoking control subjects, and further validation association studies will be carried out.
COPDGene will provide important new information about genetic factors in COPD, and will characterize the disease process using high resolution CT scans. Understanding genetic factors and CT phenotypes that define COPD will potentially permit earlier diagnosis of this disease and may lead to the development of treatments to modify progression.
Aquaporin-5 (AQP5) can cause mucus overproduction and lower lung function. Genetic variants in the AQP5 gene might be associated with rate of lung function decline in chronic obstructive pulmonary disease (COPD).
Five single nucleotide polymorphisms (SNPs) in AQP5 were genotyped in 429 European American individuals with COPD randomly selected from the NHLBI Lung Health Study. Mean annual decline in FEV1 % predicted, assessed over five years, was calculated as a linear regression slope, adjusting for potential covariates and stratified by smoking status. Constructs containing the wildtype allele and risk allele of the coding SNP N228K were generated using site-directed mutagenesis, and transfected into HBE-16 (human bronchial epithelial cell line). AQP5 abundance and localization were assessed by immunoblots and confocal immunofluoresence under control, shear stress and cigarette smoke extract (CSE 10%) exposed conditions to test for differential expression or localization.
Among continuous smokers, three of the five SNPs tested showed significant associations (0.02>P>0.004) with rate of lung function decline; no associations were observed among the group of intermittent or former smokers. Haplotype tests revealed multiple association signals (0.012>P>0.0008) consistent with the single-SNP results. In HBE16 cells, shear stress and CSE led to a decrease in AQP5 abundance in the wild-type, but not in the N228K AQP5 plasmid.
Polymorphisms in AQP5 were associated with rate of lung function decline in continuous smokers with COPD. A missense mutation modulates AQP-5 expression in response to cigarette smoke extract and shear stress. These results suggest that AQP5 may be an important candidate gene for COPD.
Case-parent trios were used in a genome wide association study of cleft lip with/without cleft palate (CL/P). SNPs near two genes not previously associated with CL/P [MAFB: most significant SNP rs13041247, with odds ratio per minor allele OR=0.704; 95%CI=0.635,0.778; p=2.05*10−11; and ABCA4: most significant SNP rs560426, with OR=1.432; 95%CI=1.292,1.587; p=5.70*10−12] and two previously identified regions (chr. 8q24 and IRF6) attained genome wide significance. Stratifying trios into European and Asian ancestry groups revealed differences in statistical significance, although estimated effect sizes were similar. Replication studies from several populations showed confirming evidence, with families of European ancestry giving stronger evidence for markers in 8q24 while Asian families showed stronger evidence for MAFB and ABCA4. Expression studies support a role for MAFB in palate development.
Isolated cleft lip with or without cleft palate (CL/P) is among the most common human birth defects, with a prevalence of approximately 1 in 700 live births. The B-Cell Leukemia/lymphoma 3 (BCL3) gene has been suggested as a candidate gene for CL/P based on association and linkage studies in some populations. This study tests for an association between markers in BCL3 and isolated, non-syndromic CL/P using a case-parent trio design, while considering parent-of-origin effects.
Forty case-parent trios were genotyped for two single nucleotide polymorphisms (SNPs) in the BCL3 gene. We performed a transmission disequilibrium test (TDT) on individual SNPs, and the FAMHAP package was used to estimate haplotype frequencies and to test for excess transmission of multi-SNP haplotypes.
The odds ratio for transmission of the minor allele, OR (transmission), was significant for SNP rs8100239 (OR=3.50, p=0.004) and rs2965169 (OR=2.08, p=0.027) when parent-of-origin was not considered. Parent-specific TDT revealed that SNP rs8100239 showed excess maternal transmission. Analysis of haplotypes of rs2965169 and rs8100239 also suggested excess maternal transmission.
BCL3 appears to influence risk of CL/P through a parent-of-origin effect with excess maternal transmission.
BCL3; Oral cleft; Maternal transmission effects; Parent-of-origin
Genome-wide association studies (GWAS) have emerged as powerful means for identifying genetic loci related to complex diseases. However, the role of environment and its potential to interact with key loci has not been adequately addressed in most GWAS. Networks of collaborative studies involving different study populations and multiple phenotypes provide a powerful approach for addressing the challenges in analysis and interpretation shared across studies. The Gene, Environment Association Studies (GENEVA) consortium was initiated to: identify genetic variants related to complex diseases; identify variations in gene-trait associations related to environmental exposures; and ensure rapid sharing of data through the database of Genotypes and Phenotypes. GENEVA consists of several academic institutions, including a coordinating center, two genotyping centers and 14 independently designed studies of various phenotypes, as well as several Institutes and Centers of the National Institutes of Health led by the National Human Genome Research Institute. Minimum detectable effect sizes include relative risks ranging from 1.24 to 1.57 and proportions of variance explained ranging from 0.0097 to 0.02. Given the large number of research participants (N > 80,000), an important feature of GENEVA is harmonization of common variables, which allow analyses of additional traits. Environmental exposure information available from most studies also enables testing of gene-environment interactions. Facilitated by its sizeable infrastructure for promoting collaboration, GENEVA has established a unified framework for genotyping, data quality control, analysis and interpretation. By maximizing knowledge obtained through collaborative GWAS incorporating environmental exposure information, GENEVA aims to enhance our understanding of disease etiology, potentially identifying opportunities for intervention.
genome-wide association; complex disease; quantitative traits; gene-environment interaction; phenotype harmonization
Rationale: Asthma prevalence and severity are high among underserved minorities, including those of African descent. The Duffy antigen/receptor for chemokines is the receptor for Plasmodium vivax on erythrocytes and functions as a chemokine-clearing receptor. Unlike European populations, decreased expression of the receptor on erythrocytes is common among populations of African descent, and results from a functional T-46C polymorphism (rs2814778) in the promoter. This variant provides an evolutionary advantage in malaria-endemic regions, because Duffy antigen/receptor for chemokines-negative erythrocytes are more resistant to infection by P. vivax.
Objectives: To determine the role of the rs2814778 polymorphism in asthma and atopy as measured by total serum IgE levels among four populations of African descent (African Caribbean, African American, Brazilian, and Colombian) and a European American population.
Methods: Family-based association tests were performed in each of the five populations to test for association between the rs2814778 polymorphism and asthma or total IgE concentration.
Measurements and Main Results: Asthma was significantly associated with the rs2814778 polymorphism in the African Caribbean, Colombian, and Brazilian families (P < 0.05). High total IgE levels were associated with this variant in African Caribbean and Colombian families (P < 0.05). The variant allele was not polymorphic among European Americans.
Conclusions: Susceptibility to asthma and atopy among certain populations of African descent is influenced by a functional polymorphism in the gene encoding Duffy antigen/receptor for chemokines. This genetic variant, which confers resistance to malarial parasitic infection, may also partially explain ethnic differences in morbidity of asthma.
Duffy antigen/receptor for chemokines; continental population groups; lung diseases; hypersensitivity
Identifying the ancestry of chromosomal segments of distinct ancestry has a wide range of applications from disease mapping to learning about history. Most methods require the use of unlinked markers; but, using all markers from genome-wide scanning arrays, it should in principle be possible to infer the ancestry of even very small segments with exquisite accuracy. We describe a method, HAPMIX, which employs an explicit population genetic model to perform such local ancestry inference based on fine-scale variation data. We show that HAPMIX outperforms other methods, and we explore its utility for inferring ancestry, learning about ancestral populations, and inferring dates of admixture. We validate the method empirically by applying it to populations that have experienced recent and ancient admixture: 935 African Americans from the United States and 29 Mozabites from North Africa. HAPMIX will be of particular utility for mapping disease genes in recently admixed populations, as its accurate estimates of local ancestry permit admixture and case-control association signals to be combined, enabling more powerful tests of association than with either signal alone.
The genomes of individuals from admixed populations consist of chromosomal segments of distinct ancestry. For example, the genomes of African American individuals contain segments of both African and European ancestry, so that a specific location in the genome may inherit 0, 1, or 2 copies of European ancestry. Inferring an individual's local ancestry, their number of copies of each ancestry at each location in the genome, has important applications in disease mapping and in understanding human history. Here we describe HAPMIX, a method that analyzes data from dense genotyping chips to infer local ancestry with very high precision. An important feature of HAPMIX is that it makes use of data from haplotypes (blocks of nearby markers), which are more informative for ancestry than individual markers. Our simulations demonstrate the utility of HAPMIX for local ancestry inference, and empirical applications to African American and Mozabite data sets uncover important aspects of the history of these populations.
Visceral leishmaniasis (VL) caused by Leishmania chagasi is endemic to northeast Brazil. A positive delayed-type hypersensitivity skin test response (DTH+) is a marker for acquired resistance to disease, clusters in families, and may be genetically controlled. Twenty-three single nucleotide polymorphisms (SNPs) were genotyped in the cytokine 5q23.3-q31.1 region IRF1-IL5-IL13-IL4-IL9-LECT2-TGFBI in 102 families (323 DTH+; 190 DTH−; 123 VL individuals) from a VL endemic region in northeast Brazil. Data from 20 SNPs were analysed for association with DTH+/− status and VL using family-based, stepwise conditional logistic regression analysis. Independent associations were observed between the DTH+ phenotype and markers in separate linkage disequilibrium blocks in LECT2 (OR 2.25; P=0.005; 95% CI=1.28-3.97) and TGFBI (OR 1.94; P=0.003; 95% CI=1.24-3.03). VL child/parent trios gave no evidence of linkage and association, but the DTH− phenotype was associated with SNP rs2070874 at IL4 (OR 3.14; P=0.006; 95% CI=1.38-7.14), and SNP rs30740 between LECT2 and TGFBI (OR 3.00; P=0.042; 95% CI=1.04-8.65). These results indicate several genes in the immune response gene cluster at 5q23.3-q31.1 influence outcomes of L. chagasi infection in this region of Brazil.
The protozoan Leishmania chagasi can cause disseminated, fatal visceral leishmaniasis (VL) or asymptomatic human infection. We hypothesized that genetic factors contribute to this variable response to infection. A family study was performed in endemic neighborhoods near Natal, northeast Brazil. Subjects were assessed for VL or asymptomatic infection, defined as a positive delayed type hypersensitivity (DTH) skin test response to Leishmania antigen without disease symptoms. A genome scan of 405 microsatellite markers in 1254 subjects was analyzed for regions of linkage. The results indicated loci of potential linkage to DTH response on chromosomes 2, 13, 15 and 19, and a novel region of potential interest for VL on chromosome 9. An understanding of the genetic factors determining whether an individual will develop symptomatic or asymptomatic infection with L. chagasi may illuminate proteins essential for immune protection against this parasitic disease; findings could reveal strategies for immunotherapy or prevention.
Visceral leishmaniasis; Delayed Type Hypersensitivity; Linkage Analysis; genetic susceptibility
The genetic basis of acute lung injury (ALI) is poorly understood. The myosin light chain kinase (MYLK) gene encodes the nonmuscle myosin light chain kinase isoform, a multifunctional protein involved in the inflammatory response (apoptosis, vascular permeability, leukocyte diapedesis). To examine MYLK as a novel candidate gene in sepsis-associated ALI, we sequenced exons, exon–intron boundaries, and 2 kb of 5′ UTR of the MYLK, which revealed 51 single-nucleotide polymorphisms (SNPs). Potential association of 28 MYLK SNPs with sepsis-associated ALI were evaluated in a case-control sample of 288 European American subjects (EAs) with sepsis alone, subjects with sepsis-associated ALI, or healthy control subjects, and a sample population of 158 African American subjects (AAs) with sepsis and ALI. Significant single locus associations in EAs were observed between four MYLK SNPs and the sepsis phenotype (P < 0.001), with an additional SNP associated with the ALI phenotype (P = 0.03). A significant association of a single SNP (identical to the SNP identified in EAs) was observed in AAs with sepsis (P = 0.002) and with ALI (P = 0.01). Three sepsis risk-conferring haplotypes in EAs were defined downstream of start codon of smooth muscle MYLK isoform, a region containing putative regulatory elements (P < 0.001). In contrast, multiple haplotypic analyses revealed an ALI-specific, risk-conferring haplotype at 5′ of the MYLK gene in both European and African Americans and an additional 3′ region haplotype only in African Americans. These data strongly implicate MYLK genetic variants to confer increased risk of sepsis and sepsis-associated ALI.
MYLK/MLCK; genetic association; SNP; ALI; sepsis
Orofacial clefts, including cleft lip with or without palate (CL/P) and cleft palate (CP), are one of the most common congenital malformations in Asian populations, where the rate of incidence is higher than in European or other racial groups. A number of candidate genes have been identified for orofacial clefts, although no single candidate has been consistently identified in all studies. We performed case-parent trio and case-control studies on 6 single nucleotide polymorphisms (SNPs) in the MSX1 gene using a sample of 52 CL/P and CP probands from Korea. In the case-control study, the allele frequencies of 6 MSX1 SNPs were compared between 52 oral cleft cases and 96 unmatched controls. For the case-parent trio study, single-marker and haplotype-based tests of transmission disequilibrium using allelic and genotypic tests revealed significant evidence of linkage in the presence of disequilibrium for 1170 G/A of exon 2. With the GG genotype as a reference group among GG, GA, and AA genotypes at 1170G/A, the disease risk decreased with the presence of the A allele (AA genotype: OR = 0.26, 95% CI = 0.10-0.99). These results are consistent with evidence from other studies in the US and Chile and confirm the importance of the MSX1 genotype in determining the risk of CL/P and CP in Koreans.
Cleft; MSX1; gene; case-control study; case trio study