Search tips
Search criteria

Results 1-20 (20)

Clipboard (0)

Select a Filter Below

more »
Year of Publication
Document Types
1.  Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder 
Nature genetics  2011;44(1):78-84.
Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts.
PMCID: PMC4310555  PMID: 22138692
2.  Oncogenic and sorafenib-sensitive ARAF mutations in lung adenocarcinoma  
The Journal of Clinical Investigation  2014;124(4):1582-1586.
Targeted cancer therapies often induce “outlier” responses in molecularly defined patient subsets. One patient with advanced-stage lung adenocarcinoma, who was treated with oral sorafenib, demonstrated a near-complete clinical and radiographic remission for 5 years. Whole-genome sequencing and RNA sequencing of primary tumor and normal samples from this patient identified a somatic mutation, ARAF S214C, present in the cancer genome and expressed at high levels. Additional mutations affecting this residue of ARAF and a nearby residue in the related kinase RAF1 were demonstrated across 1% of an independent cohort of lung adenocarcinoma cases. The ARAF mutations were shown to transform immortalized human airway epithelial cells in a sorafenib-sensitive manner. These results suggest that mutant ARAF is an oncogenic driver in lung adenocarcinoma and an indicator of sorafenib response.
PMCID: PMC3973082  PMID: 24569458
3.  Mutational heterogeneity in cancer and the search for new cancer genes 
Nature  2013;499(7457):214-218.
Major international projects are now underway aimed at creating a comprehensive catalog of all genes responsible for the initiation and progression of cancer. These studies involve sequencing of matched tumor–normal samples followed by mathematical analysis to identify those genes in which mutations occur more frequently than expected by random chance. Here, we describe a fundamental problem with cancer genome studies: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds. The list includes many implausible genes (such as those encoding olfactory receptors and the muscle protein titin), suggesting extensive false positive findings that overshadow true driver events. Here, we show that this problem stems largely from mutational heterogeneity and provide a novel analytical methodology, MutSigCV, for resolving the problem. We apply MutSigCV to exome sequences from 3,083 tumor-normal pairs and discover extraordinary variation in (i) mutation frequency and spectrum within cancer types, which shed light on mutational processes and disease etiology, and (ii) mutation frequency across the genome, which is strongly correlated with DNA replication timing and also with transcriptional activity. By incorporating mutational heterogeneity into the analyses, MutSigCV is able to eliminate most of the apparent artefactual findings and allow true cancer genes to rise to attention.
PMCID: PMC3919509  PMID: 23770567
4.  A Pan-Cancer Analysis of Transcriptome Changes Associated with Somatic Mutations in U2AF1 Reveals Commonly Altered Splicing Events 
PLoS ONE  2014;9(1):e87361.
Although recurrent somatic mutations in the splicing factor U2AF1 (also known as U2AF35) have been identified in multiple cancer types, the effects of these mutations on the cancer transcriptome have yet to be fully elucidated. Here, we identified splicing alterations associated with U2AF1 mutations across distinct cancers using DNA and RNA sequencing data from The Cancer Genome Atlas (TCGA). Using RNA-Seq data from 182 lung adenocarcinomas and 167 acute myeloid leukemias (AML), in which U2AF1 is somatically mutated in 3–4% of cases, we identified 131 and 369 splicing alterations, respectively, that were significantly associated with U2AF1 mutation. Of these, 30 splicing alterations were statistically significant in both lung adenocarcinoma and AML, including three genes in the Cancer Gene Census, CTNNB1, CHCHD7, and PICALM. Cell line experiments expressing U2AF1 S34F in HeLa cells and in 293T cells provide further support that these altered splicing events are caused by U2AF1 mutation. Consistent with the function of U2AF1 in 3′ splice site recognition, we found that S34F/Y mutations cause preferences for CAG over UAG 3′ splice site sequences. This report demonstrates consistent effects of U2AF1 mutation on splicing in distinct cancer cell types.
PMCID: PMC3909098  PMID: 24498085
5.  Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing 
Cell  2012;150(6):1107-1120.
Lung adenocarcinoma, the most common subtype of non-small cell lung cancer, is responsible for over 500,000 deaths per year worldwide. Here, we report exome and genome sequences of 183 lung adenocarcinoma tumor/normal DNA pairs. These analyses revealed a mean exonic somatic mutation rate of 12.0 events/megabase and identified the majority of genes previously reported as significantly mutated in lung adenocarcinoma. In addition, we identified statistically recurrent somatic mutations in the splicing factor gene U2AF1 and truncating mutations affecting RBM10 and ARID1A. Analysis of nucleotide context-specific mutation signatures grouped the sample set into distinct clusters that correlated with smoking history and alterations of reported lung adenocarcinoma genes. Whole genome sequence analysis revealed frequent structural re-arrangements, including in-frame exonic alterations within EGFR and SIK2 kinases. The candidate genes identified in this study are attractive targets for biological characterization and therapeutic targeting of lung adenocarcinoma.
PMCID: PMC3557932  PMID: 22980975
6.  A Landscape of Driver Mutations in Melanoma 
Cell  2012;150(2):251-263.
Despite recent insights into melanoma genetics, systematic surveys for driver mutations are challenged by an abundance of passenger mutations caused by carcinogenic ultraviolet (UV) light exposure. We developed a permutation-based framework to address this challenge, employing mutation data from intronic sequences to control for passenger mutational load on a per gene basis. Analysis of large-scale melanoma exome data by this approach discovered six novel melanoma genes (PPP6C, RAC1, SNX31, TACC1, STK19 and ARID2), three of which - RAC1, PPP6C and STK19 - harbored recurrent and potentially targetable mutations. Integration with chromosomal copy number data contextualized the landscape of driver mutations, providing oncogenic insights in BRAF- and NRAS-driven melanoma as well as those without known NRAS/BRAF mutations. The landscape also clarified a mutational basis for RB and p53 pathway deregulation in this malignancy. Finally, the spectrum of driver mutations provided unequivocal genomic evidence for a direct mutagenic role of UV light in melanoma pathogenesis.
PMCID: PMC3600117  PMID: 22817889
7.  Common variants at five new loci associated with early-onset inflammatory bowel disease 
Nature Genetics  2009;41(12):1335-1340.
The inflammatory bowel diseases (IBD) Crohn’s disease and ulcerative colitis are common causes of morbidity in children and young adults in the western world. Here we report the results of a genome-wide association study in early-onset IBD involving 3,426 affected individuals and 11,963 genetically matched controls recruited through international collaborations in Europe and North America, thereby extending the results from a previous study of 1,011 individuals with early-onset IBD1. We have identified five new regions associated with early-onset IBD susceptibility, including 16p11 near the cytokine gene IL27 (rs8049439, P = 2.41 × 10−9), 22q12 (rs2412973, P = 1.55 × 10−9), 10q22 (rs1250550, P = 5.63 × 10−9), 2q37 (rs4676410, P = 3.64 × 10−8) and 19q13.11 (rs10500264, P = 4.26 × 10−10). Our scan also detected associations at 23 of 32 loci previously implicated in adult-onset Crohn’s disease and at 8 of 17 loci implicated in adult-onset ulcerative colitis, highlighting the close pathogenetic relationship between early- and adult-onset IBD.
PMCID: PMC3267927  PMID: 19915574
8.  A Genome-Wide Meta-Analysis of Six Type 1 Diabetes Cohorts Identifies Multiple Associated Loci 
PLoS Genetics  2011;7(9):e1002293.
Diabetes impacts approximately 200 million people worldwide, of whom approximately 10% are affected by type 1 diabetes (T1D). The application of genome-wide association studies (GWAS) has robustly revealed dozens of genetic contributors to the pathogenesis of T1D, with the most recent meta-analysis identifying in excess of 40 loci. To identify additional genetic loci for T1D susceptibility, we examined associations in the largest meta-analysis to date between the disease and ∼2.54 million SNPs in a combined cohort of 9,934 cases and 16,956 controls. Targeted follow-up of 53 SNPs in 1,120 affected trios uncovered three new loci associated with T1D that reached genome-wide significance. The most significantly associated SNP (rs539514, P = 5.66×10−11) resides in an intronic region of the LMO7 (LIM domain only 7) gene on 13q22. The second most significantly associated SNP (rs478222, P = 3.50×10−9) resides in an intronic region of the EFR3B (protein EFR3 homolog B) gene on 2p23; however, the region of linkage disequilibrium is approximately 800 kb and harbors additional multiple genes, including NCOA1, C2orf79, CENPO, ADCY3, DNAJC27, POMC, and DNMT3A. The third most significantly associated SNP (rs924043, P = 8.06×10−9) lies in an intergenic region on 6q27, where the region of association is approximately 900 kb and harbors multiple genes including WDR27, C6orf120, PHF10, TCTE3, C6orf208, LOC154449, DLL1, FAM120B, PSMB1, TBP, and PCD2. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
Author Summary
Despite the fact that there is clearly a large genetic component to type 1 diabetes (T1D), uncovering the genes contributing to this disease has proven challenging. However, in the past three years there has been relatively major progress in this regard, with advances in genetic screening technologies allowing investigators to scan the genome for variants conferring risk for disease without prior hypotheses. Such genome-wide association studies have revealed multiple regions of the genome to be robustly and consistently associated with T1D. More recent findings have been a consequence of combining of multiple datasets from independent investigators in meta-analyses, which have more power to pick up additional variants contributing to the trait. In the current study, we describe the largest meta-analysis of T1D genome-wide genotyped datasets to date, which combines six large studies. As a consequence, we have uncovered three new signals residing at the chromosomal locations 13q22, 2p23, and 6q27, which went on to be replicated in independent sample sets. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
PMCID: PMC3183083  PMID: 21980299
9.  Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47 
Anderson, Carl A. | Boucher, Gabrielle | Lees, Charlie W. | Franke, Andre | D’Amato, Mauro | Taylor, Kent D. | Lee, James C. | Goyette, Philippe | Imielinski, Marcin | Latiano, Anna | Lagacé, Caroline | Scott, Regan | Amininejad, Leila | Bumpstead, Suzannah | Baidoo, Leonard | Baldassano, Robert N. | Barclay, Murray | Bayless, Theodore M. | Brand, Stephan | Büning, Carsten | Colombel, Jean-Frédéric | Denson, Lee A. | De Vos, Martine | Dubinsky, Marla | Edwards, Cathryn | Ellinghaus, David | Fehrmann, Rudolf S.N. | Floyd, James A.B. | Florin, Tim | Franchimont, Denis | Franke, Lude | Georges, Michel | Glas, Jürgen | Glazer, Nicole L. | Guthery, Stephen L. | Haritunians, Talin | Hayward, Nicholas K. | Hugot, Jean-Pierre | Jobin, Gilles | Laukens, Debby | Lawrance, Ian | Lémann, Marc | Levine, Arie | Libioulle, Cecile | Louis, Edouard | McGovern, Dermot P. | Milla, Monica | Montgomery, Grant W. | Morley, Katherine I. | Mowat, Craig | Ng, Aylwin | Newman, William | Ophoff, Roel A | Papi, Laura | Palmieri, Orazio | Peyrin-Biroulet, Laurent | Panés, Julián | Phillips, Anne | Prescott, Natalie J. | Proctor, Deborah D. | Roberts, Rebecca | Russell, Richard | Rutgeerts, Paul | Sanderson, Jeremy | Sans, Miquel | Schumm, Philip | Seibold, Frank | Sharma, Yashoda | Simms, Lisa | Seielstad, Mark | Steinhart, A. Hillary | Targan, Stephan R. | van den Berg, Leonard H. | Vatn, Morten | Verspaget, Hein | Walters, Thomas | Wijmenga, Cisca | Wilson, David C. | Westra, Harm-Jan | Xavier, Ramnik J. | Zhao, Zhen Z. | Ponsioen, Cyriel Y. | Andersen, Vibeke | Torkvist, Leif | Gazouli, Maria | Anagnou, Nicholas P. | Karlsen, Tom H. | Kupcinskas, Limas | Sventoraityte, Jurgita | Mansfield, John C. | Kugathasan, Subra | Silverberg, Mark S. | Halfvarson, Jonas | Rotter, Jerome I. | Mathew, Christopher G. | Griffiths, Anne M. | Gearry, Richard | Ahmad, Tariq | Brant, Steven R. | Chamaillard, Mathias | Satsangi, Jack | Cho, Judy H. | Schreiber, Stefan | Daly, Mark J. | Barrett, Jeffrey C. | Parkes, Miles | Annese, Vito | Hakonarson, Hakon | Radford-Smith, Graham | Duerr, Richard H. | Vermeire, Séverine | Weersma, Rinse K. | Rioux, John D.
Nature genetics  2011;43(3):246-252.
Genome-wide association studies (GWAS) and candidate gene studies in ulcerative colitis (UC) have identified 18 susceptibility loci. We conducted a meta-analysis of 6 UC GWAS, comprising 6,687 cases and 19,718 controls, and followed-up the top association signals in 9,628 cases and 12,917 controls. We identified 29 additional risk loci (P<5×10-8), increasing the number of UC associated loci to 47. After annotating associated regions using GRAIL, eQTL data and correlations with non-synonymous SNPs, we identified many candidate genes providing potentially important insights into disease pathogenesis, including IL1R2, IL8RA/B, IL7R, IL12B, DAP, PRDM1, JAK2, IRF5, GNA12 and LSP1. The total number of confirmed inflammatory bowel disease (IBD) risk loci is now 99, including a minimum of 28 shared association signals between Crohn’s disease (CD) and UC.
PMCID: PMC3084597  PMID: 21297633
10.  Comparative genetic analysis of inflammatory bowel disease and type 1 diabetes implicates multiple loci with opposite effects 
Human Molecular Genetics  2010;19(10):2059-2067.
Inflammatory bowel disease, including Crohn's disease (CD) and ulcerative colitis (UC), and type 1 diabetes (T1D) are autoimmune diseases that may share common susceptibility pathways. We examined known susceptibility loci for these diseases in a cohort of 1689 CD cases, 777 UC cases, 989 T1D cases and 6197 shared control subjects of European ancestry, who were genotyped by the Illumina HumanHap550 SNP arrays. We identified multiple previously unreported or unconfirmed disease associations, including known CD loci (ICOSLG and TNFSF15) and T1D loci (TNFAIP3) that confer UC risk, known UC loci (HERC2 and IL26) that confer T1D risk and known UC loci (IL10 and CCNY) that confer CD risk. Additionally, we show that T1D risk alleles residing at the PTPN22, IL27, IL18RAP and IL10 loci protect against CD. Furthermore, the strongest risk alleles for T1D within the major histocompatibility complex (MHC) confer strong protection against CD and UC; however, given the multi-allelic nature of the MHC haplotypes, sequencing of the MHC locus will be required to interpret this observation. These results extend our current knowledge on genetic variants that predispose to autoimmunity, and suggest that many loci involved in autoimmunity may be under a balancing selection due to antagonistic pleiotropic effect. Our analysis implies that variants with opposite effects on different diseases may facilitate the maintenance of common susceptibility alleles in human populations, making autoimmune diseases especially amenable to genetic dissection by genome-wide association studies.
PMCID: PMC2860894  PMID: 20176734
11.  Duplication of the SLIT3 Locus on 5q35.1 Predisposes to Major Depressive Disorder 
PLoS ONE  2010;5(12):e15463.
Major depressive disorder (MDD) is a common psychiatric and behavioral disorder. To discover novel variants conferring risk to MDD, we conducted a whole-genome scan of copy number variation (CNV), including 1,693 MDD cases and 4,506 controls genotyped on the Perlegen 600K platform. The most significant locus was observed on 5q35.1, harboring the SLIT3 gene (P = 2×10−3). Extending the controls with 30,000 subjects typed on the Illumina 550 k array, we found the CNV to remain exclusive to MDD cases (P = 3.2×10−9). Duplication was observed in 5 unrelated MDD cases encompassing 646 kb with highly similar breakpoints. SLIT3 is integral to repulsive axon guidance based on binding to Roundabout receptors. Duplication of 5q35.1 is a highly penetrant variation accounting for 0.7% of the subset of 647 cases harboring large CNVs, using a threshold of a minimum of 10 SNPs and 100 kb. This study leverages a large dataset of MDD cases and controls for the analysis of CNVs with matched platform and ethnicity. SLIT3 duplication is a novel association which explains a definitive proportion of the largely unknown etiology of MDD.
PMCID: PMC2995745  PMID: 21152026
12.  Common genetic variants on 5p14.1 associate with autism spectrum disorders 
Nature  2009;459(7246):528-533.
Autism spectrum disorders (ASDs) represent a group of childhood neurodevelopmental and neuropsychiatric disorders characterized by deficits in verbal communication, impairment of social interaction, and restricted and repetitive patterns of interests and behaviour. To identify common genetic risk factors underlying ASDs, here we present the results of genome-wide association studies on a cohort of 780 families (3,101 subjects) with affected children, and a second cohort of 1,204 affected subjects and 6,491 control subjects, all of whom were of European ancestry. Six single nucleotide polymorphisms between cadherin 10 (CDH10) and cadherin 9 (CDH9)—two genes encoding neuronal cell-adhesion molecules—revealed strong association signals, with the most significant SNP being rs4307059 (P = 3.4 × 10−8, odds ratio = 1.19). These signals were replicated in two independent cohorts, with combined P values ranging from 7.4 × 10−8 to 2.1 × 10−10. Our results implicate neuronal cell-adhesion molecules in the pathogenesis of ASDs, and represent, to our knowledge, the first demonstration of genome-wide significant association of common variants with susceptibility to ASDs.
PMCID: PMC2943511  PMID: 19404256
13.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes 
Nature  2009;459(7246):569-573.
Autism spectrum disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins1–4. Previous studies focusing on candidate genes or genomic regions have identified several copy number variations (CNVs) that are associated with an increased risk of ASDs5–9. Here we present the results from a whole-genome CNV study on a cohort of 859 ASD cases and 1,409 healthy children of European ancestry who were genotyped with ~550,000 single nucleotide polymorphism markers, in an attempt to comprehensively identify CNVs conferring susceptibility to ASDs. Positive findings were evaluated in an independent cohort of 1,336 ASD cases and 1,110 controls of European ancestry. Besides previously reported ASD candidate genes, such as NRXN1 (ref. 10) and CNTN4 (refs 11, 12), several new susceptibility genes encoding neuronal cell-adhesion molecules, including NLGN1 and ASTN2, were enriched with CNVs in ASD cases compared to controls (P = 9.5 × 10−3). Furthermore, CNVs within or surrounding genes involved in the ubiquitin pathways, including UBE3A, PARK2, RFWD2 and FBXO40, were affected by CNVs not observed in controls (P = 3.3 × 10−3). We also identified duplications 55 kilobases upstream of complementary DNA AK123120 (P = 3.6 × 10−6). Although these variants may be individually rare, they target genes involved in neuronal cell-adhesion or ubiquitin degradation, indicating that these two important gene networks expressed within the central nervous system may contribute to the genetic susceptibility of ASD.
PMCID: PMC2925224  PMID: 19404257
14.  Investigation of the locus near MC4R with childhood obesity in Americans of European and African ancestry 
Obesity (Silver Spring, Md.)  2009;17(7):1461-1465.
Recently a modest, but consistently, replicated association was demonstrated between obesity and the single nucleotide polymorphism (SNP), rs17782313, 3’ of the MC4R locus as a consequence of a meta-analysis of genome wide association (GWA) studies of the disease in Caucasian populations. We investigated the association in the context of the childhood form of the disease utilizing data from our ongoing GWA study in a cohort of 728 European American (EA) obese children (BMI ≥ 95th percentile) and 3,960 EA controls (BMI < 95th percentile), as well as 1,008 African American (AA) obese children and 2,715 AA controls. rs571312, rs10871777 and rs476828 (perfect surrogates for rs17782313) yielded odds ratios in the EA cohort of 1.142 (P = 0.045), 1.137 (P = 0.054) and 1.145 (P = 0.042); however, there was no significant association with these SNPs in the AA cohort. When investigating all thirty SNPs present on the Illumina BeadChip at this locus, again there was no evidence for association in AA cases when correcting for the number of tests employed. As such, variants 3’ to the MC4R locus present on the genotyping platform utilized confer a similar magnitude of risk of obesity in Caucasian children as to their adult Caucasian counterparts but this observation did not extend to African Americans.
PMCID: PMC2860794  PMID: 19265794
15.  Follow-Up Analysis of Genome-Wide Association Data Identifies Novel Loci for Type 1 Diabetes 
Diabetes  2009;58(1):290-295.
OBJECTIVE—Two recent genome-wide association (GWA) studies have revealed novel loci for type 1 diabetes, a common multifactorial disease with a strong genetic component. To fully utilize the GWA data that we had obtained by genotyping 563 type 1 diabetes probands and 1,146 control subjects, as well as 483 case subject–parent trios, using the Illumina HumanHap550 BeadChip, we designed a full stage 2 study to capture other possible association signals.
RESEARCH DESIGN AND METHODS—From our existing datasets, we selected 982 markers with P < 0.05 in both GWA cohorts. Genotyping these in an independent set of 636 nuclear families with 974 affected offspring revealed 75 markers that also had P < 0.05 in this third cohort. Among these, six single nucleotide polymorphisms in five novel loci also had P < 0.05 in the Wellcome Trust Case-Control Consortium dataset and were further tested in 1,303 type 1 diabetes probands from the Diabetes Control and Complications Trial/Epidemiology of Diabetes Interventions and Complications (DCCT/EDIC) plus 1,673 control subjects.
RESULTS—Two markers (rs9976767 and rs3757247) remained significant after adjusting for the number of tests in this last cohort; they reside in UBASH3A (OR 1.16; combined P = 2.33 × 10−8) and BACH2 (1.13; combined P = 1.25 × 10−6).
CONCLUSIONS—Evaluation of a large number of statistical GWA candidates in several independent cohorts has revealed additional loci that are associated with type 1 diabetes. The two genes at these respective loci, UBASH3A and BACH2, are both biologically relevant to autoimmunity.
PMCID: PMC2606889  PMID: 18840781
16.  Association of the BANK 1 R61H variant with systemic lupus erythematosus in Americans of European and African ancestry 
Recently an association was demonstrated between the single nucleotide polymorphism (SNP), rs10516487, within the B-cell gene BANK1 and systemic lupus erythematosus (SLE) as a consequence of a genome wide association study of this disease in European and Argentinean populations. In a bid for replication, we examined the effects of the R61H non-synonymous variant with respect to SLE in our genotyped American cohorts of European and African ancestry. Utilizing data from our ongoing genome-wide association study in our cohort of 178 Caucasian SLE cases and 1808 Caucasian population-based controls plus 148 African American (AA) SLE cases and 1894 AA population-based controls we investigated the association of the previously described non-synonymous SNP at the BANK1 locus with the disease in the two ethnicities separately. Using a Fisher’s exact test, the minor allele frequency (MAF) of rs10516487 in the Caucasian cases was 22.6% while it was 31.2% in Caucasian controls, yielding a protective odds ratio (OR) of 0.64 (95% CI 0.49-0.85; one-sided p = 7.07 × 10−4). Furthermore, the MAF of rs10516487 in the AA cases was 18.7% while it was 23.3% in AA controls, yielding a protective OR of 0.75 (95% CI 0.55–1.034; one-sided p = 0.039). The OR of the BANK1 variant in our study cohorts is highly comparable with that reported previously in a South American/European SLE case-control cohort (OR = 0.72). As such, R61H in the BANK1 gene confers a similar magnitude of SLE protection, not only in European Americans, but also in African Americans.
PMCID: PMC3681036  PMID: 23776345
systemic lupus erythematosus; African Americans; European Americans; BANK1 gene
17.  Loci on 20q13 and 21q22 are associated with pediatric-onset inflammatory bowel disease 
Nature genetics  2008;40(10):1211-1215.
Inflammatory bowel disease (IBD) is a common inflammatory disorder with complex etiology that involves both genetic and environmental triggers, including but not limited to defects in bacterial clearance, defective mucosal barrier and persistent dysregulation of the immune response to commensal intestinal bacteria. IBD is characterized by two distinct phenotypes: Crohn’s disease (CD) and ulcerative colitis (UC). Previously reported GWA studies have identified genetic variation accounting for a small portion of the overall genetic susceptibility to CD and an even smaller contribution to UC pathogenesis. We hypothesized that stratification of IBD by age of onset might identify additional genes associated with IBD. To that end, we carried out a GWA analysis in a cohort of 1,011 individuals with pediatric-onset IBD and 4,250 matched controls. We identified and replicated significantly associated, previously unreported loci on chromosomes 20q13 (rs2315008[T] and rs4809330[A]; P = 6.30 × 10−8 and 6.95 × 10−8, respectively; odds ratio (OR) = 0.74 for both) and 21q22 (rs2836878[A]; P = 6.01 × 10−8; OR = 0.73), located close to the TNFRSF6B and PSMG1 genes, respectively.
PMCID: PMC2770437  PMID: 18758464
18.  Genome-Wide Analyses of Exonic Copy Number Variants in a Family-Based Study Point to Novel Autism Susceptibility Genes 
PLoS Genetics  2009;5(6):e1000536.
The genetics underlying the autism spectrum disorders (ASDs) is complex and remains poorly understood. Previous work has demonstrated an important role for structural variation in a subset of cases, but has lacked the resolution necessary to move beyond detection of large regions of potential interest to identification of individual genes. To pinpoint genes likely to contribute to ASD etiology, we performed high density genotyping in 912 multiplex families from the Autism Genetics Resource Exchange (AGRE) collection and contrasted results to those obtained for 1,488 healthy controls. Through prioritization of exonic deletions (eDels), exonic duplications (eDups), and whole gene duplication events (gDups), we identified more than 150 loci harboring rare variants in multiple unrelated probands, but no controls. Importantly, 27 of these were confirmed on examination of an independent replication cohort comprised of 859 cases and an additional 1,051 controls. Rare variants at known loci, including exonic deletions at NRXN1 and whole gene duplications encompassing UBE3A and several other genes in the 15q11–q13 region, were observed in the course of these analyses. Strong support was likewise observed for previously unreported genes such as BZRAP1, an adaptor molecule known to regulate synaptic transmission, with eDels or eDups observed in twelve unrelated cases but no controls (p = 2.3×10−5). Less is known about MDGA2, likewise observed to be case-specific (p = 1.3×10−4). But, it is notable that the encoded protein shows an unexpectedly high similarity to Contactin 4 (BLAST E-value = 3×10−39), which has also been linked to disease. That hundreds of distinct rare variants were each seen only once further highlights complexity in the ASDs and points to the continued need for larger cohorts.
Author Summary
Autism spectrum disorders (ASDs) are common neurodevelopmental syndromes with a strong genetic component. ASDs are characterized by disturbances in social behavior, impaired verbal and nonverbal communication, as well as repetitive behaviors and/or a restricted range of interests. To identify genes likely to contribute to ASD etiology, we performed high density genotyping in 912 multiplex families from the Autism Genetics Resource Exchange (AGRE) collection and contrasted results to those obtained for 1,488 healthy controls. To enrich for variants most likely to interfere with gene function, we restricted our analyses to deletions and gains encompassing exons. Of the many genomic regions highlighted, 27 were seen to harbor rare variants in cases and not controls, both in the first phase of our analysis, and also in an independent replication cohort comprised of 859 cases and 1,051 controls. More work in a larger number of individuals will be required to determine which of the rare alleles highlighted here are indeed related to the ASDs and how they act to shape risk.
PMCID: PMC2695001  PMID: 19557195
19.  Exploiting the pathway structure of metabolism to reveal high-order epistasis 
BMC Systems Biology  2008;2:40.
Biological robustness results from redundant pathways that achieve an essential objective, e.g. the production of biomass. As a consequence, the biological roles of many genes can only be revealed through multiple knockouts that identify a set of genes as essential for a given function. The identification of such "epistatic" essential relationships between network components is critical for the understanding and eventual manipulation of robust systems-level phenotypes.
We introduce and apply a network-based approach for genome-scale metabolic knockout design. We apply this method to uncover over 11,000 minimal knockouts for biomass production in an in silico genome-scale model of E. coli. A large majority of these "essential sets" contain 5 or more reactions, and thus represent complex epistatic relationships between components of the E. coli metabolic network.
The complex minimal biomass knockouts discovered with our approach illuminate robust essential systems-level roles for reactions in the E. coli metabolic network. Unlike previous approaches, our method yields results regarding high-order epistatic relationships and is applicable at the genome-scale.
PMCID: PMC2390508  PMID: 18447928
20.  Association Analysis of the FTO Gene with Obesity in Children of Caucasian and African Ancestry Reveals a Common Tagging SNP 
PLoS ONE  2008;3(3):e1746.
Recently an association was demonstrated between the single nucleotide polymorphism (SNP), rs9939609, within the FTO locus and obesity as a consequence of a genome wide association (GWA) study of type 2 diabetes in adults. We examined the effects of two perfect surrogates for this SNP plus 11 other SNPs at this locus with respect to our childhood obesity cohort, consisting of both Caucasians and African Americans (AA). Utilizing data from our ongoing GWA study in our cohort of 418 Caucasian obese children (BMI≥95th percentile), 2,270 Caucasian controls (BMI<95th percentile), 578 AA obese children and 1,424 AA controls, we investigated the association of the previously reported variation at the FTO locus with the childhood form of this disease in both ethnicities. The minor allele frequencies (MAF) of rs8050136 and rs3751812 (perfect surrogates for rs9939609 i.e. both r2 = 1) in the Caucasian cases were 0.448 and 0.443 respectively while they were 0.391 and 0.386 in Caucasian controls respectively, yielding for both an odds ratio (OR) of 1.27 (95% CI 1.08–1.47; P = 0.0022). Furthermore, the MAFs of rs8050136 and rs3751812 in the AA cases were 0.449 and 0.115 respectively while they were 0.436 and 0.090 in AA controls respectively, yielding an OR of 1.05 (95% CI 0.91–1.21; P = 0.49) and of 1.31 (95% CI 1.050–1.643; P = 0.017) respectively. Investigating all 13 SNPs present on the Illumina HumanHap550 BeadChip in this region of linkage disequilibrium, rs3751812 was the only SNP conferring significant risk in AA. We have therefore replicated and refined the association in an AA cohort and distilled a tag-SNP, rs3751812, which captures the ancestral origin of the actual mutation. As such, variants in the FTO gene confer a similar magnitude of risk of obesity to children as to their adult counterparts and appear to have a global impact.
PMCID: PMC2262153  PMID: 18335027

Results 1-20 (20)