Search tips
Search criteria

Results 1-25 (28)

Clipboard (0)
more »
Year of Publication
Document Types
1.  A genome wide association study of plasma uric acid levels in obese cases and never-overweight controls 
Obesity (Silver Spring, Md.)  2013;21(9):E490-E494.
To identify plasma uric acid related genes in extremely obese and normal weight individuals using genome wide association studies (GWAS).
Design and Methods
Using genotypes from a GWAS focusing on obesity and thinness, we performed quantitative trait association analyses (PLINK) for plasma uric acid levels in 1,060 extremely obese individuals [body mass index (BMI) >35 kg/m2] and normal-weight controls (BMI<25kg/m2). In 961 samples with uric acid data, 924 were females.
Significant associations were found in SLC2A9 gene SNPs and plasma uric acid levels (rs6449213, P=3.15×10−12). DIP2C gene SNP rs877282 also reached genome wide significance(P=4,56×10−8). Weaker associations (P<1×10−5) were found in F5, PXDNL, FRAS1, LCORL, and MICAL2genes. Besides SLC2A9, 3 previously identified uric acid related genes ABCG2 (rs2622605, P=0.0026), SLC17A1 (rs3799344, P=0.0017), and RREB1 (rs1615495, P =0.00055) received marginal support in our study.
Two genes/chromosome regions reached genome wide association significance (P< 1× 10−7, 550K SNPs) in our GWAS : SLC2A9, the chromosome 2 60.1 Mb region (rs6723995), and the DIP2C gene region. Five other genes (F5, PXDNL, FRAS1, LCORL, and MICAL2) yielded P<1× 10−5. Four previous reported associations were replicated in our study, including SLC2A9, ABCG2, RREB, and SLC17A1.
PMCID: PMC3762924  PMID: 23703922
uric acid; genome wide association study; obesity
2.  The missense variation landscape of FTO, MC4R and TMEM18 in obese children of African ancestry 
Obesity (Silver Spring, Md.)  2013;21(1):159-163.
Common variation at the loci harboring FTO, MC4R and TMEM18 is consistently reported as being statistically the most strongly associated with obesity. We investigated if these loci also harbor rarer missense variants that confer substantially higher risk of common childhood obesity in African American (AA) children. We sequenced the exons of FTO, MC4R and TMEM18 in an initial subset of our cohort i.e. 200 obese (BMI≥95th percentile) and 200 lean AA children (BMI≤5th percentile). Any missense exonic variants that were uncovered went on to be further genotyped in a further 768 obese and 768 lean (BMI≤50th percentile) children of the same ethnicity. A number of exonic variants were observed from our sequencing effort: seven in FTO, of which four were non-synonymous (A163T, G182A, M400V and A405V), thirteen in MC4R, of which six were non-synonymous (V103I, N123S, S136A, F202L, N240S and I251L) and four in TMEM18, of which two were non-synonymous (P2S and V113L). Follow-up genotyping of these missense variants revealed only one significant difference in allele frequency between cases and controls, namely with N240S in MC4R(Fisher's Exact P = 0.0001). In summary, moderately rare missense variants within the FTO, MC4R and TMEM18 genes observed in our study did not confer risk of common childhood obesity in African Americans except for a degree of evidence for one known loss-of-function variant in MC4R.
PMCID: PMC3605748  PMID: 23505181
Obesity; Pediatrics; Genomics
3.  Examination of All Type 2 Diabetes GWAS Loci Reveals HHEX-IDE as a Locus Influencing Pediatric BMI 
Diabetes  2009;59(3):751-755.
A number of studies have found that BMI in early life influences the risk of developing type 2 diabetes later in life. Our goal was to investigate if any type 2 diabetes variants uncovered through genome-wide association studies (GWAS) impact BMI in childhood.
Using data from an ongoing GWAS of pediatric BMI in our cohort, we investigated the association of pediatric BMI with 20 single nucleotide polymorphisms at 18 type 2 diabetes loci uncovered through GWAS, consisting of ADAMTS9, CDC123-CAMK1D, CDKAL1, CDKN2A/B, EXT2, FTO, HHEX-IDE, IGF2BP2, the intragenic region on 11p12, JAZF1, KCNQ1, LOC387761, MTNR1B, NOTCH2, SLC30A8, TCF7L2, THADA, and TSPAN8-LGR5. We randomly partitioned our cohort exactly in half in order to have a discovery cohort (n = 3,592) and a replication cohort (n = 3,592).
Our data show that the major type 2 diabetes risk–conferring G allele of rs7923837 at the HHEX-IDE locus was associated with higher pediatric BMI in both the discovery (P = 0.0013 and survived correction for 20 tests) and replication (P = 0.023) sets (combined P = 1.01 × 10−4). Association was not detected with any other known type 2 diabetes loci uncovered to date through GWAS except for the well-established FTO.
Our data show that the same genetic HHEX-IDE variant, which is associated with type 2 diabetes from previous studies, also influences pediatric BMI.
PMCID: PMC2828649  PMID: 19933996
4.  Examination of Type 2 Diabetes Loci Implicates CDKAL1 as a Birth Weight Gene 
Diabetes  2009;58(10):2414-2418.
A number of studies have found that reduced birth weight is associated with type 2 diabetes later in life; however, the underlying mechanism for this correlation remains unresolved. Recently, association has been demonstrated between low birth weight and single nucleotide polymorphisms (SNPs) at the CDKAL1 and HHEX-IDE loci, regions that were previously implicated in the pathogenesis of type 2 diabetes. In order to investigate whether type 2 diabetes risk–conferring alleles associate with low birth weight in our Caucasian childhood cohort, we examined the effects of 20 such loci on this trait.
Using data from an ongoing genome-wide association study in our cohort of 5,465 Caucasian children with recorded birth weights, we investigated the association of the previously reported type 2 diabetes–associated variation at 20 loci including TCF7L2, HHEX-IDE, PPARG, KCNJ11, SLC30A8, IGF2BP2, CDKAL1, CDKN2A/2B, and JAZF1 with birth weight.
Our data show that the minor allele of rs7756992 (P = 8 × 10−5) at the CDKAL1 locus is strongly associated with lower birth weight, whereas a perfect surrogate for variation previously implicated for the trait at the same locus only yielded nominally significant association (P = 0.01; r2 rs7756992 = 0.677). However, association was not detected with any of the other type 2 diabetes loci studied.
We observe association between lower birth weight and type 2 diabetes risk–conferring alleles at the CDKAL1 locus. Our data show that the same genetic locus that has been identified as a marker for type 2 diabetes in previous studies also influences birth weight.
PMCID: PMC2750235  PMID: 19592620
5.  Contribution of common non-synonymous variants in PCSK1 to body mass index variation and risk of obesity: a systematic review and meta-analysis with evidence from up to 331 175 individuals 
Human Molecular Genetics  2015;24(12):3582-3594.
Polymorphisms rs6232 and rs6234/rs6235 in PCSK1 have been associated with extreme obesity [e.g. body mass index (BMI) ≥ 40 kg/m2], but their contribution to common obesity (BMI ≥ 30 kg/m2) and BMI variation in a multi-ethnic context is unclear. To fill this gap, we collected phenotypic and genetic data in up to 331 175 individuals from diverse ethnic groups. This process involved a systematic review of the literature in PubMed, Web of Science, Embase and the NIH GWAS catalog complemented by data extraction from pre-existing GWAS or custom-arrays in consortia and single studies. We employed recently developed global meta-analytic random-effects methods to calculate summary odds ratios (OR) and 95% confidence intervals (CIs) or beta estimates and standard errors (SE) for the obesity status and BMI analyses, respectively. Significant associations were found with binary obesity status for rs6232 (OR = 1.15, 95% CI 1.06–1.24, P = 6.08 × 10−6) and rs6234/rs6235 (OR = 1.07, 95% CI 1.04–1.10, P = 3.00 × 10−7). Similarly, significant associations were found with continuous BMI for rs6232 (β = 0.03, 95% CI 0.00–0.07; P = 0.047) and rs6234/rs6235 (β = 0.02, 95% CI 0.00–0.03; P = 5.57 × 10−4). Ethnicity, age and study ascertainment significantly modulated the association of PCSK1 polymorphisms with obesity. In summary, we demonstrate evidence that common gene variation in PCSK1 contributes to BMI variation and susceptibility to common obesity in the largest known meta-analysis published to date in genetic epidemiology.
PMCID: PMC4498155  PMID: 25784503
6.  Pathway-Based Genome-Wide Association Studies for Plasma Triglycerides in Obese Females and Normal-Weight Controls 
PLoS ONE  2015;10(8):e0134923.
Pathway-based analysis as an alternative approach can provide complementary information to single-marker genome-wide association studies (GWASs), which always ignore the epistasis and does not have sufficient power to find rare variants. In this study, using genotypes from a genome-wide association study (GWAS), pathway-based association studies were carried out by a modified Gene Set Enrichment Algorithm (GSEA) method (GenGen) for triglyceride in 1028 unrelated European-American extremely obese females (BMI≥35kg/m2) and normal-weight controls (BMI<25kg/m2), and another pathway association analysis (ICSNPathway) was also used to verify the GenGen result in the same data. The GO0009110 pathway (vitamin anabolism) was among the strongest associations with triglyceride (empirical P<0.001); the result remained significant after FDR correction (P = 0.022). MMAB, an obesity-related locus, included in this pathway. The ABCG1 and BCL6 gene was found in several triglyceride-related pathways (empirical P<0.05), which were also replicated by ICSNPathway (empirical P<0.05, FDR<0.05). We also performed single-marked GWAS using PLINK for TG levels (log-transformed). Significant associations were found between ASTN2 gene SNPs and plasma triglyceride levels (rs7035794, P = 2.24×10−10). Our study suggested that vitamin anabolism pathway, BCL6 gene pathways and ASTN2 gene may contribute to the genetic variation of plasma triglyceride concentrations.
PMCID: PMC4550433  PMID: 26308950
7.  Pathway-Based Genome-wide Association Studies Reveal That the Rac1 Pathway Is Associated with Plasma Adiponectin Levels 
Scientific Reports  2015;5:13422.
Pathway-based analysis as an alternative and effective approach to identify disease-related genes or loci has been verified. To decipher the genetic background of plasma adiponectin levels, we performed genome wide pathway-based association studies in extremely obese individuals and normal-weight controls. The modified Gene Set Enrichment Algorithm (GSEA) was used to perform the pathway-based analyses (the GenGen Program) in 746 European American females, which were collected from our previous GWAS in extremely obese (BMI > 35 kg/m2) and never-overweight (BMI<25 kg/m2) controls. Rac1 cell motility signaling pathway was associated with plasma adiponectin after false-discovery rate (FDR) correction (empirical P < 0.001, FDR = 0.008, family-wise error rate = 0.008). Other several Rac1-centered pathways, such as cdc42racPathway (empirical P < 0.001), hsa00603 (empirical P = 0.003) were among the top associations. The RAC1 pathway association was replicated by the ICSNPathway method, yielded a FDR = 0.002. Quantitative pathway analyses yielded similar results (empirical P = 0.001) for the Rac1 pathway, although it failed to pass the multiple test correction (FDR = 0.11). We further replicated our pathway associations in the ADIPOGen Consortium data by the GSA-SNP method. Our results suggest that Rac1 and related cell motility pathways might be associated with plasma adiponectin levels and biological functions of adiponectin.
PMCID: PMC4642532  PMID: 26299439
8.  Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder 
Nature genetics  2011;44(1):78-84.
Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts.
PMCID: PMC4310555  PMID: 22138692
9.  Integrative genomics identifies LMO1 as a neuroblastoma oncogene 
Nature  2010;469(7329):216-220.
Neuroblastoma is a childhood cancer of the sympathetic nervous system that accounts for approximately 10% of all paediatric oncology deaths1,2. To identify genetic risk factors for neuroblastoma, we performed a genome-wide association study (GWAS) on 2,251 patients and 6,097 control subjects of European ancestry from four case series. Here we report a significant association within LIM domain only 1 (LMO1) at 11p15.4 (rs110419, combined P = 5.2 × 10−16, odds ratio of risk allele = 1.34 (95% confidence interval 1.25–1.44)). The signal was enriched in the subset of patients with the most aggressive form of the disease. LMO1 encodes a cysteine-rich transcriptional regulator, and its paralogues (LMO2, LMO3 and LMO4) have each been previously implicated in cancer. In parallel, we analysed genome-wide DNA copy number alterations in 701 primary tumours. We found that the LMO1 locus was aberrant in 12.4% through a duplication event, and that this event was associated with more advanced disease (P < 0.0001) and survival (P = 0.041). The germline single nucleotide polymorphism (SNP) risk alleles and somatic copy number gains were associated with increased LMO1 expression in neuroblastoma cell lines and primary tumours, consistent with a gain-of-function role in tumorigenesis. Short hairpin RNA (shRNA)-mediated depletion of LMO1 inhibited growth of neuroblastoma cells with high LMO1 expression, whereas forced expression of LMO1 in neuroblastoma cells with low LMO1 expression enhanced proliferation. These data show that common polymorphisms at the LMO1 locus are strongly associated with susceptibility to developing neuroblastoma, but also may influence the likelihood of further somatic alterations at this locus, leading to malignant progression.
PMCID: PMC3320515  PMID: 21124317
10.  Correction: A Genome-Wide Association Study on Obesity and Obesity-Related Traits 
PLoS ONE  2012;7(2):10.1371/annotation/a34ee94e-3e6a-48bd-a19e-398a4bb88580.
PMCID: PMC3293772
11.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis 
Sawcer, Stephen | Hellenthal, Garrett | Pirinen, Matti | Spencer, Chris C.A. | Patsopoulos, Nikolaos A. | Moutsianas, Loukas | Dilthey, Alexander | Su, Zhan | Freeman, Colin | Hunt, Sarah E. | Edkins, Sarah | Gray, Emma | Booth, David R. | Potter, Simon C. | Goris, An | Band, Gavin | Oturai, Annette Bang | Strange, Amy | Saarela, Janna | Bellenguez, Céline | Fontaine, Bertrand | Gillman, Matthew | Hemmer, Bernhard | Gwilliam, Rhian | Zipp, Frauke | Jayakumar, Alagurevathi | Martin, Roland | Leslie, Stephen | Hawkins, Stanley | Giannoulatou, Eleni | D’alfonso, Sandra | Blackburn, Hannah | Boneschi, Filippo Martinelli | Liddle, Jennifer | Harbo, Hanne F. | Perez, Marc L. | Spurkland, Anne | Waller, Matthew J | Mycko, Marcin P. | Ricketts, Michelle | Comabella, Manuel | Hammond, Naomi | Kockum, Ingrid | McCann, Owen T. | Ban, Maria | Whittaker, Pamela | Kemppinen, Anu | Weston, Paul | Hawkins, Clive | Widaa, Sara | Zajicek, John | Dronov, Serge | Robertson, Neil | Bumpstead, Suzannah J. | Barcellos, Lisa F. | Ravindrarajah, Rathi | Abraham, Roby | Alfredsson, Lars | Ardlie, Kristin | Aubin, Cristin | Baker, Amie | Baker, Katharine | Baranzini, Sergio E. | Bergamaschi, Laura | Bergamaschi, Roberto | Bernstein, Allan | Berthele, Achim | Boggild, Mike | Bradfield, Jonathan P. | Brassat, David | Broadley, Simon A. | Buck, Dorothea | Butzkueven, Helmut | Capra, Ruggero | Carroll, William M. | Cavalla, Paola | Celius, Elisabeth G. | Cepok, Sabine | Chiavacci, Rosetta | Clerget-Darpoux, Françoise | Clysters, Katleen | Comi, Giancarlo | Cossburn, Mark | Cournu-Rebeix, Isabelle | Cox, Mathew B. | Cozen, Wendy | Cree, Bruce A.C. | Cross, Anne H. | Cusi, Daniele | Daly, Mark J. | Davis, Emma | de Bakker, Paul I.W. | Debouverie, Marc | D’hooghe, Marie Beatrice | Dixon, Katherine | Dobosi, Rita | Dubois, Bénédicte | Ellinghaus, David | Elovaara, Irina | Esposito, Federica | Fontenille, Claire | Foote, Simon | Franke, Andre | Galimberti, Daniela | Ghezzi, Angelo | Glessner, Joseph | Gomez, Refujia | Gout, Olivier | Graham, Colin | Grant, Struan F.A. | Guerini, Franca Rosa | Hakonarson, Hakon | Hall, Per | Hamsten, Anders | Hartung, Hans-Peter | Heard, Rob N. | Heath, Simon | Hobart, Jeremy | Hoshi, Muna | Infante-Duarte, Carmen | Ingram, Gillian | Ingram, Wendy | Islam, Talat | Jagodic, Maja | Kabesch, Michael | Kermode, Allan G. | Kilpatrick, Trevor J. | Kim, Cecilia | Klopp, Norman | Koivisto, Keijo | Larsson, Malin | Lathrop, Mark | Lechner-Scott, Jeannette S. | Leone, Maurizio A. | Leppä, Virpi | Liljedahl, Ulrika | Bomfim, Izaura Lima | Lincoln, Robin R. | Link, Jenny | Liu, Jianjun | Lorentzen, Åslaug R. | Lupoli, Sara | Macciardi, Fabio | Mack, Thomas | Marriott, Mark | Martinelli, Vittorio | Mason, Deborah | McCauley, Jacob L. | Mentch, Frank | Mero, Inger-Lise | Mihalova, Tania | Montalban, Xavier | Mottershead, John | Myhr, Kjell-Morten | Naldi, Paola | Ollier, William | Page, Alison | Palotie, Aarno | Pelletier, Jean | Piccio, Laura | Pickersgill, Trevor | Piehl, Fredrik | Pobywajlo, Susan | Quach, Hong L. | Ramsay, Patricia P. | Reunanen, Mauri | Reynolds, Richard | Rioux, John D. | Rodegher, Mariaemma | Roesner, Sabine | Rubio, Justin P. | Rückert, Ina-Maria | Salvetti, Marco | Salvi, Erika | Santaniello, Adam | Schaefer, Catherine A. | Schreiber, Stefan | Schulze, Christian | Scott, Rodney J. | Sellebjerg, Finn | Selmaj, Krzysztof W. | Sexton, David | Shen, Ling | Simms-Acuna, Brigid | Skidmore, Sheila | Sleiman, Patrick M.A. | Smestad, Cathrine | Sørensen, Per Soelberg | Søndergaard, Helle Bach | Stankovich, Jim | Strange, Richard C. | Sulonen, Anna-Maija | Sundqvist, Emilie | Syvänen, Ann-Christine | Taddeo, Francesca | Taylor, Bruce | Blackwell, Jenefer M. | Tienari, Pentti | Bramon, Elvira | Tourbah, Ayman | Brown, Matthew A. | Tronczynska, Ewa | Casas, Juan P. | Tubridy, Niall | Corvin, Aiden | Vickery, Jane | Jankowski, Janusz | Villoslada, Pablo | Markus, Hugh S. | Wang, Kai | Mathew, Christopher G. | Wason, James | Palmer, Colin N.A. | Wichmann, H-Erich | Plomin, Robert | Willoughby, Ernest | Rautanen, Anna | Winkelmann, Juliane | Wittig, Michael | Trembath, Richard C. | Yaouanq, Jacqueline | Viswanathan, Ananth C. | Zhang, Haitao | Wood, Nicholas W. | Zuvich, Rebecca | Deloukas, Panos | Langford, Cordelia | Duncanson, Audrey | Oksenberg, Jorge R. | Pericak-Vance, Margaret A. | Haines, Jonathan L. | Olsson, Tomas | Hillert, Jan | Ivinson, Adrian J. | De Jager, Philip L. | Peltonen, Leena | Stewart, Graeme J. | Hafler, David A. | Hauser, Stephen L. | McVean, Gil | Donnelly, Peter | Compston, Alastair
Nature  2011;476(7359):214-219.
Multiple sclerosis (OMIM 126200) is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability.1 Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals;2,3 and systematic attempts to identify linkage in multiplex families have confirmed that variation within the Major Histocompatibility Complex (MHC) exerts the greatest individual effect on risk.4 Modestly powered Genome-Wide Association Studies (GWAS)5-10 have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects play a key role in disease susceptibility.11 Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require the analysis of sample sizes that are beyond the numbers currently available to individual research groups. In a collaborative GWAS involving 9772 cases of European descent collected by 23 research groups working in 15 different countries, we have replicated almost all of the previously suggested associations and identified at least a further 29 novel susceptibility loci. Within the MHC we have refined the identity of the DRB1 risk alleles and confirmed that variation in the HLA-A gene underlies the independent protective effect attributable to the Class I region. Immunologically relevant genes are significantly over-represented amongst those mapping close to the identified loci and particularly implicate T helper cell differentiation in the pathogenesis of multiple sclerosis.
PMCID: PMC3182531  PMID: 21833088
multiple sclerosis; GWAS; genetics
12.  Common variants at five new loci associated with early-onset inflammatory bowel disease 
Nature Genetics  2009;41(12):1335-1340.
The inflammatory bowel diseases (IBD) Crohn’s disease and ulcerative colitis are common causes of morbidity in children and young adults in the western world. Here we report the results of a genome-wide association study in early-onset IBD involving 3,426 affected individuals and 11,963 genetically matched controls recruited through international collaborations in Europe and North America, thereby extending the results from a previous study of 1,011 individuals with early-onset IBD1. We have identified five new regions associated with early-onset IBD susceptibility, including 16p11 near the cytokine gene IL27 (rs8049439, P = 2.41 × 10−9), 22q12 (rs2412973, P = 1.55 × 10−9), 10q22 (rs1250550, P = 5.63 × 10−9), 2q37 (rs4676410, P = 3.64 × 10−8) and 19q13.11 (rs10500264, P = 4.26 × 10−10). Our scan also detected associations at 23 of 32 loci previously implicated in adult-onset Crohn’s disease and at 8 of 17 loci implicated in adult-onset ulcerative colitis, highlighting the close pathogenetic relationship between early- and adult-onset IBD.
PMCID: PMC3267927  PMID: 19915574
13.  Large Copy-Number Variations Are Enriched in Cases With Moderate to Extreme Obesity 
Diabetes  2010;59(10):2690-2694.
Obesity is an increasingly common disorder that predisposes to several medical conditions, including type 2 diabetes. We investigated whether large and rare copy-number variations (CNVs) differentiate moderate to extreme obesity from never-overweight control subjects.
Using single nucleotide polymorphism (SNP) arrays, we performed a genome-wide CNV survey on 430 obese case subjects (BMI >35 kg/m2) and 379 never-overweight control subjects (BMI <25 kg/m2). All subjects were of European ancestry and were genotyped on the Illumina HumanHap550 arrays with ∼550,000 SNP markers. The CNV calls were generated by PennCNV software.
CNVs >1 Mb were found to be overrepresented in case versus control subjects (odds ratio [OR] = 1.5 [95% CI 0.5–5]), and CNVs >2 Mb were present in 1.3% of the case subjects but were absent in control subjects (OR = infinity [95% CI 1.2–infinity]). When focusing on rare deletions that disrupt genes, even more pronounced effect sizes are observed (OR = 2.7 [95% CI 0.5–27.1] for CNVs >1 Mb). Interestingly, obese case subjects who carry these large CNVs have moderately high BMI and do not appear to be extreme cases. Several CNVs disrupt known candidate genes for obesity, such as a 3.3-Mb deletion disrupting NAP1L5 and a 2.1-Mb deletion disrupting UCP1 and IL15.
Our results suggest that large CNVs, especially rare deletions, confer risk of obesity in patients with moderate obesity and that genes impacted by large CNVs represent intriguing candidates for obesity that warrant further study.
PMCID: PMC3279563  PMID: 20622171
14.  A Genome-Wide Meta-Analysis of Six Type 1 Diabetes Cohorts Identifies Multiple Associated Loci 
PLoS Genetics  2011;7(9):e1002293.
Diabetes impacts approximately 200 million people worldwide, of whom approximately 10% are affected by type 1 diabetes (T1D). The application of genome-wide association studies (GWAS) has robustly revealed dozens of genetic contributors to the pathogenesis of T1D, with the most recent meta-analysis identifying in excess of 40 loci. To identify additional genetic loci for T1D susceptibility, we examined associations in the largest meta-analysis to date between the disease and ∼2.54 million SNPs in a combined cohort of 9,934 cases and 16,956 controls. Targeted follow-up of 53 SNPs in 1,120 affected trios uncovered three new loci associated with T1D that reached genome-wide significance. The most significantly associated SNP (rs539514, P = 5.66×10−11) resides in an intronic region of the LMO7 (LIM domain only 7) gene on 13q22. The second most significantly associated SNP (rs478222, P = 3.50×10−9) resides in an intronic region of the EFR3B (protein EFR3 homolog B) gene on 2p23; however, the region of linkage disequilibrium is approximately 800 kb and harbors additional multiple genes, including NCOA1, C2orf79, CENPO, ADCY3, DNAJC27, POMC, and DNMT3A. The third most significantly associated SNP (rs924043, P = 8.06×10−9) lies in an intergenic region on 6q27, where the region of association is approximately 900 kb and harbors multiple genes including WDR27, C6orf120, PHF10, TCTE3, C6orf208, LOC154449, DLL1, FAM120B, PSMB1, TBP, and PCD2. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
Author Summary
Despite the fact that there is clearly a large genetic component to type 1 diabetes (T1D), uncovering the genes contributing to this disease has proven challenging. However, in the past three years there has been relatively major progress in this regard, with advances in genetic screening technologies allowing investigators to scan the genome for variants conferring risk for disease without prior hypotheses. Such genome-wide association studies have revealed multiple regions of the genome to be robustly and consistently associated with T1D. More recent findings have been a consequence of combining of multiple datasets from independent investigators in meta-analyses, which have more power to pick up additional variants contributing to the trait. In the current study, we describe the largest meta-analysis of T1D genome-wide genotyped datasets to date, which combines six large studies. As a consequence, we have uncovered three new signals residing at the chromosomal locations 13q22, 2p23, and 6q27, which went on to be replicated in independent sample sets. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
PMCID: PMC3183083  PMID: 21980299
15.  Pathway-Wide Association Study Implicates Multiple Sterol Transport and Metabolism Genes in HDL Cholesterol Regulation 
Pathway-based association methods have been proposed to be an effective approach in identifying disease genes, when single-marker association tests do not have sufficient power. The analysis of quantitative traits may be benefited from these approaches, by sampling from two extreme tails of the distribution. Here we tested a pathway association approach on a small genome-wide association study (GWAS) on 653 subjects with extremely high high-density lipoprotein cholesterol (HDL-C) levels and 784 subjects with low HDL-C levels. We identified 102 genes in the sterol transport and metabolism pathways that collectively associate with HDL-C levels, and replicated these association signals in an independent GWAS. Interestingly, the pathways include 18 genes implicated in previous GWAS on lipid traits, suggesting that genuine HDL-C genes are highly enriched in these pathways. Additionally, multiple biologically relevant loci in the pathways were not detected by previous GWAS, including genes implicated in previous candidate gene association studies (such as LEPR, APOA2, HDLBP, SOAT2), genes that cause Mendelian forms of lipid disorders (such as DHCR24), and genes expressing dyslipidemia phenotypes in knockout mice (such as SOAT1, PON1). Our study suggests that sampling from two extreme tails of a quantitative trait and examining genetic pathways may yield biological insights from smaller samples than are generally required using single-marker analysis in large-scale GWAS. Our results also implicate that functionally related genes work together to regulate complex quantitative traits, and that future large-scale studies may benefit from pathway-association approaches to identify novel pathways regulating HDL-C levels.
PMCID: PMC3268595  PMID: 22303337
GWAS; lipid; HDL-C; pathway analysis; cholesterol; sterol transport; sterol metabolism; genetic association
16.  Comparative genetic analysis of inflammatory bowel disease and type 1 diabetes implicates multiple loci with opposite effects 
Human Molecular Genetics  2010;19(10):2059-2067.
Inflammatory bowel disease, including Crohn's disease (CD) and ulcerative colitis (UC), and type 1 diabetes (T1D) are autoimmune diseases that may share common susceptibility pathways. We examined known susceptibility loci for these diseases in a cohort of 1689 CD cases, 777 UC cases, 989 T1D cases and 6197 shared control subjects of European ancestry, who were genotyped by the Illumina HumanHap550 SNP arrays. We identified multiple previously unreported or unconfirmed disease associations, including known CD loci (ICOSLG and TNFSF15) and T1D loci (TNFAIP3) that confer UC risk, known UC loci (HERC2 and IL26) that confer T1D risk and known UC loci (IL10 and CCNY) that confer CD risk. Additionally, we show that T1D risk alleles residing at the PTPN22, IL27, IL18RAP and IL10 loci protect against CD. Furthermore, the strongest risk alleles for T1D within the major histocompatibility complex (MHC) confer strong protection against CD and UC; however, given the multi-allelic nature of the MHC haplotypes, sequencing of the MHC locus will be required to interpret this observation. These results extend our current knowledge on genetic variants that predispose to autoimmunity, and suggest that many loci involved in autoimmunity may be under a balancing selection due to antagonistic pleiotropic effect. Our analysis implies that variants with opposite effects on different diseases may facilitate the maintenance of common susceptibility alleles in human populations, making autoimmune diseases especially amenable to genetic dissection by genome-wide association studies.
PMCID: PMC2860894  PMID: 20176734
17.  A Genome-Wide Association Study on Obesity and Obesity-Related Traits 
PLoS ONE  2011;6(4):e18939.
Large-scale genome-wide association studies (GWAS) have identified many loci associated with body mass index (BMI), but few studies focused on obesity as a binary trait. Here we report the results of a GWAS and candidate SNP genotyping study of obesity, including extremely obese cases and never overweight controls as well as families segregating extreme obesity and thinness. We first performed a GWAS on 520 cases (BMI>35 kg/m2) and 540 control subjects (BMI<25 kg/m2), on measures of obesity and obesity-related traits. We subsequently followed up obesity-associated signals by genotyping the top ∼500 SNPs from GWAS in the combined sample of cases, controls and family members totaling 2,256 individuals. For the binary trait of obesity, we found 16 genome-wide significant signals within the FTO gene (strongest signal at rs17817449, P = 2.5×10−12). We next examined obesity-related quantitative traits (such as total body weight, waist circumference and waist to hip ratio), and detected genome-wide significant signals between waist to hip ratio and NRXN3 (rs11624704, P = 2.67×10−9), previously associated with body weight and fat distribution. Our study demonstrated how a relatively small sample ascertained through extreme phenotypes can detect genuine associations in a GWAS.
PMCID: PMC3084240  PMID: 21552555
18.  Association Between a High-Risk Autism Locus on 5p14 and Social Communication Spectrum Phenotypes in the General Population 
The American journal of psychiatry  2010;167(11):1364-1372.
Recent genome-wide analysis identified a genetic variant on 5p14.1 (rs4307059), which is associated with risk for autism spectrum disorder. This study investigated whether rs4307059 also operates as a quantitative trait locus underlying a broader autism phenotype in the general population, focusing specifically on the social communication aspect of the spectrum.
Study participants were 7,313 children from the Avon Longitudinal Study of Parents and Children. Single-trait and joint-trait genotype associations were investigated for 29 measures related to language and communication, verbal intelligence, social interaction, and behavioral adjustment, assessed between ages 3 and 12 years. Analyses were performed in one-sided or directed mode and adjusted for multiple testing, trait interrelatedness, and random genotype dropout.
Single phenotype analyses showed that an increased load of rs4307059 risk allele is associated with stereotyped conversation and lower pragmatic communication skills, as measured by the Children's Communication Checklist (at a mean age of 9.7 years). In addition a trend toward a higher frequency of identification of special educational needs (at a mean age of 11.8 years) was observed. Variation at rs4307059 was also associated with the phenotypic profile of studied traits. This joint signal was fully explained neither by single-trait associations nor by overall behavioral adjustment problems but suggested a combined effect, which manifested through multiple subthreshold social, communicative, and cognitive impairments.
Our results suggest that common variation at 5p14.1 is associated with social communication spectrum phenotypes in the general population and support the role of rs4307059 as a quantitative trait locus for autism spectrum disorder.
PMCID: PMC3008767  PMID: 20634369
19.  Duplication of the SLIT3 Locus on 5q35.1 Predisposes to Major Depressive Disorder 
PLoS ONE  2010;5(12):e15463.
Major depressive disorder (MDD) is a common psychiatric and behavioral disorder. To discover novel variants conferring risk to MDD, we conducted a whole-genome scan of copy number variation (CNV), including 1,693 MDD cases and 4,506 controls genotyped on the Perlegen 600K platform. The most significant locus was observed on 5q35.1, harboring the SLIT3 gene (P = 2×10−3). Extending the controls with 30,000 subjects typed on the Illumina 550 k array, we found the CNV to remain exclusive to MDD cases (P = 3.2×10−9). Duplication was observed in 5 unrelated MDD cases encompassing 646 kb with highly similar breakpoints. SLIT3 is integral to repulsive axon guidance based on binding to Roundabout receptors. Duplication of 5q35.1 is a highly penetrant variation accounting for 0.7% of the subset of 647 cases harboring large CNVs, using a threshold of a minimum of 10 SNPs and 100 kb. This study leverages a large dataset of MDD cases and controls for the analysis of CNVs with matched platform and ethnicity. SLIT3 duplication is a novel association which explains a definitive proportion of the largely unknown etiology of MDD.
PMCID: PMC2995745  PMID: 21152026
20.  Common genetic variants on 5p14.1 associate with autism spectrum disorders 
Nature  2009;459(7246):528-533.
Autism spectrum disorders (ASDs) represent a group of childhood neurodevelopmental and neuropsychiatric disorders characterized by deficits in verbal communication, impairment of social interaction, and restricted and repetitive patterns of interests and behaviour. To identify common genetic risk factors underlying ASDs, here we present the results of genome-wide association studies on a cohort of 780 families (3,101 subjects) with affected children, and a second cohort of 1,204 affected subjects and 6,491 control subjects, all of whom were of European ancestry. Six single nucleotide polymorphisms between cadherin 10 (CDH10) and cadherin 9 (CDH9)—two genes encoding neuronal cell-adhesion molecules—revealed strong association signals, with the most significant SNP being rs4307059 (P = 3.4 × 10−8, odds ratio = 1.19). These signals were replicated in two independent cohorts, with combined P values ranging from 7.4 × 10−8 to 2.1 × 10−10. Our results implicate neuronal cell-adhesion molecules in the pathogenesis of ASDs, and represent, to our knowledge, the first demonstration of genome-wide significant association of common variants with susceptibility to ASDs.
PMCID: PMC2943511  PMID: 19404256
21.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes 
Nature  2009;459(7246):569-573.
Autism spectrum disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins1–4. Previous studies focusing on candidate genes or genomic regions have identified several copy number variations (CNVs) that are associated with an increased risk of ASDs5–9. Here we present the results from a whole-genome CNV study on a cohort of 859 ASD cases and 1,409 healthy children of European ancestry who were genotyped with ~550,000 single nucleotide polymorphism markers, in an attempt to comprehensively identify CNVs conferring susceptibility to ASDs. Positive findings were evaluated in an independent cohort of 1,336 ASD cases and 1,110 controls of European ancestry. Besides previously reported ASD candidate genes, such as NRXN1 (ref. 10) and CNTN4 (refs 11, 12), several new susceptibility genes encoding neuronal cell-adhesion molecules, including NLGN1 and ASTN2, were enriched with CNVs in ASD cases compared to controls (P = 9.5 × 10−3). Furthermore, CNVs within or surrounding genes involved in the ubiquitin pathways, including UBE3A, PARK2, RFWD2 and FBXO40, were affected by CNVs not observed in controls (P = 3.3 × 10−3). We also identified duplications 55 kilobases upstream of complementary DNA AK123120 (P = 3.6 × 10−6). Although these variants may be individually rare, they target genes involved in neuronal cell-adhesion or ubiquitin degradation, indicating that these two important gene networks expressed within the central nervous system may contribute to the genetic susceptibility of ASD.
PMCID: PMC2925224  PMID: 19404257
22.  The role of obesity-associated loci identified in genome wide association studies in the determination of pediatric BMI 
Obesity (Silver Spring, Md.)  2009;17(12):2254-2257.
The prevalence of obesity in children and adults in the United States has increased dramatically over the past decade. Besides environmental factors, genetic factors are known to play an important role in the pathogenesis of obesity. A number of genetic determinants of adult BMI have already been established through genome wide association studies. In this study, we examined 25 single nucleotide polymorphisms (SNPs) corresponding to thirteen previously reported genomic loci in 6,078 children with measures of BMI. Fifteen of these SNPs yielded at least nominally significant association to BMI, representing nine different loci including INSIG2, FTO, MC4R, TMEM18, GNPDA2, NEGR1, BDNF, KCTD15 and 1q25. Other loci revealed no evidence for association, namely at MTCH2, SH2B1, 12q13 and 3q27. For the 15 associated variants, the genotype score explained 1.12% of the total variation for BMI z-score. We conclude that among thirteen loci that have been reported to associate with adult BMI, at least nine also contribute to the determination of BMI in childhood as demonstrated by their associations in our pediatric cohort.
PMCID: PMC2860782  PMID: 19478790
23.  Investigation of the locus near MC4R with childhood obesity in Americans of European and African ancestry 
Obesity (Silver Spring, Md.)  2009;17(7):1461-1465.
Recently a modest, but consistently, replicated association was demonstrated between obesity and the single nucleotide polymorphism (SNP), rs17782313, 3’ of the MC4R locus as a consequence of a meta-analysis of genome wide association (GWA) studies of the disease in Caucasian populations. We investigated the association in the context of the childhood form of the disease utilizing data from our ongoing GWA study in a cohort of 728 European American (EA) obese children (BMI ≥ 95th percentile) and 3,960 EA controls (BMI < 95th percentile), as well as 1,008 African American (AA) obese children and 2,715 AA controls. rs571312, rs10871777 and rs476828 (perfect surrogates for rs17782313) yielded odds ratios in the EA cohort of 1.142 (P = 0.045), 1.137 (P = 0.054) and 1.145 (P = 0.042); however, there was no significant association with these SNPs in the AA cohort. When investigating all thirty SNPs present on the Illumina BeadChip at this locus, again there was no evidence for association in AA cases when correcting for the number of tests employed. As such, variants 3’ to the MC4R locus present on the genotyping platform utilized confer a similar magnitude of risk of obesity in Caucasian children as to their adult Caucasian counterparts but this observation did not extend to African Americans.
PMCID: PMC2860794  PMID: 19265794
24.  ATOM: a powerful gene-based association test by combining optimally weighted markers 
Bioinformatics  2008;25(4):497-503.
Background: Large-scale candidate-gene and genome-wide association studies genotype multiple SNPs within or surrounding a gene, including both tag and functional SNPs. The immense amount of data generated in these studies poses new challenges to analysis. One particularly challenging yet important question is how to best use all genetic information to test whether a gene or a region is associated with the trait of interest.
Methods: Here we propose a powerful gene-based Association Test by combining Optimally Weighted Markers (ATOM) within a genomic region. Due to variation in linkage disequilibrium, different markers often associate with the trait of interest at different levels. To appropriately apportion their contributions, we assign a weight to each marker that is proportional to the amount of information it captures about the trait locus. We analytically derive the optimal weights for both quantitative and binary traits, and describe a procedure for estimating the weights from a reference database such as the HapMap. Compared with existing approaches, our method has several distinct advantages, including (i) the ability to borrow information from an external database to increase power, (ii) the theoretical derivation of optimal marker weights and (iii) the scalability to simultaneous analysis of all SNPs in candidate genes and pathways.
Results: Through extensive simulations and analysis of the FTO gene in our ongoing genome-wide association study on childhood obesity, we demonstrate that ATOM increases the power to detect genetic association as compared with several commonly used multi-marker association tests.
Supplementary information: Supplementary data are available at Bioinformatics online.
PMCID: PMC2642636  PMID: 19074959
25.  From Disease Association to Risk Assessment: An Optimistic View from Genome-Wide Association Studies on Type 1 Diabetes 
PLoS Genetics  2009;5(10):e1000678.
Genome-wide association studies (GWAS) have been fruitful in identifying disease susceptibility loci for common and complex diseases. A remaining question is whether we can quantify individual disease risk based on genotype data, in order to facilitate personalized prevention and treatment for complex diseases. Previous studies have typically failed to achieve satisfactory performance, primarily due to the use of only a limited number of confirmed susceptibility loci. Here we propose that sophisticated machine-learning approaches with a large ensemble of markers may improve the performance of disease risk assessment. We applied a Support Vector Machine (SVM) algorithm on a GWAS dataset generated on the Affymetrix genotyping platform for type 1 diabetes (T1D) and optimized a risk assessment model with hundreds of markers. We subsequently tested this model on an independent Illumina-genotyped dataset with imputed genotypes (1,008 cases and 1,000 controls), as well as a separate Affymetrix-genotyped dataset (1,529 cases and 1,458 controls), resulting in area under ROC curve (AUC) of ∼0.84 in both datasets. In contrast, poor performance was achieved when limited to dozens of known susceptibility loci in the SVM model or logistic regression model. Our study suggests that improved disease risk assessment can be achieved by using algorithms that take into account interactions between a large ensemble of markers. We are optimistic that genotype-based disease risk assessment may be feasible for diseases where a notable proportion of the risk has already been captured by SNP arrays.
Author Summary
An often touted utility of genome-wide association studies (GWAS) is that the resulting discoveries can facilitate implementation of personalized medicine, in which preventive and therapeutic interventions for complex diseases can be tailored to individual genetic profiles. However, recent studies using whole-genome SNP genotype data for disease risk assessment have generally failed to achieve satisfactory results, leading to a pessimistic view of the utility of genotype data for such purposes. Here we propose that sophisticated machine-learning approaches on a large ensemble of markers, which contain both confirmed and as yet unconfirmed disease susceptibility variants, may improve the performance of disease risk assessment. We tested an algorithm called Support Vector Machine (SVM) on three large-scale datasets for type 1 diabetes and demonstrated that risk assessment can be highly accurate for the disease. Our results suggest that individualized disease risk assessment using whole-genome data may be more successful for some diseases (such as T1D) than other diseases. However, the predictive accuracy will be dependent on the heritability of the disease under study, the proportion of the genetic risk that is known, and that the right set of markers and right algorithms are being used.
PMCID: PMC2748686  PMID: 19816555

Results 1-25 (28)