1.  Phenotype-specific association of the TGFBR3 locus with nonsyndromic cryptorchidism 
The Journal of urology  2014;193(5):1637-1645.
Based on a genome-wide association study (GWAS) of testicular dysgenesis syndrome (TDS) reporting possible association with TGFBR3, we analyzed GWAS data from a larger, phenotypically restricted cryptorchidism population for potential replication of this signal.
Materials and Methods
We excluded samples based on strict quality control criteria, leaving 844 cases and 2718 controls of European ancestry that were analyzed in 2 separate groups based on genotyping platform. Analyses included genotype imputation at the TGFBR3 locus, association analysis of imputed data with correction for population substructure, subsequent meta-analysis of Group 1 and 2 data and selective genotyping of independent cases (n=330) and controls (n=324) for replication. We also measured Tgfbr3 mRNA levels and performed TGFBR3/betaglycan immunostaining in rat fetal gubernaculum.
We identified suggestive (p≤1×10−4) association of markers in/near TGFBR3 including rs9661103 (OR 1.40, 95% CI 1.20,1.64, p=2.71×10−5) and rs10782968 (OR 1.58, CI 1.26,1.98, p=9.36×10−5) in Groups 1 and 2, respectively. In subgroup analyses, we observed strongest association of rs17576372 (OR 1.42, CI 1.24,1.60; p=1.67×10−4) with proximal and rs11165059 (OR 1.32, CI 1.15,1.38; p=9.42×10−4) with distal testis position, signals in strong linkage disequilibrium with rs9661103 and rs10782968, respectively. Association of the prior GWAS signal (rs12082710) was marginal (OR 1.13, CI 0.99,1.28, p=0.09 for Group 1) and we were unable to replicate signals in our independent cohort. Tgfbr3/betaglycan was differentially expressed in wild type and cryptorchid rat fetal gubernaculum.
These data suggest complex or phenotype-specific association of cryptorchidism with TGFBR3 and the gubernaculum as a potential target of TGFβ signaling.
PMCID: PMC4406821  PMID: 25390077
Cryptorchidism; genetic association studies; TGFRB3; gubernaculum
2.  Mutations in SPECC1L, encoding sperm antigen with calponin homology and coiled-coil domains 1-like, are found in some cases of autosomal dominant Opitz G/BBB syndrome 
Journal of medical genetics  2014;52(2):104-110.
Opitz G/BBB syndrome is a heterogeneous disorder characterised by variable expression of midline defects including cleft lip and palate, hypertelorism, laryngealtracheoesophageal anomalies, congenital heart defects, and hypospadias. The X-linked form of the condition has been associated with mutations in the MID1 gene on Xp22. The autosomal dominant form has been linked to chromosome 22q11.2, although the causative gene has yet to be elucidated.
Methods and results
In this study, we performed whole exome sequencing on DNA samples from a three-generation family with characteristics of Opitz G/BBB syndrome with negative MID1 sequencing. We identified a heterozygous missense mutation c.1189A>C (p.Thr397Pro) in SPECC1L, located at chromosome 22q11.23. Mutation screening of an additional 19 patients with features of autosomal dominant Opitz G/BBB syndrome identified a c.3247G>A ( p.Gly1083Ser) mutation segregating with the phenotype in another three-generation family.
Previously, SPECC1L was shown to be required for proper facial morphogenesis with disruptions identified in two patients with oblique facial clefts. Collectively, these data demonstrate that SPECC1L mutations can cause syndromic forms of facial clefting including some cases of autosomal dominant Opitz G/BBB syndrome and support the original linkage to chromosome 22q11.2.
PMCID: PMC4393015  PMID: 25412741
3.  Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder 
Nature genetics  2011;44(1):78-84.
Attention deficit hyperactivity disorder (ADHD) is a common, heritable neuropsychiatric disorder of unknown etiology. We performed a whole-genome copy number variation (CNV) study on 1,013 cases with ADHD and 4,105 healthy children of European ancestry using 550,000 SNPs. We evaluated statistically significant findings in multiple independent cohorts, with a total of 2,493 cases with ADHD and 9,222 controls of European ancestry, using matched platforms. CNVs affecting metabotropic glutamate receptor genes were enriched across all cohorts (P = 2.1 × 10−9). We saw GRM5 (encoding glutamate receptor, metabotropic 5) deletions in ten cases and one control (P = 1.36 × 10−6). We saw GRM7 deletions in six cases, and we saw GRM8 deletions in eight cases and no controls. GRM1 was duplicated in eight cases. We experimentally validated the observed variants using quantitative RT-PCR. A gene network analysis showed that genes interacting with the genes in the GRM family are enriched for CNVs in ~10% of the cases (P = 4.38 × 10−10) after correction for occurrence in the controls. We identified rare recurrent CNVs affecting glutamatergic neurotransmission genes that were overrepresented in multiple ADHD cohorts.
PMCID: PMC4310555  PMID: 22138692
4.  Association of variants of the interleukin-23 receptor (IL23R) gene with susceptibility to pediatric Crohn’s disease 
Background & Aims
Recently an association was demonstrated between the single nucleotide polymorphism (SNP), rs11209026, within the interleukin-23 receptor (IL23R) locus and Crohn’s disease (CD) as a consequence of a genome wide association study of this disease in adults. We examined the effects of this and other previously reported SNPs at this locus with respect to CD in children.
Utilizing data from our ongoing genome-wide association study in our cohort of 142 pediatric CD cases and 281 matched controls, we investigated the association of the previously reported SNPs at the IL23R locus with the childhood form of this disease.
Using a Fisher’s exact test, the minor allele frequency (MAF) of rs1120902 in the cases was 1.75% while it was 6.61% in controls, yielding a protective odds ratio (OR) of 0.25 (95% CI 0.10 – 0.65; one-sided P = 9.2×10−4). Furthermore, of all the SNPs previously reported, rs11209026 was the most strongly associated. A subsequent family-based association test (which is more resistant to population stratification) with 65 sets of trios derived from our initial patient cohort yielded significant association with rs11209026 in a transmission disequilibrium test (one-sided P=0.0017). In contrast, no association was detected to the CARD15 gene for the IBD phenotype.
The OR of the IL23R variant in our pediatric study is highly comparable with that reported previously in a non-Jewish adult IBD case-control cohort (OR=0.26). As such, variants in IL23R gene confer a similar magnitude of risk of CD to children as for their adult counterparts.
PMCID: PMC4287202  PMID: 17618837
IL23R; gene; association; Crohn’s Disease
6.  GWAS of blood cell traits identifies novel associated loci and epistatic interactions in Caucasian and African-American children 
Human Molecular Genetics  2012;22(7):1457-1464.
Hematological traits are important clinical indicators, the genetic determinants of which have not been fully investigated. Common measures of hematological traits include red blood cell (RBC) count, hemoglobin concentration (HGB), hematocrit (HCT), mean corpuscular hemoglobin (MCH), MCH concentration (MCHC), mean corpuscular volume (MCV), platelet count (PLT) and white blood cell (WBC) count. We carried out a genome-wide association study of the eight common hematological traits among 7943 African-American children and 6234 Caucasian children. In African Americans, we report five novel associations of HBE1 variants with HCT and MCHC, the alpha-globin gene cluster variants with RBC and MCHC, and a variant at the ARHGEF3 locus with PLT, as well as replication of four previously reported loci at genome-wide significance. In Caucasians, we report a novel association of variants at the COPZ1 locus with PLT as well as replication of four previously reported loci at genome-wide significance. Extended analysis of an association observed between MCH and the alpha-globin gene cluster variants demonstrated independent effects and epistatic interaction at the locus, impacting the risk of iron deficiency anemia in African Americans with specific genotype states. In summary, we extend the understanding of genetic variants underlying hematological traits based on analyses in African-American children.
PMCID: PMC3657475  PMID: 23263863
7.  AGC1 Deficiency Causes Infantile Epilepsy, Abnormal Myelination, and Reduced N-Acetylaspartate 
JIMD Reports  2014;14:77-85.
Background: Whole exome sequencing (WES) offers a powerful diagnostic tool to rapidly and efficiently sequence all coding genes in individuals presenting for consideration of phenotypically and genetically heterogeneous disorders such as suspected mitochondrial disease. Here, we report results of WES and functional validation in a consanguineous Indian kindred where two siblings presented with profound developmental delay, congenital hypotonia, refractory epilepsy, abnormal myelination, fluctuating basal ganglia changes, cerebral atrophy, and reduced N-acetylaspartate (NAA).
Methods: Whole blood DNA from one affected and one unaffected sibling was captured by Agilent SureSelect Human All Exon kit and sequenced on the Illumina HiSeq2000. Mutations were validated by Sanger sequencing in all family members. Protein from wild-type and mutant fibroblasts was isolated to assess mutation effects on protein expression and enzyme activity.
Results: A novel SLC25A12 homozygous missense mutation, c.1058G>A; p.Arg353Gln, segregated with disease in this kindred. SLC25A12 encodes the neuronal aspartate-glutamate carrier 1 (AGC1) protein, an essential component of the neuronal malate/aspartate shuttle that transfers NADH and H+ reducing equivalents from the cytosol to mitochondria. AGC1 activity enables neuronal export of aspartate, the glial substrate necessary for proper neuronal myelination. Recombinant mutant p.Arg353Gln AGC1 activity was reduced to 15% of wild type. One prior reported SLC25A12 mutation caused complete loss of AGC1 activity in a child with epilepsy, hypotonia, hypomyelination, and reduced brain NAA.
Conclusions: These data strongly suggest that SLC25A12 disease impairs neuronal AGC1 activity. SLC25A12 sequencing should be considered in children with infantile epilepsy, congenital hypotonia, global delay, abnormal myelination, and reduced brain NAA.
Electronic supplementary material
The online version of this chapter (doi:10.1007/8904_2013_287) contains supplementary material, which is available to authorized users.
PMCID: PMC4213337  PMID: 24515575
8.  Common variants at 12q15 and 12q24 are associated with infant head circumference 
Taal, H Rob | Pourcain, Beate St | Thiering, Elisabeth | Das, Shikta | Mook-Kanamori, Dennis O | Warrington, Nicole M | Kaakinen, Marika | Kreiner-Møller, Eskil | Bradfield, Jonathan P | Freathy, Rachel M | Geller, Frank | Guxens, Mònica | Cousminer, Diana L | Kerkhof, Marjan | Timpson, Nicholas J | Ikram, M Arfan | Beilin, Lawrence J | Bønnelykke, Klaus | Buxton, Jessica L | Charoen, Pimphen | Chawes, Bo Lund Krogsgaard | Eriksson, Johan | Evans, David M | Hofman, Albert | Kemp, John P | Kim, Cecilia E | Klopp, Norman | Lahti, Jari | Lye, Stephen J | McMahon, George | Mentch, Frank D | Müller, Martina | O’Reilly, Paul F | Prokopenko, Inga | Rivadeneira, Fernando | Steegers, Eric A P | Sunyer, Jordi | Tiesler, Carla | Yaghootkar, Hanieh | Breteler, Monique M B | Debette, Stephanie | Fornage, Myriam | Gudnason, Vilmundur | Launer, Lenore J | van der Lugt, Aad | Mosley, Thomas H | Seshadri, Sudha | Smith, Albert V | Vernooij, Meike W | Blakemore, Alexandra IF | Chiavacci, Rosetta M | Feenstra, Bjarke | Fernandez-Benet, Julio | Grant, Struan F A | Hartikainen, Anna-Liisa | van der Heijden, Albert J | Iñiguez, Carmen | Lathrop, Mark | McArdle, Wendy L | Mølgaard, Anne | Newnham, John P | Palmer, Lyle J | Palotie, Aarno | Pouta, Annneli | Ring, Susan M | Sovio, Ulla | Standl, Marie | Uitterlinden, Andre G | Wichmann, H-Erich | Vissing, Nadja Hawwa | DeCarli, Charles | van Duijn, Cornelia M | McCarthy, Mark I | Koppelman, Gerard H. | Estivill, Xavier | Hattersley, Andrew T | Melbye, Mads | Bisgaard, Hans | Pennell, Craig E | Widen, Elisabeth | Hakonarson, Hakon | Smith, George Davey | Heinrich, Joachim | Jarvelin, Marjo-Riitta | Jaddoe, Vincent W V
Nature genetics  2012;44(5):532-538.
To identify genetic variants associated with head circumference in infancy, we performed a meta-analysis of seven genome-wide association (GWA) studies (N=10,768 from European ancestry enrolled in pregnancy/birth cohorts) and followed up three lead signals in six replication studies (combined N=19,089). Rs7980687 on chromosome 12q24 (P=8.1×10−9), and rs1042725 on chromosome 12q15 (P=2.8×10−10) were robustly associated with head circumference in infancy. Although these loci have previously been associated with adult height1, their effects on infant head circumference were largely independent of height (P=3.8×10−7 for rs7980687, P=1.3×10−7 for rs1042725 after adjustment for infant height). A third signal, rs11655470 on chromosome 17q21, showed suggestive evidence of association with head circumference (P=3.9×10−6). SNPs correlated to the 17q21 signal show genome-wide association with adult intra cranial volume2, Parkinson’s disease and other neurodegenerative diseases3-5, indicating that a common genetic variant in this region might link early brain growth with neurological disease in later life.
PMCID: PMC3773913  PMID: 22504419
9.  Gene Network Analysis in a Pediatric Cohort Identifies Novel Lung Function Genes 
PLoS ONE  2013;8(9):e72899.
Lung function is a heritable trait and serves as an important clinical predictor of morbidity and mortality for pulmonary conditions in adults, however, despite its importance, no studies have focused on uncovering pediatric-specific loci influencing lung function. To identify novel genetic determinants of pediatric lung function, we conducted a genome-wide association study (GWAS) of four pulmonary function traits, including FVC, FEV1, FEV1/FVC and FEF25–75% in 1556 children. Further, we carried out gene network analyses for each trait including all SNPs with a P-value of <1.0×10−3 from the individual GWAS. The GWAS identified SNPs with notable trends towards association with the pulmonary function measures, including the previously described INTS12 locus association with FEV1 (pmeta = 1.41×10−7). The gene network analyses identified 34 networks of genes associated with pulmonary function variables in Caucasians. Of those, the glycoprotein gene network reached genome-wide significance for all four variables. P-value range pmeta = 6.29×10−4 - 2.80×10−8 on meta-analysis. In this study, we report on specific pathways that are significantly associated with pediatric lung function at genome-wide significance. In addition, we report the first loci associated with lung function in both pediatric Caucasian and African American populations.
PMCID: PMC3759429  PMID: 24023788
10.  Common variants at 5q22associate with pediatric eosinophilic esophagitis 
Nature genetics  2010;42(4):289-291.
Eosinophilic esophagitis (EoE) is a polygenic disorder characterized by the accumulation of eosinophils in the esophagus. We carried out a genome-wide association study on clinically and biopsy confirmed EoE patients to identify common variants associated with the disease risk. One hundred and eighty one EoE samples from Cincinnati Children’s Hospital (CCHMC) and 170 EoE samples and ~3100 controls from Children’s Hospital of Philadelphia (CHOP) were genotyped on the Illumina 550K BeadChip. All patients and controls were of European ancestry. Following standard quality control filtering of the genotype data we carried out Cochran-Armitage trend tests at each SNP using the CCHMC samples as a discovery cohort. We detected genome-wide association with variants on chr5q22 that mapped to a single LD block encompassing the TSLP and WDR36 genes. The most significantly associated SNP at that locus which maps upstream of the TSLP gene remained wide significant after Bonferroni correction (rs3806932, uncorrected P-value = 7.18×10−8, OR = 0.54). Eleven other SNPs in LD with rs3806932 were also significantly associated with EoE and mapped to the same LD block on 5q22. We subsequently replicated the association in the independent CHOP cohort (170 cases, 1130 controls) with rs3806932 P-value = 8×10−3 OR = 0.73; combined P-value for rs3806932 across CCHMC and CHOP cohorts = 3.19×10−9). In addition, TSLP was overexpressed in the esophagus of EoE patients compared with control individuals with no differences observed in the expression of WDR36. In conclusion, we have identified the first genetic association with EoE predisposition at 5q22 implicating TSLP and/or WDR36 as genes potentially involved in the pathogenesis of EoE.
PMCID: PMC3740732  PMID: 20208534
11.  The missense variation landscape of FTO, MC4R and TMEM18 in obese children of African ancestry 
Obesity (Silver Spring, Md.)  2013;21(1):159-163.
Common variation at the loci harboring FTO, MC4R and TMEM18 is consistently reported as being statistically the most strongly associated with obesity. We investigated if these loci also harbor rarer missense variants that confer substantially higher risk of common childhood obesity in African American (AA) children. We sequenced the exons of FTO, MC4R and TMEM18 in an initial subset of our cohort i.e. 200 obese (BMI≥95th percentile) and 200 lean AA children (BMI≤5th percentile). Any missense exonic variants that were uncovered went on to be further genotyped in a further 768 obese and 768 lean (BMI≤50th percentile) children of the same ethnicity. A number of exonic variants were observed from our sequencing effort: seven in FTO, of which four were non-synonymous (A163T, G182A, M400V and A405V), thirteen in MC4R, of which six were non-synonymous (V103I, N123S, S136A, F202L, N240S and I251L) and four in TMEM18, of which two were non-synonymous (P2S and V113L). Follow-up genotyping of these missense variants revealed only one significant difference in allele frequency between cases and controls, namely with N240S in MC4R(Fisher's Exact P = 0.0001). In summary, moderately rare missense variants within the FTO, MC4R and TMEM18 genes observed in our study did not confer risk of common childhood obesity in African Americans except for a degree of evidence for one known loss-of-function variant in MC4R.
PMCID: PMC3605748  PMID: 23505181
Obesity; Pediatrics; Genomics
12.  Age group and sex differences in performance on a computerized neurocognitive battery in children age 8–21 
Neuropsychology  2012;26(2):251-265.
Examine age group effects and sex differences by applying a comprehensive computerized battery of identical behavioral measures linked to brain systems in youths that were already genotyped. Such information is needed to incorporate behavioral data as neuropsychological “biomarkers” in large-scale genomic studies.
We developed and applied a brief computerized neurocognitive battery that provides measures of performance accuracy and response time for executive-control, episodic memory, complex cognition, social cognition and sensorimotor speed domains. We tested a population-based sample of 3500 genotyped youths ages 8–21 years.
Substantial improvement with age occurred for both accuracy and speed, but the rates varied by domain. The most pronounced improvement was noted in executive control functions, specifically attention, and in motor speed, with some effect sizes exceeding 1.8 standard deviation units. The least pronounced age group effect was in memory, where only face memory showed a large effect size on improved accuracy. Sex differences had much smaller effect sizes but were evident, with females outperforming males on attention, word and face memory, reasoning speed and all social cognition tests and males outperforming females in spatial processing and sensorimotor and motor speed. These sex differences in most domains were seen already at the youngest age groups, and age group × sex interactions indicated divergence at the oldest groups with females becoming faster but less accurate than males.
The results indicate that cognitive performance improves substantially in this age span, with large effect sizes that differ by domain. The more pronounced improvement for executive and reasoning domains than for memory suggests that memory capacities have reached their apex before age 8. Performance was sexually modulated and most sex differences were apparent by early adolescence.
PMCID: PMC3295891  PMID: 22251308
cognitive development; neurocognitive endophenotypes; adolescence; social cognition; computerized testing
13.  Integrative genomics identifies LMO1 as a neuroblastoma oncogene 
Nature  2010;469(7329):216-220.
Neuroblastoma is a childhood cancer of the sympathetic nervous system that accounts for approximately 10% of all paediatric oncology deaths1,2. To identify genetic risk factors for neuroblastoma, we performed a genome-wide association study (GWAS) on 2,251 patients and 6,097 control subjects of European ancestry from four case series. Here we report a significant association within LIM domain only 1 (LMO1) at 11p15.4 (rs110419, combined P = 5.2 × 10−16, odds ratio of risk allele = 1.34 (95% confidence interval 1.25–1.44)). The signal was enriched in the subset of patients with the most aggressive form of the disease. LMO1 encodes a cysteine-rich transcriptional regulator, and its paralogues (LMO2, LMO3 and LMO4) have each been previously implicated in cancer. In parallel, we analysed genome-wide DNA copy number alterations in 701 primary tumours. We found that the LMO1 locus was aberrant in 12.4% through a duplication event, and that this event was associated with more advanced disease (P < 0.0001) and survival (P = 0.041). The germline single nucleotide polymorphism (SNP) risk alleles and somatic copy number gains were associated with increased LMO1 expression in neuroblastoma cell lines and primary tumours, consistent with a gain-of-function role in tumorigenesis. Short hairpin RNA (shRNA)-mediated depletion of LMO1 inhibited growth of neuroblastoma cells with high LMO1 expression, whereas forced expression of LMO1 in neuroblastoma cells with low LMO1 expression enhanced proliferation. These data show that common polymorphisms at the LMO1 locus are strongly associated with susceptibility to developing neuroblastoma, but also may influence the likelihood of further somatic alterations at this locus, leading to malignant progression.
PMCID: PMC3320515  PMID: 21124317
14.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis 
Sawcer, Stephen | Hellenthal, Garrett | Pirinen, Matti | Spencer, Chris C.A. | Patsopoulos, Nikolaos A. | Moutsianas, Loukas | Dilthey, Alexander | Su, Zhan | Freeman, Colin | Hunt, Sarah E. | Edkins, Sarah | Gray, Emma | Booth, David R. | Potter, Simon C. | Goris, An | Band, Gavin | Oturai, Annette Bang | Strange, Amy | Saarela, Janna | Bellenguez, Céline | Fontaine, Bertrand | Gillman, Matthew | Hemmer, Bernhard | Gwilliam, Rhian | Zipp, Frauke | Jayakumar, Alagurevathi | Martin, Roland | Leslie, Stephen | Hawkins, Stanley | Giannoulatou, Eleni | D’alfonso, Sandra | Blackburn, Hannah | Boneschi, Filippo Martinelli | Liddle, Jennifer | Harbo, Hanne F. | Perez, Marc L. | Spurkland, Anne | Waller, Matthew J | Mycko, Marcin P. | Ricketts, Michelle | Comabella, Manuel | Hammond, Naomi | Kockum, Ingrid | McCann, Owen T. | Ban, Maria | Whittaker, Pamela | Kemppinen, Anu | Weston, Paul | Hawkins, Clive | Widaa, Sara | Zajicek, John | Dronov, Serge | Robertson, Neil | Bumpstead, Suzannah J. | Barcellos, Lisa F. | Ravindrarajah, Rathi | Abraham, Roby | Alfredsson, Lars | Ardlie, Kristin | Aubin, Cristin | Baker, Amie | Baker, Katharine | Baranzini, Sergio E. | Bergamaschi, Laura | Bergamaschi, Roberto | Bernstein, Allan | Berthele, Achim | Boggild, Mike | Bradfield, Jonathan P. | Brassat, David | Broadley, Simon A. | Buck, Dorothea | Butzkueven, Helmut | Capra, Ruggero | Carroll, William M. | Cavalla, Paola | Celius, Elisabeth G. | Cepok, Sabine | Chiavacci, Rosetta | Clerget-Darpoux, Françoise | Clysters, Katleen | Comi, Giancarlo | Cossburn, Mark | Cournu-Rebeix, Isabelle | Cox, Mathew B. | Cozen, Wendy | Cree, Bruce A.C. | Cross, Anne H. | Cusi, Daniele | Daly, Mark J. | Davis, Emma | de Bakker, Paul I.W. | Debouverie, Marc | D’hooghe, Marie Beatrice | Dixon, Katherine | Dobosi, Rita | Dubois, Bénédicte | Ellinghaus, David | Elovaara, Irina | Esposito, Federica | Fontenille, Claire | Foote, Simon | Franke, Andre | Galimberti, Daniela | Ghezzi, Angelo | Glessner, Joseph | Gomez, Refujia | Gout, Olivier | Graham, Colin | Grant, Struan F.A. | Guerini, Franca Rosa | Hakonarson, Hakon | Hall, Per | Hamsten, Anders | Hartung, Hans-Peter | Heard, Rob N. | Heath, Simon | Hobart, Jeremy | Hoshi, Muna | Infante-Duarte, Carmen | Ingram, Gillian | Ingram, Wendy | Islam, Talat | Jagodic, Maja | Kabesch, Michael | Kermode, Allan G. | Kilpatrick, Trevor J. | Kim, Cecilia | Klopp, Norman | Koivisto, Keijo | Larsson, Malin | Lathrop, Mark | Lechner-Scott, Jeannette S. | Leone, Maurizio A. | Leppä, Virpi | Liljedahl, Ulrika | Bomfim, Izaura Lima | Lincoln, Robin R. | Link, Jenny | Liu, Jianjun | Lorentzen, Åslaug R. | Lupoli, Sara | Macciardi, Fabio | Mack, Thomas | Marriott, Mark | Martinelli, Vittorio | Mason, Deborah | McCauley, Jacob L. | Mentch, Frank | Mero, Inger-Lise | Mihalova, Tania | Montalban, Xavier | Mottershead, John | Myhr, Kjell-Morten | Naldi, Paola | Ollier, William | Page, Alison | Palotie, Aarno | Pelletier, Jean | Piccio, Laura | Pickersgill, Trevor | Piehl, Fredrik | Pobywajlo, Susan | Quach, Hong L. | Ramsay, Patricia P. | Reunanen, Mauri | Reynolds, Richard | Rioux, John D. | Rodegher, Mariaemma | Roesner, Sabine | Rubio, Justin P. | Rückert, Ina-Maria | Salvetti, Marco | Salvi, Erika | Santaniello, Adam | Schaefer, Catherine A. | Schreiber, Stefan | Schulze, Christian | Scott, Rodney J. | Sellebjerg, Finn | Selmaj, Krzysztof W. | Sexton, David | Shen, Ling | Simms-Acuna, Brigid | Skidmore, Sheila | Sleiman, Patrick M.A. | Smestad, Cathrine | Sørensen, Per Soelberg | Søndergaard, Helle Bach | Stankovich, Jim | Strange, Richard C. | Sulonen, Anna-Maija | Sundqvist, Emilie | Syvänen, Ann-Christine | Taddeo, Francesca | Taylor, Bruce | Blackwell, Jenefer M. | Tienari, Pentti | Bramon, Elvira | Tourbah, Ayman | Brown, Matthew A. | Tronczynska, Ewa | Casas, Juan P. | Tubridy, Niall | Corvin, Aiden | Vickery, Jane | Jankowski, Janusz | Villoslada, Pablo | Markus, Hugh S. | Wang, Kai | Mathew, Christopher G. | Wason, James | Palmer, Colin N.A. | Wichmann, H-Erich | Plomin, Robert | Willoughby, Ernest | Rautanen, Anna | Winkelmann, Juliane | Wittig, Michael | Trembath, Richard C. | Yaouanq, Jacqueline | Viswanathan, Ananth C. | Zhang, Haitao | Wood, Nicholas W. | Zuvich, Rebecca | Deloukas, Panos | Langford, Cordelia | Duncanson, Audrey | Oksenberg, Jorge R. | Pericak-Vance, Margaret A. | Haines, Jonathan L. | Olsson, Tomas | Hillert, Jan | Ivinson, Adrian J. | De Jager, Philip L. | Peltonen, Leena | Stewart, Graeme J. | Hafler, David A. | Hauser, Stephen L. | McVean, Gil | Donnelly, Peter | Compston, Alastair
Nature  2011;476(7359):214-219.
Multiple sclerosis (OMIM 126200) is a common disease of the central nervous system in which the interplay between inflammatory and neurodegenerative processes typically results in intermittent neurological disturbance followed by progressive accumulation of disability.1 Epidemiological studies have shown that genetic factors are primarily responsible for the substantially increased frequency of the disease seen in the relatives of affected individuals;2,3 and systematic attempts to identify linkage in multiplex families have confirmed that variation within the Major Histocompatibility Complex (MHC) exerts the greatest individual effect on risk.4 Modestly powered Genome-Wide Association Studies (GWAS)5-10 have enabled more than 20 additional risk loci to be identified and have shown that multiple variants exerting modest individual effects play a key role in disease susceptibility.11 Most of the genetic architecture underlying susceptibility to the disease remains to be defined and is anticipated to require the analysis of sample sizes that are beyond the numbers currently available to individual research groups. In a collaborative GWAS involving 9772 cases of European descent collected by 23 research groups working in 15 different countries, we have replicated almost all of the previously suggested associations and identified at least a further 29 novel susceptibility loci. Within the MHC we have refined the identity of the DRB1 risk alleles and confirmed that variation in the HLA-A gene underlies the independent protective effect attributable to the Class I region. Immunologically relevant genes are significantly over-represented amongst those mapping close to the identified loci and particularly implicate T helper cell differentiation in the pathogenesis of multiple sclerosis.
PMCID: PMC3182531  PMID: 21833088
multiple sclerosis; GWAS; genetics
15.  Common variants at five new loci associated with early-onset inflammatory bowel disease 
Nature Genetics  2009;41(12):1335-1340.
The inflammatory bowel diseases (IBD) Crohn’s disease and ulcerative colitis are common causes of morbidity in children and young adults in the western world. Here we report the results of a genome-wide association study in early-onset IBD involving 3,426 affected individuals and 11,963 genetically matched controls recruited through international collaborations in Europe and North America, thereby extending the results from a previous study of 1,011 individuals with early-onset IBD1. We have identified five new regions associated with early-onset IBD susceptibility, including 16p11 near the cytokine gene IL27 (rs8049439, P = 2.41 × 10−9), 22q12 (rs2412973, P = 1.55 × 10−9), 10q22 (rs1250550, P = 5.63 × 10−9), 2q37 (rs4676410, P = 3.64 × 10−8) and 19q13.11 (rs10500264, P = 4.26 × 10−10). Our scan also detected associations at 23 of 32 loci previously implicated in adult-onset Crohn’s disease and at 8 of 17 loci implicated in adult-onset ulcerative colitis, highlighting the close pathogenetic relationship between early- and adult-onset IBD.
PMCID: PMC3267927  PMID: 19915574
16.  A Genome-Wide Meta-Analysis of Six Type 1 Diabetes Cohorts Identifies Multiple Associated Loci 
PLoS Genetics  2011;7(9):e1002293.
Diabetes impacts approximately 200 million people worldwide, of whom approximately 10% are affected by type 1 diabetes (T1D). The application of genome-wide association studies (GWAS) has robustly revealed dozens of genetic contributors to the pathogenesis of T1D, with the most recent meta-analysis identifying in excess of 40 loci. To identify additional genetic loci for T1D susceptibility, we examined associations in the largest meta-analysis to date between the disease and ∼2.54 million SNPs in a combined cohort of 9,934 cases and 16,956 controls. Targeted follow-up of 53 SNPs in 1,120 affected trios uncovered three new loci associated with T1D that reached genome-wide significance. The most significantly associated SNP (rs539514, P = 5.66×10−11) resides in an intronic region of the LMO7 (LIM domain only 7) gene on 13q22. The second most significantly associated SNP (rs478222, P = 3.50×10−9) resides in an intronic region of the EFR3B (protein EFR3 homolog B) gene on 2p23; however, the region of linkage disequilibrium is approximately 800 kb and harbors additional multiple genes, including NCOA1, C2orf79, CENPO, ADCY3, DNAJC27, POMC, and DNMT3A. The third most significantly associated SNP (rs924043, P = 8.06×10−9) lies in an intergenic region on 6q27, where the region of association is approximately 900 kb and harbors multiple genes including WDR27, C6orf120, PHF10, TCTE3, C6orf208, LOC154449, DLL1, FAM120B, PSMB1, TBP, and PCD2. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
Author Summary
Despite the fact that there is clearly a large genetic component to type 1 diabetes (T1D), uncovering the genes contributing to this disease has proven challenging. However, in the past three years there has been relatively major progress in this regard, with advances in genetic screening technologies allowing investigators to scan the genome for variants conferring risk for disease without prior hypotheses. Such genome-wide association studies have revealed multiple regions of the genome to be robustly and consistently associated with T1D. More recent findings have been a consequence of combining of multiple datasets from independent investigators in meta-analyses, which have more power to pick up additional variants contributing to the trait. In the current study, we describe the largest meta-analysis of T1D genome-wide genotyped datasets to date, which combines six large studies. As a consequence, we have uncovered three new signals residing at the chromosomal locations 13q22, 2p23, and 6q27, which went on to be replicated in independent sample sets. These latest associated regions add to the growing repertoire of gene networks predisposing to T1D.
PMCID: PMC3183083  PMID: 21980299
17.  Examination of All Type 2 Diabetes GWAS Loci Reveals HHEX-IDE as a Locus Influencing Pediatric BMI 
Diabetes  2009;59(3):751-755.
A number of studies have found that BMI in early life influences the risk of developing type 2 diabetes later in life. Our goal was to investigate if any type 2 diabetes variants uncovered through genome-wide association studies (GWAS) impact BMI in childhood.
Using data from an ongoing GWAS of pediatric BMI in our cohort, we investigated the association of pediatric BMI with 20 single nucleotide polymorphisms at 18 type 2 diabetes loci uncovered through GWAS, consisting of ADAMTS9, CDC123-CAMK1D, CDKAL1, CDKN2A/B, EXT2, FTO, HHEX-IDE, IGF2BP2, the intragenic region on 11p12, JAZF1, KCNQ1, LOC387761, MTNR1B, NOTCH2, SLC30A8, TCF7L2, THADA, and TSPAN8-LGR5. We randomly partitioned our cohort exactly in half in order to have a discovery cohort (n = 3,592) and a replication cohort (n = 3,592).
Our data show that the major type 2 diabetes risk–conferring G allele of rs7923837 at the HHEX-IDE locus was associated with higher pediatric BMI in both the discovery (P = 0.0013 and survived correction for 20 tests) and replication (P = 0.023) sets (combined P = 1.01 × 10−4). Association was not detected with any other known type 2 diabetes loci uncovered to date through GWAS except for the well-established FTO.
Our data show that the same genetic HHEX-IDE variant, which is associated with type 2 diabetes from previous studies, also influences pediatric BMI.
PMCID: PMC2828649  PMID: 19933996
18.  Examination of Type 2 Diabetes Loci Implicates CDKAL1 as a Birth Weight Gene 
Diabetes  2009;58(10):2414-2418.
A number of studies have found that reduced birth weight is associated with type 2 diabetes later in life; however, the underlying mechanism for this correlation remains unresolved. Recently, association has been demonstrated between low birth weight and single nucleotide polymorphisms (SNPs) at the CDKAL1 and HHEX-IDE loci, regions that were previously implicated in the pathogenesis of type 2 diabetes. In order to investigate whether type 2 diabetes risk–conferring alleles associate with low birth weight in our Caucasian childhood cohort, we examined the effects of 20 such loci on this trait.
Using data from an ongoing genome-wide association study in our cohort of 5,465 Caucasian children with recorded birth weights, we investigated the association of the previously reported type 2 diabetes–associated variation at 20 loci including TCF7L2, HHEX-IDE, PPARG, KCNJ11, SLC30A8, IGF2BP2, CDKAL1, CDKN2A/2B, and JAZF1 with birth weight.
Our data show that the minor allele of rs7756992 (P = 8 × 10−5) at the CDKAL1 locus is strongly associated with lower birth weight, whereas a perfect surrogate for variation previously implicated for the trait at the same locus only yielded nominally significant association (P = 0.01; r2 rs7756992 = 0.677). However, association was not detected with any of the other type 2 diabetes loci studied.
We observe association between lower birth weight and type 2 diabetes risk–conferring alleles at the CDKAL1 locus. Our data show that the same genetic locus that has been identified as a marker for type 2 diabetes in previous studies also influences birth weight.
PMCID: PMC2750235  PMID: 19592620
19.  Common genetic variants on 5p14.1 associate with autism spectrum disorders 
Nature  2009;459(7246):528-533.
Autism spectrum disorders (ASDs) represent a group of childhood neurodevelopmental and neuropsychiatric disorders characterized by deficits in verbal communication, impairment of social interaction, and restricted and repetitive patterns of interests and behaviour. To identify common genetic risk factors underlying ASDs, here we present the results of genome-wide association studies on a cohort of 780 families (3,101 subjects) with affected children, and a second cohort of 1,204 affected subjects and 6,491 control subjects, all of whom were of European ancestry. Six single nucleotide polymorphisms between cadherin 10 (CDH10) and cadherin 9 (CDH9)—two genes encoding neuronal cell-adhesion molecules—revealed strong association signals, with the most significant SNP being rs4307059 (P = 3.4 × 10−8, odds ratio = 1.19). These signals were replicated in two independent cohorts, with combined P values ranging from 7.4 × 10−8 to 2.1 × 10−10. Our results implicate neuronal cell-adhesion molecules in the pathogenesis of ASDs, and represent, to our knowledge, the first demonstration of genome-wide significant association of common variants with susceptibility to ASDs.
PMCID: PMC2943511  PMID: 19404256
20.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes 
Nature  2009;459(7246):569-573.
Autism spectrum disorders (ASDs) are childhood neurodevelopmental disorders with complex genetic origins1–4. Previous studies focusing on candidate genes or genomic regions have identified several copy number variations (CNVs) that are associated with an increased risk of ASDs5–9. Here we present the results from a whole-genome CNV study on a cohort of 859 ASD cases and 1,409 healthy children of European ancestry who were genotyped with ~550,000 single nucleotide polymorphism markers, in an attempt to comprehensively identify CNVs conferring susceptibility to ASDs. Positive findings were evaluated in an independent cohort of 1,336 ASD cases and 1,110 controls of European ancestry. Besides previously reported ASD candidate genes, such as NRXN1 (ref. 10) and CNTN4 (refs 11, 12), several new susceptibility genes encoding neuronal cell-adhesion molecules, including NLGN1 and ASTN2, were enriched with CNVs in ASD cases compared to controls (P = 9.5 × 10−3). Furthermore, CNVs within or surrounding genes involved in the ubiquitin pathways, including UBE3A, PARK2, RFWD2 and FBXO40, were affected by CNVs not observed in controls (P = 3.3 × 10−3). We also identified duplications 55 kilobases upstream of complementary DNA AK123120 (P = 3.6 × 10−6). Although these variants may be individually rare, they target genes involved in neuronal cell-adhesion or ubiquitin degradation, indicating that these two important gene networks expressed within the central nervous system may contribute to the genetic susceptibility of ASD.
PMCID: PMC2925224  PMID: 19404257
22.  The role of height-associated loci identified in genome wide association studies in the determination of pediatric stature 
BMC Medical Genetics  2010;11:96.
Human height is considered highly heritable and correlated with certain disorders, such as type 2 diabetes and cancer. Despite environmental influences, genetic factors are known to play an important role in stature determination. A number of genetic determinants of adult height have already been established through genome wide association studies.
To examine 51 single nucleotide polymorphisms (SNPs) corresponding to the 46 previously reported genomic loci for height in 8,184 European American children with height measurements. We leveraged genotyping data from our ongoing GWA study of height variation in children in order to query the 51 SNPs in this pediatric cohort.
Sixteen of these SNPs yielded at least nominally significant association to height, representing fifteen different loci including EFEMP1-PNPT1, GPR126, C6orf173, SPAG17, Histone class 1, HLA class III and GDF5-UQCC. Other loci revealed no evidence for association, including HMGA1 and HMGA2. For the 16 associated variants, the genotype score explained 1.64% of the total variation for height z-score.
Among 46 loci that have been reported to associate with adult height to date, at least 15 also contribute to the determination of height in childhood.
PMCID: PMC2894790  PMID: 20546612
23.  The role of obesity-associated loci identified in genome wide association studies in the determination of pediatric BMI 
Obesity (Silver Spring, Md.)  2009;17(12):2254-2257.
The prevalence of obesity in children and adults in the United States has increased dramatically over the past decade. Besides environmental factors, genetic factors are known to play an important role in the pathogenesis of obesity. A number of genetic determinants of adult BMI have already been established through genome wide association studies. In this study, we examined 25 single nucleotide polymorphisms (SNPs) corresponding to thirteen previously reported genomic loci in 6,078 children with measures of BMI. Fifteen of these SNPs yielded at least nominally significant association to BMI, representing nine different loci including INSIG2, FTO, MC4R, TMEM18, GNPDA2, NEGR1, BDNF, KCTD15 and 1q25. Other loci revealed no evidence for association, namely at MTCH2, SH2B1, 12q13 and 3q27. For the 15 associated variants, the genotype score explained 1.12% of the total variation for BMI z-score. We conclude that among thirteen loci that have been reported to associate with adult BMI, at least nine also contribute to the determination of BMI in childhood as demonstrated by their associations in our pediatric cohort.
PMCID: PMC2860782  PMID: 19478790
24.  Investigation of the locus near MC4R with childhood obesity in Americans of European and African ancestry 
Obesity (Silver Spring, Md.)  2009;17(7):1461-1465.
Recently a modest, but consistently, replicated association was demonstrated between obesity and the single nucleotide polymorphism (SNP), rs17782313, 3’ of the MC4R locus as a consequence of a meta-analysis of genome wide association (GWA) studies of the disease in Caucasian populations. We investigated the association in the context of the childhood form of the disease utilizing data from our ongoing GWA study in a cohort of 728 European American (EA) obese children (BMI ≥ 95th percentile) and 3,960 EA controls (BMI < 95th percentile), as well as 1,008 African American (AA) obese children and 2,715 AA controls. rs571312, rs10871777 and rs476828 (perfect surrogates for rs17782313) yielded odds ratios in the EA cohort of 1.142 (P = 0.045), 1.137 (P = 0.054) and 1.145 (P = 0.042); however, there was no significant association with these SNPs in the AA cohort. When investigating all thirty SNPs present on the Illumina BeadChip at this locus, again there was no evidence for association in AA cases when correcting for the number of tests employed. As such, variants 3’ to the MC4R locus present on the genotyping platform utilized confer a similar magnitude of risk of obesity in Caucasian children as to their adult Caucasian counterparts but this observation did not extend to African Americans.
PMCID: PMC2860794  PMID: 19265794
25.  Follow-Up Analysis of Genome-Wide Association Data Identifies Novel Loci for Type 1 Diabetes 
Diabetes  2009;58(1):290-295.
OBJECTIVE—Two recent genome-wide association (GWA) studies have revealed novel loci for type 1 diabetes, a common multifactorial disease with a strong genetic component. To fully utilize the GWA data that we had obtained by genotyping 563 type 1 diabetes probands and 1,146 control subjects, as well as 483 case subject–parent trios, using the Illumina HumanHap550 BeadChip, we designed a full stage 2 study to capture other possible association signals.
RESEARCH DESIGN AND METHODS—From our existing datasets, we selected 982 markers with P < 0.05 in both GWA cohorts. Genotyping these in an independent set of 636 nuclear families with 974 affected offspring revealed 75 markers that also had P < 0.05 in this third cohort. Among these, six single nucleotide polymorphisms in five novel loci also had P < 0.05 in the Wellcome Trust Case-Control Consortium dataset and were further tested in 1,303 type 1 diabetes probands from the Diabetes Control and Complications Trial/Epidemiology of Diabetes Interventions and Complications (DCCT/EDIC) plus 1,673 control subjects.
RESULTS—Two markers (rs9976767 and rs3757247) remained significant after adjusting for the number of tests in this last cohort; they reside in UBASH3A (OR 1.16; combined P = 2.33 × 10−8) and BACH2 (1.13; combined P = 1.25 × 10−6).
CONCLUSIONS—Evaluation of a large number of statistical GWA candidates in several independent cohorts has revealed additional loci that are associated with type 1 diabetes. The two genes at these respective loci, UBASH3A and BACH2, are both biologically relevant to autoimmunity.
PMCID: PMC2606889  PMID: 18840781

