1.  Genome-wide association analyses identify multiple loci associated with central corneal thickness and keratoconus 
Lu, Yi | Vitart, Veronique | Burdon, Kathryn P | Khor, Chiea Chuen | Bykhovskaya, Yelena | Mirshahi, Alireza | Hewitt, Alex W | Koehn, Demelza | Hysi, Pirro G | Ramdas, Wishal D | Zeller, Tanja | Vithana, Eranga N | Cornes, Belinda K | Tay, Wan-Ting | Tai, E Shyong | Cheng, Ching-Yu | Liu, Jianjun | Foo, Jia-Nee | Saw, Seang Mei | Thorleifsson, Gudmar | Stefansson, Kari | Dimasi, David P | Mills, Richard A | Mountain, Jenny | Ang, Wei | Hoehn, René | Verhoeven, Virginie J M | Grus, Franz | Wolfs, Roger | Castagne, Raphaële | Lackner, Karl J | Springelkamp, Henriët | Yang, Jian | Jonasson, Fridbert | Leung, Dexter Y L | Chen, Li J | Tham, Clement C Y | Rudan, Igor | Vatavuk, Zoran | Hayward, Caroline | Gibson, Jane | Cree, Angela J | MacLeod, Alex | Ennis, Sarah | Polasek, Ozren | Campbell, Harry | Wilson, James F | Viswanathan, Ananth C | Fleck, Brian | Li, Xiaohui | Siscovick, David | Taylor, Kent D | Rotter, Jerome I | Yazar, Seyhan | Ulmer, Megan | Li, Jun | Yaspan, Brian L | Ozel, Ayse B | Richards, Julia E | Moroi, Sayoko E | Haines, Jonathan L | Kang, Jae H | Pasquale, Louis R | Allingham, R Rand | Ashley-Koch, Allison | Mitchell, Paul | Wang, Jie Jin | Wright, Alan F | Pennell, Craig | Spector, Timothy D | Young, Terri L | Klaver, Caroline C W | Martin, Nicholas G | Montgomery, Grant W | Anderson, Michael G | Aung, Tin | Willoughby, Colin E | Wiggs, Janey L | Pang, Chi P | Thorsteinsdottir, Unnur | Lotery, Andrew J | Hammond, Christopher J | van Duijn, Cornelia M | Hauser, Michael A | Rabinowitz, Yaron S | Pfeiffer, Norbert | Mackey, David A | Craig, Jamie E | Macgregor, Stuart | Wong, Tien Y
Nature genetics  2013;45(2):155-163.
Central corneal thickness (CCT) is associated with eye conditions including keratoconus and glaucoma. We performed a meta-analysis on >20,000 individuals in European and Asian populations that identified 16 new loci associated with CCT at genome-wide significance (P < 5 × 10−8). We further showed that 2 CCT-associated loci, FOXO1 and FNDC3B, conferred relatively large risks for keratoconus in 2 cohorts with 874 cases and 6,085 controls (rs2721051 near FOXO1 had odds ratio (OR) = 1.62, 95% confidence interval (CI) = 1.4–1.88, P = 2.7 × 10−10, and rs4894535 in FNDC3B had OR = 1.47, 95% CI = 1.29–1.68, P = 4.9 × 10−9). FNDC3B was also associated with primary open-angle glaucoma (P = 5.6 × 10−4; tested in 3 cohorts with 2,979 cases and 7,399 controls). Further analyses implicate the collagen and extracellular matrix pathways in the regulation of CCT.
PMCID: PMC3720123  PMID: 23291589
2.  Ischaemic stroke is associated with the ABO locus: the Euroclot study 
Annals of neurology  2013;73(1):16-31.
End-stage coagulation and the structure/function of fibrin are implicated in the pathogenesis of ischaemic stroke. We explored whether genetic variants associated with end-stage coagulation in healthy volunteers account for the genetic predisposition to ischemic stroke and examined their influence on stroke subtype.
Common genetic variants identified through genome-wide association studies of coagulation factors and fibrin structure/function in healthy twins (n=2,100 Stage 1) were examined in ischemic stroke (n=4,200 cases) using 2 independent samples of European ancestry (Stage 2). A third clinical collection having stroke subtyping (total 8,900 cases 55,000 controls) was used for replication (Stage 3).
Stage 1 identified 524 SNPs from 23 LD blocks having significant association (p<5 ×10-8) with one or more coagulation/fibrin phenotypes. Most striking associations included SNP rs5985 with factor XIII activity (p=2.6×10-186), rs10665 with FVII (p = 2.4×10-47) and rs505922 in the ABO gene with both von Willebrand Factor (vWF p=4.7×10-57) and factor VIII (p=1.2×10-36). In Stage 2, the 23 independent SNPs were examined in stroke cases/non-cases using MORGAM and WTCCC2 collections. SNP rs505922 was nominally associated with ischaemic stroke, odds ratio = 0.94 (95% confidence intervals, 0.88-0.99), p=0.023. Independent replication in Meta-Stroke confirmed the rs505922 association with stroke, beta=0.066 (0.02) p = 0.001, a finding specific to large vessel and cardioembolic stroke (p = 0.001 and p = <0.001 respectively) but not seen with small vessel stroke (p=0.811).
ABO gene variants are associated with large vessel and cardioembolic stroke but not small vessel disease. This work sheds light on the different pathogenic mechanisms underpinning stroke subtype.
PMCID: PMC3582024  PMID: 23381943
GWAS; thrombosis; stroke; coagulation factor; stroke subtype
3.  Common Variants on 8p12 and 1q24.2 Confer Risk of Schizophrenia 
Nature genetics  2011;43(12):1224-1227.
Schizophrenia is a severe mental disorder affecting ~1% of the world population, with heritability of up to 80%. To identify new common genetic risk factors, we performed a genome-wide association study (GWAS) in the Han Chinese population. The discovery sample set consisted of 3,750 patients and 6,468 healthy controls (1,578 cases and 1,592 controls from the Northern Han; 1,238 cases and 2,856 controls from the Central Han; 934 cases and 2,020 controls from the Southern Han); and we followed up the top association signals in an additional independent cohort of 4,383 cases and 4,539 controls from the Han Chinese. Meta-analysis identified genome-wide significant association of common SNPs with schizophrenia on chromosome 8p12 (rs16887244, P=1.27×10−10) and 1q24.2 (rs10489202, P=9.50×10−9). Our findings provide new insights into the pathogenesis of schizophrenia.
PMCID: PMC3773910  PMID: 22037555
4.  Parental origin of sequence variants associated with complex diseases 
Nature  2009;462(7275):868-874.
Effects of susceptibility variants may depend on from which parent they are inherited. While many associations between sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin has largely been ignored. Combining genealogy with long range phasing, we demonstrate that for 38,167 Icelanders genotyped using SNP chips, the parental origin of most alleles can be determined. We then focused on SNPs that associate with diseases and are within 500kb of known imprinted genes. Seven independent SNP associations were examined. Five, one each with breast cancer and basal cell carcinoma, and three with type 2 diabetes (T2D), exhibit parental-origin specific associations. These variants are located in two genomic regions, 11p15 and 7q32, each harbouring a cluster of imprinted genes. Furthermore, a novel variant rs2334499 at 11p15 was seen to associate with T2D where the allele that confers risk when paternally inherited is protective when maternally transmitted. We identified a differentially methylated CTCF binding site at 11p15 and demonstrated correlation of rs2334499 with decreased methylation of that site.
PMCID: PMC3746295  PMID: 20016592
5.  Variant in the sequence of the LINGO1 gene confers risk of essential tremor 
Nature genetics  2009;41(3):277-279.
We identified a marker in LINGO1 showing genome-wide significant association (P = 1.2 × 10−9, odds ratio = 1.55) with essential tremor. LINGO1 has potent, negative regulatory influences on neuronal survival and is also important in regulating both central-nervous-system axon regeneration and oligodendrocyte maturation. An increase in the number of fusiform swellings of Purkinje cell axons in LINGO1 knockout models highlights the potential role of LINGO1 in essential tremor pathophysiology.
PMCID: PMC3740956  PMID: 19182806
6.  Variant of TREM2 Associated with the Risk of Alzheimer’s Disease 
The New England journal of medicine  2012;368(2):107-116.
Sequence variants, including the ε4 allele of apolipoprotein E, have been associated with the risk of the common late-onset form of Alzheimer’s disease. Few rare variants affecting the risk of late-onset Alzheimer’s disease have been found.
We obtained the genome sequences of 2261 Icelanders and identified sequence variants that were likely to affect protein function. We imputed these variants into the genomes of patients with Alzheimer’s disease and control participants and then tested for an association with Alzheimer’s disease. We performed replication tests using case–control series from the United States, Norway, the Netherlands, and Germany. We also tested for a genetic association with cognitive function in a population of unaffected elderly persons.
A rare missense mutation (rs75932628-T) in the gene encoding the triggering receptor expressed on myeloid cells 2 (TREM2), which was predicted to result in an R47H substitution, was found to confer a significant risk of Alzheimer’s disease in Iceland (odds ratio, 2.92; 95% confidence interval [CI], 2.09 to 4.09; P = 3.42×10−10). The mutation had a frequency of 0.46% in controls 85 years of age or older. We observed the association in additional sample sets (odds ratio, 2.90; 95% CI, 2.16 to 3.91; P = 2.1×10−12 in combined discovery and replication samples). We also found that carriers of rs75932628-T between the ages of 80 and 100 years without Alzheimer’s disease had poorer cognitive function than noncarriers (P = 0.003).
Our findings strongly implicate variant TREM2 in the pathogenesis of Alzheimer’s disease. Given the reported antiinflammatory role of TREM2 in the brain, the R47H substitution may lead to an increased predisposition to Alzheimer’s disease through impaired containment of inflammatory processes. (Funded by the National Institute on Aging and others.)
PMCID: PMC3677583  PMID: 23150908
7.  Common variants on 9q22.33 and 14q13.3 predispose to thyroid cancer in European populations 
Nature genetics  2009;41(4):460-464.
In order to search for sequence variants conferring risk of thyroid cancer we conducted a genome-wide association study in 192 and 37,196 Icelandic cases and controls, respectively, followed by a replication study in individuals of European descent. Here we show that two common variants, located on 9q22.33 and 14q13.3, are associated with the disease. Overall, the strongest association signals were observed for rs965513 on 9q22.33 (OR = 1.75; P = 1.7 × 10−27) and rs944289 on 14q13.3 (OR = 1.37; P = 2.0 × 10−9). The gene nearest to the 9q22.33 locus is FOXE1 (TTF2) and NKX2-1 (TTF1) is among the genes located at the 14q13.3 locus. Both variants contribute to an increased risk of both papillary and follicular thyroid cancer. Approximately 3.7% of individuals are homozygous for both variants, and their estimated risk of thyroid cancer is 5.7-fold greater than that of noncarriers. In a study on a large sample set from the general population, both risk alleles are associated with low concentrations of thyroid stimulating hormone (TSH), and the 9q22.33 allele is associated with low concentration of thyroxin (T4) and high concentration of triiodothyronine (T3).
PMCID: PMC3664837  PMID: 19198613
8.  A direct characterization of human mutation based on microsatellites 
Nature genetics  2012;44(10):1161-1165.
Mutations are the raw material of evolution, but have been difficult to study directly. We report the largest study of new mutations to date: 2,058 germline changes discovered by analyzing 85,289 Icelanders at 2,477 microsatellites. The paternal-to-maternal mutation rate ratio is 3.3, and the rate in fathers doubles from age 20 to 58 whereas there is no association with age in mothers. Longer microsatellite alleles are more mutagenic and tend to decrease in length, whereas the opposite is seen for shorter alleles. We use these empirical observations to build a model that we apply to individuals for whom we have both genome sequence and microsatellite data, allowing us to estimate key parameters of evolution without calibration to the fossil record. We infer that the sequence mutation rate is 1.4–2.3×10−8 per base pair per generation (90% credible interval), and that human-chimpanzee speciation occurred 3.7–6.6 million years ago.
PMCID: PMC3459271  PMID: 22922873
9.  Rate of de novo mutations, father’s age, and disease risk 
Nature  2012;488(7412):471-475.
Mutations generate sequence diversity and provide a substrate for selection. The rate of de novo mutations is therefore of major importance to evolution. We conducted a study of genomewide mutation rate by sequencing the entire genomes of 78 Icelandic parent-offspring trios at high coverage. Here we show that in our samples, with an average father’s age of 29.7, the average de novo mutation rate is 1.20×10−8 per nucleotide per generation. Most strikingly, the diversity in mutation rate of single-nucleotide polymorphism (SNP) is dominated by the age of the father at conception of the child. The effect is an increase of about 2 mutations per year. After accounting for random Poisson variation, father’s age is estimated to explain nearly all of the remaining variation in the de novo mutation counts. These observations shed light on the importance of the father’s age on the risk of diseases such as schizophrenia and autism.
PMCID: PMC3548427  PMID: 22914163
10.  Maternally Derived Microduplications at 15q11-q13: Implication of Imprinted Genes in Psychotic Illness 
The American journal of psychiatry  2011;168(4):408-417.
Rare copy number variants have been implicated in different neurodevelopmental disorders, with the same copy number variants often increasing risk of more than one of these phenotypes. In a discovery sample of 22 schizophrenia patients with an early onset of illness (10–15 years of age), the authors observed in one patient a maternally derived 15q11-q13 duplication overlapping the Prader-Willi/Angelman syndrome critical region. This prompted investigation of the role of 15q11-q13 duplications in psychotic illness.
The authors scanned 7,582 patients with schizophrenia or schizoaffective disorder and 41,370 comparison subjects without known psychiatric illness for copy number variants at 15q11-q13 and determined the parental origin of duplications using methylation-sensitive Southern hybridization analysis.
Duplications were found in four case patients and five comparison subjects. All four case patients had maternally derived duplications (0.05%), while only three of the five comparison duplications were maternally derived (0.007%), resulting in a significant excess of maternally derived duplications in case patients (odds ratio=7.3). This excess is compatible with earlier observations that risk for psychosis in people with Prader-Willi syndrome caused by maternal uniparental disomy is much higher than in those caused by deletion of the paternal chromosome.
These findings suggest that the presence of two maternal copies of a fragment of chromosome 15q11.2-q13.1 that overlaps with the Prader-Willi/Angelman syndrome critical region may be a rare risk factor for schizophrenia and other psychoses. Given that maternal duplications of this region are among the most consistent cytogenetic observations in autism, the findings provide further support for a shared genetic etiology between autism and psychosis.
PMCID: PMC3428917  PMID: 21324950
11.  Geographic Differences in Genetic Susceptibility to IgA Nephropathy: GWAS Replication Study and Geospatial Risk Analysis 
PLoS Genetics  2012;8(6):e1002765.
IgA nephropathy (IgAN), major cause of kidney failure worldwide, is common in Asians, moderately prevalent in Europeans, and rare in Africans. It is not known if these differences represent variation in genes, environment, or ascertainment. In a recent GWAS, we localized five IgAN susceptibility loci on Chr.6p21 (HLA-DQB1/DRB1, PSMB9/TAP1, and DPA1/DPB2 loci), Chr.1q32 (CFHR3/R1 locus), and Chr.22q12 (HORMAD2 locus). These IgAN loci are associated with risk of other immune-mediated disorders such as type I diabetes, multiple sclerosis, or inflammatory bowel disease. We tested association of these loci in eight new independent cohorts of Asian, European, and African-American ancestry (N = 4,789), followed by meta-analysis with risk-score modeling in 12 cohorts (N = 10,755) and geospatial analysis in 85 world populations. Four susceptibility loci robustly replicated and all five loci were genome-wide significant in the combined cohort (P = 5×10−32–3×10−10), with heterogeneity detected only at the PSMB9/TAP1 locus (I2 = 0.60). Conditional analyses identified two new independent risk alleles within the HLA-DQB1/DRB1 locus, defining multiple risk and protective haplotypes within this interval. We also detected a significant genetic interaction, whereby the odds ratio for the HORMAD2 protective allele was reversed in homozygotes for a CFHR3/R1 deletion (P = 2.5×10−4). A seven–SNP genetic risk score, which explained 4.7% of overall IgAN risk, increased sharply with Eastward and Northward distance from Africa (r = 0.30, P = 3×10−128). This model paralleled the known East–West gradient in disease risk. Moreover, the prediction of a South–North axis was confirmed by registry data showing that the prevalence of IgAN–attributable kidney failure is increased in Northern Europe, similar to multiple sclerosis and type I diabetes. Variation at IgAN susceptibility loci correlates with differences in disease prevalence among world populations. These findings inform genetic, biological, and epidemiological investigations of IgAN and permit cross-comparison with other complex traits that share genetic risk loci and geographic patterns with IgAN.
Author Summary
IgA nephropathy (IgAN) is the most common cause of kidney failure in Asia, has lower prevalence in Europe, and is very infrequent among populations of African ancestry. A long-standing question in the field is whether these differences represent variation in genes, environment, or ascertainment. In a recent genome-wide association study of 5,966 individuals, we identified five susceptibility loci for this trait. In this paper, we study the largest IgAN case-control cohort reported to date, composed of 10,775 individuals of European, Asian, and African-American ancestry. We confirm that all five loci are significant contributors to disease risk across this multi-ethnic cohort. In addition, we identify two novel independent susceptibility alleles within the HLA-DQB1/DRB1 locus and a new genetic interaction between loci on Chr.1p36 and Chr.22q22. We develop a seven–SNP genetic risk score that explains nearly 5% of variation in disease risk. In geospatial analysis of 85 world populations, the genetic risk score closely parallels worldwide patterns of disease prevalence. The genetic risk score also predicts an unsuspected Northward risk gradient in Europe. This genetic prediction is verified by examination of registry data demonstrating, similarly to other immune-mediated diseases such as multiple sclerosis and type I diabetes, a previously unrecognized increase in IgAN–attributable kidney failure in Northern European countries.
PMCID: PMC3380840  PMID: 22737082
12.  Six Novel Susceptibility Loci for Early-Onset Androgenetic Alopecia and Their Unexpected Association with Common Diseases 
PLoS Genetics  2012;8(5):e1002746.
Androgenetic alopecia (AGA) is a highly heritable condition and the most common form of hair loss in humans. Susceptibility loci have been described on the X chromosome and chromosome 20, but these loci explain a minority of its heritable variance. We conducted a large-scale meta-analysis of seven genome-wide association studies for early-onset AGA in 12,806 individuals of European ancestry. While replicating the two AGA loci on the X chromosome and chromosome 20, six novel susceptibility loci reached genome-wide significance (p = 2.62×10−9–1.01×10−12). Unexpectedly, we identified a risk allele at 17q21.31 that was recently associated with Parkinson's disease (PD) at a genome-wide significant level. We then tested the association between early-onset AGA and the risk of PD in a cross-sectional analysis of 568 PD cases and 7,664 controls. Early-onset AGA cases had significantly increased odds of subsequent PD (OR = 1.28, 95% confidence interval: 1.06–1.55, p = 8.9×10−3). Further, the AGA susceptibility alleles at the 17q21.31 locus are on the H1 haplotype, which is under negative selection in Europeans and has been linked to decreased fertility. Combining the risk alleles of six novel and two established susceptibility loci, we created a genotype risk score and tested its association with AGA in an additional sample. Individuals in the highest risk quartile of a genotype score had an approximately six-fold increased risk of early-onset AGA [odds ratio (OR) = 5.78, p = 1.4×10−88]. Our results highlight unexpected associations between early-onset AGA, Parkinson's disease, and decreased fertility, providing important insights into the pathophysiology of these conditions.
Author Summary
While most genome-wide association studies (GWAS) focus on the identification of susceptibility loci for a specific disease, this hypothesis-free approach also enables the identification of unexpected associations between different diseases by taking advantage of the previously published GWAS associations. Androgenetic Alopecia (AGA, also known as male pattern baldness) is the most common type of hair loss in humans. Parkinson's disease is reported to occur more commonly in men than in women; however, there are no studies investigating the link between AGA and Parkinson's disease. Here, we show that a specific genetic locus, chromosome 17q21.31, which is associated with Parkinson's disease, is also a susceptibility locus for early-onset AGA. We further investigate the association between early-onset AGA and Parkinson's disease, irrespective of genotype, directly in a large-scale web-based study. We find that men with early-onset AGA have 28% higher risk of developing Parkinson's disease. The early-onset AGA locus on chromosome 17q21.31 has also been linked to decreased fertility previously. Future studies of this locus may implicate novel biological pathways affecting these three conditions.
PMCID: PMC3364959  PMID: 22693459
13.  A Genome-Wide Association Study identifies a locus on chromosome 7q22 to influence susceptibility for osteoarthritis 
Arthritis and Rheumatism  2010;62(2):499-510.
To identify genes involved in osteoarthritis (OA), the most prevalent form of joint disease, we performed a genome-wide association study (GWAS) in which we tested 500,510 Single Nucelotide Polymorphisms (SNPs) in 1341 OA cases and 3496 Dutch Caucasian controls. SNPs associated with at least two OA-phenotypes were analysed in 14,938 OA cases and approximately 39,000 controls. The C-allele of rs3815148 on chromosome 7q22 (MAF 23%, 172 kb upstream of the GPR22 gene) was consistently associated with a 1.14-fold increased risk (95%CI: 1.09–1.19) for knee- and/or hand-OA (p=8×10−8), and also with a 30% increased risk for knee-OA progression (95%CI: 1.03–1.64, p=0.03). This SNP is in almost complete linkage disequilibrium with rs3757713 (located 68 kb upstream of GPR22) which is associated with GPR22 expression levels in lymphoblast cell lines (p=4×10−12). GPR22 encodes an G-protein coupled receptor with unkown ligand (orphan receptor). Immunohistochemistry experiments showed absence of GPR22 in normal mouse articular cartilage or synovium. However, GPR22 positive chondrocytes were found in the upper layers of the articular cartilage of mouse knee joints that were challenged by in vivo papain treatment or in the presence of interleukin-1 driven inflammation. GRP22 positive chondrocyte-like cells were also found in osteophytes in instability-induced OA. In addition, GPR22 is also present in areas of the brain involved in locomotor function. Our findings reveal a novel common variant on chromosome 7q22 to influence susceptibility for prevalence and progression of OA.
PMCID: PMC3354739  PMID: 20112360
14.  Sequence variants at CYP1A1–CYP1A2 and AHR associate with coffee consumption 
Human Molecular Genetics  2011;20(10):2071-2077.
Coffee is the most commonly used stimulant and caffeine is its main psychoactive ingredient. The heritability of coffee consumption has been estimated at around 50%. We performed a meta-analysis of four genome-wide association studies of coffee consumption among coffee drinkers from Iceland (n = 2680), the Netherlands (n = 2791), the Sorbs Slavonic population isolate in Germany (n = 771) and the USA (n = 369) using both directly genotyped and imputed single nucleotide polymorphisms (SNPs) (2.5 million SNPs). SNPs at the two most significant loci were also genotyped in a sample set from Iceland (n = 2430) and a Danish sample set consisting of pregnant women (n = 1620). Combining all data, two sequence variants significantly associated with increased coffee consumption: rs2472297-T located between CYP1A1 and CYP1A2 at 15q24 (P = 5.4 · 10−14) and rs6968865-T near aryl hydrocarbon receptor (AHR) at 7p21 (P = 2.3 · 10−11). An effect of ∼0.2 cups a day per allele was observed for both SNPs. CYP1A2 is the main caffeine metabolizing enzyme and is also involved in drug metabolism. AHR detects xenobiotics, such as polycyclic aryl hydrocarbons found in roasted coffee, and induces transcription of CYP1A1 and CYP1A2. The association of these SNPs with coffee consumption was present in both smokers and non-smokers.
PMCID: PMC3080612  PMID: 21357676
15.  An analysis of single nucleotide polymorphisms of 125 DNA repair genes in the Texas genome-wide association study of lung cancer with a replication for the XRCC4 SNPs 
DNA repair  2011;10(4):398-407.
DNA repair genes are important for maintaining genomic stability and limiting carcinogenesis. We analyzed all single nucleotide polymorphisms (SNPs) of 125 DNA repair genes covered by the Illumina HumanHap300 (v1.1) BeadChips in a previously conducted genome-wide association study (GWAS) of 1,154 lung cancer cases and 1,137 controls and replicated the top-hits of XRCC4 SNPs in an independent set of 597 cases and 611 controls in Texas populations. We found that six of 20 XRCC4 SNPs were associated with a decreased risk of lung cancer with a P value of 0.01 or lower in the discovery dataset, of which the most significant SNP was rs10040363 (P for allelic test = 4.89 ×10−4). Moreover, the data in this region allowed us to impute a potentially functional SNP rs2075685 (imputed P for allelic test = 1.3 ×10−3). A luciferase reporter assay demonstrated that the rs2075685G>T change in the XRCC4 promoter increased expression of the gene. In the replication study of rs10040363, rs1478486, rs9293329, and rs2075685, however, only rs10040363 achieved a borderline association with a decreased risk of lung cancer in a dominant model (adjusted OR = 0.80, 95% CI = 0.62–1.03, P = 0.079). In the final combined analysis of both the Texas GWAS discovery and replication datasets, the strength of the association was increased for rs10040363 (adjusted OR = 0.77, 95% CI = 0.66–0.89, Pdominant = 5×10−4 and P for trend = 5×10−4) and rs1478486 (adjusted OR = 0.82, 95% CI = 0.71 −0.94, Pdominant = 6×10−3 and P for trend = 3.5×10−3). Finally, we conducted a meta-analysis of these XRCC4 SNPs with available data from published GWA studies of lung cancer with a total of 12,312 cases and 47,921 controls, in which none of these XRCC4 SNPs was associated with lung cancer risk. It appeared that rs2075685, although associated with increased expression of a reporter gene and lung cancer risk in the Texas populations, did not have an effect on lung cancer risk in other populations. This study underscores the importance of replication using published data in larger populations.
PMCID: PMC3062723  PMID: 21296624
XRCC4; variant; Genetic susceptibility; genome-wide association study; replication study
16.  Association of a novel functional promoter variant (rs2075533 C>T) in the apoptosis gene TNFSF8 with risk of lung cancer—a finding from Texas lung cancer genome-wide association study 
Carcinogenesis  2011;32(4):507-515.
Published genome-wide association studies (GWASs) have identified few variants in the known biological pathways involved in lung cancer etiology. To mine the possibly hidden causal single nucleotide polymorphisms (SNPs), we explored all SNPs in the extrinsic apoptosis pathway from our published GWAS dataset for 1154 lung cancer cases and 1137 cancer-free controls. In an initial association analysis of 611 tagSNPs in 41 apoptosis-related genes, we identified only 10 tagSNPs associated with lung cancer risk with a P value <10−2, including four tagSNPs in DAPK1 and three tagSNPs in TNFSF8. Unlike DAPK1 SNPs, TNFSF8 rs2181033 tagged other four predicted functional but untyped SNPs (rs776576, rs776577, rs31813148 and rs2075533) in the promoter region. Therefore, we further tested binding affinity of these four SNPs by performing the electrophoretic mobility shift assay. We found that only rs2075533T allele modified levels of nuclear proteins bound to DNA, leading to significantly decreased expression of luciferase reporter constructs by 5- to –10-fold in H1299, HeLa and HCT116 cell lines compared with the C allele. We also performed a replication study of the untyped rs2075533 in an independent Texas population but did not confirm the protective effect. We further performed a mini meta-analysis for SNPs of TNFSF8 obtained from other four published lung cancer GWASs with 12  214 cases and 47  721 controls, and we found that only rs3181366 (r2 = 0.69 with the untyped rs2075533) was associated to lung cancer risk (P = 0.008). Our findings suggest a possible role of novel TNFSF8 variants in susceptibility to lung cancer.
PMCID: PMC3066422  PMID: 21292647
17.  Genome-wide significant association between a sequence variant at 15q15.2 and lung cancer risk 
Cancer research  2011;71(4):1356-1361.
Genome-wide association studies (GWAS) have identified three genomic regions, at 15q24-25.1, 5p15.33 and 6p21.33, which associate with risk of lung cancer. Large meta-analyses of GWA data have failed to find additional associations of genome-wide significance. In this study, we sought to confirm 7 variants with suggestive association to lung cancer (P<10−5) in a recently published meta-analysis. In a GWA dataset of 1,447 lung cancer cases and 36,256 controls in Iceland, three correlated variants on 15q15.2 (rs504417, rs11853991 and rs748404) showed a significant association with lung cancer whereas rs4254535 on 2p14, rs1530057 on 3p24.1, rs6438347 on 3q13.31 and rs1926203 on 10q23.31 did not. The most significant variant, rs748404, was genotyped in additional 1,299 lung cancer cases and 4,102 controls from the Netherlands, Spain and the USA and the results combined with published GWAS data. In this analysis, the T allele of rs748404 reached genome-wide significance (OR=1.15, P=1.1×10−9). Another variant at the same locus, rs12050604, showed association with lung cancer (OR=1.09, 3.6×10−6) and remained significant after adjustment for rs748404 and vice versa. rs748404 is located 140 kb centromeric of the TP53BP1 gene that has been implicated in lung cancer risk. Two fully correlated, non-synonymous coding variants in TP53BP1, rs2602141 (Q1136K) and rs560191 (E353D), showed association with lung cancer in our sample set; however, this association did not remain significant after adjustment for rs748404. Our data show that one or more lung cancer risk variants of genome-wide significance and distinct from the coding variants in TP53BP1 are located at 15q15.2.
PMCID: PMC3077097  PMID: 21303977
Lung cancer; genome-wide association studies; GWAS; 15q15.2; TP53BP1
19.  CDKN2A Mutations and Melanoma Risk in the Icelandic Population 
Journal of medical genetics  2008;45(5):284-289.
Germline CDKN2A mutations have been observed in 20-40% of high-risk melanoma-prone families, however little is known about their prevalence in population-based series of melanoma cases and controls.
We resequenced the CDKN2A gene, including the p14ARF variant and promoter regions, in approximately 703 registry-ascertained melanoma cases and 691 population-based controls from Iceland, a country in which the incidence of melanoma has increased rapidly.
We identified a novel germline variant, G89D that was strongly associated with increased melanoma risk and appeared to be an Icelandic founder mutation. The G89D variant was present in about 2% of Icelandic invasive cutaneous malignant melanoma cases. Relatives of affected G89D carriers were at significantly increased risk of melanoma, head & neck cancers, and pancreatic carcinoma compared to relatives of other melanoma patients. Nineteen other germline variants were identified, but none conferred an unequivocal risk of melanoma.
This population-based study of Icelandic melanoma cases and controls showed a frequency of disease-related CDKN2A mutant alleles ranging from 0.7% to 1.0%, thus expanding our knowledge about the frequency of CDKN2A mutations in different populations. In contrast to North America and Australia where a broad spectrum of mutations was observed at a similar frequency, in Iceland, functional CDKN2A mutations consists of only one or two different variants. Additional genetic and/or environmental factors are likely critical for explaining the high incidence rates for melanoma in Iceland. This study adds to the geographic regions for which population-based estimates of CDKN2A mutation frequencies are available.
PMCID: PMC3236640  PMID: 18178632
melanoma; CDKN2A; G89D; pancreatic cancer; population-based
20.  Recommendations for standardization and phenotype definitions in genetic studies of osteoarthritis: the TREAT-OA consortium 
To address the need for standardization of osteoarthritis (OA) phenotypes by examining the effect of heterogeneity among symptomatic (SOA) and radiographic osteoarthritis (ROA) phenotypes.
Descriptions of OA phenotypes of the 28 studies involved in the TREAT-OA consortium were collected. To investigate whether different OA definitions result in different association results, we created hip OA definitions used within the consortium in the Rotterdam Study-I and tested the association of hip OA with gender, age and BMI using one-way ANOVA. For radiographic OA, we standardized the hip, knee and hand ROA definitions and calculated prevalence's of ROA before and after standardization in 9 cohort studies. This procedure could only be performed in cohort studies and standardization of SOA definitions was not feasible at this moment.
In this consortium, all studies with symptomatic OA phenotypes (knee, hip and hand) used a different definition and/or assessment of OA status. For knee, hip and hand radiographic OA 5, 4 and 7 different definitions were used, respectively. Different hip OA definitions do lead to different association results. For example, we showed in the Rotterdam Study-I that hip OA defined as “at least definite JSN and one definite osteophyte” was not associated with gender (p=0.22), but defined as “at least one definite osteophyte” was significantly associated with gender (p=3×10−9). Therefore, a standardization process was undertaken for radiographic OA definitions. Before standardization a wide range of ROA prevalence's was observed in the 9 cohorts studied. After standardization the range in prevalence of knee and hip ROA was small. Standardization of SOA phenotypes was not possible due to the case-control design of the studies.
Phenotype definitions influence the prevalence of OA and association with clinical variables. ROA phenotypes within the TREAT-OA consortium were standardized to reduce heterogeneity and improve power in future genetics studies.
PMCID: PMC3236091  PMID: 21059398
21.  The chromosome 9p21 risk locus is associated with angiographic severity and progression of coronary artery disease 
European Heart Journal  2010;31(24):3017-3023.
We tested the hypothesis that the 9p21 risk locus promotes atherosclerosis by examining the association between rs10757278 and coronary artery disease (CAD) severity and progression determined by semi-quantitative angiographic scores.
Methods and results
The rs10757278 single nucleotide polymorphism (SNP) was genotyped as the marker for the 9p21 locus in 2334 Caucasian patients undergoing cardiac catheterization (mean age 63, male 67%). Angiographic CAD was assessed using two semi-quantitative scoring systems with one estimating severity (Gensini) and the other extent (Sullivan). A subset of 308 patients who underwent two or more coronary angiograms at least 6 months apart were examined for net change in Gensini and Sullivan scores over time to determine the rate of CAD progression by genotype and were further classified as ‘progressors’ or ‘non-progressors’ based on absolute change per year in angiographic severity score. We replicated the association between the rs10757278 SNP and myocardial infarction and binary (presence/absence) angiographic classifications of CAD. Furthermore, we observed a significant additive association with this SNP, and both severity and extent of CAD using angiographic scores, after adjustment for age, gender, body mass index, traditional cardiovascular risk factors, myocardial infarction, and statin use (Gensini P = 0.016, Sullivan P = 0.005). In addition, there was a significant linear association with CAD progression before and after adjustment for covariates (Gensini P = 0.023, Sullivan P = 0.003) with homozygotes for the risk variant having three-fold greater odds of CAD progression compared with the referent group.
The 9p21 risk locus is associated with angiographically defined severity, extent, and progression of CAD, suggesting a role for this locus in influencing atherosclerosis and its progression.
PMCID: PMC3001587  PMID: 20729229
Atherosclerosis; angiography; coronary disease; genetics; genomics; 9p21
22.  Identification of an imprinted master trans-regulator at the KLF14 locus related to multiple metabolic phenotypes 
Nature genetics  2011;43(6):561-564.
Genome-wide association studies have identified many genetic variants associated with complex traits. However, at only a minority of loci have the molecular mechanisms mediating these associations been characterized. In parallel, whilst cis-regulatory patterns of gene expression have been extensively explored, the identification of trans-regulatory effects in humans has attracted less attention. We demonstrate that the Type 2 diabetes and HDL-cholesterol associated cis-acting eQTL of the maternally-expressed transcription factor KLF14 acts as a master trans-regulator of adipose gene expression. Expression levels of genes regulated by this trans-eQTL are highly-correlated with concurrently-measured metabolic traits, and a subset of the trans-genes harbor variants directly-associated with metabolic phenotypes. This trans-eQTL network provides a mechanistic understanding of the effect of the KLF14 locus on metabolic disease risk, providing a potential model for other complex traits.
PMCID: PMC3192952  PMID: 21572415
23.  Large Scale Replication Study of the Association between HLA Class II/BTNL2 Variants and Osteoarthritis of the Knee in European-Descent Populations 
PLoS ONE  2011;6(8):e23371.
Osteoarthritis (OA) is the most common form of arthritis and a major cause of disability. This study evaluates the association in Caucasian populations of two single nucleotide polymorphisms (SNPs) mapping to the Human Leukocyte Antigen (HLA) region and deriving from a genome wide association scan (GWAS) of knee OA in Japanese populations. The frequencies for rs10947262 were compared in 36,408 controls and 5,749 knee OA cases from European-descent populations. rs7775228 was tested in 32,823 controls and 1,837 knee OA cases of European descent. The risk (major) allele at rs10947262 in Caucasian samples was not significantly associated with an odds ratio (OR)  = 1.07 (95%CI 0.94 -1.21; p = 0.28). For rs7775228 the meta-analysis resulted in OR = 0.94 (95%CI 0.81-1.09; p = 0.42) for the allele associated with risk in the Japanese GWAS. In Japanese individuals these two SNPs are in strong linkage disequilibrium (LD) (r2 = 0.86) with the HLA class II haplotype DRB1*1502 DQA1*0103 DQB1*0601 (frequency 8%). In Caucasian and Chinese samples, using imputed data, these SNPs appear not to be in LD with that haplotype (r2<0.07). The rs10947262 and rs7775228 variants are not associated with risk of knee OA in European descent populations and they do not appear tag the same HLA class II haplotype as they do in Japanese individuals.
PMCID: PMC3154440  PMID: 21853121
24.  A sequence variant on 17q21 is associated with age at onset and severity of asthma 
A sequence variant (rs7216389-T) near the ORMDL3 gene on chromosome 17q21 was recently found to be associated with childhood asthma. We sought to evaluate the effect of rs7216389-T on asthma subphenotypes and its correlation with expression levels of neighboring genes. The association of rs7216389-T with asthma was replicated in six European and one Asian study cohort (N=4917 cases N=34 589 controls). In addition, we found that the association of rs7216389-T was confined to cases with early onset of asthma, particularly in early childhood (age: 0–5 years OR=1.51, P=6.89·10−9) and adolescence (age: 14–17 years OR=1.71, P=5.47·10−9). A weaker association was observed for onset between 6 and 13 years of age (OR=1.17, P=0.035), but none for adult-onset asthma (OR=1.07, P=0.12). Cases were further stratified by sex, asthma severity and atopy status. An association with greater asthma severity was observed among early-onset asthma cases (P=0.0012), but no association with sex or atopy status was observed among the asthma cases. An association between sequence variants and the expression of genes in the 17q21 region was assessed in white blood cell RNA samples collected from Icelandic individuals (n=743). rs7216389 associated with the expression of GSDMB and ORMDL3 genes. However, other sequence variants showing a weaker association with asthma compared with that of rs7216389 were more strongly associated with the expression of both genes. Thus, the contribution of rs7216389-T to the development of asthma is unlikely to operate only through an impact on the expression of ORMDL3 or GSDMB genes.
PMCID: PMC2987388  PMID: 20372189
childhood asthma; single-nucleotide polymorphism; expression; ORMDL3; GSDMB
25.  Replication of Lung Cancer Susceptibility Loci at Chromosomes 15q25, 5p15, and 6p21: A Pooled Analysis From the International Lung Cancer Consortium 
Genome-wide association studies have identified three chromosomal regions at 15q25, 5p15, and 6p21 as being associated with the risk of lung cancer. To confirm these associations in independent studies and investigate heterogeneity of these associations within specific subgroups, we conducted a coordinated genotyping study within the International Lung Cancer Consortium based on independent studies that were not included in previous genome-wide association studies.
Genotype data for single-nucleotide polymorphisms at chromosomes 15q25 (rs16969968, rs8034191), 5p15 (rs2736100, rs402710), and 6p21 (rs2256543, rs4324798) from 21 case–control studies for 11 645 lung cancer case patients and 14 954 control subjects, of whom 85% were white and 15% were Asian, were pooled. Associations between the variants and the risk of lung cancer were estimated by logistic regression models. All statistical tests were two-sided.
Associations between 15q25 and the risk of lung cancer were replicated in white ever-smokers (rs16969968: odds ratio [OR] = 1.26, 95% confidence interval [CI] = 1.21 to 1.32, Ptrend = 2 × 10−26), and this association was stronger for those diagnosed at younger ages. There was no association in never-smokers or in Asians between either of the 15q25 variants and the risk of lung cancer. For the chromosome 5p15 region, we confirmed statistically significant associations in whites for both rs2736100 (OR = 1.15, 95% CI = 1.10 to 1.20, Ptrend = 1 × 10−10) and rs402710 (OR = 1.14, 95% CI = 1.09 to 1.19, Ptrend = 5 × 10−8) and identified similar associations in Asians (rs2736100: OR = 1.23, 95% CI = 1.12 to 1.35, Ptrend = 2 × 10−5; rs402710: OR = 1.15, 95% CI = 1.04 to 1.27, Ptrend = .007). The associations between the 5p15 variants and lung cancer differed by histology; odds ratios for rs2736100 were highest in adenocarcinoma and for rs402710 were highest in adenocarcinoma and squamous cell carcinomas. This pattern was observed in both ethnic groups. Neither of the two variants on chromosome 6p21 was associated with the risk of lung cancer.
In this international genetic association study of lung cancer, previous associations found in white populations were replicated and new associations were identified in Asian populations. Future genetic studies of lung cancer should include detailed stratification by histology.
PMCID: PMC2897877  PMID: 20548021

