Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Pharmacogenet Genomics. Author manuscript; available in PMC 2014 August 1.
Published in final edited form as:
PMCID: PMC3003940

Genome-Wide Association Studies in Pharmacogenomics: Successes and Lessons


As genotyping technology has progressed, genome-wide association studies (GWAS) have matured into efficient and effective tools for mapping genes underlying human phenotypes. Recent studies have demonstrated the utility of the GWAS approach for examining pharmacogenomic traits, including drug metabolism, efficacy, and toxicity. Application of GWAS to pharmacogenomic outcomes presents unique challenges and opportunities. In the current review, we discuss the potential promises and potential caveats of this approach specifically as it relates to pharmacogenomic studies. Concerns with study design, power and sample size, and analysis are reviewed. We further examine the features of successful pharmacogenomic GWAS, and describe consortia efforts that are likely to expand the reach of pharmacogenomic GWAS in the future.

Keywords: Genome-wide association, GWAS, pharmacogenetic, pharmacogenomic, drug response, drug metabolism, toxicity


Since 2005, genome-wide association studies (GWAS) have matured into a powerful tool to identify single nucleotide polymorphisms (SNPs) that can be reproducibly associated with a variety of human phenotypes. Currently, well over 300 papers have reported significant associations of common variants with a range of phenotypes and diseases [1]. These successes have provided numerous insights into the relationship among genetic variants, biological pathways, and human traits, as well as shown how proper study design and analysis can lead to the success of GWAS. A key lesson from this first generation of GWAS is that no single approach will be appropriate for all phenotypes [2].

The genetics of drug-response outcomes, broadly referred to here as pharmacogenetic/pharmacogenomic outcomes, are a particular category of phenotypes that present unique challenges and opportunities in gene discovery [3]. In this review we discuss the advantages and limitations of GWAS as applied to pharmacogenomic outcomes. Some of these challenges are variations on general concerns for disease gene identification, whereas others are unique to pharmacogenomic outcomes.

Like studies of disease phenotypes, the success of any pharmacogenomic GWAS will depend on the effect size and allele frequency of genetic variants that influence the trait, the sample size available to detect those variants, the population under study (treatment protocol, dosage, patient features including self-reported race/ethnicity, etc.), and study design (observational study or randomized controlled trial). Unlike most disease phenotypes, pharmacogenomic outcomes often have clear, clinically defined phenotypes and well understood mechanisms that may underlie variation in drug response, including known systems of transport and metabolism, as well as sites of drug action. In addition, larger genetic effects may exist for pharmacogenomic traits than for disease phenotypes, providing greater statistical power for genetic association studies.

An important potential limitation for pharmacogenomic GWAS is sample size. GWAS for traits like height or QT or complex diseases like diabetes need and benefit from large numbers and currently mega-meta-analyses are identifying and validating associated loci. Such large sample sets are generally not possible for pharmacogenomic outcomes since they usually include by definition both a disease (often with low prevalence) as well as a well-curated drug response phenotype (which further reduces the available study population).

In this article, we discuss key issues for GWAS, including the strengths and limitations of this approach. We then elucidate issues of heightened importance in GWAS of pharmacogenomic traits. We discuss appropriate study designs and analysis strategies, and describe lessons from successful pharmacogenomic GWAS. We end with a discussion of ongoing efforts to develop consortia for the purpose of obtaining large sample sizes for drug response outcomes.


There are clear, well-understood advantages to a genome-wide association approach to phenotype association discovery. GWAS are conventionally intended as an unbiased scan of the genome, interrogating the majority of common genetic variation for disease association. In contrast to a candidate gene approach, whether narrow or broad in scope, GWAS allow the identification of totally novel susceptibility factors that promise to provide us with better biological understanding of phenotypes [4]. There are many candidate mechanisms that drive variability in drug responses: metabolism, transport, targets, target partners, immunologic pathways (e.g. for allergic reactions), etc. that have directed many successful candidate gene studies [5],However, they cannot identify genes outside of the current knowledge of those mechanisms. GWAS allow such novel discovery.

GWAS have distinct advantages as compared to more traditional linkage based approaches [6]. There are three key general advantages of GWAS approaches for gene identification, each of which are exaggerated for pharmacogenomic outcomes:

  • Case-control cohorts are generally less expensive and easier to collect than extended pedigrees or nuclear families. This is especially true in drug response studies where it is rare for multiple family members to have well-characterized responses to drug challenges, i.e formal linkage analysis has not been feasible for drug response phenotypes. GWAS do not require the ascertainment of pharmacogenomic interventions in related individuals.
  • Association studies have higher statistical power to detect small to modest genetic effects as compared to linkage studies [6]. For pharmacogenomic studies, especially for rare toxicities where sample sizes are limited, this advantage in power may be the difference between success and failure in gene mapping.
  • Because linkage disequilibrium (LD) typically stretches over tens of kilobases as opposed to several megabases [6], association signals are more finely localized than linkage signals, which should lead to more rapid identification of causal variants by rapidly narrowing down regions for follow up in functional studies – critical for novel mechanistic insights – and, thus, to more rapid translation of findings.

There are additional advantages to GWAS that are more specific to pharmacogenomic outcomes. First, GWAS provide context for understanding the relative importance of genetic contributors to pharmacogenomic traits that may otherwise be unavailable. The genetic component of human phenotypes can be assessed by estimating heritability (the proportion of variation in a trait due to genetic factors) through methods such as variance components analysis, segregation analysis, etc. Each of these methods requires family data, which, as noted above, is usually difficult to collect for pharmacogenomic outcomes [7].

Another specific application of GWAS in pharmacogenomics is the ability to rule out – with pre-specified confidence intervals – contributions by unidentified genes to a drug response phenotype. Because pharmacogenomic GWAS can directly investigate the role of genetic variation on clinical outcomes, the findings from pharmacogenomic GWAS can be more rapidly translated to clinical practice. As translation to the bedside is one of the goals of pharmacogenomic gene mapping [8], it is important to ensure that any unanticipated important genetic contribution to variability in a drug response is not missed [9]. Of equal importance is the identification of novel mechanisms, both for drug response and/or adverse drug reactions. So, having identified variants in gene X or Y as contributors to a variable drug response, it is key to ensure that there is no other important genetic contributor before mounting a trial. Understanding the influence of genetic variants in drug response can limit unanticipated variability in a drug treatment [9]. The role of GWAS in this process is evident in the evaluation of the genetic component of warfarin dosing [9]. The strong association of variants in VKORC1 and CYP2C9 for stable warfarin dosing were well established [1012], but before the National Heart, Lung and Blood Institute (NHLBI) in the United States would mount a large clinical trial it was important to determine if there were other genetic variants that also had large effects on stable warfarin dosing. GWAS [13, 14] have now ruled out large contributions by other loci, thereby allowing clinical trials to proceed [15]. Similarly, a GWAS for clopidogrel effect on ADP-induced platelet aggregation identified only one associated locus, at CYP2C9/19, laying the groundwork for design of clinical trials [16]. As genotyping platforms with increased SNP density become available, the coverage of genetic variation in the human genome will become more complete, providing greater confidence that clinically important genetic effects on pharmacogenomic traits will not be missed. Thus while many variants in drug metabolism genes have been shown to confer large clinical effects, that have often been identified without GWAS (e.g. by well informed candidate gene studies), even GWAS with “negative” results add this crucial additional information [17].


Common Disease Common Variant Hypothesis

Despite the advantage of GWAS studies discussed above, there are important caveats that must be remembered in their design and application. While many of these caveats are true of GWAS in general, the impact of these concerns may be different in pharmacogenomic studies than in general trait mapping.

A key assumption in GWAS is what is known as the common disease/common variant hypothesis [18]. The common disease/common variant hypothesis proposes that most of the genetic risk for common, complex diseases is attributable to relatively common (minor allele frequency >0.05) polymorphisms [18]. The alternative to the common disease/common variant hypothesis is that multiple rare variants cause disease at high prevalence in the population through a variety of mechanisms, Such variants can represent genetic heterogeneity of variants in a single gene, or multiple rare variants within genes in the same pathway that have cumulative effects. These two hypotheses have important implications—common variants are thought to impart subtle effects on gene function, often through changes to gene regulation. Rare variants may have larger effects on gene function, such as nonsynonymous variants that alter the amino acid sequence of the resulting protein, and as a result lead to large changes in disease risk or trait values. As a result, it is likely that both common and rare variants will contribute to common phenotypes, but the relative proportions will influence the appropriate methods for discovering associated variants. The GWAS approach is well powered to detect common variants with modest effects. GWAS is less effective at testing rare variation, a problem that is confounded by the DNA microarrays used in these studies, which have been designed to capture common variation. Even “next generation” GWAS that will reliably interrogate (directly or indirectly) all variation with minor allele frequency > 0.005 may be insufficient to identify enough of the contributory variation to allow us to understand biology if most of that variation has minor allele frequency < .005, as the sample sizes required to achieve sufficient statistical power for such effects may be prohibitive. As “next generation” sequencing becomes more accessible, and whole genome sequencing becomes more affordable, more rare variant analysis will be possible in pharmacogenomics.

Sources of Bias

An important concern in GWAS studies for pharmacogenomics is of the potential for bias in the selection of genetic variants [2]. Although large numbers of variants with low minor allele frequency are included in the densest GWAS platforms, GWAS have little power, given sample sizes available, to detect significant associations with low minor allele frequency (MAF) SNPs. Additionally, it is widely recognized that genotype quality is not as high for rare variants as it is for more common variants. Consequently, a common approach is to not assess the significance of associations with rare variants (MAF < .01). This further compounds the limited statistical power to detect associations with less common genetic variants. Moreover, SNPs included on high throughput platforms must pass stringent tests for ease of genotyping, which leads regions with gene duplications (and pseudogenes) to be poorly represented on high-throughput genotyping products, and many of these – such as CYPs or the HLA locus are precisely the genes of greatest interest for pharmacogenomic study. The human cytochrome P-450 (CYP) family of genes that encodeenzymes active in xenobiotic metabolism have been associated with a large number of pharmacogenomic outcomes [19]. They are known to be highly polymorphic, with a wide range of allele frequencies across populations, and contain complex structural variation, with unique haplotypic structure and copy number variations [20]. The coverage of these types of variation is limited on current GWAS genotyping platforms [21].

Study Design

Experimental design is a crucial component of any successful GWAS, and pharmacogenomic studies have different advantages and limitations than traditional disease studies. The importance of proper definition and collection of phenotype data has become increasingly appreciated in the context of GWAS [17]. An important advantage in pharmacogenomic studies is that multiple response phenotypes are often collected within the same study, such as efficacy and adverse events, allowing a broader dissection of trait genetics in a single study.

However, because all pharmacogenomic outcomes are responses to the environmental exposure of the drug and because these drugs are given in response to a disease condition, there may be complex interactions between disease and drug response relevant in phenotype definition. Precise definitions are essential for both the disease and drug response phenotypes, which are often discrete diagnoses from these complex relationships. For example, in some but not all cases, rare adverse drug reactions may represent a “tail” of response distributions and where to define that cut-off within the distribution can be a challenge. The SEARCH Collaborative Group demonstrated a successful approach to address this issue by combining subjects with both definite and incipient statin-induced myopathy into a single case definition [22]. In other cases, a rare adverse reaction is an unexpected outcome often unrelated to the desired mechanism of action [17].

One efficient use of resources to collect pharmacogenomic phenotypes is to collect samples within the context of clinical trials, which streamlines the collection procedures. The use of clinical trial data for GWAS studies is not only an efficient use of resources, but has the advantage that similarly treated “controls” for the phenotype of interest are built into the trials. However, because some trials are not designed for GWAS mapping, the study designs used for collection may not be ideal for pharmacogenomic analysis (e.g. multiple drugs used in treatment arms, etc) [23]. Obviously, this “challenge” is inherent to the treatment of diseases like cancer or end stage congestive heart failure in which it would be unethical to fail to treat patients with the current standard of care for this illness. If pharmacogenomic efforts are sub-studies of clinical trials, sample sizes may decrease, which reduces the power of the pharmacogenomic component. Because meeting recruitment targets is a primary goal in most clinical trials, genomic and pharmacogenomic efforts are often included only as sub-studies to which subjects may or may not consent; as a result, the power and generalizability of genomic studies is compromised. Genetic studies added as an afterthought may be viewed as creating a barrier to recruitment and are thus may not be a priority for sponsors. Collecting drug response phenotypes in health care systems with electronic medical records is another method of accruing subjects that is now being explored.

Sample size limitations are a challenge in any GWAS study, but are amplified in many pharmacogenomic studies. Particularly when studying rare drugreactions or adverse events, it is by definition not feasible to recruit thousands of patients with rare outcomes. This is a particular limitation in pharmacogenomic GWAS studies as the replication of association results in independent populations has become the “gold standard” for validation of results [24]. If the collection of a reasonable sample size for a discovery cohort is at the edge of practicality, this makes the collection of a well-powered replication cohort often impossible. Consortia efforts (discussed below) have been motivated by this limitation, to combine samples from across the world to increase power and potentially identify replication cohorts to maximize power and provide validation to associated signals. However even the establishment of networks of investigators cannot necessarily overcome these limitations, and the field must look to creative approaches of validation/replication possibly involving functional studies or examination of related intermediate phenotypes.

There are unique “challenges” associated with validation/replication for pharmacogenomics. Clinical trials are expensive, and every study is unique since they are designed to represent an advance over previous studies to answer novel therapeutic questions. Therefore, in pharmacogenomics greater emphasis may have to be placed on functional validation of GWAS “signals” and on biological plausibility. Additionally, one must recognize that the larger the sample size, the more likely that features which confound the genotype/phenotype relationship will be undocumented or uncontrolled, thus diluting the “purity” of the phenotype and potentially reducing power [25].

Besides sample size, there are other practical limitations in study design for pharmacogenomic studies. As mentioned previously, family based designs are generally impractical with drug response outcomes, which means the field relies heavily on cohort or case-control studies for GWAS [5]. While the number of cases may be limited by event frequency as discussed above, finding and selecting appropriate controls presents additional challenges. While GWAS of common diseases have taken advantage of the use of shared controls across studies, this is not often possible in pharmacogenomic studies, as typically controls must also be exposed to the drug of interest (though this may not be necessary in all cases). Other matching criteria must also be considered, such as disease interactions, population admixture, and additional environmental and clinical exposures.


As GWAS have become more prevalent, methodologies for the analysis and interpretation of results have co-evolved. Many tools have been developed and evaluated in the context of GWAS studies, and have resulted in the many successes seen to date. However, there are still many challenges in the analysis strategies used for GWAS in general, as well as particular challenges for pharmacogenomics, as discussed below.

Standard Analytical Approaches

The majority of previous GWAS studies have relied on the use of traditional statistical methodologies for analysis, and several tools have become widely used in the field. Software packages such as PLINK [26], have become very popular to implementing logistic regression (for case-control or cohort studies), linear regression (for quantitative traits), and family based association tests for GWAS studies.

After various types of corrections for multiple testing (Bonferroni, permutation approaches, etc) results of these analyses are typically prioritized with replication strategies. For single samples, the union of significant results from several analytical approaches (committee-based approaches) or measures of reliability from internal model validation is often used toprioritize robust signals. When more than one sample is available, multistage replication strategies are often employed to discover, prioritize, and validate signals. Finally, when multiple samples are available, meta-analysis is often used to obtain more comprehensive measure of association signals [27]. Challenges in sample collection (discussed above) can limit the use of such multistage replication and meta-analysis strategies in pharmacogenomics. One alternative approach for replication, or at least prioritization, of association signals in pharmacogenetic studies is to utilize non-clinical GWAS studies of large collections of human tissue, cell lines, and genetic model organisms [28].

Detecting Complex Predictive Models

Such traditional approaches have been very powerful for identifying strong single-locus associations (“low-hanging fruit”) for a wide range of phenotypes in both common diseases and pharmacogenomic outcomes (reviewed below), and are typically applied in a way that fits within the “unbiased” intentions of GWAS association studies. Despite the successes of these approaches, their limitations for detecting and prioritizing more complex models have become a hot topic in the literature [29].

As many successful GWAS have been published, the sum of the genetic contributions of associated variants in many common traits is far below the estimated heritability of the traits. These gaps in explained heritability are potentially clarified by several potential etiologies. Low power to detect low effect sizes, the presence of rare variants contributing to phenotypes, unmeasured nucleotide or structuralvariation, complex methylation/epigenetic mechanisms, and gene-gene/gene-environment interactions are all hypothesized to contribute to the unexplained trait variation [29]. In response to these limitations, new analytical approaches are evolving to detect complex genetic risk models, discussed below. These limitations are leading to refinement of methods for GWAS analysis, and these may be especially appropriate for pharmacogenomic studies.

Expert-Knowledge Driven Analysis

While this “unbiased” intent of GWAS studies is to detect potentially new genetic associations that might not have been considered as candidate genes, there has been a recent appreciation for the fact that these simple analytical approaches ignore the large amount of expert knowledge available for a particular outcome. In response, there has recently been rapid development in the use of network and pathway analysis for analysis of GWAS data [3033]. Literature searches (automated or hand-curated), databases of previous results, etc. are being exploited to improve the power of GWAS. Because there is much known about the mechanism and metabolism of many of the drugs evaluated in pharmacogenomic studies, there is very well directed guidance for such knowledge-driven analysis. The Pharmacogenomics Knowledge Base (PharmGKB) [34]is an important resource and data repository that summarizes and curates drug response/gene relationships via gene variant annotation, hand-curated literature review, and important pharmacogenomic genes and pathways. An example of the potential of pathway-based analysis is discussed below.


Arguably the most important demonstrations of the utility and challenges of GWAS studies in pharmacogenomics are the empirical results of successful studies. A brief description of the outcomes evaluated in pharmacogenomic GWAS and the strongest signals identified is listed in Table 1. Details of each study can be found in the references provided.

Table 1
Details of Pharmacogenomic GWAS

The potential and drawbacks of an agnostic, unbiased approach for genetic association studies in pharmacogenetics are illustrated by a GWAS of the activity of a well-known polymorphic drug metabolizing enzyme, thiopurine methyltransferase(TPMT) in lymphoblastoid cell lines from the HapMap project [35]. The goal of the experiment was to assess whether the TPMT polymorphism could be “rediscovered” in this fashion [36]. Although common polymorphisms in TPMT were well tagged, and TPMT polymorphisms were associated with TPMT activity, the GWAS indicated that 96 genes were ranked higher than was TPMT itself. The extent to which these higher ranked genes are false vs. true positives is not yet clear, but indicate the difficulty of using GWAS approaches even for putatively monogenic traits.

An example of a GWAS for drug pharmacokinetics is provided by an analysis of methotrexate clearance determined in over 3000 courses of the drug given to 434 children with leukemia [37]. Many candidate gene studies have previously been conducted to identify genetic variation associated with methotrexate pharmacokinetic variability, with limited success. Using GWAS, the SLCO1B1 gene was represented by multiple polymorphisms in several LD blocks, a finding that was replicated in an independent cohort of patients, suggesting that there are multiple mechanisms by which alteration of OATP1B1 (encoded by SLCO1B1) could affect methotrexate pharmacokinetics. Although methotrexate had been shown to be an OATP1B1 substrate, it was a rather weak one [38, 39], and so the gene had not risen to the top of candidate gene lists. This finding has implications for both toxicity to methotrexate, and to possible drug interactions with widely used OATP1B1 substrates, such as statins.

The utility of pathway-based analysis is demonstrated by Hartford et al. 2007 [40], who performed a GWAS examining etoposide-induced leukemia with MLL. They prioritized variant associations based on expression results, to identify alterations in three biological pathways: adhesion, Wnt signaling and regulation of actin. Results in an independent validation cohort confirmed the alterations in the adhesion pathway. None of the alterations identified were significant based on traditional association analysis, demonstrating the potential of more complex modeling to identify pathway-level associations.

While most of the published studies identified variants at a genome-wide significance level, many of them found strong potential signals that did not stand up to traditional analyses [4143]. These negative results may represent true negative results, but it is highly likely that many of these studies were limited by many of the challenges discussed above (power, coverage, etc).


In order to address many of the limitations discussed above, particularly in regards to limited sample sizes and lack of traditional replication cohorts, researchers are successfully combining resources and establishing worldwide collaborations to support large-scale GWAS. Given the complexities of drug response phenotypes, this approach seems especially appealing in the application of GWAS to pharmacogenomics. By combining cohorts from around the globe, pharmacogenomic studies will have higher power to detect and validate response-determining variants.

The SEARCH Collaborative Group [22]demonstrates the success of such a collaboration. The SEARCH Collaborative Group examined a rare outcome of statin therapy – myopathy, defined as markedly elevated creatinine kinase. In its most extreme form, this can result in the potentially fatal adverse effect of rhabdomyolysis, but these cases are exceedingly rare. The SEARCH Collaboration also found that myopathy was rare (~0.1%) with low dose simvastatin, so they focused their efforts on 98 cases identified in 6031 patients receiving high doses (80 mg/day) of the drug. A GWAS that studied 85 of these cases and90 controls identified rs4363657, in perfect LD with a known functional non-synonymous SNP in SLCO1B1 at genome-wide significance. The 5-year incidence of myopathy was 18% in individuals homozygous for the risk allele (2.1% of the study group), 3% in heterozygotes, and 0.6% in those with no risk allele. The result was replicated in a separate cohort of patients receiving a lower dose of 40 mg/day (relative risk 2.6 per C allele).

The success of this study illustrates several important points in the study design of pharmacogenomic GWAS. First, large collaborative samples can provide a valuable resource for collecting a critical mass of subjects with a rare phenotype. Second, rare phenotypes are sampled from the extreme tail of drug response distributions. As a result, genetic variants that influence these traits may have larger genetic effect sizes, and therefore be detectable with small sample sizes, than more common outcomes. Third, similar outcomes can sometimes be combined into a single case group. Here, in the initial association phase, definite and incipient myopathy patients were considered together. Fourth, replication of an association should take place in a similar population. In this study, the replication cohort was treated with a lower dose, 40 mg of simvastatin daily as compared to 80 mg in the initial group. We note that selecting cases from lower dose regimen for a follow up study may be preferable to the converse (i.e., higher doses in the follow up cohort) as those cases have a more extreme phenotype (by developing toxicity at a lower dose). This can limit the dilution of the association signal in the confirmatory study.

Several additional pharmacogenomics consortia have been established to evaluate a number of drug response outcomes, including the International Severe Irinotecan Neutropenia Consortium (, and the International Tamoxifen Pharmacogenomics Consortium ( These groups have pooled data from around the world to investigate genetic predictors of drug response with high power and comparison across global populations. While the initial work of these consortia has typically been focused on candidate/known genetic effects, they are moving towards GWAS. For example, the International Warfarin Pharmacogenetics Consortia (IWPC) ( originally combined data for over 4000 individuals from 24 international sites, to develop and test warfarin dosing algorithms [44], and are currently using the cohort data for a GWAS (through the IWPC-GWAS consortium) to identify and confirm previous findings, and potentially discover novel variants that explain potential trait variation across multiple populations. Such collaborations are extremely important for rare events, such as adverse events. TheInternational Serious Adverse Events Consortium ( one important effort in pharmacogenomics for adverse events, pulling together commercial, academic, and industry partners to collect data for well-powered GWAS.

These combined datasets represent exciting resources for pharmacogenomics GWAS, but are not without challenges. Concerns with consistent data collection, storage, data-ownership issues, etc. can be concerns in these collaborative efforts.


Genome-wide association studies have proven to be an exciting tool for gene mapping in common human traits, and are demonstrating their potential in pharmacogenomic outcomes as well. As pharmacogenomic GWAS mature, there is an increased appreciation for issues that are specifically related to these unique phenotypes. Practical considerations, related to study design and available sample sizes highlight the need for creative methods of replication, beyond the traditional replication cohorts that are used for common disease genetics, and the necessity of combining samples across consortia. The complex physiology of drug response outcomes highlights the need for analytical methods that incorporate this complexity, using the wealth of information available about drug mechanisms and pathways.


The authors are grateful to the Pharmacogenetics Research Network (PGRN) Publications Committee for helpful advice. Some of the work described in this review was funded by the NIH/NIGMS Pharmacogenetics Research Network grants U01 GM61393, U01 HL65962, U01 HG04603, U01 GM61388, U01 GM63340, UO1 GM61390 and Database (U01GM61374 This work was additionally supported by CA21765, the American Lebanese Syrian Associated Charities (ALSAC); and by CureSearch. Dr. Relling receives a portion of the income St. Jude receives from licensing patent rights related to TPMT polymorphisms.This publication [or project] was partially supported by NIH/NCRR/OD UCSF-CTSI Grant Number KL2 RR024130. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH.

The current article has been partially funded by U01 GM61393, U01 HL65962, U01 HG04603,U01 GM61388, U01 GM63340, UO1 GM61390 and Database (U01GM61374 This work was additionally supported by CA21765, the American Lebanese Syrian Associated Charities (ALSAC); and by CureSearch.


1. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA. 2009;106(23):9362–9367. [PubMed]
2. McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JP, Hirschhorn JN. Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet. 2008;9(5):356–369. [PubMed]
3. Weinshilboum R, Wang L. Pharmacogenomics: bench to bedside. Nat Rev Drug Discov. 2004;3(9):739–748. [PubMed]
4. Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6(2):95–108. [PubMed]
5. Roden DM, Altman RB, Benowitz NL, Flockhart DA, Giacomini KM, Johnson JA, Krauss RM, McLeod HL, Ratain MJ, Relling MV, et al. Pharmacogenomics: challenges and opportunities. Ann Intern Med. 2006;145(10):749–757. [PubMed]
6. Risch N, Merikangas K. The future of genetic studies of complex human diseases. Science. 1996;273(5281):1516–1517. [PubMed]
7. Weiss ST, Silverman EK, Palmer LJ. Case-control association studies in pharmacogenetics. Pharmacogenomics J. 2001;1(3):157–158. [PubMed]
8. Roses AD. Pharmacogenetics in drug discovery and development: a translational perspective. Nat Rev Drug Discov. 2008;7(10):807–817. [PubMed]
9. Limdi NA, Veenstra DL. Expectations, validity, and reality in pharmacogenetics. J Clin Epidemiol. 2009 [PMC free article] [PubMed]
10. Carlquist JF, Horne BD, Muhlestein JB, Lappe DL, Whiting BM, Kolek MJ, Clarke JL, James BC, Anderson JL. Genotypes of the cytochrome p450 isoform, CYP2C9, and the vitamin K epoxide reductase complex subunit 1 conjointly determine stable warfarin dose: a prospective study. J Thromb Thrombolysis. 2006;22(3):191–197. [PubMed]
11. Aquilante CL, Langaee TY, Lopez LM, Yarandi HN, Tromberg JS, Mohuczy D, Gaston KL, Waddell CD, Chirico MJ, Johnson JA. Influence of coagulation factor, vitamin K epoxide reductase complex subunit 1, and cytochrome P450 2C9 gene polymorphisms on warfarin dose requirements. Clin Pharmacol Ther. 2006;79(4):291–302. [PubMed]
12. Rieder MJ, Reiner AP, Gage BF, Nickerson DA, Eby CS, McLeod HL, Blough DK, Thummel KE, Veenstra DL, Rettie AE. Effect of VKORC1 haplotypes on transcriptional regulation and warfarin dose. N Engl J Med. 2005;352(22):2285–2293. [PubMed]
13. Cooper GM, Johnson JA, Langaee TY, Feng H, Stanaway IB, Schwarz UI, Ritchie MD, Stein CM, Roden DM, Smith JD, et al. A genome-wide scan forcommon genetic variants with a large influence on warfarin maintenance dose. Blood. 2008;112(4):1022–1027. [PubMed]
14. Takeuchi F, McGinnis R, Bourgeois S, Barnes C, Eriksson N, Soranzo N, Whittaker P, Ranganath V, Kumanduri V, McLaren W, et al. A genome-wide association study confirms VKORC1, CYP2C9, and CYP4F2 as principal genetic determinants of warfarin dose. PLoS Genet. 2009;5(3):e1000433. [PMC free article] [PubMed]
15. Kangelaris KN, Bent S, Nussbaum RL, Garcia DA, Tice JA. Genetic testing before anticoagulation? A systematic review ofpharmacogenetic dosing of warfarin. J Gen Intern Med. 2009;24(5):656–664. [PMC free article] [PubMed]
16. Shuldiner AR, O’Connell JR, Bliden KP, Gandhi A, Ryan K, Horenstein RB, Damcott CM, Pakyz R, Tantry US, Gibson Q, et al. Association of cytochrome P450 2C19 genotype with the antiplatelet effect and clinical efficacy of clopidogrel therapy. JAMA. 2009;302(8):849–857. [PMC free article] [PubMed]
17. Crowley JJ, Sullivan PF, McLeod HL. Pharmacogenomic genome-wide association studies: lessons learned thus far. Pharmacogenomics. 2009;10 (2):161–163. [PMC free article] [PubMed]
18. Altshuler D, Daly MJ, Lander ES. Genetic mapping in human disease. Science. 2008;322(5903):881–888. [PMC free article] [PubMed]
19. Ingelman-Sundberg M. Genetic polymorphisms of cytochrome P450 2D6 (CYP2D6): clinical consequences, evolutionary aspects and functional diversity. Pharmacogenomics J. 2005;5(1):6–13. [PubMed]
20. Daly AK, Brockmoller J, Broly F, Eichelbaum M, Evans WE, Gonzalez FJ, Huang JD, Idle JR, Ingelman-Sundberg M, Ishizaki T, et al. Nomenclature for human CYP2D6 alleles. Pharmacogenetics. 1996;6(3):193–201. [PubMed]
21. Peters EJ, McLeod HL. Ability of whole-genome SNP arrays to capture ‘must have’ pharmacogenomic variants. Pharmacogenomics. 2008;9(11):1573–1577. [PubMed]
22. Link E, Parish S, Armitage J, Bowman L, Heath S, Matsuda F, Gut I, Lathrop M, Collins R. SLCO1B1 variants and statin-induced myopathy--a genomewide study. N Engl J Med. 2008;359(8):789–799. [PubMed]
23. Guessous I, Gwinn M, Khoury MJ. Genome-wide association studies in pharmacogenomics: untapped potential for translation. Genome Med. 2009;1(4):46. [PMC free article] [PubMed]
24. Ioannidis JP, Ntzani EE, Trikalinos TA, Contopoulos-Ioannidis DG. Replication validity of genetic association studies. Nat Genet. 2001;29 (3):306–309. [PubMed]
25. Thornton-Wells TA, Moore JH, Haines JL. Dissecting trait heterogeneity: a comparison of three clustering methods applied to genotypic data. BMC Bioinformatics. 2006;7:204. [PMC free article] [PubMed]
26. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–575. [PubMed]
27. Cantor RM, Lange K, Sinsheimer JS. Prioritizing GWAS results: A review of statistical methods and recommendations for their application. Am J Hum Genet. 86(1):6–22. [PubMed]
28. Wang L, Weinshilboum RM. Pharmacogenomics: candidate gene identification, functional validation and mechanisms. Hum Mol Genet. 2008;17(R2):R174–179. [PMC free article] [PubMed]
29. Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, et al. Finding the missing heritability of complex diseases. Nature. 2009;461(7265):747–753. [PMC free article] [PubMed]
30. Elbers CC, van Eijk KR, Franke L, Mulder F, van der Schouw YT, Wijmenga C, Onland-Moret NC. Using genome-wide pathway analysis to unravel the etiology of complex diseases. Genet Epidemiol. 2009;33(5):419–431. [PubMed]
31. Eleftherohorinou H, Wright V, Hoggart C, Hartikainen AL, Jarvelin MR, Balding D, Coin L, Levin M. Pathway analysis of GWAS provides new insights into genetic susceptibility to 3 inflammatory diseases. PLoS One. 2009;4(11):e8068. [PMC free article] [PubMed]
32. Bush WS, Dudek SM, Ritchie MD. Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies. Pac Symp Biocomput. 2009:368–379. [PMC free article] [PubMed]
33. Torkamani A, Topol EJ, Schork NJ. Pathway analysis of seven common diseases assessed by genome-wide association. Genomics. 2008;92 (5):265–272. [PMC free article] [PubMed]
34. Klein TE, Chang JT, Cho MK, Easton KL, Fergerson R, Hewett M, Lin Z, Liu Y, Liu S, Oliver DE, et al. Integrating genotype and phenotype information: an overview of the PharmGKB project. Pharmacogenetics Research Network and Knowledge Base. Pharmacogenomics J. 2001;1(3):167–170. [PubMed]
35. The International HapMap Project. Nature. 2003;426(6968):789–796. [PubMed]
36. Jones TS, Yang W, Evans WE, Relling MV. Using HapMap tools in pharmacogenomic discovery: the thiopurine methyltransferase polymorphism. Clin Pharmacol Ther. 2007;81(5):729–734. [PubMed]
37. Trevino LR, Shimasaki N, Yang W, Panetta JC, Cheng C, Pei D, Chan D, Sparreboom A, Giacomini KM, Pui CH, et al. Germline genetic variation in an organic anion transporter polypeptide associated with methotrexate pharmacokinetics and clinical effects. J Clin Oncol. 2009;27(35):5972–5978. [PMC free article] [PubMed]
38. Abe T, Unno M, Onogawa T, Tokui T, Kondo TN, Nakagomi R, Adachi H, Fujiwara K, Okabe M, Suzuki T, et al. LST-2, a human liver-specific organic anion transporter, determines methotrexate sensitivity in gastrointestinal cancers. Gastroenterology. 2001;120(7):1689–1699. [PubMed]
39. Tirona RG, Leake BF, Merino G, Kim RB. Polymorphisms in OATP-C: identification of multiple allelic variants associated with altered transportactivity among European-and African-Americans. J Biol Chem. 2001;276(38):35669–35675. [PubMed]
40. Hartford C, Yang W, Cheng C, Fan Y, Liu W, Trevino L, Pounds S, Neale G, Raimondi SC, Bogni A, et al. Genome scan implicates adhesion biological pathways in secondary leukemia. Leukemia. 2007;21(10):2128–2136. [PubMed]
41. Comabella M, Craig DW, Morcillo-Suarez C, Rio J, Navarro A, Fernandez M, Martin R, Montalban X. Genome-wide scan of 500,000 single-nucleotide polymorphisms among responders and nonresponders to interferon beta therapy in multiple sclerosis. Arch Neurol. 2009;66(8):972–978. [PubMed]
42. Alkelai A, Greenbaum L, Rigbi A, Kanyas K, Lerer B. Genome-wide association study of antipsychotic-induced parkinsonism severity among schizophrenia patients. Psychopharmacology (Berl) 2009;206(3):491–499. [PubMed]
43. Lavedan C, Licamele L, Volpi S, Hamilton J, Heaton C, Mack K, Lannan R, Thompson A, Wolfgang CD, Polymeropoulos MH. Association of the NPAS3 gene and five other loci with response to the antipsychotic iloperidone identified in a whole genome association study. Mol Psychiatry. 2009;14 (8):804–819. [PubMed]
44. Klein TE, Altman RB, Eriksson N, Gage BF, Kimmel SE, Lee MT, Limdi NA, Page D, Roden DM, Wagner MJ, et al. Estimation of the warfarin dose with clinical and pharmacogenetic data. N Engl J Med. 2009;360(8):753–764. [PMC free article] [PubMed]
45. Thompson JF, Hyde CL, Wood LS, Paciga SA, Hinds DA, Cox DR, Hovingh GK, Kastelein JJ. Comprehensive whole-genome and candidate gene analysis for response to statin therapy in the Treating to New Targets (TNT) cohort. Circ Cardiovasc Genet. 2009 Apr;2(2):173–81. [PubMed]
46. Aberg K, Adkins DE, Bukszar J, Webb BT, Caroff SN, Miller DD, Sebat J, Stroup S, Fanous AH, Vladimirov VI, et al. Genomewide Association Study of Movement-Related Adverse Antipsychotic Effects. Biol Psychiatry. 2009 [PMC free article] [PubMed]
47. Garriock HA, Kraft JB, Shyn SI, Peters EJ, Yokoyama JS, Jenkins GD, Reinalda MS, Slager SL, McGrath PJ, Hamilton SP. A genomewide association study of citalopram response in major depressive disorder. Biol Psychiatry. 67(2):133–138. [PMC free article] [PubMed]
48. Suppiah V, Moldovan M, Ahlenstiel G, Berg T, Weltman M, Abate ML, Bassendine M, Spengler U, Dore GJ, Powell E, et al. IL28B is associated with response to chronic hepatitis C interferon-alpha and ribavirin therapy. Nat Genet. 2009;41(10):1100–1104. [PubMed]
49. Tanaka Y, Nishida N, Sugiyama M, Kurosaki M, Matsuura K, Sakamoto N, Nakagawa M, Korenaga M, Hino K, Hige S, et al. Genome-wide association of IL28B with response to pegylated interferon-alpha and ribavirin therapy for chronic hepatitis C. Nat Genet. 2009;41(10):1105–1109. [PubMed]
50. Ising M, Lucae S, Binder EB, Bettecken T, Uhr M, Ripke S, Kohli MA, Hennings JM, Horstmann S, Kloiber S, et al. A genomewide association study points to multiple loci that predict antidepressant drug treatment outcome in depression. Arch Gen Psychiatry. 2009;66(9):966–975. [PubMed]
51. Laje G, Allen AS, Akula N, Manji H, John Rush A, McMahon FJ. Genome-wide association study of suicidal ideation emerging during citalopram treatment of depressed outpatients. Pharmacogenet Genomics. 2009;19 (9):666–674. [PMC free article] [PubMed]
52. McClay JL, Adkins DE, Aberg K, Stroup S, Perkins DO, Vladimirov VI, Lieberman JA, Sullivan PF, van den Oord EJ. Genome-wide pharmacogenomic analysis of response to treatment with antipsychotics. Mol Psychiatry. 2009 [PMC free article] [PubMed]
53. Ge D, Fellay J, Thompson AJ, Simon JS, Shianna KV, Urban TJ, Heinzen EL, Qiu P, Bertelsen AH, Muir AJ, et al. Genetic variation in IL28B predicts hepatitis C treatment-induced viral clearance. Nature. 2009;461(7262):399–401. [PubMed]
54. Teichert M, Eijgelsheim M, Rivadeneira F, Uitterlinden AG, van Schaik RH, Hofman A, De Smet PA, van Gelder T, Visser LE, Stricker BH. A genome-wide association study of acenocoumarol maintenance dosage. Hum Mol Genet. 2009 Oct 1;18(19):3758–68. [PubMed]
55. Daly AK, Donaldson PT, Bhatnagar P, Shen Y, Pe’er I, Floratos A, Daly MJ, Goldstein DB, John S, Nelson MR, Graham J, Park BK, Dillon JF, Bernal W, Cordell HJ, Pirmohamed M, Aithal GP, Day CP. DILIGEN Study; International SAE Consortium. HLA-B*5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin. Nat Genet. 2009 Jul;41(7):816–9. [PubMed]
56. Perlis RH, Smoller JW, Ferreira MA, McQuillin A, Bass N, Lawrence J, Sachs GS, Nimgaonkar V, Scolnick EM, Gurling H, et al. A genomewide association study of response to lithium for prevention of recurrence in bipolar disorder. Am J Psychiatry. 2009;166(6):718–725. [PMC free article] [PubMed]
57. Yang JJ, Cheng C, Yang W, Pei D, Cao X, Fan Y, Pounds SB, Neale G, Trevino LR, French D, et al. Genome-wide interrogation of germline genetic variation associated with treatment response in childhood acute lymphoblastic leukemia. JAMA. 2009;301(4):393–403. [PMC free article] [PubMed]
58. French D, Yang W, Cheng C, Raimondi SC, Mullighan CG, Downing JR, Evans WE, Pui CH, Relling MV. Acquired variation outweighs inherited variation in whole genome analysis of methotrexate polyglutamate accumulation in leukemia. Blood. 2009;113(19):4512–4520. [PubMed]
59. Mick E, Neale B, Middleton FA, McGough JJ, Faraone SV. Genome-wide association study of response to methylphenidate in 187 children with attention-deficit/hyperactivity disorder. Am J Med Genet B Neuropsychiatr Genet. 2008;147B(8):1412–1418. [PubMed]
60. Liu C, Batliwalla F, Li W, Lee A, Roubenoff R, Beckman E, Khalili H, Damle A, Kern M, Furie R, et al. Genome-wide association scan identifies candidate polymorphisms associated with differential response to anti-TNF treatment in rheumatoid arthritis. Mol Med. 2008;14(9-10):575–581. [PubMed]
61. Sarasquete ME, Garcia-Sanz R, Marin L, Alcoceba M, Chillon MC, Balanzategui A, Santamaria C, Rosinol L, de la Rubia J, Hernandez MT, et al. Bisphosphonate-related osteonecrosis of the jaw is associated with polymorphisms of the cytochrome P450 CYP2C8 in multiple myeloma: a genome-wide single nucleotide polymorphism analysis. Blood. 2008;112(7):2709–2712. [PubMed]
62. Turner ST, Bailey KR, Fridley BL, Chapman AB, Schwartz GL, Chai HS, Sicotte H, Kocher JP, Rodin AS, Boerwinkle E. Genomic association analysis suggests chromosome 12 locus influencing antihypertensive response to thiazide diuretic. Hypertension. 2008;52(2):359–365. [PMC free article] [PubMed]
63. Volpi S, Heaton C, Mack K, Hamilton JB, Lannan R, Wolfgang CD, Licamele L, Polymeropoulos MH, Lavedan C. Whole genome association study identifies polymorphisms associated with QT prolongation during iloperidone treatment of schizophrenia. Mol Psychiatry. 2009;14 (11):1024–1031. [PubMed]
64. Lavedan C, Volpi S, Polymeropoulos MH, Wolfgang CD. Effect of a ciliary neurotrophic factor polymorphism on schizophrenia symptom improvement in an iloperidone clinical trial. Pharmacogenomics. 2008 Mar;9(3):289–301. [PubMed]
65. Inada T, Koga M, Ishiguro H, Horiuchi Y, Syu A, Yoshio T, Takahashi N, Ozaki N, Arinami T. Pathway-based association analysis of genome-wide screening data suggest that genes associated with the gamma-aminobutyric acid receptor signaling pathway are involved in neuroleptic-induced, treatment-resistant tardive dyskinesia. Pharmacogenet Genomics. 2008;18(4):317–323. [PubMed]
66. Byun E, Caillier SJ, Montalban X, Villoslada P, Fernandez O, Brassat D, Comabella M, Wang J, Barcellos LF, Baranzini SE, et al. Genome-wide pharmacogenomic analysis of the response to interferon beta therapy in multiple sclerosis. Arch Neurol. 2008;65(3):337–344. [PubMed]
67. Kindmark A, Jawaid A, Harbron CG, Barratt BJ, Bengtsson OF, Andersson TB, Carlsson S, Cederbrant KE, Gibson NJ, Armstrong M, et al. Genome-wide pharmacogenetic investigation of a hepatic adverse event without clinical signs of immunopathology suggests an underlying immune pathogenesis. Pharmacogenomics J. 2008;8(3):186–195. [PubMed]