Admixture, a common form of gene flow between populations, refers to the process whereby two or more genetically and phenotypically diverse populations begin to mate and form a new, mixed or ‘hybrid’ population
13–14. Each chromosome of an admixed individual resembles a mosaic of chromosomal segments derived from a particular ancestral population (). The use of such populations to map genes was proposed over 50 years ago
15, but has come into prominence recently due to the availability of genome-wide sets of highly informative markers and adequate statistical tools to successfully conduct these studies
16–19. Markers with different frequency distributions among ancestral populations can be used to adjust for population stratification among admixed populations. Such markers are often referred to as ancestry informative markers (AIMs), because they can distinguish the ancestral origin of the haplotype on which they reside. If an ancestral population carries a genetic risk allele at a higher frequency than other(s), then genomes of affected “mixed-ancestry” offspring will share a greater level of ancestry (DNA) from that population around that disease susceptibility locus, compared with the background ancestry level in the genome-wide average or compared with the ancestry sharing among discordant individuals around the same location
20.
Populations like African Americans, African Caribbeans and Mexican Americans were formed within the past 400 years (i.e., within approximately 15 generations)
21. Stretches of DNA with contiguous European and African ancestry have therefore not had sufficient time to break up due to recombination; hence, allelic associations in these populations typically extend over distances as large as 20–30 cM
22–23. For studies mapping genes contributing to variability in drug outcome within these populations, the large amount of linkage disequilibrium (i.e., linkage between markers with ancestral information) in these admixed populations will translate into smaller requirements for both marker saturation and sample size
19, 24–25. This principle is important, since it is estimated that about 20% of the genetic material in today’s African-American population originated from a non-African, predominantly Caucasian, source
26.
Gene mapping can be accomplished within admixed populations through the application of three fundamental steps. First, a panel of AIMs that differentiate well between ancestral populations must be designed. Next, individuals are genotyped using the panel (e.g., following case control design), and the mosaic of ancestries is inferred for each individual. Finally, the inferred ancestral profiles are scanned in search of an aberration skewed toward the ancestral population with the higher risk, as expected based upon prior association with the locus of interest. Given the recent and common origin of all human populations, any two unrelated individuals share more than 99.8 % of their DNA sequence, and variation
within populations is by far greater than variation
between populations
27. The admixture between geographically isolated populations such as Europeans, Africans and Native Americans, has only a small average effect (0.2%) on the genetic variation of the gene pool. For most genomic regions, the parental populations have similar allele frequencies and, at these frequencies, admixture may be of limited consequence
28. However, at other loci, there have been marked changes in allele frequency in the time since the separation of parental populations, and it is at these loci that difference in frequencies of risk alleles can be leveraged to identify the loci impacting clinically recognizable traits
9, 29–31. Stated another way, admixture mapping is most applicable when variability in a given drug outcome is significantly different between the ancestral populations from which the admixed population has been formed. When such a trait is studied, admixed individuals demonstrating greater variability are expected to show an elevated genomic contribution from the ancestral population with the higher prevalence of the trait around the associated genetic loci.
The arguments in favor of admixture mapping are compelling, and the statistical methods are improving rapidly. Until recently, the availability of admixed populations suitable for study (with differences in disease and/or allele frequencies) has been somewhat limited
18, 32. Most pharmacogenetic association studies conducted to date have involved retrospective genotyping of archived DNA from previous randomized treatment trials
33. Nearly a decade ago, international efforts were initiated to launch the construction of DNA biobanks that were either based in the community (population-based) or in the context of routine clinical care (practice-based)
34. Biobanks typically include biological samples (i.e., serum, plasma and DNA) linked to structured clinical databases (i.e., comprehensive electronic medical records). These biobanks are uniquely suited for studies quantifying the impact of ancestry in admixed population and play a role in the pathway to personalized medicine in which treatments will be no longer be “one size fits all” but instead “tailored” to the molecular and genetic profiles of each patient based on pharmacogenetic predictors “treatment plans that fall in line with a “one-size-fits-one” approach. For example, the eMERGE network (electronic Medical Records and Genomics) represents a group of five large academic medical institutions within the United States that have collected DNA linked to secure encrypted clinical data extracted from dense electronic medical records
33. The participating institutions, Vanderbilt University (coordinating center), Northwestern University, Marshfield Clinic, Mayo Clinic, and Group Health, have begun the process of standardizing clinical phenotypes within the context of disease (onset and progression), as well as treatment outcome (efficacy and toxicity), for ongoing genome-wide association studies using densely populated high-throughput SNP scans (
http://www.gwas.net). In many cases, these data are available along with self-reported race and family structure across multiple generations. Similar efforts are also underway at Harvard Pilgrim, and Fallon Healthcare (in the Northeastern United States), Kaiser Permanente Georgia (in the Southeastern United States), HealthPartners, Henry Ford, and Marshfield Clinic (in the Midwestern United States), Kaiser Permanente Colorado, Kaiser Permanente Northwest, Group Health Cooperative, Lovelace, Kaiser Permanente Southern California, Kaiser Permanente Northern California, and Kaiser Permanente Hawaii (in the Western United States)
35. In some cases, race-specific biobanks are also being developed. The African-American DNA biobank at Howard University in Washington, D.C. represents the largest resource of its kind
36.