Human Y-chromosome haplogroup structure is largely circumscribed by continental boundaries. One notable exception to this general pattern is the young haplogroup R1a that exhibits post-Glacial coalescent times and relates the paternal ancestry of more than 10% of men in a wide geographic area extending from South Asia to Central East Europe and South Siberia. Its origin and dispersal patterns are poorly understood as no marker has yet been described that would distinguish European R1a chromosomes from Asian. Here we present frequency and haplotype diversity estimates for more than 2000 R1a chromosomes assessed for several newly discovered SNP markers that introduce the onset of informative R1a subdivisions by geography. Marker M434 has a low frequency and a late origin in West Asia bearing witness to recent gene flow over the Arabian Sea. Conversely, marker M458 has a significant frequency in Europe, exceeding 30% in its core area in Eastern Europe and comprising up to 70% of all M17 chromosomes present there. The diversity and frequency profiles of M458 suggest its origin during the early Holocene and a subsequent expansion likely related to a number of prehistoric cultural developments in the region. Its primary frequency and diversity distribution correlates well with some of the major Central and East European river basins where settled farming was established before its spread further eastward. Importantly, the virtual absence of M458 chromosomes outside Europe speaks against substantial patrilineal gene flow from East Europe to Asia, including to India, at least since the mid-Holocene.
Y chromosome; haplogroup R1a; human evolution; population genetics
The phylogenetic relationships of numerous branches within the core Y-chromosome haplogroup R-M207 support a West Asian origin of haplogroup R1b, its initial differentiation there followed by a rapid spread of one of its sub-clades carrying the M269 mutation to Europe. Here, we present phylogeographically resolved data for 2043 M269-derived Y-chromosomes from 118 West Asian and European populations assessed for the M412 SNP that largely separates the majority of Central and West European R1b lineages from those observed in Eastern Europe, the Circum-Uralic region, the Near East, the Caucasus and Pakistan. Within the M412 dichotomy, the major S116 sub-clade shows a frequency peak in the upper Danube basin and Paris area with declining frequency toward Italy, Iberia, Southern France and British Isles. Although this frequency pattern closely approximates the spread of the Linearbandkeramik (LBK), Neolithic culture, an advent leading to a number of pre-historic cultural developments during the past ≤10 thousand years, more complex pre-Neolithic scenarios remain possible for the L23(xM412) components in Southeast Europe and elsewhere.
Y-chromosome; haplogroup R1b; human evolution; population genetics
To better define the structure and origin of the Bulgarian paternal gene pool, we have examined the Y-chromosome variation in 808 Bulgarian males. The analysis was performed by high-resolution genotyping of biallelic markers and by analyzing the STR variation within the most informative haplogroups. We found that the Y-chromosome gene pool in modern Bulgarians is primarily represented by Western Eurasian haplogroups with ∼ 40% belonging to haplogroups E-V13 and I-M423, and 20% to R-M17. Haplogroups common in the Middle East (J and G) and in South Western Asia (R-L23*) occur at frequencies of 19% and 5%, respectively. Haplogroups C, N and Q, distinctive for Altaic and Central Asian Turkic-speaking populations, occur at the negligible frequency of only 1.5%. Principal Component analyses group Bulgarians with European populations, apart from Central Asian Turkic-speaking groups and South Western Asia populations. Within the country, the genetic variation is structured in Western, Central and Eastern Bulgaria indicating that the Balkan Mountains have been permeable to human movements. The lineage analysis provided the following interesting results: (i) R-L23* is present in Eastern Bulgaria since the post glacial period; (ii) haplogroup E-V13 has a Mesolithic age in Bulgaria from where it expanded after the arrival of farming; (iii) haplogroup J-M241 probably reflects the Neolithic westward expansion of farmers from the earliest sites along the Black Sea. On the whole, in light of the most recent historical studies, which indicate a substantial proto-Bulgarian input to the contemporary Bulgarian people, our data suggest that a common paternal ancestry between the proto-Bulgarians and the Altaic and Central Asian Turkic-speaking populations either did not exist or was negligible.
Knowledge of high resolution Y-chromosome haplogroup diversification within Iran provides important geographic context regarding the spread and compartmentalization of male lineages in the Middle East and southwestern Asia. At present, the Iranian population is characterized by an extraordinary mix of different ethnic groups speaking a variety of Indo-Iranian, Semitic and Turkic languages. Despite these features, only few studies have investigated the multiethnic components of the Iranian gene pool. In this survey 938 Iranian male DNAs belonging to 15 ethnic groups from 14 Iranian provinces were analyzed for 84 Y-chromosome biallelic markers and 10 STRs. The results show an autochthonous but non-homogeneous ancient background mainly composed by J2a sub-clades with different external contributions. The phylogeography of the main haplogroups allowed identifying post-glacial and Neolithic expansions toward western Eurasia but also recent movements towards the Iranian region from western Eurasia (R1b-L23), Central Asia (Q-M25), Asia Minor (J2a-M92) and southern Mesopotamia (J1-Page08). In spite of the presence of important geographic barriers (Zagros and Alborz mountain ranges, and the Dasht-e Kavir and Dash-e Lut deserts) which may have limited gene flow, AMOVA analysis revealed that language, in addition to geography, has played an important role in shaping the nowadays Iranian gene pool. Overall, this study provides a portrait of the Y-chromosomal variation in Iran, useful for depicting a more comprehensive history of the peoples of this area as well as for reconstructing ancient migration routes. In addition, our results evidence the important role of the Iranian plateau as source and recipient of gene flow between culturally and genetically distinct populations.
Huntington disease (HD) results from CAG expansion in the huntingtin (HTT) gene. Although HD occurs worldwide, there are large geographic differences in its prevalence. The prevalence in populations derived from Europe is 10–100 times greater than in East Asia. The European general population chromosomes can be grouped into three major haplogroups (group of similar haplotypes): A, B and C. The majority of HD chromosomes in Europe are found on haplogroup A. However, in the East-Asian populations of China and Japan, we find the majority of HD chromosomes are associated with haplogroup C. The highest risk HD haplotypes (A1 and A2), are absent from the general and HD populations of China and Japan, and therefore provide an explanation for why HD prevalence is low in East Asia. Interestingly, both East-Asian and European populations share a similar low level of HD on haplogroup C. Our data are consistent with the hypothesis that different HTT haplotypes have different mutation rates, and geographic differences in HTT haplotypes explain the difference in HD prevalence. Further, the bias for expansion on haplogroup C in the East-Asian population cannot be explained by a higher average CAG size, as haplogroup C has a lower average CAG size in the general East-Asian population compared with other haplogroups. This finding suggests that CAG-tract size is not the only factor important for CAG instability. Instead, the expansion bias may be because of genetic cis-elements within the haplotype that influence CAG instability in HTT, possibly through different mutational mechanisms for the different haplogroups.
Huntington disease; prevalence; CAG expansion; CAG instability; haplotypes; Cis-elements
To determine the human Y-chromosome haplogroup backgrounds of intermediate-sized variant alleles displayed by short tandem repeat (STR) loci DYS392, DYS449, and DYS385, and to evaluate the potential of each intermediate variant to elucidate new phylogenetic substructure within the human Y-chromosome haplogroup tree.
Molecular characterization of lineages was achieved using a combination of Y-chromosome haplogroup defining binary polymorphisms and up to 37 short tandem repeat loci. DNA sequencing and median-joining network analyses were used to evaluate Y-chromosome lineages displaying intermediate variant alleles.
We show that DYS392.2 occurs on a single haplogroup background, specifically I1*-M253, and likely represents a new phylogenetic subdivision in this European haplogroup. Intermediate variants DYS449.2 and DYS385.2 both occur on multiple haplogroup backgrounds, and when evaluated within specific haplogroup contexts, delineate new phylogenetic substructure, with DYS449.2 being informative within haplogroup A-P97 and DYS385.2 in haplogroups D-M145, E1b1a-M2, and R1b*-M343. Sequence analysis of variant alleles observed within the various haplogroup backgrounds showed that the nature of the intermediate variant differed, confirming the mutations arose independently.
Y-chromosome short tandem repeat intermediate variant alleles, while relatively rare, typically occur on multiple haplogroup backgrounds. This distribution indicates that such mutations arise at a rate generally intermediate to those of binary markers and Y-STR loci. As a result, intermediate-sized Y-STR variants can reveal phylogenetic substructure within the Y-chromosome phylogeny not currently detected by either binary or Y-STR markers alone, but only when such variants are evaluated within a haplogroup context.
Archaeological studies have revealed a series of cultural changes around the Last Glacial Maximum in East Asia; whether these changes left any signatures in the gene pool of East Asians remains poorly indicated. To achieve deeper insights into the demographic history of modern humans in East Asia around the Last Glacial Maximum, we extensively analyzed mitochondrial DNA haplogroup M9a'b, a specific haplogroup that was suggested to have some potential for tracing the migration around the Last Glacial Maximum in East Eurasia.
A total of 837 M9a'b mitochondrial DNAs (583 from the literature, while the remaining 254 were newly collected in this study) pinpointed from over 28,000 subjects residing across East Eurasia were studied here. Fifty-nine representative samples were further selected for total mitochondrial DNA sequencing so we could better understand the phylogeny within M9a'b. Based on the updated phylogeny, an extensive phylogeographic analysis was carried out to reveal the differentiation of haplogroup M9a'b and to reconstruct the dispersal histories.
Our results indicated that southern China and/or Southeast Asia likely served as the source of some post-Last Glacial Maximum dispersal(s). The detailed dissection of haplogroup M9a'b revealed the existence of an inland dispersal in mainland East Asia during the post-glacial period. It was this dispersal that expanded not only to western China but also to northeast India and the south Himalaya region. A similar phylogeographic distribution pattern was also observed for haplogroup F1c, thus substantiating our proposition. This inland post-glacial dispersal was in agreement with the spread of the Mesolithic culture originating in South China and northern Vietnam.
Most heritable surnames, like Y chromosomes, are passed from father to son. These unique cultural markers of coancestry might therefore have a genetic correlate in shared Y chromosome types among men sharing surnames, although the link could be affected by mutation, multiple foundation for names, nonpaternity, and genetic drift. Here, we demonstrate through an analysis of 1,678 Y-chromosomal haplotypes within 40 British surnames a remarkably high degree of coancestry that generally increases as surnames become rarer. On average, the proportion of haplotypes lying within descent clusters is 62% but ranges from 0% to 87%. The shallow time depth of many descent clusters within names, the lack of a detectable effect of surname derivation on diversity, and simulations of surname descent suggest that genetic drift through variation in reproductive success is important in structuring haplotype diversity. Modern patterns therefore provide little reliable information about the original founders of surnames some 700 years ago. A comparative analysis of published data on Y diversity within Irish surnames demonstrates a relative lack of surname frequency dependence of coancestry, a difference probably mediated through distinct Irish and British demographic histories including even more marked genetic drift in Ireland.
surnames; Y chromosome; haplotype; haplogroup; genetic drift
The tribe Bovini contains a number of commercially and culturally important species, such as cattle. Understanding their evolutionary time scale is important for distinguishing between post-glacial and domestication-associated population expansions, but estimates of bovine divergence times have been hindered by a lack of reliable calibration points. We present a Bayesian phylogenetic analysis of 481 mitochondrial D-loop sequences, including 228 radiocarbon-dated ancient DNA sequences, using a multi-demographic coalescent model. By employing the radiocarbon dates as internal calibrations, we co-estimate the bovine phylogeny and divergence times in a relaxed-clock framework. The analysis yields evidence for significant population expansions in both taurine and zebu cattle, European aurochs and yak clades. The divergence age estimates support domestication-associated expansion times (less than 12 kyr) for the major haplogroups of cattle. We compare the molecular and palaeontological estimates for the Bison–Bos divergence.
divergence times; demographic model; population expansion; ancient DNA; time dependency
It has been proposed that the distribution patterns and coalescence ages found in Europeans for mitochondrial DNA (mtDNA) haplogroups V, H1 and H3 are the result of a post-glacial expansion from a Franco-Cantabrian refuge that recolonized central and northern areas. In contrast, in this refined mtDNA study of the Cantabrian Cornice that contributes 413 partial and 9 complete new mtDNA sequences, including a large Basque sample and a sample of Asturians, no experimental evidence was found to support the human refuge-expansion theory. In fact, all measures of gene diversity point to the Cantabrian Cornice in general and the Basques in particular, as less polymorphic for V, H1 and H3 than other southern regions in Iberia or in Central Europe. Genetic distances show the Cantabrian Cornice is a very heterogeneous region with significant local differences. The analysis of several minor subhaplogroups, based on complete sequences, also suggests different focal expansions over a local and peninsular range that did not affect continental Europe. Furthermore, all detected clinal trends show stronger longitudinal than latitudinal profiles. In Northern Iberia, it seems that the highest diversity values for some haplogroups with Mesolithic coalescence ages are centred on the Mediterranean side, including Catalonia and South-eastern France.
mtDNA haplogroups; humans; Franco-Cantabrian refuge theory
The Tuareg presently live in the Sahara and the Sahel. Their ancestors are commonly believed to be the Garamantes of the Libyan Fezzan, ever since it was suggested by authors of antiquity. Biological evidence, based on classical genetic markers, however, indicates kinship with the Beja of Eastern Sudan. Our study of mitochondrial DNA (mtDNA) sequences and Y chromosome SNPs of three different southern Tuareg groups from Mali, Burkina Faso and the Republic of Niger reveals a West Eurasian-North African composition of their gene pool. The data show that certain genetic lineages could not have been introduced into this population earlier than ∼9000 years ago whereas local expansions establish a minimal date at around 3000 years ago. Some of the mtDNA haplogroups observed in the Tuareg population were involved in the post-Last Glacial Maximum human expansion from Iberian refugia towards both Europe and North Africa. Interestingly, no Near Eastern mtDNA lineages connected with the Neolithic expansion have been observed in our population sample. On the other hand, the Y chromosome SNPs data show that the paternal lineages can very probably be traced to the Near Eastern Neolithic demic expansion towards North Africa, a period that is otherwise concordant with the above-mentioned mtDNA expansion. The time frame for the migration of the Tuareg towards the African Sahel belt overlaps that of early Holocene climatic changes across the Sahara (from the optimal greening ∼10 000 YBP to the extant aridity beginning at ∼6000 YBP) and the migrations of other African nomadic peoples in the area.
Tuareg; genetic diversity; phylogeography
Linguistic and genetic studies on Roma populations inhabited in Europe have unequivocally traced these populations to the Indian subcontinent. However, the exact parental population group and time of the out-of-India dispersal have remained disputed. In the absence of archaeological records and with only scanty historical documentation of the Roma, comparative linguistic studies were the first to identify their Indian origin. Recently, molecular studies on the basis of disease-causing mutations and haploid DNA markers (i.e. mtDNA and Y-chromosome) supported the linguistic view. The presence of Indian-specific Y-chromosome haplogroup H1a1a-M82 and mtDNA haplogroups M5a1, M18 and M35b among Roma has corroborated that their South Asian origins and later admixture with Near Eastern and European populations. However, previous studies have left unanswered questions about the exact parental population groups in South Asia. Here we present a detailed phylogeographical study of Y-chromosomal haplogroup H1a1a-M82 in a data set of more than 10,000 global samples to discern a more precise ancestral source of European Romani populations. The phylogeographical patterns and diversity estimates indicate an early origin of this haplogroup in the Indian subcontinent and its further expansion to other regions. Tellingly, the short tandem repeat (STR) based network of H1a1a-M82 lineages displayed the closest connection of Romani haplotypes with the traditional scheduled caste and scheduled tribe population groups of northwestern India.
While it is generally accepted that patterns of intra-specific genetic differentiation are substantially affected by glacial history, population genetic processes occurring during Pleistocene glaciations are still poorly understood. In this study, we address the question of the genetic consequences of Pleistocene glaciations for European grey wolves. Combining our data with data from published studies, we analysed phylogenetic relationships and geographic distribution of mitochondrial DNA haplotypes for 947 contemporary European wolves. We also compared the contemporary wolf sequences with published sequences of 24 ancient European wolves.
We found that haplotypes representing two haplogroups, 1 and 2, overlap geographically, but substantially differ in frequency between populations from south-western and eastern Europe. A comparison between haplotypes from Europe and other continents showed that both haplogroups are spread throughout Eurasia, while only haplogroup 1 occurs in contemporary North American wolves. All ancient wolf samples from western Europe that dated from between 44,000 and 1,200 years B.P. belonged to haplogroup 2, suggesting the long-term predominance of this haplogroup in this region. Moreover, a comparison of current and past frequencies and distributions of the two haplogroups in Europe suggested that haplogroup 2 became outnumbered by haplogroup 1 during the last several thousand years.
Parallel haplogroup replacement, with haplogroup 2 being totally replaced by haplogroup 1, has been reported for North American grey wolves. Taking into account the similarity of diets reported for the late Pleistocene wolves from Europe and North America, the correspondence between these haplogroup frequency changes may suggest that they were associated with ecological changes occurring after the Last Glacial Maximum.
It is generally accepted that the most ancient European mitochondrial haplogroup, U5, has evolved essentially in Europe. To resolve the phylogeny of this haplogroup, we completely sequenced 113 mitochondrial genomes (79 U5a and 34 U5b) of central and eastern Europeans (Czechs, Slovaks, Poles, Russians and Belorussians), and reconstructed a detailed phylogenetic tree, that incorporates previously published data. Molecular dating suggests that the coalescence time estimate for the U5 is ∼25–30 thousand years (ky), and ∼16–20 and ∼20–24 ky for its subhaplogroups U5a and U5b, respectively. Phylogeographic analysis reveals that expansions of U5 subclusters started earlier in central and southern Europe, than in eastern Europe. In addition, during the Last Glacial Maximum central Europe (probably, the Carpathian Basin) apparently represented the area of intermingling between human flows from refugial zones in the Balkans, the Mediterranean coastline and the Pyrenees. Age estimations amounting for many U5 subclusters in eastern Europeans to ∼15 ky ago and less are consistent with the view that during the Ice Age eastern Europe was an inhospitable place for modern humans.
Studies of both survival after sepsis and sperm motility in human populations have shown significant associations with common European mitochondrial DNA haplogroups, and have led to proposals that mitochondria bearing haplogroup H have different bioenergetic capacities than those bearing haplogroup T. However, the validity of such associations assumes that there are no non-random influences of nuclear genes or other factors. Here, we removed the effect of any differences in nuclear genes by constructing transmitochondrial cybrids harbouring mitochondria with either haplogroup H or haplogroup T in cultured A549 human lung carcinoma cells with identical nuclear backgrounds. We compared the bioenergetic capacities and coupling efficiencies of mitochondria isolated from these cells, and of mitochondria retained within the cells, as a critical experimental test of the hypothesis that these haplogroups affect mitochondrial bioenergetics. We found that there were no functionally-important bioenergetic differences between mitochondria bearing these haplogroups, using either isolated mitochondria or mitochondria within cells.
ρ0, mtDNA-less; Δψ, mitochondrial membrane potential; TPMP, triphenylmethylphosphonium; FCCP, carbonyl cyanide p-trifluoromethoxyphenylhydrazone; Oxidative phosphorylation; Coupling efficiency; Sepsis; Sperm motility; Cybrid
The host genetic basis of differential outcomes in HIV infection, progression, viral load set point and highly active retroviral therapy (HAART) responses was examined for the common Y haplogroups in European Americans and African Americans. Accelerated progression to acquired immune deficiency syndrome (AIDS) and related death in European Americans among Y chromosome haplogroup I (Y-I) subjects was discovered. Additionally, Y-I haplogroup subjects on HAART took a longer time to HIV-1 viral suppression and were more likely to fail HAART. Both the accelerated progression and longer time to viral suppression results observed in haplogroup Y-I were significant after false-discovery-rate corrections. A higher frequency of AIDS-defining illnesses was also observed in haplogroup Y-I. These effects were independent of the previously identified autosomal AIDS restriction genes. When the Y-I haplogroup subjects were further subdivided into six I subhaplogroups, no one subhaplogroup accounted for the effects on HIV progression, viral load or HAART response. Adjustment of the analyses for population stratification found significant and concordant haplogroup Y-I results. The Y chromosome haplogroup analyses of HIV infection and progression in African Americans were not significant. Our results suggest that one or more loci on the Y chromosome found on haplogroup Y-I have an effect on AIDS progression and treatment responses in European Americans.
Population history can be reflected in group genetic ancestry, where genomic variation captured by the mitochondrial DNA (mtDNA) and non-recombining portion of the Y chromosome (NRY) can separate female- and male-specific admixture processes. Genetic ancestry may influence genetic association studies due to differences in individual admixture within recently admixed populations like African Americans.
We evaluated the genetic ancestry of Senegalese as well as European Americans and African Americans from Philadelphia. Senegalese mtDNA consisted of ∼12% U haplotypes (U6 and U5b1b haplotypes, common in North Africa) while the NRY haplotypes belonged solely to haplogroup E. In Philadelphia, we observed varying degrees of admixture. While African Americans have 9–10% mtDNAs and ∼31% NRYs of European origin, these results are not mirrored in the mtDNA/NRY pools of European Americans: they have less than 7% mtDNAs and less than 2% NRYs from non-European sources. Additionally, there is <2% Native American contribution to Philadelphian African American ancestry and the admixture from combined mtDNA/NRY estimates is consistent with the admixture derived from autosomal genetic data. To further dissect these estimates, we have analyzed our samples in the context of different demographic groups in the Americas.
We found that sex-biased admixture in African-derived populations is present throughout the Americas, with continual influence of European males, while Native American females contribute mainly to populations of the Caribbean and South America. The high non-European female contribution to the pool of European-derived populations is consistently characteristic of Iberian colonization. These data suggest that genomic data correlate well with historical records of colonization in the Americas.
Mitochondrial haplogroups could influence individual susceptibility to mitochondrial DNA (mtDNA) damage, and human longevity, as indicated by previous studies with Caucasian (European) or Asian cohorts. Here, we compared the frequency of mtDNA haplogroups in a group of Spanish (Caucasian) centenarians (n = 65, aged 100–108 years, 58 women, most from the central part of Spain) and a group of healthy young adults (n = 138, 62 women, aged 20–40 years) of the same ethnic origin. We did not find significant differences between centenarians and the control group (P > 0.2). Only two centenarians (both women) had the haplogroup J, which hampered comparison with the control group (n = 15, five women). Our data confirm that the potential effects of mitochondrial haplogroups on human longevity might be population/geographic specific, with important differences between studies (notably, with regard to the previously reported potential benefit brought about by the haplogroup J) arising from the different living environment and ethnic background of the study cohorts.
Genetics; Mitochondria; Centenarians
This study aims to establish the likely origin of EEJ (Eastern European Jews) by genetic distance analysis of autosomal markers and haplogroups on the X and Y chromosomes and mtDNA.
According to the autosomal polymorphisms the investigated Jewish populations do not share a common origin, and EEJ are closer to Italians in particular and to Europeans in general than to the other Jewish populations. The similarity of EEJ to Italians and Europeans is also supported by the X chromosomal haplogroups. In contrast according to the Y-chromosomal haplogroups EEJ are closest to the non-Jewish populations of the Eastern Mediterranean. MtDNA shows a mixed pattern, but overall EEJ are more distant from most populations and hold a marginal rather than a central position. The autosomal genetic distance matrix has a very high correlation (0.789) with geography, whereas the X-chromosomal, Y-chromosomal and mtDNA matrices have a lower correlation (0.540, 0.395 and 0.641 respectively).
The close genetic resemblance to Italians accords with the historical presumption that Ashkenazi Jews started their migrations across Europe in Italy and with historical evidence that conversion to Judaism was common in ancient Rome. The reasons for the discrepancy between the biparental markers and the uniparental markers are discussed.
This article was reviewed by Damian Labuda (nominated by Jerzy Jurka), Kateryna Makova and Qasim Ayub (nominated by Dan Graur).
The extinct aurochs (Bos primigenius primigenius) was a large
type of cattle that ranged over almost the whole Eurasian continent. The aurochs
is the wild progenitor of modern cattle, but it is unclear whether European
aurochs contributed to this process. To provide new insights into the
demographic history of aurochs and domestic cattle, we have generated
high-confidence mitochondrial DNA sequences from 59 archaeological skeletal
finds, which were attributed to wild European cattle populations based on their
chronological date and/or morphology. All pre-Neolithic aurochs belonged to the
previously designated P haplogroup, indicating that this represents the Late
Glacial Central European signature. We also report one new and highly divergent
haplotype in a Neolithic aurochs sample from Germany, which points to greater
variability during the Pleistocene. Furthermore, the Neolithic and Bronze Age
samples that were classified with confidence as European aurochs using
morphological criteria all carry P haplotype mitochondrial DNA, suggesting
continuity of Late Glacial and Early Holocene aurochs populations in Europe.
Bayesian analysis indicates that recent population growth gives a significantly
better fit to our data than a constant-sized population, an observation
consistent with a postglacial expansion scenario, possibly from a single
European refugial population. Previous work has shown that most ancient and
modern European domestic cattle carry haplotypes previously designated T. This,
in combination with our new finding of a T haplotype in a very Early Neolithic
site in Syria, lends persuasive support to a scenario whereby gracile Near
Eastern domestic populations, carrying predominantly T haplotypes, replaced P
haplotype-carrying robust autochthonous aurochs populations in Europe, from the
Early Neolithic onward. During the period of coexistence, it appears that
domestic cattle were kept separate from wild aurochs and introgression was
ancient DNA; aurochs; starburst network; mitochondrial haplotypes; domestication
The Koreans are generally considered a northeast Asian group because of their geographical location. However, recent findings from Y chromosome studies showed that the Korean population contains lineages from both southern and northern parts of East Asia. To understand the genetic history and relationships of Korea more fully, additional data and analyses are necessary.
Methodology and Results
We analyzed mitochondrial DNA (mtDNA) sequence variation in the hypervariable segments I and II (HVS-I and HVS-II) and haplogroup-specific mutations in coding regions in 445 individuals from seven east Asian populations (Korean, Korean-Chinese, Mongolian, Manchurian, Han (Beijing), Vietnamese and Thais). In addition, published mtDNA haplogroup data (N = 3307), mtDNA HVS-I sequences (N = 2313), Y chromosome haplogroup data (N = 1697) and Y chromosome STR data (N = 2713) were analyzed to elucidate the genetic structure of East Asian populations. All the mtDNA profiles studied here were classified into subsets of haplogroups common in East Asia, with just two exceptions. In general, the Korean mtDNA profiles revealed similarities to other northeastern Asian populations through analysis of individual haplogroup distributions, genetic distances between populations or an analysis of molecular variance, although a minor southern contribution was also suggested. Reanalysis of Y-chromosomal data confirmed both the overall similarity to other northeastern populations, and also a larger paternal contribution from southeastern populations.
The present work provides evidence that peopling of Korea can be seen as a complex process, interpreted as an early northern Asian settlement with at least one subsequent male-biased southern-to-northern migration, possibly associated with the spread of rice agriculture.
Geographic distribution of the genetic diversity in domestic animals, particularly mitochondrial DNA, has often been used to infer centers of domestication. The underlying presumption is that phylogeographic patterns among domesticates were established during, or shortly after the domestication. Human activities are assumed not to have altered the haplogroup frequencies to any great extent. We studied this hypothesis by analyzing 24 mtDNA sequences in ancient Scandinavian dogs. Breeds originating in northern Europe are characterized by having a high frequency of mtDNA sequences belonging to a haplogroup rare in other populations (HgD). This has been suggested to indicate a possible origin of the haplogroup (perhaps even a separate domestication) in central or northern Europe.
The sequences observed in the ancient samples do not include the haplogroup indicative for northern European breeds (HgD). Instead, several of them correspond to haplogroups that are uncommon in the region today and that are supposed to have Asian origin.
We find no evidence for local domestication. We conclude that interpretation of the processes responsible for current domestic haplogroup frequencies should be carried out with caution if based only on contemporary data. They do not only tell their own story, but also that of humans.
We present a method for improving the power of linkage analysis by detecting chromosome segments shared identical by descent (IBD) by individuals not known to be related. Existing Markov chain Monte Carlo methods sample descent patterns on pedigrees conditional on observed marker data. These patterns can be stored as IBD graphs, which express shared ancestry only, rather than specific family relationships. A model for IBD between unrelated individuals allows the estimation of coancestry between individuals in different pedigrees. IBD graphs on separate pedigrees can then be combined using these estimates. We report results from analyses of three sets of simulated marker data on two different pedigrees. We show that when families share a gene for a trait due to shared ancestry on the order of tens of generations, our method can detect a linkage signal when independent analyses of the families do not.
linkage; pedigrees; gene coancestry; IBD
The ability of the Y chromosome to retain a record of its evolution has seen it become an essential tool of molecular anthropology. In the last few years, however, it has also found use in forensic genetics, providing information on the geographic origin of individuals. This has been aided by the development of efficient screening methods and an increased knowledge of geographic distribution. In this study, we describe the development of single base extension assays used to resolve 61 Y chromosome haplogroups, mainly within haplogroups A, B and E, found in Africa.
Seven multiplex assays, which incorporated 60 Y chromosome markers, were developed. These resolved Y chromosomes to 61 terminal branches of the major African haplogroups A, B and E, while also including a few Eurasian haplogroups found occasionally in African males. Following its validation, the assays were used to screen 683 individuals from Southern Africa, including south eastern Bantu speakers (BAN), Khoe-San (KS) and South African Whites (SAW). Of the 61 haplogroups that the assays collectively resolved, 26 were found in the 683 samples. While haplogroup sharing was common between the BAN and KS, the frequencies of these haplogroups varied appreciably. Both groups showed low levels of assimilation of Eurasian haplogroups and only two individuals in the SAW clearly had Y chromosomes of African ancestry.
The use of these single base extension assays in screening increased haplogroup resolution and sampling throughput, while saving time and DNA. Their use, together with the screening of short tandem repeat markers would considerably improve resolution, thus refining the geographic ancestry of individuals.
Modern linkage-based approaches employing extended pedigrees are becoming powerful tools for localizing complex quantitative trait loci. For these linkage mapping methods, it is necessary to reconstruct extended pedigrees which include living individuals, using extensive pedigree records. Unfortunately, such records are not always easy to obtain and application of the linkage-based approaches has been restricted. Within a finite population under random mating, latent inbreeding rather than non-random inbreeding by consanguineous marriages is expected to occur and is attributable to coalescence in a finite population. Interestingly, it has been revealed that significant random inbreeding exists even in general human populations. Random inbreeding should be used to detect the hidden coancestry between individuals for a particular chromosomal position and it could also have application in linkage mapping methods. Here we present a novel method, named finite population based linkage mapping (FPL) method, to detect linkage between a quantitative trait and a marker via random inbreeding in a finite population without pedigree records. We show how to estimate coancestry for a chromosomal position between individuals by using multipoint Bayesian estimation. Subsequently, we describe the FPL method for detecting linkage via interval mapping method using a nonparametric test. We show that the FPL method does work via simulated data. For a random sample from a finite population, the FPL method is more powerful than a standard pedigree-based linkage mapping method with using genotypes of all parents of the sample. In addition, the FPL method was demonstrated by actual microsatellite genotype data of 750 Japanese individuals that are unrelated according to pedigree records to map a known Psoriasis susceptible locus. For samples without pedigree records, it was suggested that the FPL method require limited number of individuals, therefore would be better than other methods using thousands of individuals.