We have analyzed 7137 samples from 125 different caste, tribal and religious groups of India and 99 samples from three populations of Nepal for the length variation in the COII/tRNALys region of mtDNA. Samples showing length variation were subjected to detailed phylogenetic analysis based on HVS-I and informative coding region sequence variation. The overall frequencies of the 9-bp deletion and insertion variants in South Asia were 1.8% and 0.5%, respectively. We have also defined a novel deep-rooting haplogroup M43 and identified the rare haplogroup H14 in Indian populations carrying the 9bp-deletion by complete mtDNA sequencing. Moreover, we redefined haplogroup M6 and dissected it into two well-defined subclades. The presence of haplogroups F1 and B5a in Uttar Pradesh suggests minor maternal contribution from Southeast Asia to Northern India. The occurrence of haplogroup F1 in the Nepalese sample implies that Nepal might have served as a bridge for the flow of eastern lineages to India. The presence of R6 in the Nepalese, on the other hand, suggests that the gene flow between India and Nepal has been reciprocal.
South Asia; 9bp indel; mtDNA; Haplogroup
We have analyzed 7,137 samples from 125 different caste, tribal and religious groups of India and 99 samples from three populations of Nepal for the length variation in the COII/tRNALys region of mtDNA. Samples showing length variation were subjected to detailed phylogenetic analysis based on HVS-I and informative coding region sequence variation. The overall frequencies of the 9-bp deletion and insertion variants in South Asia were 1.9 and 0.6%, respectively. We have also defined a novel deep-rooting haplogroup M43 and identified the rare haplogroup H14 in Indian populations carrying the 9-bp deletion by complete mtDNA sequencing. Moreover, we redefined haplogroup M6 and dissected it into two well-defined subclades. The presence of haplogroups F1 and B5a in Uttar Pradesh suggests minor maternal contribution from Southeast Asia to Northern India. The occurrence of haplogroup F1 in the Nepalese sample implies that Nepal might have served as a bridge for the flow of eastern lineages to India. The presence of R6 in the Nepalese, on the other hand, suggests that the gene flow between India and Nepal has been reciprocal.
South Asia; 9bp indel; mtDNA; Haplogroup
Macrohaplogroups 'M' and 'N' have evolved almost in parallel from a founder haplogroup L3. Macrohaplogroup N in India has already been defined in previous studies and recently the macrohaplogroup M among the Indian populations has been characterized. In this study, we attempted to reconstruct and re-evaluate the phylogeny of Macrohaplogroup M, which harbors more than 60% of the Indian mtDNA lineage, and to shed light on the origin of its deep rooting haplogroups.
Using 11 whole mtDNA and 2231 partial coding sequence of Indian M lineage selected from 8670 HVS1 sequences across India, we have reconstructed the tree including Andamanese-specific lineage M31 and calculated the time depth of all the nodes. We defined one novel haplogroup M41, and revised the classification of haplogroups M3, M18, and M31.
Our result indicates that the Indian mtDNA pool consists of several deep rooting lineages of macrohaplogroup 'M' suggesting in-situ origin of these haplogroups in South Asia, most likely in the India. These deep rooting lineages are not language specific and spread over all the language groups in India. Moreover, our reanalysis of the Andamanese-specific lineage M31 suggests population specific two clear-cut subclades (M31a1 and M31a2). Onge and Jarwa share M31a1 branch while M31a2 clade is present in only Great Andamanese individuals. Overall our study supported the one wave, rapid dispersal theory of modern humans along the Asian coast.
In search of the ancestors of Native American mitochondrial DNA (mtDNA) haplogroups, we analyzed the mtDNA of 531 individuals from nine indigenous populations in Siberia. All mtDNAs were subjected to high-resolution RFLP analysis, sequencing of the control-region hypervariable segment I (HVS-I), and surveyed for additional polymorphic markers in the coding region. Furthermore, the mtDNAs selected according to haplogroup/subhaplogroup status were completely sequenced. Phylogenetic analyses of the resulting data, combined with those from previously published Siberian arctic and sub-arctic populations, revealed that remnants of the ancient Siberian gene pool are still evident in Siberian populations, suggesting that the founding haplotypes of the Native American A–D branches originated in different parts of Siberia. Thus, lineage A complete sequences revealed in the Mansi of the Lower Ob and the Ket of the Lower Yenisei belong to A1, suggesting that A1 mtDNAs occasionally found in the remnants of hunting-gathering populations of northwestern and northern Siberia belonged to a common gene pool of the Siberian progenitors of Paleoindians. Moreover, lineage B1, which is the most closely related to the American B2, occurred in the Tubalar and Tuvan inhabiting the territory between the upper reaches of the Ob River in the west, to the Upper Yenisei region in the east. Finally, the sequence variants of haplogroups C and D, which are most similar to Native American C1 and D1, were detected in the Ulchi of the Lower Amur. Overall, our data suggest that the immediate ancestors of the Siberian/Beringian migrants who gave rise to ancient (pre-Clovis) Paleoindians have a common origin with aboriginal people of the area now designated the Altai-Sayan Upland, as well as the Lower Amur/Sea of Okhotsk region.
mtDNA variation; native Siberians; Native Americans
Recent advances in the understanding of the maternal and paternal heritage of south and southwest Asian populations have highlighted their role in the colonization of Eurasia by anatomically modern humans. Further understanding requires a deeper insight into the topology of the branches of the Indian mtDNA phylogenetic tree, which should be contextualized within the phylogeography of the neighboring regional mtDNA variation. Accordingly, we have analyzed mtDNA control and coding region variation in 796 Indian (including both tribal and caste populations from different parts of India) and 436 Iranian mtDNAs. The results were integrated and analyzed together with published data from South, Southeast Asia and West Eurasia.
Four new Indian-specific haplogroup M sub-clades were defined. These, in combination with two previously described haplogroups, encompass approximately one third of the haplogroup M mtDNAs in India. Their phylogeography and spread among different linguistic phyla and social strata was investigated in detail. Furthermore, the analysis of the Iranian mtDNA pool revealed patterns of limited reciprocal gene flow between Iran and the Indian sub-continent and allowed the identification of different assemblies of shared mtDNA sub-clades.
Since the initial peopling of South and West Asia by anatomically modern humans, when this region may well have provided the initial settlers who colonized much of the rest of Eurasia, the gene flow in and out of India of the maternally transmitted mtDNA has been surprisingly limited. Specifically, our analysis of the mtDNA haplogroups, which are shared between Indian and Iranian populations and exhibit coalescence ages corresponding to around the early Upper Paleolithic, indicates that they are present in India largely as Indian-specific sub-lineages. In contrast, other ancient Indian-specific variants of M and R are very rare outside the sub-continent.
We analyzed the frequency of four mitochondrial DNA haplogroups in 424 individuals from 21 Colombian Amerindian tribes. Our results showed a high degree of mtDNA diversity and genetic heterogeneity. Frequencies of mtDNA haplogroups A and C were high in the majority of populations studied. The distribution of these four mtDNA haplogroups from Amerindian populations was different in the northern region of the country compared to those in the south. Haplogroup A was more frequently found among Amerindian tribes in northern Colombia, while haplogroup D was more frequent among tribes in the south. Haplogroups A, C and D have clinal tendencies in Colombia and South America in general. Populations belonging to the Chibcha linguistic family of Colombia and other countries nearby showed a strong genetic differentiation from the other populations tested, thus corroborating previous findings. Genetically, the Ingano, Paez and Guambiano populations are more closely related to other groups of south eastern Colombia, as also inferred from other genetic markers and from archeological data. Strong evidence for a correspondence between geographical and linguistic classification was found, and this is consistent with evidence that gene flow and the exchange of customs and knowledge and language elements between groups is facilitated by close proximity.
mitochondrial DNA; Amerindian; Colombia; Chibcha; genetic relationships
Linguistic and genetic studies on Roma populations inhabited in Europe have unequivocally traced these populations to the Indian subcontinent. However, the exact parental population group and time of the out-of-India dispersal have remained disputed. In the absence of archaeological records and with only scanty historical documentation of the Roma, comparative linguistic studies were the first to identify their Indian origin. Recently, molecular studies on the basis of disease-causing mutations and haploid DNA markers (i.e. mtDNA and Y-chromosome) supported the linguistic view. The presence of Indian-specific Y-chromosome haplogroup H1a1a-M82 and mtDNA haplogroups M5a1, M18 and M35b among Roma has corroborated that their South Asian origins and later admixture with Near Eastern and European populations. However, previous studies have left unanswered questions about the exact parental population groups in South Asia. Here we present a detailed phylogeographical study of Y-chromosomal haplogroup H1a1a-M82 in a data set of more than 10,000 global samples to discern a more precise ancestral source of European Romani populations. The phylogeographical patterns and diversity estimates indicate an early origin of this haplogroup in the Indian subcontinent and its further expansion to other regions. Tellingly, the short tandem repeat (STR) based network of H1a1a-M82 lineages displayed the closest connection of Romani haplotypes with the traditional scheduled caste and scheduled tribe population groups of northwestern India.
The domestic pig currently indigenous to the Tibetan highlands is supposed to have been introduced during a continuous period of colonization by the ancestors of modern Tibetans. However, there is no direct genetic evidence of either the local origin or exotic migration of the Tibetan pig.
Methods and Findings
We analyzed mtDNA hypervariable segment I (HVI) variation of 218 individuals from seven Tibetan pig populations and 1,737 reported mtDNA sequences from domestic pigs and wild boars across Asia. The Bayesian consensus tree revealed a main haplogroup M and twelve minor haplogroups, which suggested a large number of small scale in situ domestication episodes. In particular, haplogroups D1 and D6 represented two highly divergent lineages in the Tibetan highlands and Island Southeastern Asia, respectively. Network analysis of haplogroup M further revealed one main subhaplogroup M1 and two minor subhaplogroups M2 and M3. Intriguingly, M2 was mainly distributed in Southeastern Asia, suggesting for a local origin. Similar with haplogroup D6, M3 was mainly restricted in Island Southeastern Asia. This pattern suggested that Island Southeastern Asia, but not Southeastern Asia, might be the center of domestication of the so-called Pacific clade (M3 and D6 here) described in previous studies. Diversity gradient analysis of major subhaplogroup M1 suggested three local origins in Southeastern Asia, the middle and downstream regions of the Yangtze River, and the Tibetan highlands, respectively.
We identified two new origin centers for domestic pigs in the Tibetan highlands and in the Island Southeastern Asian region.
An early dispersal of biologically and behaviorally modern humans from their African origins to Australia, by at least 45 thousand years via southern Asia has been suggested by studies based on morphology, archaeology and genetics. However, mtDNA lineages sampled so far from south Asia, eastern Asia and Australasia show non-overlapping distributions of haplogroups within pan Eurasian M and N macrohaplogroups. Likewise, support from the archaeology is still ambiguous.
In our completely sequenced 966-mitochondrial genomes from 26 relic tribes of India, we have identified seven genomes, which share two synonymous polymorphisms with the M42 haplogroup, which is specific to Australian Aborigines.
Our results showing a shared mtDNA lineage between Indians and Australian Aborigines provides direct genetic evidence of an early colonization of Australia through south Asia, following the "southern route".
More than a half of the northern Asian pool of human mitochondrial DNA (mtDNA) is fragmented into a number of subclades of haplogroups C and D, two of the most frequent haplogroups throughout northern, eastern, central Asia and America. While there has been considerable recent progress in studying mitochondrial variation in eastern Asia and America at the complete genome resolution, little comparable data is available for regions such as southern Siberia – the area where most of northern Asian haplogroups, including C and D, likely diversified. This gap in our knowledge causes a serious barrier for progress in understanding the demographic pre-history of northern Eurasia in general. Here we describe the phylogeography of haplogroups C and D in the populations of northern and eastern Asia. We have analyzed 770 samples from haplogroups C and D (174 and 596, respectively) at high resolution, including 182 novel complete mtDNA sequences representing haplogroups C and D (83 and 99, respectively). The present-day variation of haplogroups C and D suggests that these mtDNA clades expanded before the Last Glacial Maximum (LGM), with their oldest lineages being present in the eastern Asia. Unlike in eastern Asia, most of the northern Asian variants of haplogroups C and D began the expansion after the LGM, thus pointing to post-glacial re-colonization of northern Asia. Our results show that both haplogroups were involved in migrations, from eastern Asia and southern Siberia to eastern and northeastern Europe, likely during the middle Holocene.
Only a limited number of complete mitochondrial genome sequences belonging to Native American haplogroups were available until recently, which left America as the continent with the least amount of information about sequence variation of entire mitochondrial DNAs. In this study, a comprehensive overview of all available complete mitochondrial DNA (mtDNA) genomes of the four pan-American haplogroups A2, B2, C1, and D1 is provided by revising the information scattered throughout GenBank and the literature, and adding 14 novel mtDNA sequences. The phylogenies of haplogroups A2, B2, C1, and D1 reveal a large number of sub-haplogroups but suggest that the ancestral Beringian population(s) contributed only six (successful) founder haplotypes to these haplogroups. The derived clades are overall starlike with coalescence times ranging from 18,000 to 21,000 years (with one exception) using the conventional calibration. The average of about 19,000 years somewhat contrasts with the corresponding lower age of about 13,500 years that was recently proposed by employing a different calibration and estimation approach. Our estimate indicates a human entry and spread of the pan-American haplogroups into the Americas right after the peak of the Last Glacial Maximum and comfortably agrees with the undisputed ages of the earliest Paleoindians in South America. In addition, the phylogenetic approach also indicates that the pathogenic status proposed for various mtDNA mutations, which actually define branches of Native American haplogroups, was based on insufficient grounds.
Sakha – an area connecting South and Northeast Siberia – is significant for understanding the history of peopling of Northeast Eurasia and the Americas. Previous studies have shown a genetic contiguity between Siberia and East Asia and the key role of South Siberia in the colonization of Siberia.
We report the results of a high-resolution phylogenetic analysis of 701 mtDNAs and 318 Y chromosomes from five native populations of Sakha (Yakuts, Evenks, Evens, Yukaghirs and Dolgans) and of the analysis of more than 500,000 autosomal SNPs of 758 individuals from 55 populations, including 40 previously unpublished samples from Siberia. Phylogenetically terminal clades of East Asian mtDNA haplogroups C and D and Y-chromosome haplogroups N1c, N1b and C3, constituting the core of the gene pool of the native populations from Sakha, connect Sakha and South Siberia. Analysis of autosomal SNP data confirms the genetic continuity between Sakha and South Siberia. Maternal lineages D5a2a2, C4a1c, C4a2, C5b1b and the Yakut-specific STR sub-clade of Y-chromosome haplogroup N1c can be linked to a migration of Yakut ancestors, while the paternal lineage C3c was most likely carried to Sakha by the expansion of the Tungusic people. MtDNA haplogroups Z1a1b and Z1a3, present in Yukaghirs, Evens and Dolgans, show traces of different and probably more ancient migration(s). Analysis of both haploid loci and autosomal SNP data revealed only minor genetic components shared between Sakha and the extreme Northeast Siberia. Although the major part of West Eurasian maternal and paternal lineages in Sakha could originate from recent admixture with East Europeans, mtDNA haplogroups H8, H20a and HV1a1a, as well as Y-chromosome haplogroup J, more probably reflect an ancient gene flow from West Eurasia through Central Asia and South Siberia.
Our high-resolution phylogenetic dissection of mtDNA and Y-chromosome haplogroups as well as analysis of autosomal SNP data suggests that Sakha was colonized by repeated expansions from South Siberia with minor gene flow from the Lower Amur/Southern Okhotsk region and/or Kamchatka. The minor West Eurasian component in Sakha attests to both recent and ongoing admixture with East Europeans and an ancient gene flow from West Eurasia.
mtDNA; Y chromosome; Autosomal SNPs; Sakha
After several years of research, there is now a consensus that America was populated from Asia through Beringia, probably at the end of the Pleistocene. But many details such as the timing, route(s), and origin of the first settlers remain uncertain. In the last decade genetic evidence has taken on a major role in elucidating the peopling of the Americas. To study the early peopling of South America, we sequenced the control region of mitochondrial DNA from 300 individuals belonging to indigenous populations of Chile and Argentina, and also obtained seven complete mitochondrial DNA sequences. We identified two novel mtDNA monophyletic clades, preliminarily designated B2l and C1b13, which together with the recently described D1g sub-haplogroup have locally high frequencies and are basically restricted to populations from the extreme south of South America. The estimated ages of D1g and B2l, about ∼15,000 years BP, together with their similar population dynamics and the high haplotype diversity shown by the networks, suggests that they probably appeared soon after the arrival of the first settlers and agrees with the dating of the earliest archaeological sites in South America (Monte Verde, Chile, 14,500 BP). One further sub-haplogroup, D4h3a5, appears to be restricted to Fuegian-Patagonian populations and reinforces our hypothesis of the continuity of the current Patagonian populations with the initial founders. Our results indicate that the extant native populations inhabiting South Chile and Argentina are a group which had a common origin, and suggest a population break between the extreme south of South America and the more northern part of the continent. Thus the early colonization process was not just an expansion from north to south, but also included movements across the Andes.
R-lineage mitochondrial DNA represents over 90% of the European population and is significantly present all around the planet (North Africa, Asia, Oceania, and America). This lineage played a major role in migration “out of Africa” and colonization in Europe. In order to determine an accurate dating of the R lineage and its sublineages, we analyzed 1173 individuals and complete mtDNA sequences from Mitomap. This analysis revealed a new coalescence age for R at 54.500 years, as well as several limitations of standard dating methods, likely to lead to false interpretations. These findings highlight the association of a striking under-accumulation of synonymous mutations, an over-accumulation of non-synonymous mutations, and the phenotypic effect on haplogroup J. Consequently, haplogroup J is apparently not a Neolithic group but an older haplogroup (Paleolithic) that was subjected to an underestimated selective force. These findings also indicated an under-accumulation of synonymous and non-synonymous mutations localized on coding and non-coding (HVS1) sequences for haplogroup R0, which contains the major haplogroups H and V. These new dates are likely to impact the present colonization model for Europe and confirm the late glacial resettlement scenario.
A Neolithic domestication of taurine cattle in the Fertile Crescent from local aurochsen (Bos primigenius) is generally accepted, but a genetic contribution from European aurochsen has been proposed. Here we performed a survey of a large number of taurine cattle mitochondrial DNA (mtDNA) control regions from numerous European breeds confirming the overall clustering within haplogroups (T1, T2 and T3) of Near Eastern ancestry, but also identifying eight mtDNAs (1.3%) that did not fit in haplogroup T. Sequencing of the entire mitochondrial genome showed that four mtDNAs formed a novel branch (haplogroup R) which, after the deep bifurcation that gave rise to the taurine and zebuine lineages, constitutes the earliest known split in the mtDNA phylogeny of B. primigenius. The remaining four mtDNAs were members of the recently discovered haplogroup Q. Phylogeographic data indicate that R mtDNAs were derived from female European aurochsen, possibly in the Italian Peninsula, and sporadically included in domestic herds. In contrast, the available data suggest that Q mtDNAs and T subclades were involved in the same Neolithic event of domestication in the Near East. Thus, the existence of novel (and rare) taurine haplogroups highlights a multifaceted genetic legacy from distinct B. primigenius populations. Taking into account that the maternally transmitted mtDNA tends to underestimate the extent of gene flow from European aurochsen, the detection of the R mtDNAs in autochthonous breeds, some of which are endangered, identifies an unexpected reservoir of genetic variation that should be carefully preserved.
Genetic affinities between aboriginal Taiwanese and populations from Oceania and Southeast Asia have previously been explored through analyses of mitochondrial DNA (mtDNA), Y chromosomal DNA, and human leukocyte antigen loci. Recent genetic studies have supported the “slow boat” and “entangled bank” models according to which the Polynesian migration can be seen as an expansion from Melanesia without any major direct genetic thread leading back to its initiation from Taiwan. We assessed mtDNA variation in 640 individuals from nine tribes of the central mountain ranges and east coast regions of Taiwan. In contrast to the Han populations, the tribes showed a low frequency of haplogroups D4 and G, and an absence of haplogroups A, C, Z, M9, and M10. Also, more than 85% of the maternal lineages were nested within haplogroups B4, B5a, F1a, F3b, E, and M7. Although indicating a common origin of the populations of insular Southeast Asia and Oceania, most mtDNA lineages in Taiwanese aboriginal populations are grouped separately from those found in China and the Taiwan general (Han) population, suggesting a prevalence in the Taiwanese aboriginal gene pool of its initial late Pleistocene settlers. Interestingly, from complete mtDNA sequencing information, most B4a lineages were associated with three coding region substitutions, defining a new subclade, B4a1a, that endorses the origin of Polynesian migration from Taiwan. Coalescence times of B4a1a were 13.2 ± 3.8 thousand years (or 9.3 ± 2.5 thousand years in Papuans and Polynesians). Considering the lack of a common specific Y chromosomal element shared by the Taiwanese aboriginals and Polynesians, the mtDNA evidence provided here is also consistent with the suggestion that the proto-Oceanic societies would have been mainly matrilocal.
An extensive phylogenetic analysis of mtDNA from nine Taiwanese tribes reveals an unambiguous genetic link between aboriginal Taiwanese and Polynesian populations, to the exclusion of mainland Asians.
It has been proposed that the distribution patterns and coalescence ages found in Europeans for mitochondrial DNA (mtDNA) haplogroups V, H1 and H3 are the result of a post-glacial expansion from a Franco-Cantabrian refuge that recolonized central and northern areas. In contrast, in this refined mtDNA study of the Cantabrian Cornice that contributes 413 partial and 9 complete new mtDNA sequences, including a large Basque sample and a sample of Asturians, no experimental evidence was found to support the human refuge-expansion theory. In fact, all measures of gene diversity point to the Cantabrian Cornice in general and the Basques in particular, as less polymorphic for V, H1 and H3 than other southern regions in Iberia or in Central Europe. Genetic distances show the Cantabrian Cornice is a very heterogeneous region with significant local differences. The analysis of several minor subhaplogroups, based on complete sequences, also suggests different focal expansions over a local and peninsular range that did not affect continental Europe. Furthermore, all detected clinal trends show stronger longitudinal than latitudinal profiles. In Northern Iberia, it seems that the highest diversity values for some haplogroups with Mesolithic coalescence ages are centred on the Mediterranean side, including Catalonia and South-eastern France.
mtDNA haplogroups; humans; Franco-Cantabrian refuge theory
The frequencies of four mitochondrial Native American DNA haplogroups were determined in 1526 unrelated individuals from 11 Departments of Colombia and compared to the frequencies previously obtained for Amerindian and Afro-Colombian populations. Amerindian mtDNA haplogroups ranged from 74% to 97%. The lowest frequencies were found in Departments on the Caribbean coast and in the Pacific region, where the frequency of Afro-Colombians is higher, while the highest mtDNA Amerindian haplogroup frequencies were found in Departments that historically have a strong Amerindian heritage. Interestingly, all four mtDNA haplogroups were found in all Departments, in contrast to the complete absence of haplogroup D and high frequencies of haplogroup A in Amerindian populations in the Caribbean region of Colombia. Our results indicate that all four Native American mtDNA haplogroups were widely distributed in Colombia at the time of the Spanish conquest.
admixture; Colombia; haplogroup; Mestizo; mitochondrial DNA
The Koreans are generally considered a northeast Asian group because of their geographical location. However, recent findings from Y chromosome studies showed that the Korean population contains lineages from both southern and northern parts of East Asia. To understand the genetic history and relationships of Korea more fully, additional data and analyses are necessary.
Methodology and Results
We analyzed mitochondrial DNA (mtDNA) sequence variation in the hypervariable segments I and II (HVS-I and HVS-II) and haplogroup-specific mutations in coding regions in 445 individuals from seven east Asian populations (Korean, Korean-Chinese, Mongolian, Manchurian, Han (Beijing), Vietnamese and Thais). In addition, published mtDNA haplogroup data (N = 3307), mtDNA HVS-I sequences (N = 2313), Y chromosome haplogroup data (N = 1697) and Y chromosome STR data (N = 2713) were analyzed to elucidate the genetic structure of East Asian populations. All the mtDNA profiles studied here were classified into subsets of haplogroups common in East Asia, with just two exceptions. In general, the Korean mtDNA profiles revealed similarities to other northeastern Asian populations through analysis of individual haplogroup distributions, genetic distances between populations or an analysis of molecular variance, although a minor southern contribution was also suggested. Reanalysis of Y-chromosomal data confirmed both the overall similarity to other northeastern populations, and also a larger paternal contribution from southeastern populations.
The present work provides evidence that peopling of Korea can be seen as a complex process, interpreted as an early northern Asian settlement with at least one subsequent male-biased southern-to-northern migration, possibly associated with the spread of rice agriculture.
The archaeology of North Africa remains enigmatic, with questions of population continuity versus discontinuity taking centre-stage. Debates have focused on population transitions between the bearers of the Middle Palaeolithic Aterian industry and the later Upper Palaeolithic populations of the Maghreb, as well as between the late Pleistocene and Holocene.
Improved resolution of the mitochondrial DNA (mtDNA) haplogroup U6 phylogeny, by the screening of 39 new complete sequences, has enabled us to infer a signal of moderate population expansion using Bayesian coalescent methods. To ascertain the time for this expansion, we applied both a mutation rate accounting for purifying selection and one with an internal calibration based on four approximate archaeological dates: the settlement of the Canary Islands, the settlement of Sardinia and its internal population re-expansion, and the split between haplogroups U5 and U6 around the time of the first modern human settlement of the Near East.
A Bayesian skyline plot placed the main expansion in the time frame of the Late Pleistocene, around 20 ka, and spatial smoothing techniques suggested that the most probable geographic region for this demographic event was to the west of North Africa. A comparison with U6's European sister clade, U5, revealed a stronger population expansion at around this time in Europe. Also in contrast with U5, a weak signal of a recent population expansion in the last 5,000 years was observed in North Africa, pointing to a moderate impact of the late Neolithic on the local population size of the southern Mediterranean coast.
Genetic studies of the Arabian Peninsula are scarce even though the region was the center of ancient trade routes and empires and may have been the southern corridor for the earliest human migration from Africa to Asia. A total of 120 mtDNA Saudi Arab lineages were analyzed for HVSI/II sequences and for haplogroup confirmatory coding diagnostic positions. A phylogeny of the most abundant haplogroup (preHV)1 (R0a) was constructed based on 13 whole mtDNA genomes.
The Saudi Arabian group showed greatest similarity to other Arabian Peninsula populations (Bedouin from the Negev desert and Yemeni) and to Levantine populations. Nearly all the main western Asia haplogroups were detected in the Saudi sample, including the rare U9 clade. Saudi Arabs had only a minority sub-Saharan Africa component (7%), similar to the specific North-African contribution (5%). In addition, a small Indian influence (3%) was also detected.
The majority of the Saudi-Arab mitochondrial DNA lineages (85%) have a western Asia provenance. Although the still large confidence intervals, the coalescence and phylogeography of (preHV)1 haplogroup (accounting for 18 % of Saudi Arabian lineages) matches a Neolithic expansion in Saudi Arabia.
A comprehensive review of uniparental systems in South Amerindians was undertaken. Variability in the Y-chromosome haplogroups were assessed in 68 populations and 1,814 individuals whereas that of Y-STR markers was assessed in 29 populations and 590 subjects. Variability in the mitochondrial DNA (mtDNA) haplogroup was examined in 108 populations and 6,697 persons, and sequencing studies used either the complete mtDNA genome or the highly variable segments 1 and 2. The diversity of the markers made it difficult to establish a general picture of Y-chromosome variability in the populations studied. However, haplogroup Q1a3a* was almost always the most prevalent whereas Q1a3* occurred equally in all regions, which suggested its prevalence among the early colonizers. The STR allele frequencies were used to derive a possible ancient Native American Q-clade chromosome haplotype and five of six STR loci showed significant geographic variation. Geographic and linguistic factors moderately influenced the mtDNA distributions (6% and 7%, respectively) and mtDNA haplogroups A and D correlated positively and negatively, respectively, with latitude. The data analyzed here provide rich material for understanding the biological history of South Amerindians and can serve as a basis for comparative studies involving other types of data, such as cultural data.
genetics; language and geography; mitochondrial DNA; Native Americans; South Amerindians; Y-chromosome
The presence of Africans in Britain has been recorded since Roman times, but has left no apparent genetic trace among modern inhabitants. Y chromosomes belonging to the deepest-rooting clade of the Y phylogeny, haplogroup A, are regarded as African-specific, and no examples have been reported from Britain or elsewhere in western Europe. We describe the presence of a haplogroup A1 chromosome in an indigenous British male; comparison with African examples suggests a western African origin. Seven out of eighteen men carrying the same rare east-Yorkshire surname as the original male also carry haplogroup A1 chromosomes, and documentary research resolves them into two genealogies with most-recent-common-ancestors living in Yorkshire in the late eighteenth century. Analysis using 77 Y-STRs (short tandem repeats) is consistent with coalescence a few generations earlier. Our findings represent the first genetic evidence of Africans among ‘indigenous’ British, and emphasise the complexity of human migration history, as well as the pitfalls of assigning geographical origin from Y-chromosomal haplotypes.
Y chromosome; haplogroup; African; surnames; genealogy; Y-STRs
A Southwest Asian origin and dispersal to North Africa in the Early Upper Palaeolithic era has been inferred in previous studies for mtDNA haplogroups M1 and U6. Both haplogroups have been proposed to show similar geographic patterns and shared demographic histories.
We report here 24 M1 and 33 U6 new complete mtDNA sequences that allow us to refine the existing phylogeny of these haplogroups. The resulting phylogenetic information was used to genotype a further 131 M1 and 91 U6 samples to determine the geographic spread of their sub-clades. No southwest Asian specific clades for M1 or U6 were discovered. U6 and M1 frequencies in North Africa, the Middle East and Europe do not follow similar patterns, and their sub-clade divisions do not appear to be compatible with their shared history reaching back to the Early Upper Palaeolithic. The Bayesian Skyline Plots testify to non-overlapping phases of expansion, and the haplogroups’ phylogenies suggest that there are U6 sub-clades that expanded earlier than those in M1. Some M1 and U6 sub-clades could be linked with certain events. For example, U6a1 and M1b, with their coalescent ages of ~20,000–22,000 years ago and earliest inferred expansion in northwest Africa, could coincide with the flourishing of the Iberomaurusian industry, whilst U6b and M1b1 appeared at the time of the Capsian culture.
Our high-resolution phylogenetic dissection of both haplogroups and coalescent time assessments suggest that the extant main branching pattern of both haplogroups arose and diversified in the mid-later Upper Palaeolithic, with some sub-clades concomitantly with the expansion of the Iberomaurusian industry. Carriers of these maternal lineages have been later absorbed into and diversified further during the spread of Afro-Asiatic languages in North and East Africa.
mtDNA haplogroups M1 and U6; Afro-Asiatic languages; North Africa
The current human mitochondrial (mtDNA) phylogeny does not equally represent all human populations but is biased in favour of representatives originally from north and central Europe. This especially affects the phylogeny of some uncommon West Eurasian haplogroups, including I and W, whose southern European and Near Eastern components are very poorly represented, suggesting that extensive hidden phylogenetic substructure remains to be uncovered. This study expanded and re-analysed the available datasets of I and W complete mtDNA genomes, reaching a comprehensive 419 mitogenomes, and searched for precise correlations between the ages and geographical distributions of their numerous newly identified subclades with events of human dispersal which contributed to the genetic formation of modern Europeans. Our results showed that haplogroups I (within N1a1b) and W originated in the Near East during the Last Glacial Maximum or pre-warming period (the period of gradual warming between the end of the LGM, ∼19 ky ago, and the beginning of the first main warming phase, ∼15 ky ago) and, like the much more common haplogroups J and T, may have been involved in Late Glacial expansions starting from the Near East. Thus our data contribute to a better definition of the Late and postglacial re-peopling of Europe, providing further evidence for the scenario that major population expansions started after the Last Glacial Maximum but before Neolithic times, but also evidencing traces of diffusion events in several I and W subclades dating to the European Neolithic and restricted to Europe.