The Austro-Asiatic linguistic family, which is considered to be the oldest of all the families in India, has a substantial presence in Southeast Asia. However, the possibility of any genetic link among the linguistic sub-families of the Indian Austro-Asiatics on the one hand and between the Indian and the Southeast Asian Austro-Asiatics on the other has not been explored till now. Therefore, to trace the origin and historic expansion of Austro-Asiatic groups of India, we analysed Y-chromosome SNP and STR data of the 1222 individuals from 25 Indian populations, covering all the three branches of Austro-Asiatic tribes, viz. Mundari, Khasi-Khmuic and Mon-Khmer, along with the previously published data on 214 relevant populations from Asia and Oceania.
Our results suggest a strong paternal genetic link, not only among the subgroups of Indian Austro-Asiatic populations but also with those of Southeast Asia. However, maternal link based on mtDNA is not evident. The results also indicate that the haplogroup O-M95 had originated in the Indian Austro-Asiatic populations ~65,000 yrs BP (95% C.I. 25,442 – 132,230) and their ancestors carried it further to Southeast Asia via the Northeast Indian corridor. Subsequently, in the process of expansion, the Mon-Khmer populations from Southeast Asia seem to have migrated and colonized Andaman and Nicobar Islands at a much later point of time.
Our findings are consistent with the linguistic evidence, which suggests that the linguistic ancestors of the Austro-Asiatic populations have originated in India and then migrated to Southeast Asia.
Human settlement and migrations along sides of Bay-of-Bengal have played a vital role in shaping the genetic landscape of Bangladesh, Eastern India and Southeast Asia. Bangladesh and Northeast India form the vital land bridge between the South and Southeast Asia. To reconstruct the population history of this region and to see whether this diverse region geographically acted as a corridor or barrier for human interaction between South Asia and Southeast Asia, we, for the first time analyzed high resolution uniparental (mtDNA and Y chromosome) and biparental autosomal genetic markers among aboriginal Bangladesh tribes currently speaking Tibeto-Burman language. All the three studied populations; Chakma, Marma and Tripura from Bangladesh showed strikingly high homogeneity among themselves and strong affinities to Northeast Indian Tibeto-Burman groups. However, they show substantially higher molecular diversity than Northeast Indian populations. Unlike Austroasiatic (Munda) speakers of India, we observed equal role of both males and females in shaping the Tibeto-Burman expansion in Southern Asia. Moreover, it is noteworthy that in admixture proportion, TB populations of Bangladesh carry substantially higher mainland Indian ancestry component than Northeast Indian Tibeto-Burmans. Largely similar expansion ages of two major paternal haplogroups (O2a and O3a3c), suggested that they arose before the differentiation of any language group and approximately at the same time. Contrary to the scenario proposed for colonization of Northeast India as male founder effect that occurred within the past 4,000 years, we suggest a significantly deep colonization of this region. Overall, our extensive analysis revealed that the population history of South Asian Tibeto-Burman speakers is more complex than it was suggested before.
Human genetic diversity observed in Indian subcontinent is second only to that of Africa. This implies an early settlement and demographic growth soon after the first 'Out-of-Africa' dispersal of anatomically modern humans in Late Pleistocene. In contrast to this perspective, linguistic diversity in India has been thought to derive from more recent population movements and episodes of contact. With the exception of Dravidian, which origin and relatedness to other language phyla is obscure, all the language families in India can be linked to language families spoken in different regions of Eurasia. Mitochondrial DNA and Y chromosome evidence has supported largely local evolution of the genetic lineages of the majority of Dravidian and Indo-European speaking populations, but there is no consensus yet on the question of whether the Munda (Austro-Asiatic) speaking populations originated in India or derive from a relatively recent migration from further East.
Here, we report the analysis of 35 novel complete mtDNA sequences from India which refine the structure of Indian-specific varieties of haplogroup R. Detailed analysis of haplogroup R7, coupled with a survey of ~12,000 mtDNAs from caste and tribal groups over the entire Indian subcontinent, reveals that one of its more recently derived branches (R7a1), is particularly frequent among Munda-speaking tribal groups. This branch is nested within diverse R7 lineages found among Dravidian and Indo-European speakers of India. We have inferred from this that a subset of Munda-speaking groups have acquired R7 relatively recently. Furthermore, we find that the distribution of R7a1 within the Munda-speakers is largely restricted to one of the sub-branches (Kherwari) of northern Munda languages. This evidence does not support the hypothesis that the Austro-Asiatic speakers are the primary source of the R7 variation. Statistical analyses suggest a significant correlation between genetic variation and geography, rather than between genes and languages.
Our high-resolution phylogeographic study, involving diverse linguistic groups in India, suggests that the high frequency of mtDNA haplogroup R7 among Munda speaking populations of India can be explained best by gene flow from linguistically different populations of Indian subcontinent. The conclusion is based on the observation that among Indo-Europeans, and particularly in Dravidians, the haplogroup is, despite its lower frequency, phylogenetically more divergent, while among the Munda speakers only one sub-clade of R7, i.e. R7a1, can be observed. It is noteworthy that though R7 is autochthonous to India, and arises from the root of hg R, its distribution and phylogeography in India is not uniform. This suggests the more ancient establishment of an autochthonous matrilineal genetic structure, and that isolation in the Pleistocene, lineage loss through drift, and endogamy of prehistoric and historic groups have greatly inhibited genetic homogenization and geographical uniformity.
The human population history in Southeast Asia was shaped by numerous migrations and population expansions. Their reconstruction based on archaeological, linguistic or human genetic data is often hampered by the limited number of informative polymorphisms in classical human genetic markers, such as the hypervariable regions of the mitochondrial DNA. Here, we analyse housekeeping gene sequences of the human stomach bacterium Helicobacter pylori from various countries in Southeast Asia and we provide evidence that H. pylori accompanied at least three ancient human migrations into this area: i) a migration from India introducing hpEurope bacteria into Thailand, Cambodia and Malaysia; ii) a migration of the ancestors of Austro-Asiatic speaking people into Vietnam and Cambodia carrying hspEAsia bacteria; and iii) a migration of the ancestors of the Thai people from Southern China into Thailand carrying H. pylori of population hpAsia2. Moreover, the H. pylori sequences reflect iv) the migrations of Chinese to Thailand and Malaysia within the last 200 years spreading hspEasia strains, and v) migrations of Indians to Malaysia within the last 200 years distributing both hpAsia2 and hpEurope bacteria. The distribution of the bacterial populations seems to strongly influence the incidence of gastric cancer as countries with predominantly hspEAsia isolates exhibit a high incidence of gastric cancer while the incidence is low in countries with a high proportion of hpAsia2 or hpEurope strains. In the future, the host range expansion of hpEurope strains among Asian populations, combined with human motility, may have a significant impact on gastric cancer incidence in Asia.
The Mlabri are a group of nomadic hunter-gatherers inhabiting the rural highlands of Thailand. Little is known about the origins of the Mlabri and linguistic evidence suggests that the present-day Mlabri language most likely arose from Tin, a Khmuic language in the Austro-Asiatic language family. This study aims to examine whether the genetic affinity of the Mlabri is consistent with this linguistic relationship, and to further explore the origins of this enigmatic population.
We conducted a genome-wide analysis of genetic variation using more than fifty thousand single nucleotide polymorphisms (SNPs) typed in thirteen population samples from Thailand, including the Mlabri, Htin and neighboring populations of the Northern Highlands, speaking Austro-Asiatic, Tai-Kadai and Hmong-Mien languages. The Mlabri population showed higher LD and lower haplotype diversity when compared with its neighboring populations. Both model-free and Bayesian model-based clustering analyses indicated a close genetic relationship between the Mlabri and the Htin, a group speaking a Tin language.
Our results strongly suggested that the Mlabri share more recent common ancestry with the Htin. We thus provided, to our knowledge, the first genetic evidence that supports the linguistic affinity of Mlabri, and this association between linguistic and genetic classifications could reflect the same past population processes.
The faunal and floral relationship of northward-drifting India with its neighboring continents is of general biogeographic interest as an important driver of regional biodiversity. However, direct biogeographic connectivity of India and Southeast Asia during the Cenozoic remains largely unexplored. We investigate timing, direction and mechanisms of faunal exchange between India and Southeast Asia, based on a molecular phylogeny, molecular clock-derived time estimates and biogeographic reconstructions of the Asian freshwater crab family Gecarcinucidae.
Although the Gecarcinucidae are not an element of an ancient Gondwana fauna, their subfamily Gecarcinucinae, and probably also the Liotelphusinae, evolved on the Indian Subcontinent and subsequently dispersed to Southeast Asia. Estimated by a model testing approach, this dispersal event took place during the Middle Eocene, and thus before the final collision of India and the Tibet-part of Eurasia.
We postulate that the India and Southeast Asia were close enough for exchange of freshwater organisms during the Middle Eocene, before the final Indian-Eurasian collision. Our data support geological models that assume the Indian plate having tracked along Southeast Asia during its move northwards.
Tibeto-Burman populations of India provide an insight into the peopling of India and aid in understanding their genetic relationship with populations of East, South and Southeast Asia. The study investigates the genetic status of one such Tibeto-Burman group, Adi of Arunachal Pradesh based on 15 autosomal microsatellite markers. Further the study examines, based on 9 common microsatellite loci, the genetic relationship of Adi with 16 other Tibeto-Burman speakers of India and 28 neighboring populations of East and Southeast Asia. Overall, the results support the recent formation of the Adi sub-tribes from a putative ancestral group and reveal that geographic contiguity is a major influencing factor of the genetic affinity among the Tibeto-Burman populations of India.
Recent advances in the understanding of the maternal and paternal heritage of south and southwest Asian populations have highlighted their role in the colonization of Eurasia by anatomically modern humans. Further understanding requires a deeper insight into the topology of the branches of the Indian mtDNA phylogenetic tree, which should be contextualized within the phylogeography of the neighboring regional mtDNA variation. Accordingly, we have analyzed mtDNA control and coding region variation in 796 Indian (including both tribal and caste populations from different parts of India) and 436 Iranian mtDNAs. The results were integrated and analyzed together with published data from South, Southeast Asia and West Eurasia.
Four new Indian-specific haplogroup M sub-clades were defined. These, in combination with two previously described haplogroups, encompass approximately one third of the haplogroup M mtDNAs in India. Their phylogeography and spread among different linguistic phyla and social strata was investigated in detail. Furthermore, the analysis of the Iranian mtDNA pool revealed patterns of limited reciprocal gene flow between Iran and the Indian sub-continent and allowed the identification of different assemblies of shared mtDNA sub-clades.
Since the initial peopling of South and West Asia by anatomically modern humans, when this region may well have provided the initial settlers who colonized much of the rest of Eurasia, the gene flow in and out of India of the maternally transmitted mtDNA has been surprisingly limited. Specifically, our analysis of the mtDNA haplogroups, which are shared between Indian and Iranian populations and exhibit coalescence ages corresponding to around the early Upper Paleolithic, indicates that they are present in India largely as Indian-specific sub-lineages. In contrast, other ancient Indian-specific variants of M and R are very rare outside the sub-continent.
Sakha – an area connecting South and Northeast Siberia – is significant for understanding the history of peopling of Northeast Eurasia and the Americas. Previous studies have shown a genetic contiguity between Siberia and East Asia and the key role of South Siberia in the colonization of Siberia.
We report the results of a high-resolution phylogenetic analysis of 701 mtDNAs and 318 Y chromosomes from five native populations of Sakha (Yakuts, Evenks, Evens, Yukaghirs and Dolgans) and of the analysis of more than 500,000 autosomal SNPs of 758 individuals from 55 populations, including 40 previously unpublished samples from Siberia. Phylogenetically terminal clades of East Asian mtDNA haplogroups C and D and Y-chromosome haplogroups N1c, N1b and C3, constituting the core of the gene pool of the native populations from Sakha, connect Sakha and South Siberia. Analysis of autosomal SNP data confirms the genetic continuity between Sakha and South Siberia. Maternal lineages D5a2a2, C4a1c, C4a2, C5b1b and the Yakut-specific STR sub-clade of Y-chromosome haplogroup N1c can be linked to a migration of Yakut ancestors, while the paternal lineage C3c was most likely carried to Sakha by the expansion of the Tungusic people. MtDNA haplogroups Z1a1b and Z1a3, present in Yukaghirs, Evens and Dolgans, show traces of different and probably more ancient migration(s). Analysis of both haploid loci and autosomal SNP data revealed only minor genetic components shared between Sakha and the extreme Northeast Siberia. Although the major part of West Eurasian maternal and paternal lineages in Sakha could originate from recent admixture with East Europeans, mtDNA haplogroups H8, H20a and HV1a1a, as well as Y-chromosome haplogroup J, more probably reflect an ancient gene flow from West Eurasia through Central Asia and South Siberia.
Our high-resolution phylogenetic dissection of mtDNA and Y-chromosome haplogroups as well as analysis of autosomal SNP data suggests that Sakha was colonized by repeated expansions from South Siberia with minor gene flow from the Lower Amur/Southern Okhotsk region and/or Kamchatka. The minor West Eurasian component in Sakha attests to both recent and ongoing admixture with East Europeans and an ancient gene flow from West Eurasia.
mtDNA; Y chromosome; Autosomal SNPs; Sakha
The Cham people are the major Austronesian speakers of Mainland Southeast Asia (MSEA) and the reconstruction of the Cham population history can provide insights into their diffusion. In this study, we analyzed non-recombining region of the Y chromosome markers of 177 unrelated males from four populations in MSEA, including 59 Cham, 76 Kinh, 25 Lao, and 17 Thai individuals. Incorporating published data from mitochondrial DNA (mtDNA), our results indicated that, in general, the Chams are an indigenous Southeast Asian population. The origin of the Cham people involves the genetic admixture of the Austronesian immigrants from Island Southeast Asia (ISEA) with the local populations in MSEA. Discordance between the overall patterns of Y chromosome and mtDNA in the Chams is evidenced by the presence of some Y chromosome lineages that prevail in South Asians. Our results suggest that male-mediated dispersals via the spread of religions and business trade might play an important role in shaping the patrilineal gene pool of the Cham people.
Snake envenomation is a serious public health threat in the rural areas of Asian and African countries. To date, the only proven treatment for snake envenomation is antivenom therapy. Cross-neutralization of heterologous venoms by antivenom raised against venoms of closely related species has been reported. The present study examined the cross neutralizing potential of a newly developed polyvalent antivenom, termed Neuro Polyvalent Snake Antivenom (NPAV). NPAV was produced by immunization against 4 Thai elapid venoms.
In vitro neutralization study using mice showed that NPAV was able to neutralize effectively the lethality of venoms of most common Asiatic cobras (Naja spp.), Ophiophagus hannah and kraits (Bungarus spp.) from Southeast Asia, but only moderately to weakly effective against venoms of Naja from India subcontinent and Africa. Studies with several venoms showed that the in vivo neutralization potency of the NPAV was comparable to the in vitro neutralization potency. NPAV could also fully protect against N. sputatrix venom-induced cardio-respiratory depressant and neuromuscular blocking effects in anesthetized rats, demonstrating that the NPAV could neutralize most of the major lethal toxins in the Naja venom.
The newly developed polyvalent antivenom NPAV may find potential application in the treatment of elapid bites in Southeast Asia, especially Malaysia, a neighboring nation of Thailand. Nevertheless, the applicability of NPAV in the treatment of cobra and krait envenomations in Southeast Asian victims needs to be confirmed by clinical trials. The cross-neutralization results may contribute to the design of broad-spectrum polyvalent antivenom.
Snake envenomation is a serious public health threat in the rural areas of Asia and Africa. To date, the only proven treatment for snake envenomation is antivenom therapy. Owing to the difficulties in the diagnosis of the biting species, there is a need to develop polyvalent antivenoms that could cross-neutralize venoms of medically important venomous snakes in the various regions. Recently, Thai Red Cross Society from Thailand has developed a new polyvalent antivenom for treatment of cobra and krait venoms. The polyvalent antivenom, termed “Neuro Polyvalent Snake Antivenom (NPAV),” is raised against venoms of two Thai cobras and two Thai kraits. Our results indicated that the polyvalent antivenom can effectively neutralize venoms from many Southeast Asian cobras, kraits and king cobra but is less effective against Indian cobra venoms. Studies using anesthetized rats showed that NPAV can effectively protect against cobra venom-induced cardio-respiratory depressant and neuromuscular blocking effects, confirming that the antivenom can effectively neutralize the major lethal toxins of common cobra venoms. This new antivenom may find potential application in the treatment of elapid bites in Southeast Asia, especially Malaysia, a neighboring nation of Thailand.
Austronesian is a linguistic family spread in most areas of the Southeast Asia, the Pacific Ocean, and the Indian Ocean. Based on their linguistic similarity, this linguistic family included Malayo-Polynesians and Taiwan aborigines. The linguistic similarity also led to the controversial hypothesis that Taiwan is the homeland of all the Malayo-Polynesians, a hypothesis that has been debated by ethnologists, linguists, archaeologists, and geneticists. It is well accepted that the Eastern Austronesians (Micronesians and Polynesians) derived from the Western Austronesians (Island Southeast Asians and Taiwanese), and that the Daic populations on the mainland are supposed to be the headstream of all the Austronesian populations.
In this report, we studied 20 SNPs and 7 STRs in the non-recombining region of the 1,509 Y chromosomes from 30 China Daic populations, 23 Indonesian and Vietnam Malayo-Polynesian populations, and 11 Taiwan aboriginal populations. These three groups show many resemblances in paternal lineages. Admixture analyses demonstrated that the Daic populations are hardly influenced by Han Chinese genetically, and that they make up the largest proportion of Indonesians. Most of the population samples contain a high frequency of haplogroup O1a-M119, which is nearly absent in other ethnic families. The STR network of haplogroup O1a* illustrated that Indonesian lineages did not derive from Taiwan aborigines as linguistic studies suggest, but from Daic populations.
We show that, in contrast to the Taiwan homeland hypothesis, the Island Southeast Asians do not have a Taiwan origin based on their paternal lineages. Furthermore, we show that both Taiwan aborigines and Indonesians likely derived from the Daic populations based on their paternal lineages. These two populations seem to have evolved independently of each other. Our results indicate that a super-phylum, which includes Taiwan aborigines, Daic, and Malayo-Polynesians, is genetically educible.
An early dispersal of biologically and behaviorally modern humans from their African origins to Australia, by at least 45 thousand years via southern Asia has been suggested by studies based on morphology, archaeology and genetics. However, mtDNA lineages sampled so far from south Asia, eastern Asia and Australasia show non-overlapping distributions of haplogroups within pan Eurasian M and N macrohaplogroups. Likewise, support from the archaeology is still ambiguous.
In our completely sequenced 966-mitochondrial genomes from 26 relic tribes of India, we have identified seven genomes, which share two synonymous polymorphisms with the M42 haplogroup, which is specific to Australian Aborigines.
Our results showing a shared mtDNA lineage between Indians and Australian Aborigines provides direct genetic evidence of an early colonization of Australia through south Asia, following the "southern route".
The population of India harbors one of the world’s most highly diverse gene pools, owing to the influx of successive waves of immigrants over regular periods in time. Several phylogenetic studies involving mitochondrial DNA and Y chromosomal variation have demonstrated Europeans to have been the first settlers in India. Nevertheless, certain controversy exists, due to the support given to the thesis that colonization was by the Austro-Asiatic group, prior to the Europeans. Thus, the aim was to investigate pre-historic colonization of India by anatomically modern humans, using conserved stretches of five amino acid (EPIYA) sequences in the cagA gene of Helicobacter pylori. Simultaneously, the existence of a pathogenic relationship of tyrosine phosphorylation motifs (TPMs), in 32 H. pylori strains isolated from subjects with several forms of gastric diseases, was also explored. High resolution sequence analysis of the above described genes was performed. The nucleotide sequences obtained were translated into amino acids using MEGA (version 4.0) software for EPIYA. An MJ-Network was constructed for obtaining TPM haplotypes by using NETWORK (version 4.5) software. The findings of the study suggest that Indian H. pylori strains share a common ancestry with Europeans. No specific association of haplotypes with the outcome of disease was revealed through additional network analysis of TPMs.
Helicobacter pylori; EPIYA motifs; tyrosine phosphorylation motifs; haplotypes; anatomically modern humans
Heart failure is a leading cause of death of people in South Asia, and cardiomyopathy is a major cause of heart failure. Myosin binding protein C (MYBPC3) is expressed in the heart muscle, where it regulates the cardiac response to adrenergic stimulation and is important for the structural integrity of the sarcomere. Mutations in the MYBPC3 gene are associated with hypertrophic or dilated cardiomyopathies. A 25-base-pair deletion in intron 32 causes skipping of the downstream exon and is associated with familial cardiomyopathy. To date, this deletion is found primarily in India and South Asia, although it is also found at low frequency in Southeast Asia. In order to better characterize the distribution of this variant, we determined its frequency in 447 individuals from 19 populations, including 10 populations from India and neighboring populations from Pakistan and Nepal. The deletion frequency is over 8% in some of our Indian samples, and it is not present in any of the populations we sampled outside of India. The differences in the deletion frequencies among populations in India are consistent with patterns of variation previously reported and with patterns we observed among Indian populations based on high-density SNP chip data. Our results indicate the MYBPC3 deletion is primarily found among Indian populations, and that its distribution is consistent with genome-wide patterns of variation in India.
India is a country with enormous social and cultural diversity due to its positioning on the crossroads of many historic and pre-historic human migrations. The hierarchical caste system in the Hindu society dominates the social structure of the Indian populations. The origin of the caste system in India is a matter of debate with many linguists and anthropologists suggesting that it began with the arrival of Indo-European speakers from Central Asia about 3500 years ago. Previous genetic studies based on Indian populations failed to achieve a consensus in this regard. We analysed the Y-chromosome and mitochondrial DNA of three tribal populations of southern India, compared the results with available data from the Indian subcontinent and tried to reconstruct the evolutionary history of Indian caste and tribal populations.
No significant difference was observed in the mitochondrial DNA between Indian tribal and caste populations, except for the presence of a higher frequency of west Eurasian-specific haplogroups in the higher castes, mostly in the north western part of India. On the other hand, the study of the Indian Y lineages revealed distinct distribution patterns among caste and tribal populations. The paternal lineages of Indian lower castes showed significantly closer affinity to the tribal populations than to the upper castes. The frequencies of deep-rooted Y haplogroups such as M89, M52, and M95 were higher in the lower castes and tribes, compared to the upper castes.
The present study suggests that the vast majority (>98%) of the Indian maternal gene pool, consisting of Indio-European and Dravidian speakers, is genetically more or less uniform. Invasions after the late Pleistocene settlement might have been mostly male-mediated. However, Y-SNP data provides compelling genetic evidence for a tribal origin of the lower caste populations in the subcontinent. Lower caste groups might have originated with the hierarchical divisions that arose within the tribal groups with the spread of Neolithic agriculturalists, much earlier than the arrival of Aryan speakers. The Indo-Europeans established themselves as upper castes among this already developed caste-like class structure within the tribes.
Molecular anthropological studies of the populations in and around East Asia have resulted in the discovery that most of the Y-chromosome lineages of East Asians came from Southeast Asia. However, very few Southeast Asian populations had been investigated, and therefore, little was known about the purported migrations from Southeast Asia into East Asia and their roles in shaping the genetic structure of East Asian populations. Here, we present the Y-chromosome data from 1,652 individuals belonging to 47 Mon-Khmer (MK) and Hmong-Mien (HM) speaking populations that are distributed primarily across Southeast Asia and extend into East Asia. Haplogroup O3a3b-M7, which appears mainly in MK and HM, indicates a strong tie between the two groups. The short tandem repeat network of O3a3b-M7 displayed a hierarchical expansion structure (annual ring shape), with MK haplotypes being located at the original point, and the HM and the Tibeto-Burman haplotypes distributed further away from core of the network. Moreover, the East Asian dominant haplogroup O3a3c1-M117 shows a network structure similar to that of O3a3b-M7. These patterns indicate an early unidirectional diffusion from Southeast Asia into East Asia, which might have resulted from the genetic drift of East Asian ancestors carrying these two haplogroups through many small bottle-necks formed by the complicated landscape between Southeast Asia and East Asia. The ages of O3a3b-M7 and O3a3c1-M117 were estimated to be approximately 19 thousand years, followed by the emergence of the ancestors of HM lineages out of MK and the unidirectional northward migrations into East Asia.
East Asia harbors substantial genetic, physical, cultural and linguistic diversity, but the detailed structures and interrelationships of those aspects remain enigmatic. This question has begun to be addressed by a rapid accumulation of molecular anthropological studies of the populations in and around East Asia, especially by Y chromosome studies. The current Y chromosome evidence suggests multiple early migrations of modern humans from Africa via Southeast Asia to East Asia. After the initial settlements, the northward migrations during the Paleolithic Age shaped the genetic structure in East Asia. Subsequently, recent admixtures between Central Asian immigrants and northern East Asians enlarged the genetic divergence between southern and northern East Asia populations. Cultural practices, such as languages, agriculture, military affairs and social prestige, also have impacts on the genetic patterns in East Asia. Furthermore, application of Y chromosome analyses in the family genealogy studies offers successful showcases of the utility of genetics in studying the ancient history.
East Asian populations; Y-chromosome; Migrations; Genetic structures
Humans reached present-day Island Southeast Asia (ISEA) in one of the first major human migrations out of Africa. Population movements in the millennia following this initial settlement are thought to have greatly influenced the genetic makeup of current inhabitants, yet the extent attributed to different events is not clear. Recent studies suggest that south-to-north gene flow largely influenced present-day patterns of genetic variation in Southeast Asian populations and that late Pleistocene and early Holocene migrations from Southeast Asia are responsible for a substantial proportion of ISEA ancestry. Archaeological and linguistic evidence suggests that the ancestors of present-day inhabitants came mainly from north-to-south migrations from Taiwan and throughout ISEA approximately 4,000 years ago. We report a large-scale genetic analysis of human variation in the Iban population from the Malaysian state of Sarawak in northwestern Borneo, located in the center of ISEA. Genome-wide single-nucleotide polymorphism (SNP) markers analyzed here suggest that the Iban exhibit greatest genetic similarity to Indonesian and mainland Southeast Asian populations. The most common non-recombining Y (NRY) and mitochondrial (mt) DNA haplogroups present in the Iban are associated with populations of Southeast Asia. We conclude that migrations from Southeast Asia made a large contribution to Iban ancestry, although evidence of potential gene flow from Taiwan is also seen in uniparentally inherited marker data.
Linguistic and genetic studies on Roma populations inhabited in Europe have unequivocally traced these populations to the Indian subcontinent. However, the exact parental population group and time of the out-of-India dispersal have remained disputed. In the absence of archaeological records and with only scanty historical documentation of the Roma, comparative linguistic studies were the first to identify their Indian origin. Recently, molecular studies on the basis of disease-causing mutations and haploid DNA markers (i.e. mtDNA and Y-chromosome) supported the linguistic view. The presence of Indian-specific Y-chromosome haplogroup H1a1a-M82 and mtDNA haplogroups M5a1, M18 and M35b among Roma has corroborated that their South Asian origins and later admixture with Near Eastern and European populations. However, previous studies have left unanswered questions about the exact parental population groups in South Asia. Here we present a detailed phylogeographical study of Y-chromosomal haplogroup H1a1a-M82 in a data set of more than 10,000 global samples to discern a more precise ancestral source of European Romani populations. The phylogeographical patterns and diversity estimates indicate an early origin of this haplogroup in the Indian subcontinent and its further expansion to other regions. Tellingly, the short tandem repeat (STR) based network of H1a1a-M82 lineages displayed the closest connection of Romani haplotypes with the traditional scheduled caste and scheduled tribe population groups of northwestern India.
Genetic affinities between aboriginal Taiwanese and populations from Oceania and Southeast Asia have previously been explored through analyses of mitochondrial DNA (mtDNA), Y chromosomal DNA, and human leukocyte antigen loci. Recent genetic studies have supported the “slow boat” and “entangled bank” models according to which the Polynesian migration can be seen as an expansion from Melanesia without any major direct genetic thread leading back to its initiation from Taiwan. We assessed mtDNA variation in 640 individuals from nine tribes of the central mountain ranges and east coast regions of Taiwan. In contrast to the Han populations, the tribes showed a low frequency of haplogroups D4 and G, and an absence of haplogroups A, C, Z, M9, and M10. Also, more than 85% of the maternal lineages were nested within haplogroups B4, B5a, F1a, F3b, E, and M7. Although indicating a common origin of the populations of insular Southeast Asia and Oceania, most mtDNA lineages in Taiwanese aboriginal populations are grouped separately from those found in China and the Taiwan general (Han) population, suggesting a prevalence in the Taiwanese aboriginal gene pool of its initial late Pleistocene settlers. Interestingly, from complete mtDNA sequencing information, most B4a lineages were associated with three coding region substitutions, defining a new subclade, B4a1a, that endorses the origin of Polynesian migration from Taiwan. Coalescence times of B4a1a were 13.2 ± 3.8 thousand years (or 9.3 ± 2.5 thousand years in Papuans and Polynesians). Considering the lack of a common specific Y chromosomal element shared by the Taiwanese aboriginals and Polynesians, the mtDNA evidence provided here is also consistent with the suggestion that the proto-Oceanic societies would have been mainly matrilocal.
An extensive phylogenetic analysis of mtDNA from nine Taiwanese tribes reveals an unambiguous genetic link between aboriginal Taiwanese and Polynesian populations, to the exclusion of mainland Asians.
A new fossil leaf impression of Alphonsea Hk. f. & T. of the family Annonaceae is described from the Late Oligocene sediments of Makum Coalfield, Assam, India. This is the first authentic record of the fossil of Alphonsea from the Tertiary rocks of South Asia. The Late Oligocene was the time of the last significant globally warm climate and the fossil locality was at 10°–15°N palaeolatitude. The known palaeoflora and sedimentological studies indicate a fluvio-marine deltaic environment with a mosaic of mangrove, fluvial, mire and lacustrine depositional environments. During the depositional period the suturing between the Indian and Eurasian plates was not complete to facilitate the plant migration. The suturing was over by the end of the Late Oligocene/beginning of Early Miocene resulting in the migration of the genus to Southeast Asia where it is growing profusely at present. The present study is in congruence with the earlier published palaeofloral and molecular phylogenetic data. The study also suggests that the Indian plate was not only a biotic ferry during its northward voyage from Gondwana to Asia but also a place for the origin of several plant taxa.
The fulvous fruit bat (Rousettus leschenaulti) and the greater short-nosed fruit bat (Cynopterus sphinx) are two abundant and widely co-distributed Old World fruit bats in Southeast and East Asia. The former species forms large colonies in caves while the latter roots in small groups in trees. To test whether these differences in social organization and roosting ecology are associated with contrasting patterns of gene flow, we used mtDNA and nuclear loci to characterize population genetic subdivision and phylogeographic histories in both species sampled from China, Vietnam and India. Our analyses from R. leschenaulti using both types of marker revealed little evidence of genetic structure across the study region. On the other hand, C. sphinx showed significant genetic mtDNA differentiation between the samples from India compared with China and Vietnam, as well as greater structuring of microsatellite genotypes within China. Demographic analyses indicated signatures of past rapid population expansion in both taxa, with more recent demographic growth in C. sphinx. Therefore, the relative genetic homogeneity in R. leschenaulti is unlikely to reflect past events. Instead we suggest that the absence of substructure in R. leschenaulti is a consequence of higher levels of gene flow among colonies, and that greater vagility in this species is an adaptation associated with cave roosting.
Chloroquine-resistant Plasmodium falciparum (CRPF) malaria isolates in Southeast Asia and sub-Saharan Africa share the same Plasmodium falciparum chloroquine resistance transporter (PfCRT) haplotype (CVIET; amino acids 72 to 76). It is believed that CRPF malaria emerged in Southeast Asia and spread to sub-Saharan Africa via the Indian subcontinent. Based on this assumption, we hypothesized that CRPF isolates in India should possess the same drug resistance haplotype (PfCRT haplotype CVIET) as P. falciparum isolates in Southeast Asia and Africa and that the prevalence of CRPF may be higher and more widespread in India than appreciated. To test this postulate, we utilized a standardized real-time PCR assay to assess the prevalence and distribution of PfCRT haplotypes in P. falciparum isolates (n = 406) collected from Western, Central, and Eastern states in India and compared them to isolates from South America and Africa. Based on the proportion of isolates possessing the molecular marker K76T, the prevalence of chloroquine resistance was high in all five regions of India studied (91%), as well as in Uganda (98%) and Suriname (100%). All isolates from Suriname contained the chloroquine-resistant SVMNT haplotype typical of South American isolates, and 98% of isolates from Uganda possessed the chloroquine-resistant CVIET haplotype characteristic of Southeast Asian/African strains. However, of 246 P. falciparum isolates from across India that contained the molecular marker for chloroquine resistance, 81% contained the SVMNT haplotype. In conclusion, the prevalence of CRPF malaria was high in geographically dispersed regions of India, and the primary haplotype observed, SVMNT, did not support a presumed geographic spread from contiguous Southeast Asia.
Arab forces conquered the Indus Delta region in 711 A.D. and, although a Muslim state was established there, their influence was barely felt in the rest of South Asia at that time. By the end of the tenth century, Central Asian Muslims moved into India from the northwest and expanded throughout the subcontinent. Muslim communities are now the largest minority religion in India, comprising more than 138 million people in a predominantly Hindu population of over one billion. It is unclear whether the Muslim expansion in India was a purely cultural phenomenon or had a genetic impact on the local population. To address this question from a male perspective, we typed eight microsatellite loci and 16 binary markers from the Y chromosome in 246 Muslims from Andhra Pradesh, and compared them to published data on 4,204 males from China, Central Asia, other parts of India, Sri Lanka, Pakistan, Iran, the Middle East, Turkey, Egypt and Morocco. We find that the Muslim populations in general are genetically closer to their non-Muslim geographical neighbors than to other Muslims in India, and that there is a highly significant correlation between genetics and geography (but not religion). Our findings indicate that, despite the documented practice of marriage between Muslim men and Hindu women, Islamization in India did not involve large-scale replacement of Hindu Y chromosomes. The Muslim expansion in India was predominantly a cultural change and was not accompanied by significant gene flow, as seen in other places, such as China and Central Asia.
Y-chromosomal polymorphism; India; Muslim; Hindu