To examine the evolutionary relationships of the newly identified
Plasmodium parasites, we constructed phylogenetic trees for a subset of the diagnostic
cytB sequences. This analysis showed that all sequences, except for one
P. ovale-like strain, fell into one large monophyletic clade that also included
P. reichenowi and
P. falciparum (
Supplementary Fig. 2). Parasites related to
P. reichenowi and
P. falciparum have previously been classified into a subgenus, termed
Laverania, to distinguish them from more divergent
Plasmodium species
18. Our results thus indicated that parasites from this subgenus were common and widespread among wild ape populations. However, the topology of the
Laverania clade was highly unusual, characterized by only few discrete clades and multiple sequences with very short branches attached to internal branches. Moreover, repeated PCR analysis of the same fecal samples yielded sequences that clustered variably in different parts of the tree (
Supplementary Fig. 2). These findings indicated simultaneous infection with genetically diverse
Plasmodium parasites and suggested that conventional (bulk) PCR amplification had generated
in vitro recombinants. To examine this possibility, we re-analyzed the same
Plasmodium positive fecal samples by single genome amplification (SGA), a molecular strategy that has been used extensively to characterize the genetic identity and quasispecies complexity of human and simian immunodeficiency viruses (HIV/SIV)
19-23. Fecal DNA was diluted so that fewer than 30% of all PCR reactions yielded an amplification product, which ensured amplification of single
Plasmodium templates in most reactions
19-23. All amplicons were sequenced directly and sequences containing mixed bases indicative of more than one amplified template were discarded. Using this approach to characterize the genetic complexity of malaria parasites in fecal samples, we could eliminate both
Taq polymerase-induced recombination (template switching) and nucleotide misincorporations in finished sequences, thereby ensuring an accurate representation of plasmodial variants as they existed
in vivo21-23.
depicts the phylogenetic relationships of a subset of SGA derived mitochondrial
cytB sequences (the entire set of 697 sequences is analyzed in
Supplementary Fig. 3). As in the corresponding tree of bulk PCR-derived sequences (
Supplementary Fig. 2), all SGA derived sequences, except for seven
P. ovale, P. vivax and
P. malariae-like strains, grouped within the
Laverania radiation. However, unlike in the bulk PCR tree,
Laverania sequences in the SGA tree clustered in a strictly host species-specific manner, forming three chimpanzee (C1-C3) and three gorilla (G1-G3) specific clades, each supported by high bootstrap values. Interestingly, this host specificity did not extend to the subspecies level, since
P. t. ellioti, P. t. troglodytes and
P. t. schweinfurthii derived sequences were interspersed; however,
cytB sequences from
P. t. schweinfurthii segregated into distinct subclades within two of the three chimpanzee lineages (C1, C2), suggesting a phylogeographic distribution of certain
Plasmodium variants (
Supplementary Figs. 3a and b). None of 363 chimpanzee derived
Plasmodium cytB sequences was closely related to human
P. falciparum. Instead, all human sequences grouped within a single clade of parasites (G1) that infected western gorillas at numerous sites in Cameroon (LB, BB, CP, NK, BQ, DD, MM, LM), the Central African Republic (DS, ND) and the Republic of Congo (GT) ( and
Supplementary Fig. 3d). A notable finding of the SGA analysis, which was obscured by bulk PCR analysis, was that most apes were co-infected with parasites representing multiple different plasmodial lineages, including variants from (i) the same
Laverania clade, (ii) different
Laverania clades, or (iii)
Laverania and non-
Laverania clades (
Supplementary Table 5). Of 65 chimpanzee and 53 gorilla samples characterized, 48 (74%) and 37 (70%), respectively, harbored more than one genetically distinct parasite strain, and 36 (55%) and 23 (43%) contained members of two or more major
Plasmodium clades (
Supplementary Fig. 3). Given this high frequency of co-infection with divergent parasites, conventional recombination-prone PCR approaches are not appropriate for generating ape
Plasmodium sequences for phylogenetic analysis. Moreover, previously reported ape
Plasmodium sequences
9-12 must be interpreted with caution since they were subject to these same confounding factors.
To test the robustness of the phylogenetic relationships depicted in , we used SGA to amplify additional genomic regions from
cytB positive fecal samples, targeting loci in the mitochondrial, apicoplast and nuclear
Plasmodium genomes. These regions included 390 bp of the caseinolytic protease C (
clpC) gene (n=126), 772 bp of the lactase dehydrogenase (
ldh) gene (n=46), and 3.4 kb (n=165) and 3.3 kb (n=127) fragments that together spanned the entire mitochondrial genome (
Supplementary Fig. 1a). Phylogenetic analyses of each of these genomic loci revealed very similar topologies. In trees of
clpC (
Supplementary Fig. 4),
ldh (
Supplementary Fig. 5) and mitochondrial sequences (
Supplementary Figs. 6 and 7),
Laverania sequences formed the same number of chimpanzee (C1-C3) and gorilla (G1-G3) specific clades, albeit with some variations in the relationships among these lineages. Importantly, there was no evidence of recombination between chimpanzee and gorilla specific parasites, although many of them infected apes at the same field sites. This suggested that
Laverania parasites are largely host specific (recombination between parasites infecting the same host species could not be assessed because of mixed
Plasmodium infections). These findings, together with the extent of genetic diversity that distinguishes the various clades, argue strongly for the existence of six distinct
Plasmodium species within the
Laverania subgenus (
Supplementary Figs. 3-8). Formal classification of these lineages must await additional taxonomic evaluation.