General Features of the
Baumannia Genome and Predicted Genes
The genome of
Baumannia consists of one circular chromosome of 686,192 base pairs (bp) with an average G + C content of 33.23% (
). The genome size closely matches an earlier estimate from gel electrophoresis [
16].
Baumannia has neither a strong GC skew pattern nor a
dnaA homolog—two features commonly used to identify origins of replication in bacteria. A putative origin was identified and designated as position 1, based on a weak but clear transition in oligonucleotide skew.
| Table 1General Features of the Genomes of
Baumannia and Other Insect Endosymbionts
|
A total of 46 noncoding RNA genes were identified: six rRNAs (two sets of 16S, 5S, and 23S), one small RNA, and 39 tRNAs including at least one for each of the 20 amino acids. A total of 605 putative protein-coding genes (CDSs) were identified in the genome, and 89.9% of these can be assigned a putative biological function. An overview of the
Baumannia genome and its encoded genes is illustrated in
, and features of these genes are summarized in
Table S1. Only four of the CDSs lack detectable homologs in GenBank or other complete genomes and thus can be considered “orphan” genes.
Evolution of
Baumannia and the Genomes of Intracellular Organisms
Genome sequences have been found to be very useful in providing for better resolution and accuracy in phylogenetic trees than is achieved using single genes such as rRNA genes [
17]. Although there are many ways to build genome-based trees, one particularly powerful approach is to identify orthologous genes between species and to combine alignments of these genes into a single alignment. We built a tree for
Baumannia and related species from 45 ribosomal proteins using this concatenation approach (
A). This tree supports the rRNA-based grouping of
Baumannia with the insect endosymbionts of the genera
Buchnera, Wigglesworthia (symbionts of tsetse flies)
, and
Blochmania (symbionts of ants) [
16]. However, the branching order is different in the protein tree with
Baumannia being the deepest branching symbiont. As in prior genomic studies [
18], the insect endosymbionts in the tree in
A are monophyletic (i.e., they share a common ancestor to the exclusion of all other species for which genomes are available). A possible close relationship of
Baumannia to the other symbionts in the group is further supported by the presence of a substantial number of segments of conserved gene order (
B).
All of these insect endosymbionts, including
Baumannia, exhibit many genome-level trends commonly found in intracellular organisms when compared to free-living relatives, including a smaller genome, lower G + C content, a higher average predicted isoelectric point for encoded proteins, and more rapidly evolving proteins (
,
A). Of critical importance to understanding these trends is that they occur in all types of intracellular organisms (e.g., mutualists and pathogens) from across the tree of life (archaea, bacteria, and eukaryotes). Much research has focused on trying to understand the mechanisms underlying these trends for which there are two major hypotheses: the loss of DNA repair genes resulting in subsequent changes in mutation patterns or changes in population genetic parameters that lead to more genetic drift [
19,
20].
As a global explanation, the population genetic forces have more support (e.g., [
21–
23]), but the issue is far from resolved. One reason for this lack of resolution is that it is usually difficult to reconstruct the early events in the evolution of intracellularity. This insect endosymbiont group has many advantages that have made it a model system for resolving these early events. The addition of the
Baumannia genome further improves the utility of this group for reasons we detail below.
One limitation of studies of the evolution of intracellular organisms is that the evolutionary separation between free-living and intracellular species is usually very large. For example, although much can be learned about recent mitochondrial evolution by comparative analysis of mitochondrial genomes, it is not even known what subgroup of
Alphaproteobacteria contains the closest free-living relative of these organelles. This is because the mitochondrial symbiosis originated billions of years ago. The insect endosymbionts lack this limitation both because their symbioses evolved relatively recently and because of the large diversity of genomes available for the
Gammaproteobacteria. To make the most use of these benefits, it is imperative to have an accurate picture of the phylogeny of the symbionts. The addition of the
Baumannia genome is useful in this regard because its proteins appear to be evolving more slowly (as indicated by shorter branch lengths in
A) than those in the other endosymbionts. Having one organism with relatively short branch lengths in this group makes it more likely that the monophyly of the insect endosymbionts in trees is a reflection of their true history and not an artifact of phylogenetic reconstruction known as long-branch attraction.
The branch-length finding is an example of how
Baumannia can be considered as a “missing link” in that it is an intermediate in many ways between the other insect endosymbionts and free-living species. This is the case not only for branch length but also for phylogenetic position (it is the deepest branching species), isoelectric point (pI), and G + C content (
). By filling in the gaps between the free-living and intracellular species, the
Baumannia genome should allow better inferences of the early events in the evolution of intracellularity.
Baumannia is not intermediate in value between free-living species and other insect endosymbionts for all “intracellular” features. For example, its genome size is smaller than that of some of the other endosymbionts. This is an important finding since the absolute values for many other features are highly correlated, both in this group and in other symbiont groups [
24]. An example of this is shown for pI and G + C content (
). Another way of looking at this is that the
Baumannia genome has shrunk more than one might expect based on its other intracellular features. This decoupling of the rates of change of different features can be useful in understanding the patterns of evolution in intracellular species. For example, one explanation for the pattern in
Baumannia is that although it has experienced more gene loss than some of the other insect endosymbionts, it has maintained the most complete set of DNA repair genes for the group (
). This retention of repair functions may have slowed its rate of change in other features, such as sequence change. If true, this suggests that, although the general differences between intracellular and free-living species may be due to population genetic forces, the variation among intracellular species may be due in part to variation in DNA repair. Consistent with this is the finding that species with the longest branch lengths in the trees (
Wigglesworthia and
Blochmania,
A) are those that are missing the mismatch repair genes (
).
| Table 2Homologs of Genes Known to Be Involved in DNA Repair and Recombination in the Complete Genomes of Insect Endosymbionts |
The differential loss of repair genes among organisms that share many other genome properties allows the insect endosymbiont group to serve as a model for studying the long-term effects of loss of various repair processes. For example, the consequences for genome evolution of losing
recA can be examined by comparing
Baumannia and
Wigglesworthia, which retain it, to
Buchnera, which lack it. The same logic can be used to study the effects of the loss of the DNA replication initiation gene
dnaA which is missing from
Baumannia (see above),
Wigglesworthia, and
Blochmannia [
18,
25] but is present in the other insect endosymbionts. Although the species without
recA may be able to survive with little or no recombination, those lacking
dnaA must make use of alternative initiation pathways. Some alternatives such as pathways based on
priA and
recA [
26] can be ruled out since at least one of these is missing from each of the species missing
dnaA. The
recBCD genes may play some role in initiation. This would explain why the
recBCD genes are present in all insect endosymbionts (
) including those missing
recA, which is required for the “normal” role of
recBCD in recombination.
Single Nucleotide Polymorphisms Are Abundant in the
Baumannia Population
Genetic variation among individuals is both a complication of genome sequencing projects of uncultured species and a valuable source of information about microbial populations. For the
Baumannia data, we used a stringent search protocol that may have missed some true polymorphisms but should have eliminated variation that was due to sequencing errors or cloning artifacts (see
Materials and Methods). In total, we identified 104 single nucleotide polymorphisms (SNPs) and two insertion-deletion differences (indels) that fit these criteria. Details of the locations and types of polymorphisms are given in
.
| Table 3Categorizations of Polymorphisms Detected in the Assembled
Baumannia Genome
|
Since our DNA was isolated from the symbionts of hundreds of hosts, one major question is whether the observed polymorphisms were between symbionts within one host or between hosts. We used polymerase chain reaction surveys of individual insects to address this question. Of the 40 insects for which sequences were obtained individually for two loci, 35 showed identity to the consensus sequence for the
Baumannia genome and five possessed the alternative alleles that were present as minority bases at four sites (two per fragment). No polymorphism was detected within individual hosts. Thus, the polymorphisms that we identified are real, and they reflect differences between symbionts of different hosts.
Since the
Baumannia can be treated as maternally inherited markers, the finding of significant levels of polymorphism between hosts suggests that the sampled population contains individuals from two separate origins. This is somewhat in conflict with theories suggesting a single introduction of a small number of individuals into California [
10] but is consistent with results from recent mitochondrial analysis [
27].
Sequence polymorphisms have been detected in genomic studies of other insect endosymbionts [
3,
28]. The most relevant one for comparison to
Baumannia is a study of the ant endosymbiont
Blochmannia pennsylvanicus, although we note that the criteria they used for detecting a polymorphism were somewhat less stringent than ours [
28]. The percentage of the SNPs that are in coding regions is different in the two species (81% in
Baumannia and 65% in
B. pennsylvanicus), but this is in line with differences in gene-coding density (88% in
Baumannia and 76% in
B. pennsylvanicus). For both species, the percentage of SNPs in protein-coding genes that represent synonymous differences is higher than expected from random changes given the genomic base compositions (52% in
Baumannia and 62% in
B. pennsylvanicus). This indicates ongoing purifying selection in both genomes. The most significant difference between the species is the higher ratio of transitions to transversions in
B. pennsylvanicus (2.9 versus 1.4 in
Baumannia;
). We propose that this is due to the absence of mismatch repair genes in
B. pennsylvanicus (as discussed above)
, which in other species leads to an increase in transition mutations [
29]. An absence of mismatch repair would also explain the higher incidence of indels in
B. pennsylvanicus.
Metabolic Reconstructions Provide Insight into the Biology of
Baumannia
Predictions of the metabolism of an organism from its genome sequence are critical to studies of uncultured organisms because of the difficulty of experimental studies. We have generated such a prediction for
Baumannia (
). Although all such predictions should be viewed as hypotheses, not facts, they are greatly improved by having closely related species for which experimental studies are available. This is yet another advantage of working on the insect symbionts in the
Gammaproteobacteria. For example, almost all
Baumannia genes have clearcut orthologs in well-studied organisms such as
Escherichia coli.
As expected, based on its small genome size,
Baumannia has a relatively limited repertoire of synthetic capabilities. There are some important features of its predicted metabolism, and we discuss these in this and the next few sections of this paper, calling attention in particular to those of relevance to the host-symbiont interaction.
Baumannia is predicted to synthesize its own cell wall and plasma membrane, processes known to be lost in some intracellular species. It is, however, apparently unable to synthesize the lipopolysaccharide (LPS) commonly found in the outer membrane of other Gram-negative bacteria. The same is true for
Buchnera species but not for
Wigglesworthia and
Blochmannia. The functional significance of this difference is unclear. On one hand, lipid A (the lipid component of the LPS) is generally highly toxic to animal cells; thus, LPS may be disadvantageous for endosymbionts and discarded during their evolution. Alternatively, the difference may reflect differences in the packaging of symbionts within the host bacteriocytes.
Buchnera and
Baumannia cells are surrounded by host-derived vesicles, while
Wigglesworthia and
Blochmannia directly contact the host cytoplasm.
The findings in regard to sugar metabolism are consistent with
Baumannia acquiring sugars from its host and using them for energy metabolism. For import, a complete mannose phosphotransferase permease system is present including an Enzyme II
Man complex, the phosphotransferase system Enzyme I, and histidyl phosphorylatable protein PtsH. Imported sugars could then be fed into glycolysis. However, since the tricarboxylic acid cycle appears to be incomplete, apparently reducing power must come from other sources such as glycolysis itself, a pyruvate dehydrogenase complex, and an
mqo type malate dehydrogenase. An intact electron transport chain consisting of NADH dehydrogenase I, cytochrome
o oxidase, and ATP synthase is present.
The most striking aspects of the metabolism of
Baumannia relates to what it apparently does and does not do in terms of the synthesis of essential nutrients missing from the hosts' xylem diet.
Baumannia Is a Vitamin and Cofactor Machine
A large fraction of the
Baumannia genome (83 genes, 13.7% of the total) encodes proteins predicted to have roles in pathways for the synthesis of a diverse set of vitamins, cofactors, prosthetic groups and related compounds (
,
Table S1). These include thiamine (vitamin B
1), riboflavin (vitamin B
2), niacin (vitamin B
3), pantothenic acid (vitamin B
5), pyridoxine (vitamin B
6), as well as biotin and folic acid. More detail on the pathways and the basis for the predictions is given below.
For the synthesis of riboflavin, folate, pyridoxal 5′-phosphate, and thiamine, complete pathways for de novo synthesis could be identified with
Baumannia's ability to produce endogenously important precursors such as ribulose-5-phosphate, phosphoenolpyruvate, pyruvate, dihydroxyacetonephosphate, glyceraldehyde-3-phosphate, erythrose-4-phosphate, guanosine triphosphate, 5-aminoimidazole ribonucleotide, 5′-phosphoribosylglycinamide, and 5,10-methylene-tetrahydrofolate.
For some compounds, although homologs of enzymes carrying out key steps in other species could not be identified, candidates for alternatives are present suggesting the pathways are complete. For example, the step normally carried out by erythrose 4-phosphate dehydrogenase
(Epd) in the pyridoxal 5′-phosphate pathway might be carried out by glyceraldehyde 3-phosphate dehydrogenase (GapA) as seen in some other species [
30].
There are some compounds for which we could identify homologs of all known genes in biosynthetic pathways. However, some enzymes in these pathways are still unknown in any organism, and thus we could not identify them here. This is true for the pyrimidine phosphatase in the riboflavin pathway and the dihydroneopterin monophosphate dephosphorylase in the folic acid pathway. We believe it is likely that these pathways are complete in
Baumannia and that, due to its ultracompact gene pool,
Baumannia provides an ideal opportunity to identify the genes encoding the enzymes for these steps.
Perhaps most interesting are the pathways for which we could identify genes underlying many downstream steps but for which
Baumannia would need to import some intermediates to feed those steps. For example,
Baumannia encodes genes for the last three steps for siroheme synthesis, and the last step of heme O pathway, but candidate genes underlying the upstream steps could not be identified. Thus,
Baumannia needs to import prophobilinogen and protoheme as substrates for these incomplete pathways. This pattern is particularly apparent in that
Baumannia appears to be able to synthesize many cofactors from amino acids but is unable to synthesize the amino acid precursors. Examples of such pathways and the amino acid required include thiamin (tyrosine), biotin (alanine), pyridine nucleotides (aspartate), and folate and pyridoxal 5′-phosphate (glutamine and glutamate). This suggests that
Baumannia must import these amino acids. The lack of amino acid biosynthesis pathways also makes it a necessity for
Baumannia to import 2-ketovaline as a precursor for the synthesis of pantothenate and coenzyme A.
Due to the diversity of vitamin and cofactor synthesis pathways that are present, we conclude that
Baumannia is providing its host with these compounds due to their low abundance in its diet. In this respect
Baumannia is more similar to
Wigglesworthia, the symbiont of tsetse flies, than to
Buchnera.
Amino Acid Biosynthetic Pathways Are Generally Absent from
Baumannia and Likely Are Found in Another Organism in the System
In contrast to what is seen for vitamin and cofactor synthesis,
Baumannia is predicted to encode a very limited set of amino acid synthesis pathways. The few capabilities that are present include histidine biosynthesis, synthesis of methionine if external homoserine is provided, and the ability to make chorismate but not to use it as substrate for production of aromatic amino acids as in most bacterial species. Except for histidine, no complete pathways for the synthesis of any amino acids essential to the host are present.
The lack of amino acid synthesis pathways is apparently compensated by an ability to import amino acids from the environment using a general amino acid ABC transporter, an arginine/lysine ABC transporter, a lysine permease, and a proton/sodium-glutamate symport protein, although the gene for the latter is disrupted by one frameshift. The import of amino acids is apparently used not just for making proteins but also for energy metabolism. The latter is evident by the presence of the aspartate ammonia-lyase AspA, which could be used to convert
l-aspartate to fumarate, which in turn can be fed into the tricarboxylic acid cycle.
The absence of essential amino acid synthesis pathways from
Baumannia implies that both the host and
Baumannia must obtain amino acids from some external source or sources. The sole diet of
H. coagulata is xylem sap [
10], in which essential amino acids are rare to absent; however, a substantial portion of the nitrogen in xylem occurs in the form of certain nonessential amino acids, including glutamine, aspartic acid, and asparagine (e.g., [
11,
14,
31,
32]). The essential amino acid synthesis pathways have not been found in any animal species studied to date, and nutritional studies in insects indicate that these compounds are required nutrients in insects as in mammals. Thus, the most plausible alternative is that another organism that is reliably present in the “ecosystem” of the host body is synthesizing the missing amino acids.
Analysis of Leftover Shotgun Sequence Reads Reveals the Presence of Amino Acid Synthesis Genes in Organisms Other than
Baumannia
The most likely candidate for another organismal source of the amino acid synthesis pathways is
Sulcia, the other coevolving symbiont found in bacteriomes mentioned above. Although we did not set out to sequence the
Sulcia genome as part of this project, we realized we might have inadvertently acquired some of it since many sequence reads from the shotgun sequencing did not assemble with the
Baumannia genome. These reads derived from cells of other organisms that were present in the tissue samples we used to isolate DNA for the
Baumannia sequencing. An initial search of these sequence reads revealed the presence of homologs of genes with roles in the synthesis of essential amino acids. However, we could not conclude that these reads were from
Sulcia, since there could have been cells of other organisms in the sample as well. To sort the extra reads into taxonomic bins, we adapted methods we have used to sort sequences from environmental shotgun sequencing projects (see
Materials and Methods) and were able to assign non-
Baumannia sequences to three main groups: host,
Wolbachia related, and
Sulcia related.
The finding of some
Wolbachia in the sample was not surprising since rRNA surveys have shown that these alphaproteobacterial relatives of Rickettsia are found in many sharpshooters including
H. coagulata. We note that we did not detect any sequences from the previously sequenced phytopathogen
X. fastidiosa, which colonizes the surface of the foregut and is not present in the bacteriomes that we used for DNA isolation. In addition, although some of our sequences show high identity to sequences annotated as being from a phytoplasma, we believe this annotation is incorrect. The “phytoplasma” DNA was isolated from the saliva of the leafhopper
Orosius albicinctus [
33]. However, all the sequences in our sample that showed matches to sequences annotated as “phytoplasma”-like show phylogenetic relationships to the
Bacteroidetes phylum. In addition,
Sulcia is known to be a symbiont of species in the Deltocephalinae, the leafhopper subfamily containing
O. albicinctus [
9]. Thus, the putative “phytoplasma”-like sequences with matches in our sample are likely from the
Sulcia symbiont of
O. albicinctus. Why these sequences appeared in samples from salivary secretions is unclear.
Amino Acid Synthesis Pathways Are in
Sulcia and Not Other Organisms in the Sample
Of the essential amino acid synthesis genes identified in the extra shotgun sequence reads, the vast majority (31 of 32) were assigned to the
Sulcia bin. In contrast, only one gene (
argB) was found in the
Wolbachia bin and none were found in the host bin. We therefore sought to obtain as much sequence information as possible from the
Sulcia symbionts in this system. First, we completed the sequence of all plasmid clones for which at least one read had been assigned to the
Sulcia bin. In addition, we constructed a new library from tissue thought to contain more of the
Sulcia symbiont than the library used for the initial sequencing. End-sequencing of this library identified some additional
Sulcia-derived clones, and these, too, were completely sequenced. After conducting another round of assembly and assigning contigs and sequences to taxonomic bins, we were able to assign 146,384 bp of unique sequence to
Sulcia. In these data, we identified 166 protein-coding genes. A phylogenetic analysis of a concatenated alignment of ribosomal proteins groups this protein set within the Bacteroidetes, thus supporting our assignment of these sequences to
Sulcia (
).
Although theoretically we could obtain a complete genome sequence of
Sulcia by very deep sequencing of the samples we have obtained, this was not practical given limited funds. Nevertheless, analysis of the incomplete genome is quite revealing. First, among the 166 predicted proteins are 31 that underlie steps or whole pathways for the synthesis of amino acids essential for the host (
). These include the complete pathway of threonine biosynthesis and nearly complete pathways for the synthesis of leucine, valine, and isoleucine (the only gene not sampled is
ilvE encoding the branched chain amino acid aminotransferase). In addition, multiple genes in the pathways for the synthesis of lysine, arginine, and tryptophan are present. We believe it is likely that these pathways are present and that the missing genes are in the unsequenced parts of the genome.
One question that remains is where
Sulcia gets all of the nitrogen for these amino acids. One possibility is that it acquires and then converts nitrogenous organic compounds, particularly the nonessential amino acids known to be present in xylem (e.g., [
14,
32]). Alternatively, it is possible that
Sulcia assimilates nitrogen from compounds such as ureides or ammonium, which are found in xylem (e.g., [
14,
32,
34]). It has been proposed that
X. fastidiosa, the plant pathogen vectored by
H. coagulata, makes use of the ammonium in xylem as a nitrogen source [
35]. Alternatively,
Sulcia could garner inorganic nitrogen from the host, for which ammonium is a waste product [
10,
13]. Host waste is apparently is a source of nitrogen for
Blattabacterium, close relatives of
Sulcia that are symbionts of cockroaches [
36]. Although some insect genomes encode enzymes that may allow for this (e.g., glutamine synthetase or glutamate synthase (e.g., [
37]), it is not yet known whether these capabilities are present in sharpshooters. Whatever the source of its nitrogen, the genome analysis indicates that
Sulcia apparently can make the amino acids required by the host.
The other abundant organism in our DNA was
Wolbachia, an unlikely candidate as the source of these compounds.
Wolbachia cannot be an obligate symbiont of sharpshooters because it infects only some individuals. Screening of individual
H. coagulata indicates that some do not contain
Wolbachia ([
16], two of 40 insects were uninfected in our screens); and screening of individuals of the closely related species,
Homalodisca literata (a synonym of
H. lacerta), revealed no cases of
Wolbachia infection. Also, although we have sampled only a fraction of the
Wolbachia genome, the absence of amino acid synthesis pathways is consistent with the complete lack of essential amino acid biosynthesis in any of several sequenced
Wolbachia genomes (two complete and many incomplete) [
23,
38,
39].
We therefore conclude that
Sulcia is most likely the sole provider of essential amino acids for
H. coagulata. Thus, this member of the
Bacteroidetes phylum appears to function in a similar way to
Buchnera and
Blochmannia species in the
Gammaproteobacteria.
Sulcia and
Baumannia Complement Each Other
We found very few genes in the partial
Sulcia genome for vitamin or cofactor synthesis. Since the
Sulcia genome appears to be quite small and we have apparently sampled a large fraction of it, we can speculate that few such genes are likely to be present. Thus, in the 146 kb of sequence assigned to
Sulcia, we have already found many of the core housekeeping types of genes (e.g., 40 ribosomal proteins and ten tRNA synthetases (
,
Table S2). A very small genome size is consistent with phylogenetic reconstructions indicating that
Sulcia is an extremely old symbiont, originating in the Permian [
9].
The paucity of vitamin and cofactor synthesis pathways in
Sulcia suggests the possibility that
Sulcia and
Baumannia play complementary, nonoverlapping roles in this symbiotic system. Not only do they appear to provide different resources for the host (
Sulcia provides the amino acids and
Baumannia the vitamins and cofactors) but, based on the current evidence, each does not provide the resources made by the other (
). Indeed, the single essential amino acid biosynthetic pathway present in the
Baumannia genome, that for histidine, is correspondingly the sole essential amino acid pathway with multiple steps for which no genes were detected in
Sulcia. Thus, although
Baumannia and the host apparently depend on
Sulcia for the majority of essential amino acids,
Sulcia and the host may depend on
Baumannia for histidine. The complementarity between host and each symbiont extends to mutual dependence between the symbionts, which appear to depend on each other for these required compounds and for intermediates in other metabolic processes. For example, we predict that
Sulcia can make homoserine, which, as discussed above, could be the substrate for methionine synthesis in
Baumannia. In addition, the valine pathway in
Sulcia could be the source of the 2-ketovaline for pantothenate and coenzyme A biosynthesis in
Baumannia. Exchange of intermediates may be occurring for many aspects of metabolism. In the case of ubiquinone, a key component of the electron transport chain,
Baumannia lacks genes encoding the needed biosynthetic enzymes and thus likely needs to import ubiquinone. The same appears to be true for menaquinone. Strikingly, even though only four of the 166 proteins in
Sulcia are predicted to be involved in pathways of cofactor synthesis, two are for production of menaquinone and ubiquinone production, which are among the few cofactors whose synthesis is not carried out by
Baumannia.
| Table 4The Complementarity of Amino Acid Biosynthesis and Cofactor Biosynthesis Pathways between
Baumannia and
Sulcia
|
The coresidence of
Sulcia and
Baumannia, presented here from
H. coagulata, is representative of a symbiotic pair that is distributed in most or all sharpshooters, a xylem-feeding insect group [
9,
16]. Thus, the possibility of metabolic complementarity that is suggested by the genome analyses reflects long coevolution of the three lineages represented by the insects and the two bacteria. The two symbionts occur in close proximity within the yellow portion of the host bacteriomes [
16], and
Baumannia cells often appear to adhere to the surface of the much larger
Sulcia cells. This arrangement is illustrated in images from our in situ hybridizations for
H. literata, a close relative of
H. coagulata (
).
Conclusions
The glassy-winged sharpshooter,
H. coagulata, feeds on xylem sap, which has very low levels of many nutrients required by insects and other animals [
10]. Sequence analysis suggests the occurrence of an obligate symbiosis among three organisms:
H. coagulata, the gamma-proteobacterial endosymbiont
Baumannia, and the
Bacteroidetes bacterial symbiont
Sulcia. The two bacterial symbionts co-occur within the cytosol of sharpshooter bacteriocytes, sometimes residing within the same cells. The main function of
Baumannia, as revealed by its genomic sequence, is to provide cofactors, especially water-soluble B-family vitamins, to the host. Partial sequences from
Sulcia suggest that it provides essential amino acids to the host. The two endosymbionts appear to show functional complementarity and show little overlap in biosynthetic pathways, although full sequencing of the
Sulcia genome is needed for a comprehensive view of the contributions of these two organisms. Our analysis shows the added insight possible from assigning sequences to organisms rather than treating environmental samples as a representative of a communal gene set.
Many questions remain regarding this fusion of separate lineages into a single metabolic system. For example, the different organisms must balance their contributions to the shared metabolism through coordinated growth and gene expression, and the mechanisms underlying this integration are not known. Also, these bacterial genomes have undergone major reduction in size while apparently maintaining their complementary capabilities, raising the question of how the steps in genome reduction have been coordinated. The sharpshooters and their obligate bacterial endosymbionts provide a simple model of genomic coevolution, a process that has likely been central in the evolution of most organisms living in stable associations.