Phospholipase C, A1, A2
Phospholipase C, A
1, and A
2 (PLC, PLA1, PLA2) activity was reported in Ureaplasma serovars 3, 4, and 8 by DeSilva and Quinn
[
20,
21,
23]. It is important to note that the assay used by DeSilva measures combined activity of PLC and phospholipase D (PLD) because both cleavage products are in the soluble fraction and the radioactively labeled hydrogen would be found in both cleavage products
[
39]. PLC activity has been reported in
Ureaplasma diversum cells as well, and has been suggested to play a role in ureaplasma invasion in mammalian cells
[
40]. However, the detection method used the artificial substrate p-nitrophenylphosphorylcholine (p-NPPC), which can be hydrolyzed by several other enzymes that can hydrolyze phosphate esters, including PLD
[
41]. All 14 ATCC ureaplasma serovar genomes and the genome of the previously sequenced clinical isolate of UPA3 were extensively evaluated for the presence of PLC, PLA1, and PLA2 genes. No genes showed significant similarity to known sequences of PLC, PLA1, or PLA2 in any of the genomes. HMMs developed for known PLC, PLA1, and PLA2 did not detect any ureaplasma genes with significant similarity. This suggested that ureaplasma may encode phospholipases that are either very degenerate or have evolved separately from known phospholipases as previously suggested by Glass et al.
[
25], or that no phospholipase genes are present in
Ureaplasma spp. It is interesting to note that a PLD domain containing protein was easily identified. In all serovars this protein is annotated as cardiolipin synthase (UPA3_0627 [GenBank YP_001752673]).
We used two PLC assays to test ureaplasmas for PLC activity: Invitrogen’s Amplex® Red Phosphatidylcholine-Specific Phospholipase C Assay Kit, which detects also PLD activity, and the original PLC assay published by DeSilva and Quinn. We were not able to detect PLC or PLD activity in ureaplasma cultures of serovars 3 and 8. Our attempts to repeat De Silva and Quinn’s PLC assay using L-a-dipalmitoylphosphatidylcholine - (choline-methyl-3

H) with UPA3 and UUR8 cultures grown to exponential phase and processed to collect the cell membranes and cleared cell lysates as described in their original publications
[
20,
21,
23] failed to replicate the specific activity levels they reported in ureaplasma cultures. Because we were not able to find PLC, either computationally or experimentally, we believe that this gene is not present in ureaplasmas. However, a study done by Park et al. suggests implication of PLD in the signaling cascade that activates COX-2, leading to production of prostaglandins and initiation of labor
[
42]. Since all ureaplasma serovars and the four sequenced clinical isolates contain a gene with PLD domains, a future functional characterization of this gene would be of interest. We have not been able to find computationally the genes encoding PLA1 and PLA2 in ureaplasmas.
Multiple Banded Antigen (MBA) Superfamily
The original classification of ureaplasma isolates into distinct serovars was largely based on differences in the major ureaplasma surface antigen called the multiple banded antigen (MBA) (8–10, 12). MBA consists of an N-terminal conserved domain and a C-terminal variable domain. The conserved domain contains a signal peptide, lipoprotein attachment site, and one transmembrane domain. While the conserved
mba domains for all 14 serovars had been sequenced previously, for most serovars sequencing of the variable domain, which was thought to be serovar specific, was only partial
[
15,
50,
51]. Our whole genome data confirmed that variable regions usually consist of tandem repeating sequence/units (TRU). Only in UUR13 is the conserved domain attached to a variable domain that does not contain any tandem repeats. The same variable domain is found also in UUR12 and UUR4; however it is not attached to the conserved domain of the
mba in these serovars
. The MBA is recognized by the Toll-like receptors 1, 2, and 6, and is capable of inducing the cytokine, NF-κB and antibody production
[
52]. It is conceivable that ureaplasmas would have evolved strategies to vary the MBA in order to evade this response. Ureaplasma isolates can vary the number of the tandem repeats of their
mba gene in response to challenge with antibodies presumably by slipped strand mutagenesis
[
53]. Furthermore,
mba can phase vary with neighboring genes, and UPA3 was recently shown to produce a chimeric genes though phase variation by fusing the N- terminal part of the
mba paralog UU172 [GenBank: CBI70486] to its neighboring gene UU171 [GenBank: NP_078003] and by fusing the N-terminal part of UU375 [GenBank: NP_078209.1] to its neighboring gene UU376 [GenBank: NP_078210.1]
[
54,
55]. These findings suggest that
mba and some
mba paralogous genes might be involved in strategies for evading the host immune system employed by ureaplasmas.
One of the surprises of our whole genome analysis and comparison of the 14 ATCC serovars showed the
mba genes to be part of a large complex gene superfamily comprising 183 UPA and UUR genes and 22 subfamilies (Figure

). There were a limited number of unique variable domains as shown in Table

. We found that all UUR serovars and UPA1 and 6 had more than one tandem repeating unit type in their
mba locus. Although some of the TRUs in the loci have not yet been observed to be attached to the conserved domain of the
mba, they are surrounded by inverted repeats that contain a putative recombinase recognition site. This suggested that these TRUs were involved with the
mba and contributed to surface antigen variation. We consider genes without tandem repeats that are in the
mba locus and have the putative recombination recognition site to be part of the MBA superfamily. The UPA serovars had a simpler MBA phase variation systems than the UUR serovars: the UPA conserved domain was surrounded by inverted single base pair repeats, containing the 25 base pair putative recombinase recognition site (Figures

and
). The inverted repeats and a site-specific recombinase were potentially involved in inverting the orientation of the transcriptional promoter and conserved domain in order for expression to occur with one or the other TRU. A list of all genes encoding potential recombinases or transposases is provided in the Additional file
5: 19UU_Recombinases.xls. In most serovars a recombinase or a transposase is located in close proximity to the
mba locus. Experimental evidence is needed to determine which recombinase is responsible for the rearrangement of the locus. It is interesting to note that one TRU was short and had a high copy number (18 nt - UPA1, 12 nt - UPA6, repeated >30X) and the other one was long and had a low copy number (327 nt -UPA1, 336 nt - UPA6, repeated <5X). Rearrangements of the
mba locus were evident in the smaller contigs of unfinished serovar genomes (Figures

and
). UPA1 genome sequencing data clearly shows a sub-population in which the conserved domain of the
mba is attached to the alternative TRU ([GenBank: NZ_ABES01000008] -gcontig_1106430400161, [GenBank: NZ_ABES01000003] - gcontig_106430400170; Figure
& Table

) and another subpopulation in which another gene is present between the two TRUs ([GenBank: NZ_ABES01000002] - gcontig_1106430400172). The high repeat number of the
mba TRUs, and the existence of a subpopulation in the culture being sequenced that has a rearrangement of the
mba locus, represent an ambiguity for the assembly software, resulting in the generation of smaller alternative contigs that cannot be assembled into the chromosome. The alternative 327 nt
mba TRU of UPA1 is on a 1399 nt long contig [GenBank: NZ_ABES01000008] that contains only this gene, and it ends truncating the 327 nt TRU at only 2.3 repeats compared to 4 repeats on the main contig. Furthermore, comparing the two variations of the
mba locus makes evident the break-points where the flip of the conserved domain occurred. This coincides with the sites of the inverted repeats suspected to be part of the mechanism for MBA phase-variation. This represents sequencing evidence that this serovar could express both variations of the MBA at different times.
| Table 5Tandem Repeating Units (TRUs) identified in thembalocus |
All UUR serovars have more than two TRUs in close proximity to each other. Serovars UUR7 and UUR11 have only 2 TRUs each, whereas UUR2 and UUR5 have 6 TRUs each, which is the maximum number of TRUs observed. The largest
mba loci are around 10

KB and have 6 TRUs and some non-TRU
mba genes. Each
mba locus contains only one conserved domain. The loci are always located adjacent to the DNA pol III alpha subunit (except UPA14) and on the other side of the loci there is a putative Xer-C site- specific recombinase. Next to each TRU there is a putative 25 nt recombinase recognition sequence [ACTTT(T/C)TCT(G/C)TTTGATAATT(C/A)AAAT]. The same recognition site is located next to some non-TRU genes in the loci, therefore making them likely to be involved in this phase variable superfamily. Furthermore, serovar 13 has a non-TRU variable domain fused to the conserved domain of the
mba, confirming that the variable unit does not necessarily require tandem repeats. An interesting observation is that UUR4, 12 and 13 have the same
mba locus composition in 3 different rearrangements (Figure

). Most TRUs were found to be present in more than one serovar. By carefully analyzing small contigs in unfinished ureaplasma genomes, we identified variations of the
mba loci. For example, on a small contig of UUR8 gcontig_1118434609926 [GenBank: NZ_AAYN02000001] we saw a partial
mba locus arranged alternatively by duplicating one of the TRUs in the locus. Examining the sequencing and assembly data of such contigs confirms that these contigs are not misassembled, but rather represent a subpopulation of the sequenced culture. The proposed mechanism for variation of the ureaplasma
mba locus resembles the previously reported variable loci of
Mycoplasma bovis:
vsp,
Mycoplasma pulmonis: vsa and
Mycoplasma agalactiae: vpma[
56]. The involvement of a site-specific Xer-like recombinase and inverted repeats was experimentally proven for the
M. pulmonis vsa locus
[
57] and the
vpma locus of
M. agalactiae[
58]
, and suggested for the phase variation of the
vsp locus in
M. bovis[
56]
. We believe that a Xer-like recombinase is likely to be involved in the phase variation of the
mba locus of
Ureaplasma spp and a putative recombinase recognition site has been determined. The
mba locus resembles the
M. pulmonis vsa locus in that it has only one promoter and one conserved domain per
mba locus, which needs to be moved in front of a variable domain to make a functional surface MBA.
Examination of the
mba loci of the four sequenced UUR clinical isolates that cannot be assigned to a serovar shows that the
mba conserved domain is UUR specific. Due to the repetitive nature of the
mba TRUs the loci are broken into multiple contigs, making it impossible to determine the exact order of the genes in the
mba loci without further sequencing. Isolate 2033 had 4 identifiable TRUs (
mba333bp,
mba213bp.1,
mba30bp and the non-tandemly repeating unit of UUR13; see Table

). Of these,
mba30bp was found attached to the conserved domain of the MBA and is the equivalent of the active TRU in UUR4. The same TRU was also present in the mba loci of UUR12 and UUR13. Isolate 2608 contained 3 identifiable TRUs (
mba24bp.1,
mba267bp, and
mba330bp). The conserved domain was found attached to
mba24bp.1, as in UUR5; this TRU was also present in UUR2 and UUR8. Clinical isolate 4318 had 3 identifiable TRUs (
mba24bp.1,
mba276bp, and
mba333bp). The conserved domain was attached to
mba24bp.1. Isolate 4155 had 5 identifiable TRUs (
mba24bp.1,
mba45bp,
mba213bp.2,
mba252bp.1, and
mba276bp). The conserved domain was attached to
mba276bp; this TRU had not been previously seen attached to a conserved domain in any of the 14 ATCC type strains, including the clinical UPA3 described by Glass et al.
[
25]. This is a further confirmation that the TRUs found in the
mba locus are part of this phase variable system, which trough recombination should be capable to present on the surface of the ureaplasma cell different TRUs at different times. It would be interesting to investigate whether some TRUs are more immunogenic than others and therefore may contribute to differential pathogenicity. As mentioned earlier the
mba variable domain has been used as one of the determinants of serovar classification. It is interesting to note that serovars 4 and 12, which have an identical set of MBA genes, have a percent difference at the nucleotide level in a whole genome comparison (Table

) of only 0.06 or 0.07% (value depends on which genome is used as reference sequence), making these serovars almost identical, with the exception of some minor rearrangements and small insertion/deletion events (see Additional file
2: Figure S5). In addition, we observed two chimeric
U. parvum strains in a clinical isolate that had exchanged through horizontal gene transfer their
mba genes
[
26]. Taken together, these observation suggest that the
mba locus is dynamic and can comprise of a different set of variable domains at different times, therefore making this gene an unsuitable target for serovar differentiation.