Chlamydia genome architecture
The two C.trachomatis
and the two C.pneumoniae
genomes sequenced to date are highly conserved in gene content and order (Fig. and Table ). Scatter plots based on the results of Fasta3 searches are presented in Figure A and B. The C.trachomatis
MoPn and serovar D plots were almost linear (Fig. A), indicating that despite evolutionary separation that has allowed an average difference in orthologous genes of ~10%, there have been no major rearrangements in the chromosomes. The exception to the overall synteny is in an area of ~50 kb near to the predicted termination origin, which appears to be a ‘plasticity zone’ (13
). The C.pneumoniae
AR39 and CWL029 chromosomes were essentially identical, with only a few small deletions and ~300 single nucleotide polymorphisms (SNPs) distinguishing the two strains, although the AR39 sequence included a novel infecting bacteriophage (described later). Only when the C.trachomatis
genome was compared to the C.pneumoniae
genome (Fig. B) was there evidence of chromosomal rearrangements. It appears that there have been several large DNA inversions (inverted diagonals on the scatter plot) in the period since the two species had diverged from their common ancestor. The C.pneumoniae
chromosome also has a plasticity zone near its termination origin where there has been a higher rate of DNA reorganization, although this region is more extended in the C.pneumoniae
genome than in the C.trachomatis
genome (~160 versus ~50 kb). From Figure it is notable that many of the divergent genes (red or blue ticks) in the chlamydiae are clustered, suggesting that they are in units involved in a similar cellular function. The significance of these groups of divergent genes is that they might represent determinants of strain-specific functions, for instance host tropism or specific virulence activity.
Figure 1 Comparison of the C.pneumoniae and C.trachomatis genomes. The DNA genomes of C.trachomatis MoPn, C.trachomatis serovar D and C.pneumoniae (CWL029 and AR39 are effectively identical at this level of resolution) are represented as circles. Genomes are (more ...)
Figure 2 Dot plots of gene similarities between (A) C.trachomatis MoPn and C.trachomatis serovar D and (B) C.trachomatis MoPn and C.pneumoniae AR39. The criterion for match was a Fasta3 score of <10–8. The genomes have been rotated to better show (more ...)
It is interesting to note that the multiple large inversions in the Chlamydia
chromosomes occur around the axes of the origins of replication and termination (Fig. B). Recombination across the origins has been seen in other eubacteria (14
) but the chlamydiae provide one of the clearest illustrations of how this phenomenon affects the architecture of the genome. Another significant feature of the chlamydiae genomes is tandemly repeated genes. Figure A and B charts the positions of the duplicated genes of C.trachomatis
and those that have the nearest sequence match to another gene in the same chromosome. Mostly, these genes are situated next to each other, indicating a recent recombination event.
Figure 3 Gene duplications in (A) C.pneumoniae AR39 and (B) C.trachomatis MoPn. Duplicated genes in each chromosome were identified as having a score in a Fasta3 comparison of <10–8. The location of each gene and its duplicate on the chromosome (more ...)
The origin-linked inversions and tandem duplication events are more clearly observable in Chlamydia than in other bacteria because of the apparent paucity of intra-genomic DNA rearrangement and the rarity of sequences from extraneous sources. There are no IS elements or other dispersed repeated sequences in the genomes to promote intramolecular rearrangements and disrupt the synteny of the genetic organization. Judging from a lack of variation in the ratio of GC to AT nucleotides across the genomes (data not shown) there are no regions from any of the four sequenced Chlamydia genomes that have recently been transferred from an evolutionarily diverged organism.
The C.trachomatis plasticity zones
Considering that Chlamydia are isolated from genetic exchange with other bacteria owing to their obligate intracellular ecological niche, it is not surprising that there is a great deal of overall synteny between the C.trachomatis genomes. For the same reasons, it is significant that there is one segment of each genome, the plasticity zone (Fig. ), that has undergone genetic reorganization to a much higher degree than the rest of the chromosome. Outside the single plasticity zone, syntenic differences between the C.trachomatis strains is limited to three novel genes together with rare gene duplications in the MoPn genome. Two of the novel MoPn genes encode DNA helicases, while the other specifies uracil phosphoribosyl transferase (upp).
The C.trachomatis plasticity zone extends between the conserved orthologs dbsB (disulphide bond oxidoreductase) and ycfV, encoding an ABC-transporter of unknown substrate specificity. The distance between the 3′-ends of these genes is 22 922 nt in serovar D and 50 624 nt in MoPn. This size difference in the plasticity zones (27 702 nt) is of the same order as the total difference between the two genomes (26 893 nt; Table ). The genetic composition of the C.trachomatis plasticity zones are outlined in Figure . There are several differences in the plasticity zone between the human and mouse trophic genomes that suggest an influence on Chlamydia pathogenesis. Chlamydia trachomatis MoPn, in common with C.pneumoniae, contains guaAB and adenosine deaminase (add) apparently arranged as a single operon. In the same location relative to the 5′-end of the opp gene, C.trachomatis serovar D has the trpRBA trypophan biosynthesis cluster. This arrangement suggests strongly that in the human C.trachomatis strain trpRBA has replaced the guaAB and adenosine deaminase genes.
Figure 4 Gene map of C.trachomatis MoPn and serovar D plasticity zones. Schematic diagram showing the gene content of the plasticity zones of the two C.trachomatis strains between the conserved dbsB and ycrfV loci. The line across the two homologs of the large (more ...)
Another striking difference between the two C.trachomatis
plasticity zones is the presence of a 9675 nt gene, ORF TC0439, encoding a putative toxin protein of predicted molecular weight 364 kDa. The protein bears an overall similarity of 53% to a 3192 amino acid putative toxin encoded by the Escherichia coli
0157:H7 virulence plasmid (17
). Both the MoPn and 0157:H7 toxins have similarity at their N-terminus to the N-terminus of large clostridial toxins (LCTs; 19
). This portion of LCT molecules has been shown to interfere with eukaryotic cell chemistry by glycosylating GTP-binding proteins of the Ras superfamily. A conserved motif in LCTs and yeast glycosyltransferases (LxxxGGxYxDxD) (17
) was found at the N-terminus of the MoPn and E.coli
toxins, suggesting similar activity by the latter proteins. In addition to the catalytic region, LCTs contain domains for recognizing cell surface receptors and translocation through the outer membrane. The C-terminal ligand-recognizing portions typically contain multiple repeated motifs. Hydrophobicity plots of the Chlamydia
toxin (data not shown) indicate potential MSD in the center of the molecules but the C-termini are not repetitive in the manner of LCTs. Adjacent to the toxin are two other very similar large toxin-encoding genes but these ORFs contain multiple frameshift mutations (Fig. ). The serovar D strain also contains what appears to have been an entire toxin gene that has accumulated numerous frameshift mutations, arguing that there has been selection against expression of the entire toxin in the human trophic strain but not in the mouse trophic strain.
The plasticity zone is also the location of an unusual family of genes encoding phospholipase D–endonuclease (PLD) superfamily proteins previously reported by Kalman et al
). These proteins have little overall similarity to other PLD enzymes and lack a type II secretion signal sequence but contain conserved duplicated HKD motifs typical of this family (20
). Chlamydia trachomatis
serovar D contains four PLD paralogs between ycfV
and the toxin genes arranged in an operon. MoPn contains five paralogs in this location and two on the other side of the toxin genes in the opposite orientation (Fig. ). PLD genes on the same genome are generally more closely related to each other than to paralogs from the other strain, indicating that frequent intragenomic duplication and deletion has occurred in this gene family.
Overall, the C.trachomatis plasticity zones are the location of several genes suspected to be involved in pathogenesis, such as the trp genes, the large toxin and the unusual PLD-like enzymes, suggesting that these regions might be sites for horizontal gene exchange. The GC content and codon adaptation values (21) of genes in this area are in line with other chlamydial loci, providing no evidence of recent horizontal movement of genes from outside the genus. More likely, genetic rearrangements in this area are endogenous, with deletions or duplications of existing loci (the latter case explaining the expansion of the numerous PLD paralogs). Chlamydia pneumoniae has either deleted the toxin and PLD genes from this region at an earlier point in its evolution or C.trachomatis acquired these genes after the speciation event. The mechanisms driving rearrangements at the plasticity locus are not clear. There are no features reminiscent of ‘pathogenicity islands’ as seen in other Gram-negative bacteria, such as long flanking repeats and associated transposase or recombinase genes. It is likely important that the plasticity zone is close to the predicted termination origin. Perhaps genomic rearrangement is facilitated at this site by stalled replication forks caused by lack of processivity of the chlamydial DNA polymerase enzyme at the termination origin.
Why is MoPn a mouse pathogen and serovar D a human pathogen?
The extraordinary similarity in gene content and order in the C.trachomatis
MoPn and serovar D genomes is surprising given prior reports which suggested that the two biovars exhibited only low to moderate homology by DNA:DNA hybridization and amplified fragment length polymorphism (AFLP) studies (22
). On the one hand, the extraordinary conservation offers encouragement for investigators using the MoPn biovar to model disease caused by human biovars of C.trachomatis
. On the other hand, there are no macroscopic features deduced from comparative genomic analysis to explain the observed differences in host range and pathogenicity between the two C.trachomatis
biovars. Instead, host species tropism might be attributable to a few genes on the chromosomes that influence the ecology of infection within a species. The finding that serovar D contains tryptophan biosynthesis genes not present in MoPn has potential significance. One of the primary host defences against chlamydial infection is the pro-inflammatory cytokine interferon-γ (IFN-γ). IFN-γ modulates the depletion of intracellular tryptophan through induction of indoleamine 2,3-dioxygenase (23
). The trp
genes of serovar D might allow for increased survival inside a tryptophan-depleted human cell, thus producing persistent infection more readily than MoPn.
Serovar D may require persistent infection in order to achieve successful transmission from human to human through density-independent sexual contact, whereas acute high level respiratory infection with MoPn may facilitate aerosol transmission under the density-dependent conditions of a rodent colony (24
). In this regard, the large toxin encoded by the MoPn genome may be an important virulence determinant that promotes acute high level infection and might be the reason why MoPn replicates more readily in vivo
and in vitro
compared to human C.trachomatis
isolates. It is notable that serovar D appears to have accumulated mutations in its copy of the toxin gene that prevent expression of the entire molecule, suggesting that the toxin could be an example of a virulence determinant important in infection of one host (mice) but unnecessary or disadvantageous for pathogenesis in a second host (human). In addition, several of the few genes specific to MoPn but not found in serovar D (guaAB
, adenine deaminase, upp
) are involved in scavenging of nucleotides. This differential capacity for nucleotide metabolism could also contribute to defining the host range of tissues each organism is capable of infecting (discussed later).
Comparison of the C.pneumoniae AR39 and CWL029 genomes
strain sequenced by TIGR (AR39) and the strain (CWL029) reported previously (5
) are the most similar published genomes to date. Comparison of the 1.23 Mb chromosomes by the MUMmer suffix tree analysis method (25
) revealed only 296 SNPs and 21 single base frameshift mutations. There are two small insertions in the AR39 strain (25 and 85 bp) and five insertions in the CWL029 isolate (5, 5, 7, 89 and 305 bp). Previous studies based on AFLP (22
) highlighted close similarities of C.pneumoniae
isolates but suggested that AR39 might be a phylogenetically separated isolate with a sequence difference of 6% from the main set of strains. However, the genome data on these two C.pneumoniae
isolates show that they are close enough to have diverged within recent human history.
The C.pneumoniae AR39 genome data contain 304 polymorphisms (areas where there is a mixture of two variant sequences). Most of these variants are either SNPs or variations in the number of repeated nucleotides. The largest polymorphism by far is the deletion of one unit of a tandem 1649 bp repeat containing a tyrosine transport protein gene and partial ORFs of a tyrosine permease and glucosamine-fructose 6-phosphate aminotransferase. In comparison, the C.trachomatis MoPn sequence has only three polymorphic areas (see Materials and Methods). The reason for the differences in numbers of polymorphisms between the two chlamydiae is unknown.
Having whole genome sequence data for two strains as closely related as the C.pneumoniae
strains provides a unique opportunity to observe the process of mutagenic change. Many of the mutations (including polymorphisms in the AR39 sequence) occur in intergenic regions of the chromosome, suggesting a predominantly neutral phenotypic effect. Comparing the AR39 and the CWL029 genomes, only 161 of 1165 proteins are not identical. By far the majority of mutation events are purine–purine or pyrimidine–pyrimidine transitions (90%), in line with other studies of C.trachomatis omp1
gene polymorphisms (26
). There was no indication of clustering of the SNPs at any particular genomic location.
Given the high degree of similarity between the C.pneumoniae chromosomes, the small differences that are observed become important, as they offer potential targets for strain differentiation assays and for ideas about gene function. One notable change is the apparent loss of a 393 bp iterated segment in AR39 from the large polymorphic outer membrane protein pmp6, showing how cell surface variability could be generated in otherwise very similar bacteria.
One of the most intriguing differences between the two C.pneumoniae
chromosomes is in the area upstream of the uridine kinase gene (Fig. ) where there is a 23 nt sequence in AR39 that is in an inverted orientation relative to the CWL029 genome. Phase variable expression of key virulence determinants involving inversion of promoter DNA mediated by site-specific recombinases is a common feature of Gram-negative bacteria, for example the hin
systems of Salmonella
and Moraxella bovis
, respectively (27
). The likelihood of a recombinase-mediated DNA inversion in the C.pneumoniae
genome is indicated by the fact that the 23 nt flipped segment is flanked by a 15 nt inverted repeat sequence (Fig. ). The 23 nt inverted segment contains a reasonable consensus –10 RNA polymerase binding site (TATAGT; Fig. ), therefore, it is possible that inversion of this piece of DNA may result in switching on or off of expression/transcription of the uridine kinase gene. In CWL029 the putative –10 site is orientated for transcription of the gene; in AR39 it is in the opposite orientation. As uridine kinase is apparently a key enzyme in nucleoside metabolism in C.pneumoniae
(see below), a phase variation system that appears to result in potential lack of expression of the protein is a fascinating observation. Chlamydia pneumoniae
contains two genes specifying homologs to integrase/recombinase enzymes although neither are situated near the uridine kinase gene and both are conserved in the other Chlamydia
genomes. Neither the inverted repeats nor the inverted 23 nt are found anywhere else in any of the Chlamydia
Figure 5 Regions upstream of the uridine kinase gene in C.pneumoniae CWL029 and AR39. A comparison of the sequence between ygeD and uridine kinase of the CWL029 and AR39 strains is shown. Differences between the sequences (asterisks) are limited to the 23 bp (more ...)
The C.pneumoniae AR39 bacteriophage
AR39 genome includes a 4524 nt circular molecule homologous to members of the single-stranded (ss)DNA microviridae class of bacteriophages previously reported in Chlamydia
). The phage genome reported here bears 49% nucleotide sequence identity to the Chp1 phage from an avian strain of Chlamydia psittaci
. We therefore believe that we have sequenced the dsDNA circular intracellular replicative form of an ssDNA C.pneumoniae
bacteriophage present in the sample lysate.
The discovery of a small ssDNA bacteriophage genome during sequencing of the C.pneumoniae
AR39 genome was unexpected. The phage was not found in the otherwise almost identical C.pneumoniae
CWL029 strain sequenced by Kalman et al
). A map of the phage genome is shown in Figure . Three genes encode products homologous to microviridae structural proteins VP1–VP3. The C.pneumoniae
phage also contains sequence homologous to the
X174 gene A nicking/closing protein ORF4. Interestingly, the C.pneumoniae
chromosome contains a truncated version of this gene with 73% identity to the truncated phage gene 4 product, suggesting that at some time in the past the C.pneumoniae
phage was integrated into the chromosome. This gene was seen in both the AR39 and CWL029 genomes. This is the first C.pneumoniae
-infecting bacteriophage described and while it has fundamental organizational similarities with other microviridae, the level of sequence divergence from its nearest neighbour, Chp1, indicates that it is a novel branch of this virus family. The presence of extra phage bands is possibly the reason for the figure of 6% divergence of the AR39 isolate from the main group of C.pneumoniae
strains reported by Meijer et al
) using AFLP.
Figure 6 Comparison of C.pneumoniae AR39 phage with other sequences. The figure details putative ORFs of the 4524 nt ssDNA phage sequence (inner circle) with similarities to previously sequenced proteins. Chp1, C.psittaci Chp1 bacteriophage; Spv, Spiroplasma (more ...)
The presence of a bacteriophage in an obligate intracellular pathogen raises interesting biological questions. For instance, does the phage infect the RB or EB of Chlamydia
? How does the phage transfer between and co-infect new host organisms? Pioneering work on the C.psittaci
virus by Richmond et al
) suggested that the RB is the target for phage replication, with the phage identified as multiple electron-dense particles in the cytoplasmic compartment. While nothing like the crystalline structures noted by Richmond have been described in C.pneumoniae
, it is interesting to speculate that the intracellular and periplasmic particles termed ‘minibodies’ (33
) observed in C.pneumoniae
strains AR39 and TW183 could actually be associated with the virus.
It is possible that the C.pneumoniae
phage may play a role in pathogenesis. Lysis of intracellular C.pneumoniae
could cause the release of cell-activating proteins such as Chlamydia
heat shock protein 60 (35
) or of multiple immunogenic epitopes which could result in an enhanced inflammatory response to pathogenic epitopes such as the MAxxxST motif (36
). Alternatively, lytic phage may reduce the antigenic mass and promote the persistence of C.pneumoniae
by preventing the accumulation of a strong anti-chlamydial immune response, thereby preventing immune-mediated clearance.
As well as its potential importance in C.pneumoniae pathogenesis, the phage has exciting promise as a genetic vector for a bacterium where genetic analysis has so far proved difficult. The finding of a phage gene apparently inserted into the chromosome is encouraging as it suggests that the virus might co-integrate at some frequency. It is also significant that the C.pneumoniae chromosomally located partial ORF4 is found within the plasticity zone; another indication that plasticity zones might have increased susceptibility to uptake of foreign DNA than the rest of the genome.
Comparative genomics of C.trachomatis and C.pneumoniae
Kalman et al
) noted that ~80% of the C.pneumoniae
serovar D predicted coding sequences were orthologs. Unsurprisingly, given the synteny of the C.trachomatis
genomes, the number of shared orthologs between MoPn and C.pneumoniae
is of the same order (854/924). Despite the number of orthologous proteins and their relatively high similarity, as well as the overall homology in genome organization (Fig. ), there is only a relatively low level similarity in the nucleotide sequence of orthologous genes between C.pneumoniae
(81.5%). This argues for conservation for the basic functions necessary for intracellular growth in the chlamydiae despite a long separation of the C.pneumoniae
An overall comparison between the C.pneumoniae
genomes is shown in Figure . The C.pneumoniae
genome is ~0.15 Mb larger than that of C.trachomatis
and contains ~200 genes not found in C.trachomatis
. As detailed by Kalman et al
), most of the ‘extra’ genes found in C.pneumoniae
are either expansions of paralogous families (for instance, there are 21 pmp
outer membrane protein genes in C.pneumoniae
but only nine in C.trachomatis
) or encode hypothetical proteins without current database matches. Many of the additional C.pneumoniae
genes are located in the plasticity zone portion of the genome (Fig. ). Proteins with homologs of known function encoded by C.pneumoniae
but by neither C.trachomatis
MoPn nor serovar D include tryptophan hydroxylase, genes involved in biotin synthesis and uridine monophosphate synthase and uridine kinase. It is interesting that both the MoPn and C.pneumoniae
genomes contained guaAB
and adenosine deaminase homologs whereas these genes are not present in the serovar D genome.
The few C.trachomatis genes without homologs in C.pneumoniae are restricted to the plasticity zone, with the exception of the three apparently inserted genes of MoPn: two DNA helicases and uracil phosphoribosyltransferase (upp). Plasticity zone genes unique to C.trachomatis include those encoding the large toxins, the family of PLD-like proteins lacking signal sequence, the tryptophan biosynthesis cluster of serovar D and several proteins without homologs in other species.
An important result of comparative genome sequencing is the identification of proteins conserved within bacterial species. Table lists the Chlamydia orthologous proteins that have >90% sequence identity over >90% of their length. Presumably sequence conservation at this level when the general level of similarity of orthologs between C.pneumoniae and C.trachomatis is ~65% reflects strong conservative selection on the protein. Most of the proteins in Table are conserved across all bacteria: ribosomal proteins, σ factors and transcriptional elongation factors, for example. Some of the conserved proteins elaborate highly specific structures important in the chlamydial lifestyle, such as SctN and SctV, type III secretion transporters. One of the conserved proteins, encoded by TC0313, is a hypothetical protein without homologs in another organism. Possibly this protein fulfills a unique role in chlamydial virulence or intracellular survival and may be an important subject for studies on pathogenesis and molecular typing.
Highly conserved chlamydial proteinsa
It was recently reported (36
) that the chlamydial 60 kDa cysteine-rich outer membrane protein (omp2
) contains a conserved MAxxxST motif that can induce autoimmune inflammatory heart disease in mice through molecular mimicry with heart muscle α-myosin proteins. Comparative genomic analysis revealed another chlamydial protein with the conserved MAxxxST structure: a homolog of the E.coli
cell division protein FtsH. In common with FtsH found in other bacteria, the chlamydial protein contains ATPase and zinc metalloprotease motifs. However, the FtsH homologs of the Chlamydiae
contain a 400 amino acid N-terminal domain with multiple transmembrane helices not seen in any other organism. It is possible that FtsH plays a unique role in the outer envelope of Chlamydia
and might be recognized by the immune system.
All four Chlamydia
genomes contain highly conserved determinants for a complete type III secretion system spread over three chromosomal regions. The high level of similarity between the proteins in dispersed locations is a further argument for the key role of these systems in survival of the bacterium in the intracellular vacuole. The chlamydial type III systems have homologs to other type III structural, targeting and regulation proteins and chaperones in Yersinia
and Gram-negative plant pathogens (37
) but there are no obvious matches to known type III secreted effector proteins. This situation is common with other type III systems and reflects the versatility and adaptability of these important pathogenesis mechanisms. Identification of the effectors is therefore a critical focus for research. Potential effectors revealed by genomic analysis include MoPn TC0044, which has a conserved serine/threonone kinase motif and is located within type III gene cluster 2. This molecule, when injected into the infected cell via type III secretion, might interfere with intracellular signaling in a manner beneficial to the parasitic Chlamydia
. Other possible effectors are encoded by TC0042, TC0867 and TC0868, which are also situated close to the type III gene clusters and have low level similarity to other proteins such as E.coli
EspB, Salmonella typhimurium
SspB and SspC and the plant pathogen Pseudomona aeruginosa
Harpin HrpO (37
Different strategies for nucleotide salvaging among different strains of Chlamydia
One of the interesting insights to emerge from comparative chlamydial genome sequencing is the different pathways used by the four chlamydial strains for acquiring nucleotides. During the course of evolution toward an obligate intracellular lifestyle, Chlamydia
spp. appear to have abandoned much of their genome necessary for self-sustaining existence (2
). A vital set of genes missing from the chlamydial genomes are those necessary for de novo
synthesis and/or salvage of three of the four ribonucleotides, making them dependent on import of nucleotides from the host. Genome sequence analysis indicates that all four Chlamydia
contain a CTP synthetase which converts UTP to CTP. All four Chlamydia
genomes also encode two proteins known to be dedicated to the transport of nucleotide triphosphates, Tlc1 and Tlc2, both homologs of Tlc, an ATP/ADP translocase from the obligate intracellular parasite Rickettsia prowazakii
). The Tlc1 protein of C.trachomatis
serovar L2 is an ATP/ADP tranlocase, whereas the Tlc2 protein, although sharing a high degree of sequence similarity, is a more general NTP transporter, apparently utilizing an H+
pump to energize the process (39
). These differences are likely true for the four completely sequenced chlamydial genomes.
As discussed earlier, four of the genes present in the C.trachomatis MoPn but not serovar D genomes are involved in nucleoside/nucleobase anabolism. The guaAB–add operon should allow for conversion of ATP to GTP, while upp facilitates biosynthesis of UTP from uracil. MoPn upp is an interesting gene. It is apparently inserted into the genome between loci that are adjacent in both the serovar D and C.pneumoniae genomes. The protein has most identity (56%) with upp gene products from Gram-positive bacteria but also contains an ~100 amino acid N-terminus without database homology, suggesting that the molecule has a second function. Although C.pneumoniae contains guaAB–add, these proteins are probably not expressed due to frameshift mutations. Like MoPn, C.pneumoniae can also synthesize UTP, however, uridine kinase is the salvage enzyme employed in C.pneumoniae, rather than upp, which is employed in MoPn. An interesting finding in the C.pneumoniae genome arises from the observation that uridine kinase may undergo phase variable expression (Fig. ), hinting that either there is some other novel determinant in the genome that provides for UTP synthesis or that, under certain conditions, the cell can import UTP directly. The human C.trachomatis serovar D does not appear to contain additional nucleotide biosynthesis genes.
Based on these observations regarding nucleotide metabolism, we speculate that the MoPn biovar is the least dependent on its host cell in its requirement for ATP to initiate purine biosynthesis and uracil for pyrimidine biosynthesis. Chlamydia pneumoniae appears dependent only on scavenging uridine for pyrimidine anabolism. Serovar D appears dependent on the host cell for three of the four ribonucleotides. The key difference in the different chlamydial strains could lie in the independent evolution of the substrate specificity of the Tlc2 transporter, possibly a simple ATP transporter in MoPn, an ATP/GTP transporter in C.pneumoniae and a more general NTP importer in serovar D.
With regard to deoxyribonucleotide biosynthesis, all chlamydiae contain a ribonucleotide diphosphate reductase for the generation of dNDPs. Ribonucleotide reductase directly provides dATP, dGTP and dCTP, but not dTTP. Interestingly, none of the chlamydial genomes contain homologs of either thymidylate synthase or thymidine kinase, the only two enzymes known to be capable of dTMP biosynthesis. Previous studies have shown that Chlamydia
cannot obtain thymidine nucleotides from the host (40
), therefore, it still remains unclear as to how Chlamydia
obtain the dTTP required for DNA synthesis.