|Home | About | Journals | Submit | Contact Us | Français|
The gram-negative plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria is the causative agent of bacterial spot disease in pepper and tomato plants, which leads to economically important yield losses. This pathosystem has become a well-established model for studying bacterial infection strategies. Here, we present the whole-genome sequence of the pepper-pathogenic Xanthomonas campestris pv. vesicatoria strain 85-10, which comprises a 5.17-Mb circular chromosome and four plasmids. The genome has a high G+C content (64.75%) and signatures of extensive genome plasticity. Whole-genome comparisons revealed a gene order similar to both Xanthomonas axonopodis pv. citri and Xanthomonas campestris pv. campestris and a structure completely different from Xanthomonas oryzae pv. oryzae. A total of 548 coding sequences (12.2%) are unique to X. campestris pv. vesicatoria. In addition to a type III secretion system, which is essential for pathogenicity, the genome of strain 85-10 encodes all other types of protein secretion systems described so far in gram-negative bacteria. Remarkably, one of the putative type IV secretion systems encoded on the largest plasmid is similar to the Icm/Dot systems of the human pathogens Legionella pneumophila and Coxiella burnetii. Comparisons with other completely sequenced plant pathogens predicted six novel type III effector proteins and several other virulence factors, including adhesins, cell wall-degrading enzymes, and extracellular polysaccharides.
Xanthomonas campestris pv. vesicatoria (also designated Xanthomonas axonopodis pv. vesicatoria  or Xanthomonas euvesicatoria ) is a gram-negative, rod-shaped γ-proteobacterium with a high genomic G+C content. Members of the genus Xanthomonas represent an omnipresent group of plant-pathogenic bacteria which infect most economically important crop plants and cause a broad variety of diseases (54). X.campestris pv. vesicatoria, the causative agent of bacterial spot disease on pepper (Capsicum spp.) and tomato (Lycopersicon spp.) plants, enters the plant tissue through stomata and wounds. Bacterial colonization of plant intercellular spaces is locally restricted and induces macroscopically visible disease symptoms, so-called water-soaked lesions that later become necrotic (91). The disease results in defoliation and severely spotted fruits, both of which cause massive yield losses. Bacterial spot disease occurs worldwide but is most pernicious in regions with a warm and humid climate.
Pathogenicity of X. campestris pv. vesicatoria depends on a type III protein secretion system (TTSS) (11, 17), which is highly conserved among plant and animal pathogenic bacteria (24, 97). In X. campestris pv. vesicatoria, the TTSS is encoded by the chromosomal hrp gene cluster (hypersensitive response and pathogenicity) (11) and translocates effector proteins intothe plant cell (96). Once inside the plant cytoplasm, the effectors modulate host cell processes, such as suppression of the plant basal defense mechanisms, for the benefit of the pathogen (1, 13). Some effectors, termed avirulence proteins, are recognized by the host plant. This leads to the induction of the plant defense, including a rapid, locally restricted cell death reaction, the hypersensitive response, which ultimately arrests bacterial growth (1). So far, only a small number of type III effectors have been identified in X. campestris pv. vesicatoria, and their molecular functions are widely unknown (17, 19, 70, 82).
The expression of the Hrp TTSS is controlled in X. campestris pv. vesicatoria in planta by two key regulatory proteins, HrpG and HrpX (107, 109). The OmpR family regulator HrpG is activated by an unknown mechanism and controls the expression of a genome-wide regulon, including hrp and effector genes and several TTSS-unrelated genes (71). Many, but not all, HrpG-regulated genes are also regulated by HrpX, an AraC-type regulator which is controlled by HrpG. Since some HrpX-regulated genes contain a conserved sequence motif in their promoter region (plant-inducible promoter [PIP] box; TTCGC-N15-TTCGC), it is speculated that HrpX controls their expression by binding to this promoter element (35, 107).
Recently, genome analyses and comparative genomics have been used to identify novel virulence factors. Genome sequencing of the phytopathogenic bacteria Xanthomonas axonopodis pv. citri, Xanthomonas campestris pv. campestris (26), Xanthomonas oryzae pv. oryzae (53), Pseudomonas syringae pv. tomato DC3000 (14), Erwinia carotovora subsp. atroseptica (8), and Ralstonia solanacearum (87) proved to be a milestone for the identification of putative effectors and other candidate pathogenicity factors by bioinformatic approaches (19). All of these pathogens have different lifestyles and host specificities. For example, X. axonopodis pv. citri causes citrus canker, whereas X. campestris pv. campestris is the causative agent of black rot, a systemic disease on a wide range of crucifers. Xanthomonas oryzae pv. oryzae causes bacterial blight on rice, which leads to massive yield losses. R. solanacearum, which belongs to the β-proteobacteria, has an unusually wide host range and, as X. campestris pv. vesicatoria does, also infects tomato. The identification of the complete repertoire of virulence factors of these bacteria and their biological functions is a prerequisite to understanding the pathogen-plant interaction, e.g., different host range and infection strategies.
Here, we describe the complete genome sequence of X.campestris pv. vesicatoria strain 85-10, which is pathogenic for pepper but not for tomato plants (pepper race 2) (20, 63). Strain 85-10 was chosen because it is a well-established model for bacterium-plant interactions. For instance, type III secretion and several avirulence genes have been studied in this strain in great detail (17, 19). By comparison with other plant-pathogenic bacteria, we have gained novel insights into the genome evolution of xanthomonads and detected unique features of strain 85-10, such as its specific set of candidate virulence determinants.
Genomic DNA from X. campestris pv. vesicatoria 85-10 (20) was extracted from cells grown at 30°C in NYG liquid cultures (25) using the Genomic-tip 100/G kit (QIAGEN GmbH, Hilden, Germany). DNA shotgun libraries with insert sizes of 1 kb and 2 to 3 kb were constructed in pGEM-T (Promega GmbH, Mannheim, Germany), and 8-kb fragments were cloned into pTrueBlue-rop (MoBiTec GmbH, Göttingen, Germany) by MWG Biotech AG (Ebersberg, Germany). Plasmid clones were end sequenced on ABI 3700 sequencers (PE Applied Biosystems) by MWG Biotech AG. Base calling was done using TraceTuner (PE Applied Biosystems). High-quality reads were defined by a minimal length of 250 bp, with an averaging quality value of ≥20 in a sliding window of 30 bp. Finally, 70,042 high-quality reads, a total of 49,029 (5.66 X), 14,183 (1.64 X), and 6,830 (0.79 X) end sequences (X's indicate genome equivalents) from libraries with 1-kb, 2- to 3-kb, and 8-kb inserts, respectively, were established.
Base calling, quality control, and elimination of vector DNA sequences of the sequences were performed using the software package BioMake (Bielefeld University, unpublished) as described previously (47). Sequences were assembled using the PHRAP assembly tool (www.phrap.org), resulting in 139 contigs. For finishing of the genome sequence, the Consed/Autofinish software package (38, 39) was used, supplemented by the in-house tool BACCardI (5).
For gap closure and assembly validation, a fosmid library with inserts of approximately 35 to 38 kb was constructed using the EpiFOS fosmid library production kit (Epicenter, Madison, WI). A total of 768 fosmid clones were end sequenced on ABI 3730xl and ABI 377 sequencers by the Max Planck Institute for Developmental Biology (Tübingen, Germany) and IIT GmbH (Bielefeld, Germany), respectively. Remaining gaps were closed by sequencing on shotgun and fosmid clones carried out by IIT GmbH on LI-COR 4200L (LI-COR Inc., Lincoln, NE) and ABI 377 sequencers. Additionally, gaps were closed on fosmids using an ABI 3730xl DNA analyzer by the Max Planck Institute for Developmental Biology. To obtain a high-quality genome sequence, all regions of the consensus sequence were polished to at least phred40 quality by primer walking. Collectively, 1,731 sequencing reads were added to the primary shotgun assembly for the finishing and polishing of the genomic sequence. Repetitive elements, i.e., insertion sequence (IS) elements and rRNA operons, were sequenced completely by primer walking on fosmids and, in some cases, on shotgun clones. For assembly validation, fosmid end sequences were mapped onto the genome sequence employing BACCardI (5).
The genome was annotated using GenDB 2.0 (62). Briefly, a combined gene prediction strategy (58) was applied on the assembled sequences using Glimmer (88) and CRITICA (4). Putative ribosomal binding sites and tRNA genes were identified with RBSfinder (94) and tRNAscan-SE (57), respectively. Prior to the manual annotation of each predicted gene, an automatic functional annotation was computed based on different analyses. Similarity searches were performed against different databases, including SWISS-PROT and TrEMBL (9), KEGG (48), Pfam (6), TIGRFAM (41), and InterPro (65). Additionally, SignalP (67), helix-turn-helix (32), and TMHMM (52) were applied. Finally, each gene was functionally classified by assigning a clusters of orthologous groups (COG) number and corresponding COG category (98) and gene ontology numbers (42). Putative membrane transporters were compared against the transporter classification (TC) database for functional assignment (16). Potential substrates of the twin-arginine translocation (TAT) secretion pathway were predicted using TATFind 1.2(30).
For comparative analyses, the annotated genome sequences of the following bacteria were imported into GenDB: X. campestris pv. campestris ATCC 33913 (GenBank accession no. AE008922) (26), X. axonopodis pv. citri 306 and plasmids pXAC33 and pXAC64 (GenBank accession nos. AE008923, AE008924, and AE008925, respectively) (26), X. oryzae pv. oryzae KACC10331 (GenBank accession no. AE013598) (53), Erwinia carotovora subsp. atroseptica SCRI1043 (GenBank accession no. BX950851) (8), Xylella fastidiosa 9a5c (GenBank accession no. AE003849) (90), Pseudomonas syringae pv. tomato DC3000 (GenBank accession nos. AE016853 to AE016855) (14), and Ralstonia solanacearum GMI1000 (GenBank accession nos. AL646052 and AL646053) (87). Homology searches were conducted on the nucleotide and amino acid sequence level using BLAST (2). Comparisons of chromosomal sequences were carried out with the Artemis Comparison Tool (ACT) (www.sanger.ac.uk /Software/ACT).
Genomic regions with atypical G+C content were identified using a “sliding window” technique with a window size of 2,000 bp and a step size of 1,000 bp. For this, the G+C content was treated as a Gaussian distribution, and regions with differences of at least 1.5 standard deviations from the mean were calculated.
The nucleotide sequences of the chromosome of X.campestris pv. vesicatoria strain 85-10 and its four plasmids, pXCV2, pXCV19, pXCV38, and pXCV183, were submitted to GenBank (accession numbers: AM039948, AM039949, AM039950, AM039951, and AM039952). Sequence files giving more annotation details are available in the supplementary material at https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html.
The X. campestris pv. vesicatoria strain 85-10 genome sequence was established by employing a whole-genome shotgun approach. In the final consensus sequence, each base matched at least phred40 quality. The assembly of the high-quality sequence was validated by a complete fosmid map (Fig. (Fig.11).
X. campestris pv. vesicatoria contains a single, circular chromosome of 5,178,466 bp (Fig. (Fig.1)1) and four extrachromosomal elements (Table (Table1).1). While the presence of plasmids pXCV2 (1,852 bp), pXCV38 (38,116 bp), and pXCV183 (182,572 bp) was known previously (unpublished data), the genome project revealed the existence of a fourth plasmid, which was designated pXCV19 (19,146 bp) (Table (Table11).
The G+C content averages 64.75% for the chromosome and varies between 56.59% and 60.73% for the plasmids. The origin of replication is clearly detectable by a bias of G toward the leading strand (GC skew) (76). The dnaA gene is located in this region, and, therefore, its start codon has been defined as the zero point of the chromosome (Fig. (Fig.1).1). The GC skew analysis of the chromosome indicates a bidirectional replication mechanism, but the chromosome is not clearly divided into two equal replichores (Fig. (Fig.1A).1A). The location of the predicted replication terminus appears to be skewed from the 180°position opposite to oriC, resulting in an approximately 370-kb size difference between the replichores.
X. campestris pv. vesicatoria contains a total of 4,726 predicted coding sequences (CDS) (Table (Table1).1). The chromosome shows an average coding capacity of 87.13%, which is typical for most bacteria. There is no significant asymmetry in the distribution of CDS on the chromosome between the leading strand (2,223 CDS; 49.5%) and the lagging strand (2,264 CDS; 50.5%). Based on the manual annotation, biological roles were assigned to 3,080 of the 4,726 CDS. The remaining 1,646 CDS comprise 697 conserved hypothetical CDS and 949 CDS of unknown function (Fig. (Fig.1A;1A; Table Table11).
X. campestris pv. vesicatoria contains two rRNA operons organized in the order 16S-23S-5S and located in a region of approximately 500 kb (between 4,600,000 bp and 5,100,000 bp) on the left replichore (Fig. (Fig.1B).1B). Altogether, 56 genes for tRNAs representing all 20 amino acids were identified. A total of 54 tRNA genes are located on the chromosome (Fig. (Fig.1B),1B), whereas two genes for tRNAs both recognizing the AUA codon are exclusively carried on the plasmids pXCV19 and pXCV183 (Table (Table11).
The genome of X. campestris pv. vesicatoria contains 66 IS elements. A total of58 IS elements are located on the chromosome, and 8 are located on the four plasmids (Table (Table1;1; see supplementary Table Table11 at https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html). A total of 48 elements belong to the IS3 family, 10 elements belong to the IS5 family, and 4 elements belong to the IS6 family. Four IS elements could not be grouped with a known family. In X. campestris pv. vesicatoria, the IS3 family member IS476/IS1477 (20 copies) is the most abundant element. IS476 has been characterized in X. campestris pv. vesicatoria and shown to inactivate the avirulence gene avrBs1 (50). The IS3 family is also highly abundant in X. axonopodis pv. citri, whereas in X. campestris pv. campestris, most IS elements belong to the IS3 and IS5 families.
An indication of high genomic plasticity is the presence of atypical DNA regions, for example, regions that differ in their G+C contents from the average of the genome. Most likely, these regions were acquired by horizontal gene transfer (31). The chromosomal CDS range in their G+C contents from 40.1% to 75.1%, with an average of 65.1%. More than 85% of the CDS have a G+C content between 60% and 70%. Analysis of the 30 CDS with the lowest G+C contents (40.1% to 50.4%) revealed that 50% are ORFans, i.e., genes that are restricted to a particular genome and that possess no known homologs. With one exception, these ORFans are short (average length, 475 bp). A total of 21 ORFans are found in the vicinity of IS elements. In contrast, there are no ORFans among the 30 CDS with the highest G+C contents (72.3% to 75.1%). The small size and low G+C content of the ORFans in X. campestris pv. vesicatoria are similar to observations made in Escherichia coli (28).
To identify alien nucleotide sequences, the chromosome was analyzed for regions with significant deviations from the genomic mean in G+C content (see Materials and Methods). These regions vary in size between 2 kb and 26 kb and contain 50 out of 58 IS elements and 15 tRNA genes. The presence of IS elements and tRNA genes is typical for genomic islands (31). Interestingly, the two largest regions (3,870,000 to 3,899,000 bp and 3,190,000 to 3,215,000 bp) carry parts of a type IV secretion system (virB6 or virB9, virB8, and virD4) and components of the type 4 pilus (i.e., pilE and fimT). The location of parts of the type IV apparatus within a genomic region of atypical G+C content, an indicator for horizontal gene transfer, has also been observed for Wolinella succinogenes, a member of the -proteobacteria (3). Interestingly, another ~15-kb atypical region carries the complete xanthan biosynthesis cluster gumA to gumM and is flanked on one side by IS1477. An ~14-kb region harboring genes with homology to the filamentous phage Lf (~2,784,000 to 2,798,000 bp) could also be identified by this approach. Additionally, smaller atypical regions encode putative virulence determinants, such as type III effector proteins and adhesins (see below).
In summary, features such as the high number of mobile elements and deviations in G+C content suggest a highly flexible genome. This is advantageous for pathogen evolution driven by the need for continuous adaptation to the host in order to evade or suppress coevolving host defense mechanisms. This idea is corroborated by the presence of a large number of virulence determinants (see below).
Previously, the complete genomic sequences of three different xanthomonads, X. axonopodis pv. citri, X. campestris pv. campestris, and X.oryzae pv. oryzae, causal agents of citrus canker, the systemic black rot disease of crucifers, and rice blight, respectively, were determined (26, 53). All four xanthomonads have similar chromosome sizes; however, they differ in their plasmid contents. X.campestris pv. vesicatoria harbors four plasmids, and X. axonopodis pv. citri carries two plasmids, whereas X.campestris pv. campestris and X. oryzae pv. oryzae lack any plasmids. A comparison of the chromosomal sequences of the four xanthomonads revealed that X. campestris pv. vesicatoria is most closely related to X. axonopodis pv. citri (Fig. (Fig.1A;1A; Fig. Fig.2).2). Since the genome structure of X. oryzae pv. oryzae is completely different from the three other xanthomonads (Fig. (Fig.2B),2B), we focused our comparative analyses of the genomic organization on X. axonopodis pv. citri and X. campestris pv. campestris. Most genes are syntenic to each other among these three xanthomonads; however, striking differences are found between the flagellar gene cluster and the replication terminus of the chromosomes (Fig. (Fig.1A;1A; Fig. Fig.2A).2A). Integration events in this region are probably the reason for the asymmetric replichores (see above).
The DNA sequences of X. campestris pv. vesicatoria and X.campestris pv. campestris chromosomes differ in the locations of some large gene clusters (Fig. (Fig.2A).2A). For instance, two gene clusters coding for different type II secretion systems and their flanking regions are colinear between X. campestris pv. vesicatoria and X. axonopodis pv. citri, whereas their positions are exchanged in the chromosome of X. campestris pv. campestris compared to the other two xanthomonads (Fig. (Fig.2A).2A). The same is true for the chromosomal location of the hrp gene cluster, which is found at the same position in X. campestris pv. vesicatoria and X. axonopodis pv. citri but at a different location in X. campestris pv. campestris (Fig. (Fig.2A).2A). Thus, the chromosomes of X. campestris pv. vesicatoria and X. axonopodis pv. citri are not completely colinear with X. campestris pv. campestris (Fig. (Fig.2A2A).
Using a cutoff E value of 10−30, comparisons of the predicted protein sequences of the four xanthomonads revealed that 2,999 CDS (66.8%) of the X. campestris pv. vesicatoria chromosome are orthologs of genes in X. axonopodis pv. citri, X. campestris pv. campestris, and X.oryzae pv. oryzae, thus representing the conserved chromosomal backbone of these four species. A total of 184 CDS (4.1%), 87 CDS (1.9%), and 45 CDS (1.0%) were considered orthologs of genes present in only X. axonopodis pv. citri, X. campestris pv. campestris, and X.oryzae pv. oryzae, respectively. A total of 548 CDS (12.2%) are unique to X. campestris pv. vesicatoria, most of which encode hypothetical and conserved hypothetical proteins.
We also compared X. campestris pv. vesicatoria with the following other plant-pathogenic bacteria: X. fastidiosa, P. syringae pv. tomato, R. solanacearum and E. carotovora subsp. atroseptica. Because of the low overall DNA sequence identity, we limited the comparisons to predicted protein sequences. Using the same cutoff E value as above, there are 1,714 CDS (38.2%) in X. campestris pv. vesicatoria which are homologous to predicted proteins of X. fastidiosa, 1,780 CDS (39.7%) of P.syringae pv. tomato, 1,600 CDS (35.7%) of R. solanacearum (chromosome and megaplasmid), and 1,434 CDS (32.0%) of E.carotovora subsp. atroseptica.
Many Xanthomonas strains carry plasmids, but little is known about their function and relevance for pathogenicity (102). X.campestris pv. vesicatoria strain 85-10 harbors four different plasmids (Table (Table1).1). The smallest plasmid, pXCV2, is basically identical to pXV64 of X. campestris pv. vesicatoria strain Xv64 (106). The presence of plasmid pXCV19 was detected only due to the efforts of the genome sequencing project and contains almost exclusively genes necessary for plasmid partitioning and conjugation. pXCV38 shows similarities to the X. axonopodis pv. citri plasmid pXAC33; however, pXCV38 lacks the type III effector genes pthA1 and pthA2. Instead, pXCV38 contains nine genes encoding a putative type IV secretion system of the Vir/Tra type (XCVc0028 to XCVc0033, XCVc0035, XCVc0041, and XCVc0042) which probably serve for conjugal transfer.
Intriguingly, the largest plasmid in strain 85-10, pXCV183, encodes a putative type IV secretion system which is most similar to the Icm/Dot system of human pathogens (21). We could identify 14 of the 22 components found in Legionella pneumophila and Coxiella burnetii (21, 111) (see supplementary Table Table22 at https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html). We alsofound candidate genes which could fulfill the function of the missing components. This is the first report of a putative Icm/Dot-like type IV secretion system in a plant-pathogenic bacterium. The essential role of Icm/Dot type IV secretion for the virulence of Coxiella and Legionella species (21, 111) suggests that this system might contribute to the virulence of Xanthomonas.
One interesting question concerns the segregational stability of the plasmids, which appears to be maintained by different mechanisms. pXCV183 contains a putative postsegregational killing system (Fig. (Fig.3)3) of the ζ toxin (XCVd0099) antitoxin (XCVd0100) type (59), which is also present on pXAC64. Additionally, the presence of an isoleucin tRNA gene on plasmids pXCV183 and pXCV19, which is missing in the chromosome, might contribute to plasmid stability. This feature is unique for X. campestris pv. vesicatoria relative to X. axonopodis pv. citri.
All basic genes for information storage and processing (i.e., DNA replication, recombination, repair, transcription, and translation) are present. Interestingly, there are three genes encoding histone-like nuclear structuring proteins (H-NS). Mutations in these genes often have pleiotropic effects, and a regulatory function of these genes in virulence has been implicated (80).
Most of the transcriptional regulators in X. campestris pv. vesicatoria belong to one of the following families: LysR (40genes), AraC (18 genes), TetR (17 genes), LacI (10 genes), and MarR (10 genes). In addition, X. campestris pv. vesicatoria contains several putative two-component signal transduction systems. Approximately half of the 68 response regulators possess a predicted DNA binding motif as an output domain, while the rest contain other output domains, such as diguanylate cyclase (GGDEF domain), cyclic diguanylate (c-diGMP) phosphodiesterase (EAL domain), metal-dependent phosphohydrolase (HD domain), and CheB-like methylesterase. Altogether, 29 GGDEF and 14 EAL domains were found. These protein domains are responsible for the synthesis and turnover of the secondary messenger c-diGMP. Originally, this signal molecule was identified as an allosteric regulator of the cellulose synthase of Glucoacetobacter xylinum (86). In several bacteria, cellulose serves as an extracellular matrix involved in biofilm formation (84). Interestingly, X. campestris pv. vesicatoria also possesses a cellulose synthase operon (XCV3640 to XCV3644) with a c-diGMP binding subunit (XCV3642). However, this catalytic subunit is probably not functional due to an internal stop codon (XCV3643 to XCV3644).
Although histidine kinases and methyl-accepting proteins (see below) are the main sensors of extracellular signals, they are by no means the only ones present in X. campestris pv. vesicatoria. As known for most bacteria, PAS (31 proteins) and GAF (17 proteins) domains are the most common cytoplasmic signaling domains found in X. campestris pv. vesicatoria (99).
Finally, the genome of X. campestris pv. vesicatoria encodes 15 RNA polymerase σ factors (69), including 1 primary σ factor (RpoD), 1 heat-shock factor (RpoH), 1 flagellar-specific factor (FliA), 10 alternate extracytoplasmic function-type factors, and 2 σ54 family factors (RpoN). Interestingly, one of the alternate sigma factors (XCV1276) carries a C-terminal extension of approximately 200 amino acids which is unique to X.campestris pv. vesicatoria, X. campestris pv. campestris, and X.oryzae pv. oryzae.
The high number and modular diversity of regulatory proteins relative to those of other bacteria, e.g., E. coli (36), suggest that xanthomonads are able to cope with complex and changing environmental conditions. Future studies will reveal whether any of these proteins are components of the regulatory cascade which controls expression of virulence factors like the TTSS.
Quorum sensing is a mechanism by which bacteria regulate the expression of certain genes in response to their population density (103). Only a few xanthomonads appear to produce the typical autoinducer molecule l-homoserine lactone, and there is no indication that X.campestris pv. vesicatoria does so (22). Two different alternative autoinducers have been well studied in X. campestris pv. campestris. The diffusible signal factor (DSF), an α,β-unsaturated fatty acid (105), is involved in the regulation of the synthesis of extracellular enzymes, exopolysaccharides, and cyclic glucans. Eight genes (rpfA to H) which are involved in DSF production and perception are also present in X. campestris pv. vesicatoria, thus providing circumstantial evidence for the existence of a DSF regulatory system in X. campestris pv. vesicatoria. A second diffusible factor (DF), a butyrolactone, is involved in the regulation of pigment (xanthomonadin) and exopolysaccharide synthesis in X. campestris pv. campestris (77). Next to the xanthomonadin biosynthesis gene cluster in X. campestris pv. vesicatoria is an operon that might correspond to the pigB locus of X. campestris pv. campestris, which has been implicated in the synthesis of DF (77). Both quorum-sensing systems participate in the regulation of exopolysaccharide, an important virulence factor in X. campestris pv. campestris (75, 78). Based on the conserved gene content, these systems are probably functional in X. campestris pv. vesicatoria.
X. campestris pv. vesicatoria carries a single unipolar flagellum. We identified a 147-kb region in the chromosome (116 genes, XCV1929 to XCV2044) which is almost exclusively devoted to chemotaxis and flagellar biosynthesis. The most interesting feature of this large chromosomal region is the presence of 14 tandemly repeated genes encoding methyl-accepting chemotaxis proteins. Similar gene clusters were also found inX.axonopodis pv. citri (10 mcp genes), X. campestris pv. campestris (10 mcp genes), and X. oryzae pv. oryzae (7 mcp genes). Intriguingly, this region appears to be highly dynamic since its size and gene order differ among the four Xanthomonas strains analyzed. The following other chemotaxis-related genes are also present in several copies which are dispersed in the chromosome: 10 more mcp genes, 3 cheA homologs, 2 cheB homologs, 4 cheR homologs, and 6 cheW homologs. It will be interesting to elucidate which environmental stimuli are sensed by this complex chemotactic system.
Bacterial motility is not limited to swimming. Type 4 pili enable movement by retraction and mediate bacterial adhesion to plant tissue. Immunofluorescence studies showed that purified type 4 pili of X. campestris pv. hyacinthi attached to stomata of hyacinth leaves, suggesting a role for these surface structures in the first stages of yellow disease (100). In X.campestris pv. vesicatoria strain 85-10, several operons are predicted to encode components of type 4 pili. Although type 4 pilins are very diverse in their primary structure, they can be identified by the presence of a prepilin peptidase-processing site followed by a transmembrane α-helix. Using this criterion, 15 type 4-related pilin genes were found, 8 of which are part of the two different type II protein secretion systems (see below). So far, the function of type 4 pili in pathogenicity of X. campestris pv. vesicatoria is not clear (74).
Besides type 4 pili, several other surface structures might be implicated in the adherence of gram-negative bacterial pathogens. X. campestris pv. vesicatoria encodes several proteins which could act as adhesins (XCV1860, XCV1861, XCV2103, XCV3670, XCV3672, XCV4203, and XCV4444), such as homologs of the Yersinia proteins YapH and YadA (43), HecA from Erwinia chrysanthemi (83), and the filamentous hemagglutinin from Bordetella pertussis (56). In E. chrysanthemi, HecA is encoded next to the TTSS cluster and plays a role in virulence (51). Homologs of hecA are also present in the genomes of Xylella spp. and R. solanacearum (83). The nonfimbrial adhesin YadA forms a sheath-like structure on the surface of Yersinia and mediates adherence to epithelial cells and autoagglutination (43). Homologs of YadA have been found in several plant and animal pathogenic bacteria (73), e.g., in X.oryzae pv. oryzae, where XadA was shown to be an important virulence factor (79).
Gram-negative bacteria exhibit complex sets of polysaccharides on their surfaces, which often contribute to their pathogenic interactions with plant and animal cells (29). The typical exopolysaccharide of the genus Xanthomonas is xanthan, the structure and biosynthesis of which have been well studied in X. campestris pv. campestris (7). The gum gene cluster which controls xanthan biosynthesis in X. campestris pv. campestris isconserved in X. campestris pv. vesicatoria (XCV2776 to XCV2789). Mutant studies in X. campestris pv. campestris demonstrated that xanthan gum is not essential for pathogenicity but contributes to bacterial aggressiveness against the host (49).
Lipopolysaccharides from Xanthomonas play a role as elicitors of plant defense reactions (61, 66). Lipopolysaccharide biosynthesis in X. campestris pv. vesicatoria strain 85-10 appears to follow the ABC transporter-dependent pathway (encoded by wzm and wzt). Next to the genes for the ABC transporter and the lipopolysaccharide core biosynthesis (rmd and gmd), several genes for O-antigen synthesis (wxc) are predicted. Interestingly, the wxc genes have counterparts in X.campestris pv. campestris but not in X. axonopodis pv. citri or X.oryzae pv. oryzae. However, all of the glycosyltransferase genes required for O-antigen modification in X. campestris pv. campestris (104) are missing in X. campestris pv. vesicatoria. Instead, two deviant glycosyltransferase genes (wbdA1 and wbdA2) were found within this gene cluster, and they encode homologs of mannosyltransferases (see supplementary Table Table33 at https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html). Thus, it seems that the O-antigen of X. campestris pv. vesicatoria is basically a polymannan, in contrast to the xylosylated polyrhamnan of X. campestris pv. campestris (64).
Surprisingly, a novel gene cluster of 11 genes (XCV1723 to XCV1733), which encodes a putative polysaccharide polymerase and a chain length regulator, both characteristic for heteropolymer polysaccharide biosynthesis, was identified. In addition, this cluster harbors three predicted glycosyltransferases and enzymes modifying sugar nucleotides or transferring amino or methyl groups. Homologs of this gene cluster were also found in the genomes of X. axonopodis pv. citri, X. campestris pv. campestris, and X. oryzae pv. oryzae. It is not clear which carbohydrate is produced by this novel gene cluster.
Candidate transport proteins have been classified according to the TC database (16). Most transport systems belong to the class of secondary transporters, such as proteins of the major facilitator superfamily (TC 2.A.1; 38 proteins) and the resistance-nodulation-cell division superfamily (TC 2.A.6; 15 proteins). Some secondary transporters are encoded in the close vicinity of accessory components (i.e., membrane fusion proteins, TC 8.A.1) which might be important for efflux of drugs, heavy metal cations, oligosaccharides, or proteins. Altogether there are 17 complete secondary transporter-type export systems, 10 of which also contain an outer membrane factor (TC 1.B.17). Often, secondary transporters lead to multidrug resistance (68). For Erwinia amylovora, it was shown that multidrug efflux contributes to virulence (15). Predominance of secondary transporters is in accordance with the fact that X. campestris pv. vesicatoria has an aerobic lifestyle.
Additionally, genes for 28 primary transporters (traffic ATPases; TP 3.A.1) were identified. Eight of these are associated with a gene encoding a periplasmic binding protein, indicating that these systems are responsible for import. One traffic ATPase (XCV1571 and XCV1573) is encoded next to genes for a membrane fusion protein (XCV1570) and an outer membrane factor (XCV1569). We believe that this gene cluster encodes a type I protein secretion system (see below) which exports several predicted extracellular proteases and peptidases.
Bacterial secretion systems are important for the interaction of pathogens with the host (18). X. campestris pv. vesicatoria contains genes for all known protein transport systems in gram-negative bacteria, namely the Sec, signal recognition particle, and TAT pathways; at least one putative type I, two type II, one type III, and two putative type IV secretion systems of different types; four type V autotransporters; and two two-partner secretion systems (Table (Table2;2; supplementary Table Table3).3). The presence of all known secretion systems was highlighted before for only the plant pathogens E. carotovora subsp. atroseptica (8) and R. solanacearum (37).
The Sec pathway is important for the export of many proteins into the periplasmic space (Table (Table2;2; see supplementary Table Table3).3). The TAT system (tatA-tatC) offers an alternative route to the periplasm for folded proteins (81), but so far the function of this pathway has not been elucidated for Xanthomonas. Predicted candidate substrates of the TAT pathway are 66 proteins (Table (Table2;2; see supplementary Table Table44 at https://www.genetik.uni-bielefeld.de/GenoMik/genomeproj.html), e.g., putative xylosidases and cellulases.
Type II secretion systems of pathogenic bacteria secrete a vast number of proteins, including possible virulence determinants, into plant intercellular space (89). X. campestris pv. vesicatoria, like X. axonopodis pv. citri and X. campestris pv. campestris (26), harbors two type II secretion systems, whereas X. oryzae pv. oryzae encodes only one (Fig. (Fig.2).2). Remarkably, the gene for a prepilin leader peptidase is lacking. Instead, this task might be fulfilled by the type 4 pilus assembly leader peptidase (XCV3355) (45). In X. campestris pv. vesicatoria strain 85-10, likely substrates include cellulases (nine candidates), β-glucosidases (five candidates), pectate lyases (four candidates), polygalacturonases (three candidates), and xylanases (five candidates) (Table (Table2;2; see supplementary Table Table3).3). These proteins probably exhibit plant cell wall-degrading activity. The nature and number of these candidates are similar in all four xanthomonads. Additionally, we found a disrupted homolog of Pat-1 from the gram-positive tomato pathogen Clavibacter michiganensis subsp. michiganensis (XCV4424 and XCV4425). This gene, which encodes a putative secreted serine protease that is essential for wilt induction on tomato (33), is not present in X. axonopodis pv. citri and X. campestris pv. campestris. However, a nondisrupted homolog is present in X. oryzae pv. oryzae (XOO0986), suggesting a possible role of this gene in virulence.
In addition to the well-studied secretion systems, X. campestris pv. vesicatoria contains homologs of raxST, raxA, raxB, and raxC (XCV1244 to XCV1246 and XCV3591), which are involved in AvrXa21 activity in X. oryzae pv. oryzae. The nature of AvrXa21 is unknown, but it leads to specific recognition of bacteria in rice plants expressing the Xa21 resistance gene (27). RaxA and RaxB show similarity to a double-glycine leader peptide-dependent secretion system and might, in concert with RaxC, secrete the so-far-unknown substrate (27).
The main virulence determinant of X. campestris pv. vesicatoria is the TTSS (17, 19), which is encoded by the hrp pathogenicity gene cluster (11) located in a 35.3-kb chromosomal genomic island (459,555 bp to 494,868 bp). Interestingly, the regions flanking the 16.2-kb core cluster (hrcC-hpaB) encode accessory components and substrates of the TTSS and are not conserved among the different xanthomonads (19, 53, 72).
Most promoters in the hrp gene cluster contain PIP boxes (TTCGC-N15-TTCGC), the supposed binding motif for the transcriptional regulator HrpX (35, 107). In addition to 8 known PIP boxes, we identified 15 novel boxes within a reasonable distance upstream of the CDS (Table (Table3).3). The genome contains 189 imperfect PIP boxes (TTCG-N16-TTCG); however, no clear association with pathogenicity-related genes is evident. Future work will elucidate how many of these genes are regulated by HrpG and HrpX and might play a role in pathogenicity. However, it is known already that not all HrpX-regulated genes possess this motif in their promoter regions (71). The genome sequence enables us to determine the complexities of HrpG- and HrpX-dependent regulation of gene expression by transcriptome analyses.
TTSS of plant pathogens translocate a large number of effector proteins into the plant cells (1). So far, 14 type III effectors of X. campestris pv. vesicatoria strain 85-10 are known (19, 60, 70, 82). Analysis of the X. campestris pv. vesicatoria genome sequence led to the identification of six new candidate effectors (XCVd0105, XCV0294, XCV1298, XCV2280, XCV3786, and XCV4428) (Table (Table4),4), most of which have homologs in other xanthomonads and different P. syringae strains but not in E. carotovora subsp. atroseptica (Table (Table4).4). Type III effectors of P. syringae were designated according to the new unified nomenclature (55). It appears that homologs of HopX2 (formerly HopPmaB), HopH1 (formerly HopPtoH), HopQ1-1 (formerly HopPtoQ), and XopP are distributed most widely among plant-pathogenic bacteria. For almost all effectors, homologous proteins can be found in other pathogenic bacteria. The only exception is XopC, which is unique for X. campestris pv. vesicatoria (70). It should be noted that in contrast to other X. campestris pv. vesicatoria strains, strain 85-10 does not contain homologs of the well-studied AvrBs3 family of effectors (17). Xanthomonads also lack effectors of the YopT family, which is found in many animal and plant-pathogenic bacteria, such as P.syringae and R. solanacearum. In summary, the set of effectors differs between several species and even between closely related bacteria like the four xanthomonads (Table (Table4).4). The specific effector set of a given strain is certainly important for its interaction with the plant, thus potentially determining host range.
Ten of the 20 candidate and proven effector genes of X.campestris pv. vesicatoria show significantly lower G+C contents, indicative for acquisition by horizontal gene transfer (31). This idea is corroborated by the fact that some of the predicted effector genes are located in the vicinity of mobile genetic elements. It is noteworthy that we identified two novel effector integrons similar to a previously described element (Fig. (Fig.3)3) (70). All three integrons are flanked by inverted repeats that were first identified next to genes coding for effectors of the AvrBs3 family (10, 70). One of the newly identified integrons is located on the chromosome and is associated with a gene (XCV2280) encoding a homolog of the effector HopX2 from P. syringae (Table (Table4)4) (40). The other integron is located on the plasmid pXCV183 and harbors the putative postsegregational killing system. This integron is flanked by a type III effector locus, which encodes AvrBs1 and a putative tyrosine phosphatase homologous to the effector HopAO1 (formerly HopPtoD2) from P. syringae pv. tomato DC3000 (12, 34). We were surprised that these are the only two effector genes present on the plasmids of X. campestris pv. vesicatoria strain 85-10. Similar effector integrons are also present in the X.axonopodis pv. citri chromosome and plasmid pXAC64 (70).
In addition to protein secretion systems, other factors might be important for the pathogen-plant interaction. We identified 65 proteins with possible functions in defense of X. campestris pv. vesicatoria against antimicrobial substances (Table (Table2;2; see supplementary Table Table3).3). Interestingly, genes coding for a copper-efflux system of X. campestris pv. vesicatoria (XCV3747 and XCV3748) were first described to be carried on the same plasmid as AvrBs1 (92), but in strain 85-10, they are located on the chromosome. The known streptomycin resistance of this strain is encoded on a transposon (93). Additionally, we identified genes coding for catalases (five candidates), peroxidases (five candidates), and superoxide dismutases (four candidates), which might function in detoxification of reactive oxygen species produced during the plant defense reaction (see supplementary Table Table33).
In this study, we present the complete genome sequence of X. campestris pv. vesicatoria, which revealed novel insights into the genome plasticity and pathogenicity of this important plant pathogen. All xanthomonads carry a high number of mobile elements; however, their set differs in each strain. Several regions of atypical G+C content are associated with both IS elements and tRNA genes and contain putative pathogenicity determinants which might have been acquired by horizontal gene transfer. The presence of four plasmids, two of which carry a tRNA gene not present on the chromosome, was unexpected. The role of the plasmids in pathogenicity is currently being addressed by plasmid curing.
Intriguingly, X. campestris pv. vesicatoria possesses all known types of protein secretion systems. Previous studies revealed that the TTSS is essential for pathogenicity. Here, several novel type III effectors were predicted and these will be analyzed in the future. In addition, the role of the other secretion systems in the interaction with the plant will be examined with a focus on cell wall-degrading enzymes and adhesins. One of the highlights of our genome analysis is the finding of a candidate Icm/Dot-like type IV secretion system in X. campestris pv. vesicatoria.
We were surprised by the number of genes devoted to regulation, a process that is poorly understood in X. campestris pv. vesicatoria. The identification of two quorum-sensing systems and of a complex and apparently dynamic chemotactic system will stimulate the study of regulation in the context of pathogenicity.
Finally, the first identification of genes possibly involved in adhesion will open new directions in the study of the complete arsenal of virulence factors in X. campestris pv. vesicatoria.
We gratefully thank all the people involved in this project.
This work was supported by the GenoMik initiative of the German Federal Ministry of Education and Research (BMBF) and grants to Ulla Bonas, Alfred Pühler, and the competence network center Bielefeld (BMBF grant numbers 031U213D and 0313105). D. H. Nies and G. Mittenhuber were supported by an HWP grant.