Bacteroides vulgatus is among the most commonly isolated microbes from the human gastrointestinal tract, and it has been found to constitute part of the core gut microbiota in healthy humans (
6,
11).
B. vulgatus is generally considered to be a beneficial gut commensal, but studies also suggest that important intraspecies variations may exist, with specific strains of
B. vulgatus shown to be capable of promoting or protecting against colitis (e.g., see references
2,
7, and
12).
B. vulgatus is also an occasional opportunistic pathogen, and consistent with this it is capable of attaching to and invading colonic epithelial cells and of inducing proinflammatory cytokines (
5). In addition,
B. vulgatus can bind to host structural factors (
10), and strains isolated from subjects with ulcerative colitis show increased adhesion to tissue-cultured cells in comparison to that of strains isolated from healthy subjects (
8). A key objective of the Human Microbiome Project is to provide information on the pan-genome of key microbial species.
Bacteroides spp. have been extensively shaped by lateral gene transfer (e.g., see references
1,
9, and
13), and we thus describe here the draft genome sequence of
B. vulgatus PC510 isolated as part of the Australian Human Gut Microbiome Project to provide a greater insight into the genetic variability of this genus and species.
B. vulgatus PC510 was isolated from a pooled healthy human fecal sample plated on medium 10 (
4), with inulin and Hi-maize as the sole carbohydrate sources. We applied a 454 Life Sciences GS FLX system at the J. Craig Venter Institute (JCVI) to generate 4,781,702 bp of DNA sequence at 61× coverage. The individual sequence reads were then assembled into 117 contigs using the Newbler assembler, version 2.3. The contig N50 was approximately 94.2 kb, and the largest contig assembled was approximately 490.4 kb. Then, the JCVI prokaryotic annotation pipeline was used to annotate the DNA sequences.
The G+C content of the draft genome is 42%, with 89.6% of the genome predicted to be coding. The draft genome contains 4,091 genes, with 3,956 protein-coding genes and 63 structural RNAs. As expected,
B. vulgatus PC510 encodes a diverse array of proteins with predicted roles in carbohydrate metabolism. In particular, we identified arylsulfatases, hexosaminidases, fucosidases, and a sialidase with putative roles in harvesting host glycans, as well as several genomic loci encoding proteins with likely roles in capsule production. The genome sequence also revealed the presence of a putative plasminogen/laminin binding protein, a heme utilization regulon, and an ortholog of a lipoprotein recently implicated in the activation of the NF-κB signaling pathway in an HT-29-based cell line (
3).
Finally, we identified integron, transposon, and phage-like elements, suggesting that the genome of B. vulgatus PC510 has been extensively shaped by lateral gene transfer, consistent with other Bacteroides spp. Further intraspecies analyses may provide important genetic insights into the functional capability of B. vulgatus, and future studies should focus particularly on species isolated from subjects with gastrointestinal disorders, including ulcerative colitis and Crohn's disease.
Nucleotide sequence accession number. This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number ADKO00000000. The version described here is the first version, ADKO01000000. The genome project data are also available at GenBank under the Genome project identifier 42763.