The oral microbial community (microbiota) plays a critical role in human health and disease. Alterations in the oral microbiota may be associated with disorders such as gingivitis, periodontitis, childhood caries, alveolar osteitis, oral candidiasis and endodontic infections. In the immunosuppressed population, the spectrum of potential oral disease is even broader, encompassing candidiasis, necrotizing gingivitis, parotid gland enlargement, Kaposi’s sarcoma, oral warts and other diseases. Here, we used 454 pyrosequencing of bacterial 16S rRNA genes to examine the oral microbiome of saliva, mucosal and tooth samples from HIV-positive and negative children. Patient demographics and clinical characteristics were collected from a cross-section of patients undergoing routine dental care. Multiple specimens from different sampling sites in the mouth were collected for each patient. The goal of the study was to observe the potential diversity of the oral microbiota among individual patients, sample locations, HIV status and various dental characteristics. We found that there were significant differences in the microbiome among the enrolled patients, and between sampling locations. The analysis was complicated by uneven enrollment in the patient cohorts, with only five HIV-negative patients enrolled in the study and by the rapid improvement in the health of HIV-infected children between the time the study was conceived and completed. The generally good oral health of the HIV-negative patients limited the number of dental plaque samples that could be collected. We did not identify significant differences between well-controlled HIV-positive patients and HIV-negative controls, suggesting that well-controlled HIV-positive patients essentially harbor similar oral flora compared to patients without HIV. Nor were significant differences in the oral microbiota identified between different teeth or with different dental characteristics. Additional studies are needed to better characterize the oral microbiome in children and those with poorly-controlled HIV infections.
Staphylococcus aureus CC30 is distinguished by its ability initiate complicated infections. This ability is due to acquisition of unique genes, SNPs and metabolic changes that attenuate virulence until the conditions become favorable for bacteremia and subsequent hematogenous seeding.
Background. The contemporary Staphylococcus aureus clonal complex (CC) 30 lineage is associated with complicated infections, including endocarditis and osteomyelitis. This lineage diverged from the phage-type 80/81 S aureus clone responsible for a major bacterial epidemic of the 20th century. The genome and transcriptome features that contribute to complicated infections of the CC30 lineage are unknown.
Methods. Twenty-nine clinical methicillin-resistant S aureus (MRSA) strains (8 from CC30 and 21 from other major CCs were evaluated for virulence using murine and Galleria mellonella sepsis models. Genomic features of CC30 were identified by comparative genome sequencing and RNA-Seq transcriptome analysis of the 29 strains and 31 previously sequenced S aureus genomes.
Results. The CC30 isolates displayed lower virulence in the sepsis models compared with other CCs [P < .0001]. Comparisons of orthologous proteins and transcriptome analysis identified genes (eg, nitric oxide reductase) and changes in metabolic pathways (eg, pyrimidine metabolism) that contribute to the distinct CC30 phenotype. Previously reported nonsynonymous single-nucleotide polymorphisms (SNPs) were found in accessory gene regulator C (agrC) and α-hemolysin (hla), molecules important for virulence. Additional nonsynonymous SNPs conserved across clinical CC30 isolates when compared with the first sequenced contemporary CC30 clone, MRSA-16, were identified in multiple genes, suggesting continuing evolutionary divergence in this lineage.
Conclusions. Genomic and transcriptional analyses suggest that the CC30 lineage has acquired metabolic features that contribute to persistent and complicated infections. Absence of sepsis-induced mortality in animal models may be due in part to its unique genomic profile and suggests that specific genotypes of S aureus elicit distinct types of infection types.
genome; genotype; hematogenous infections; Staphylococcus aureus; transcriptome
We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.
Phylogenomic footprinting is an approach for ab initio identification of genome-wide regulatory elements in bacterial species based on sequence conservation. The statistical power of the phylogenomic approach depends on the degree of sequence conservation, the length of regulatory elements, and the level of phylogenetic divergence among genomes. Building on an earlier model, we propose a binomial model that uses synonymous tree lengths as neutral expectations for determining the statistical significance of conserved intergenic spacer (IGS) sequences. Simulations show that the binomial model is robust to variations in the value of evolutionary parameters, including base frequencies and the transition-to-transversion ratio. We used the model to search for regulatory sequences in the Lyme disease species group (Borrelia burgdorferi sensu lato) using 23 genomes. The model indicates that the currently available set of Borrelia genomes would not yield regulatory sequences shorter than five bases, suggesting that genome sequences of additional B. burgdorferi sensu lato species are needed. Nevertheless, we show that previously known regulatory elements are indeed strongly conserved in sequence or structure across these Borrelia species. Further, we predict with sufficient confidence two new RpoS binding sites, 39 promoters, 19 transcription terminators, 28 noncoding RNAs, and four sets of coregulated genes. These putative cis- and trans-regulatory elements suggest novel, Borrelia-specific mechanisms regulating the transition between the tick and host environments, a key adaptation and virulence mechanism of B. burgdorferi. Alignments of IGS sequences are available on BorreliaBase.org, an online database of orthologous open reading frame (ORF) and IGS sequences in Borrelia.
While bacterial genomes contain mostly protein-coding genes, they also house DNA sequences regulating the expression of these genes. Gene regulatory sequences tend to be conserved during evolution. By sequencing and comparing related genomes, one can therefore identify regulatory sequences in bacteria based on sequence conservation. Here, we describe a statistical framework by which one may determine how many genomes need to be sequenced and at what level of evolutionary relatedness in order to achieve a high level of statistical significance. We applied the framework to Borrelia burgdorferi, the Lyme disease agent, and identified a large number of candidate regulatory sequences, many of which are known to be involved in regulating the phase transition between the tick vector and mammalian hosts.
Smoking has been identified in observational studies as a risk factor for bacterial vaginosis (BV), a condition defined in part by decimation of Lactobacillus spp. The anti-estrogenic effect of smoking and trace amounts of benzo[a]pyrene diol epoxide (BPDE) may predispose women to BV. BPDE increases bacteriophage induction in Lactobacillus spp. and is found in the vaginal secretions of smokers. We compared the vaginal microbiota between smokers and non-smokers and followed microbiota changes in a smoking cessation pilot study.
In 2010–2011, 20 smokers and 20 non-smokers were recruited to a cross-sectional study (Phase A) and 9 smokers were enrolled and followed for a 12-week smoking cessation program (Phase B). Phase B included weekly behavioral counseling and nicotine patches to encourage smoking cessation. In both phases, participants self-collected mid-vaginal swabs (daily, Phase B) and completed behavioral surveys. Vaginal bacterial composition was characterized by pyrosequencing of barcoded 16S rRNA genes (V1-V3 regions). Vaginal smears were assigned Nugent Gram stain scores. Smoking status was evaluated (weekly, Phase B) using the semi-quantitative NicAlert® saliva cotinine test and carbon monoxide (CO) exhalation.
In phase A, there was a significant trend for increasing saliva cotinine and CO exhalation with elevated Nugent scores (P value <0.005). Vaginal microbiota clustered into three community state types (CSTs); two dominated by Lactobacillus (L. iners, L. crispatus), and one lacking significant numbers of Lactobacillus spp. and characterized by anaerobes (termed CST-IV). Women who were observed in the low-Lactobacillus CST-IV state were 25-fold more likely to be smokers than those dominated by L. crispatus (aOR: 25.61, 95 % CI: 1.03-636.61). Four women completed Phase B. One of three who entered smoking cessation with high Nugent scores demonstrated a switch from CST-IV to a L.iners-dominated profile with a concomitant drop in Nugent scores which coincided with completion of nicotine patches. The other two women fluctuated between CST-IV and L. iners-dominated CSTs. The fourth woman had low Nugent scores with L. crispatus-dominated CSTs throughout.
Smokers had a lower proportion of vaginal Lactobacillus spp. compared to non-smokers. Smoking cessation should be investigated as an adjunct to reducing recurrent BV. Larger studies are needed to confirm these findings.
Vaginal microbiota; Smoking cessation; Cigarette; 16S rRNA gene analysis; Bacterial vaginosis
The bacterial genus Borrelia (phylum Spirochaetes) consists of two groups of pathogens represented respectively by B. burgdorferi, the agent of Lyme borreliosis, and B. hermsii, the agent of tick-borne relapsing fever. The number of publicly available Borrelia genomic sequences is growing rapidly with the discovery and sequencing of Borrelia strains worldwide. There is however a lack of dedicated online databases to facilitate comparative analyses of Borrelia genomes.
We have developed BorreliaBase, an online database for comparative browsing of Borrelia genomes. The database is currently populated with sequences from 35 genomes of eight Lyme-borreliosis (LB) group Borrelia species and 7 Relapsing-fever (RF) group Borrelia species. Distinct from genome repositories and aggregator databases, BorreliaBase serves manually curated comparative-genomic data including genome-based phylogeny, genome synteny, and sequence alignments of orthologous genes and intergenic spacers.
With a genome phylogeny at its center, BorreliaBase allows online identification of hypervariable lipoprotein genes, potential regulatory elements, and recombination footprints by providing evolution-based expectations of sequence variability at each genomic locus. The phylo-centric design of BorreliaBase (http://borreliabase.org) is a novel model for interactive browsing and comparative analysis of bacterial genomes online.
Lyme disease; Vector-borne relapsing fever; Genome browser; Recombination; Population genomics
High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium’s minimal information (MIxS) and NCBI’s BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.
The function of the appendix is largely unknown, but its microbiota likely contributes to function. Alterations in microbiota may contribute to appendicitis, but conventional culture studies have not yielded conclusive information. We conducted a pilot, culture-independent 16S rRNA-based microbiota study of paired appendix and rectal samples.
We collected appendix and rectal swabs from 21 children undergoing appendectomy, six with normal appendices and fifteen with appendicitis (nine perforated). After DNA extraction, we amplified and sequenced 16S rRNA genes and analyzed sequences using CLoVR. We identified organisms differing in relative abundance using ANOVA (p<0.05) by location (appendix vs. rectum), disease (appendicitis vs. normal), and disease severity (perforated vs. non-perforated).
We identified 290 taxa in the study's samples. Three taxa were significantly increased in normal appendices vs. normal rectal samples: Fusibacter (p = 0.009), Selenomonas (p = 0.026), and Peptostreptococcus (p = 0.049). Five taxa were increased in abundance in normal vs. diseased appendices: Paenibacillaceae (p = 0.005), Acidobacteriaceae GP4 (p = 0.019), Pseudonocardinae (p = 0.019), Bergeyella (p = 0.019) and Rhizobium (p = 0.045). Twelve taxa were increased in the appendices of appendicitis patients vs. normal appendix: Peptostreptococcus (p = 0.0003), Bilophila (p = 0.0004), Bulleidia (p = 0.012), Fusobacterium (p = 0.018), Parvimonas (p = 0.003), Mogibacterium (p = 0.012), Aminobacterium (p = 0.019), Proteus (p = 0.028), Actinomycineae (p = 0.028), Anaerovorax (p = 0.041), Anaerofilum (p = 0.045), Porphyromonas (p = 0.010). Five taxa were increased in appendices in patients with perforated vs. nonperforated appendicitis: Bulleidia (p = 0.004), Fusibacter (p = 0.005), Prevotella (p = 0.021), Porphyromonas (p = 0.030), Dialister (p = 0.035). Three taxa were increased in rectum samples of patients with appendicitis compared to the normal patients: Bulleidia (p = 0.034), Dialister (p = 0.003), and Porphyromonas (p = 0.026).
Specific taxa are more abundant in normal appendices compared to the rectum, suggesting that a distinctive appendix microbiota exists. Taxa with altered abundance in diseased and severely diseased (perforated) samples may contribute to appendicitis pathogenesis, and may provide microbial signatures in the rectum useful for guiding both treatment and diagnosis of appendicitis.
Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. “bavariensis” (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. “finlandensis” (1).
Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases.
Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues.
Borrelia burgdorferi; Lyme borreliosis; Pan-genome; Single-nucleotide polymorphisms; Phylogenetic tree; Genome evolution simulation
Crohn's disease (CD) is an inflammatory bowel disease of complex etiology, although dysbiosis of the gut microbiota has been implicated in chronic immune-mediated inflammation associated with CD. Here we combined shotgun metagenomic and metaproteomic approaches to identify potential functional signatures of CD in stool samples from six twin pairs that were either healthy, or that had CD in the ileum (ICD) or colon (CCD). Integration of these omics approaches revealed several genes, proteins, and pathways that primarily differentiated ICD from healthy subjects, including depletion of many proteins in ICD. In addition, the ICD phenotype was associated with alterations in bacterial carbohydrate metabolism, bacterial-host interactions, as well as human host-secreted enzymes. This eco-systems biology approach underscores the link between the gut microbiota and functional alterations in the pathophysiology of Crohn's disease and aids in identification of novel diagnostic targets and disease specific biomarkers.
Chronic wounds contain complex polymicrobial communities of sessile organisms that have been underappreciated because of limitations of standard culture techniques. The aim of this work is to combine recently developed next-generation investigative techniques to comprehensively describe the microbial characteristics of chronic wounds. Tissue samples were obtained from 15 patients with chronic wounds presenting to the Johns Hopkins Wound Center. Standard bacteriological cultures demonstrated an average of 3 common bacterial species in wound samples. By contrast, high-throughput pyrosequencing revealed increased bacterial diversity with an average of 17 genera in each wound. Data from microbial community profiling of chronic wounds was compared to published sequenced analyses of bacteria from normal skin. Increased proportions of anaerobes, Gram-negative rods and Gram-positive cocci were found in chronic wounds. In addition, chronic wounds had significantly lower populations of Propionibacterium compared to normal skin. Using epifluorescence microscopy, wound bacteria were visualized in highly organized thick confluent biofilms or as scattered individual bacterial cells. Fluorescent in-situ hybridization allowed for the visualization of Staphylococcus aureus cells in a wound sample. Quorum sensing molecules were measured by bioassay to evaluate signaling patterns amongst bacteria in the wounds. A range of autoinducer-2 activities were detected in the wound samples. Collectively, these data provide new insights into the identity, organization, and behavior of bacteria in chronic wounds. Such information may provide important clues to effective future strategies in wound healing.
Chronic Wound; Microbiome; Biofilm; Microbiology; Pyrosequencing; Epifluorescence Microscopy; Fluorescent In-Situ Hybridization; Quorum Sensing
Obesity has been linked to the human gut microbiota; however, the contribution of gut bacterial species to the obese phenotype remains controversial because of conflicting results from studies in different populations. To explore the possible dysbiosis of gut microbiota in obesity and its metabolic complications, we studied men and women over a range of body mass indices from the Old Order Amish sect, a culturally homogeneous Caucasian population of Central European ancestry. We characterized the gut microbiota in 310 subjects by deep pyrosequencing of bar-coded PCR amplicons from the V1–V3 region of the 16S rRNA gene. Three communities of interacting bacteria were identified in the gut microbiota, analogous to previously identified gut enterotypes. Neither BMI nor any metabolic syndrome trait was associated with a particular gut community. Network analysis identified twenty-two bacterial species and four OTUs that were either positively or inversely correlated with metabolic syndrome traits, suggesting that certain members of the gut microbiota may play a role in these metabolic derangements.
It has been known for decades that human Lyme disease is caused by the three spirochete species Borrelia burgdorferi, Borrelia afzelii, and Borrelia garinii. Recently, Borrelia valaisiana, Borrelia spielmanii, and Borrelia bissettii have been associated with Lyme disease. We report the complete genome sequences of B. valaisiana VS116, B. spielmanii A14S, and B. bissettii DN127.
Human Lyme disease is commonly caused by several species of spirochetes in the Borrelia genus. In Eurasia these species are largely Borrelia afzelii, B. garinii, B. burgdorferi, and B. bavariensis sp. nov. Whole-genome sequencing is an excellent tool for investigating and understanding the influence of bacterial diversity on the pathogenesis and etiology of Lyme disease. We report here the whole-genome sequences of four isolates from two of the Borrelia species that cause human Lyme disease, B. afzelii isolates ACA-1 and PKo and B. garinii isolates PBr and Far04.
Methicillin-resistant Staphylococcus aureus (MRSA) strains are leading causes of hospital-acquired infections in the United States, and clonal cluster 5 (CC5) is the predominant lineage responsible for these infections. Since 2002, there have been 12 cases of vancomycin-resistant S. aureus (VRSA) infection in the United States—all CC5 strains. To understand this genetic background and what distinguishes it from other lineages, we generated and analyzed high-quality draft genome sequences for all available VRSA strains. Sequence comparisons show unambiguously that each strain independently acquired Tn1546 and that all VRSA strains last shared a common ancestor over 50 years ago, well before the occurrence of vancomycin resistance in this species. In contrast to existing hypotheses on what predisposes this lineage to acquire Tn1546, the barrier posed by restriction systems appears to be intact in most VRSA strains. However, VRSA (and other CC5) strains were found to possess a constellation of traits that appears to be optimized for proliferation in precisely the types of polymicrobic infection where transfer could occur. They lack a bacteriocin operon that would be predicted to limit the occurrence of non-CC5 strains in mixed infection and harbor a cluster of unique superantigens and lipoproteins to confound host immunity. A frameshift in dprA, which in other microbes influences uptake of foreign DNA, may also make this lineage conducive to foreign DNA acquisition.
Invasive methicillin-resistant Staphylococcus aureus (MRSA) infection now ranks among the leading causes of death in the United States. Vancomycin is a key last-line bactericidal drug for treating these infections. However, since 2002, vancomycin resistance has entered this species. Of the now 12 cases of vancomycin-resistant S. aureus (VRSA), each was believed to represent a new acquisition of the vancomycin-resistant transposon Tn1546 from enterococcal donors. All acquisitions of Tn1546 so far have occurred in MRSA strains of the clonal cluster 5 genetic background, the most common hospital lineage causing hospital-acquired MRSA infection. To understand the nature of these strains, we determined and examined the nucleotide sequences of the genomes of all available VRSA. Genome comparison identified candidate features that position strains of this lineage well for acquiring resistance to antibiotics in mixed infection.
Lyme disease is the most common tick-borne human illness in North America. In order to understand the molecular pathogenesis, natural diversity, population structure and epizootic spread of the North American Lyme agent, Borrelia burgdorferi sensu stricto, a much better understanding of the natural diversity of its genome will be required. Towards this end we present a comparative analysis of the nucleotide sequences of the numerous plasmids of B. burgdorferi isolates B31, N40, JD1 and 297. These strains were chosen because they include the three most commonly studied laboratory strains, and because they represent different major genetic lineages and so are informative regarding the genetic diversity and evolution of this organism. A unique feature of Borrelia genomes is that they carry a large number of linear and circular plasmids, and this work shows that strains N40, JD1, 297 and B31 carry related but non-identical sets of 16, 20, 19 and 21 plasmids, respectively, that comprise 33–40% of their genomes. We deduce that there are at least 28 plasmid compatibility types among the four strains. The B. burgdorferi ∼900 Kbp linear chromosomes are evolutionarily exceptionally stable, except for a short ≤20 Kbp plasmid-like section at the right end. A few of the plasmids, including the linear lp54 and circular cp26, are also very stable. We show here that the other plasmids, especially the linear ones, are considerably more variable. Nearly all of the linear plasmids have undergone one or more substantial inter-plasmid rearrangements since their last common ancestor. In spite of these rearrangements and differences in plasmid contents, the overall gene complement of the different isolates has remained relatively constant.
Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.
Human Lyme disease is caused by a number of related Borrelia burgdorferi sensu lato species. We report here the complete genome sequence of Borrelia sp. isolate SV1 from Finland. This isolate is to date the closest known relative of B. burgdorferi sensu stricto, but it is sufficiently genetically distinct from that species that it and its close relatives warrant its candidacy for new-species status. We suggest that this isolate should be named “Borrelia finlandensis.”
Borrelia burgdorferi is a causative agent of Lyme disease in North America and Eurasia. The first complete genome sequence of B. burgdorferi strain 31, available for more than a decade, has assisted research on the pathogenesis of Lyme disease. Because a single genome sequence is not sufficient to understand the relationship between genotypic and geographic variation and disease phenotype, we determined the whole-genome sequences of 13 additional B. burgdorferi isolates that span the range of natural variation. These sequences should allow improved understanding of pathogenesis and provide a foundation for novel detection, diagnosis, and prevention strategies.
The composition of the oral microbiota from 10 individuals with healthy oral tissues was determined using culture-independent techniques. From each individual, 26 specimens, each from different oral sites at a single point in time, were collected and pooled. An eleventh pool was constructed using portions of the subgingival specimens from all 10 individuals. The 16S rRNA gene was amplified using broad-range bacterial primers, and clone libraries from the individual and subgingival pools were constructed. From a total of 11 368 high-quality, non-chimeric, near full-length sequences, 247 species-level phylotypes (using a 99% sequence identity threshold) and 9 bacteria phyla were identified. At least 15 bacterial genera were conserved among all 10 individuals, with significant interindividual differences at the species and strain level. Comparisons of these oral bacterial sequences to near full-length sequences found previously in the large intestines and feces of other healthy individuals suggest that the mouth and intestinal tract harbor distinct sets of bacteria. Co-occurrence analysis demonstrated significant segregation of taxa when community membership was examined at the level of genus, but not at the level of species, suggesting that ecologically-significant, competitive interactions are more apparent at a broader taxonomic level than species. This study is one of the more comprehensive, high-resolution analyses of bacterial diversity within the healthy human mouth to date, and highlights the value of tools from macroecology for enhancing our understanding of bacterial ecology in human health.
oral microbiota; ribosomal RNA sequences; human microbial ecology
Microbial pathogens have evolved sophisticated mechanisms for evasion of host innate and adaptive immunities. PFam54 is the largest paralogous gene family in the genomes of Borrelia burgdorferi, the Lyme disease bacterium. One member of PFam54, the complement-regulator acquiring surface proteins 1 (BbCRASP-1), is able to abort the alternative pathway of complement activation via binding human complement regulator factor H (FH). The gene coding for BbCRASP-1 exists in a tandem array of PFam54 genes in the B. burgdorferi genome, a result apparently of repeated gene duplications. To help elucidate the functions of the large number of PFam54 genes, we performed phylogenomic and structural analyses of the PFam54 gene array from ten B. burgdorferi genomes. Analyses based on gene tree, genome synteny, and structural models revealed rapid adaptive evolution of this array through gene duplication, gene loss, and functional diversification. Individual PFam54 genes, however, do not show high intra-population sequence polymorphisms as genes providing evasion from adaptive immunity generally do. PFam54 members able to bind human FH are not monophyletic, suggesting that human FH affinity, however strong, is an incidental rather than main function of these PFam54 proteins. The large number of PFam54 genes existing in any single B. burgdorferi genome may target different innate-immunity proteins of a single host species or the same immune protein of a variety of host species. Genetic variability of the PFam54 gene array suggests that universally present PFam54 lineages such as BBA64, BBA65, BBA66, and BBA73 may be better candidates for the development of broad-spectrum vaccines or drugs than strain-restricted lineages such as BbCRASP-1.
Borrelia burgdorferi; Complement-Regulator Acquiring Surface Protein-1; Lyme disease; Factor H; immune evasion; cspA
Caldicellulosiruptor saccharolyticus is an extremely thermophilic, gram-positive anaerobe which ferments cellulose-, hemicellulose- and pectin-containing biomass to acetate, CO2, and hydrogen. Its broad substrate range, high hydrogen-producing capacity, and ability to coutilize glucose and xylose make this bacterium an attractive candidate for microbial bioenergy production. Here, the complete genome sequence of C. saccharolyticus, consisting of a 2,970,275-bp circular chromosome encoding 2,679 predicted proteins, is described. Analysis of the genome revealed that C. saccharolyticus has an extensive polysaccharide-hydrolyzing capacity for cellulose, hemicellulose, pectin, and starch, coupled to a large number of ABC transporters for monomeric and oligomeric sugar uptake. The components of the Embden-Meyerhof and nonoxidative pentose phosphate pathways are all present; however, there is no evidence that an Entner-Doudoroff pathway is present. Catabolic pathways for a range of sugars, including rhamnose, fucose, arabinose, glucuronate, fructose, and galactose, were identified. These pathways lead to the production of NADH and reduced ferredoxin. NADH and reduced ferredoxin are subsequently used by two distinct hydrogenases to generate hydrogen. Whole-genome transcriptome analysis revealed that there is significant upregulation of the glycolytic pathway and an ABC-type sugar transporter during growth on glucose and xylose, indicating that C. saccharolyticus coferments these sugars unimpeded by glucose-based catabolite repression. The capacity to simultaneously process and utilize a range of carbohydrates associated with biomass feedstocks is a highly desirable feature of this lignocellulose-utilizing, biofuel-producing bacterium.
Whole-genome sequencing has been skewed toward bacterial pathogens as a consequence of the prioritization of medical and veterinary diseases. However, it is becoming clear that in order to accurately measure genetic variation within and between pathogenic groups, multiple isolates, as well as commensal species, must be sequenced. This study examined the pangenomic content of Escherichia coli. Six distinct E. coli pathovars can be distinguished using molecular or phenotypic markers, but only two of the six pathovars have been subjected to any genome sequencing previously. Thus, this report provides a seminal description of the genomic contents and unique features of three unsequenced pathovars, enterotoxigenic E. coli, enteropathogenic E. coli, and enteroaggregative E. coli. We also determined the first genome sequence of a human commensal E. coli isolate, E. coli HS, which will undoubtedly provide a new baseline from which workers can examine the evolution of pathogenic E. coli. Comparison of 17 E. coli genomes, 8 of which are new, resulted in identification of ∼2,200 genes conserved in all isolates. We were also able to identify genes that were isolate and pathovar specific. Fewer pathovar-specific genes were identified than anticipated, suggesting that each isolate may have independently developed virulence capabilities. Pangenome calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors. This comparative study of the species E. coli, while descriptive, should provide the basis for future functional work on this important group of pathogens.
The extent of chromosome rearrangements in Pyrococcus isolates from marine hydrothermal vents in Vulcano Island, Italy, was evaluated by high-throughput genomic methods. The results illustrate the dynamic nature of the genomes of the genus Pyrococcus and raise the possibility of a connection between rapidly changing environmental conditions and adaptive genomic properties.