Search tips
Search criteria 


Logo of aemPermissionsJournals.ASM.orgJournalAEM ArticleJournal InfoAuthorsReviewers
Appl Environ Microbiol. 2008 August; 74(15): 4898–4909.
Published online 2008 May 16. doi:  10.1128/AEM.02884-07
PMCID: PMC2519371

Diversity of Human Vaginal Bacterial Communities and Associations with Clinically Defined Bacterial Vaginosis[down-pointing small open triangle]


Bacterial vaginosis (BV) is a common syndrome associated with numerous adverse health outcomes in women. Despite its medical importance, the etiology and microbial ecology of BV remain poorly understood. We used broad-range PCR to census the community structure of the healthy and BV-affected vaginal microbial ecosystems and synthesized current publicly available bacterial 16S rRNA gene sequence data from this environment. The community of vaginal bacteria detected in subjects with BV was much more taxon rich and diverse than in subjects without BV. At a 97% sequence similarity cutoff, the number of operational taxonomic units (OTUs) per patient in 28 subjects with BV was nearly three times greater than in 13 subjects without BV: 14.8 ± 0.7 versus 5.2 ± 0.75 (mean ± standard error). OTU-based analyses revealed previously hidden diversity for many vaginal bacteria that are currently poorly represented in GenBank. Our sequencing efforts yielded many novel phylotypes (123 of our sequences represented 38 OTUs not previously found in the vaginal ecosystem), including several novel BV-associated OTUs, such as those belonging to the Prevotella species complex, which remain severely underrepresented in the current NCBI database. Community composition was highly variable among subjects at a fine taxonomic scale, but at the phylum level, Actinobacteria and Bacteroidetes were strongly associated with BV. Our data describe a previously unrecognized extent of bacterial diversity in the vaginal ecosystem. The human vagina hosts many bacteria that are only distantly related to known species, and subjects with BV harbor particularly taxon-rich and diverse bacterial communities.

A growing body of evidence suggests that human health is fundamentally affected by the composition and function of the vast human microbial ecosystem. Microbial cells in and on our bodies are approximately 10 times more numerous than our own cells and contain, in aggregate, about 100 times more genes, leading to the suggestion that humans and our microbial symbionts be considered superorganisms (12). There is increasing evidence that microbial communities strongly influence human health and quality of life. For example, in the gastrointestinal tract, the taxonomic composition of the microbial community may affect the propensity to develop obesity (30). In the vagina, bacterial vaginosis (BV) is marked by dramatic shifts in the types and relative proportions of a diverse community of bacteria as the vaginal ecosystem changes from a healthy to a diseased state (7, 9).

BV is strongly associated with several adverse health outcomes, including preterm labor and delivery (13, 14), pelvic inflammatory disease (29), and the acquisition and transmission of sexually transmitted diseases, such as human immunodeficiency virus (23-25). BV is the most common cause of vaginal discharge among women of reproductive age and results in millions of health care visits annually in the United States (26). Because BV is fundamentally a result of changes to the vaginal microbial community, full understanding and successful treatment will require knowledge of how the healthy community is altered in its taxonomic composition, community structure, and, ultimately, function.

Historically, a search for single etiologic agents in BV and the constraints of traditional cultivation techniques limited appreciation of the full extent of vaginal microbial diversity in this syndrome. BV has long been recognized as a complex condition, but until recently, lactobacilli found in the healthy vagina were thought to be replaced by a relatively small suite of taxa typically including Gardnerella vaginalis, Prevotella spp., Porphyromonas spp., Mobiluncus spp., and Mycoplasma hominis (7, 27). The advent of molecular tools, such as broad-range bacterial 16S rRNA gene PCR, has enabled community assessments independent of cultivation-based techniques and has greatly expanded our knowledge of the phylogenetic range and taxonomic diversity of bacteria found in the vagina. For example, intensive sequencing has revealed extensive diversity, not only within the Lactobacillus species complex (15), but also within other taxa most closely related to Atopobium, Clostridium, Leptotrichia, Prevotella, Peptostreptococcus, and Peptoniphilus, species not previously thought to be common in the human vagina (3, 9, 15, 33, 34).

Although there is agreement in the literature that no single agent likely causes BV, there is no consensus about what constitutes a pathogenic bacterial community in this syndrome. Comprehensive cultivation-independent comparisons of the vaginal bacterial communities from subjects with and without clinically defined BV have been rare. Without such fundamental data, our understanding of the pathogenesis of BV is constrained. In one study that compared broad-range bacterial community composition among patients with objectively defined BV status, subjects with BV had nearly four times as many taxa as subjects without BV, including uncultivated Clostridia-like bacteria, which were shown to be highly specific for BV (9).

In this paper, we extend this previous work to present a systematic phylogenetic analysis of vaginal bacterial community composition from subjects with defined BV status as determined by commonly accepted clinical criteria. We also synthesize current public 16S rRNA gene sequence data from bacteria found in the vaginal ecosystem (with inclusion of studies where BV status was not assessed) and place these data in a phylogenetic context. Our goal was to provide a comprehensive picture of our current knowledge of the community structure of the vaginal bacterial ecosystem.


Sample collection and clinical assessment of BV status.

Vaginal swabs were collected from subjects attending either the Public Health-Seattle and King County Sexually Transmitted Diseases Clinic or the Harborview Medical Center Women's Research Clinic and immediately frozen for later analysis, as previously described (9). BV status was assessed using Amsel clinical criteria (1) for all subjects and confirmed using Gram stain criteria (Nugent [18] scores) for subjects attending the Women's Research Clinic. In this paper, we analyze data collected from 34 vaginal samples, including 21 collected from subjects with clinically defined BV and 13 samples collected from subjects without BV. This analysis includes data on six vaginal samples not previously reported (9) and provides a more exhaustive phylogenetic analysis based on the presence of high-quality sequence data (detailed below) representative of the two patient populations.

Written informed consent was obtained from all participants in the study, which was also approved by the Institutional Review Boards at the Fred Hutchinson Cancer Research Center and the University of Washington.

Molecular methods.

DNA was obtained from swabs using the UltraClean Soil DNA extraction kit (MoBio, Carlsbad, CA), chosen after testing guanidinium-based lysis protocols with and without bead beating. Out of the 969 sequences generated, 3 were detected only in the protocol without bead beating and 7 were detected only in the bead-beating protocol. Accordingly, we did not find that bead beating substantially affected our results. For broad-range PCR, the primers 338F (5′-ACTCCTRCGGGAGGCAGCAG-3′) and 1407R (5′-GACGGGCGGTGTGTRCA-3′) were used with a thermal-cycling protocol of denaturation at 95°C, followed by 21 to 25 cycles of 95°C for 30 s, 55°C for 30s, and 72°C for 90s, with a final extension at 72°C for 7 min.

Successfully amplified products were cloned using the Topo blunt cloning kit (Invitrogen, Carlsbad, CA), and PCR-amplified inserts were sequenced using vector primers and BigDye version 3 (Applied Biosystems, Foster City, CA). Single reads were generated for 736 sequences, with an average length of 811 base pairs. Two reads were obtained for 233 sequences, resulting in about 1-kb sequence lengths.

Data analysis.

Raw sequence data were edited, and contigs were assembled for each clone using Sequencher (Gene Codes, Ann Arbor, MI). The 969 sequences obtained from our sequencing efforts and all publicly available sequences from vaginal bacteria (554 sequences, listed below) were then submitted to the GreenGenes 16S rRNA gene database (5) for alignment using the NAST algorithm (4) and taxonomic classification of each sequence. All sequences were checked for chimeric anomalies using the Mallard program (2).

To classify sequences based on self-similarities rather than matches to an external database, sequences were grouped into operational taxonomic units (OTUs) with cutoffs of 99%, 97%, and 95% sequence similarity using the DOTUR software package implemented with the furthest-neighbor option, in which all of the sequences within an OTU are at least x% similar to all of the other sequences within the OTU (22). The commonly accepted phylogenetic species definition of 97% 16S sequence similarity (32) was used to define a core data set of representative sequences that were used for phylogenetic analyses.

Sequences were incorporated into ARB (17) for phylogenetic analysis and tree construction. Sequences from a nonredundant list of three nearest-neighbor isolates for each sequence were also added to the database. Alignments were manually corrected based on the secondary structure of the 16S rRNA and tree construction performed using the maximum-likelihood method in ARB (FastdnaML) with a 75% similarity filter constructed for each phylum. Bootstrap values (100 resamplings) were generated for maximum-likelihood trees created in PHYLIP (8) using the DNAML algorithm and neighbor-joining trees created in ARB and superimposed on nodes supported by all three methods.

Publicly available sequences were collected from previous surveys of vaginal bacterial diversity, which included 439 sequences from Hyman et al. (15) (AY958774 to AY959212), 7 unpublished sequences from Zozaya-Hinchliffe et al. (EF120360 to EF120366), 25 sequences from Verhelst et al. (31) (AJ585206 to AJ619714), 15 unpublished sequences from Zhang et al. (DQ666091 to DQ666105), and 68 sequences from Zhou at al (33) (AY267541 and -2, AY269020 to -34, AY271931 to -53, AY283264 to -75, AY335493 to -504, and DQ987868 to -9). Because subject BV status was not available for most of these studies, the corresponding sequences were excluded from comparisons of bacterial community structure in subjects with and without BV but were included in phylogenetic analyses of vaginal bacterial diversity.

Nucleotide sequence accession numbers.

Sequences from the 97 OTUs generated in this study were deposited in GenBank (AY738660, AY738684, AY738687, AY738691, AY738694, AY738697, AY738701 to -5, EF428974, and EU188937 to EU189021) and are listed in Table S1 in the supplemental material.


The structures of the bacterial communities were dramatically different in samples from women with and without BV. Increased taxonomic richness and diversity of BV-associated flora relative to normal vaginal flora was evident in a variety of analyses. The mean number of taxa detected per patient was significantly higher (P < 0.001) in subjects with BV than in subjects without BV for all phylotype definitions (Fig. (Fig.1A).1A). Similarly, diversity was higher in subjects with BV; the Shannon diversity index was 1.4 to 4.1 times greater for subjects with BV than for subjects without BV, depending on the taxonomic definition (Table (Table11).

FIG. 1.
(A) Numbers of taxa per subject using four different taxon definitions. Taxon definitions of 99%, 97%, and 95% OTUs were assigned using the DOTUR package, and NCBI definitions were based on taxonomic classifications using the GreenGenes ...
Richness and diversity values and best matches to taxon designations in NCBI

High interpatient variability of vaginal bacterial community structure was evident in several analyses, particularly for subjects with BV. First, the total number of phylotypes found across all subjects in each clinical group was dramatically higher than the per-patient mean (Fig. (Fig.1).1). Second, many taxa were relatively rare; for subjects with BV, 47 of the 97 OTUs identified were found in fewer than 25% of subjects, while only a single taxon, most closely related to Bifidobacterium/Gardnerella, was found in all subjects with BV (Fig. (Fig.22).

FIG. 2.
Proportions of subjects for which each of the 97 OTUs classified at a 97% sequence similarity were encountered. The list indicates genus-level identification of each OTU based on the NCBI taxonomy. Note that sequences designated Bifidobacterium ...

There were dramatic differences in the relative proportions of major groups of bacteria between subjects with and without BV. Relative to subjects without BV, those with BV had much higher proportions of Actinobacteria (4.5 times) and Bacteroidetes (6.7 times) and a much lower proportion of Firmicutes (0.45 times) (Fig. (Fig.3B).3B). Several phylogenetic groups were found only in samples from subjects with BV; Fusobacteria, candidate division TM7, and Betaproteobacteria together accounted for 9% of clones from subjects with BV but were not detected in clone libraries from subjects without BV (Fig. (Fig.3B3B).

FIG. 3.
Taxonomic affiliations and relative proportions of sequences summarized at the genus (A) and phylum (B) levels. The taxa in panel A represent the closest matches from the NCBI database, with the numbers in parentheses representing the mean percent match ...

Subjects without BV had bacterial communities dominated by Lactobacillus species, accounting for 86% of all sequences from subjects without BV (Fig. (Fig.3).3). In contrast, subjects with BV did not possess a single dominant taxon but rather harbored a diverse array of vaginal bacteria, many present at low relative abundance. Of phylotypes unique to subjects with BV, nine were singletons (encountered only once), and the four most common (sequences most closely related to Olsenella, Megasphaera, Streptobacillus, and Shuttleworthia) accounted for 4%, 7%, 8%, and 9%, respectively, of all sequences retrieved from subjects with BV (Fig. (Fig.33).

For several taxonomic groups in particular, the true extent of diversity is not reflected in the current GenBank database. For example, even though about 20% of sequences from subjects with BV were classified as Prevotella according to the best match in GenBank, this group of sequences actually contained 21 different taxa based on a species definition of 97% sequence similarity (Fig. (Fig.3A).3A). Similarly, within the group of sequences most closely related to Lactobacillus in GenBank, there were actually 15 different phylotypes (Fig. (Fig.3A).3A). Estimates of taxonomic richness and diversity based on best matches to named taxa may severely underestimate the true extent of bacterial diversity in the vagina.

Taxonomic designations of sequences using two different classification schemes, NCBI and the Ribosomal Database Project (RDP) (, were largely congruent (Tables (Tables2,2, ,3,3, and and4)4) but also demonstrated important discrepancies. For example, sequences classified as Bifidobacterium by NCBI belonged to Gardnerella in the RDP scheme (Tables (Tables22 and and4).4). Similarly, sequences classified as Clostridium by NCBI were called Acetivibrio by RDP (Table (Table2).2). In general, NCBI designations split sequences into more groups, while the RDP scheme lumped sequences together: the 60 sequences classified as Acetivibrio by RDP belonged to four different taxonomic groups in the NCBI scheme, and 47 sequences classified as Coriobacteriaceae by RDP represented five taxonomic groups according to NCBI (Table (Table22).

Comparison of NCBI and RDP taxonomic designations for Actinobacteria, Bacteroidetes, and Firmicutes sequences obtained from subjects with BV (n = 580)
Comparison of NCBI and RDP taxonomic designations for Firmicutes and Fusobacteria sequences obtained from subjects with BV (n = 249)
Comparison of NCBI and RDP taxonomic designations for sequences obtained from subjects without BV (n = 140)

In the entire data set of 1,523 publicly available sequences, we found a total of 195 OTUs using the bacterial species definition of 97% 16S rRNA gene sequence similarity, including 38 novel OTUs representing 123 sequences unique to the data presented here. Of these, 31 OTUs (115 sequences) were found only in subjects with BV. We encountered 59 OTUs in our sequencing set that were also reported elsewhere in the data set of publicly available sequences, while 98 OTUs were detected in other studies (all but 5 of these by Hyman et al. [15]) but not in our patient populations.

Six sequences deposited in GenBank (AY959158, AY958981, AY959016, AY959073, AY958933, and AY959044) were found to be consistent with chimeric data (P < 0.001) and so were excluded from our analyses.

Phylogenetic analyses of BV-associated bacteria.

The true diversity and novelty of BV-associated bacteria were apparent when sequences generated in this study were compared to publicly available sequences from previous surveys of vaginal bacterial diversity and their nearest cultivated and uncultivated matches.

Within the phylum Bacteroidetes, sequences in our data set were most closely related to one of three genera: Prevotella, Bacteroides, and Porphyromonas (Fig. (Fig.4).4). Most (196/222) belonged to the Prevotella group but were only distantly related to their nearest relative and represented many novel sequence types not found in previous studies of vaginal bacteria. In the Prevotella species complex, we found 13 novel OTUs (Fig. (Fig.4).4). Two novel OTUs representing 24 sequences were most closely related to Porphyromonas uenonis (Fig. (Fig.4)4) and were found only in subjects with BV.

FIG. 4.
Phylogenetic relationships among representative taxa belonging to the phylum Bacteroidetes. The tree was reconstructed using the maximum-likelihood method in ARB with a 75% similarity column filter. Representative taxa were defined using a 97% ...

Within the Actinobacteria, most sequences (276/302) belonged to clades affiliated with Atopobium, Eggerthella, or Gardnerella, and all of the shared OTUs across studies belonged to one of these three groups (Fig. (Fig.5).5). Of these 306 sequences, only eight representing two OTUs in the Gardnerella group were from subjects confirmed not to have BV (Fig. (Fig.5).5). Our sequencing also revealed several novel OTUs in this phylum: one related to Atopobium, two most similar to Mobiluncus, and two Gardnerella-like OTUs (Fig. (Fig.5).5). Of the 17 OTUs not found in our sequences, 13 were singletons representing a wider phylogenetic range outside of the three main BV-associated genera of the Actinobacteria, and all were from patients with unknown BV status (Fig. (Fig.55).

FIG. 5.
Phylogenetic relationships among representative taxa belonging to the phylum Actinobacteria. The tree reconstruction methods, color scheme, and bootstrap representations are as described in the legend to Fig. Fig.44.

Within the Firmicutes, the majority of sequences (313/387) belonged to one of four main groups related to known taxa: Shuttleworthia, Megasphaera/Veillonella, Dialister, and Clostridium (Fig. (Fig.6).6). Of these, only a single sequence was from a subject confirmed not to have BV. We identified 11 novel OTUs representing 35 sequences within the Firmicutes; of these, only three sequences from two OTUs related to Clostridium perfringens and Ureaplasma were from subjects without BV (Fig. (Fig.66).

FIG. 6.
Phylogenetic relationships among representative taxa belonging to the phylum Firmicutes. The tree reconstruction methods, color scheme, and bootstrap representations are as described in the legend to Fig. Fig.44.

Previously identified (9) BV-associated bacteria appear to be truly novel taxa that are commonly encountered in subjects with BV. Ninety-three sequences most closely related to Shuttleworthia encompass those previously designated (9) as BV-associated bacterium 1 (BVAB-1) (Fig. (Fig.6).6). Three additional OTUs closely related to BVAB-1 were also identified (Fig. (Fig.6).6). Sequence types designated BVAB-2 and BVAB-3 (9) were most closely related to Acetovibrio cellulolyticus and several Clostridium strains and together accounted for 47 sequences (Fig. (Fig.6).6). Three additional OTUs (two of them novel) closely related to BVAB-2 represented an additional 27 sequences (Fig. (Fig.6).6). All three of the BVAB phylotypes had low similarities to their nearest known cultivated neighbors and together accounted for 14% of all sequences from subjects with BV.


The microbial ecosystem resident on the human vaginal epithelium remains poorly understood in terms of census, microbe-microbe interactions, and microbial interactions with the host. In particular, details about the relationship between newly described fastidious BV-associated bacteria and the manifestations of BV are just starting to emerge. In this study, we compared vaginal bacterial community structures between subjects with and without clinically defined BV and synthesized publicly available data to better understand the diversity of healthy and diseased vaginal bacterial communities.

For patients afflicted with BV, there is a profound shift in the types of bacteria present in the vagina and their absolute and relative abundances. Data presented here clearly demonstrate that BV is associated with a dramatic increase in the taxonomic richness and diversity of the vaginal bacterial community. Our analyses highlight three main points.

First, the true extent of vaginal bacterial diversity may be severely underestimated by attributing taxonomic identities to sequences simply according to their best match to currently known taxa. Although the use of the NCBI taxonomic classification scheme gives a familiar nomenclature for sequences obtained from any environment, in the case of the vaginal ecosystem, it also clearly hides a significant amount of diversity. Multiple OTUs within sequence groups matching a single taxon as defined by NCBI and low similarity scores to these taxa demonstrate that we are just beginning to reveal the true extent of the diversity of the vaginal flora.

Vaginal bacteria are generally underrepresented in the NCBI database, but for some groups in particular, the true extent of diversity is hidden by their poor representation in GenBank. For example, sequences classified as Prevotella actually contained 21 different OTUs using a 97% definition. Similarly, within sequences classified as Lactobacillus, there were 15 different OTUs. The diversity contained within the Lactobacillus species complex of the vagina has been noted previously by other researchers (15, 34), but the functional importance of this level of diversity remains unknown. Ecological theory suggests that each phylotype occupies its own niche in its environment and thus must be somehow unique, but the details of how BV-associated microbes partition niche space in the vaginal ecosystem and the correspondence between 16S rRNA-defined phylotypes and various functional capabilities remain relatively unexplored.

It is likely that additional sampling will reveal even greater richness and diversity in subjects with BV. When a taxon was defined using a 97% OTU definition, we observed a total of 61 taxa across all subjects with BV, but richness estimates of the true number of taxa present ranged from 63 to 111 (95% confidence interval), with an average estimate of 75 taxa (Table (Table1).1). For subjects without BV, our sampling effort was more complete; we encountered 25 taxa with a 97% OTU definition, and estimates of the total ranged from 20 to 30 (95% confidence interval), with an average of 21 (Table (Table11).

The loss of information caused by lumping multiple OTUs into a single taxon and the subsequent underestimation of the true richness and diversity of the vaginal ecosystem highlight the second important result of our analyses. The application of an OTU-based analysis to the full data set of our sequences and those from previous surveys of the vaginal ecosystem revealed many novel phylotypes associated with BV. We found 38 OTUs not previously encountered in the vaginal ecosystem, including 31 OTUs that were found only in subjects with BV. These OTUs were found across all of the major groups of BV-related taxa (Fig. (Fig.4,4, ,5,5, and and6),6), providing additional evidence that many more taxa are associated with BV than was previously thought.

Third, although the vaginal bacterial communities found in subjects with and without BV are distinctly different, the structures of these vaginal bacterial communities have high intersubject variability within each clinical group. For both groups, the total number of taxa encountered across all subjects was at least four times greater than the per-patient mean (Fig. (Fig.1),1), and many taxa were relatively uncommon (Fig. (Fig.2).2). This variability in community membership and structure has important implications for understanding the etiology of BV and for developing diagnostic tools. Why is the taxonomic composition of BV-associated bacteria so different for each patient? High interpatient variability has been observed previously for microbial communities in the vagina (3) and other areas of the human body (6, 11). Stochastic differences in colonization could generate this pattern, but so could other factors, such as differences in host immune response, expression of ligands for bacterial attachment to epithelial cells, the chemical and physical environments of the host, or intra- and interspecific microbial competition.

Although no single bacterium can be identified as uniquely associated with BV, aggregating our data at higher taxonomic levels made it clear that several taxonomic groups do have a strong association. Common practice has generally been to describe sampled communities using the finest level of taxonomic discrimination possible, but aggregating data into higher taxonomic groups could be more informative and may avoid currently unresolved problems in classifying many BV-related bacteria at the genus or species level. For example, when treated at the genus level, taxa such as Eggerthella, Mobiluncus, and Slackia were relatively rare in subjects with BV, but when considered as a phylum, Actinobacteria accounted for 26% of all sequences from subjects with BV versus 6% from those without. Actinobacteria and Bacteroidetes in particular were much more common in subjects with BV than in those without BV (4.5 times and 6.7 times, respectively). The ability to identify a set of signature taxa common to BV could have important clinical and diagnostic implications.

Aggregating taxonomic data may also reduce problems associated with the current poorly resolved systematics of many BV-associated bacteria. For example, under the NCBI naming scheme, sequences classified as Gardnerella were relatively rare in subjects with BV, which contradicts generally held notions of a strong association between BV and Gardnerella. However, the 110 sequences classified as Bifidobacterium by NCBI were all classified as Gardnerella by the RDP (Table (Table2).2). The discrepancies among different taxonomic classification schemes highlight the current state of systematics for many BV-associated taxa and emphasize the importance of OTU-based analyses to capture the true diversity within a pool of sequences.

The polymicrobial nature of BV clearly raises the possibility of interactions among vaginal microbes, including syntrophies, which might contribute to the pathogenesis of BV (see reference 20 for a review). Ammonia transfer from Prevotella bivia to G. vaginalis has been demonstrated (19), but much remains to be learned about interspecific interactions of vaginal microbes and their contribution to the etiology of BV. Subjects with BV have volatile amines in vaginal fluid (the basis for the “whiff test” used in the clinical diagnosis of BV) with elevated levels of trimethylamine, putrescine, cadaverine, and tyramine, but the microbes responsible for generating these metabolic products are not clearly identified. An interesting recent paper has shown the importance of direct competition between lactobacilli and G. vaginalis, apparently independently of pH and H2O2 production (21). Clearly, much work remains to be done to tease apart the microbial interactions, metabolic processes, and host factors that lead to BV; it is our hope that increased knowledge about the composition and structure of BV-associated bacterial communities will facilitate these studies.


The data presented here reflect different sample sizes for subjects with and without BV. However, initial screening of samples using restriction fragment length polymorphisms indicated that the extreme differences in taxonomic richness between the two patient populations justified the different sample sizes in order to achieve similar sampling depths. It is axiomatic that representative sampling of any community requires that the sampling effort be tied to the structure of the community; depauperate communities naturally require less sampling effort than taxon-rich and diverse communities to achieve equivalent sampling saturation. In the case of subjects without BV, fewer sequences were obtained due to many fewer restriction fragment length polymorphism patterns because of the dominance by Lactobacillus. Richness estimators (Table (Table1)1) indicated subjects without BV were adequately sampled, and additional diversity will be discovered in subjects with BV. Future work utilizing techniques such as pyrosequencing could provide greatly improved coverage of BV-associated genetic diversity but with more limited phylogenetic resolution due to limited sequence lengths.

Although the use of cloning to identify bacteria from complex samples has been criticized for potentially biased amplification (28), the clone library approach has also compared favorably to PCR-independent methods, such as fluorescence in situ hybridization, in its estimation of the relative proportions of various taxonomic groups (16). The partial 16S rRNA gene sequences obtained in our study (~800 to 1,000 bp) may contribute to some of the discrepancies in taxonomic assignments between the NCBI and RDP classifications. Full-length sequences (~1.5 kb) would likely improve phylogenetic resolution for some taxa, but in this study, we opted for a relatively high-throughput approach to compare a large number of sequences from subjects with and without BV.

Comparison to previous results.

In our phylogenetic analysis, it is apparent that novel OTUs found in our study generally belong to different phylogenetic clades than OTUs not encountered in our study. This is perhaps most striking for the Bacteroidetes (Fig. (Fig.4),4), for which we found 13 OTUs that had not been discovered in previous studies of vaginal bacteria. Similarly, for the Actinobacteria (Fig. (Fig.5),5), we found only two OTUs representing five sequences that did not group with either Atopobium, Eggerthella, or Gardnerella/Bifidobacterium, while Hyman et al. found 15 OTUs outside of this main group. This pattern could reflect different study populations, the inherent variability of BV, PCR primer bias, or different sampling intensities across studies. Most (82/98) of the OTUs not found by us have two or fewer members represented, and all but six of these were found only by Hyman et al. (15), suggesting that their intensive sequencing effort did find uncommon members of the community. However, it is also possible that these sequences could originate from low-level PCR contaminants, which are more likely to become evident with more intensive sampling of clone libraries (10). Taq polymerase, used for PCR, is known to be contaminated with low levels of bacterial 16S rRNA genes. Demographic differences among patient populations have also been demonstrated (34) and may explain these differences.


The structures of the vaginal bacterial communities differ dramatically between subjects with and without BV. BV is associated with increased taxonomic richness and diversity. At a species or genus level, the composition of the vaginal bacterial community has high interpatient variability, yet at higher taxonomic levels, several bacterial groups are strongly associated with BV, most notably Actinobacteria and Bacteroidetes. Our data describe a previously unrecognized extent of diversity in the vaginal ecosystem in general and of BV-associated bacteria in particular. The true extent of diversity within several key taxonomic groups is grossly underrepresented in the current NCBI database. The most prominent of these are Prevotella-like sequences, commonly found in subjects with BV, and Lactobacillus-like sequences, common in subjects without BV.

Using Web-based tools freely available to the research community, our analysis provides a comprehensive census of vaginal bacterial communities and their association with BV. It is our hope that the data presented here will stimulate the formulation of new hypotheses about the metabolic functions, syntrophic interactions, and niche partitioning of bacteria colonizing the vaginal ecosystem. Continuing investigations of BV will almost certainly reveal complex syntrophies, cell-to-cell signaling, and bacterial-host interactions that will shed light on how consortia of bacteria interact to form pathogenic communities in the human host.

Supplementary Material

[Supplemental material]


This work was supported by the National Institute of Allergy and Infectious Diseases with grants to D.N.F. (RO1 AI061628) and to J.M.M. (AI052228).

We thank Andrew Millard for assistance with Perl scripting.


[down-pointing small open triangle]Published ahead of print on 16 May 2008.

Supplemental material for this article may be found at


1. Amsel, R., C. A. Spiegel, K. C. Chen, D. Eschenbach, and K. K. Holmes. 1983. Nonspecific vaginitis: diagnostic criteria and microbial and epidemiologic associations. Am. J. Med. 74:14-22. [PubMed]
2. Ashelford, K. E., N. A. Chuzhanova, J. C. Fry, A. J. Jones, and A. J. Weightman. 2006. New screening software shows that most recent large 16S rRNA gene clone libraries contain chimeras. Appl. Environ. Microbiol. 72:5734-5741. [PMC free article] [PubMed]
3. Burton, J. P., and G. Reid. 2002. Evaluation of the bacterial vaginal flora of 20 postmenopausal women by direct (Nugent score) and molecular (polymerase chain reaction and denaturing gradient gel electrophoresis) techniques. J. Infect. Dis. 186:1770-1780. [PubMed]
4. DeSantis, T. Z., P. Hugenholtz, K. Keller, E. L. Brodie, N. Larsen, Y. M. Piceno, R. Phan, and G. L. Andersen. 2006. NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Res. 34:394-399. [PMC free article] [PubMed]
5. DeSantis, T. Z., P. Hugenholtz, N. Larsen, M. Rojas, E. L. Brodie, K. Keller, T. Huber, D. Dalevi, P. Hu, and G. L. Andersen. 2006. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 72:5069-5072. [PMC free article] [PubMed]
6. Eckburg, P. B., E. M. Bik, C. N. Bernstein, E. Purdom, L. Dethlefsen, M. Sargent, S. R. Gill, K. E. Nelson, D. A. Relman, Z. Gao, C. H. Tseng, Z. Pei, and M. J. Blaser. 2005. Diversity of the human intestinal microbial flora. Science 308:1635-1638. [PMC free article] [PubMed]
7. Eschenbach, D. A. 1993. History and review of bacterial vaginosis. Am. J. Obstet. Gynecol. 169:441-445. [PubMed]
8. Felsenstein, J. 1989. PHYLIP: Phylogeny Inference Package (version 3.2). Cladistics 5:164-166.
9. Fredricks, D. N., T. L. Fiedler, and J. M. Marrazzo. 2005. Molecular identification of bacteria associated with bacterial vaginosis. N. Engl. J. Med. 353:1899-1911. [PubMed]
10. Fredricks, D. N., and J. M. Marrazzo. 2005. Molecular methodology in determining vaginal flora in health and disease: its time has come. Curr. Infect. Dis. Rep. 7:463-470. [PubMed]
11. Gao, Z., C. H. Tseng, Z. H. Pei, and M. J. Blaser. 2007. Molecular analysis of human forearm superficial skin bacterial biota. Proc. Natl. Acad. Sci. USA 104:2927-2932. [PubMed]
12. Gill, S. R., M. Pop, R. T. Deboy, P. B. Eckburg, P. J. Turnbaugh, B. S. Samuel, J. I. Gordon, D. A. Relman, C. M. Fraser-Liggett, and K. E. Nelson. 2006. Metagenomic analysis of the human distal gut microbiome. Science 312:1355-1359. [PMC free article] [PubMed]
13. Gravett, M. G., D. Hummel, D. A. Eschenbach, and K. K. Holmes. 1986. Preterm labor associated with subclinical amniotic-fluid infection and with bacterial vaginosis. Obstet. Gynecol. 67:229-237. [PubMed]
14. Hillier, S. L., R. P. Nugent, D. A. Eschenbach, M. A. Krohn, R. S. Gibbs, D. H. Martin, M. F. Cotch, R. Edelman, J. G. Pastorek, A. V. Rao, D. McNellis, J. A. Regan, J. C. Carey, and M. A. Klebanoff. 1995. Association between bacterial vaginosis and preterm delivery of a low-birth-weight infant. N. Engl. J. Med. 333:1737-1742. [PubMed]
15. Hyman, R. W., M. Fukushima, L. Diamond, J. Kumm, L. C. Giudice, and R. W. Davis. 2005. Microbes on the human vaginal epithelium. Proc. Natl. Acad. Sci. USA 102:7952-7957. [PubMed]
16. Juretschko, S., A. Loy, A. Lehner, and M. Wagner. 2002. The microbial community composition of a nitrifying-denitrifying activated sludge from an industrial sewage treatment plant analyzed by the full-cycle rRNA approach. Syst. Appl. Microbiol. 25:84-99. [PubMed]
17. Ludwig, W., O. Strunk, R. Westram, L. Richter, H. Meier, Yadhukumar, A. Buchner, T. Lai, S. Steppi, G. Jobb, W. Forster, I. Brettske, S. Gerber, A. W. Ginhart, O. Gross, S. Grumann, S. Hermann, R. Jost, A. Konig, T. Liss, R. Lussmann, M. May, B. Nonhoff, B. Reichel, R. Strehlow, A. Stamatakis, N. Stuckmann, A. Vilbig, M. Lenke, T. Ludwig, A. Bode, and K. H. Schleifer. 2004. ARB: a software environment for sequence data. Nucleic Acids Res. 32:1363-1371. [PMC free article] [PubMed]
18. Nugent, R. P., M. A. Krohn, and S. L. Hillier. 1991. Reliability of diagnosing bacterial vaginosis is improved by a standardized method of Gram stain interpretation. J. Clin. Microbiol. 29:297-301. [PMC free article] [PubMed]
19. Pybus, V., and A. B. Onderdonk. 1997. Evidence for a commensal, symbiotic relationship between Gardnerella vaginalis and Prevotella bivia involving ammonia: potential significance for bacterial vaginosis. J. Infect. Dis. 175:406-413. [PubMed]
20. Pybus, V., and A. B. Onderdonk. 1999. Microbial interactions in the vaginal ecosystem, with emphasis on the pathogenesis of bacterial vaginosis. Microbes Infect. 1:285-292. [PubMed]
21. Saunders, S., A. Bocking, J. Challis, and G. Reid. 2007. Effect of Lactobacillus challenge on Gardnerella vaginalis biofilms. Colloids Surf. B 55:138-142. [PubMed]
22. Schloss, P. D., and J. Handelsman. 2005. Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness. Appl. Environ. Microbiol. 71:1501-1506. [PMC free article] [PubMed]
23. Schmid, G., L. Markowitz, R. Joesoef, and E. Koumans. 2000. Bacterial vaginosis and HIV infection. Sex. Transm. Infect. 76:3-4. [PMC free article] [PubMed]
24. Sewankambo, N., R. H. Gray, M. J. Wawer, L. Paxton, D. McNairn, F. WabwireMangen, D. Serwadda, C. J. Li, N. Kiwanuka, S. L. Hillier, L. Rabe, C. A. Gaydos, T. C. Quinn, and J. KondeLule. 1997. HIV-1 infection associated with abnormal vaginal flora morphology and bacterial vaginosis. Lancet 350:546-550. [PubMed]
25. Sha, B. E., M. R. Zariffard, Q. J. Wang, H. Y. Chen, J. Bremer, M. H. Cohen, and G. T. Spear. 2005. Female genital-tract HIV load correlates inversely with Lactobacillus species but positively with bacterial vaginosis and Mycoplasma hominis. J. Infect. Dis. 191:25-32. [PubMed]
26. Sobel, J. D. 2005. What's new in bacterial vaginosis and trichomoniasis? Infect. Dis. Clin. N. Am. 19:387-406. [PubMed]
27. Spiegel, C. A. 2002. Bacterial vaginosis. Rev. Med. Microbiol. 13:43-51.
28. Suzuki, M. T., and S. J. Giovannoni. 1996. Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR. Appl. Environ. Microbiol. 62:625-630. [PMC free article] [PubMed]
29. Sweet, R. L. 1995. Role of bacterial vaginosis in pelvic inflammatory disease. Clin. Infect. Dis. 20:S271-S275. [PubMed]
30. Turnbaugh, P. J., R. E. Ley, M. A. Mahowald, V. Magrini, E. R. Mardis, and J. I. Gordon. 2006. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature 444:1027-1031. [PubMed]
31. Verhelst, R., H. Verstraelen, G. Claeys, G. Verschraegen, J. Delanghe, L. Van Simaey, C. De Ganck, M. Temmerman, and M. Vaneechoutte. 2004. Cloning of 16S rRNA genes amplified from normal and disturbed vaginal microflora suggests a strong association between Atopobiurn vaginae, Gardnerella vaginalis and bacterial vaginosis. BMC Microbiol. 4:16. [PMC free article] [PubMed]
32. Wayne, L. G., D. J. Brenner, R. R. Colwell, P. A. D. Grimont, O. Kandler, M. I. Krichevsky, L. H. Moore, W. E. C. Moore, R. G. E. Murray, E. Stackebrandt, M. P. Starr, and H. G. Truper. 1987. Report of the ad-hoc-committee on reconciliation of approaches to bacterial systematics. Int. J. Syst. Bacteriol. 37:463-464.
33. Zhou, X., S. J. Bent, M. G. Schneider, C. C. Davis, M. R. Islam, and L. J. Forney. 2004. Characterization of vaginal microbial communities in adult healthy women using cultivation-independent methods. Microbiology 150:2565-2573. [PubMed]
34. Zhou, X., C. J. Brown, Z. Abdo, C. C. Davis, M. A. Hansmann, P. Joyce, J. A. Foster, and L. J. Forney. 2007. Differences in the composition of vaginal microbial communities found in healthy Caucasian and black women. ISME J. 1:121-133. [PubMed]

Articles from Applied and Environmental Microbiology are provided here courtesy of American Society for Microbiology (ASM)