Streptococcus agalactiae, also referred to as Group B Streptococcus, is a frequent resident of the rectovaginal tract in humans, and a major cause of neonatal infection. The pathogen can also infect adults with underlying disease, particularly the elderly and immunocompromised ones. In addition, S. agalactiae is a known fish pathogen, which compromises food safety and represents a zoonotic hazard. This study provides valuable structural, functional and evolutionary genomic information of a human S. agalactiae serotype Ia (ST-103) GBS85147 strain isolated from the oropharynx of an adult patient from Rio de Janeiro, thereby representing the first human isolate in Brazil. We used the Ion Torrent PGM platform with the 200 bp fragment library sequencing kit. The sequencing generated 578,082,183 bp, distributed among 2,973,022 reads, resulting in an approximately 246-fold mean coverage depth and was assembled using the Mira Assembler v3.9.18. The S. agalactiae strain GBS85147 comprises of a circular chromosome with a final genome length of 1,996,151 bp containing 1,915 protein-coding genes, 18 rRNA, 63 tRNA, 2 pseudogenes and a G + C content of 35.48 %.
Electronic supplementary material
The online version of this article (doi:10.1186/s40793-016-0158-6) contains supplementary material, which is available to authorized users.
Streptococcus agalactiae; Human pathogenic bacteria; Oropharynx; Complete genome sequence; Ion torrent
Studies have detected mis-assemblies in genomes of the species Corynebacterium pseudotuberculosis. These new discover have been possible due to the evolution of the Next-Generation Sequencing platforms, which have provided sequencing with accuracy and reduced costs. In addition, the improving of techniques for construction of high accuracy genomic maps, for example, Whole-genome mapping (WGM) (OpGen Inc), have allow high-resolution assembly that can detect large rearrangements.
In this work, we present the resequencing of Corynebacterium pseudotuberculosis strain 1002 (Cp1002). Cp1002 was the first strain of this species sequenced in Brazil, and its genome has been used as model for several studies in silico of caseous lymphadenitis disease. The sequencing was performed using the platform Ion PGM and fragment library (200 bp kit). A restriction map was constructed, using the technique of WGM with the enzyme KpnI. After the new assembly process, using WGM as scaffolder, we detected a large inversion with size bigger than one-half of genome. A specific analysis using BLAST and NR database shows that the inversion occurs between two homology RNA ribosomal regions.
In conclusion, the results showed by WGM could be used to detect mismatches in assemblies, providing genomic maps with high resolution and allow assemblies with more accuracy and completeness. The new assembly of C. pseudotuberculosis was deposited in GenBank under the accession no. CP012837.
Electronic supplementary material
The online version of this article (doi:10.1186/s12864-016-2673-7) contains supplementary material, which is available to authorized users.
Genomics; Sequencing; Optical mapping; Mis-assembly
Corynebacterium pseudotuberculosis causes significant loss to goat and sheep farmers because it is the causal agent of the infectious disease caseous lymphadenitis, which may lead to outcomes ranging from skin injury to animal death (Ruiz et al., 2011) . This bacterium was grown under osmotic (2 M), acid (pH) and heat (50 °C) stress and under control (Normal-BHI brain heart infusion) conditions, which simulate the conditions faced by the bacteria during the infectious process. Subsequently, cDNA of each condition was sequenced by the SOLiD3 Plus platform using the RNA-Seq technique , , . The data produced was processed to evaluate the differential gene expression, which is helpful to understand the adaptation mechanisms during the infection in the host. The sequencing data of all conditions are available in the European Bioinformatics Institute (EBI) repository under accession number E-MTAB-2017.
Corynebaterium pseudotuberculosis; RNA-seq; SOLiD; Infection stresses
Lactococcus lactis subsp. lactis NCDO 2118 is a nondairy lactic acid bacterium, a xylose fermenter, and a gamma-aminobutyric acid (GABA) producer isolated from frozen peas. Here, we report the complete genome sequence of L. lactis NCDO 2118, a strain with probiotic potential activity.
The completion of whole-genome sequencing for Corynebacterium pseudotuberculosis strain 1002 has contributed to major advances in research aimed at understanding the biology of this microorganism. This bacterium causes significant loss to goat and sheep farmers because it is the causal agent of the infectious disease caseous lymphadenitis, which may lead to outcomes ranging from skin injury to animal death. In the current study, we simulated the conditions experienced by the bacteria during host infection. By sequencing transcripts using the SOLiDTM 3 Plus platform, we identified new targets expected to potentiate the survival and replication of the pathogen in adverse environments. These results may also identify possible candidates useful for the development of vaccines, diagnostic kits or therapies aimed at the reduction of losses in agribusiness.
Under the 3 simulated conditions (acid, osmotic and thermal shock stresses), 474 differentially expressed genes exhibiting at least a 2-fold change in expression levels were identified. Important genes to the infection process were induced, such as those involved in virulence, defence against oxidative stress, adhesion and regulation, and many genes encoded hypothetical proteins, indicating that further investigation of the bacterium is necessary. The data will contribute to a better understanding of the biology of C. pseudotuberculosis and to studies investigating strategies to control the disease.
Despite the veterinary importance of C. pseudotuberculosis, the bacterium is poorly characterised; therefore, effective treatments for caseous lymphadenitis have been difficult to establish. Through the use of RNAseq, these results provide a better biological understanding of this bacterium, shed light on the most likely survival mechanisms used by this microorganism in adverse environments and identify candidates that may help reduce or even eradicate the problems caused by this disease.
Differential gene expression; Transcripts; RNAseq; SOLID™; Stress; C. pseudotuberculosis
Since the first successful attempt at sequencing the Corynebacterium pseudotuberculosis genome, large amounts of genomic, transcriptomic and proteomic data have been generated. C. pseudotuberculosis is an interesting bacterium due to its great zoonotic potential and because it causes considerable economic losses worldwide. Furthermore, different strains of C. pseudotuberculosis are capable of causing various diseases in different hosts. Currently, we seek information about the phylogenetic relationships between different strains of C. pseudotuberculosis isolates from different hosts across the world and to employ these data to develop tools to diagnose and eradicate the diseases these strains cause. In this review, we present the latest findings on C. pseudotuberculosis that have been obtained with the most advanced techniques for sequencing and genomic organization. We also discuss the development of in silico tools for processing these data to prompt a better understanding of this pathogen.
Corynebacterium pseudotuberculosis; SOLiD next generation sequencing; Ion Torrent next generation sequencing; SDS-PAGE; mass spectrometry; RNA-seq
Corynebacterium pseudotuberculosis is of major veterinary importance because it affects many animal species, causing economically significant livestock diseases and losses. Therefore, the genomic sequencing of various lines of this organism, isolated from different hosts, will aid in the development of diagnostic methods and new prevention and treatment strategies and improve our knowledge of the biology of this microorganism. In this study, we present the genome of C. pseudotuberculosis Cp31, isolated from a buffalo in Egypt.
Exiguobacterium antarcticum is a psychotropic bacterium isolated for the first time from microbial mats of Lake Fryxell in Antarctica. Many organisms of the genus Exiguobacterium are extremophiles and have properties of biotechnological interest, e.g., the capacity to adapt to cold, which make this genus a target for discovering new enzymes, such as lipases and proteases, in addition to improving our understanding of the mechanisms of adaptation and survival at low temperatures. This study presents the genome of E. antarcticum B7, isolated from a biofilm sample of Ginger Lake on King George Island, Antarctic peninsula.
The bacterium Corynebacterium pseudotuberculosis is of major veterinary importance because it affects livestock, particularly sheep, goats, and horses, in several countries, including Australia, Brazil, the United States, and Canada, resulting in significant economic losses. In the present study, we describe the complete genome of the Corynebacterium pseudotuberculosis Cp316 strain, biovar equi, isolated from the abscess of a North American horse.
Corynebacterium pseudotuberculosis is a pathogen of great veterinary and economic importance, since it affects livestock, mainly sheep and goats, worldwide, together with reports of its presence in camels in several Arabic, Asiatic, and East and West African countries, as well as Australia. In this article, we report the genome sequence of Corynebacterium pseudotuberculosis strain Cp162, collected from the external neck abscess of a camel in the United Kingdom.
Here, we report the whole-genome sequences of two ovine-pathogenic Corynebacterium pseudotuberculosis isolates: strain 3/99-5, which represents the first C. pseudotuberculosis genome originating from the United Kingdom, and 42/02-A, the second from Australia. These genome sequences will contribute to the objective of determining the global pan-genome of this bacterium.
Corynebacterium pseudotuberculosis equi is a Gram-positive pathogenic bacterium which affects a variety of hosts. Besides the great economic losses it causes to horse-breeders, this organism is also known to be an important infectious agent to cattle and buffaloes. As an outcome of the efforts in characterizing the molecular basis of its virulence, several complete genome sequences were made available in recent years, enabling the large-scale assessment of genes throughout distinct isolates. Meanwhile, the RNA-seq stood out as the technology of choice for comprehensive transcriptome studies, which may bring valuable information regarding active genomic regions, despite of the still impeditive associated costs. In an attempt to increase the use of generated reads per instrument run, by effectively eliminating unwanted rRNAs from total RNA samples without relying on any commercially available kits, we applied denaturing high-performance liquid chromatography (DHPLC) as an alternative method to assess the transcriptional profile of C. pseudotuberculosis. We have found that the DHPLC depletion method, allied to Ion Torrent sequencing, allows mapping of transcripts in a comprehensive way and identifying novel transcripts when a de novo approach is used. These data encourage us to use DHPLC in future transcriptional evaluations in C. pseudotuberculosis.
Corynebacterium pseudotuberculosis causes disease in several animal species, although distinct biovars exist that appear to be restricted to specific hosts. In order to facilitate a better understanding of the differences between biovars, we report here the complete genome sequence of the equine pathogen Corynebacterium pseudotuberculosis strain 1/06-A.
Corynebacterium pseudotuberculosis is a facultative intracellular pathogen and the causative agent of several infectious and contagious chronic diseases, including caseous lymphadenitis, ulcerative lymphangitis, mastitis, and edematous skin disease, in a broad spectrum of hosts. In addition, Corynebacterium pseudotuberculosis infections pose a rising worldwide economic problem in ruminants. The complete genome sequences of 15 C. pseudotuberculosis strains isolated from different hosts and countries were comparatively analyzed using a pan-genomic strategy. Phylogenomic, pan-genomic, core genomic, and singleton analyses revealed close relationships among pathogenic corynebacteria, the clonal-like behavior of C. pseudotuberculosis and slow increases in the sizes of pan-genomes. According to extrapolations based on the pan-genomes, core genomes and singletons, the C. pseudotuberculosis biovar ovis shows a more clonal-like behavior than the C. pseudotuberculosis biovar equi. Most of the variable genes of the biovar ovis strains were acquired in a block through horizontal gene transfer and are highly conserved, whereas the biovar equi strains contain great variability, both intra- and inter-biovar, in the 16 detected pathogenicity islands (PAIs). With respect to the gene content of the PAIs, the most interesting finding is the high similarity of the pilus genes in the biovar ovis strains compared with the great variability of these genes in the biovar equi strains. Concluding, the polymerization of complete pilus structures in biovar ovis could be responsible for a remarkable ability of these strains to spread throughout host tissues and penetrate cells to live intracellularly, in contrast with the biovar equi, which rarely attacks visceral organs. Intracellularly, the biovar ovis strains are expected to have less contact with other organisms than the biovar equi strains, thereby explaining the significant clonal-like behavior of the biovar ovis strains.
In this work we report the genome of Corynebacterium pseudotuberculosis strain 267, isolated from a llama. This pathogen is of great veterinary and economic importance, as it is the cause of caseous lymphadenitis in several livestock species around the world and causes significant losses due to the high cost of treatment.
The Actinobacteria, Corynebacterium pseudotuberculosis strain P54B96, a nonmotile, non-sporulating and a mesophile bacterium, was isolated from liver, lung and mediastinal lymph node lesions in an antelope from South Africa. This strain is interesting in the sense that it has been found together with non-tuberculous mycobacteria (NTMs) which could nevertheless play a role in the lesion formation. In this work, we describe a set of features of C. pseudotuberculosis P54B96, together with the details of the complete genome sequence and annotation. The genome comprises of 2.34 Mbp long, single circular genome with 2,084 protein-coding genes, 12 rRNA, 49 tRNA and 62 pseudogenes and a G+C content of 52.19%. The analysis of the genome sequence provides means to better understanding the molecular and genetic basis of virulence of this bacterium, enabling a detailed investigation of its pathogenesis.
s: biovar ovis; Gram-positive pathogen; caseous lymphadenitis/cheesy gland disease; liver lesion; Antelope; genome sequencing; Ion Torrent
Pan-genomic studies aim, for instance, at defining the core, dispensable and unique genes within a species. A pan-genomics study for vaccine design tries to assess the best candidates for a vaccine against a specific pathogen. In this context, rather than studying genes predicted to be exported in a single genome, with pan-genomics it is possible to study genes present in different strains within the same species, such as virulence factors. The target organism of this pan-genomic work here presented is Corynebacterium pseudotuberculosis, the etiologic agent of caseous lymphadenitis (CLA) in goat and sheep, which causes significant economic losses in those herds around the world. Currently, only a few antigens against CLA are known as being the basis of commercial and still ineffective vaccines. In this regard, the here presented work analyses, in silico, five C. pseudotuberculosis genomes and gathers data to predict common exported proteins in all five genomes. These candidates were also compared to two recent C. pseudotuberculosis in vitro exoproteome results.
The complete genome of five C. pseudotuberculosis strains (1002, C231, I19, FRC41 and PAT10) were submitted to pan-genomics analysis, yielding 306, 59 and 12 gene sets, respectively, representing the core, dispensable and unique in silico predicted exported pan-genomes. These sets bear 150 genes classified as secreted (SEC) and 227 as potentially surface exposed (PSE). Our findings suggest that the main C. pseudotuberculosis in vitro exoproteome could be greater, appended by a fraction of the 35 proteins formerly predicted as making part of the variant in vitro exoproteome. These genomes were manually curated for correct methionine initiation and redeposited with a total of 1885 homogenized genes.
The in silico prediction of exported proteins has allowed to define a list of putative vaccine candidate genes present in all five complete C. pseudotuberculosis genomes. Moreover, it has also been possible to define the in silico predicted dispensable and unique C. pseudotuberculosis exported proteins. These results provide in silico evidence to further guide experiments in the areas of vaccines, diagnosis and drugs. The work here presented is the first whole C. pseudotuberculosis in silico predicted pan-exoproteome completed till today.
In this work, we report the whole-genome sequence of Corynebacterium pseudotuberculosis bv. equi strain CIP 52.97 (Collection Institut Pasteur), isolated in 1952 from a case of ulcerative lymphangitis in a Kenyan horse, which has evidently caused significant losses to agribusiness. Therefore, obtaining this genome will allow the detection of important targets for postgenomic studies, with the aim of minimizing problems caused by this microorganism.
In this work, we report the complete genome sequence of a Corynebacterium pseudotuberculosis PAT10 isolate, collected from a lung abscess in an Argentine sheep in Patagonia, whose pathogen also required an investigation of its pathogenesis. Thus, the analysis of the genome sequence offers a means to better understanding of the molecular and genetic basis of virulence of this bacterium.
This work reports the completion and annotation of the genome sequence of Corynebacterium pseudotuberculosis I19, isolated from an Israeli dairy cow with severe clinical mastitis. To present the whole-genome sequence, a de novo assembly approach using 33 million short (25-bp) mate-paired SOLiD reads only was applied. Furthermore, the automatic, functional, and manual annotations were attained with the use of several algorithms in a multistep process.
Corynebacterium pseudotuberculosis, a Gram-positive, facultative intracellular pathogen, is the etiologic agent of the disease known as caseous lymphadenitis (CL). CL mainly affects small ruminants, such as goats and sheep; it also causes infections in humans, though rarely. This species is distributed worldwide, but it has the most serious economic impact in Oceania, Africa and South America. Although C. pseudotuberculosis causes major health and productivity problems for livestock, little is known about the molecular basis of its pathogenicity.
Methodology and Findings
We characterized two C. pseudotuberculosis genomes (Cp1002, isolated from goats; and CpC231, isolated from sheep). Analysis of the predicted genomes showed high similarity in genomic architecture, gene content and genetic order. When C. pseudotuberculosis was compared with other Corynebacterium species, it became evident that this pathogenic species has lost numerous genes, resulting in one of the smallest genomes in the genus. Other differences that could be part of the adaptation to pathogenicity include a lower GC content, of about 52%, and a reduced gene repertoire. The C. pseudotuberculosis genome also includes seven putative pathogenicity islands, which contain several classical virulence factors, including genes for fimbrial subunits, adhesion factors, iron uptake and secreted toxins. Additionally, all of the virulence factors in the islands have characteristics that indicate horizontal transfer.
These particular genome characteristics of C. pseudotuberculosis, as well as its acquired virulence factors in pathogenicity islands, provide evidence of its lifestyle and of the pathogenicity pathways used by this pathogen in the infection process. All genomes cited in this study are available in the NCBI Genbank database (http://www.ncbi.nlm.nih.gov/genbank/) under accession numbers CP001809 and CP001829.