The recently described Type VI Secretion System (T6SS) represents a new paradigm of protein secretion in bacteria maintaining pathogenic or symbiotic interactions with eukaryotic organisms (reviewed in references [
1-
4]). In this context, T6SSs have been linked to a variety of functions such as adherence, cytotoxicity, host-cell invasion, biofilm formation, survival within macrophages, and persistence within the host [
2,
4].
T6SS are encoded in loci initially known as IAHP (IcmF associated homologous protein) clusters. These gene clusters were characterized by the presence of approximately 15 ORFs surrounding a homolog of IcmF, a protein associated with type IV secretion in
Legionella pneumophila [
5-
7]. In addition to the variable genetic architecture presented by IHAPs in several microbial genomes, low level of sequence identity between components of the system in different bacteria has hampered the identification of new T6SS loci. In spite of this,
in silico analyses identified a set of 13 conserved proteins defined as the T6SS "core components" [
8]. These components carry a distinct COG ID (Cluster of Orthologous Groups of proteins), most of which are unique for T6SS function [
1,
8]. In addition, there are 15 conserved accessory proteins widely distributed among T6SS loci that include specific transcriptional and post-transcriptional regulators [
8].
Most T6SS core components correspond to structural elements of the secretion machinery. The DotU and IcmF homologs (COG3523 and COG3455, respectively) are conserved inner membrane proteins essential for secretion [
9,
10]. A role for IcmF and DotU homologs in membrane stabilization of the T6SS apparatus has been proposed based on observations made for homolog proteins involved in stabilization of the T4SS apparatus in
L. pneumophila [
6]. The ClpV homologs (COG0542) belong to the AAA+ family of ATPases and are hypothesized to energize the system, enabling protein secretion [
9,
11,
12]. Recently, it has been reported that
V. cholerae ClpV interacts through its N-terminal domain with a tubular structure formed by VipA (IglA, COG3516) and VipB (IglB, COG 3517), two conserved and essential T6SS components. Remodeling of VipA/VipB tubules by ClpV-mediated threading is crucial for type VI protein secretion [
13]. In addition, it has been recently proposed that the VipA/VipB structure corresponds to a structural and possibly functional homolog of the tail sheath in bacteriophage T4, as both structures present similar dimensions, symmetry and overall organization [
14].
Several other T6SS core components resemble proteins of bacteriophage origin. This is the case of the hemolysin coregulated protein Hcp (COG3157), the valine-glycine repeat protein VgrG (COG3501) and the gp25-like protein (COG3518), which are homologs to components of the tail and baseplate in bacteriophage T4 [
14,
15]. Interestingly, both Hcp and VgrG appear to be structural and secreted components. Thus, the detection of both proteins in culture supernatants has become an indicator of T6SS functionality [
10,
16,
17]. Some VgrG proteins known as "evolved VgrGs" present a C-terminal extension including "effector domains" that have been linked to a variety of functions, such as crosslinking of host actin, degradation of the peptidoglycan layer, and ADP-ribosylation of host proteins [
4,
16]. The evolved VgrGs have been identified in a limited number of bacteria and are usually encoded outside T6SS loci, scattered throughout the genome.
Following the initial characterization of IAHP clusters [
5], several
in silico analyses have identified T6SS loci in bacterial genome sequences available. In 2008, a screen for homologs to the conserved T6SS components VipA/IglA and VipB/IglB identified 37 T6SS loci in 29 bacteria [
1]. Later that year, a similarity search for orthologs of T6SS components described in
Vibrio cholerae,
Pseudomonas aeruginosa and
Burkholderia mallei identified T6SS loci in 42 bacterial species [
18]. Even though both analyses greatly expanded the number of T6SS loci reported, they were based on the detection of homologs to T6SS components present in only three organisms, overlooking the existence of phylogenetically distant T6SS loci. A more recent study addressed this limitation by analyzing the presence of conserved protein domains rather than orthologous proteins, to identify 176 T6SS loci from 92 different bacteria [
8].
The first report for a T6SS in
Salmonella corresponds to the genetic characterization of the
Salmonella Pathogenicity Island 6 (SPI-6), formerly known as SCI (
Salmonella enterica centisome 7 island), which is adjacent to the tRNA-encoding gene
aspV [
19]. SPI-6 is a ~47 kb genomic island presenting a mosaic structure characterized by the presence of the
saf and
tcf fimbrial operons. Analysis of the region upstream of the fimbrial operons revealed numerous ORFs encoding putative periplasmic, outer membrane and secreted proteins, suggesting that SPI-6 encoded a novel secretion system [
19]. After the classification of SPI-6 as an IAHP locus [
5] and the discovery of T6SSs in
V. cholerae and
P. aeruginosa [
9,
20] it was clear that SPI-6 encoded a T6SS, although there is no experimental evidence on the functionality of this system. Conflicting reports associate SPI-6 T6SS with
Salmonella pathogenesis. A
S. Typhimurium mutant in STM0285 (IcmF homolog) presented an increased intracellular growth in macrophages and was hypervirulent in BALB/c mice [
21]. On the other hand, mutations in STM0272 (ClpV homolog) and STM0291 (Rhs element) have been reported to cause a ~30% reduction in the ability of
S. Typhimurium to replicate in macrophages [
22]. In addition, overexpression of a dominant negative version of STM0272 impaired the ability of
S. Typhimurium to invade epithelial cells [
12].
Despite the recent advances in our understanding of the structural and genomic organization of T6SS gene clusters, every in silico analysis performed so far to identify these loci in bacteria has taken into account a limited number of the currently available Salmonella genome sequences. Considering that many bacterial species harbor multiple T6SS in their genomes, it is plausible to think that there could be more than one T6SS in the genus Salmonella in addition to SPI-6 T6SS waiting to be uncovered. In the present work we performed a genome-wide in silico analysis of all currently available Salmonella genome sequences to identify T6SS loci. Our analysis revealed the presence of 3 novel T6SS gene clusters encoded in differentially-distributed genomic islands presenting distinctive evolutionary histories.