All cells possess quality control mechanisms that ensure degradation of defective protein and mRNA. Eukaryotic cells have a surveillance system called nonsense-mediated mRNA decay (NMD) that removes aberrant mRNAs containing a premature termination codon in their protein coding regions. The NMD pathway is triggered during the first round of translation of the aberrant mRNA thus ensuring that synthesis of truncated, potentially harmful proteins is prevented (
1,
2). A related mechanism is operational in bacteria. Here, quality control of gene expression is accomplished by an interplay between tmRNA (
3,
4) and ‘toxins’ that cleave mRNA at the ribosomal A-site (
5–
8). tmRNA is both a tRNA and an mRNA that recognizes ribosomes that are locked by translation of broken (or non-stop) mRNAs. Ribosomes trapped on non-stop mRNAs cannot terminate translation by the regular termination pathway. Such ribosomes are rescued by tmRNA in a reaction called
trans-translation that simultaneously mediates ribosome recycling and tagging of incomplete proteins for degradation by cellular proteases. Recently, two families of ribonucleases, RelE and MazF, which block translation by cleavage of mRNAs, were identified (
5,
6,
8,
9). Such mRNAs lack their natural stop-codons and tmRNA is needed to release ribosomes locked at their termini. Consistently, tmRNA counteracted the toxic effect of RelE and MazF overexpression and cells devoid of tmRNA became hypersensitive to the toxins (
5,
6). Together these results suggest that the RelE and MazF toxins function in quality control of gene expression.
The
relBE locus of
E.coli encodes RelE toxin and RelB antitoxin. RelB counteracts RelE activity by direct protein–protein interaction (
10). RelB also represses
relBE transcription and RelE acts as a co-repressor of
relBE transcription (
10). The
mazEF locus has a very similar genetic organization (
11). Nutritional stresses, such as amino acid and glucose starvation, activates RelE and MazF to inhibit translation by mRNA cleavage (
5,
12). Activation of RelE and MazF depends on Lon protease (
5,
12). During nutritional stress, Lon degrades RelB and most likely also MazE (
5,
12). In wild-type cells, the simultaneous degradation of RelB and MazF antitoxins has two effects that act in concert: it increases RelE and MazE activities and it increases the transcription rates of the
relBE and
mazEF operons (
5,
12). In turn, theincreased transcription rates sustain toxin synthesis during the stress period. Ectopic expression of RelE or MazF inhibited translation and conferred rapid loss of colony formation (
5,
12). However, cell viability could be fully regained by later induction of
relB transcription, thus indicating that even efficient overproduction of RelE or MazF did not confer cell death (
12,
13).
Toxin–antitoxin loci were discovered due to their ability to stabilize plasmids by post-segregational killing (PSK) (
14,
15). Plasmid stabilization is a consequence of the differential stabilities of the toxins and antitoxins: since the antitoxins are metabolically unstable, cells that lose a TA locus experience activation of the toxin that, in turn, prevents further cell growth of the plasmid-free cells. In a growing bacterial population, this results in phenotypic stabilization of plasmids that carry a TA locus (
16). During the years, seven plasmid-encoded TA families have been described (
15,
17). The TA loci belonging to these seven families are listed in the order of discovery:
ccd of F (
14),
parD/
pem of R1/R100 (
18),
vapBC of a
Salmonella dublin virulence plasmid (
19),
phd/
doc of P1 (
20),
parDE of RK2 (
21),
higBA of Rts1 (
22) and
relBE of P307 (
23). All TA loci belonging to these seven families have the same modular genetic set-up and overall similar regulatory and phenotypic properties, except for
higBA that has a reversed gene order (
higB toxin gene is located upstream of
higA that encodes the antitoxin) (
17). The elucidation of the cellular targets of the toxins has been of particular interest: CcdB of F and ParE of RK2 inhibit DNA replication by inhibiting DNA gyrase (
24,
25) and PemK of R1/R100 and RelE of P307 inhibit translation by mRNA cleavage (
23,
26). Indirect evidence suggests that Doc inhibits translation (
27), whereas the targets of VapC and HigB are not yet known.
Toxin–antitoxin loci are also present on bacterial chromosomes, often in multiple copies. Thus, the chromosome of
E.coli K-12 encodes three
relBE homologous loci (
relBE,
dinJ yafQ and
yoefM yoeB) (
17,
28) and two
mazEF homologous loci (initially called
chpA and
chpB for
chromosomal
homologs of
plasmid-encoded genes) (
29). Recent work has shown that chromosomal TA loci are surprisingly abundant in both bacteria and archaea, and exhibit very complex phylogenetic patterns (
17,
30–
32). While we performed this work, it was described that the RelE, ParE and HigB toxins constitute a large superfamily of toxins (
28). It was also proposed that the VapC PIN-domain proteins are ribonucleases that may constitute an evolutionary link between NMD in eukaryotes and quality control of gene expression in prokaryotes (
28). More recently, the structure of the first VapC toxin was solved (
33).
Here, we present an exhaustive search for TA loci in 126 totally sequenced prokaryotic genomes. We identify 671 complete TA loci belonging to the seven known TA gene families. Strikingly, we find that TA loci are surprisingly abundant in free-living prokaryotes, but are virtually absent from restricted and obligate host-associated organisms. The marine bacterium Vibrio cholerae has 13 TA loci, all located within the mega-integron on chromosome II. All 13 TA loci have closely linked attC sites, strongly suggesting that they are bona fide integron elements that are transposed via the integron-encoded integrase. The overall phylogenetic pattern supports that TA loci are stress-response elements that function in quality control of gene expression particularly beneficial to slowly growing free-living prokaryotes. The extensive database mining presented here will be highly useful in the further characterization of prokaryotic TA loci.