It has been 20 years since the first bacterial response regulator genes were sequenced (
35,
39,
89) and the CheY-like receiver (phosphoacceptor) domain was identified as their common regulatory module (
95,
124,
138; see references
55,
57,
127, and
150 for reviews and reference
43 for a historical perspective). Structural characterization of the CheY-like receiver domain (hereafter, the REC domain) from
Escherichia coli and other organisms confirmed that it is an autonomously folding, evolutionarily stable, compact structural unit (
125,
126,
147) whose conformation undergoes a distinctive change upon phosphorylation (
65,
75). These conformational changes, which increase the propensity of the REC domain to form dimers, are used by bacterial and archaeal cells to transmit and propagate a wide variety of environmental and intracellular signals (
29,
127,
150).
The typical scheme of bacterial two-component signal transduction involves signal sensing (ligand binding or other conformational change) by a sensory histidine kinase that leads to its autophosphorylation, followed by phosphoryl transfer to the Asp residue in the N-terminal REC domain of the cognate response regulator, which affects the properties of the C-terminal DNA-binding domain (
127,
150). Structural characterization of the DNA-binding domains of various response regulators revealed several variations on the common helix-turn-helix (HTH) theme, exemplified by the NarL-type, OmpR-type “winged helix,” Spo0A-type, and Fis-like structures (
12,
78,
79,
87,
88,
100). Sequence comparisons revealed additional types of DNA-binding domains in certain response regulators, some of which have been experimentally verified (
94,
113).
In addition to forming associations with various DNA-binding domains, the REC domain can function as a stand-alone module. Its propensity to protein-protein interactions plays a key role in the chemotaxis machinery, as well as in some other regulatory pathways (
10,
130). It also forms other types of response regulators by associating with certain enzymatic or protein-binding domains, e.g., with the methylesterase (CheB) and chemotaxis modulator (CheW) domains, both of which participate in the adaptation to attractants during chemotaxis (
61). In the CheB-type response regulator, the N-terminal REC domain packs against the active site of the C-terminal methylesterase domain and inhibits methylesterase activity by restricting access to the active site (
34). The conformational change occurring upon phosphorylation of the REC domain apparently disrupts this interaction and relieves the methylesterase domain from inhibition by REC (
34). In the
Bacillus subtilis CheV protein, the presence of the N-terminal CheW-like domain was shown to stabilize the phosphorylated state of the C-terminal REC domain (
40,
61).
The REC domain can also form combinations with other signaling domains, e.g., in the
Caulobacter crescentus PleD response regulator, which controls flagellum ejection and stalk formation during the
C. crescentus life cycle (
1,
2,
52). In PleD, the N-terminal REC domain is fused to an inactivated REC domain and a C-terminal GGDEF domain, which has diguanylate cyclase activity that produces bis-(3′→5′)-cyclic diguanosine monophosphate (c-di-GMP), a secondary messenger in bacteria (
99,
112; see references
58 and
109 for reviews). In
Vibrio cholerae response regulator VieA, which controls biofilm formation by this organism (
137), the REC domain is associated with the EAL domain, a c-di-GMP-specific phosphodiesterase (
22,
30,
114,
131). A combination of the REC domain with another c-di-GMP-specific phosphodiesterase, the HD-GYP domain, controls biosynthesis of extracellular polysaccharide in
Xanthomonas campestris (
45,
111,
121).
The response regulator sets encoded in
E. coli,
B. subtilis,
Synechococystis sp.,
Streptococcus pneumoniae, and several other bacteria have been tabulated and classified (
48,
51,
74,
90,
91,
97,
146,
148). There have been several attempts to compile comprehensive lists of response regulators in all sequenced microbial genomes (
11,
66,
70,
141), as well as to list response regulators in microbial genome databases, such as KEGG, SENTRA, and COG (
60,
84,
134). However, the extreme diversity of REC domain combinations encoded in recently sequenced microbial genomes continues to generate problems during genome annotation (
43,
157). We have already discussed the most common errors in annotating signaling proteins and suggested that, unless exact function of the newly described protein is known, its annotation should be based on its domain composition, rather than on sequence similarity of any given domain (
38,
109). To assist in this process, I present here a comprehensive domain-based classification of the response regulators encoded in completely sequenced genomes of 200 bacterial and archaeal species and describe domain architectures that involve the REC domain and known DNA-binding, enzymatic, and ligand-binding domains, as well as several new domains, not described previously.