P2CS (http://www.p2cs.org) is a specialized database for prokaryotic two-component systems (TCSs), virtually ubiquitous signalling proteins which regulate a wide range of physiological processes. The primary aim of the database is to annotate and classify TCS proteins from completely sequenced prokaryotic genomes and metagenomes. Information within P2CS can be accessed through a variety of routes—TCS complements can be browsed by metagenome, replicon or sequence cluster (and these genesets are available for download by users). Alternatively a variety of database-wide or taxon-specific searches are supported. Each TCS protein is fully annotated with sequence-feature information including replicon context, while properties of the predicted proteins can be queried against several external prediction servers to suggest homologues, interaction networks, sub-cellular localization and domain complements. Another unique feature of P2CS is the analysis of ORFeomes to identify TCS genes missed during genome annotation. Recent innovations for P2CS include a CGView representation of the distribution of TCS genes around a replicon, categorization of TCS genes based on gene organization, an expanded domain-based classification scheme, a P2CS ‘gene cart’ and categorization on the basis of sequence clusters.



This article has been
bp of each other and encoded on the same strand. TCS gene loci are categorized as orphans (one gene), pairs (two genes), triads (three genes), tetrads (four genes) or pentads+ (five or more genes). In the current version of P2CS-containing 77357 TCS genes, triads and tetrads are relatively common (5% of all TCS genes). However, larger clusters are rare (26 pentads, five hexads, one heptad and one decad, together accounting for only 0.2% of all TCS genes). The decad is a particularly intriguing locus comprising genes Glov_1564 to Glov_1573 from Geobacter lovleyi SZ, together encoding nine receiver and three transmitter domains. Gene organization summaries are presented on each replicon page, while each gene page describes the gene organization and allows navigation to other genes in the locus.