There are at least three different types of intrachromosomal recombination that can effect genotypic variation in bacteria (see reference
34 for a review). These include RecA-dependent homologous recombination, site-specific recombination, and illegitimate recombination (
34). Slipped-strand mispairing is one type of RecA-independent illegitimate recombination that occurs within the bacterial chromosome and can affect either transcription of the gene involved or translation of its mRNA. There are now many examples of slipped-strand mispairing involving homopolymeric or heteropolymeric nucleotide repeats that result in phase variation in bacteria (see references
14,
38, and
39 for reviews). Simple sequence repeats, including poly(A) tracts similar to the
uspA2H gene feature that is the subject of the present study, have been reported to be more common in bacterial pathogens than in other bacteria (
34), perhaps because of their involvement with contingency genes (
5,
6,
38).
In
M. catarrhalis, the UspA1 and Hag surface appendages (
30) are known to undergo a phase variation that involves simple sequence repeats. More specifically, the poly(G) tracts in the
uspA1 and
hag genes have been reported to affect the expression of these two genes in different ways. In
M. catarrhalis strain O35E, a decrease from 10 to 9 G residues in the poly(G) tract located in the 5′ UTR of the
uspA1 gene causes a reduction in the level of
uspA1 mRNA and the amount of UspA1 protein (
23). In contrast, the poly(G) tract found within the
hag ORF affects Hag expression by allowing frameshift mutations that cause premature termination of translation (
25; Sasaki et al., 99th Gen. Meet. Am. Soc. Microbiol.). Both the UspA1 protein and the Hag protein function as adhesins, albeit for different human cell types (
11,
15,
18,
22), and the ability of
M. catarrhalis to alter the expression of these two different gene products via phase variation may affect the ability of this organism to colonize the mucosal surface of the nasopharynx or permit the selection of a population with a phenotypic trait that is advantageous for a particular environmental niche.
A change in the number of A residues in the
uspA2H poly(A) tract of strain O46E had multiple phenotypic effects. The UspA2H-negative variant O46E.U2V lost its ability to resist killing by normal human serum, a trait that is most often associated with disease isolates of this organism (
17,
40,
41). This UspA2H-negative variant also lost its ability to attach to Chang cells in vitro via this protein. What adhesin molecule is directly involved in the attachment of
M. catarrhalis to the human nasopharyngeal mucosa in vivo is not known, but clearly this variant has lost one facet of its attachment capability. In addition, this variant also lacked the rapid-autoagglutination phenotype apparently caused by expression of the UspA2H protein in strain O46E. This resulted in the variant growing in suspension in broth cultures whereas the O46E parent strain rapidly fell out of suspension. This rapid-autoagglutination phenotype likely affected the ability of the O46E strain to attach to other
M. catarrhalis bacteria and form microcolonies or otherwise interact with host cells. Interestingly, the lack of autoagglutination by the O46E.U2V variant is similar to that of a mutant or variant of
M. catarrhalis strain 4223 that did not readily form aggregates and which was cleared from the lungs of mice more slowly than its wild-type parent strain (
21). This same nonaggregating 4223 variant lacked or had altered expression of several antigens, including HMW-OMP (i.e., UspA2).
The amino terminus of the predicted mature UspA2H protein closely resembles that of UspA1 proteins (
22), and the signal peptides are virtually identical in both molecules. In fact, the same poly(A) tract that is present at the 5′ end of the
uspA2H ORF is also present at the 5′ end of the
uspA1 ORF. However, the 5′ UTR of the
uspA2H gene of strain O46E is very different from that of the
uspA1 gene, lacking the poly(G) tract that is characteristic of all of the
M. catarrhalis uspA1 genes analyzed to date (
9). The observed change from eight to seven A residues in the
uspA2H poly(A) tract of the O46E strain is most likely the result of slipped-strand mispairing (
14,
38) and occurred at a frequency of 6.5 × 10
−4. Efforts to determine the frequency of the change from seven to eight A residues were not successful, although there are reports of homopolymeric repeats containing as few as 6 nt in other bacteria being unstable and undergoing reversible loss or gain of nucleotides (
13,
33). The presence of a poly(A) tract containing eight A residues near the 5′ end of the ORF of the
uspA1 gene in several of the
M. catarrhalis strains included in this study raises the possibility that expression of these particular UspA1 proteins could be subject to phase variation involving the deletion of an A residue from this poly(A) tract, resulting in a frameshift mutation that would cause premature termination of translation. In this event, these particular strains could experience phase-variable expression of their UspA1 protein by either of two different mechanisms, one involving the poly(G) tract in the 5′ UTR and the other involving the poly(A) tract within the ORF.
The lack of expression of the UspA2H protein by the O46E.U2V variant described in the present study was the result of a frameshift that occurred immediately downstream from the
uspA2H translation initiation codon. This, in turn, resulted in the occurrence of a premature translational stop codon 74 nt downstream from the poly(A) tract. It is also apparent that this change in the poly(A) tract affected
uspA2H mRNA levels, as measured by Northern blot analysis. It has been known for many years that the stability of some mRNA transcripts is negatively affected by early or premature translation termination (
28). In the present study, both native
M. catarrhalis uspA2H transcripts and
uspA2H-
lacZ fusion-derived transcripts were apparently more readily degraded when the number of A residues in the poly(A) tract was seven instead of eight. Moreover, when this frameshift in the O46E.U2V variant was corrected, either by converting the poly(A) tract from seven to eight A residues or by insertion of a G residue 2 nt downstream from the poly(A) tract, this restoration of the reading frame resulted in readily detectable levels of
uspA2H transcripts. It could be inferred from these data that the frameshift-induced premature termination of translation and a possible reduction in the number of ribosomes on the
uspA2H transcripts from O46E.U2V may have resulted in increased accessibility of these transcripts to endoribonucleolytic enzymes. Differences in RNA secondary structure were apparently not involved (data not shown). Whether the observed frameshift occurs in the host environment and whether the resultant altered phenotype would affect the colonization ability or virulence of
M. catarrhalis remain to be determined.
Finally, in silico analysis of the available nucleotide sequence of the genome of
M. catarrhalis ATCC 43617 (GenBank accession numbers AX067426 to AX067466) revealed the presence of homopolymeric nucleotide repeats containing eight or more nucleotides located just downstream from the beginning of at least nine different ORFs (data not shown), in a manner similar to that observed for the poly(A) tract near the start of the
uspA2H ORF. These included the previously described poly(G) tract in the
hag gene (
25,
30; Sasaki et al., 99th Gen. Meet. Am. Soc. Microbiol.) and the poly(A) tract within the
uspA1 ORF. In addition, at least 20 homopolymeric nucleotide repeats containing eight or more nucleotides were found to be located immediately upstream (within 100 nt) from ORFs (data not shown). Furthermore, it has been reported that several
M. catarrhalis strains contained loci with the tetranucleotide repeat 5′-CAAC-3′ (
29) and that this repeat was associated with phase-variable restriction-modification systems (
36). Taken together, these data suggest that
M. catarrhalis likely contains at least two dozen potentially phase-variable genes.