|Home | About | Journals | Submit | Contact Us | Français|
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
The Correia Repeat Enclosed Element (CREE) of the Neisseria spp., with its inverted repeat and conserved core structure, can generate a promoter sequence at either or both ends, can bind IHF, and can bind RNase III and either be cleaved by it or protected by it. As such, the presence of this element can directly control the expression of adjacent genes. Previous work has shown differences in regulation of gene expression between neisserial strains and species due to the presence of a CREE. These interruptions perhaps remove the expression of CREE-associated genes from ancestral neisserial regulatory networks.
Analysis of the chromosomal locations of the CREE in Neisseria gonorrhoeae strain FA1090 and N. gonorrhoeae strain NCCP11945 has revealed that most of the over 120 copies of the element are conserved in location between these genome sequences. However, there are some notable exceptions, including differences in the presence and sequence of CREE 5' of copies of the opacity protein gene opa, differences in the potential to bind IHF, and differences in the potential to be cleaved by RNase III.
The presence of CREE insertions in one strain relative to the other, CREE within a prophage region, and CREE disrupting coding sequences, provide strong evidence of mobility of this element in N. gonorrhoeae. Due to the previously demonstrated role of these elements in altering transcriptional control and the findings from comparing the two gonococcal genome sequences, it is suggested that regulatory differences orchestrated by CREE contribute to the differences between strains and also between the closely related yet clinically distinct species N. gonorrhoeae, Neisseria meningitidis, and Neisseria lactamica.
The genome sequences of the Neisseria spp. contain 100 or more copies of a repetitive sequence that has demonstrated roles in gene regulation and is involved in chromosomal changes such as gene inactivation and rearrangements. The Correia Repeat (CR) of 26 bp was first described in 1986 , and has only been found in Neisseria spp., both pathogenic (Neisseria gonorrhoeae, the gonococcus, and Neisseria meningitidis, the meningococcus) and commensal (Neisseria lactamica) [2,3]. Often it is found as an inverted repeat with a characteristic core , the Correia Repeat Enclosed Element (CREE) , which is most commonly 153–157 bp or 104–108 bp (Figure (Figure1).1). The CREE has features of an IS element including the presence of the inverted repeat CR sequences  and duplication of the target sequence [6,7]. In addition, there are vast differences in their placement in the neisserial genome sequences and some coding sequences have been disrupted by CREE [3,5,7-9]. This suggests they are, or have been, mobile genetic elements. Unlike IS elements that encode their own mobilizing transposase, the CREE have never been demonstrated to encode proteins and the mobilizing mechanisms for CR and CREE distribution in the chromosome have not been determined.
Two functional σ70 promoters have been described as being generated by CREE insertion 5' of genes [10,11] (Figure (Figure2).2). The first of these, the Black promoter, is at the end of the CREE closest to the gene . The Black promoter is comprised of a -35 and a partial -10 element within the CREE. Insertion in some locations completes the -10 promoter element (Figure (Figure2).2). Functional Black promoters are present 5' of uvrB , drg , lst , and mtrCDE . The second σ70 promoter, the Snyder promoter, was identified at the end of the CREE farthest from the gene. In this case the -35 element comes from the native sequence and the -10 is contained completely within the first 6 bases of the CREE (Figure (Figure2).2). A functional Snyder promoter was first identified 5' of dcw gene dcaC in N. lactamica .
The endoribonuclease RNase III is involved in the processing of stable RNAs such as rRNA and some mRNA transcripts . The stem-loop of the CREE generated by the inverted repeats (Figures (Figures11 and and2)2) has been determined to be a binding site for RNase III when the CREE sequence is present within the mRNA . The binding of RNase III regulates gene expression post-transcriptionally either through cleavage or protection from cleavage by RNase III, depending on the CR inverted repeat sequence [14,16,17].
The CREE may have other functions as well. For example, some of the longer 153–157 bp CREE contain an IHF binding site (Figures (Figures11 and and2).2). This may be involved in end synapsis during element transposition  or participate in the modulation of regulation of associated genes . CREE may also be hotspots for genomic recombination and rearrangement [5,7,18,19]. Additionally, CREE are associated with gene loss through deletion [18,20] and gene disruption [8,21,22].
Despite all of these known functions, we still do not know whether CREE are currently mobile within these species or whether their mobilization occurred in the distant past in the evolution of these bacteria and the mobilizing element has now been lost. None are present in known or suspected regions of horizontally transferred DNA [6,9,23,24], which suggests that mobility has been lost. However, diversity in the distribution in the N. meningitidis sequenced chromosomes [5,8,9,25] and rearrangements between the genome sequences strongly suggest that CREE are mobile in meningococci.
The state of the CREE in N. gonorrhoeae could previously only be investigated in the context of its comparison with N. meningitidis due to the availability of only one gonococcal genome sequence [GenBank: RefSeq NC_002946]. In the meningococcus over 250 CREE have been identified [5,8,9,25], yet less than half this number have been reported for gonococcal strain FA1090 [5,6]. Is this the result of a species difference in CREE copy number between the gonococcus and the meningococcus? Or, has the relatively longer laboratory propagation of strain FA1090 resulted in the deletion of genetic material including CREE sequences? If the latter is the case then comparison with a more recently acquired and less passaged gonococcal isolate should provide evidence of CREE-mediated deletion events. Likewise, variations in CREE locations between two gonococcal genome sequences might suggest that mobilization has occurred in N. gonorrhoeae since its split with the other neisserial species and that such CREE movement might still be occurring. The recent publication of the complete genome sequence of N. gonorrhoeae strain NCCP11945, an isolate from a 2002 vaginal smear , has opened the way to better understand this human sexually-transmitted pathogen through comparative genome analysis. Here we investigate the CREE of N. gonorrhoeae strains FA1090 and NCCP11945.
A new analysis of the N. gonorrhoeae strain FA1090 genome sequence was conducted, identifying a total of 123 CREE and 9 single CR sequences (see Additional file 1). A total of 131 CREE and 6 single CR sequences were identified in the genome sequence of N. gonorrhoeae strain NCCP11945 (see Additional file 2). The CREE are fairly evenly distributed in the genomes (Figure (Figure3).3). For both genome sequences, 17 of the CREE contain a perfect IHF binding site based on the published CREE IHF binding site sequence , although these 17 are different between the two strains (see Additional file 3). The IHF-binding site-containing CREE are 154 to 157 bp, except for a 143 bp variant that has an 11 bp deletion in its CR in strain NCCP11945 (Figure (Figure33 and Additional file 3). The potential to be cleaved by RNase III is dependent upon the symmetry of the CR ends of the CREE . In strain FA1090, 92 have symmetrical CR ends and 100 CREE have symmetrical ends in strain NCCP11945.
Of the 123 CREE in strain FA1090, 120 are at the same locations in strain NCCP11945 (see Additional file 4). Of these, 83 (69.16%) are nearly identical with 3 or fewer nucleotide differences in sequence and no differences in length. Some of these are 5' of genes that are integral to gonococcal biology (Table (Table11).
The CREE have become associated with various systems within the gonococcus including those involved in LPS biosynthesis, pilus expression, DNA repair, iron acquisition, adhesion, competence, and pathogen-specific gene regulation. It is possible that variations in the presence of CREE associated with such genes may account for differences in strain behaviour and phenotypes that differ between the gonococcus and the meningococcus. For example, in N. gonorrhoeae the MtrCDE efflux pump system, involved in antibiotic resistance  and in vitro survival , is regulated by the repressor MtrR  and the activator MtrA . In N. meningitidis, a CREE has inserted within the regulatory region for mtrCDE and removed the expression of these genes from control by MtrR and MtrA . In this case, the CREE is present between the native promoter  and the genes, thus the CREE sequence is part of the mRNA transcript and its cleavage by RNase III has been demonstrated . This CREE also contains an IHF-binding site and its presence in the CREE was shown to have a negative impact on transcription levels .
Of the CREE identified in the genome sequences of N. gonorrhoeae strain FA1090 and N. gonorrhoeae strain NCCP11945, thirteen are only present in one of the strains (see Additional files 1, 2, and 4). Four of these are within larger regions of difference. Fragments of CREE are found at three other sites. The remaining six are clear insertions of the element, often at a TA target site as has been described . However, two insertions appear to have occurred at CA sites (strain NCCP11945 positions 1,577,266 and 2,185,647) and indeed some CREE have terminal CA sequences rather than TA (see Additional files 1 and 2). These insertions of CREE in just one of the gonococcal strains support the mobility of the element within the gonococcus and suggest that the mobilization mechanism for the CREE could not have been lost before the speciation event that generated N. gonorrhoeae and N. meningitidis.
Also identified was one CREE in strain FA1090 that corresponded to two different CREE in strain NCCP11945 (Figure (Figure4).4). The associated chromosomal rearrangement has not created fragments of CREE in either strain.
It is clear that the gonococci differ from their meningococcal relatives in the number of CREE copies sustained in their genome sequences, with N. meningitidis genomes having over 250 [8,9,25]. The commensal N. lactamica, a close relative of the pathogenic neisserial species, has been reported to have three times fewer CREE than N. meningitidis . A quick search suggests that there are approximately 100 CREE in the N. lactamica ST-640 genome sequence (data not presented). Therefore, the commensal and the gonococcus share similar CREE copy numbers.
Coupled with the functions of the CREE, this suggests that N. gonorrhoeae and N. lactamica retain more of the ancestral Neisseria regulatory networks than N. meningitidis, where many of these may now be under the transcriptional control of CREE. This is certainly true for the efflux pump system encoded by mtrCDE, where both the gonococcal repression  and activation  systems have been subverted through CREE insertion in the meningococcus . It has been difficult to assign a species-specific gene set to N. gonorrhoeae, N. meningitidis, and N. lactamica, largely due to horizontal exchange between these naturally competent species . The small number of genetic islands identified that were thought to be unique to one species are either present in both pathogenic species or are not present in all strains of the species in which they were originally found. For example, the Gonococcal Genetic Island is not present in all gonococci  and was found in strains of N. meningitidis . The meningococcal capsule is absent from some N. meningitidis strains including some that have caused invasive disease [32,33]. Other Islands of Horizontal Transfer found in N. meningitidis are strain-specific [9,25]. Added to this, only six 'virulence' genes present in all pathogenic Neisseria genome sequences were absent from the non-pathogen N. lactamica . Evidence increasingly supports the idea that regulation rather than gene complement differentiates the species.
The single CR sequences found in the two gonococcal genome sequences were often associated with a partial core sequence (6 out of 9 and 4 out of 6). In addition, the vast majority of the CR are part of CREE (see Additional files 1 and 2). This suggests that the 26 bp repeats identified by Correia et al.  are fragments of CREE, having lost the remainder of the sequence through deletion events. Cases of partial CR (<26 bp) are also found in strain FA1090 that correspond to whole CREE 5' of NGK_2270 (mafA) and between NGK_2168 and NGK_2169 in strain NCCP11945 (see Additional file 3). The CREE itself should therefore be seen as the functional unit, rather than the chance occurrence of an inverted pair of CR. This is especially evident in light of the conservation of the core sequences (Figure (Figure11).
The gonococcal CREE are evenly distributed in the chromosomes (Figure (Figure3).3). Most are of the 154–156 bp or 105–107 bp types, however a small number are shorter than these at 69–73 bp. Sequence feature characteristics of the different lengths of CREE illustrated in Figure Figure11 show a general conservation of CREE structure based on length. Although some length variants shown in Figure Figure33 were found, in each case these are modifications of one of the basic CREE structures through deletions and tandem repeat duplications. All CREE lengths can have the variations in the inverted repeat end sequences (Figure (Figure1;1; AACAAAAA or AAATTTAAA) that have been described previously . Symmetry of these inverted repeats generates the potential for RNase III cleavage . The potential for an IHF-binding site is present only in the longer of the CREE (Figure (Figure1).1). CREE core sequence directionality has a CCGGTACGG end and a TCAGGACAA end (Figure (Figure1).1). The 71 to 73 bp CREE are the exception to this structure, having two CCGGTACGG ends, yet these still retain a directionality to their core sequence (Figure (Figure11).
A total of 18 potential Black promoters and 15 potential Snyder promoters were found associated with N. gonorrhoeae strain FA1090 CREE, with 7 of these CREE containing both (see Additional file 1). While few potential Black and Snyder promoters were identified, there are 76 CREE in strain FA1090 positioned 5' of a gene. This suggests that the presence of the element in this location influences the regulation or expression of these gene(s). It is likely that RNase III has a role in post-transcriptional regulation of these genes. Of these 76 CREE, 57 have symmetrical inverted repeat ends and are therefore potential substrates for RNase III cleavage if the CREE sequence is part of the transcript. Asymmetrical sites are also influenced by RNase III through binding, therefore RNase III is thought to influence the longevity of all mRNAs containing CREE sequence. Seven CREE differ between the strains in their end symmetry, with four of these located 5' of annotated CDSs (NGO0407, NGO0452, NGO1221, and NGO1347). Likewise, nine CREE 5' of CDSs contain IHF-binding sites that may influence the binding and action of other proteins, including RNA polymerase and RNase III.
The Opa proteins of the Neisseria spp. are a family of phase variably expressed, antigenically variable outer membrane proteins involved in attachment and invasion of host cells. Multiple copies of the opa gene are found in neisserial genomes and variations in their sequences can mediate different interactions with different host cell receptors [35,36]. There are 11 opa genes in strain FA1090  and in strain NCCP11945 (Table (Table2).2). Most of the strain FA1090 opa genes are unannotated and in several cases the strain NCCP11945 annotation has not identified the initiation codon due to the frame-shift generated by the phase variable CTTCT tract within the gene . The majority of the opa genes are associated with a 5' CREE, which in some cases contains an IHF-binding site sequence consensus (Table (Table22).
In all cases, there appears to be a σ70 promoter between the CREE and opa, which would mean that the CREE sequence is not part of the mRNA transcript and that it is not therefore targeted by RNase III. There are sequence differences in the IHF-binding sites between the strains, but whether the sites that differ from the published IHF-binding site sequence are still able to bind IHF is not known. The CREE 5' of NGK_1799 in strain NCCP11945 is not present in strain FA1090, where a 17 bp fragment of the end of the CREE remains 5' of NGO1513. The CREE 5' of NGK_1847 (unannotated in strain FA1090) is only 105 bp and therefore does not carry the IHF-binding site (Figure (Figure11).
Indeed, there are no CREE at all present 5' of the opa copies NGK_0096, NGK_0847, NGK_2170, and NGK_2410 (all unannotated in strain FA1090) (Table (Table2).2). The regions upstream of these genes are otherwise similar, which would suggest that the CREE was mobile after the gene duplication events that generated 11 copies of opa, three to four times the number of opa genes found in N. meningitidis [8,9,25]. Indeed all of the copies of opa found in N. meningitidis strain Z2491 have a CREE 98–99 bp 5' (NMA1676, NMA1890, NMA2043), with similarly placed CREE for three out of four opa genes in meningococcal strain MC58 (NMB0442, NMB1465, NMB1636, but not NMB0926) [8,9,34]. When the phase variable tract is ON in opa the influences of the various upstream sequences, including the CREE, may have further regulatory effects on expression of this important adhesin, perhaps through binding of IHF.
Of the 5 previously reported genes disrupted by CREE insertion [8,21,22], 4 are associated with CREE in these two gonococcal genome sequences (Table (Table3).3). In the case of NMA2121, this CDS is not present in strain FA1090 as it is part of a Minimal Mobile Element [22,39,40], which facilitates horizontal exchange of gene cassettes between genome sequences.
Three additional coding sequence disruptions by CREE were identified in strains FA1090 and NCCP11945 (Table (Table3).3). In two of these, the disruption is shared by both strains, while the third has no CREE in this region in strain NCCP11945. The fact that the CREE has inserted into identifiable coding regions supports the hypothesis that this element is, or was, mobile. That disruption of some CDSs is seen only in the gonococcus supports mobilization of the CREE within N. gonorrhoeae since its split from N. meningitidis.
It has been proposed that CREE are not found in regions of horizontal transfer and that their absence can be taken as additional evidence for the horizontal origin of a region, given their otherwise even distribution in the chromosome [6,9,23,24]. The CREE sequences 38 bp 5' of NGO1004, 213 bp 5' of NGO1006, and 21 bp 3' of NGO1020, present in both gonococcal strains, are within a 13 kb region between rpoD and tRNA-Arg containing 17 CDSs annotated as "putative phage associated proteins". This demonstrates that the CREE has been mobile at some time since the acquisition of this prophage sequence in the Neisseria. While it might be possible to use the absence of CREE as additional evidence indicating foreign sequence origin, as for the meningococcal IHTs  and the Gonococcal Genetic Island , the presence of CREE does not preclude the possibility of horizontal transfer of the region.
The CREE are less numerous in the gonococcal genome sequences (~120 copies) than in the meningococcal genome sequences (~250 copies). Many of the gonococcal CREE (71 of 113 in strain FA1090) are 5' of genes and in this position are likely to be involved in the regulation of genes through generation of promoter sequences, binding of IHF, and mRNA stability control by RNase III. CREE are associated with virulence- and host survival-associated determinants such as Opa, HpuA, and MafB, as well as regulators of factors important in pathogenesis. Differences in CREE insertions between the two strains and the presence of CREE within a prophage region, upstream of multiple copies of opa, and within coding sequences provide strong evidence of mobility of this element in the gonococcus and therefore since the speciation event that differentiated the gonococcus from the meningococcus. The regulatory influence of the CREE and the copy number differences between N. gonorrhoeae, N. meningitidis, and N. lactamica may contribute to the different behaviours of these pathogens through differences in their regulatory networks.
The genome sequences of N. gonorrhoeae strain FA1090 [GenBank: RefSeq NC_002946] and N. gonorrhoeae strain NCCP11945 [GenBank: RefSeq NC_011035] were searched for CR sequences using the tools within xBASE http://xbase.bham.ac.uk. Each genome sequence was searched using 'fuzznuc' for three different patterns based on the conserved CR sequences identified from previous studies of this element [5,6,8,25]: TATAG [CT]GGATTAACAAAAATCAGGAC; TATAG [CT]GGATTAAATTTAAACCGGTAC; TATAG [CT]GGATTAACAAAAACCGGTAC; and TATAG [CT]GGATTAAATTTAAATCAGGAC. Square brackets will find either of the letters within them. In each case, up to 3 mismatches were allowed. Increasing to 4 or 5 mismatches did not identify any additional CREE. Pattern finding has been successfully used previously to identify CREE [3,5]. Each identified location was then investigated manually to determine if the site contained a lone CR or a CREE. Each location was catalogued and the length and sequence of the CREE was determined (see Additional files 1 and 2). As a confirmation, searches with the above sequences were also conducted using BLASTN, as has been done previously [5-7], against the gonococcal genome sequences and also against N. lactamica ST-640 (data not presented). IHF sites were found by searching within the CREE sequences for the sequence CTTCAGCACCTTA  (see Additional file 3). RNase III cleavage potential was determined based on CR sequence and symmetry between the ends of the CREE.
Using xBASE http://www.xbase.bham.ac.uk, WebACT http://www.webact.org, and progressive Mauve http://asap.ahabs.wisc.edu/software/mauve, the genome sequence data from gonococcal strains FA1090 and NCCP11945 were comparatively aligned and visualized. For each CREE location the presence of a corresponding CREE in the other genome sequence, if present, was catalogued (see Additional file 4).
For each site where CREE were found in both the gonococcal genome sequences, the sequences of the CREE were aligned using EMBOSS needle  and the calculated percent identity was recorded (see Additional file 4). All of the identified CREE sequences were aligned together using ClustalW2  from EMBL-EBI.
Correia repeats; Correia elements (CE); Correia; Correia full (~150 bp); Correia internal deletion (~105 bp); Neisseria miniature insertion sequences (nemis); neisserial repetitive sequences; pseudo-transposable neisserial small repetitive elements (SRE); N. gonorrhoeae inverted repeats; Correia sequences; 152-bp repetitive sequence.
Correia repeats; Correia repeat unit; 26-bp repeat; 26-bp neisserial repeat (NR).
LS conceived of the study, carried out the comparative genome analyses, data collection, and drafted the manuscript. JC and MP assisted in the interpretation of the data and suggested improvements to the presentation of the results and their discussion in the manuscript. All authors read and approved the final manuscript.
CREE in N. gonorrhoeae strain FA1090. This spreadsheet contains the locations, sequence, and features of all of the CREE identified in the genome sequence of N. gonorrhoeae strain FA1090.
CREE in N. gonorrhoeae strain NCCP11945. This spreadsheet contains the locations, sequence, and features of all of the CREE identified in the genome sequence of N. gonorrhoeae strain NCCP11945.
Comparison of long CREE and IHF-binding sites between N. gonorrhoeae strains FA1090 and NCCP11945. This spreadsheet contains comparative data on the long CREE in N. gonorrhoeae strain FA1090 and N. gonorrhoeae strain NCCP11945 and the presence of an IHF-binding site that corresponds to the sequence published .
Comparison of CREE locations between N. gonorrhoeae strains FA1090 and NCCP11945. This spreadsheet contains the CREE that are in common between N. gonorrhoeae strain FA1090 and N. gonorrhoeae strain NCCP11945.
LS was supported by a BBSRC grant (BBE111791) awarded to MP and D.J. Stekel. This research made use of xBASE: a comprehensive resource for comparative bacterial genomics , which is also supported by this BBSRC grant.