|Home | About | Journals | Submit | Contact Us | Français|
Riemerella anatipestifer (Hendrickson and Hilbert 1932) Segers et al. 1993 is the type species of the genus Riemerella, which belongs to the family Flavobacteriaceae. The species is of interest because of the position of the genus in the phylogenetic tree and because of its role as a pathogen of commercially important avian species worldwide. This is the first completed genome sequence of a member of the genus Riemerella. The 2,155,121 bp long genome with its 2,001 protein-coding and 51 RNA genes consists of one circular chromosome and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
No strain designation has been published for the type strain of Riemerella anatipestifer; therefore it will be referred to in this publication as ATCC 11845T, after the earliest known deposit (= DSM 15868 = ATCC 11845 = JCM 9532). Strain ATCC 11845T is the type strain of R. anatipestifer which is the type species of the genus Riemerella. The organism was described for the first time by Hendrickson and Hilbert in 1932 as 'Pfeifferella anatipestifer' , was subsequently known as 'Pasteurella anapestifer'  and renamed by Bruner and Fabricant in 1954 as Moraxella anatipestifer [3,4]. However, since the organism is closely related to neither the genus Moraxella nor to Pasteurella  it was reclassified as a novel genus by Segers et al. in 1993 . The reclassification was confirmed by subsequent 16S rRNA gene sequence analysis [7,8]. The generic name was given in honor to Riemer, who first described R. anatipestifer infections in geese in 1904 and referred to the disease as septicemia anserum exsudativa . The species epithet is derived from the Latin noun 'anas/atis' meaning 'duck', and the Latin adjective 'pestifer' meaning 'pestilence-carrying' referring to the pathogenic effect the species has on water-fowl, especially ducks. The type strain of the species was isolated from blood of ducklings on Long Island, New York, and identified by Bruner in 1954 . Further isolates were obtained from all kinds of avian hosts, however, pigeons and mammals are not infected by this species. R. anatipestifer is distributed worldwide and causes serious problems in agricultural flocks of duck, goose and turkey , but it has also been found in the upper respiratory tract of clinically healthy birds . There is no indication that the organism can survive outside of its host. Currently, there are two species in the genus Riemerella. The only other validly published species of the genus is R. columbina which is mainly associated with respiratory disease in pigeons . Here we present a summary classification and a set of features for R. anatipestifer ATCC 11845T, together with the description of the complete genomic sequencing and annotation.
A representative genomic 16S rRNA sequence from strain ATCC 11845T was compared using NCBI BLAST under default settings (e.g., considering only the high-scoring segment pairs (HSPs) from the best 250 hits) with the most recent release of the Greengenes database  and the relative frequencies, weighted by BLAST scores, of taxa and keywords (reduced to their stem ) were determined. The five most frequent genera were Riemerella (79.2%), Chryseobacterium (17.1%), Bergeyella (2.6%), “Rosa” (0.6%; misnomer) and Cloacibacterium (0.5%) (166 hits in total). Regarding the 124 hits to sequences from members of the species, the average identity within HSPs was 99.5%, whereas the average coverage by HSPs was 95.2%. Among all other species, the one yielding the highest score was “Rosa chinensis”, apparently a severe misannotation, which corresponded to an identity of 99.8% and an HSP coverage of 91.9%. The highest-scoring environmental sequence was EF219033 ('structure and significance Rhizobiales biofouling biofilms on reverse osmosis membrane treating MBR effluent clone RO224'), which showed an identity of 96.0% and a HSP coverage of 97.8%. The five most frequent keywords within the labels of environmental samples which yielded hits were 'skin' (10.3%), 'human' (4.9%), 'biota, cutan, lesion, psoriat' (4.0%) and 'fossa' (4.0%) (84 hits in total). Environmental samples which yielded hits of a higher score than the highest scoring species were not found.
Figure 1 shows the phylogenetic neighborhood of R. anatipestifer in a 16S rRNA based tree. The sequences of the three identical 16S rRNA gene copies in the genome differ by one nucleotide from the previously published 16S rRNA sequence (U60101).
The cells of R. anatipestifer are generally rod-shaped (0.3-0.5 × 1.0-2.5 µm) with round ends (Figure 2) . R. anatipestifer is a Gram-negative, non spore-forming bacterium (Table 1). The organism is described as non-motile. Gliding motility is not observed. Only three genes associated with motility have been found in the genome (see below). The organism is a capnophilic chemoorganotroph which prefers microaerobic conditions for growth. The optimum temperature for growth is 37°C, most strains can grow at 45°C but not at 4°C. Catalase and oxidase are present, thiamine is required for growth . R. anatipestifer is not able to reduce nitrate and does not produce hydrogen sulfide. The organism tolerates 10% bile in serum but no growth occurs on agar containing 40% bile in serum . Many biochemical reactions are negative or strain-dependent: Hinz et al. have stated that “R. anatipestifer is not easy to identify because it is characterized more by the absence than by the presence of specific biochemical properties” . The organism has proteolytic activity but its capacity to utilize carbohydrates is strain-dependent and has been discussed controversially. It has been described that carbohydrates are used oxidatively and that R. anatipestifer is able to produce acid from glucose and maltose, less often from fructose, dextrin, mannose, trehalose, inositol, arabinose and rhamnose . The production of indole is strain-dependent; the type strain does not produce indole . Esculin is not hydrolyzed by most R. anatipestifer strains, a trait useful for distinguishing these strains from R. columbina strains . Strain ATCC 11845T exhibits positive reactions for alkaline and acid phosphatase, ester lipase C8, leucine arylamidase, valine arylamidase, cystine arylamidase, phophoamidase, α-glucosidase and esterase C4. It does not produce α- and β-galactosidases, β-glucuronidase, β-glucosidase, α-mannosidase, β-glucosaminidase, lipase C14, fucosidase, trypsin, ornithine and lysine decarboxylases and phenylalanine deaminase .
R. anatipestifer is generally susceptible to enrofloxacin, amoxicillin, chloramphenicol, novobiocin, spiramycin, lincomycin and tetracyclines. Antibiotic resistance of the organism is steadily increasing: resistance to penicillin G, streptomycin and sulfonamides has been reported and more than 90% of all strains are resistant to polymyxin B, colistin, gentamycin, neomycin and kanamycin . The transmission of R. anatipestifer in ducks occurs vertically through the egg as well as horizontally via the respiratory route. The disease affects primarily young ducks where it typically involves the respiratory tract and nervous system. Ocular and nasal discharge are often typical for the onset of the disease, lameness can be observed at a later state. The mortality ranges between 1 and 10%, surviving animals may be stunted [34,35]. Vaccination of flocks has proven a valuable course of protection, however, immunity is serovar-specific and more than 20 serovars of R. anatipestifer are known . It is remarkable that R. anatipestifer persists post-infection on duck farms; biofilm formation of the organism is discussed as one possible explanation .
Few data are available for R. anatipestifer strain ATCC 11845T. The sole respiratory quinone found in this species is menaquinone . Which specific quinone is present in R. anatipestifer remains unclear from the literature: whereas Segers et al. specify menaquinone 7 as sole respiratory quinone in the type strain , Vancanneyt et al. claim menaquinone 6 is the major respiratory quinone of the type species . Typically, representatives of the genus Riemerella contain branched-chain fatty acids in high percentages. Major fatty acids of R. anatipestifer are iso-C15:0 (50-60%), iso-C13:0 (15-20%), 3-hydroxy iso-C17:0 (13-18%), 3-hydroxy anteiso-C15:0 (8-11%) and anteiso-C15:0 (6-8%) .
This organism was selected for sequencing on the basis of its phylogenetic position , and is part of the Genomic Encyclopedia of Bacteria and Archaea project . The genome project is deposited in the Genomes On Line Database  and the complete genome sequence is deposited in GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute (JGI). A summary of the project information is shown in Table 2.
R. anatipestifer ATCC 11845T, DSM 15868, was grown microaerobically in DSMZ medium 535 (Trypticase Soy Broth Medium)  at 37°C. DNA was isolated from 0.5-1 g of cell paste using MasterPure Gram-positive DNA purification kit (Epicentre MGP04100) following the standard protocol as recommended by the manufacturer, with modification st/DL for cell lysis as described in Wu et al. . DNA is available through the DNA Bank Network .
The draft genome was generated at the DOE Joint Genome Institute (JGI) using a combination of Illumina and 454 technologies (Roche). For this genome, we constructed and sequenced an Illumina GAii shotgun library which generated 26,937,600 reads totaling 969.8 Mb, a 454 Titanium standard library which generated 238,617 reads and a paired end 454 library with an average insert size of 14.3 kb which generated 112,671 reads totaling 141.8 Mb of 454 data. All general aspects of library construction and sequencing performed at the JGI can be found at . The initial draft assembly contained 28 contigs in one scaffold. The 454 Titanium standard data and the 454 paired end data were assembled together with Newbler, version 2.3. The Newbler consensus sequences were computationally shredded into 2 kb overlapping fake reads (shreds). Illumina sequencing data was assembled with VELVET, version 0.7.63 , and the consensus sequences were computationally shredded into 1.5 kb overlapping fake reads (shreds). We integrated the 454 Newbler consensus shreds, the Illumina VELVET consensus shreds and the read pairs in the 454 paired end library using parallel phrap, version SPS - 4.24 (High Performance Software, LLC). The software Consed  was used in the following finishing process. Illumina data was used to correct potential base errors and increase consensus quality using the software Polisher developed at JGI . Possible mis-assemblies were corrected using gapResolution , Dupfinisher , or sequencing cloned bridging PCR fragments with subcloning. Gaps between contigs were closed by editing in Consed, by PCR and by Bubble PCR (J-F Cheng, unpublished) primer walks. A total of 388 additional reactions were necessary to close gaps and to raise the quality of the finished sequence. The error rate of the completed genome sequence is less than 1 in 100,000. Together, the combination of the Illumina and 454 sequencing platforms provided 508.6 × coverage of the genome.
Genes were identified using Prodigal  as part of the Oak Ridge National Laboratory genome annotation pipeline, followed by a round of manual curation using the JGI GenePRIMP pipeline . The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) nonredundant database, UniProt, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and InterPro databases. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes - Expert Review (IMG-ER) platform .
The genome consists of a 2,155,121 bp long chromosome with a G+C content of 35.0% (Table 3 and Figure 3). Of the 2,052 genes predicted, 2,001 were protein-coding genes, and 51 RNAs; 29 pseudogenes were also identified. The majority of the protein-coding genes (64.1%) were assigned with a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.
We would like to gratefully acknowledge the help of Sabine Welnitz (DSMZ) for growing R. anatipestifer cultures. This work was performed under the auspices of the US Department of Energy Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under contract No. DE-AC02-05CH11231, Lawrence Livermore National Laboratory under Contract No. DE-AC52-07NA27344, and Los Alamos National Laboratory under contract No. DE-AC02-06NA25396, UT-Battelle and Oak Ridge National Laboratory under contract DE-AC05-00OR22725, as well as German Research Foundation (DFG) INST 599/1-2.