|Home | About | Journals | Submit | Contact Us | Français|
Adult stable flies are blood feeders, a nuisance, and mechanical vectors of veterinary diseases. To enable efficient feeding, blood sucking insects have evolved a sophisticated array of salivary compounds to disarm their host's hemostasis and inflammatory reaction. While the sialomes of several blood sucking Nematocera flies have been described, no thorough description has been made so far of any Brachycera, except for a detailed proteome analysis of a tabanid (Xu et al., 2008). In this work we provide an insight into the sialome of the muscid Stomoxys calcitrans, revealing a complex mixture of serine proteases, endonucleases, Kazal-containing peptides, anti-thrombins, antigen-5 related proteins, antimicrobial peptides, and the usual finding of mysterious secreted peptides that have no known partners, and may reflect the very fast evolution of salivary proteins due to the vertebrate host immune pressure. Supplemental tables S1 and S2 can be downloaded from http://exon.niaid.nih.gov/transcriptome/S_calcitrans/T1/Sc-tb1-web.xls and http://exon.niaid.nih.gov/transcriptome/S_calcitrans/T2/Sc-tb2-web.xls.
The habit of blood feeding evolved independently several times in Diptera (Grimaldi and Engel, 2005), including the Nematocera and Brachycera. Evolution to blood feeding is associated with the expression of specialized saliva that is pharmacologically active against vertebrate hemostasis and inflammation. Because inflammation and hemostasis are complex and redundant phenomena, the salivary potion of blood feeders is also complex, containing vasodilators, anti-clotting substances and enzymes that destroy vertebrate agonists and matrix components (Ribeiro, 1995). Sialotranscriptome analysis (from the Greek, Sialo=Saliva) of mosquitoes, sand flies, biting midges and black flies are revealing a vast repertoire of polypeptides recruited to serve this function (Andersen et al., 2009; Arca et al., 2007; Arca et al., 2005; Calvo et al., 2004; Campbell et al., 2005; Ribeiro et al., 2004; Valenzuela et al., 2003).
It has been proposed that all blood sucking Nematocera, except for sand flies, have a common blood feeding ancestor (Grimaldi and Engel, 2005). While the blood feeding Nematocera salivary proteins have some common protein families, such as the very divergent D7 protein family also found in sand flies (Valenzuela et al., 2002a), most other protein families are family or genus specific. For example, the salivary anticlotting protein of Aedes aegypti is a member of the serpin family (Stark and James, 1998), while a novel peptide named anophelin is an anti-thrombin in Anopheles albimanus (Valenzuela et al., 1999). As another example, while salivary apyrase activity, responsible for ADP hydrolysis, is found in both sand flies and mosquitoes, the gene families recruited for these functions are completely different in these two organisms (Champagne et al., 1995; Valenzuela et al., 2001). While the Brachycera contain diverse families of blood sucking arthropods of medical and veterinary importance including the tsetse, tabanids, keds and stable flies, no detailed transcriptome analysis has been done so far from any of these flies, although a detailed sialoproteome analysis has been made from the tabanid Tabanus yao (Xu et al., 2008).
The stable fly, Stomoxys calcitrans is an important pest of cattle, their larvae growing in decaying faeces and straw, the adults biting and blood feeding on mammals (Campbell et al., 2001; Taylor and Berkebile, 2006). They can also vector pathogens by mechanical transmission. It is the aim of this study to provide a preliminary description of the sialome of S. calcitrans.
Stable flies were reared from a colony collected in Manhattan, KS in 1990. Eggs were collected with a wet black cloth wick. The larvae were reared in a fermented mixture of wheat bran (Farmers' Cooperative Association, Frederick, MD), vermiculite (Hummert International, St. Louis, MO), and Calf Manna (Manna PRO Corporation, Chesterfield, MO). Adult flies were fed with 1 M sucrose or Gatorade. For reproduction, the flies were fed daily with citrated bovine blood supplied in saturated cotton pads. The flies were kept at 26 ± 1°C, a photoperiod of 16:8 h (L:D), and at 65 ± 5% relative humidity. Flies from this colony maintain the ability to feed well on cattle (A. Broce, unpublished observations).
The salivary glands were dissected from male and female adults (not blood-fed) between 1 and 7 days after eclosion. The glands were immediately transferred to either TRI Reagent (Sigma) on ice for mRNA purification or to 10 mM Hepes, pH 7.0, 150 mM NaCl (HBS) on ice for extraction of proteins. Usually, groups of 20 salivary glands were placed in 20 μl of HBS in a 0.5 ml centrifuge tube. The salivary glands in TRI Reagent or HBS were stored at -80°C.
Total mRNA was purified using TRI Reagent (Sigma) according to the manufacturer's protocol. A long distance polymerase chain reaction (PCR)-based cDNA library was constructed in a λ Triple Ex2 vector following the procedures from the SMART™ cDNA Library Construction Kit (Clontech, USA). Briefly, 1.1 μg of total RNA purified from salivary glands of male and female adult stable flies was used to prepare first–strand cDNA in a 10 μl reaction volume at 42°C for 1 hour with PowerScript™ Reverse Transcriptase and CDS III primer (Clontech, USA). Half of the above first strand cDNA reaction volume (5 μl) was used as template for double stranded cDNA amplification in a reaction volume of 100 μl by long distance PCR using SMART IV oligonucleotide as forward primer and CDS III primer as reverse primer. The two primers incorporated Sfi IA and Sfi IB sites at the two ends of cDNAs suitable for digestion with Sfi I. PCR was conducted on a Perkin Elmer 9600 thermal cycler (Perkin Elmer, USA) using a Clontech Advantage 2 PCR kit (Clontech, USA) with denaturation at 95°C for 20 s, followed by 20 cycles of 95°C for 5 s and 68°C for 6 min.
The double stranded cDNA was treated with proteinase K at 45°C for 20 min and then extracted with phenol:chloroform:isoamyl alcohol. The cDNA was digested with Sfi I, size fractionated with a CHROMA SPIN-400 column (Clontech), and ligated into the λ TripleEx2 vector (Clontech) containing digested asymmetrical Sfi I sites. A packaging reaction was performed with Gigapack III Gold Packaging Extract (Stratagene) according to manufacturer's instructions. Titer of the unamplified cDNA library was determined, and the library was then amplified by plating 4 × 106 plaques and eluting resulting phage with 0.01% gelatin, 100 mM NaCl, 10 mM MgSO4, 35 mM Tris· Cl, pH 7.5.
In an initial test, 106 plaques were randomly selected for sequencing. The λ TripleEx2 phage was converted to pTripleEx2 plasmid by incubating 150 μl of the eluted phage with 200 μl overnight culture of E.coli BM25.8 (OD600nm at 1.1) at 31°C. Each culture was spread onto an LB/Ampicillin plate and grown at 31°C overnight. One colony from each plate was grown in 3 ml LB/ampicillin medium. Plasmids were purified from the cultures using QIAprep Spin Miniprep Kit (Qiagen) according to manufacturer's protocol. These DNA samples were then analyzed by digestion with BstXI and SphI or EcoRI and XhoI, followed by electrophoresis on a 1.1% agarose gel to assess insert size. Plasmid DNA samples were sequenced at the DNA Sequencing Facility in the Department of Plant Pathology, Kansas State University, using the following primers from the SMART™ cDNA Library Construction Kit (Clontech, USA): 5′-seqencing primer TCCGAGATCTGGACGAGC and 3′-sequencing primer TAATACGACTCACTATAGGG. Larger scale sequencing of the cDNA library was performed as described elsewhere (Francischetti et al., 2002; Valenzuela et al., 2002b).
ESTs were trimmed of primer and vector sequences, clusterized, and compared to other databases as described before (Valenzuela et al., 2003). The blast tool (Altschul et al., 1997) and CAP3 assembler (Huang, 1992) were used for the EST clusterization, by using a decreasing word size inclusion strategy. First, all ESTs were blasted against all ESTs with a word size of 100. All matches were fed as input to the CAP3 assembler, and a fasta of the assembled data was obtained, including quality files. This assembly done with word size of 100 was in turn used for the next assembly round, but now using a word size of 80, then 60 then 40 to produce the final assembly shown in supplemental table S1. This assembly strategy is easy to parallelize, allowing for very large data sets to be clusterized. The final output was piped into a tab delimited file that was imported into an Excel spreadsheet. These operations were automated by a program written in Visual Basic named Megacluster. We submitted all translated sequences (starting with a Met) to the Signal P (Nielsen et al., 1997) server to help identify putative secreted peptides. Deduced coding sequences (CDS) were also sent to the above prediction server, as well as to the SecretomeP (Bendtsen et al., 2004) to identify non-classical secreted proteins, to the TMHMM server (Sonnhammer et al., 1998) to detect membrane helices, the NETOGLYC server to detect possible mucin-type galactosylations (Hansen et al., 1998) and to the ProP server (Duckert et al., 2004) to identify putative furin processed protein cleavage sites. BLAST searches were done locally from executables obtained at the NCBI FTP site (ftp://ftp.ncbi.nih.gov/blast/executables/) against the non redundant protein database of the NCBI, the gene ontology fasta subset (Lewis et al., 2000), the Conserved Domains Database of NCBI (Marchler-Bauer et al., 2002) containing the KOG (Tatusov et al., 2003), Pfam (Bateman et al., 2000) and Smart (Schultz et al., 2000) motifs, and to custom downloaded databases containing mitochondrial and rRNA nucleotide sequences available at the NCBI.
To obtain salivary gland extracts (SGE), 20 salivary glands in 20 μl HBS were quickly frozen in liquid nitrogen and thawed in a 37°C water bath, and the procedure was repeated for another 3 cycles. After centrifuging (13, 000 × g, 2 min), the supernatant was analyzed by electrophoresis. The SGE sample was treated with NuPAGE 4 × LDS sample buffer (Invitrogen) without adding reducing agent and heated at 80°C for 10 min. The sample containing approximately 20 μg protein was applied to a NuPAGE® Novex 12% Bis-Tris Gel 1.0 mm, 10 well (Invitrogen). SeeBlue™ molecular weight standards (Invitrogen) were applied to an adjacent well. Electrophoresis was carried out in MOPS buffer (Invitrogen) at 200 V for 30 min. The proteins were transferred to a polyvinylidene difluoride (PVDF) membrane using 10 mM CAPS buffer, pH 11.0, 10% methanol as transfer buffer on a mini trans-blot electrophoretic transfer cell (Bio-Rad). Immediately before the transfer, the gel was pre-equilibrated with the transfer buffer for 1 min, and the membrane was first washed with 100% methanol for 30 s and then washed with water for 1 min. The transfer was conducted with constant current at 250 mA for 2 h. After transfer, the membrane was washed three times with deionized water, 5 min per wash. The membrane was stained for 5 min with 0.025% Coomassie Blue in 40% methanol. It was destained for 10 min in 50% methanol and then washed with water several times and air-dried. Stained protein bands were excised and analyzed by Edman degradation sequencing using a Procise sequencer (Perkin Elmer).
Amino-terminal de-blocking with pyroglutamate aminopeptidase was carried out similarly to a published protocol (Coligan et al., 1995) with minor modification. Briefly, the PVDF membrane was incubated in 200 μl 0.5% polyvinylpyrrolidone (PVP-40, Sigma) in 0.1 M acetic acid for 30 min at 37°C and then rinsed with water. Twenty microliters (2 mU) pyroglutamate aminopeptidase (PGAP, TaKaRa) and 80 μl freshly prepared 1 × PGAP buffer consisting of 50 mM sodium phosphate, 10 mM DTT, 1 mM EDTA, pH 7.0 was added to the tube, which was flushed with nitrogen and incubated at 50°C for 5 h. The membrane was rinsed three times with water and then air dried. Protein on the membrane was sequenced by Edman degradation using an Applied Biosystem Procise 492 Protein Sequencer at the Kansas State University Biotechnology Core Facility.
To identify cDNA sequences that matched the amino-terminal sequence of protein bands from the salivary gland extracts, a search program written by Dr. Ribeiro (Francischetti et al., 2002; Valenzuela et al., 2002b) was used to check the N-terminal sequencing information against three possible protein translations of each cDNA from cDNA library sequencing. The program was also used to resolve complex amino-terminal sequence information from a protein band that contained a mixture of proteins.
A total of 820 cDNA clones were used to assemble a database (Supplemental Table S1) that yielded 240 clusters of related sequences, 187 of which contained only one EST. The 240 clusters were compared, using the programs blastx, blastn, or RPS-BLAST (Altschul et al., 1997), to the nonredundant (NR) protein database of the National Center of Biological Information (NCBI), National Library of Medicine, NIH, to a gene ontology database (Ashburner et al., 2000), to the conserved domains database of the NCBI (Marchler-Bauer et al., 2002), and to a custom prepared subset of the NCBI nucleotide database containing either mitochondrial or rRNA sequences.
Manual annotation of the contigs resulted in four broad categories of expressed genes (Fig. 1). The putatively secreted (S) category contained 61% of the sequences, the housekeeping (H) category had 29%, 0.73 % derived from transposable elements and 9% of the ESTs could not be classified and belong to the unknown (U) class. The transcripts of the U class could represent novel proteins or derive from the less conserved 3/ or 5/ untranslated regions of genes, as was indicated for the sialotranscriptome of An. gambiae (Arca et al., 2005). The fourth class of transcripts were associated with transposable elements (TE). They may represent either the presence of active transposition in the fly, or more likely, the expression of sequences suppressing transposition. Low level expression of TE sequences have been a relatively common finding in previous sialotranscriptomes. It is possible that amplification of the library led to decreased complexity or altered representation of sequences in the original mRNA. However, sequences most highly represented in the cDNA library were also the most abundant proteins detected in salivary protein extracts (described below), suggesting that amplification did not account for over-representation of these sequences.
The 240 ESTs attributed to H genes expressed in the salivary glands (SGs) of S. calcitrans were further characterized into 16 subgroups according to function (Table 1 and Supplemental Table S1). Transcripts associated with the protein synthesis machinery represented 45% of all transcripts associated with a housekeeping function, an expected result for the secretory nature of the organ. Energy metabolism accounted for 16.7% of the transcripts. Twenty percent of the transcripts were classified as either ‘Hypothetical conserved’ or ‘Conserved secreted’ proteins. These represent highly conserved proteins of unknown function, presumably associated with cellular function but still uncharacterized.
A total of 498 ESTs represent putative S. calcitrans salivary components (Table 2 and Supplemental Table S1). These include previously known gene families, such as enzymes, antigen 5 proteins and antimicrobial peptides. However, several putative proteins, including some that appear to be multigenic, have no similarities to known proteins, as will be described further below.
In addition to obtaining transcriptome information, we tried to identify the major proteins expressed in the salivary gland. Proteins extracted from 20 salivary glands were analyzed by SDS-PAGE. The proteins were transferred to PVDF membrane, and Coomassie blue stained bands were excised for Edman degradation sequencing. The gel showed about 20 clearly visible protein bands, with a predominant doublet at ~26-27 kDa. Amino-terminal sequencing information was successfully obtained for nine proteins bands. Sequences for the predominant double bands at 26-27 kDa were obtained only after deblocking with pyroglutamine aminopeptidase. The nine sequences obtained by Edman degradation matched predicted amino acid sequences from clusters of the cDNA database in the S category (Fig 2). Other bands did not yield useful information either because of their low signal, or because of mixed sequencing information that could not be assigned to any of the deduced amino acid sequence from the clusters of the cDNA database. The polypeptide sequences associated with amino-terminal sequences found by Edman degradation had EST representation between three to one hundred and thirty three sequences, with an average of 52.2 EST's sequences per polypeptide (Supplemental table S2), while the remaining proteins have an average of 2.98 sequences per polypeptide. Accordingly, the predominant proteins in the salivary gland correlated with the mRNA abundance in the transcriptome. This was also observed in other proteome analysis of salivary glands in Anopheles stephensi (Valenzuela et al., 2003).
Of the nine amino terminal sequences, four peptides related to thrombostasin were found; their position in the gel indicates that these proteins may be processed to smaller peptides, as described further in section 18.104.22.168. The predominant double bands at 26-27 kDa were related to the antigen 5 (Ag5) family of proteins. A 39 kDa protein related to an endonuclease of Culex pipiens quinquefasciatus was found. Two proteins at about 19 kDa were a glutamine-rich protein and member of the Hyp-16 family. Their function in blood feeding is unknown.
From the sequenced cDNAs, a total of 74 protein sequences was derived, 27 of which encode putative secreted products (Table 2, Table 3, and Supplemental Table S2). Table 3 presents a summary of the secreted subset, with links to GenBank. The following presentation is a guide for browsing Supplemental Table S2.
Six different serine proteases are presented in Table 3, 4 of which are full length and 2 are fragments. When compared to the NR protein database, they provide best matches to Drosophila proteins. The proteins named SC-4-4-4 and SC-4-17-3 are most closely related, approaching 60 % similarity. These enzymes may be associated with fibrinogenolytic activity, as demonstrated for Tabanus flies (Xu et al., 2008), but may also be related to digestion of host extracellular matrix components.
A secreted endonuclease similar to a salivary homologue from tsetse is described. A previously described salivary endonuclease of Culex quinquefasciatus was suggested to function by decreasing the viscosity of the feeding matrix and producing pharmacologically active DNA fragments that might assist blood feeding (Calvo and Ribeiro, 2006).
Thrombostasin (Zhang et al., 2002) is a protein with anti-thrombin activity obtained from the salivary gland of horn flies. The S. calcitrans thrombostasin homologue has only 41% sequence similarity to the horn fly protein, and presents only low complexity matches to other unrelated proteins due to the repeating charged amino acids that it contains. The sequences of two additional Stomoxys thrombostasin family members are also presented, thrombostasin-2 having only 37 % identity to its paralogue, and no predicted similarity to the Haematobia protein. The full length sequence of thrombostasin-3 is also presented in supplemental table S2. It has numerous QHDGESNEESDE repeats, which gives an acidic pI of 3.9 for the predicted protein. Such acidic residues may interact with the anion binding exosites of thrombin and Factor Xa. Edman degradation products for the 3 proteins were found in the PAGE experiment (Fig 2). Similarly to the horn fly thrombostasin, the Stomoxys homologue is also processed from a preproprotein. Indeed the ProP server predicts several furin putative cleavage sites, including at the regions found by the Edman degradation products (see column AI on supplemental file S2). The Stomoxys thrombostasins are abundantly expressed, being represented by 98, 36 and 49 EST's for thrombostasins-1-3, respectively.
The Kazal domain is ubiquitously found in plants and animals and is normally associated with protease inhibitors (Kanost, 1999; Schlott et al., 2002). Exceptionally, the vasodilator of tabanids was identified as belonging to this protein family (Takac et al., 2006; Xu et al., 2008). Table 3 presents the sequences of two polypeptides of this family, both producing highest similarities to Drosophila peptides. They have relatively small EST representation, with 1 and 5 ESTs. The function of these proteins as vasodilators remains to be confirmed.
The antigen 5 family, also known as CAP or CRISP families, was originally described in the venom of wasps and belong to a ubiquitous family of proteins found in plants and animals (Schreiber et al., 1997). Members of these families function in diverse ways, being toxins and ion channel blockers in lizard and snake venom (Nobile et al., 1996; Yamazaki et al., 2003; Yamazaki and Morita, 2004), or proteases in Conus snails (Milne et al., 2003). Antigen-5 related proteins have been described from virtually all sialotranscriptomes from hematophagous arthropods done to date. Three very closely related proteins of this family are described in Table 3, which may be products of the same gene as alleles or splice variants. One variant possibly lacking an exon produces a product 3 kDa smaller that could explain the two bands with same Edman product found in the PAGE experiment (Fig 2). The mature protein starts with a glutamine, and treatment with pyroglutamase was necessary to produce the Edman products. Overall these proteins have a total of 133 matching ESTs and represent the most abundantly expressed gene in the salivary glands of S. calcitrans. Recently a recombinant member of this family was shown to bind the Fc portion of immunoglobulins, and may function as a complement inhibitor (Ameri et al., 2008).
Three unrelated peptides that might serve an antimicrobial function are reported in their full length form (Table 3). SC-4-22-3 is 53% similar to a Drosophila protein containing GGY repeats, which was shown in C. elegans to have antimicrobial function (Couillault et al., 2004). Stomoxin is a member of the cecropin family previously reported from the gut of the stable fly (Boulanger et al., 2002; Landon et al., 2006), and is here reported to be found in the insect salivary glands. The third AMP has the PFAM Attacin C-terminal region, being a member of the Diptericin family. Truncated fragments of a homolog of sarcotoxin were also found (Supplemental table S1).
Hyp 16 family found in mosquito sialotranscriptomes: Two distantly related salivary peptides from S. calcitrans, producing mature peptides of 13.7 and 15.6 kDa are also distantly related to mosquito proteins. Alignment of these two peptides with the mosquito proteins indicates a conserved amino acid backbone G-x(2)-N-x(2)-I-x(6)-C-D-x(3)-C-P-x(5)-C-x(3)-K-x(15)-C-x(4)-G-x(4)-K-x(8,9)-P-x(8)-E (Fig 3). Interestingly, two of the mosquito proteins, deriving from An. stephensi and from Ae. albopictus, were found in previously described sialotranscriptomes (Arca et al., 2007; Valenzuela et al., 2003). The molecular model database (MMDB) of NCBI (Wang et al., 2007) has structural matches for both proteins indicating these to be related to the kissing bug salivary protein named nitrophorin I, which is a lipocalin carrier of heme, NO, and binds histamine (Andersen et al., 2005), suggesting this family derives from a lipocalin structure and may function by sequestering agonists of inflammation or hemostasis, as do many tick and kissing bug lipocalins, as well as the D7 family of mosquitoes (Calvo et al., 2006). The Stomoxys proteins are represented by 53 and 42 EST's each, and their predicted amino terminal sequences were found in the PAGE experiment (Fig 2). Their abundant expression is compatible with an agonist sequestration function.
11.4 kDa conserved family: Two related proteins similar to other secreted insect proteins of unknown function were deduced from the S. calcitrans sialotranscriptome. The closest D. melanogaster relative, gi:24653255 is annotated in Flybase as an unknown protein that is highly expressed in larval and adult midguts, as indicated in flyatlas. They may function in immunity or in some conserved function associated with the endoplasmic reticulum or Golgi.
Q-rich secreted polypeptides: Two proteins containing repeats involving the amino acid motif QDNGA are described. These may derive from alternate splicing of the same gene. Their repeat nature suggests interaction with matrix proteins, possibly collagen. Sequences with large contents of glutamine were also found abundantly expressed in black fly sialomes (Andersen et al., 2009). A total of 7 transcripts are associated with the 2 Q-rich polypeptides. Evidence for secretion of SC-4-13-4 was found in the PAGE experiment (Fig 2).
Orphan secreted proteins: Table 3 additionally depicts the full length sequence of 3 putative secreted peptides. SC-4-1-3 is 98% identical to the tsetse Fb12 protein, but has very weak similarity to Drosophila proteins of the same size, both of unknown function. The remaining 2 proteins do not match any other protein of the NR database in a significant way, and include the relatively well expressed protein encoded by SC-4-39-1, which was assembled from 16 ESTs. Unique proteins such a these are commonly found in sialotranscriptomes of blood sucking arthropods and possibly reflect the very fast pace of evolution observed for these proteins possibly due to host immune pressure.
The EST set acquired in this study allowed for the description of 46 coding sequences, mostly full length, associated with housekeeping functions, including a set of conserved hypothetical proteins that might be related with protein synthesis or protein modification. A CDS fragment coding for a protein similar to insect proteins annotated as transposable elements is also presented.
From the analysis of previous sialotranscriptomes, it is becoming clear that the “generic” salivary potion of any blood feeding arthropod, even those not descending from a common blood sucker ancestor, consists of the somewhat unrelated classes of enzymes, protease inhibitors, vasodilator agonists, serotonin- and histamine-binding proteins, antigen 5 proteins, antimicrobial peptides and a large group of mysterious, unknown secreted proteins. This disjointed list is found in triatomines, mosquitoes, sand flies, biting midges and ticks.
Among the enzymes, ATP-diphosphohydrolases are commonly found in the saliva of hematophagous arthropods feeding on mammals, but not on those feeding on birds or reptiles (Ribeiro, 2000; Ribeiro et al., 1989; Ribeiro et al., 1985). They degrade ATP and ADP to AMP thus eliminating important agonists of neutrophil and platelet aggregation. Different protein families can perform this function. Mosquito and triatomine bugs of the genus Triatoma have recruited members of the 5′-nucleotidase family (Champagne et al., 1995; Faudry et al., 2004; Sun et al., 2006), while the bed bug Cimex lectularius and sand flies have recruited a unique enzyme, the Cimex type of apyrase (Valenzuela et al., 2001; Valenzuela et al., 1998), now found to be ubiquitous in animals. In these insects, transcripts for these enzymes are found abundantly, but in this Stomoxys sialotranscriptome we found no evidence for the expression of any protein of these families. Previous work with salivary gland homogenates of Haematobia irritans, also a blood feeding Muscidae, did not identify salivary apyrase activity or anti-platelet activity (Cupp et al., 1998; Kerlin and Hughes, 1992), perhaps explaining the absence of transcripts coding for known ATPdiphosphohydrolases in Stomoxys. This negative finding was interpreted as resulting from bovine platelets being exceptional in their response to ADP. Bovine platelets respond with a monophasic, not diphasic wave of platelet aggregation by ADP, and require higher agonist concentrations, when compared to human platelets, for example (Bondy and Gentry, 1989; Dodds, 1977).
Serine proteases are also commonly found in the saliva of blood feeding arthropods, but only in Tabanus were they characterized to have a fibrinogenolytic, and thus anti-clotting, action (Xu et al., 2008). The S. calcitrans sialotranscriptome revealed transcripts coding for an endonuclease, which could help to decrease the skin viscosity, as does hyaluronidase. Hyaluronidase activity, however, was measured, and not detected, in stable fly salivary gland homogenates (Volfova et al., 2008).
The S. calcitrans sialotranscriptome reveals only one family of putative proteins containing domains associated with protease inhibitors, namely for the Kazal domain. However, Kazal domain containing peptides were verified to be vasodilatory peptides in horse flies (Takac et al., 2006; Xu et al., 2008), suggesting the S. calcitrans peptides could be performing a similar function. On the other hand, the S. calcitrans sialotranscriptome abounds with transcripts coding for homologues of the horn fly thrombin inhibitor named thrombostasin (Zhang et al., 2002). Without the previous discovery of the Haematobia thrombin inhibitor we would have no clue for the function of this protein in Stomoxys. We also call the attention for the bioinformatic tool ProP (Duckert et al., 2004) that could identify correctly the furin cleavage and processing of the S. calcitrans thrombostasin peptides as detected by Edman degradation (Fig 2) (Duckert et al., 2004). Future sialotranscriptome analysis may benefit from this tool, mainly when attempting to predict the function of relatively large proteins containing repetitive domains.
Antigen-5 related proteins are ubiquitously found in sialotrancriptomes, but little is known about their function in the saliva of blood-feeding arthropods. However, the S. calcitrans protein here described has immunoglobulin binding capacity (Ameri et al., 2008) and may inhibit the classical pathway of complement activation. In Tabanus yao members of this protein family acquired an RGD motif and inhibit platelet aggregation by interfering with the platelet receptor for fibrinogen, which causes platelet cross-linking (Xu et al., 2008).
Three antimicrobial peptides were discovered in the S. calcitrans sialotranscriptome, one of which, named stomoxyn, was previously discovered in the fly's gut (Boulanger et al., 2002; Landon et al., 2006). Salivary antimicrobial peptides, when ingested with the blood meal may prevent microorganism growth in the meal, and when left in the host skin may prevent wound infection, a good husbandry practice on the part of the stable fly.
Regarding biogenic amine binders, the MMDB indicates lipocalin matches for the Hyp16 members, which are highly expressed. The 11.4 kDa conserved family member SC-4-47-2 is also highly expressed. These proteins are strong candidates as biogenic amine binders, based on their high expression levels and the observation that other salivary proteins in this family are always associated with this function (Calvo et al., 2006).
We additionally obtained the CDS for glutamine-rich proteins possibly associated with matrix (collagen?) binding and three additional proteins for which we have no clue for their function, characterizing the mysterious group always associated with sialotranscriptomes.
Finally, when comparing the complexity of both the sialotranscriptome and the SDS gel of S. calcitrans with that of mosquitoes, or triatomine bugs, the lower complexity of Stomoxys is apparent. While mosquitoes have been blood feeders for over 150 MY, when Aedes and Anopheles shared a common blood feeding ancestor (Grimaldi and Engel, 2005; Krzywinski et al., 2006), it is likely that stable flies are relative newcomers to the art, and still have some evolutionary time ahead to perfect their “magic potion” as well as their feeding apparatus, as evidenced by their cruder mouthparts that inflict a quite painful bite. Indeed horn and stable flies appear to be opportunistically migrating into this feeding mode after emergence of large mammals that are relatively helpless in defending themselves. This contrasts with the more sophisticated potion of Tabanids, elegantly demonstrated by Xu et. al (Xu et al., 2008). Tabanids are vectors of apicomplexan parasites to turtles (DeGiusti et al., 1973) indicative of a long association with feeding on blood of vertebrates, possibly starting before the radiation of mammals, ~60 MYA. Similarly, tsetse, also a Muscidae, appears to have an ancient origin as evidenced by the preference for some species to feed on reptiles, by their efficient transmission of Trypanosoma protozoa and their fine feeding apparatus (Gordon and Crewe, 1948; Krafsur, 2009; Okiwelu and Maiga, 1981).
Additional file 1 Supplemental Table S1: Hyperlinked Excel spreadsheet and associated files with EST assembly results. NOTE: This is a compressed ZIP file that should be expanded to a new directory. After this is done, start Excel and then open the file ending in .xls so the hyperlinks will work.
Additional file 2 Supplemental Table S2: Hyperlinked Excel spreadsheet with deducted protein sequences. See note above.
This work was supported by the Intramural Research Program of the Division of Intramural Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, NIH Grant GM41247, and Kansas State University Agricultural Experimental Station Grant, 418343. We thank Kent Hampton for insect rearing.
Because JMCR is a government employees and this is a government work, the work is in the public domain in the United States. Notwithstanding any other agreements, the NIH reserves the right to provide the work to PubMedCentral for display and use by the public, and PubMedCentral may tag or modify the work consistent with its customary practices. You can establish rights outside of the U.S. subject to a government use license.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.