|Home | About | Journals | Submit | Contact Us | Français|
We report the proteomes of four life cycle stages of the Apicomplexan parasite Eimeria tenella. A total of 1868 proteins were identified, with 630, 699, 845 and 1532 found in early oocysts (unsporulated), late oocysts (sporulated), sporozoites and second-generation merozoites, respectively. A MudPIT shotgun approach identified 812 sporozoite, 1528 merozoite and all of the oocyst proteins, whereas 2D gel proteomics identified 230 sporozoite and 98 merozoite proteins. Comparing the invasive stages, we find moving junction components RON2 in both, whilst AMA-1 and RON4 are found only in merozoites and AMA-2 and RON5 are only found in sporozoites, suggesting stage specific moving junction proteins. During early oocyst to sporozoite development, refractile body and most ‘glideosome’ proteins are found throughout, whilst microneme and most rhoptry proteins are only found after sporulation. Quantitative analysis indicates glycolysis and gluconeogenesis are the most abundant metabolic groups detected in all stages. The mannitol cycle ‘off shoot’ of glycolysis was not detected in merozoites but was well represented in the other stages. However, in merozoites we find more protein associated with oxidative phosphorylation, suggesting a metabolic shift mobilising greater energy production. We find a greater abundance of protein linked to transcription, protein synthesis and cell cycle in merozoites than in sporozoites, which may be residual protein from the preceding massive replication during schizogony.
Eimeria species are parasitic protozoa belonging to the phylum Apicomplexa including species of great veterinary and medical significance such as Cryptosporidium, Neospora, Plasmodium, Theileria and Toxoplasma. Eimeria tenella is one of seven species that cause coccidiosis in chickens, a serious intestinal disease which results in economic losses of around $2.4 billion per annum worldwide . Eimeria parasites have complex developmental life cycles with an exogenous phase in the environment during which oocysts excreted from the chicken undergo differentiation (sporulation) and become infective, and an endogenous phase in the intestine during which there are two or more (depending on the species) rounds of discrete, expansive asexual reproduction (schizogony) followed by sexual differentiation, fertilisation and shedding of unsporulated oocysts.
The unsporulated oocyst results from fertilisation of gametes and develops by the deposition of proteins (for example Gams 56, 82 and 230) from two distinct wall forming bodies into a multi-layered oocyst cell wall . After shedding, unsporulated oocysts make contact with air and moisture and rapidly undergo meiosis (completed by ~ 9–12 hours of sporulation) and cell division to give rise to 8 haploid sporozoites (completed by ~24 hours of sporulation) . When ingested by a chicken the sporozoites are liberated by mechanical abrasion of the oocyst wall in the chicken’s gizzard followed by enzymatic digestion of the sporocyst wall in the lumen of the upper intestine. Sporozoites migrate to their preferred sites of development (in the case of E. tenella, this is the caecum) where they invade villus enterocytes. Two species (Eimeria brunetti and Eimeria praecox) undergo their entire endogenous development within the villi but others, including E. tenella, develop within enterocytes of the crypts ; the mechanism by which sporozoites transfer from villi to crypts is not understood but may involve transportation within host intra-epithelial lymphocytes  or free migration of sporozoites between or across cells . E. tenella undergoes two massive and distinct waves of schizogony in the crypts, which produce large numbers of first and second generation merozoites. A third round of schizogony, initiated by invasion of second generation merozoites and characterised by much smaller schizonts, is know to occur and may be obligatory , although it is possible that invasion of second generation merozoites may also initiate gametogony.
Sporozoites and merozoites of E. tenella share many features related to their invasive natures including proteins released from micronemes, which are important for host binding and invasion , rhoptry proteins secreted during invasion to form the parasitophorous vacuole within which the parasite resides , the use of actin based ‘glideosome’ to power host invasion  and the possession of GPI-linked variant surface antigens (SAGs), which may mediate binding to the host . However sporozoites and merozoites also differ in some characteristics as follows: (1) The sporozoite stage is much longer-lived than any of the merozoite stages as it can remain dormant for many weeks within the oocyst until ingestion and excystation within the gut of a chicken. (2) After excystation, sporozoites migrate a considerable distance from the upper part of the small intestine along the gut lumen to invade enterocytes of the caecum  whereas merozoites invade locally and rapidly. (3) Sporozoites contain a unique pair of organelles termed the ‘refractile bodies’ that are hypothesised to be protein storage organelles  and which fragment and reduce in size in first-generation merozoites and are absent from second-generation merozoites. (4) Successive merozoite generations (but not sporozoites) are punctuated by intracellular schizont stages characterised by rapid and massive cell replication; producing up to 900, 350 and 16 daughter merozoites in the first, second and third generations respectively.
Eimeria genome sequencing is ongoing, thus previous global analysis of gene expression has largely been achieved by EST studies [13–15]. A previous proteomic analysis has identified a limited number of sporozoite proteins . Here we describe the first large scale proteomic analysis of E. tenella unsporulated and sporulated oocysts, sporozoites and second-generation merozoites. We compare and contrast the distribution and abundance of proteins in the zoites and correlate these with possible shared or stage-specific invasive mechanisms. Furthermore proteins associated with organelles are followed from development of early oocyst to sporozoites.
E. tenella (Houghton strain) oocysts were propagated, harvested from caeca and sporulated using standard protocols . Early (unsporulated) oocysts were harvested after 9–12 hours, whilst as late oocysts were sporulated in vitro for 24–29 hours. Sporozoites were hatched from cleaned oocysts by in vitro excystation and purified over nylon wool and DE-52 cellulose columns . Second-generation merozoites were recovered from the caecal mucosa of chickens 112 hours after inoculation with 5 × 105 sporulated oocysts .
Proteins from 108 purified sporozoites were resolved by 2D-PAGE and identified by mass spectrometry as previously described . Briefly, proteins were solubilised in urea and resolved on either pH3-10 or pH4-7 gels. Protein spots were stained with colloidal Coomassie, excised, digested with trypsin and subjected to MS/MS analysis as described .
Proteins from 108 purified merozoites were solubilised in 7 M urea, 2 M thiourea, 4% (w/v) CHAPS, 2% (v/v) IPG Buffer, 40 mM DTT and resolved on either pH 3–10 or pH 4–7 2D gels. Protein spots were silver stained and destained with 15mM potassium ferricyanide/50mM sodium thiosulphate for 5 min, reduced with 10mM DTT/100mM ammonium bicarbonate for 30 min and alkylated with 55mM iodoacetamide/100mM ammonium bicarbonate for 20 min. Spots were washed with 100mM ammonium bicarbonate and dehydrated with 100% (v/v) acetonitrile. Proteins were digested with 6ng/μl trypsin/50mM ammonium bicarbonate for 5 hours at 37°C and peptides were extracted in 1% (v/v) formic acid/2% (v/v) acetonitrile and secondly in 50% (v/v) acetonitrile. Tryptic peptides (5 μl) were desalted and concentrated on a C18 TRAP (180μm × 20mm, 5μm Symmetry, Waters), for 3 min at 10 μl/min, and resolved on a 1.7μm BEH 130 C18 column (100μm × 100mm, Waters) at 400 nl/min attached to Waters UPLC Acquity HPLC. Peptides were eluted at 400 nl/min with a linear gradient of 0–50% (v/v) acetonitrile/0.1% (v/v) formic acid over 30 min, followed by 85% (v/v) acetonitrile/0.1% (v/v) formic acid for 7 min. Ionised peptides were analysed by a Waters Quadrupole Time of Flight Premier Mass Spectrometer in data directed acquisition mode, tuned using 200 fmol/μl human glutamate fibrinopeptide B in (v/v) 0.1% formic acid/(v/v) 50% acetonitrile/water. Peaklists were created from raw spectral data with ProteinLynx Global server version 2.2.5 (Waters), using the following criteria: external calibration with lockmass of 785.8426 (M+H2+) of glutamate fibrinopeptide B, adaptive type background subtraction combining all scans and deisotoping with a threshold of 5%.
The resulting peak lists from both sporozoites and merozoites were submitted to MASCOT version 2.0 (Matrix Science, UK) and searched against a locally installed protein database composed of: E. tenella predicted proteins (Workflow-based Automatic Genome Annotation or WAGAs). Also included were predicted proteins from possible contaminants Gallus gallus (NCBI, 112806 Assembly) and Homo sapiens (NCBI, 030408 Assembly). Database search settings selected trypsin as the enzyme, allowed one missed cleavage, carbamidomethylated cysteine as fixed modification, oxidised methionine as potential variable modification, peptide tolerance of ± 200 ppm, MS/MS tolerance of ± 0.2 Da and +2 and +3 peptide charge states. The WAGA E. tenella predicted proteins were derived from the 020407 Genome Assembly (The Wellcome Trust Sanger Institute) and gene models were predicted using gene prediction programs GlimmerHMM, SNAP, PHAT, TwinScan, GeneFinder and Apicomplexa gene models which were refined using 70,000 ESTs, 8775 peptides detected by MudPIT or gel LC-MS/MS, (Xikun Wu (IAH) and The Wellcome Trust Sanger Institute, unpublished data). Sporozoite proteins were analysed from a total of four gels (pH 3–10A, pH 3–10B, pH 4-7A and pH4-7B) and merozoite proteins were identified from two gels (pH 3–10 and pH 4–7).
Sporozoite, merozoite, early and late oocyst proteins were each analysed by MudPIT as previously described . Briefly, proteins were solubilised in 2M urea 10 mM Tris-HCl pH 8.5 and centrifuged at 16,000 g for 30 min at 4°C to separate soluble from insoluble fractions. Both fractions were digested with endoproteinase Lys-C and trypsin . Peptide mixtures were analyzed by MudPIT . A quaternary Agilent 1100 series HPLC coupled directly to a Finnigan LTQ-ion trap mass spectrometer (Thermo, San Jose, CA) equipped with a nano-LC electrospray ionization source  resolved peptide mixtures by strong cation exchange liquid chromatography upstream of reverse-phase liquid chromatography . Fully-automated 12 step chromatography was carried out on each sample. Good quality spectra were filtered as previously described  and compared, using SEQUEST (version 27), to a database of predicted proteins from E. tenella (WAGA), chicken, human and a reverse sequence database was included to estimate the false positive rate. No enzyme specificity was considered for any search. All searches were parallelized and performed on a Beowulf computer cluster consisting of 100 1.2 GHz Athlon CPUs. Results were assembled and filtered using DTASelect (version 2.0) to estimate the false positive rate (<6%). Triplicate samples of each life cycle stage were analysed and combined to create a non-redundant catalogue.
The abundance of a protein detected by MudPIT is reflected in the number of peptides detected , spectrum count  and sum of cross correlation scores (Xcorr) . The Xcorr score quantitates the relatedness of experimental tandem mass spectra to in silico generated tandem mass spectra from sequence databases. This calculation includes the number of fragment ions in the mass spectrum, their relative abundance, continuity of ion series and presence of immonium ions for certain amino acids in the spectrum; all of which are proportional to the concentration of the precursor ion. To assess the abundance of protein associated with a functional category we summed the a) peptide number, b) spectrum count or c) sum of Xcorr of each protein detected associated with the functional category. We normalised for the total a) peptides, b) spectrum or c) sum of Xcorr of all proteins detected in each stage and observed trends consistent in the three different measures of protein abundance.
Predicted Eimeria protein sequences with matching peptides (WAGAs_detected.fasta attached) were identified. Identifications were attributed by combining data from several sources in order of importance; 1) similarity to E. tenella published sequences deposited in NCBI (BLAST searches), 2) predicted domains (InterProScan ), 3) Plasmodium falciparum, Theileria anulata and Toxoplasma gondii orthologues (identified by Inparanoid ), 4) a significant BLAST score to Swiss-Prot or 5) NCBI redundant databases. In addition, characterised SAG family sequences were provided by Chandra Subramaniam and Fiona Tomley (unpublished data). E. tenella orthologues of described Apicomplexan rhoptry protein sequences were found by Inparanoid and BLAST. All identified proteins were assigned one functional classification using the MIPS Functional Catalogue Database (FunCatDB)  as modified for Plasmodium . E. tenella proteins with orthologues in P. falciparum or S. cerevisiae with ascribed MIPS [29, 30] were assigned the same category. Proteins lacking orthologous MIPS annotation were manually assigned a category. Proteins with metabolic functions were further classified into pathways using a) InterPro domain to Gene Ontology mapping (http://www.geneontology.org/external2go/interpro2go), b) KAAS (KEGG Automatic Annotation Server at http://www.genome.jp/kegg/kaas/), which uses BLAST comparisons to the KEGG pathway database and c) additional manual curation.
Proteins associated with the secretory pathway were identified by the presence of either a signal peptide or transmembrane domain(s), assessed by Phobius . Secretory proteins putatively imported to the apicoplast were identified with PATS , or modified by the addition of a GPI anchor were predicted by the big-PI predictor  or PredGPI (http://gpcr.biocomp.unibo.it/predgpi/pred.htm). Cytoplasmic proteins putatively imported to the nucleus or mitochondria were identified with NucPred  and PlasMit , respectively.
A total of 1868 non-redundant proteins was detected from all of the stages that were analysed (Table S1).
Merozoite lysates analysed by MudPIT or LC-MS/MS identified a non-redundant dataset of 1532 proteins. MudPIT detected 1528 of these proteins with a minimum of two peptides (Table S2) and LC-MS/MS, using a combination of broad (pH 3–10) and narrow (pH 4–7) range 2-DE gels (Figs. 1 and S1), resulted in 136 protein spots (Table S3) from which 1810 peptides were identified within a non-redundant dataset of 98 proteins. The majority of proteins identified by gel LC-MS/MS (91 proteins) were also identified by MudPIT (Table S1), generally by a significant number of peptides, suggesting that these are abundant molecules.
Sporozoite lysates analysed by either MudPIT or LC-MS/MS identified a total of 845 non-redundant proteins. MudPIT analysis of sporozoite lysates identified 812 proteins, with two or more peptide matches (Table S4). Sporozoite proteins were also resolved on four 2-DE gels; pH 3-10A, pH 3-10B, pH 4-7A, pH 4-7B (Figs 2, S2-S4). Electrospray mass spectrometry identified 13,254 peptides from 371 spots, identifying 230 individual proteins (Tables S5-S8), including an additional 36 proteins not found by MudPIT.
The E. tenella WAGA database contains a total of 6659 predicted proteins. Thus, we detect ~23%, 13%, 9% and 10% of all predicted proteins in the merozoite, sporozoite, early and late oocysts, respectively. The percentage of chicken proteins identified within each Eimeria life cycle preparation ranged between, 7.3% with sporozoites and 3.6% with early oocyst, indicating low contamination with host proteins. Figure 3 illustrates the degree of overlap between the datasets derived from each life cycle stage.
Analysis of proteins by 2D gel and LC-MS/MS revealed that individual proteins are often found in multiple spots. Sporozoite proteins analysed from gels pH3-10A, pH3-10B, and pH4-7A show 81/144, 47/102 and 27/92 proteins respectively are found in more than one spot (Tables S5-S8). Merozoite proteins analysed from pH4-7A and pH3-10 show 28/76 and 10/38 proteins are also found in more than one spot (Table S3). Several proteins, if detected, are consistently found in multiple spots and a selection of these is shown in Table S11.
We detected 47 SAG surface proteins in the merozoite whilst only four were found in the sporozoite. Microneme proteins MIC1-5 & MIC7-9 [36–39] were detected in both merozoites and sporozoites and the majority of microneme proteins (MIC1-5 & MIC7) were also identified in the fully sporulated oocyst but all were absent from the early oocyst. Microneme proteins AMA-1 was detected only in sporozoites and AMA-2 was only detected in merozoites. Rhoptry neck proteins RON2, RON3 and RON4 were found in the merozoite, whilst RON2 and RON5 were found in both the sporozoite and late oocyst. The rhoptry proteases, Tg subtilisin 2 orthologue was found in the merozoite and Tg toxopain orthologue was identified in all stages analysed. No rhoptry proteins were found in the early oocyst apart from toxopain. In contrast the oocyst wall proteins Gam56 and Gam 82  were readily detected in both early and late oocysts but were not found in either of the zoite stages.
‘Glideosome’ proteins actin, myosin A, myosin A tail interacting protein (MTIP), aldolase, profilin, actin depolymerizing factor (ADF) were detected in both of the motile stages, the merozoite and sporozoite. All described ‘glideosome’ proteins, with the exception of myosin A and MTIP were also detected in the early oocyst.
Proteins previously described as residing in the refractile bodies or at the sporozoite apex, such as Eimepsin , S07 antigen , and pyridine nucleotide transhydrogenase  were found in the sporozoite, early and late oocyst proteomes but were absent or expressed at low levels in the second-generation merozoites. Putative refractile body proteins lactate dehydrogenase, carbonyl reductase and haloacid dehalogenase-like hydrolase  were found in all stages analysed.
The mannitol cycle is an ‘off shoot’ of the glycolytic pathway and component proteins; mannitol-1-phosphate dehydrogenase, mannitol-1-phosphatase and mannitol dehydrogenase are detected in the unsporulated, sporulated oocysts and sporozoites but were not found in merozoites.
To investigate the functional composition of the proteomes described, we classified each protein detected into a MIPS functional category (Table S1). We reasoned that if a developmental stage performs a greater range of roles associated with a functional category, it might be expected to use a greater diversity of proteins. The range of proteins associated with each functional category is represented by the number of different proteins detected in the functional category normalised for total number of different proteins detected, for each life cycle stage (Fig. 4). The majority of proteins detected are of unknown function or are involved in metabolism. A considerable number of proteins are categorised as protein synthesis and protein fate, whilst those associated with the cell cycle are few in number. There appears to be no major differences between the life cycle stages in the range of proteins associated with the functional groups.
We reasoned that even if two parasite life cycle stages perform a similar range of functions (reflected in associated protein diversity), one stage may be performing a function to a greater degree and this may be reflected in the protein abundance associated with the functional categories. Thus, we estimated the relative amount of protein associated with each functional category in each life cycle stage as assessed by a) the sum of peptide number, b) the sum of spectrum count or c) sum of Xcorr (Fig. 5A, B and C). We find metabolism to be the largest category of protein abundance in all stages analysed, except the merozoite. Identification of a larger number of merozoite proteins has possibly allowed identification of a greater number of lower abundance proteins of unknown function. Compared to the sporozoite we detect in the merozoite, two-fold more protein associated with transcription, 1.5 fold more involved in protein synthesis and 1.7 fold more linked with cell cycle functions. Also compared to the sporozoite, we detect over 1.5 fold more protein associated with protein synthesis and cell cycle functions in the early oocyst.
We analysed proteins associated with metabolism in more detail, categorising them further into the main metabolic pathways (Table S1). As mentioned in results 3.3 proteins associated with the mannitol cycle were found in all stages apart from merozoites. The range of proteins associated with the remaining main metabolic pathways appears similar in all the life cycle stages (Fig. S5). Analysis of the protein abundances’ found, indicates that the most abundant group of metabolic proteins are involved in glycolysis and gluconeogenesis in all life cycle stages (Fig. 6A, C & E). Despite the apparently small contribution of protein to oxidative phosphorylation, it is three times more abundant in merozoites than the sporozoite or the oocyst stages (Fig. 6B, D & F), despite using a similar repertoire of proteins (Fig. S5). Proteins associated with nucleotide metabolism were more abundant in merozoites than the other stages (2 fold more than sporozoites) (Table S12). In comparison, proteins of other metabolic pathways are equally abundant (Table S12).
Approximately 25% of sporozoite proteins are identified as being part of the secretory pathway compared to 30% of merozoite proteins (Fig. S6). The merozoite is predicted to add a GPI anchor to 68 proteins compared to only 16 sporozoite proteins detected (mostly SAG proteins). As expected the majority of proteins in all stages are cytoplasmic, ranging between 59%-66% in merozoites and sporozoites, respectively. Only 13 proteins are predicted to target to the Apicoplast using the prediction program PATS (Table S1).
Our description of the sporozoite proteome extends the previous description of 28 sporozoite proteins  and adds the first proteomes of second-generation merozoites and both unsporulated and sporulated oocysts. We detect 28% of all E. tenella predicted proteins in all stages analysed, with 23% of predicted proteins found in the merozoite. The published proteomes of P. falciparum merozoites  and Toxoplasma gondii tachyzoites  describe detection of 15% and 29% of their respective predicted proteins. Thus comparatively our detection coverage of merozoite proteins is good. We find 13%, 9% and 10% of predicted proteins from sporozoites, early and late oocysts respectively, suggesting that the protein detection coverages although respectable, are unlikely to be exhaustive.
The 288 proteins found in both sporozoites and merozoites but not in the early oocysts (Fig. 3A) may have functions associated with invasive mechanisms shared by the zoites. Proteins found in both zoites include microneme proteins involved in host adhesion and ‘glideosome’ proteins which drive motility, an essential component process of invasion. During parasite invasion of the host cell an intimate contact, the moving junction, is formed and includes proteins secreted from the micronemes (AMA-1) or rhoptry neck (RON2, RON4 and RON5) in T. gondii [46, 47]. These proteins are thought to play a conserved function in P. falciparum  and are therefore likely to act similarly in E. tenella. Indeed RON2 and RON5 has been identified in enriched E. tenella sporozoite rhoptry fractions (Tomley unpublished data). Our detection of an isoform of AMA-1 and RON2 in both zoites suggests a similar moving junction is common to invasion of both. However, we find AMA-1 and RON4 in merozoites only and AMA-2 and RON5 in sporozoites only. The stage specific expression of AMA-1 and AMA-2 has been confirmed by transcription analysis (Tomley unpublished data). This suggests that moving junction proteins differ between merozoites and sporozoites. Both RON4 and RON5 locate to the cytoplasmic face of the host cell membrane of the moving junction , but as both zoites invade chicken gut enterocytes, it remains unclear why distinct proteins would be present in the moving junctions of the two invasive stages.
Additional, protein differences between the zoites include the large range of SAG surface proteins found in the merozoite whilst few are detected in the sporozoite. SAG proteins are hypothesised to assist with host cell binding and greater merozoite SAG diversity may increase the avidity of binding to the host receptors aiding rapid invasion of this short lived zoite.
Proteins shared across the development of early/late oocysts and then to sporozoites (Fig. 3B) highlight differences in protein development associated with specific organelles. Refractile body proteins, the majority of ‘glideosome’ proteins and the rhoptry protease toxopain are found in the early oocyst in addition to the sporulated oocyst and sporozoite. Indeed, SO7 antigen, ADF and profilin proteins have been previously reported in all four stages [40, 50, 51]. As the early oocyst has neither refractile bodies, nor a pellicle  to support ‘glideosome’ activity, nor rhoptries with proteins which require proteolytic processing, these proteins are likely to be found at alternative locations and may perform additional roles in the early oocyst. In contrast microneme proteins and the majority of rhoptry proteins are only found in the later two stages of oocyst to sporozoite development. Micronemes and rhoptries are first visible by electron microscopy after sporozoite formation in the late oocyst . Indeed, detailed analysis of MICs 1, 2, 3, 4 and 5 proteins showed they are initially found 22.5 hours after initiation of sporulation, concurrent with the formation of recognisable sporozoites . This suggests microneme and most rhoptry proteins are not pre-synthesised but synthesis coincides with development of the apical complex during sporozoite formation.
Most metabolic proteins detected appear to be associated with gluconeogenesis or glycolysis in all life cycle stages (Fig. 6A, C, E). If protein abundance reflects the contribution to energy production, gluconeogenesis and glycolysis are likely to be the major contributors, similar to Toxoplasma tachyzoites . The mannitol cycle appears well represented in oocyst stages and sporozoites but we do not identify these proteins in the merozoite. This is consistent with previous reports of substantial up-regulation of the mannitol cycle during sexual development . Importantly, the drug Megasul inhibits mannitol-1-phosphate, blocking oocyst development and transmission of the parasite to subsequent hosts . On the other hand, we detect more protein linked to oxidative phosphorylation in merozoites compared to the other life cycle stages. Every stage has proteins involved in oxidative phosphorylation (Fig. S5) and electron microscopy shows the mitochondria of all stages contain similar internal membrane structures to support oxidative phosphorylation. However, the dramatic difference in protein abundance observed suggests a metabolic shift to use oxygen to mobilise greater energy production in the merozoite. Indeed, sporozoite excystation is adapted to low oxygen conditions . It has also been suggested that the parasite uses anaerobic glycolysis during schizogony but switches to aerobic metabolism immediately before merozoite release . The glycolytic proteins pyruvate kinase and enolase relocate from the cytoplasm/nucleus to the parasite apex in both merozoites and sporozoites in contact with host cells . Moreover both proteins are secreted under conditions identical to microneme release i.e. incubation with FCS at 41°C and secretion is inhibited by the kinase inhibitor staurosporine . These data suggest a role for these proteins at the time of invasion. Furthermore, in Toxoplasma tachyzoites five metabolic proteins involved in glycolysis or lactic acid fermentation relocate during parasite invasion from the cytoplasm to the pellicle . Thus, proteins involved in ATP production including those with oxidative phosphorylation roles may produce energy necessary for invasion and the merozoite may have a greater invasive energy requirement.
Transcription, protein synthesis and nucleotide metabolism all show greater protein abundances detected in the merozoite than the sporozoite (Fig. 5 and Table S12). Greater commitment to protein synthesis in the merozoite compared to the sporozoite is consistent with previous reports of large scale EST analysis. Analysis of 13,450 merozoite and 13,255 sporozoite ESTs revealed 9.5% of merozoite but only 0.2% of sporozoite ESTs were ribosomal proteins . Whilst the amount of protein involved in the cell cycle is small, we detect more in merozoites than sporozoites. Consistent with the greater abundance of cell cycle proteins, we detect MORN-domain proteins only in the merozoite. It is possible that these increased protein abundances are residual protein related to the massive replication (~350 fold) recently undergone during schizogony. Consistent with replication during sporulation (8 fold), we detect a greater abundance of protein linked to cell cycle and protein synthesis in early oocysts than sporozoites (over 1.5 fold more for both categories).
The PATS algorithm predicts few proteins to target to the apicoplast in Eimeria. PATS uses P. falciparum sequences to model apicoplast targeting and the programs authors suggest ‘it may not work’ with the related Apicomplexan T. gondii (http://gecco.org.chemie.uni-frankfurt.de/pats/pats-index.php). Indeed PATS fails to predict the described E. tenella apicoplast protein enoyl reductase . This suggests that the apicoplast targeting sequence in Plasmodium differs from Eimeria and PATS under-predicts Eimeria apicoplast targeted proteins.
Many proteins identified by 2D gel LC-MS/MS were found in multiple protein spots. For highly abundant proteins such as SO7, incomplete protein solubilisation and resolution on the 2D gel can lead to detection in multiple spots, however, in many cases differential protein resolution is likely to be the result of proteolytic processing and post-translational modifications. For example enolase is post translationally modified by either glycosylation or phosphorylation which regulates it’s location and function in vivo [61, 62]. Also, MIC1 belongs to the Thrombospondin-Related Anonymous Protein-like protein family  which undergo cleavage at sequential protein developmental points which are critical to invasion .
E. tenella gene architecture has proved extremely complex, exhibiting a large number of introns which are frequently alternatively spliced. Previous gene models were based on gene prediction programs which often disagreed. Our peptide data, together with EST data, was used to select detected gene models, such that the previous redundant database of 41,430 putative predicted proteins was condensed to the non-redundant WAGA database of 6,659 proteins (Wu and Tomley et al manuscript in preparation). Thus, this proteomic analysis has provided 8775 detected peptides with which to re-annotate Eimeria tenella gene models. In addition, we provide the first evidence that the 579 ‘hypothetical proteins’ detected are indeed expressed proteins and combined with evidence of expression patterns in the different life cycles stages, this work may inform the development of new intervention targets.
We extend our generous thanks to Derek Huntley for bioinformatic support and to Dominic Kurian and Lawrence Hunt for technical help with proteomic analysis of the merozoite samples. This work was supported by BBSRC Grant BBS/B/03858. John R Yates III is supported by NIH P41 RR011823 and Judith Helena Prieto is supported by NIH-NIAID grant number R21 AI072615-01.