Here we present a comprehensive data set used for a systematic investigation of the genetic organization, evolution, regulation, and expansion of the coccidian
SRS gene superfamily. The proteins they encode are the major constituents expressed on the surface coats of prevalent coccidian parasites, including
T. gondii,
N. caninum, and
S. neurona. Comparative genomic analyses identified 182
SRS genes that were distributed throughout the genome, which is in contrast to surface antigens described in other protozoan species, where their subtelomeric location is thought to drive antigenic variation or sequential expression of a single gene on the parasite surface coat (i.e.,
VSG genes in
T. brucei) or promote rapid sequence variation via increased rates of ectopic recombination (
var genes in
P. falciparum) (
24,
25). In contrast, distinct, largely nonoverlapping sets of
SRS mRNA transcripts were found to be codominantly expressed in a strain-dependent and developmental life cycle stage-dependent fashion. Although no data currently exist for merozoites, gametes, and developing oocysts that represent the sexual stages of
T. gondii development, it is likely that their surface coats will likewise be populated by a mutually exclusive set of SRS proteins. Hence, the genetic mechanism(s) that regulates
SRS gene expression is clearly different from that of the
vsg or
var genes, whereby only a single locus is expressed at any one time. Analysis of the expression data is a first step toward understanding how
SRS genes are regulated as a system. Stage specificity data will become more robust as new RNA sequence data sets emerge for bradyzoites sampled from cysts
in vivo, for merozoites growing in feline enterocytes, and for oocysts undergoing sporulation. By moving away from single-gene studies, future work will be able to look for factors that regulate the coordinated coexpression of these life cycle stage-specific
SRS genes, whether it be by shared transcription factor binding sites and promoters or via epigenetic factors.
The size of the SRS superfamily, its sequence diversity, and its limited taxonomic distribution point to a relatively recent and rapid expansion coupled to a high mutation rate. The tolerance of these events is likely due to the SRS structural fold serving as a scaffold for the design of new biological properties, similar to the proposed evolution of the
T. brucei transferrin receptor (TfR) from the
VSG gene (
26). Given the plasticity of this gene family, the assignment of SRS domains was nontrivial. The subfamily HMMs identified related domains, which were curated by match length and the presence of at least four cysteine residues. Domains that failed either of the latter two criteria were labeled as “degraded.” This led to the assignment of an SRS domain to
SRS35, the
SAG4 gene. It is noteworthy that
SAG4.2 does not meet the HMM cutoff and contains too few cysteines; however, it is likely that this gene was duplicated from an
SRS gene and evolved to fulfill a new functional niche. It is possible that a similar event led to the establishment of the
SUSA gene family (
5). Structural analysis and cross-species comparisons will help reveal the role of such SRS-related sequences.
The eight domain subfamilies present an excellent opportunity to study the SRS superfamily. An important caveat is that intrasubfamily relationships must remain ambiguous. Within each multigene locus, member genes have similar architectures of domain families and similar gene structures (number of introns) and, with the exception of the
SRS38 locus, are transcribed in the same direction. These strong associations between domain families and the small number of observed domain architectures (only 9 of a potential 36 nondirectional two-domain combinations) imply that exon switching is not a significant mechanism for increasing protein divergence. Taken together, these data suggest that these loci arose by tandem gene duplication. Whether the expansion is due to duplications of single genes or whole loci is unclear. Considering a subset of two-domain proteins (families 7 and 8), it appears that a mixture of the two is most likely (). The situation is further complicated by convergent evolution. A recent analysis of the expanded
SRS gene family in
N. caninum has revealed intralocus genetic exchange as an important mechanism homogenizing
SRS genes (
27).
The spectrum of biological functions for known SRS proteins has led to the hypothesis that the SRS family has evolved distinct subsets that either function as adhesins (e.g., SRS57) or immune decoys (e.g., SRS34A) or possess some form of dual activity (e.g., SRS29B). In fact, the tachyzoite-specific SRS29B, SRS34A, and SRS29C proteins elicit high-titer antibodies during the acute infection phase (
28). Such immune dominance has been postulated to focus immunity against the disseminating tachyzoite stage to limit their proliferation and away from the transmissible tissue-encysting bradyzoite form (which no longer expresses SRS29B, SRS34A, and SRS29C). Alternatively, immunity to these dominant antigens might function, at least in part, to provide a differentiation signal to switch tachyzoites into bradyzoites. This could represent a novel type of surface coat variation that is coupled to the developmental program of the parasite. These two mechanisms are not necessarily mutually exclusive, and the latter possibility is further supported by previous studies that implicate components of host immunity as differentiation signals (
29). At least 29 orthologs of the
SRS superfamily are present in other tissue-dwelling coccidians. In contrast, no
SRS orthologs were identified in other apicomplexan parasites (
Eimeria,
Plasmodium, or
Cryptosporidium) on the basis of primary sequence similarity searches. Recent studies, however, identified an SRS-like fold in the 10-member Pfs230-related 6-Cys family of gamete surface proteins, potentially extending the role of SRS domains as fertility factors in gamete-gamete recognition and attachment during fertilization (
15,
30). It is unclear whether this represents the convergent evolution of unrelated proteins or is the result of divergent evolution of genes that arose from the same ancestral gene, possibly a precursor of the 6-Cys proteins that are a feature of the apicomplexan genomes (
1).
Toxoplasma presumably regulates the expression of specific SRS antigens to successfully initiate infection and regulate the development of host immunity in order to establish patent, transmissible infections. Differential expression of certain alleles of the
ROPK genes
ROP5,
ROP16, and
ROP18 has previously been shown to facilitate parasite infection by actively altering immunity-related GTPases (IRGs) and cytokine signaling to promote parasite infection (
4,
31–
33). Integration of the SRS expression and polymorphism data identified numerous parasite lineage-specific differences and prioritized nine
SRS genes for functional investigation to determine their role in parasite virulence. Targeted disruption of the
SRS29B and
SRS34A genes in the virulent type I RH strain did not affect parasite replication
in vitro but did have a dramatic effect on mouse virulence. These DKO parasites showed a strong and specific upregulation in their SRS29C surface expression to a level found on the surface of mouse avirulent type II and III parasites. Upregulation was not at the level of transcription (Fig. 6C) but more likely to be the result of increased mRNA stability or to be regulated posttranscriptionally, since similar levels of
SRS29C message were detected in type I strains, which express low levels of SRS29C protein, and the DKO parasites and type II and III strains, which are high expressers. Importantly, the SRS29C-I and SRS29C-II strains that transgenically overexpress the two SRS29C alleles in the virulent RH strain possessed similar lethal dose kinetics in mice, suggesting that the SRS29C expression level, rather than polymorphism, is the more relevant factor modulating the acute-virulence phenotype. The decreased virulence seen is most likely the function of an altered host immune response because no intrinsic growth defect was observed in the transgenic parasites and they were not impaired in the ability to invade host cells. Our preliminary results measuring systemic gamma interferon (IFN-γ) levels at the peak of infection showed that parasites expressing high levels of SRS29C protein had substantially reduced levels of IFN-γ during
in vivo infection (V. Pszenny et al., unpublished data). Hence, the absolute levels of IFN-γ produced in mice infected with the SRS29C-I or SRS29C-II parasites relative to the WT RH parent strain are controlled throughout infection. Murine infections with type I strains typically produce dysregulated levels of interleukin-12 and IFN-γ and induce severe immunopathology and death (
34,
35). It is possible that antigens such as SRS29C have evolved to fine-tune and/or negatively regulate the proinflammatory capacity of other immunodominant antigens (SRS29B) or polymorphic effector molecules that directly alter cytokine induction and/or signaling (ROP5, ROP16, ROP18, ROP38, GRA15) in a parasite strain- and host genetics-dependent fashion, resulting in dysregulated immunity and gross pathology (
4,
8,
31,
32,
36). Ultimately, these two conceivably opposing mechanisms might ensure the persistence of the bradyzoite form of the parasite by actively subverting and/or fine-tuning the potent proinflammatory immune response that tachyzoite-specific antigens such as SRS29B, ROP16, ROP18, and GRA15 induce. This notion is strongly reinforced by a recent study implicating
SRS genes that regulate the persistence of parasite cysts (
7). This incredibly successful
Toxoplasma parasite might thus have evolved such an intricate mechanism to allow for early parasite proliferation and dissemination to sites of chronic infection in the host (e.g., brain, heart) before a potent host response is mounted against the tachyzoite form to limit the parasite burden and the pathology of infection.
It is conceivable that Toxoplasma’s complex virulence traits are multifactorial, with a variety of different positive and/or negative modulatory factors that cooperatively function to confer various degrees of virulence among parasite strains. Ultimately, this would allow the efficient expansion of the range of hosts and ecological niches this successful parasite occupies. Attempts to alter the expression levels and/or engineer a targeted deletion of SRS29C in the avirulent type II Prugniaud strain are ongoing. Genetic disruption of the endogenous gene and/or subsequent transgenic manipulation to allow allele-specific and regulated expression of SRS29C will be required to elucidate the mechanistic basis governing the role of SRS29C in the acute-virulence phenotype across a variety of parasite genotypes.
In conclusion, this report highlights the importance of integrating comparative and functional genomic data sets to generate testable scientific hypotheses that are assumptionless in their approach. The identification of SRS29C as an important regulator of virulence capable of altering parasite pathogenesis and the development of protective immunity underpins the critical role apicomplexan parasite surface-expressed antigens play among eukaryotic pathogens that interact with and subvert host immune responses in order to establish patent, transmissible infections.