The human parasite
Entamoeba histolytica (
E. histolytica) is a unicellular protozoal parasite that phylogenetically is placed on one of the lowermost branches of the eukaryotic tree, closest to
Dictyostelium discoideum [
1]. It is an unusual organism, having 9,938 predicted genes, with slightly less than one third (31.8%) of its predicted proteins having no homologues in GenBank [
2]. Humans are its only natural hosts, and
E. histolytica is spread by ingestion of contaminated food or water via the fecal-oral route and thus tends to endemically infect people under circumstances where hygiene is poor [
3]. It has a simple life cycle, alternating between infective quadrinucleate cysts and invasive motile trophozoites [
3]. 80% of people infected with
E. histolytica are colonized asymptomatically; in the remaining 20%, trophozoites invade into the intestinal epithelium, resulting in clinical disease [
3]. It is estimated that there are 50 million symptomatic cases of amebic colitis and 100,000 deaths per year worldwide due to
E. histolytica [
4].
The discovery that double-stranded RNA (dsRNA) can initiate post-transcriptional sequence-specific gene silencing of cellular genes [
5] via translational repression or degradation of mRNA in most eukaryotic cells has become an important tool in assessing and manipulating gene function. This mechanism of RNA interference (RNAi) may have evolved as a defense against viruses and transposable elements with dsRNA intermediates [
6,
7]. The small RNA intermediates in this process, short interfering RNAs (siRNAs), result from dsRNA being cleaved at 21- to 23- nucleotide intervals [
8] by an RNase III-type protein, Dicer [
9], and are then incorporated into the RNA-induced silencing complex (RISC), which includes Argonaute "Slicer" protein [
8,
10]. The antisense strand of the siRNA is used to guide the RISC to its target mRNA, which is then cleaved by Argonaute [
11,
12]. RNAi effects can be amplified by the action of RNA-dependent RNA polymerases (RdRPs). siRNAs act as primers for RdRPs, which form new dsRNAs using the target mRNA as a template, which are subsequently cleaved into siRNAs with sequences corresponding to target mRNAs but differing from the original dsRNAs [
13,
14]. Genes encoding RdRPs have been identified in many organisms, but not in flies or mammals [
12].
E. histolytica possesses the molecular machinery for RNAi. It has a gene [GenBank:
XM_645408] [
2,
15,
16] encoding a protein which has a single RNase III domain and possesses RNase III activity, and could perform the Dicer role as a dimer. It also has two Argonaute homologs [GenBank:
XM_651344,
XM_651422] [
2,
15-
17] and an RdRP [GenBank:
XM_646217] [
2,
15]. Exploitation of RNAi for knockdown of gene expression is an attractive approach for
E. histolytica, as there is no evidence for meiotic division or detectable homologous recombination of genes [
18-
20], thus it has not been possible to generate gene knockouts [
18,
21]. Multiple copies of the genome, and even nuclei, occur in the parasite due to an apparent lack of the normal cell cycle regulatory checkpoints [
22,
23]. Homologous
Entamoeba cell-cycle regulation genes are divergent from typical eukaryotic versions and may not have equivalent function [
19]. This presents difficulties in studying gene function or in isolating recessive mutations [
18]. The study of the function of individual genes in the past has been limited to other techniques, such as the over-expression of wild-type or mutant genes, and other methods of gene inactivation such as antisense [
21,
24]. Methods of RNAi used in
E. histolytica have included the use of long dsRNA expressed by an
E. histolytica RNA polymerase II promoter, which was successfully used to knock down expression of the
E. histolytica proteins Diaphanous, Klp5 and EhSTIRP [
18,
25,
26], and the soaking of trophozoites in artificial siRNAs to knock down γ-tubulin expression [
20]. These reports of RNAi use in
E. histolytica showed knockdown of a single gene or of a gene family. Here, we report in this study the success of the method of expression of short hairpin RNAs driven by the
E. histolytica U6 promoter to knock down protein expression in
E. histolytica of three unrelated genes.
Short hairpin RNAs (shRNAs) have a similar structure to siRNAs except the sense and antisense strands are connected at one end by a short loop, and function like siRNAs to knock down gene expression [
27]. shRNAs can be produced from an expression vector as a single transcript from a RNA polymerase III promoter. The eukaryotic U6 promoter offers two advantages over other RNA polymerase III promoters: the promoter region immediately upstream of the transcribed sequence for the U6 small nuclear RNA gene includes all the required regulatory elements [
28,
29], and the termination sequence consists of 4 to 5 thymidine residues rather than a poly-A tail [
28,
29]. A variety of shRNA loop and stem lengths have been tested, with the loop UUCAAGAGA [
28] used in a number of mammalian shRNA constructs, including Gou et al (2003) [
30], and is also used in the constructs in this study. Longer hairpins with 29-base pair stems appear to be better inhibitors of gene expression than ones with shorter 19–21 bp stems [
31]. Increased effectiveness has also been seen for similarly-sized longer artificial siRNAs, with only one siRNA apparently generated per longer shRNA or siRNA [
31,
32].
Genes selected for knockdown: The three genes selected for knockdown in this study, Igl, URE3-BP, and EhC2A, are genes involved in amebic virulence under study in our laboratory; they were selected since we wanted to create an additional tool for studying the function and role of these genes in amebic virulence.
Igl, the intermediate subunit of the galactose- and N-acetyl-D-galactosamine- (Gal/GalNAc) inhibitable lectin [
33,
34], is a 150 kDa protein. The Gal/GalNAc lectin, the major defined amebic adhesin, is a virulence factor mediating adherence to target cells in the first step of contact-dependent cell killing [
3]. The lectin binds to terminal galactose or GalNAc residues in glycoproteins on the surfaces of target cells, and is composed of three subunits: the heavy subunit Hgl (containing a carbohydrate-recognition domain), the intermediate subunit Igl, and the light subunit Lgl [
3]. The integral-membrane Hgl is disulfide-bonded to the GPI (glycosylphosphatidylinositol)-anchored Lgl. Igl is also GPI-anchored to the membrane [
3]. Evidence that Igl is associated non-covalently with the Hgl-Lgl heterodimer includes that Igl and the Hgl-Lgl heterodimer co-migrate in native gel electrophoresis, and affinity-purification of Igl with monoclonal antibodies results in the co-purification of the Hgl-Lgl heterodimer [
3,
33,
34]. Igl is encoded by two unlinked gene copies, Igl1 [GenBank:
AF337950] [
34] and Igl2 [GenBank:
XM_647302] [
2]; [GenBank:
AF337951] [
34], producing ~1100 aa proteins that are 81% identical and contain 32 CXXC repeats. CXXC repeats are also found in a family of transmembrane kinases of
E. histolytica and the
Giardia lamblia variant-specific surface proteins [
35].
URE3-BP, Upstream Regulatory Element 3-Binding Protein [GenBank:
AF291721] [
36], is a 22.6 kDa calcium-regulated transcription factor encoding two EF-hand motifs, which are associated with calcium-binding activity [
36]. URE3-BP binds to the URE3 (Upstream Regulatory Element 3) consensus motif, TATTCTATT, found in the promoter of
hgl5, which is one of the genes encoding the Gal/GalNAc lectin heavy subunit, and is also present in the ferredoxin 1 (
fdx1) promoter, thereby regulating the expression of these genes [
36]. The human neuronal protein DREAM (calsenilin) is the only other known example of a calcium-responsive transcription factor with EF hands [
36].
EhC2A [GenBank:
XM_650207] [
2] is a 22 kDa calcium-binding membrane protein containing a conserved C2 domain, is associated with the ability to bind phospholipids, and has a proline-rich C-terminal tail. This protein was found to be associated to the amebic phagosome [
37]. A C2 domain, identified originally in protein kinase C, is a Ca
2+-binding motif that allows calcium-dependent protein anchoring to or interaction with membranes; these domains are found in a number of signaling proteins in eukaryotes [
38].
A gene for which we have previously shown knockdown is PATMK, Phagosome-Associated Transmembrane Kinase 96 [GenBank:
XM_650501] [
2,
39]. PATMK is a transmembrane kinase family member found in the early phagosome and is involved in the phagocytosis of human erythrocytes [
39]. It contains an intracellular putative kinase domain, a short membrane-spanning region, and an ectodomain containing CXXC-repeats like Igl [
35,
39].
We report here the effectiveness of shRNAs in silencing genes in
Entamoeba histolytica. Expression of 29-bp shRNAs driven by the
E. histolytica U6 promoter was successful in knocking down protein expression of the three different and unrelated genes in
E. histolytica reported in this study, and we previously showed knockdown for a fourth gene [
39]. This method of gene knockdown appears to function well for a variety of gene types, and should be a useful tool for studying gene function in this organism.