|Home | About | Journals | Submit | Contact Us | Français|
Cellulose synthases form rosette terminal complexes in the plasma membranes of Streptophyta and various linear terminal complexes in other taxa. The sequence of a putative CESA from Griffithsia monilis (Rhodophyta, Floridiophyceae) was deduced using a cloning strategy involving degenerate primers, a cDNA library screen, and 5′ and 3′ rapid amplification of cDNA ends (RACE). RACE identified two alternative transcriptional starts and four alternative polyadenylation sites. The first translation start codon provided an open reading frame of 2610bp encoding 870 amino acids and was PCR amplified without introns from genomic DNA. Southern hybridization indicated one strongly hybridizing gene with possible weakly related genes or pseudogenes. Amino acid sequence analysis identified a family 48 carbohydrate-binding module (CBM) upstream of the protein's first predicted transmembrane domain. There are broad similarities in predicted 3D structures of the family 48 modules from CESA, from several glycogen- and starch-binding enzymes, and from protein kinases, but there are substitutions at some residues thought to be involved in ligand binding. The module in G. monilis CESA will be on the cytoplasmic face of the plasma membrane so that it could potentially bind either low molecular weight ligands or starch which is cytosolic rather than inside membrane-bound plastids in red algae. Possible reasons why red algal CESAs have evolved family 48 modules perhaps as part of a system to regulate cellulose synthase activity in relation to cellular carbohydrate status are briefly discussed.
Diverse prokaryotic and eukaryotic organisms synthesize cellulose microfibrils, semi-crystalline aggregates of various sizes that contain parallel β-1,4 glucan chains (Tsekos, 1999; Romling, 2002; Saxena and Brown, 2005). The glycosyltransferases involved are designated CESAs and are integral membrane proteins that show the so-called D, D, D, QxxRW signature; that is, their catalytic domain contains three spaced, probably catalytically important, glutamate residues followed by a characteristic five amino acid motif in which residues two and three are not fixed. The regions containing these four components are often referred to as U1, U2, U3, and U4. CESAs have been described from numerous embryophytes, from one streptophyte and one rhodophyte alga, from various non-plant eukaryotes (e.g. oomycetes, slime moulds), and from many prokaryotes.
In the plasma membrane, CESA proteins organize into cellulose-synthesizing complexes visible by freeze etch electron microscopy. The structure of the complexes differs between various taxa. The Streptophyta—consisting of the streptophyte algae (Mesostigmatales, Chlorokybales, Klebsormidiales, Zygnematales, Charales, and Coleochaetales) and the Embryophyta (Becker and Marin, 2009)—have rosette terminal complexes consisting of six clustered membrane particles (Mueller et al., 1976). Rosette terminal complexes are known to contain CESA proteins (Kimura et al., 1999) and disassemble in some CESA mutants (Arioli et al., 1998; Wang et al., 2008). In other taxa, cellulose synthase forms linear terminal complexes (Becker and Marin, 2009) where the particles resolved by electron microscopy occur in rows rather than forming the fixed six-member clusters characterizing Streptophyta. The length of the rows, the number of rows, and other features vary in different taxa and have been linked to variations in microfibril dimensions (Tsekos, 1999).
The red algae show linear terminal complexes with between one and four rows of particles in different species (Tsekos, 1999). The only reported rhodophyte CESA (Roberts and Roberts, 2009) is from Porphyra yezoensis (Rhodophyta, Bangiophyceae). The Rhodophyta have been reported to exist in the fossil record for >1 billion years, with representatives of Bangiophyceae and Florideophyceae recognizable in late pre-Cambrian strata (Saunders and Hommersand, 2004). Molecular clock estimates also point to very ancient origins. Cell walls of Rhodophyta vary in carbohydrate composition, and most recent cell wall studies focus on the economically important matrix polysaccharides such as the carrageenans (Cole and Sheath, 1990). The walls can contain microfibrillar xylans or mannans as well as or instead of microfibrillar cellulose. The cellulose has received little attention since early electron microscopy and X-ray diffraction studies (Myers and Preston, 1959a, b).
This study predicts the features of a CESA protein encoded by Griffithsia monilis Harvey (Rhodophyta; Florideophyceae). The protein has a family 48 carbohydrate-binding module (CBM48) in its N-terminal region which is related in sequence and predicted 3D structure to the CBMs found in some glycogen- and starch-binding proteins and to the CBMs of protein kinases involved in higher plant sugar-sensing networks. Such CBM48s are taxonomically widely distributed (Cantarel et al., 2009) but have never been reported in a CESA from any taxon.
The source of the G. monilis isolate and the growth conditions have been described (Whitney et al., 2001). Cells in 1.0l cultures were rinsed, blotted dry, ground to a fine powder in liquid nitrogen, and stored at –70°C until required.
CESA sequences from non-plant organisms with linear terminal complexes, from the green alga Mesotaenium caldariorum (Roberts et al., 2002) and the red alga P. yezoensis (Roberts and Roberts, 2009), were used for alignment by the Blockmaker program (the first step in the CODEHOP process; Supplementary Table S1 available at JXB online). The CODEHOP program (Rose et al., 1998) found primers of ≤64-fold degeneracy in regions U1–U4 when the program was weighted heavily towards the two P. yezoensis sequences, and P. yezoensis codon usage data were employed. Supplementary Table S2 shows the six degenerate primers and the other primers used.
Small amounts of total RNA were prepared from ground G. monilis tissue using Total RNA Isolation Reagent (Thermo Scientific). (The manufacturers’ instructions were followed with all molecular biology reagents unless stated otherwise.) Alternatively, total RNA and genomic DNA were both extracted from 3–6g of ground tissue (La Claire and Herrin, 1997) and separated by lithium chloride precipitation. In both cases mRNA was purified from total RNA prior to cDNA synthesis with Dynabeads Oligo dT25 (Dynal). cDNA synthesis was primed with oligo(dT18), with oligo(dT) adaptor primers (for cDNA library construction), or with primers specific to already cloned sequences [for 5′ rapid amplification of cDNA ends (RACE)]. Reverse transcriptase enzyme (MMLV) was from Promega (initial cDNA synthesis), Stratagene (cDNA library construction), or Invitrogen (Superscript III enzyme, for 5′ and 3′ RACE).
Oligo(dT) primer was used with MMLV reverse transcriptase (Promega) to synthesize cDNA from bead-purified mRNA in a total volume of 40μl. PCR (Platinum Taq High Fidelity, Invitrogen) used 1.2μl of this reaction mixture in a total volume of 15μl with 2mM MgCl2 and primer concentrations of 2μM (degenerate) or 0.2μM (pure). PCR used a ‘touchdown’ phase (six cycles each decreasing by 1°C from 60°C) followed by 37 cycles annealing at 58–63°C depending on the primers. Amplified fragments were sequenced after purifying from gels (NucleoSpin Extract II columns, Machery-Nagel) and cloning into pCR2.1 (TA cloning kit, Invitrogen).
Bead-purified mRNA was used in a SMART cDNA library kit (Clontech/BD Biosciences), employing an initial PCR amplification step and size fractionating the cDNA on a 10–40% sucrose gradient. Library plaques were probed by standard techniques (Sambrook et al., 1989) after labelling the amplimer obtained from the degenerate U1F and U4R primers with [γ-32P]dCTP. Two hybridizing plaques obtained after plaque purification were excised into the pTriplEx2 plasmid and sequenced.
‘Cap-switching’ 5′ RACE (method adapted from Frohman, 2006) used cDNA synthesized from purified mRNA using a specific primer (PuGmCA1-R2, Supplementary Table S2 at JXB online) with Superscript III reverse transcriptase (Invitrogen) in a volume of 20μl for 1h at 55°C. Then 1μl of a ‘20×’ addition solution [60mM MgCl2, 40mM MnCl2, 20μM CapFind_A primer (Supplementary Table S2), 2mg ml−1 nuclease-free bovine serum albumin (BSA) and 1× first strand buffer (Invitrogen)] was added together with a further 0.25μl of reverse transcriptase, and incubated (10min 45°C, 10min 50°C, 10min 70°C). After cooling, 0.8μl of RNase H (Invitrogen) was added followed by incubation at 37°C for 20min, purification (Machery-Nagel NucleoSpin Extract II), and elution in 20μl. Subsequent PCR used 1.2μl of this in a reaction volume of 15μl with primers CapFind_Prim and PuGmCA1-R2 or PuGmCA1-R1 (Supplementary Table S2). Gel-purified fragments were either used for further PCR with the nested primer or cloned into pCR2.1 for sequencing.
Approximately 1.5g of frozen ground alga was added to 10ml of fresh GIT buffer [4M guanidinium isothiocyanate, 0.1M TRIS pH 8.0, 10mM EDTA, 2% (w/v) sarcosyl, 5mM dithiothreitol] and incubated at 60°C with gentle agitation for 2h. Caesium chloride (2.1g) was dissolved in the supernatant after centrifugation (3500g, 10min, room temperature) before layering over a 3.0ml, 5.7M CsCl cushion in a 14×95mm polyallomer centrifuge tube and centrifuging (SW40Ti rotor, 20h, 24000rpm, 24°C). Fractions (0.5ml) collected from the top were spotted on an ethidium bromide plate, and DNA-containing fractions were pooled, dialysed overnight against TE buffer (10mM TRIS pH 8.0, 1mM EDTA), and made up to 2.0ml with TE.
DNA was further purified by adding 1.5ml of 2.0M NaCl, 0.4ml of 10% (w/v) cetyltrimethylammonium bromide (CTAB), 0.1ml of 1.0M TRIS pH 7.5, and 50μl of 0.5M EDTA. After gentle mixing at 65°C for 30min, 4.0ml of chloroform was added and gently mixed for 10min at 55°C. After centrifugation (12000g, 10min, 20°C), the upper phase was combined with two volumes of ethanol, centrifuged immediately (12000g, 10min, 25°C), and the DNA pellet was washed twice with cold 70% ethanol. After 10min air drying, the pellet was dissolved in 100–200μl of TE. Successful PCR required one further CTAB treatment, and complete cutting with restriction enzymes required two further CTAB treatments.
DNA (6μg in 50μl) was digested (50–100 enzyme units, 37°C, 17h), precipitated, and dissolved in TE. DNA separated on an agarose gel was capillary transferred to a Hybond N+ membrane (GE Healthcare). The G. monilis CESA open reading frame (ORF; 2.6kb) was amplified from genomic DNA by PCR (primers GmCA-ORF-F and GmCA-ORF-R; Supplementary Table S2 at JXB online), purified on an agarose gel, and extracted as described. A 30ng sample with 1pg of DNA marker ladder was labelled with 5μl of [γ-32P]dCTP (3000Ci mmol−1, 10mCi ml−1, Perkin-Elmer) using a DNA labelling bead (GE Lifesciences). After overnight hybridization (60°C in ExpressHyb, Clontech) and stringency washes at 58°C, the membrane was exposed with two intensifying screens (20h, room temperature).
The protein modelling tools Fugue and Orchestrar provided by the Sybyl suite of programs (SYBYL 8.1, Tripos Inc., St Louis, MO, USA) were used to build 3D protein homology models for the putative glycan-binding domain of G. monilis based on the CESA1 N-terminal protein sequence. Refinement of the resulting protein structure was done using the Amber ff99 forcefield for 1000 steps of Powell optimization.
The cDNA (and protein) sequence generated in this study has been deposited in GenBank as accession GU563823 (protein ID ADK77974).
Six degenerate primers (Supplementary Table S2 at JXB online) used in PCR on G. monilis cDNA amplified five products corresponding to the CESA regions U1–U2, U2–U3, U3–U4, U1–U3, and U1–U4 (Fig. 1). A contig of the sequenced clones showed that all five originated from a single cDNA.
The cloned U1–U4 amplimer was used to probe ~3×105 recombinant plaques in a cDNA library. Two hybridizing plaques were isolated and recombinant plasmids excised. The sequence of the insert in both plasmids ended at the same nucleotide roughly mid-way between U2 and U3 (Fig. 1) and was identical over 1370bp until a 1bp difference in the length of the 3′ poly(A) tract (30bp or 31bp). The sequence at the 5′ end matched that of the U1–U4 amplimer over 282bp.
When further library screening failed to detect longer cDNA clones, two nested reverse primers (PuGmCA1-R2 and PuGmCA1-R1; Supplementary Table S2) were designed from the U1–U2 sequence, and used in 5′ RACE. Two bands of ~1.3kb resulted when primers PuGmCA1-R1 and CapFind_Prim were used to PCR amplify from cDNA generated using primers PuGmCA1-R2 and CapFind_A (Supplementary Fig. S1A at JXB online). Ten of 13 clones examined from the combined bands had inserts of 1341bp and three had 1210bp inserts. The sequences were identical except for additional upstream sequence in the larger clone. This indicated that two alternative transcriptional initiation sites existed (Fig. 1). The 3′ ends of these RACE fragments had 63bp that was identical to the U1–U2 sequence.
The 3′ RACE technique was performed on G. monilis cDNA made using the 3′ RACE primer PolyAfind_A, amplifying with the internal forward primer GmCA1-F2 and the primer PolyAfind_Prim1 (Supplementary Table S2). This generated two amplimer bands (Supplementary Fig. S1B), which were cloned together. Sequencing four clones revealed four different polyadenylation sites at nucleotides 3239, 3256, 3311, and 3539 of the cDNA sequence (Fig. 1).
The sequence of a full-length cDNA of 3539bp that was the consensus of all clones identified in Fig. 1 was predicted. It contained a potential ORF of 2610bp beginning at the first ATG codon. Upstream of this are three in-frame stop codons in the 366bp 5′ untranslated region (UTR), two of them in the 131bp region unique to the longer 5′ RACE product. A BLASTN search with this sequence returned many CESAs, the most similar being the P. yezoensis sequence, followed by CESA sequences from species of the oomycete Phytophthora. Similarities between the two red algal nucleotide sequences extended into the 5′ UTRs where the P. yezoensis sequence terminated 286bp downstream of the G. monilis terminus (Supplementary Fig. S2 at JXB online).
The lack of introns in the coding region of the gene was shown by PCR amplifying from genomic DNA a 2.6kb sequence containing the entire ORF with no insertions (primers GmCA-ORF-F and GmCA0ORF-R in Supplementary Table S2; ORF PCR in Fig. 1). Southern analysis with this as probe revealed bands consistent with the predicted number and position of cutting sites including an internal 981bp band after digestion with PstI (Fig. 2). Additional, fainter bands that occurred in some lanes from enzymes having no internal site (HindIII, XhoI, and BamHI) may indicate weakly related genes or pseudogenes.
There are seven potential in-frame start codons upstream of the first predicted transmembrane domain in the G. monilis CESA (Fig. 1, and Supplementary Fig. S2 at JXB online) giving a potential protein of up to 870 amino acids. In choosing between possible translational start sites, BLASTP searches were made with conceptual translations beginning from the first potential start codon at the 5′ end. These identified a conserved domain (residues 94–166) belonging to the category exemplified by the glycogen-binding region of the β-subunit of the AMP-activated protein kinase of rats and also seen in some higher plant protein kinases, starch-binding proteins, and in enzymes synthesizing or metabolizing α-1,4 glucans. The latter include a putative glycogen synthase encoded by a G. japonica expressed sequence tag (Fig. 3). Such CBMs show wide taxonomic distribution in prokaryotes and eukaryotes and are assigned to family 48 in the classification of CBMs at CAZy (www.cazy.org; Cantarel et al., 2008; Christiansen et al., 2009). The overall sequence conservation shown by the full set of CBM48s is not high, but the G. monilis sequence shows many of the conserved residues highlighted by Christiansen et al. (2009; their Fig. 1). Sequences of CBM48s are quite distinct from those of known cellulose-binding modules, making it very unlikely that the CESA CBM48 binds a nascent cellulose chain (a β-1,4 glucan) before it exits the cytoplasm. Such a CBM48 is also present in the P. yezoensis CESA if it is translated from one of the potential start codons upstream of the one chosen by Roberts and Roberts (2009) that lies within the CBM48 (Fig. 4).
A homology model of the G. monilis CESA CBM48 could be constructed based on the X-ray structure of the glycogen-binding domain of the AMP-activated protein kinase from rat (pdb code: 1Z0N, normalized Z-score: 19.32) (Fig. 5). Some of the other protein sequences classified as CBM48s by Christiansen et al. (2009) (Zea AKIN1, G. japonica putative glycogen synthase, Arabidopsis SEX4) also gave similar predicted structures (Fig. 5). Some structural similarity (Z-score: 8.89) to the CBM20 family that contains other starch-binding domains (Christiansen et al., 2009) did not lead to satisfactory models. Superposition of the overall 3D structures, the cyclodextrin-binding motif of the model, and the X-ray structure showed an excellent steric fit (Figs 5, ,6).6). However, despite the good overall similarity between protein and homology models, the physicochemical properties of some potentially critical amino acids in the G. monilis CESA homology model and, to a lesser degree, in the SEX4 starch-binding protein, differed markedly (Figs 3, ,6)6) from those present in the rat AMP-activated kinase. They included some suggested to be directly involved in binding (Polekhina et al., 2005).
The remaining 11 sequences listed as CBM48s by Christiansen et al. (2009; see their Fig. 1) had significant amino acid insertions compared with those shown here to give a good overall fit with the structure of the rat protein kinase. Their predicted structures did not closely resemble structure 1Z0N but resembled other experimentally determined structures—the N-terminal domain of Escherichia coli branching enzyme (structure 1M7X); Sulfolobus solfataricus glycosyltrehalose trehalohydrolase (1EH9); Klebsiella pneumoniae pullulanase (2FHF); Pseudomonas isoamylase (1BF2); and Bacillus acidopullulyticus pullulanase (2WAN) (data not shown).
The CESA gene in G. monilis encodes a family 48 CBM in the N-terminal domain of the predicted protein. Such modules have been found only in enzymes that metabolize polysaccharides containing α-1,4 glucans (starch, glycogen, or pullulan) or in protein kinases that are part of sugar-sensing networks (see listing at the CAZy database). Specifically, they have not been found in an enzyme such as cellulose synthase that makes β-1,4 glucans, raising many questions about evolution and function.
The use of 5′ and 3′ RACE generated extensive UTR sequence and demonstrated two alternative transcription start sites and at least four alternative polyadenylation sites. The three in-frame stop codons in the 5′ UTR showed that the upstream limit of translation was reached. The nucleotide sequence shows high similarity to the CESA gene from P. yezoensis (Rhodophyta, Bangiophyceae; Roberts and Roberts, 2009; Supplementary Fig. S2 at JXB online). Both CESA-coding regions lacked introns, as did almost all genes in the Cyanidioschyzon merolae genome (Rhodophyta, Bangiophyceae) (Matsuzaki et al., 2004). Southern analysis of G. monilis suggests a single copy with perhaps some pseudogenes or weakly related genes, although no evidence for other genes emerged when using the degenerate primers or library screening. Roberts and Roberts (2009) also reported a single CESA and documented two pseudogenes in P. yezoensis, a contrast with the multiple CESAs invariably found in higher plants. A requirement for several different CESAs to make a functional cellulose synthase may be a feature of organisms with rosette terminal complexes (Somerville, 2006) that many, perhaps all, of the diverse organisms having linear terminal complexes lack. Since higher plants also employ different CESAs to make primary and secondary wall celluloses (Somerville, 2006), the absence of secondary walls in most Rhodophyta (Martone et al., 2009) removes a further driver for genomes to encode multiple CESAs.
BLASTP searches with conceptual translations beginning from the first of seven potential start codons in G. monilis showed a CBM48 between residues 94 and 166 that would be part of the protein if translation began from any of the first six ATGs. The improbability of a nucleotide sequence capable of encoding this CBM being conserved in evolution if it were part of the 5′ UTR allowed the seventh ATG to be excluded as the translation start site (Fig. 1). Further support for the importance of the CBM comes from re-examining the P. yezoensis nucleotide sequence (EU279853.1) since the CESA protein it encodes would also contain a complete family 48 CBM if translation began at one of three start codons upstream of the one that Roberts and Roberts (2009) chose to start the ORF (Fig. 4). Their choice was based on nucleotide context around the fourth ATG, although a recent genomic analysis of C. merolae (Rhodophyta, Bangiophyceae) found comparatively little bias in the identities of nucleotides immediately up- and downstream of start codons (Nakagawa et al., 2008). Using the highest contextual biases observed in C. merolae (at the minus 1 and minus 3 positions; Nakagawa et al., 2008), the most favoured ATG codons in the G. monilis sequence are numbers one, four and five, and number one in the P. yezoensis sequence. All would produce a protein with the complete CBM. The weaker similarity between the potential amino acid sequences on the N-terminal side of the CBM in the two red algae than in the CBM (Fig. 4) and in the catalytic domain (Supplementary Fig. S3 at JXB online), and the lack of resemblance to N-terminal sequences in CESAs from other taxa provide no further clues to choose between the other ATG codons. For reasons of simplicity, the first ATG in both proteins has been favoured here, but it is recognized that protein sequencing will be required to settle the matter definitively.
The CAZy database lists >2000 examples of family 48 CBMs drawn from Archaea (38), Bacteria (1921), and Eukaryota (307). The majority occur in enzymes that metabolize polysaccharides with α-1,4-linked glucose residues such as starch, glycogen, or pullulan, but CBM48s also occur in many eukaryotic protein kinases. Rhodophyte CESAs seem to be the first example of an enzyme that contains such a module that is not a protein kinase or involved in metabolizing α-1,4 glucans.
The extent to which the CBM48 that occurs in rhodophyte CESAs resembles other CBM48s in amino acid sequence and predicted 3D structure was explored. Table 1 of Christiansen et al. (2009) compares the sequences of members of the evolutionarily related CBM families 20, 21, 48, and 53. There is obvious sequence heterogeneity (most obviously seen in some substantial inserts) within the 16 aligned CBM48s that comprise regulatory subunits of protein kinases and various enzymes metabolizing α-1,4 glucans. According to the present homology models, the CBM48 of the G. monilis CESA closely resembles the first five members of the CBM48 family listed by Christiansen et al. (2009). These comprise three protein kinases (the β-subunit of the AMP-activated protein kinase of Rattus norvegicus, AKIN-β-γ-1 of Zea mays, and the β1 regulatory subunit of the SNF-related kinase from Oryza sativa) and two glucan-metabolizing enzymes (SEX4, the starch excess 4 protein of Arabidopsis thaliana, and the putative glycogen synthase from G. japonica). All have the glycogen-binding domain of the β-subunit of the AMP-activated protein kinase of rat as their closest 3D homologue (structure 1Z0N).
Although the structural models support the view that the CESA CBM48 is similar to several other CBM48s, there are non-conservative substitutions in several of the residues that Polekhina et al. (2005) suggest are involved in carbohydrate binding in the glycogen-binding domain of the rat AMP-activated protein kinase (pdb code: 1Z0N) (see their Fig. 6). Some of the mainly hydrophobic and slightly polar residues of the cyclodextrin-binding motif in 1Z0N are mutated to acidic and basic amino acids in the G. monilis CESA as judged by superposition of models (Fig. 6) or by sequence alignments (Fig. 3). Less extensive substitutions of those key residues occurs in other CBM48s. For example, both tryptophan residues (W100 and W133 in the rat kinase) are fully conserved in the other sequences, whereas CESA has a conservative valine substitution at one but replaces the other with an acidic glutamate. Such substitutions raise doubts about the ability of the CESA CBM48 to bind glycans but, partially counteracting these doubts, the Arabidopsis SEX4 protein binds starch (Kerk et al., 2006) in spite of some non-conservative substitutions (Figs 3, ,6).6). An experimental demonstration that the CESA CBM48 shows glycan-binding activity is required to settle the issue of binding. Interestingly, in the context of a CBM48, Rhodophyta have cytosolic rather than plastidic starch (Viola et al., 2001; Deschamps et al., 2008), making it potentially possible for α-1,4-glucan chains to bind to the CESA CBM48 that is predicted to lie on the cytoplasmic face of the plasma membrane.
The presence of a CBM48 in CESAs from members of both the Bangiophyceae and Florideophyceae points to strong conservation over an extended period of evolution since representatives of the two families have been distinguished in the fossil record of late pre-Cambrian strata (Saunders and Hommersand, 2004). The sequenced genome (Matsuzaki et al., 2004) of Cyanidioschyzon merolae (Rhodophyta, Bangiophyceae) does not shed further light on whether CESAs and perhaps other wall polysaccharide synthases contain CBM48s in red algae because it is wall-less. Interestingly, however, none of its limited set of protein kinases contains a CBM48, leaving it without some major components of the sugar-sensing pathways that regulate carbohydrate metabolism in many other eukaryotes. If the absence of protein kinases with CBM48s is widespread in the Rhodophyta, sugar sensing through CBM48s may occur on some individual carbohydrate-synthesizing enzymes including CESAs, and perhaps modulate their activity in response to cellular carbohydrate status.
In conclusion, the CESA of G. monilis has been described and it was shown that it (and probably the P. yezoensis CESA) contains a potential CBM48. Although showing a predicted 3D structure similar to that of other CBM48s, substitutions at key residues leave doubts about its glycan-binding ability and thus a potential role in regulating cellulose synthase.
Supplementary data are available at JXB online.
Figure S1. DNA fragments amplified from G. monilis cDNA by rapid amplification of cDNA ends (RACE) procedures.
Figure S2. Comparison of nucleotide sequences encoding CESAs from G. monilis (GmCESA1) and P. yezoensis (PyCESA1).
Figure S3. Comparison of protein sequences for CESAs from G. monilis and P. yezoensis.
Table S1. CesA proteins used to design degenerate primers for PCR with G. monilis cDNA.
Table S2. Sequences of the six degenerate primers and of other primers used.
We thank the Australian Research Council for support through the Linkage Program (LP0669276) to REW and TA, and Dr Spencer Whitney for supplying the G. monilis culture and advice on purifying genomic DNA from it.