The rumen harbors a complex community of microbes which catalyzes the depolymerization of cellulose and hemicellulose to soluble sugars which are then fermented to short-chain fatty acids. These end products of fermentation are utilized by the mammalian host as the major source of energy (
3,
41). Among the bacterial genera in the rumen,
Prevotella has been reported to be one of the most numerous (
8,
60).
P. ruminicola 23 is one of the most commonly isolated species (
8) from rumen contents and possesses the capacity to degrade plant cell wall polysaccharides, proteins, and peptides (
61). Therefore, this species is thought to play an important role in the utilization of xylans and other hemicelluloses within the rumen.
Two xylan-hydrolyzing enzymes have been previously characterized from
P. ruminicola: a bifunctional GH family 5 xylanase/endoglucanase cloned from the type strain,
P. ruminicola 23
T (
70), and a GH family 10 xylanase with an interrupted catalytic domain cloned from the related strain
P. ruminicola D31d (
71). The recently sequenced genome for
P. ruminicola 23 harbors genes corresponding to these two previously characterized xylanases (data not shown) and also harbors two additional putative xylanase genes that were not previously characterized. One of these genes was annotated as a bifunctional xylanase-esterase (designated
xyn10D-fae1A). To provide further insight into the molecular basis for xylan depolymerization by
P. ruminicola 23, we cloned this gene and characterized the recombinant protein. Our results show that
xyn10D-fae1A encodes a bifunctional GH family 10 xylanase-ferulic acid esterase. Furthermore, directed mutagenesis studies mapped the two catalytic functionalities to distinct regions on the polypeptide. A unique result that was obtained during the course of these experiments was that mutation of the putative nucleophile for the xylanase domain (E280S) led to an approximately threefold increase in ferulic acid esterase activity. While a clear explanation for these data will require further biophysical experiments, these results indicate that the xylanase and esterase domains may be functionally coupled.
The recently sequenced genomes for
Bacteroides intestinalis DSM 17393 (GenBank accession no. EDV07678.1) and
Bacteroides eggerthii DSM 20697 (GenBank accession no. ABVO01000038.1, nucleotides 298769 to 296631) harbor the only putative proteins in the GenBank database with significant sequence identity (64% for both) across the entire protein sequence for Xyn10D-Fae1A. Two genes from
Clostridium thermocellum (
xynY and
xynZ) encode proteins with both GH family 10 xylanase (
26,
31) and ferulic acid esterase (
6) activities; however, in addition, the clostridial proteins have carbohydrate binding modules and dockerin domains. Despite the presence of these accessory domains in
C. thermocellum XynY and XynZ, the natural occurrence of the GH 10 xylanase and CE 1 ferulic acid esterase domains within a single polypeptide sequence from different organisms suggests that this linkage may impart a selective advantage for xylan degradation over the two enzymatic functionalities in isolation. Another rumen bacterium,
Ruminococcus flavefaciens 17, harbors a gene predicted to encode a protein with GH 11 xylanase and CE 1 esterase domains (
1); however, the biochemical properties of this protein have not been characterized. Thus, it is not clear whether the CE 1 domain possesses ferulic acid esterase activity, as is the case for Xyn10D-Fae1A.
A number of different exoglycosidase functions have been identified for GH family 3 enzymes, including β-
d-glucosidase, β-
d-xylosidase, α-
l-arabinofuranosidase, and
N-acetyl-β-
d-glucosaminidase (
21). When we cloned the putative β-
d-glucosidase gene (ORF02829) and characterized the recombinant protein, we found that it releases
pNP from
pNPX and
pNP-α-
l-arabinofuranoside but not
pNP-β-
d-glucopyranoside. The hydrolysis of both
pNPX and
pNP-α-
l-arabinofuranoside (Xyl/Ara) has been reported for some members of GH family 3 (
20,
46,
51,
74). With one exception (
68), all of the characterized family 3 β-
d-xylosidases possess substrate specificity for Xyl/Ara and do not possess β-
d-glucosidase activity. The crystal structure of a family 3 β-
d-glucosidase from barley (
H. vulgare) revealed that Asp120 forms a hydrogen bond to the 6′OH group of the glucose substrate bound at the −1 subsite (
36). Those authors found that Asp120 was highly conserved among biochemically defined family 3 β-
d-glucosidases and also observed that the Xyl/Ara-active enzymes did not possess this conserved aspartate residue (
36). The pentose sugars (β-
d-xylopyranose and α-
l-arabinofuranose) do not possess the additional CH
2OH substituent from C-5 that is present in glucose; thus, it is possible that the larger glutamate residue could function to discriminate between hexoses and pentoses on the basis of steric interactions. It is more difficult to explain why the structurally distinct six-membered ring of xylopyranose and the five-membered ring of arabinofuranose are permitted into the active site (
36,
44). To identify a primary amino acid sequence motif that might distinguish the Xyl/Ara-active enzymes from the glucose-active enzymes, we constructed amino acid sequence alignments for biochemically defined members of these two distinct groups. These alignments revealed a conserved motif (WWSEAL) for the Xyl/Ara enzymes that was not found in the β-
d-glucosidase enzymes (data not shown). This motif may be useful for distinguishing between GH family 3 β-
d-glucosidases and Xyl/Ara-active enzymes based solely on primary sequence analysis.
Xyl3A possesses a PA14 domain that forms an insertion within the C-terminal GH family 3 domain (Table ). The precise function for the PA14 domain remains poorly defined; however, its presence in bacterial, archaeal, and eukaryotic proteins, including glycosidases, glycosyltransferases, proteases, amidases, adhesins, and bacterial toxins (
57), and its predominantly β-sheet structure (
56) suggest a carbohydrate-binding function. Alignment of the amino acid sequences for Xyl3A with other GH family 3 β-
d-xylosidases/α-
l-arabinofuranosidases suggests that the PA14 domain may be inserted within the C-terminal domain (data not shown). The predicted proximity of this region to the active site supports the possibility that the PA14 domain may aid in binding to oligosaccharide substrates. Thus, insertion of the PA14 domain in this region could influence the substrate specificity for the enzyme. A recent report identified a loop within the PA14 domain of two fungal adhesins from
Candida glabrata that is responsible for determining the binding specificity (
14,
75) to eukaryotic membrane-anchored glycoproteins. The amino acid sequence for the PA14 domain of Xyl3A aligns with the ligand specificity determinants for the fungal cell surface adhesins Epa6 and Epa7 (data not shown), which supports the possibility that the PA14 domain could facilitate anchoring of Xyl3A to sugars. Identification of the true functional role for the PA14 domain within Xyl3A will require further studies.
There are a number of residues that have been shown to make hydrogen bond contacts to hydroxyl groups of the glycosyl substrate within the active site for the β-
d-glucan glucohydrolase (ExoI) from
H. vulgare (
65), and directed-mutagenesis studies in the β-glucosidase from
Flavobacterium meningosepticum have confirmed the importance of these residues for catalysis (
49). We identified analogous residues in Xyl3A by amino acid sequence alignments, and based on these alignments, we constructed site-directed mutants that are predicted to have important roles in catalysis (residues indicated in Table ). The significant attenuation in activity for the mutants tested in this study (Table ) provides support for the classification of Xyl3A as a family 3 GH.
In the gene cluster identified in this study, there is a protein just upstream of
xyl3A that was annotated as a hypothetical protein. Based on the short intergenic space (31 nucleotides) between this gene and
xyl3A, it is possible that a single polycistronic mRNA may encode these two gene products. Comparison of the hypothetical protein with other proteins in the GenBank database revealed that this putative protein shares significant sequence identity (30%) with an α-1,2-fucosidase from
Bifidobacterium bifidum (
39,
54). It is unclear whether this protein may function synergistically with Xyn10D-Fae1A and Xyl3A to hydrolyze xylan; however, studies of the substrate specificity of this protein are under way in our laboratory.