Directed evolution and biodiversity mining of environmental DNA are two powerful approaches in enzyme discovery. Directed evolution is an established method for generating enzymes with improved specific activities for industrial, research, and therapeutic applications
1–5. It consists of generating large numbers of protein variants, then assaying those variants
en masse for the desired function. Biodiversity mining exploits the diversity found in unculturable microorganisms, which represents >99% of microorganisms
6, to find novel enzymes. It consists of amplifying environmental DNA
in vitro, expressing the genes in a model organism and testing them
en masse for the desired function. Biodiversity mining has also been successfully applied for the discovery of industrial, research and therapeutic enzymes
7. Directed evolution and biodiversity mining are limited to functional enzyme reactions that are inherently screenable or selectable, such as reactions where the product is fluorescent or an essential metabolite. To address this bottleneck we developed chemical complementation, a general high-throughput assay for enzyme catalysts that relies on the yeast three-hybrid assay to link enzyme catalysis to reporter gene transcription
in vivo. This assay has been previously applied to screen for bond cleavage reactions using a beta-lactamase enzyme and
lacZ as the reporter gene
8. More recently, chemical complementation was adapted to select for bond formation reactions using a glycosynthase enzyme and a
LEU2 reporter gene essential for cell survival
9.
One of the major bottlenecks to cost-competitive conversion of biomass to fermentable sugars for biofuel production is the high enzyme cost for cellulose degradation
10. Technological improvements in the breakdown of cellulose into sugars is often cited as one of the solutions to reducing cost
10. Cellulose, a β-1,4-glucose polymer, represents 38–50% of the total cellulosic biomass composition
11. Cellulose is hydrolyzed into glucose units by an ensemble of three enzymes: endoglucanases, also called cellulases [E.C. 3.2.1.4], exoglucanases [E.C.3.2.1.91], and glucohydrolases [E.C. 3.2.1.21]. Glucose can then be fermented to ethanol by microorganisms. Since the rate-limiting step in enzymatic cellulose depolymerization is performed by exoglucanases and endoglucanases
12, discovery of improved cellulases likely is a straightforward solution to decreasing costs.
Currently, the discovery of improved endoglucanases and other glycosylhydrolase enzymes important for biomass conversion is based on medium-throughput screening technologies that rely on surrogate substrates
12, 13. The two most widely used medium-throughput assays for endoglucanase activity are a halo assay carried out on petri dishes, which detects hydrolysis of carboxymethylcellulose (CMC) based on Congo red staining of carbohydrate reducing ends, and a UV assay, which detects hydrolysis of
p-nitrophenyl cellobioside (pNPC)
12. Because each gene in a screen must be assayed individually, even with automatation techniques only 10
4–10
6 genes can be tested, at best. A selection for cellulase catalysts would allow the search of much larger cellulase catalysts libraries (>10
8) because only cells expressing active enzyme variants survive. The problem is that cellulase catalysis is not inherently selectable.
Here, using chemical complementation, we have developed a
URA3 counter selection for cellulose activity. To our knowledge, this is the first high-throughput selection for cellulase catalysts and should significantly increase the number of variants that can be tested over medium-throughput screens. The method was developed using the endoglucanase Cel7B from
Humicola insolens that catalyzes the hydrolysis of beta-1,4-linked glucosidic bonds in cellulose. First, a
URA3 counter selection was adapted to link chemical dimerizer activated gene transcription to cell death. The classic yeast
URA3 5-fluoroorotic acid counter selection was chosen as the reporter gene because it is the state of the art counter selection in the two hybrid literature. Although the
URA3 counter selection has been previously applied in an n-hybrid system
14, 15, re-engineering of a chemical complementation
URA3 counter selection strain was necessary as previously published strains were either not available or did not meet the parameters required by the chemical complementation strategy. Next, the
URA3 counter selection was shown to detect cellulase activity based on cleavage of a tetrasaccharide chemical dimerizer substrate and decrease expression of the toxic
URA3 reporter. Finally, the utility of the cellulose selection was assessed by isolating cellulases with improved activity from a cellulase family DNA shuffled library.