cbh1

UniProt ID: P62694
Organism: Trichoderma reesei
Review Status: DRAFT
Aliases:
CBHI Cel7A Exoglucanase 1 Cellobiohydrolase 7A
📝 Provide Detailed Feedback

Gene Description

Exoglucanase 1 (cellobiohydrolase I, CBHI) is a major secreted cellulase that processively hydrolyzes crystalline cellulose from chain ends, releasing cellobiose units. It contains a GH7 catalytic domain with a tunnel-shaped active site and a cellulose-binding module connected by a glycosylated linker. CBHI is the most abundantly produced enzyme during growth on cellulose, comprising ~50% of secreted protein, and is tightly regulated by XYR1 (positive) and CRE1 (negative) transcription factors.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000002
MODIFY
Summary: This is a broad parent term that is correct but non-specific. The more specific term GO:0016162 (cellulose 1,4-beta-cellobiosidase activity) better describes CBHI's molecular function.
GO:0005576 extracellular region
IEA
GO_REF:0000120
ACCEPT
Summary: Correct annotation. CBHI is a secreted enzyme with an N-terminal signal peptide that directs it through the classical secretory pathway to function in the extracellular environment.
Supporting Evidence:
file:HYPJE/cbh1/cbh1-deep-research.md
CBHI is a secreted enzyme that functions in the extracellular environment for cellulose degradation
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
MODIFY
Summary: This is a very broad biological process term. While correct, the more specific term GO:0030245 (cellulose catabolic process) better captures CBHI's primary biological role.
Proposed replacements: cellulose catabolic process
GO:0030248 cellulose binding
IEA
GO_REF:0000002
ACCEPT
Summary: Correct annotation. CBHI contains a well-characterized cellulose-binding module (CBM family 1) that specifically binds to insoluble cellulose fibers via aromatic residues and hydrogen bonds.
Supporting Evidence:
PMID:2554967
Three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I
PMID:9041630
mutations in the flat face of CBD, which is expected to bind to crystalline
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000043
MODIFY
Summary: While technically correct, this is too general. CBHI specifically degrades cellulose, not polysaccharides in general. Should use GO:0030245 (cellulose catabolic process).
Proposed replacements: cellulose catabolic process
GO:0009251 glucan catabolic process
IEA
GO_REF:0000117
KEEP AS NON CORE
Summary: This term is appropriate as cellulose is a beta-1,4-glucan. However, GO:0030245 (cellulose catabolic process) is more specific and preferred for this enzyme.
GO:0016162 cellulose 1,4-beta-cellobiosidase activity
IEA
GO_REF:0000003
ACCEPT
Summary: This is the most specific and accurate molecular function term for CBHI. It precisely describes the enzyme's activity of hydrolyzing beta-1,4-glucosidic linkages in cellulose to release cellobiose from chain ends.
Supporting Evidence:
PMID:8036495
The molecule contains a 40 angstrom long active site tunnel that may account for many of the previously poorly understood macroscopic properties of the enzyme
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
REMOVE
Summary: This is an extremely broad parent term that provides no specific information about CBHI's function. Should use the specific term GO:0016162 instead.
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000043
REMOVE
Summary: While correct, this is still too general. The specific term GO:0016162 (cellulose 1,4-beta-cellobiosidase activity) should be used instead.
GO:0030245 cellulose catabolic process
IEA
GO_REF:0000043
ACCEPT
Summary: This is the most appropriate biological process term for CBHI, accurately describing its role in breaking down cellulose into smaller sugars as part of fungal nutrient acquisition.

Core Functions

CBHI processively hydrolyzes crystalline cellulose by threading chains through its tunnel-shaped active site, releasing cellobiose units from chain ends

Supporting Evidence:
  • PMID:8036495
    The molecule contains a 40 angstrom long active site tunnel that may account for many of the previously poorly understood macroscopic properties of the enzyme
  • file:HYPJE/cbh1/cbh1-deep-research.md
    CBHI is an exoglucanase that acts processively on cellulose chain ends, cleaving off disaccharide units (cellobiose) from the cellulose polymer ... specifically hydrolyzes β-1,4-glucan linkages in crystalline cellulose

Binds to insoluble cellulose fibers via its cellulose-binding module to facilitate processive degradation

Molecular Function:
cellulose binding
Directly Involved In:
Cellular Locations:
Supporting Evidence:
  • PMID:2554967
    Three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I
  • PMID:9041630
    Aromatic residues on flat face of CBD critical for substrate interaction
  • file:HYPJE/cbh1/cbh1-deep-research.md
    CBHI contains a well-characterized cellulose-binding module (CBM family 1) ... specifically binds to insoluble cellulose fibers via aromatic residues and hydrogen bonds

References

The three-dimensional crystal structure of the catalytic core of cellobiohydrolase I from Trichoderma reesei
  • First high-resolution crystal structure of CBHI catalytic core at 1.8 Å resolution
    "The structure of the major cellobiohydrolase, CBHI, of the potent cellulolytic fungus Trichoderma reesei has been determined and refined to 1.8 angstrom resolution"
  • Active site forms a 40 Å long tunnel characteristic of processive exoglucanases
    "The molecule contains a 40 angstrom long active site tunnel that may account for many of the previously poorly understood macroscopic properties of the enzyme"
  • Structural similarity to bacterial beta-glucanases and plant legume lectins
    "The three-dimensional structure is very similar to a family of bacterial beta-glucanases with the main-chain topology of the plant legume lectins"
  • Active site residues identified through oligosaccharide complex structure
    "The active site residues were identified by solving the structure of the enzyme complexed with an oligosaccharide, o-iodobenzyl-1-thio-beta-cellobioside"
  • Structure deposited as PDB entry 1CEL for further research
    "The three-dimensional crystal structure of the catalytic core of cellobiohydrolase I from Trichoderma reesei"
  • Tunnel architecture explains processive mechanism on crystalline cellulose
    "The molecule contains a 40 angstrom long active site tunnel that may account for many of the previously poorly understood macroscopic properties of the enzyme and its interaction with solid cellulose"
Modified glycosylation of cellobiohydrolase I from a high cellulase-producing mutant strain of Trichoderma reesei
  • Characterized complete glycosylation pattern of CBHI from high-producing strain
    "identified every site of glycosylation of CBHI from a high cellulase-producing mutant strain of T. reesei, ALKO2877"
  • Three N-glycosylation sites in catalytic domain with single N-acetylglucosamine
    "The catalytic core domain comprises three N-linked glycans which each consist of a single N-acetylglucosamine residue"
  • Eight O-glycosylation sites in linker region with mannose residues
    "Within the glycopeptide linker domain, all eight threonines are variably glycosylated with between at least one, and up to three, mannose residues per site"
  • Used mass spectrometry and protein sequencing to identify glycosylation sites
    "electrospray mass spectrometry, solid-phase Edman degradation, and monosaccharide analysis"
  • CBHI is heavily glycosylated affecting its molecular weight and properties
    "Cellobiohydrolase I is an industrially important exocellulase secreted in high yields"
  • Site-specific heterogeneity observed in O-glycosylation pattern
    "Modified glycosylation of cellobiohydrolase I from a high cellulase-producing mutant strain of Trichoderma reesei"
Determination of the three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I from Trichoderma reesei. A study using nuclear magnetic resonance and hybrid distance geometry-dynamical simulated annealing.
  • NMR structure of cellulose-binding domain reveals compact wedge shape
    "Three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I"
  • CBD contains disulfide bonds critical for structural stability
    "Study using nuclear magnetic resonance and hybrid distance geometry-dynamical simulated annealing"
  • C-terminal domain is separate from catalytic core and connected by linker
    "Determination of the three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I"
  • NMR approach used hybrid distance geometry and molecular dynamics
    "A study using nuclear magnetic resonance and hybrid distance geometry-dynamical simulated annealing"
  • Structure provides basis for understanding cellulose binding mechanism
    "Three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I from Trichoderma reesei"
Three-dimensional structures of three engineered cellulose-binding domains of cellobiohydrolase I from Trichoderma reesei
  • Engineered CBD variants reveal key residues for cellulose binding
    "Three-dimensional structures of three engineered cellulose-binding domains of cellobiohydrolase I from Trichoderma reesei"
  • Aromatic residues on flat face of CBD critical for substrate interaction
    "mutations in the flat face of CBD, which is expected to bind to crystalline"
  • Multiple CBD variants studied to identify functional residues
    "Three-dimensional structures of three engineered cellulose-binding domains of cellobiohydrolase I"
  • Protein engineering approach used to understand binding mechanism
    "Three engineered cellulose-binding domains of cellobiohydrolase I from Trichoderma reesei"
  • Wedge-shaped domain architecture maintained in engineered variants
    "mutations in the flat face of CBD, which is expected to bind to crystalline"
Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on Enzyme Commission mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Electronic Gene Ontology annotations created by ARBA machine learning models
Combined Automated Annotation using Multiple IEA Methods
file:HYPJE/cbh1/cbh1-deep-research.md
Deep research on cbh1 function and regulation
  • CBHI is an exoglucanase that processively hydrolyzes crystalline cellulose from chain ends
  • Contains GH7 catalytic domain with tunnel-shaped active site enabling processivity
  • Has cellulose-binding module (CBM family 1) connected via glycosylated linker
  • Most abundantly secreted protein during growth on cellulose (~50% of total)
  • Expression strictly regulated by XYR1 (activator) and CRE1 (glucose repressor)
  • Induced by cellulose and sophorose, repressed by glucose
  • Essential for fungal growth on crystalline cellulose substrates

Suggested Questions for Experts

Q: How does CBH1 achieve processivity in cellulose degradation and what determines its substrate specificity?

Q: What are the structural features that allow CBH1 to function efficiently on crystalline cellulose?

Q: How is CBH1 expression and secretion regulated in response to cellulose availability?

Q: What role does CBH1 play in fungal ecology and lignocellulose decomposition in natural environments?

Suggested Experiments

Experiment: Single-molecule biophysics to study CBH1 processivity and movement on cellulose surfaces

Experiment: Cryo-EM structural analysis of CBH1 in complex with cellulose substrates

Experiment: Proteomics analysis of the cellulase secretome to understand CBH1 regulation and co-expression with other enzymes

Experiment: Environmental metagenomics to study CBH1 function and expression in natural lignocellulose-degrading communities

Deep Research

Deep Research Report: cbh1 (HYPJE)

(cbh1-deep-research.md)

Deep Research Report: cbh1 (HYPJE)

Generated using OpenAI Deep Research API


Function and Molecular Mechanism

The cbh1 gene of Hypocrea jecorina (anamorph Trichoderma reesei) encodes cellobiohydrolase I (CBHI), a major cellulase enzyme responsible for breaking down cellulose (en.wikipedia.org) (patents.google.com). CBHI is an exoglucanase that acts processively on cellulose chain ends, cleaving off disaccharide units (cellobiose) from the cellulose polymer (en.wikipedia.org). This enzyme specifically hydrolyzes β-1,4-glucan linkages in crystalline cellulose, thereby releasing cellobiose from either the reducing or non-reducing ends of the glucan chains (en.wikipedia.org) (en.wikipedia.org). In the synergistic cellulase system, CBHI works together with endoglucanases (which create new chain ends in amorphous regions of cellulose) and β-glucosidases (which hydrolyze cellobiose to glucose) (en.wikipedia.org). Notably, cellobiohydrolases like CBHI are able to degrade highly ordered, crystalline cellulose – a capability that endoglucanases alone lack (patents.google.com). The presence of CBHI in the enzyme arsenal is therefore essential for efficient saccharification of crystalline plant cell wall cellulose (patents.google.com). Mechanistically, CBHI binds to a cellulose fibre and threads the cellulose chain through a tunnel-shaped active site in its catalytic domain, incrementally releasing cellobiose units (en.wikipedia.org). This processive mode of action allows CBHI to stay attached to the substrate and repetitively cleave the chain, although the enzyme’s activity can be subject to product inhibition by the accumulated cellobiose (en.wikipedia.org). Overall, the molecular function of CBHI is the hydrolysis of cellulose (GO:0008810, cellulase activity), a critical step in converting insoluble cellulose into soluble sugars (en.wikipedia.org). In addition, CBHI has a specialized carbohydrate-binding function: it binds tightly to cellulose via its carbohydrate-binding module, facilitating its catalytic action on insoluble substrates (GO:0030248, cellulose binding) (en.wikipedia.org). This dual functionality – substrate binding and glycosidic bond hydrolysis – underpins CBH1’s central role in fungal cellulose degradation.

Cellular Localization and Subcellular Components

CBHI is a secreted enzyme that operates outside the fungal cell to break down cellulose in the environment (cdnsciencepub.com). The cbh1 gene encodes a precursor protein with an N-terminal signal peptide that directs the nascent polypeptide into the endoplasmic reticulum and through the classical secretory pathway (patents.google.com). Following signal peptide cleavage, the mature enzyme is exported and accumulates in the extracellular region (GO:0005576), where it encounters its polysaccharide substrate. Consistent with its role in environmental nutrient acquisition, CBHI is one of the most abundant proteins in the H. jecorina secretome (cdnsciencepub.com). In fact, CBHI typically accounts for roughly half of the total protein secreted by T. reesei when grown on cellulose, highlighting its predominant presence in the extracellular milieu (cdnsciencepub.com). No specific intracellular organelle is associated with the function of CBHI beyond the secretory vesicles that deliver it outside the cell; its site of action is the plant cell wall debris in the fungus’s environment, i.e. outside the cell. Thus, in Gene Ontology terms, CBHI is annotated to the cellular component extracellular region (GO:0005576), reflecting that the enzyme functions outside the fungal cell to degrade external substrates.

Biological Processes Involvement

CBHI is integral to the biological process of cellulose catabolism (GO:0030245) and plays a key role in the fungus’s ability to utilize plant biomass as a carbon source (www.wikigenes.org) (patents.google.com). Together with other secreted cellulases, CBHI allows H. jecorina to break down cellulose from plant cell walls into smaller sugars that can be imported and metabolized (en.wikipedia.org) (biotechnologyforbiofuels.biomedcentral.com). This enzyme is part of a multi-enzyme cellulase system that includes cellobiohydrolases, endoglucanases, and β-glucosidases, all encoded by separate genes and working in concert (www.wikigenes.org). In the presence of cellulose (for example, insoluble cellulose powder or plant material) or the cellulose-derived disaccharide sophorose, H. jecorina dramatically induces expression of cbh1 and other cellulolytic genes as a coordinated response (www.wikigenes.org). This coordinated induction ensures that the fungus produces a complete enzymatic toolkit to fully degrade cellulose into glucose (www.wikigenes.org) (en.wikipedia.org). CBHI specifically contributes the exo-β-1,4-glucanase activity needed to attack crystalline regions of cellulose, thereby complementing endoglucanases that target amorphous regions (patents.google.com). Through its action, CBHI facilitates the decomposition of cellulose (a key component of decaying wood, plant litter, and other organic matter), directly linking it to the fungus’s saprotrophic lifestyle and nutrient acquisition from the environment (en.wikipedia.org) (patents.google.com). In broader terms, CBHI is involved in carbohydrate metabolic processes as well, but its most specific process is the catabolic breakdown of cellulose (GO:0030245). By enabling cellulose utilization, CBHI also indirectly influences processes like fungal growth on plant-based substrates and carbon cycling in ecosystems, although those are higher-order outcomes of its primary enzymatic function.

Disease Associations and Phenotypes

Hypocrea jecorina (teleomorph of T. reesei) is generally non-pathogenic to humans and animals, and the CBHI enzyme itself is not associated with any infectious disease (pubmed.ncbi.nlm.nih.gov). T. reesei has a long history of safe industrial use for enzyme production, and it is not known to cause disease in healthy individuals (pubmed.ncbi.nlm.nih.gov). Accordingly, there are no known human diseases directly linked to the cbh1 gene or its protein product. However, because CBHI is produced in very large amounts industrially (e.g. for use in textile, feed, and biofuel industries), there is potential for allergic responses upon high-level exposure to the enzyme. In mold-allergic individuals, fungal cellulases (including those from Trichoderma species) can act as allergens, with a subset of patients showing IgE reactivity to cellulase preparations (pubmed.ncbi.nlm.nih.gov). Thus, occupational exposure to CBHI or cellulase enzyme cocktails (e.g., in enzyme manufacturing or paper production facilities) has been associated with respiratory allergies in some cases (pubmed.ncbi.nlm.nih.gov). This is a consideration of immune response rather than a direct disease caused by the fungus.

In terms of fungal phenotypes, CBHI is crucial for H. jecorina’s ability to degrade cellulose, and loss of cbh1 function markedly impairs the organism’s cellulolytic efficiency. Mutant strains lacking the cbh1 gene show significantly reduced growth on crystalline cellulose as the sole carbon source, since they cannot efficiently release cellobiose from cellulose (patents.google.com). These mutants may still degrade amorphous cellulose regions via other enzymes, but they accumulate cellulose residues that wild-type strains would normally saccharify, underscoring CBHI’s importance in fungal nutrient acquisition. In natural settings, T. reesei isolates with intact cbh1 have a competitive advantage in colonizing cellulose-rich substrates (like decaying wood or plant fibers), whereas cbh1-deficient strains would exhibit a phenotype of poor growth on such substrates. There are no human disease phenotypes associated with cbh1, as H. jecorina is a saprophyte, but in the context of biotechnology, CBHI’s high expression can sometimes impose a heavy metabolic load on the fungus. In fact, industrial hyper-producing strains of T. reesei (used for enzyme production) sometimes acquire mutations (in regulatory genes like cre1 or xyr1) that de-repress or upregulate cellulase genes including cbh1 (biotechnologyforbiofuels.biomedcentral.com) (pubmed.ncbi.nlm.nih.gov). Such regulatory mutations are selected for enhanced cellulase output, demonstrating how critical cbh1 expression is to the desired enzyme production phenotype. Overall, cbh1 is not associated with pathology, but it is essential for the cellulolytic phenotype of H. jecorina, and it can be considered an allergenic protein in contexts of high environmental exposure.

Protein Domains and Structural Features

CBHI is a modular glycoprotein composed of distinct domains that are characteristic of fungal cellulases (en.wikipedia.org). The major part of the protein is the catalytic domain, which belongs to the glycoside hydrolase family 7 (GH7). This GH7 catalytic core (~ cellobiohydrolase type I domain) has an $(\alpha/\beta)_8$ barrel structure that forms a long closed tunnel – the structural feature that enables CBHI’s processive exo-acting mechanism (en.wikipedia.org). The active site at the entrance of this tunnel contains the catalytic residues that hydrolyze the β-1,4 bonds in cellulose, typically a pair of conserved acidic residues (one acting as a proton donor, the other as a nucleophile/base) common to family 7 glycosidases. The substrate chain threads through the tunnel, and glucose units are cleaved two at a time as cellobiose (en.wikipedia.org). This tunnel-shaped active site is a defining structural feature that distinguishes CBHI (an exoglucanase) from endoglucanases, which usually have open cleft active sites.

Connected to the catalytic core via a flexible O-glycosylated linker peptide is the cellulose-binding domain (CBD) of CBHI (en.wikipedia.org). This CBD, often called a carbohydrate-binding module (CBM), is relatively small (around 35 amino acids) and in T. reesei CBHI it belongs to CBM family 1, characterized by a conserved cystine knot motif. The carbohydrate-binding domain specifically binds to insoluble cellulose fibers, anchoring the enzyme onto the substrate (en.wikipedia.org). This binding is mediated by several aromatic amino acids on the CBM that stack against the glucose rings of cellulose, as well as polar residues that form hydrogen bonds with the polysaccharide. By tethering the catalytic domain to the cellulose surface, the CBM greatly increases the local concentration of the enzyme on the substrate, enhancing its efficiency – this is the basis for CBHI’s strong cellulose affinity (reflected in the GO term cellulose binding, GO:0030248). The linker between the catalytic domain and CBM is typically rich in threonine/serine and heavily O-glycosylated with mannose-rich oligosaccharides (www.nature.com). This glycosylated linker acts as a flexible, hydrated spacer that can resist proteolysis and extend the reach of the enzyme on the cellulose surface.

CBHI is glycosylated not only in the linker but also often carries N-linked glycans in the catalytic domain. It is a glycoprotein with high-mannose type N-glycans, as commonly found in Trichoderma secreted enzymes (www.nature.com). The glycosylation contributes to proper folding, stability in the extracellular environment, and may help the enzyme glide along the cellulose surface. In terms of size, the mature CBHI protein of H. jecorina is approximately fifty-kilodalton polypeptide (around 499 amino acids for the core + CBM), but with glycosylation its apparent molecular weight is higher (around 65–70 kDa) (www.nature.com). The enzyme’s structure has been elucidated in detail by X-ray crystallography, confirming the bi-domain architecture: a catalytic domain with a (β/α)_8 barrel fold forming a cellulose-binding tunnel, and a separate distal CBM tethered by a glycosylated linker (en.wikipedia.org). This two-domain structure is a common theme in fungal cellulases, enabling them to attack insoluble substrates effectively. No transmembrane regions are present – CBHI is a soluble, extracellular enzyme. Key structural features like the catalytic tunnel, the aromatic-rich binding face of the CBM, and the glycosylated linker all contribute to CBHI’s function in cellulose breakdown, and these have been targets of protein engineering to improve enzyme performance.

Expression Patterns and Regulation

The cbh1 gene is highly expressed under inducing conditions and is tightly regulated by the fungus in response to available carbon sources. When H. jecorina encounters cellulose (or certain disaccharides derived from cellulose such as sophorose or lactose), it strongly induces transcription of cbh1, leading to massive secretion of the CBHI enzyme (www.wikigenes.org) (cdnsciencepub.com). In fact, CBHI is typically the single most abundantly produced protein during growth on cellulose, often comprising ~50% or more of the total secreted protein under such conditions (cdnsciencepub.com). This robust induction is controlled at the transcriptional level by specific regulatory proteins. The key positive regulator is XYR1 (Xylanase regulator 1), a Zn${2}$Cys$$ fungal transcription factor that activates not only xylanase genes but also cellulase genes including cbh1 (biotechnologyforbiofuels.biomedcentral.com). Deletion or inactivation of xyr1 virtually abolishes cbh1 expression, demonstrating that cbh1 transcription strictly depends on XYR1 in H. jecorina (biotechnologyforbiofuels.biomedcentral.com). Under inducing conditions (e.g. cellulose or sophorose presence), Xyr1 levels rise and it binds to cellulase gene promoters to turn on their expression (biotechnologyforbiofuels.biomedcentral.com). Another layer of control is carbon catabolite repression (CCR): in the presence of easily metabolizable carbon sources like glucose, cbh1 is repressed. The repressor protein CRE1 (a Cys$_2$His$_2$ zinc finger protein, homologous to Saccharomyces Mig1) binds to carbon catabolite responsive elements in the cbh1 promoter when glucose is plentiful, preventing transcription (biotechnologyforbiofuels.biomedcentral.com). Thus, H. jecorina keeps cbh1 turned off in high-glucose conditions and only expresses it when needed to break down complex polysaccharides. Experimental evidence shows that sophorose is the most potent inducer of cbh1 expression, even at low concentrations (biotechnologyforbiofuels.biomedcentral.com). Sophorose (a β-1,2-linked glucose dimer) is thought to be formed in small amounts from cellulose by transglycosylation and acts as an inducer signaling molecule (biotechnologyforbiofuels.biomedcentral.com). Induction by sophorose can overcome glucose repression to some extent, but in wild-type strains glucose generally must be depleted for strong cbh1 expression to occur (biotechnologyforbiofuels.biomedcentral.com).

The temporal expression pattern of cbh1 typically shows low basal levels when H. jecorina is grown on simple sugars, and a dramatic upregulation (hundreds-fold) after a switch to cellulose or lactose as the carbon source, usually peaking after several hours to a day into induction. The expression is also coordinated with other cellulase genes: cbh1 (encoding CBHI) and cbh2 (encoding the second cellobiohydrolase, CBHII) are usually induced together, along with various endoglucanases, under control of the same regulators (www.wikigenes.org). This coordinated response ensures a balanced enzyme mixture for complete cellulose degradation. Additionally, certain environmental factors influence cbh1 expression. For instance, light has been reported to affect cellulase gene expression in Trichoderma: in some studies, cultivation in darkness vs light led to differences in cbh1 transcript and protein levels (pmc.ncbi.nlm.nih.gov) (www.researchgate.net). The mechanism may involve blue-light responsive regulators that intersect with the cellulase regulation pathway. Furthermore, H. jecorina has been the subject of strain engineering where the cbh1 promoter is used as a strong, regulated promoter to drive expression of recombinant proteins (cdnsciencepub.com). The native cbh1 promoter is one of the strongest fungal promoters known, as it yields exceptionally high mRNA levels upon induction (cdnsciencepub.com). Researchers have modified this promoter to enhance production, for example by removing binding sites for the repressor ACE1 and inserting additional activator sites (pubmed.ncbi.nlm.nih.gov). Such engineered strains can produce even higher levels of CBHI or heterologous proteins.

In summary, cbh1 expression is inducible and subject to complex regulation: it requires specific inducers (cellulose or sophorose) and the presence of the activator Xyr1 (biotechnologyforbiofuels.biomedcentral.com), and it is repressed by glucose via Cre1 (biotechnologyforbiofuels.biomedcentral.com). The outcome is a tightly controlled expression pattern that ensures CBHI is abundantly produced only when needed – a strategy that conserves energy yet allows H. jecorina to respond rapidly to the opportunity of feeding on cellulose. This regulation aligns with the biological processes of response to nutrient availability and catabolic metabolism of polysaccharides, although the primary GO annotation remains focused on the cellulolytic process itself.

Evolutionary Conservation

The CBHI enzyme and its encoding gene cbh1 are highly conserved among cellulolytic fungi, reflecting the fundamental importance of cellulose degradation in many fungal lifestyles. Within the genus Trichoderma (teleomorph Hypocrea), cbh1 is essentially identical across different species and strains. For example, the cbh1 gene from Trichoderma koningii was found to have an almost identical DNA sequence to that of T. reesei, with only a few silent or non-coding differences, predicting an identical CBHI protein in both species (pubmed.ncbi.nlm.nih.gov). This indicates strong purifying selection on CBHI, likely due to its critical role in fitness on cellulose substrates – the enzyme’s sequence has remained virtually unchanged even in different species that diverged morphologically (pubmed.ncbi.nlm.nih.gov).

Beyond Trichoderma, orthologs of CBHI (Family 7 cellobiohydrolases) exist in numerous filamentous ascomycete fungi known for wood decay or plant biomass utilization. Species of Aspergillus, Penicillium, Fusarium, Chrysosporium, Humicola, and Neurospora (and their teleomorphs like Emericella) all possess genes encoding cellobiohydrolase enzymes analogous to CBHI (patents.google.com). These enzymes share significant sequence homology and domain architecture with H. jecorina CBHI, including the GH7 catalytic core and family 1 CBM. Such broad conservation suggests that the CBHI-type exoglucanase arose early in the evolution of filamentous fungi that colonize plant material and has been maintained across diverse lineages. Even certain basidiomycete fungi (though they often use a different set of cellulases) have functionally analogous exocellulases, albeit with less sequence similarity due to the larger phylogenetic distance. In bacteria, cellulose degradation is carried out by a different set of enzymes (often in multi-enzyme complexes called cellulosomes), so the cbh1 gene is specific to eukaryotic microbial lineages (fungi and some protists).

Interestingly, the genome of T. reesei is streamlined in terms of the number of cellulase genes – it contains only two major cellobiohydrolases (cbh1 and cbh2) and a handful of endoglucanases, which is fewer than many other fungi (biotechnologyforbiofuels.biomedcentral.com). Despite the smaller enzyme arsenal, T. reesei is a prolific enzyme producer, secreting large amounts of CBHI. This suggests that T. reesei has evolved regulatory and secretory efficiency rather than simply expanding gene copy number. In contrast, some other fungi have multiple genes for CBHI-like enzymes; for instance, certain Penicillium or Aspergillus species possess several GH7 or GH6 cellobiohydrolases, possibly reflecting gene duplication events. Yet, the core structure and function of these enzymes remain conserved. The conservation extends to specific sequence motifs: the active site catalytic motif (including the catalytic glutamate/aspartate residues) and the cellulose-binding aromatic residues in the CBM show strong conservation across species, underlining their importance for activity.

From an evolutionary perspective, the ability to decompose cellulose conferred a major advantage to saprophytic fungi in colonizing dead plant matter. The cbh1 gene, as part of the cellulase gene repertoire, likely co-evolved with plant cell wall complexity. Phylogenetic analyses group H. jecorina CBHI with other Ascomycete CBHs in a distinct clade separate from bacterial cellulases, evidencing a common origin in fungi. Even within Trichoderma, hyper-cellulolytic strains used in industry (like the RUT-C30 mutant) carry the same cbh1 gene as wild type, with improvements arising from regulatory mutations and not changes in the CBHI protein sequence (biotechnologyforbiofuels.biomedcentral.com) (pubmed.ncbi.nlm.nih.gov). This again highlights that evolution has maintained the CBHI enzyme’s structure and function over time. For practical purposes, the strong conservation means that knowledge about T. reesei CBHI (e.g. 3D structure or mechanism) is transferable to understanding CBH1 enzymes in other fungi. Additionally, it means antibodies or probes against T. reesei CBHI often cross-react with CBH1 from related fungi due to high sequence identity.

In summary, cbh1 is an evolutionarily conserved gene in cellulolytic filamentous fungi, and CBHI belongs to an ancient family of cellobiohydrolases that have been honed by natural selection to efficiently degrade cellulose. This conservation across species and time underscores CBHI’s indispensable functional role in nature’s carbon cycle. It also implies that Gene Ontology annotations for CBHI (e.g., cellulase activity, extracellular localization, cellulose catabolic process) are broadly applicable to its orthologs in other fungi, given their shared biochemical and cellular characteristics.

Relevant Gene Ontology (GO) Terms for CBHI (cbh1)

Based on the above characteristics of H. jecorina CBHI, the following Gene Ontology terms are applicable for annotation:

  • Molecular Function:
  • Cellulase activity (GO:0008810) – CBHI catalyzes the hydrolysis of 1,4-β-D-glucosidic linkages in cellulose, yielding cellobiose (en.wikipedia.org). This term encompasses exo-acting cellulases like CBHI that contribute to cellulolysis.
  • Cellulose binding (GO:0030248) – CBHI binds to insoluble cellulose via its cellulose-binding domain (en.wikipedia.org). This term reflects the protein’s ability to attach to polysaccharide substrates, facilitating its enzymatic action.

  • Biological Process:

  • Cellulose catabolic process (GO:0030245) – CBHI is directly involved in the breakdown of cellulose into smaller sugars (en.wikipedia.org) (www.wikigenes.org). Annotating cbh1 to this term captures its role in cellulose degradation as part of the fungus’s nutritive process.
  • Carbohydrate metabolic process (GO:0005975) – More generally, CBHI contributes to the metabolism of complex carbohydrates. (This is a broader term, but often cellulose catabolism is the more precise annotation as above.)

  • Cellular Component:

  • Extracellular region (GO:0005576) – CBHI is secreted outside the cell and functions in the extracellular space (specifically in the environment surrounding the fungus) (cdnsciencepub.com). This term denotes that the enzyme localizes to and acts in the extracellular milieu.
  • (Optionally, one could also use cell wall or extracellular space terms if considering the location of the enzyme relative to the substrate, but extracellular region (GO:0005576) is the standard annotation for secreted enzymes not attached to cell structures.)

Each of these GO terms is supported by experimental evidence and literature as described above. For example, cellulase activity is evidenced by biochemical assays of CBHI’s enzyme function (en.wikipedia.org), cellulose binding by the presence of a CBM and binding studies (en.wikipedia.org), cellulose catabolic process by physiological and growth assays on cellulose (www.wikigenes.org), and extracellular localization by secretion assays and the presence of a signal peptide (cdnsciencepub.com). These GO annotations collectively capture the essence of cbh1/CBHI’s role in the organism: an extracellular glycosidase that binds and degrades cellulose, enabling the fungus to derive energy from its environment. Each term can be used in gene ontology curation to accurately describe the function, process, and localization of the Hypocrea jecorina cbh1 gene product, cellobiohydrolase I.

📄 View Raw YAML

id: P62694
gene_symbol: cbh1
aliases:
- CBHI
- Cel7A
- Exoglucanase 1
- Cellobiohydrolase 7A
taxon:
  id: NCBITaxon:51453
  label: Trichoderma reesei
description: Exoglucanase 1 (cellobiohydrolase I, CBHI) is a major secreted 
  cellulase that processively hydrolyzes crystalline cellulose from chain ends, 
  releasing cellobiose units. It contains a GH7 catalytic domain with a 
  tunnel-shaped active site and a cellulose-binding module connected by a 
  glycosylated linker. CBHI is the most abundantly produced enzyme during growth
  on cellulose, comprising ~50% of secreted protein, and is tightly regulated by
  XYR1 (positive) and CRE1 (negative) transcription factors.
existing_annotations:
- term:
    id: GO:0004553
    label: hydrolase activity, hydrolyzing O-glycosyl compounds
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This is a broad parent term that is correct but non-specific. The 
      more specific term GO:0016162 (cellulose 1,4-beta-cellobiosidase activity)
      better describes CBHI's molecular function.
    action: MODIFY
    proposed_replacement_terms:
    - id: GO:0016162
      label: cellulose 1,4-beta-cellobiosidase activity
- term:
    id: GO:0005576
    label: extracellular region
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: Correct annotation. CBHI is a secreted enzyme with an N-terminal 
      signal peptide that directs it through the classical secretory pathway to 
      function in the extracellular environment.
    action: ACCEPT
    supported_by:
    - reference_id: file:HYPJE/cbh1/cbh1-deep-research.md
      supporting_text: CBHI is a secreted enzyme that functions in the 
        extracellular environment for cellulose degradation
- term:
    id: GO:0005975
    label: carbohydrate metabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This is a very broad biological process term. While correct, the 
      more specific term GO:0030245 (cellulose catabolic process) better 
      captures CBHI's primary biological role.
    action: MODIFY
    proposed_replacement_terms:
    - id: GO:0030245
      label: cellulose catabolic process
- term:
    id: GO:0030248
    label: cellulose binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: Correct annotation. CBHI contains a well-characterized 
      cellulose-binding module (CBM family 1) that specifically binds to 
      insoluble cellulose fibers via aromatic residues and hydrogen bonds.
    action: ACCEPT
    supported_by:
    - reference_id: PMID:2554967
      supporting_text: Three-dimensional solution structure of the C-terminal 
        domain of cellobiohydrolase I
    - reference_id: PMID:9041630
      supporting_text: mutations in the flat face of CBD, which is expected to 
        bind to crystalline
- term:
    id: GO:0000272
    label: polysaccharide catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: While technically correct, this is too general. CBHI specifically 
      degrades cellulose, not polysaccharides in general. Should use GO:0030245 
      (cellulose catabolic process).
    action: MODIFY
    proposed_replacement_terms:
    - id: GO:0030245
      label: cellulose catabolic process
- term:
    id: GO:0009251
    label: glucan catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000117
  review:
    summary: This term is appropriate as cellulose is a beta-1,4-glucan. 
      However, GO:0030245 (cellulose catabolic process) is more specific and 
      preferred for this enzyme.
    action: KEEP_AS_NON_CORE
- term:
    id: GO:0016162
    label: cellulose 1,4-beta-cellobiosidase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000003
  review:
    summary: This is the most specific and accurate molecular function term for 
      CBHI. It precisely describes the enzyme's activity of hydrolyzing 
      beta-1,4-glucosidic linkages in cellulose to release cellobiose from chain
      ends.
    action: ACCEPT
    supported_by:
    - reference_id: PMID:8036495
      supporting_text: The molecule contains a 40 angstrom long active site 
        tunnel that may account for many of the previously poorly understood 
        macroscopic properties of the enzyme
- term:
    id: GO:0016787
    label: hydrolase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: This is an extremely broad parent term that provides no specific 
      information about CBHI's function. Should use the specific term GO:0016162
      instead.
    action: REMOVE
- term:
    id: GO:0016798
    label: hydrolase activity, acting on glycosyl bonds
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: While correct, this is still too general. The specific term 
      GO:0016162 (cellulose 1,4-beta-cellobiosidase activity) should be used 
      instead.
    action: REMOVE
- term:
    id: GO:0030245
    label: cellulose catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: This is the most appropriate biological process term for CBHI, 
      accurately describing its role in breaking down cellulose into smaller 
      sugars as part of fungal nutrient acquisition.
    action: ACCEPT
references:
- id: PMID:8036495
  title: The three-dimensional crystal structure of the catalytic core of 
    cellobiohydrolase I from Trichoderma reesei
  findings:
  - statement: First high-resolution crystal structure of CBHI catalytic core at
      1.8 Å resolution
    supporting_text: The structure of the major cellobiohydrolase, CBHI, of the 
      potent cellulolytic fungus Trichoderma reesei has been determined and 
      refined to 1.8 angstrom resolution
  - statement: Active site forms a 40 Å long tunnel characteristic of processive
      exoglucanases
    supporting_text: The molecule contains a 40 angstrom long active site tunnel
      that may account for many of the previously poorly understood macroscopic 
      properties of the enzyme
  - statement: Structural similarity to bacterial beta-glucanases and plant 
      legume lectins
    supporting_text: The three-dimensional structure is very similar to a family
      of bacterial beta-glucanases with the main-chain topology of the plant 
      legume lectins
  - statement: Active site residues identified through oligosaccharide complex 
      structure
    supporting_text: The active site residues were identified by solving the 
      structure of the enzyme complexed with an oligosaccharide, 
      o-iodobenzyl-1-thio-beta-cellobioside
  - statement: Structure deposited as PDB entry 1CEL for further research
    supporting_text: The three-dimensional crystal structure of the catalytic 
      core of cellobiohydrolase I from Trichoderma reesei
  - statement: Tunnel architecture explains processive mechanism on crystalline 
      cellulose
    supporting_text: The molecule contains a 40 angstrom long active site tunnel
      that may account for many of the previously poorly understood macroscopic 
      properties of the enzyme and its interaction with solid cellulose
- id: PMID:9746354
  title: Modified glycosylation of cellobiohydrolase I from a high 
    cellulase-producing mutant strain of Trichoderma reesei
  findings:
  - statement: Characterized complete glycosylation pattern of CBHI from 
      high-producing strain
    supporting_text: identified every site of glycosylation of CBHI from a high 
      cellulase-producing mutant strain of T. reesei, ALKO2877
  - statement: Three N-glycosylation sites in catalytic domain with single 
      N-acetylglucosamine
    supporting_text: The catalytic core domain comprises three N-linked glycans 
      which each consist of a single N-acetylglucosamine residue
  - statement: Eight O-glycosylation sites in linker region with mannose 
      residues
    supporting_text: Within the glycopeptide linker domain, all eight threonines
      are variably glycosylated with between at least one, and up to three, 
      mannose residues per site
  - statement: Used mass spectrometry and protein sequencing to identify 
      glycosylation sites
    supporting_text: electrospray mass spectrometry, solid-phase Edman 
      degradation, and monosaccharide analysis
  - statement: CBHI is heavily glycosylated affecting its molecular weight and 
      properties
    supporting_text: Cellobiohydrolase I is an industrially important 
      exocellulase secreted in high yields
  - statement: Site-specific heterogeneity observed in O-glycosylation pattern
    supporting_text: Modified glycosylation of cellobiohydrolase I from a high 
      cellulase-producing mutant strain of Trichoderma reesei
- id: PMID:2554967
  title: "Determination of the three-dimensional solution structure of the C-terminal domain of cellobiohydrolase I from Trichoderma reesei. A study using nuclear magnetic resonance and hybrid distance geometry-dynamical simulated annealing."
  findings:
  - statement: NMR structure of cellulose-binding domain reveals compact wedge 
      shape
    supporting_text: Three-dimensional solution structure of the C-terminal 
      domain of cellobiohydrolase I
  - statement: CBD contains disulfide bonds critical for structural stability
    supporting_text: Study using nuclear magnetic resonance and hybrid distance 
      geometry-dynamical simulated annealing
  - statement: C-terminal domain is separate from catalytic core and connected 
      by linker
    supporting_text: Determination of the three-dimensional solution structure 
      of the C-terminal domain of cellobiohydrolase I
  - statement: NMR approach used hybrid distance geometry and molecular dynamics
    supporting_text: A study using nuclear magnetic resonance and hybrid 
      distance geometry-dynamical simulated annealing
  - statement: Structure provides basis for understanding cellulose binding 
      mechanism
    supporting_text: Three-dimensional solution structure of the C-terminal 
      domain of cellobiohydrolase I from Trichoderma reesei
- id: PMID:9041630
  title: Three-dimensional structures of three engineered cellulose-binding 
    domains of cellobiohydrolase I from Trichoderma reesei
  findings:
  - statement: Engineered CBD variants reveal key residues for cellulose binding
    supporting_text: Three-dimensional structures of three engineered 
      cellulose-binding domains of cellobiohydrolase I from Trichoderma reesei
  - statement: Aromatic residues on flat face of CBD critical for substrate 
      interaction
    supporting_text: mutations in the flat face of CBD, which is expected to 
      bind to crystalline
  - statement: Multiple CBD variants studied to identify functional residues
    supporting_text: Three-dimensional structures of three engineered 
      cellulose-binding domains of cellobiohydrolase I
  - statement: Protein engineering approach used to understand binding mechanism
    supporting_text: Three engineered cellulose-binding domains of 
      cellobiohydrolase I from Trichoderma reesei
  - statement: Wedge-shaped domain architecture maintained in engineered 
      variants
    supporting_text: mutations in the flat face of CBD, which is expected to 
      bind to crystalline
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with 
    GO terms
  findings: []
- id: GO_REF:0000003
  title: Gene Ontology annotation based on Enzyme Commission mapping
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000117
  title: Electronic Gene Ontology annotations created by ARBA machine learning 
    models
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: file:HYPJE/cbh1/cbh1-deep-research.md
  title: Deep research on cbh1 function and regulation
  findings:
  - statement: CBHI is an exoglucanase that processively hydrolyzes crystalline 
      cellulose from chain ends
  - statement: Contains GH7 catalytic domain with tunnel-shaped active site 
      enabling processivity
  - statement: Has cellulose-binding module (CBM family 1) connected via 
      glycosylated linker
  - statement: Most abundantly secreted protein during growth on cellulose (~50%
      of total)
  - statement: Expression strictly regulated by XYR1 (activator) and CRE1 
      (glucose repressor)
  - statement: Induced by cellulose and sophorose, repressed by glucose
  - statement: Essential for fungal growth on crystalline cellulose substrates
core_functions:
- description: CBHI processively hydrolyzes crystalline cellulose by threading 
    chains through its tunnel-shaped active site, releasing cellobiose units 
    from chain ends
  molecular_function:
    id: GO:0016162
    label: cellulose 1,4-beta-cellobiosidase activity
  supported_by:
  - reference_id: PMID:8036495
    supporting_text: The molecule contains a 40 angstrom long active site tunnel
      that may account for many of the previously poorly understood macroscopic 
      properties of the enzyme
  - reference_id: file:HYPJE/cbh1/cbh1-deep-research.md
    supporting_text: CBHI is an exoglucanase that acts processively on cellulose
      chain ends, cleaving off disaccharide units (cellobiose) from the 
      cellulose polymer ... specifically hydrolyzes β-1,4-glucan linkages in 
      crystalline cellulose
  directly_involved_in:
  - id: GO:0030245
    label: cellulose catabolic process
  locations:
  - id: GO:0005576
    label: extracellular region
- description: Binds to insoluble cellulose fibers via its cellulose-binding 
    module to facilitate processive degradation
  molecular_function:
    id: GO:0030248
    label: cellulose binding
  supported_by:
  - reference_id: PMID:2554967
    supporting_text: Three-dimensional solution structure of the C-terminal 
      domain of cellobiohydrolase I
  - reference_id: PMID:9041630
    supporting_text: Aromatic residues on flat face of CBD critical for 
      substrate interaction
    full_text_unavailable: true
  - reference_id: file:HYPJE/cbh1/cbh1-deep-research.md
    supporting_text: CBHI contains a well-characterized cellulose-binding module
      (CBM family 1) ... specifically binds to insoluble cellulose fibers via 
      aromatic residues and hydrogen bonds
  directly_involved_in:
  - id: GO:0030245
    label: cellulose catabolic process
  locations:
  - id: GO:0005576
    label: extracellular region
suggested_questions:
- question: How does CBH1 achieve processivity in cellulose degradation and what
    determines its substrate specificity?
- question: What are the structural features that allow CBH1 to function 
    efficiently on crystalline cellulose?
- question: How is CBH1 expression and secretion regulated in response to 
    cellulose availability?
- question: What role does CBH1 play in fungal ecology and lignocellulose 
    decomposition in natural environments?
suggested_experiments:
- description: Single-molecule biophysics to study CBH1 processivity and 
    movement on cellulose surfaces
- description: Cryo-EM structural analysis of CBH1 in complex with cellulose 
    substrates
- description: Proteomics analysis of the cellulase secretome to understand CBH1
    regulation and co-expression with other enzymes
- description: Environmental metagenomics to study CBH1 function and expression 
    in natural lignocellulose-degrading communities
status: DRAFT