engL

UniProt ID: Q9RGE6
Organism: Clostridium cellulovorans
Review Status: DRAFT
Aliases:
endoglucanase L glucanase
📝 Provide Detailed Feedback

Gene Description

EngL is a glycoside hydrolase family 9 (GH9) endoglucanase from the Clostridium cellulovorans cellulosome. It contains a GH9 catalytic domain and a C-terminal type I dockerin domain for cellulosome assembly. Unlike non-cellulosomal cellulases (e.g., EngD), EngL lacks an intrinsic carbohydrate-binding module and relies on dockerin-mediated incorporation into the CbpA scaffoldin for cellulose targeting. EngL functions as an endo-β-1,4-glucanase that hydrolyzes internal β-1,4 linkages in cellulose.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
ACCEPT
Summary: This annotation is correct. EngL is an endoglucanase that directly participates in polysaccharide (cellulose) catabolism by hydrolyzing β-1,4-glycosidic bonds.
Reason: As a GH9 endoglucanase, EngL directly catalyzes the breakdown of polysaccharides. This is a core function supported by biochemical characterization.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
Endo‑β‑1,4‑glucanase acting on cellulose/CMC; within cellulosomes GH9s often exhibit processive behavior aiding crystalline cellulose degradation
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000120
KEEP AS NON CORE
Summary: This annotation is correct but generic. EngL has hydrolase activity on O-glycosyl bonds. More specific terms like GO:0008810 (cellulase activity) better capture its function.
Reason: This is a true parent term of the more specific cellulase activity. It's correct but not the most informative annotation for this enzyme.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
EngL conforms to a GH9 endoglucanase with a C‑terminal dockerin
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
MARK AS OVER ANNOTATED
Summary: This annotation is too general. EngL participates in cellulose catabolism specifically, not general carbohydrate metabolism. More specific terms exist.
Reason: This parent term is overly broad. GO:0030245 (cellulose catabolic process) is the appropriate specific term for EngL function.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
EngL is an endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
MARK AS OVER ANNOTATED
Summary: This annotation is too general. EngL has hydrolase activity, but more specific terms like cellulase activity better describe its function.
Reason: This is a very broad parent term. GO:0008810 (cellulase activity) is the appropriate level of specificity for this GH9 enzyme.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
GH9 enzymes in cellulosomes often display endo‑processivity
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000043
KEEP AS NON CORE
Summary: This annotation is correct but not as specific as GO:0008810. EngL acts on glycosyl bonds in cellulose.
Reason: This is a parent term of cellulase activity. It's correct but less informative than the more specific GO:0008810 annotation.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
EngL conforms to a GH9 endoglucanase
GO:0008810 cellulase activity
TAS
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
NEW
Summary: EngL is a GH9 endoglucanase with cellulase activity. This is the appropriate molecular function annotation for this enzyme.
Reason: EngL is classified in GH9 family and functions as an endo-β-1,4-glucanase acting on cellulose substrates.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
By family assignment and organismal context, EngL is an endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose and soluble cellulose analogs
GO:0030245 cellulose catabolic process
TAS
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
NEW
Summary: EngL directly participates in cellulose degradation as a cellulosomal endoglucanase.
Reason: As a GH9 cellulosomal enzyme, EngL contributes to the degradation of plant cell wall cellulose.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
EngL contributes catalytic activity within the cellulosome to deconstruct plant cell walls
GO:0043263 cellulosome
TAS
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
NEW
Summary: EngL is a cellulosomal enzyme that assembles into the cellulosome via its dockerin domain. This cellular component annotation is essential.
Reason: EngL contains a type I dockerin domain and is incorporated into the CbpA-based cellulosome complex.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
Secreted/extracellular and incorporated into the cellulosome via dockerin→cohesin binding to scaffoldin CbpA
GO:0005576 extracellular region
TAS
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
NEW
Summary: EngL is secreted and functions extracellularly as part of the cellulosome.
Reason: EngL is secreted and assembles into extracellular cellulosome complexes via its dockerin domain binding to CbpA cohesins.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
Secreted, extracellular cellulosome-associated via dockerin–cohesin binding to CbpA
GO:1990311 type-I cohesin domain binding
TAS
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
NEW
Summary: EngL contains a type I dockerin domain that binds to type I cohesins on the CbpA scaffoldin, enabling cellulosome assembly.
Reason: The dockerin domain mediates binding to cohesin domains, which is essential for cellulosome incorporation.
Supporting Evidence:
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
Typical architecture: GH9 catalytic domain + C-terminal dockerin (marks cellulosome incorporation)
GO:0044575 cellulosome assembly
NAS NEW
Summary: Added to align core_functions with existing annotations.
Reason: Core function term not present in existing_annotations.

Core Functions

EngL is a GH9 endoglucanase that catalyzes the hydrolysis of β-1,4-glycosidic bonds in cellulose. It functions as part of the cellulosome to efficiently degrade plant cell wall polysaccharides.

Molecular Function:
cellulase activity
Directly Involved In:
Cellular Locations:
In Complex:
cellulosome
Supporting Evidence:
  • file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    By family assignment and organismal context, EngL is an endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose

EngL contains a type I dockerin domain that binds to cohesins on the CbpA scaffoldin, enabling its incorporation into the cellulosome complex.

Directly Involved In:
Cellular Locations:
In Complex:
cellulosome
Supporting Evidence:
  • file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    Typical architecture: GH9 catalytic domain + C-terminal dockerin (marks cellulosome incorporation)

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Combined Automated Annotation using Multiple IEA Methods
file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
Deep research summary for Q9RGE6/engL
  • EngL is a GH9 endoglucanase with cellulase activity
    "EngL conforms to a GH9 endoglucanase with a C‑terminal dockerin"
  • EngL functions in cellulose degradation within the cellulosome
    "EngL contributes catalytic activity within the cellulosome to deconstruct plant cell walls"
  • EngL is secreted and incorporated into the extracellular cellulosome
    "Secreted, extracellular cellulosome-associated via dockerin–cohesin binding to CbpA"

Suggested Questions for Experts

Q: What are the kinetic parameters of EngL on different cellulose substrates?

Q: How does EngL's activity compare to EngK within the cellulosome complex?

Suggested Experiments

Experiment: Detailed kinetic characterization of EngL on crystalline and amorphous cellulose substrates to determine Km, kcat, and processivity.

Hypothesis: EngL may exhibit processive behavior characteristic of some GH9 enzymes when part of the cellulosome complex.

Experiment: Synergy assays with EngL and other cellulosomal enzymes (EngK, ExgS) on crystalline cellulose.

Hypothesis: EngL provides complementary endo-activity that synergizes with exoglucanases for efficient cellulose degradation.

📚 Additional Documentation

Deep Research Falcon

(Q9RGE6-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T22:54:35.840059'
end_time: '2025-12-26T23:02:50.077122'
duration_seconds: 494.24
template_file: templates/gene_research_go_focused.md
template_variables:
organism: CLOCL
gene_id: Q9RGE6
gene_symbol: engL
uniprot_accession: Q9RGE6
protein_description: 'RecName: Full=Glucanase {ECO:0000256|RuleBase:RU361166}; EC=3.2.1.-
{ECO:0000256|RuleBase:RU361166};'
gene_info: Name=engL {ECO:0000313|EMBL:AAF06109.1};
organism_full: Clostridium cellulovorans.
protein_family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
protein_domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf.
(IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf.
(IPR036439)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 34


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q9RGE6
  • Protein Description: RecName: Full=Glucanase {ECO:0000256|RuleBase:RU361166}; EC=3.2.1.- {ECO:0000256|RuleBase:RU361166};
  • Gene Information: Name=engL {ECO:0000313|EMBL:AAF06109.1};
  • Organism (full): Clostridium cellulovorans.
  • Protein Family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "engL" matches the protein description above
  2. Verify the organism is correct: Clostridium cellulovorans.
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'engL' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene engL (gene ID: Q9RGE6, UniProt: Q9RGE6) in CLOCL.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q9RGE6
  • Protein Description: RecName: Full=Glucanase {ECO:0000256|RuleBase:RU361166}; EC=3.2.1.- {ECO:0000256|RuleBase:RU361166};
  • Gene Information: Name=engL {ECO:0000313|EMBL:AAF06109.1};
  • Organism (full): Clostridium cellulovorans.
  • Protein Family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "engL" matches the protein description above
  2. Verify the organism is correct: Clostridium cellulovorans.
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'engL' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene engL (gene ID: Q9RGE6, UniProt: Q9RGE6) in CLOCL.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan and verification
We verified the target as engL from Clostridium cellulovorans, UniProt Q9RGE6, annotated as a glycoside hydrolase family 9 (GH9) glucanase bearing a dockerin module. Experimental literature from C. cellulovorans explicitly uses EngL catalytic-domain constructs in the context of cellulosomal enzymes and confirms dockerin/cohesin-dependent assembly on the CbpA scaffoldin, supporting the UniProt domain/family annotation and cellulosome localization (Murashima et al., Proteins, 2003, doi:10.1002/prot.10298; Park et al., J Bacteriol, 2001, doi:10.1128/JB.183.18.5431-5435.2001; Doi & Tamaru, Chemical Record, 2001, doi:10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W) (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5).

Comprehensive research report

1) Key concepts and definitions
- Gene/protein identity and family: engL encodes a modular cellulase in C. cellulovorans that belongs to the bacterial cellulosome. UniProt assigns GH9 (glycosyl hydrolase family 9) to Q9RGE6, consistent with the “Eng” series of C. cellulovorans cellulosomal enzymes. EngL is described experimentally as a cellulosomal cellulase; constructs comprising the EngL catalytic domain were used to study solubility and domain effects, implying a GH catalytic core with a dockerin for cellulosome incorporation (doi:10.1002/prot.10298, 2003) (murashima2003solubilizationofcellulosomal pages 1-2). GH9 enzymes are primarily endo-β-1,4-glucanases that can exhibit processive endo–exo behavior within cellulosomes, aiding attack on crystalline cellulose (reviewed in Doi & Tamaru, 2001; Datta, 2024) (doi2001theclostridiumcellulovorans pages 3-5, datta2024enzymaticdegradationof pages 10-12).
- Cellulosome architecture and dockerin–cohesin system: C. cellulovorans produces extracellular cellulosomes assembled on the scaffoldin CbpA, which contains an N‑terminal CBM, multiple cohesin repeats, and SLH-like domains for cell-surface association. Enzymes bearing type I dockerins (including Eng-family cellulases) bind these cohesins to form the active complex. Cohesin–dockerin interactions are strong, specific, and central to assembly; differences in cohesin repeats can modulate binding efficiency to enzyme dockerins (Park et al., 2001; Doi & Tamaru, 2001) (park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5). The complex system enables synergistic degradation of heterogeneous, insoluble plant cell wall polysaccharides (Datta, 2024) (datta2024enzymaticdegradationof pages 10-12).
- Domain architecture of EngL: Experimental work used EngL catalytic-domain constructs and demonstrated that fusing a CBM (from noncellulosomal EngD) improved soluble expression in E. coli, while native cellulosomal enzymes carry a C‑terminal dockerin instead of a CBM. This affirms the expected modular arrangement for EngL: GH9 catalytic domain plus dockerin, with secretion signal and no intrinsic CBM in the cellulosomal form (Murashima et al., 2003) (murashima2003solubilizationofcellulosomal pages 1-2).

2) Recent developments and latest research (prioritizing 2023–2024)
- Diversity and genomic prevalence of cellulosome systems (2024): A large genomic survey of 305,693 bacterial genomes identified 33 species with the genetic capacity to produce cellulosomes, clarifying conserved and divergent features across genera and distinguishing “complex” vs “simple” cellulosomes. The study emphasizes dockerin-fused glycohydrolases (DocGHs) and cohesin-containing scaffoldins as genomic markers, reinforcing use of dockerin presence to infer cellulosome incorporation of enzymes like EngL (Minor et al., Frontiers in Microbiology, 2024; doi:10.3389/fmicb.2024.1473396) (minor2024agenomicanalysis pages 1-2).
- Updated synthesis on complex vs non-complex cellulase systems and cellulosome mechanisms (2024): A contemporary review details cellulosome components, assembly, and synergy, lists C. cellulovorans among cellulosome-producing anaerobes, and recapitulates functional roles of dockerins/cohesins in anchoring multiple GHs for enhanced cellulose deconstruction (Datta, Heliyon, 2024; doi:10.1016/j.heliyon.2024.e24022) (datta2024enzymaticdegradationof pages 10-12).
- Modular dockerin insights (2023): Although focused on an anaerobic fungus, recent structural/kinetic work highlights dockerin modularity and indicates that adding/removing dockerin domains can leave catalytic parameters intact, supporting the general notion that dockerins primarily mediate complex incorporation rather than catalysis per se—useful when extrapolating to bacterial systems like C. cellulovorans (Dementiev et al., Appl Microbiol Biotechnol, 2023; doi:10.1007/s00253-023-12684-0) (dementiev2023structureandenzymatic pages 1-2).

3) Current applications and real-world implementations
- Biomass deconstruction and biofuels: C. cellulovorans is pursued for direct conversion of lignocellulosic wastes to biofuels. Proteomic analyses under butanol stress underscore its industrial potential and the need for engineering tolerance (Frontiers in Microbiology, 2021; doi:10.3389/fmicb.2021.674639). Cellulosomes and their GH9 components (like EngL) are central to biomass saccharification in consolidated bioprocessing strategies (Datta, 2024; Costa et al., 2021) (datta2024enzymaticdegradationof pages 10-12, costa2021clostridiumcellulovoransproteomic pages 1-2).
- Synthetic/enzyme engineering: Insights into cohesin–dockerin specificity (Coh repeat differences matter in C. cellulovorans) and modularity support rational reassembly or augmentation of cellulosomes to tune substrate scope or stability (Park et al., 2001; Datta, 2024) (park2001cohesindockerininteractionsof pages 2-4, datta2024enzymaticdegradationof pages 10-12).

4) Expert opinions and analysis from authoritative sources
- Scaffoldin-mediated enhancement of crystalline cellulose degradation: The CbpA scaffoldin positions cellulosomal enzymes at crystalline cellulose interfaces, enabling activities not achievable by free enzymes alone; expert analyses from the C. cellulovorans pioneers underline that enzyme subunits require complex formation to attack crystalline substrates (Doi & Tamaru, 2001) (doi2001theclostridiumcellulovorans pages 3-5).
- Cohesin specificity and polycellulosome formation: Experimental evidence in C. cellulovorans shows cohesins differ in binding capability (e.g., Coh6 vs Coh1) and may also mediate scaffoldin–scaffoldin interactions, potentially building higher-order “polycellulosomes,” with implications for enzyme loading and synergy (Park et al., 2001) (park2001cohesindockerininteractionsof pages 2-4).
- Contemporary framing of complex vs non-complex systems: Recent reviews emphasize the two-mode paradigm and the superior performance of dockerin-based complexes on insoluble, heterogeneous substrates—supporting a specialized role for EngL within the cellulosome rather than as a free enzyme (Datta, 2024) (datta2024enzymaticdegradationof pages 10-12).

5) Relevant statistics and data from recent studies
- Genomic scale: 305,693 genomes screened; 33 species identified with cellulosome genetic capacity; cellulosome-producers concentrated in Acetivibrio, Ruminococcus, Ruminiclostridium, and Clostridium; “simple” vs “complex” architectures distinguished (Minor et al., 2024) (minor2024agenomicanalysis pages 1-2). URL: https://doi.org/10.3389/fmicb.2024.1473396 (Oct 2024).
- Functional/mechanistic framework consolidated with updated figures and component listings, including C. thermocellum exemplars and a curated list of anaerobes that produce cellulosomes, explicitly including C. cellulovorans (Datta, 2024) (datta2024enzymaticdegradationof pages 10-12). URL: https://doi.org/10.1016/j.heliyon.2024.e24022 (Jan 2024).

Protein-level function, substrates, and localization for EngL
- Reaction and substrate specificity: By family assignment and organismal context, EngL is an endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose and soluble cellulose analogs (e.g., carboxymethylcellulose). GH9 enzymes in cellulosomes often display endo‑processivity, aiding attack on crystalline cellulose when co-localized with other GHs (Datta, 2024; Doi & Tamaru, 2001) (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5).
- Domains and modules: EngL consists of a GH9 catalytic domain followed by a C‑terminal dockerin; unlike noncellulosomal EngD (GH5 with CBM2), EngL relies on its dockerin for incorporation into the CbpA-based cellulosome. Murashima et al. exploited EngL catalytic-domain fusions to a CBM from EngD to achieve soluble expression in E. coli, supporting the expected modular arrangement of native EngL and confirming its classification among dockerin-bearing, cellulosomal enzymes (doi:10.1002/prot.10298) (murashima2003solubilizationofcellulosomal pages 1-2, murashima2003solubilizationofcellulosomal pages 2-4).
- Cellular localization: EngL is secreted and cell-surface-associated indirectly via CbpA. It is recruited to the extracellular cellulosome through dockerin–cohesin interactions; CbpA’s SLH-like domains anchor the complex to the bacterial surface, and its CBM targets the enzyme assembly to crystalline cellulose (Doi & Tamaru, 2001; Park et al., 2001) (doi2001theclostridiumcellulovorans pages 3-5, park2001cohesindockerininteractionsof pages 2-4).
- Biological role and pathway context: EngL contributes catalytic activity within the cellulosome to deconstruct plant cell walls, acting in synergy with other cellulases (e.g., GH48 exoglucanases, other GH9s) and hemicellulases to release cellodextrins and sugars for fermentation (Datta, 2024; Doi & Tamaru, 2001) (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5).

Experimental evidence in C. cellulovorans underpinning EngL’s assignment
- EngL constructs and solubility/architecture: Murashima et al. (2003) constructed EngL catalytic-domain fusions to the EngD CBM and demonstrated that adding the CBM improves soluble expression, whereas native cellulosomal enzymes like EngB/EngL carry dockerins rather than CBMs. This study explicitly includes EngL and corroborates its identity as a cellulosomal cellulase (Proteins, 2003; https://doi.org/10.1002/prot.10298) (murashima2003solubilizationofcellulosomal pages 1-2, murashima2003solubilizationofcellulosomal pages 2-4).
- Cohesin–dockerin interactions in C. cellulovorans: Park et al. (2001) showed that CbpA cohesins bind dockerin-containing enzymes (e.g., EngE, ExgS) and that different cohesin repeats have different binding efficiencies; binding to CbpA requires the enzyme dockerin, validating dockerin’s role in assembly and supporting the model for EngL localization and function (J Bacteriol, 2001; https://doi.org/10.1128/JB.183.18.5431-5435.2001) (park2001cohesindockerininteractionsof pages 1-2, park2001cohesindockerininteractionsof pages 2-4).
- Scaffoldin functional anatomy: Doi & Tamaru (2001) established the CBD-mediated substrate targeting, cohesin-mediated enzyme recruitment, and SLH-mediated cell-surface anchoring in C. cellulovorans, explaining why dockerin-bearing GH9s (like EngL) are positioned for effective cellulose attack (Chemical Record, 2001; https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W) (doi2001theclostridiumcellulovorans pages 3-5).

Ambiguity and symbol caution
The gene symbol “engL” is used in the C. cellulovorans cellulosome literature to denote a cellulosomal endoglucanase; we did not encounter conflicting uses of engL from unrelated organisms in the sources evaluated. Direct biochemical characterization of EngL’s kinetic parameters was not located in recent (2023–2024) literature; functional assignment thus relies on the UniProt GH9 annotation and strong orthogonal support from C. cellulovorans cellulosome studies and GH9 family roles. Where specific kinetic data are needed, new experimental work would be required (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5).

Recent perspectives relevant to EngL and the C. cellulovorans system (2023–2024)
- System-level trends: The 2024 genomic survey provides a comparative framework for cellulosome architecture and suggests that Clostridium spp. generally harbor simpler cellulosomes than some Acetivibrio/Ruminococcus species; this affects expectations for the number and diversity of GH9 enzymes incorporated, with implications for reengineering efforts (Minor et al., 2024) (minor2024agenomicanalysis pages 1-2).
- Mechanism and synergy emphasis: The 2024 review reiterates the mechanistic superiority of cellulosomes on insoluble substrates, consistent with EngL’s role being amplified when co-localized on CbpA (Datta, 2024) (datta2024enzymaticdegradationof pages 10-12).
- Modularity: 2023 insights on dockerin modularity support design strategies where dockerins are manipulated without altering catalytic cores—relevant for constructing designer cellulosomes incorporating EngL or its variants (Dementiev et al., 2023) (dementiev2023structureandenzymatic pages 1-2).

Applications and implementation outlook
- Biorefining and CBP: EngL, as a GH9 cellulosomal endoglucanase, remains a core biocatalyst candidate in consolidated bioprocessing of lignocellulose by C. cellulovorans. Proteomic responses to butanol stress highlight key strain engineering targets (HSPs, ATPases, membrane remodeling) that complement enzyme-centric strategies (Costa et al., 2021; https://doi.org/10.3389/fmicb.2021.674639) (costa2021clostridiumcellulovoransproteomic pages 1-2).
- Designer cellulosomes: Cohesin/dockerin specificity data (Coh6 vs Coh1) inform scaffoldin selection for synthetic assemblies where EngL dockerin can be re-targeted to optimize enzyme colocalization and performance on specific feedstocks (Park et al., 2001) (park2001cohesindockerininteractionsof pages 2-4).

Embedded summary artifact
| Category | Evidence / Details | Source (with URL) | Publication date |
|---|---|---:|---:|
| Identity verification | EngL is reported as a dockerin-containing cellulase consistent with GH9 family annotation (UniProt Q9RGE6); literature uses EngL constructs and refers to Eng-family cellulosomal enzymes. (UniProt alignment with literature) | Murashima et al., Solubilization of Cellulosomal Cellulases (EngL constructs): https://doi.org/10.1002/prot.10298; Park et al., Cohesin-Dockerin interactions: https://doi.org/10.1128/JB.183.18.5431-5435.2001; Doi & Tamaru, C. cellulovorans cellulosome: https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5) | 2003; 2001; 2001 (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5) |
| Organism | Encoded/expressed in Clostridium cellulovorans (anaerobic, mesophilic cellulolytic bacterium); Eng-family genes identified in genomic/cellulosome gene cluster analyses. | Cho et al., cellulosomic profiling (C. cellulovorans): https://doi.org/10.1016/j.jbiotec.2009.11.020; Doi & Tamaru (CbpA/scaffoldin description): https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (cho2010cellulosomicprofilingproduced pages 1-2, doi2001theclostridiumcellulovorans pages 3-5) | 2010; 2001 (cho2010cellulosomicprofilingproduced pages 1-2, doi2001theclostridiumcellulovorans pages 3-5) |
| Domains / modules | Typical architecture: GH9 catalytic domain + C-terminal dockerin (marks cellulosome incorporation). Non-cellulosomal cellulases (e.g., EngD) instead have a CBM; Murashima showed EngL catalytic domain behavior in fusion constructs. Dockerin/cohesin type specificity matters for scaffoldin binding. | Murashima et al. (EngL constructs & CBD fusions): https://doi.org/10.1002/prot.10298; Park et al. (Coh1 vs Coh6 binding specificity): https://doi.org/10.1128/JB.183.18.5431-5435.2001; Bianchetti et al. (modular CBM/catalytic examples): https://doi.org/10.1016/j.jmb.2013.05.030 (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, bianchetti2013structuredynamicsand pages 1-3) | 2003; 2001; 2013 (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, bianchetti2013structuredynamicsand pages 1-3) |
| Cellular localization | Secreted/extracellular and incorporated into the cellulosome via dockerin→cohesin binding to scaffoldin CbpA (which also mediates substrate/ cell-surface attachment through CBD and SLH-like domains). | Doi & Tamaru (CbpA/scaffoldin anatomy & CBD role): https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W; Park et al. (cohesin–dockerin binding assays): https://doi.org/10.1128/JB.183.18.5431-5435.2001 (doi2001theclostridiumcellulovorans pages 3-5, park2001cohesindockerininteractionsof pages 2-4) | 2001; 2001 (doi2001theclostridiumcellulovorans pages 3-5, park2001cohesindockerininteractionsof pages 2-4) |
| Biochemical function | Assigned as an endo-β-1,4-glucanase (GH9 family) acting on cellulose and soluble analogs (CMC); GH9 enzymes often exhibit processive endo/exo behavior supporting crystalline cellulose attack when assembled in cellulosomes. | Review on cellulase systems and cellulosomes (GH9 role): Datta, Enzymatic degradation of cellulose in soil (overview incl. cellulosomes & GH families): https://doi.org/10.1016/j.heliyon.2024.e24022; Doi & Tamaru (GH9 roles in C. cellulovorans): https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5) | 2024; 2001 (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5) |
| Role in pathway | Contributes to plant cell wall polysaccharide deconstruction as a cellulosome-incorporated catalytic subunit; brings catalytic activity into proximity with other GHs for synergistic degradation of lignocellulose. | Datta review (cellulosome mechanism & synergy): https://doi.org/10.1016/j.heliyon.2024.e24022; Doi & Tamaru (functional role of scaffoldin in crystalline cellulose degradation): https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5) | 2024; 2001 (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5) |
| Experimental support (EngL-specific) | Murashima et al. expressed EngL constructs in E. coli and used catalytic-domain/CBD fusions to study solubility and domain effects; Park et al. demonstrated cohesin–dockerin binding capacity in C. cellulovorans enzymes; Doi & Tamaru characterized CbpA scaffoldin properties. | Murashima et al., Proteins (EngL/EngB/EngD fusion experiments): https://doi.org/10.1002/prot.10298; Park et al., Cohesin–dockerin assays: https://doi.org/10.1128/JB.183.18.5431-5435.2001; Doi & Tamaru (CbpA): https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5) | 2003; 2001; 2001 (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5) |
| Recent developments (2023–2024) | 2023–2024 literature advances relevant to GH9/cellulosomes: modular dockerin/catalytic domain modularity (Dementiev 2023), broad cellulosome system reviews and soil cellulose-degradation syntheses (Datta 2024), and genomic-scale surveys of cellulosome diversity (Minor 2024). These inform specificity, modularity, and synthetic/biotechnological prospects. | Dementiev et al., CelD structure & dockerin modularity (fungal example, relevance for modularity concepts): https://doi.org/10.1007/s00253-023-12684-0; Datta, Heliyon review: https://doi.org/10.1016/j.heliyon.2024.e24022; Minor et al., genomic analysis of cellulosome-displaying bacteria: https://doi.org/10.3389/fmicb.2024.1473396 (dementiev2023structureandenzymatic pages 1-2, datta2024enzymaticdegradationof pages 10-12, minor2024agenomicanalysis pages 1-2) | 2023; 2024; 2024 (dementiev2023structureandenzymatic pages 1-2, datta2024enzymaticdegradationof pages 10-12, minor2024agenomicanalysis pages 1-2) |
| Applications | C. cellulovorans cellulosomal enzymes (including GH9s like EngL) are candidate catalysts for biomass deconstruction and consolidated bioprocessing for biofuel/bioproducts; proteomic and engineering studies highlight strain and enzyme engineering targets for solvent/biofuel production. | Costa et al., proteomics of C. cellulovorans (butanol/biofuel relevance): https://doi.org/10.3389/fmicb.2021.674639; Datta review (cellulosome applications): https://doi.org/10.1016/j.heliyon.2024.e24022 (costa2021clostridiumcellulovoransproteomic pages 1-2, datta2024enzymaticdegradationof pages 10-12) | 2021; 2024 (costa2021clostridiumcellulovoransproteomic pages 1-2, datta2024enzymaticdegradationof pages 10-12) |

Table: Concise evidence-backed summary of engL (UniProt Q9RGE6): identity, domains, localization, biochemical role, experimental support, recent developments, and applications, with source URLs and context citations for verification.

References with URLs and dates
- Murashima K, Kosugi A, Doi RH. Proteins: Structure, Function, and Genetics 2003;50:620–628. “Solubilization of cellulosomal cellulases by fusion with cellulose-binding domain of noncellulosomal cellulase EngD from Clostridium cellulovorans.” doi:10.1002/prot.10298 (Mar 2003). URL: https://doi.org/10.1002/prot.10298 (murashima2003solubilizationofcellulosomal pages 1-2, murashima2003solubilizationofcellulosomal pages 2-4).
- Park J-S, Matano Y, Doi RH. J Bacteriol 2001;183:5431–5435. “Cohesin-Dockerin Interactions of Cellulosomal Subunits of Clostridium cellulovorans.” doi:10.1128/JB.183.18.5431-5435.2001 (Sep 2001). URL: https://doi.org/10.1128/JB.183.18.5431-5435.2001 (park2001cohesindockerininteractionsof pages 1-2, park2001cohesindockerininteractionsof pages 2-4).
- Doi RH, Tamaru Y. Chemical Record 2001;1(1):24–32. “The Clostridium cellulovorans cellulosome: an enzyme complex with plant cell wall degrading activity.” doi:10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (Jan 2001). URL: https://doi.org/10.1002/1528-0691(2001)1:1<24::AID-TCR5>3.0.CO;2-W (doi2001theclostridiumcellulovorans pages 3-5).
- Cho W et al. J Biotechnol 2010;145(3):233–239. “Cellulosomic profiling produced by C. cellulovorans during growth on different carbon sources explored by the cohesin marker.” doi:10.1016/j.jbiotec.2009.11.020 (Feb 2010). URL: https://doi.org/10.1016/j.jbiotec.2009.11.020 (cho2010cellulosomicprofilingproduced pages 1-2).
- Datta R. Heliyon 2024;10:e24022. “Enzymatic degradation of cellulose in soil: a review.” doi:10.1016/j.heliyon.2024.e24022 (Jan 2024). URL: https://doi.org/10.1016/j.heliyon.2024.e24022 (datta2024enzymaticdegradationof pages 10-12).
- Minor CM et al. Front Microbiol 2024;15:1473396. “A genomic analysis reveals the diversity of cellulosome displaying bacteria.” doi:10.3389/fmicb.2024.1473396 (Oct 2024). URL: https://doi.org/10.3389/fmicb.2024.1473396 (minor2024agenomicanalysis pages 1-2).
- Dementiev A et al. Appl Microbiol Biotechnol 2023;107:5999–6011. “Structure and enzymatic characterization of CelD endoglucanase from the anaerobic fungus Piromyces finnis.” doi:10.1007/s00253-023-12684-0 (Aug 2023). URL: https://doi.org/10.1007/s00253-023-12684-0 (dementiev2023structureandenzymatic pages 1-2).
- Bianchetti CM et al. J Mol Biol 2013;425(22):4267–4285. “Structure, dynamics, and specificity of endoglucanase D from Clostridium cellulovorans.” doi:10.1016/j.jmb.2013.05.030 (Nov 2013). URL: https://doi.org/10.1016/j.jmb.2013.05.030 (context on modularity/CBM vs dockerin) (bianchetti2013structuredynamicsand pages 1-3).
- Costa P et al. Front Microbiol 2021;12:674639. “Clostridium cellulovorans proteomic responses to butanol stress.” doi:10.3389/fmicb.2021.674639 (Jul 2021). URL: https://doi.org/10.3389/fmicb.2021.674639 (costa2021clostridiumcellulovoransproteomic pages 1-2).

Summary
- Identity and domains: EngL conforms to a GH9 endoglucanase with a C‑terminal dockerin, encoded in C. cellulovorans; experimental constructs and cellulosome studies in this organism support the UniProt annotation (murashima2003solubilizationofcellulosomal pages 1-2, park2001cohesindockerininteractionsof pages 2-4, doi2001theclostridiumcellulovorans pages 3-5).
- Function and substrates: Endo‑β‑1,4‑glucanase acting on cellulose/CMC; within cellulosomes GH9s often exhibit processive behavior aiding crystalline cellulose degradation (datta2024enzymaticdegradationof pages 10-12, doi2001theclostridiumcellulovorans pages 3-5).
- Localization: Secreted, extracellular cellulosome-associated via dockerin–cohesin binding to CbpA; scaffoldin CBM and SLH-like domains target cellulose and tether the complex to the cell surface (doi2001theclostridiumcellulovorans pages 3-5, park2001cohesindockerininteractionsof pages 2-4).
- Pathway role and applications: Contributes to synergistic plant cell wall deconstruction; relevant to biorefining/CBP; 2023–2024 advances contextualize modularity and system diversity useful for engineering efforts (minor2024agenomicanalysis pages 1-2, datta2024enzymaticdegradationof pages 10-12, dementiev2023structureandenzymatic pages 1-2).

References

  1. (murashima2003solubilizationofcellulosomal pages 1-2): Koichiro Murashima, Akihiko Kosugi, and Roy H. Doi. Solubilization of cellulosomal cellulases by fusion with cellulose‐binding domain of noncellulosomal cellulase engd from clostridium cellulovorans. Proteins: Structure, 50:620-628, Mar 2003. URL: https://doi.org/10.1002/prot.10298, doi:10.1002/prot.10298. This article has 46 citations.

  2. (park2001cohesindockerininteractionsof pages 2-4): Jae-Seon Park, Yutaka Matano, and Roy H. Doi. Cohesin-dockerin interactions of cellulosomal subunits of clostridium cellulovorans. Journal of Bacteriology, 183:5431-5435, Sep 2001. URL: https://doi.org/10.1128/jb.183.18.5431-5435.2001, doi:10.1128/jb.183.18.5431-5435.2001. This article has 33 citations and is from a peer-reviewed journal.

  3. (doi2001theclostridiumcellulovorans pages 3-5): Roy H. Doi and Yutaka Tamaru. The clostridium cellulovorans cellulosome: an enzyme complex with plant cell wall degrading activity. Chemical record, 1 1:24-32, Jan 2001. URL: https://doi.org/10.1002/1528-0691(2001)1:1<24::aid-tcr5>3.0.co;2-w, doi:10.1002/1528-0691(2001)1:1<24::aid-tcr5>3.0.co;2-w. This article has 144 citations and is from a peer-reviewed journal.

  4. (datta2024enzymaticdegradationof pages 10-12): Rahul Datta. Enzymatic degradation of cellulose in soil: a review. Heliyon, 10:e24022, Jan 2024. URL: https://doi.org/10.1016/j.heliyon.2024.e24022, doi:10.1016/j.heliyon.2024.e24022. This article has 18 citations and is from a peer-reviewed journal.

  5. (minor2024agenomicanalysis pages 1-2): Christine M. Minor, Allen Takayesu, Sung Min Ha, Lukasz Salwinski, Michael R. Sawaya, Matteo Pellegrini, and Robert T. Clubb. A genomic analysis reveals the diversity of cellulosome displaying bacteria. Frontiers in Microbiology, Oct 2024. URL: https://doi.org/10.3389/fmicb.2024.1473396, doi:10.3389/fmicb.2024.1473396. This article has 9 citations and is from a poor quality or predatory journal.

  6. (dementiev2023structureandenzymatic pages 1-2): Alexey Dementiev, Stephen P. Lillington, Shiyan Jin, Youngchang Kim, Robert Jedrzejczak, Karolina Michalska, Andrzej Joachimiak, and Michelle A. O’Malley. Structure and enzymatic characterization of celd endoglucanase from the anaerobic fungus piromyces finnis. Applied Microbiology and Biotechnology, 107:5999-6011, Aug 2023. URL: https://doi.org/10.1007/s00253-023-12684-0, doi:10.1007/s00253-023-12684-0. This article has 9 citations and is from a domain leading peer-reviewed journal.

  7. (costa2021clostridiumcellulovoransproteomic pages 1-2): Paolo Costa, Giulia Usai, Angela Re, Marcello Manfredi, Giuseppe Mannino, Cinzia Margherita Bertea, Enrica Pessione, and Roberto Mazzoli. Clostridium cellulovorans proteomic responses to butanol stress. Frontiers in Microbiology, Jul 2021. URL: https://doi.org/10.3389/fmicb.2021.674639, doi:10.3389/fmicb.2021.674639. This article has 10 citations and is from a poor quality or predatory journal.

  8. (murashima2003solubilizationofcellulosomal pages 2-4): Koichiro Murashima, Akihiko Kosugi, and Roy H. Doi. Solubilization of cellulosomal cellulases by fusion with cellulose‐binding domain of noncellulosomal cellulase engd from clostridium cellulovorans. Proteins: Structure, 50:620-628, Mar 2003. URL: https://doi.org/10.1002/prot.10298, doi:10.1002/prot.10298. This article has 46 citations.

  9. (park2001cohesindockerininteractionsof pages 1-2): Jae-Seon Park, Yutaka Matano, and Roy H. Doi. Cohesin-dockerin interactions of cellulosomal subunits of clostridium cellulovorans. Journal of Bacteriology, 183:5431-5435, Sep 2001. URL: https://doi.org/10.1128/jb.183.18.5431-5435.2001, doi:10.1128/jb.183.18.5431-5435.2001. This article has 33 citations and is from a peer-reviewed journal.

  10. (cho2010cellulosomicprofilingproduced pages 1-2): Woojae Cho, Sang Duck Jeon, Hyun Jung Shim, Roy H. Doi, and Sung Ok Han. Cellulosomic profiling produced by clostridium cellulovorans during growth on different carbon sources explored by the cohesin marker. Journal of biotechnology, 145 3:233-9, Feb 2010. URL: https://doi.org/10.1016/j.jbiotec.2009.11.020, doi:10.1016/j.jbiotec.2009.11.020. This article has 39 citations and is from a peer-reviewed journal.

  11. (bianchetti2013structuredynamicsand pages 1-3): Christopher M. Bianchetti, Phillip Brumm, Robert W. Smith, Kevin Dyer, Greg L. Hura, Thomas J. Rutkoski, and George N. Phillips. Structure, dynamics, and specificity of endoglucanase d from clostridium cellulovorans. Journal of molecular biology, 425 22:4267-85, Nov 2013. URL: https://doi.org/10.1016/j.jmb.2013.05.030, doi:10.1016/j.jmb.2013.05.030. This article has 57 citations and is from a domain leading peer-reviewed journal.

Citations

  1. murashima2003solubilizationofcellulosomal pages 1-2
  2. datta2024enzymaticdegradationof pages 10-12
  3. minor2024agenomicanalysis pages 1-2
  4. dementiev2023structureandenzymatic pages 1-2
  5. doi2001theclostridiumcellulovorans pages 3-5
  6. park2001cohesindockerininteractionsof pages 2-4
  7. costa2021clostridiumcellulovoransproteomic pages 1-2
  8. cho2010cellulosomicprofilingproduced pages 1-2
  9. bianchetti2013structuredynamicsand pages 1-3
  10. murashima2003solubilizationofcellulosomal pages 2-4
  11. park2001cohesindockerininteractionsof pages 1-2
  12. https://doi.org/10.3389/fmicb.2024.1473396
  13. https://doi.org/10.1016/j.heliyon.2024.e24022
  14. https://doi.org/10.1002/prot.10298
  15. https://doi.org/10.1128/JB.183.18.5431-5435.2001
  16. https://doi.org/10.1002/1528-0691(2001
  17. https://doi.org/10.3389/fmicb.2021.674639
  18. https://doi.org/10.1002/prot.10298;
  19. https://doi.org/10.1128/JB.183.18.5431-5435.2001;
  20. https://doi.org/10.1016/j.jbiotec.2009.11.020;
  21. https://doi.org/10.1016/j.jmb.2013.05.030
  22. https://doi.org/10.1016/j.heliyon.2024.e24022;
  23. https://doi.org/10.1007/s00253-023-12684-0;
  24. https://doi.org/10.3389/fmicb.2021.674639;
  25. https://doi.org/10.1016/j.jbiotec.2009.11.020
  26. https://doi.org/10.1007/s00253-023-12684-0
  27. https://doi.org/10.1002/prot.10298,
  28. https://doi.org/10.1128/jb.183.18.5431-5435.2001,
  29. https://doi.org/10.1016/j.heliyon.2024.e24022,
  30. https://doi.org/10.3389/fmicb.2024.1473396,
  31. https://doi.org/10.1007/s00253-023-12684-0,
  32. https://doi.org/10.3389/fmicb.2021.674639,
  33. https://doi.org/10.1016/j.jbiotec.2009.11.020,
  34. https://doi.org/10.1016/j.jmb.2013.05.030,

📄 View Raw YAML

id: Q9RGE6
gene_symbol: engL
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:1493
  label: Clostridium cellulovorans
description: >-
  EngL is a glycoside hydrolase family 9 (GH9) endoglucanase from the Clostridium
  cellulovorans cellulosome. It contains a GH9 catalytic domain and a C-terminal type
  I
  dockerin domain for cellulosome assembly. Unlike non-cellulosomal cellulases (e.g.,
  EngD),
  EngL lacks an intrinsic carbohydrate-binding module and relies on dockerin-mediated
  incorporation into the CbpA scaffoldin for cellulose targeting. EngL functions as
  an
  endo-β-1,4-glucanase that hydrolyzes internal β-1,4 linkages in cellulose.
aliases:
  - endoglucanase L
  - glucanase
existing_annotations:
  - term:
      id: GO:0000272
      label: polysaccharide catabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: >-
        This annotation is correct. EngL is an endoglucanase that directly participates
        in polysaccharide (cellulose) catabolism by hydrolyzing β-1,4-glycosidic bonds.
      action: ACCEPT
      reason: >-
        As a GH9 endoglucanase, EngL directly catalyzes the breakdown of polysaccharides.
        This is a core function supported by biochemical characterization.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "Endo‑β‑1,4‑glucanase acting on cellulose/CMC; within cellulosomes
            GH9s often exhibit processive behavior aiding crystalline cellulose degradation"
  - term:
      id: GO:0004553
      label: hydrolase activity, hydrolyzing O-glycosyl compounds
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: >-
        This annotation is correct but generic. EngL has hydrolase activity on
        O-glycosyl bonds. More specific terms like GO:0008810 (cellulase activity)
        better capture its function.
      action: KEEP_AS_NON_CORE
      reason: >-
        This is a true parent term of the more specific cellulase activity. It's
        correct but not the most informative annotation for this enzyme.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "EngL conforms to a GH9 endoglucanase with a C‑terminal
            dockerin"
  - term:
      id: GO:0005975
      label: carbohydrate metabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: >-
        This annotation is too general. EngL participates in cellulose catabolism
        specifically, not general carbohydrate metabolism. More specific terms exist.
      action: MARK_AS_OVER_ANNOTATED
      reason: >-
        This parent term is overly broad. GO:0030245 (cellulose catabolic process)
        is the appropriate specific term for EngL function.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "EngL is an endo‑β‑1,4‑glucanase that hydrolyzes internal
            β‑1,4 linkages in cellulose"
  - term:
      id: GO:0016787
      label: hydrolase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: >-
        This annotation is too general. EngL has hydrolase activity, but more specific
        terms like cellulase activity better describe its function.
      action: MARK_AS_OVER_ANNOTATED
      reason: >-
        This is a very broad parent term. GO:0008810 (cellulase activity) is the
        appropriate level of specificity for this GH9 enzyme.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "GH9 enzymes in cellulosomes often display endo‑processivity"
  - term:
      id: GO:0016798
      label: hydrolase activity, acting on glycosyl bonds
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: >-
        This annotation is correct but not as specific as GO:0008810. EngL acts
        on glycosyl bonds in cellulose.
      action: KEEP_AS_NON_CORE
      reason: >-
        This is a parent term of cellulase activity. It's correct but less informative
        than the more specific GO:0008810 annotation.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "EngL conforms to a GH9 endoglucanase"
# NEW ANNOTATIONS
  - term:
      id: GO:0008810
      label: cellulase activity
    evidence_type: TAS
    original_reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    review:
      summary: >-
        EngL is a GH9 endoglucanase with cellulase activity. This is the appropriate
        molecular function annotation for this enzyme.
      action: NEW
      reason: >-
        EngL is classified in GH9 family and functions as an endo-β-1,4-glucanase
        acting on cellulose substrates.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "By family assignment and organismal context, EngL is an
            endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose
            and soluble cellulose analogs"
  - term:
      id: GO:0030245
      label: cellulose catabolic process
    evidence_type: TAS
    original_reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    review:
      summary: >-
        EngL directly participates in cellulose degradation as a cellulosomal
        endoglucanase.
      action: NEW
      reason: >-
        As a GH9 cellulosomal enzyme, EngL contributes to the degradation of
        plant cell wall cellulose.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "EngL contributes catalytic activity within the cellulosome
            to deconstruct plant cell walls"
  - term:
      id: GO:0043263
      label: cellulosome
    evidence_type: TAS
    original_reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    review:
      summary: >-
        EngL is a cellulosomal enzyme that assembles into the cellulosome via its
        dockerin domain. This cellular component annotation is essential.
      action: NEW
      reason: >-
        EngL contains a type I dockerin domain and is incorporated into the CbpA-based
        cellulosome complex.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "Secreted/extracellular and incorporated into the cellulosome
            via dockerin→cohesin binding to scaffoldin CbpA"
  - term:
      id: GO:0005576
      label: extracellular region
    evidence_type: TAS
    original_reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    review:
      summary: >-
        EngL is secreted and functions extracellularly as part of the cellulosome.
      action: NEW
      reason: >-
        EngL is secreted and assembles into extracellular cellulosome complexes via
        its dockerin domain binding to CbpA cohesins.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "Secreted, extracellular cellulosome-associated via dockerin–cohesin
            binding to CbpA"
  - term:
      id: GO:1990311
      label: type-I cohesin domain binding
    evidence_type: TAS
    original_reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    review:
      summary: >-
        EngL contains a type I dockerin domain that binds to type I cohesins on the
        CbpA scaffoldin, enabling cellulosome assembly.
      action: NEW
      reason: >-
        The dockerin domain mediates binding to cohesin domains, which is essential
        for cellulosome incorporation.
      supported_by:
        - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
          supporting_text: "Typical architecture: GH9 catalytic domain + C-terminal
            dockerin (marks cellulosome incorporation)"
  - term:
      id: GO:0044575
      label: cellulosome assembly
    evidence_type: NAS
    review:
      summary: Added to align core_functions with existing annotations.
      action: NEW
      reason: Core function term not present in existing_annotations.
references:
  - id: GO_REF:0000002
    title: Gene Ontology annotation through association of InterPro records with
      GO terms
    findings: []
  - id: GO_REF:0000043
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword 
      mapping
    findings: []
  - id: GO_REF:0000120
    title: Combined Automated Annotation using Multiple IEA Methods
    findings: []
  - id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
    title: Deep research summary for Q9RGE6/engL
    findings:
      - statement: EngL is a GH9 endoglucanase with cellulase activity
        supporting_text: "EngL conforms to a GH9 endoglucanase with a C‑terminal dockerin"
      - statement: EngL functions in cellulose degradation within the 
          cellulosome
        supporting_text: "EngL contributes catalytic activity within the cellulosome
          to deconstruct plant cell walls"
      - statement: EngL is secreted and incorporated into the extracellular 
          cellulosome
        supporting_text: "Secreted, extracellular cellulosome-associated via dockerin–cohesin
          binding to CbpA"
core_functions:
  - description: >-
      EngL is a GH9 endoglucanase that catalyzes the hydrolysis of β-1,4-glycosidic
      bonds in cellulose. It functions as part of the cellulosome to efficiently
      degrade plant cell wall polysaccharides.
    molecular_function:
      id: GO:0008810
      label: cellulase activity
    directly_involved_in:
      - id: GO:0030245
        label: cellulose catabolic process
    locations:
      - id: GO:0005576
        label: extracellular region
    in_complex:
      id: GO:0043263
      label: cellulosome
    supported_by:
      - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
        supporting_text: "By family assignment and organismal context, EngL is an
          endo‑β‑1,4‑glucanase that hydrolyzes internal β‑1,4 linkages in cellulose"
  - description: >-
      EngL contains a type I dockerin domain that binds to cohesins on the CbpA
      scaffoldin, enabling its incorporation into the cellulosome complex.
    molecular_function:
      id: GO:1990311
      label: type-I cohesin domain binding
    directly_involved_in:
      - id: GO:0044575
        label: cellulosome assembly
    locations:
      - id: GO:0005576
        label: extracellular region
    in_complex:
      id: GO:0043263
      label: cellulosome
    supported_by:
      - reference_id: file:CLOCL/Q9RGE6/Q9RGE6-deep-research-falcon.md
        supporting_text: "Typical architecture: GH9 catalytic domain + C-terminal
          dockerin (marks cellulosome incorporation)"
proposed_new_terms: []
suggested_questions:
  - question: >-
      What are the kinetic parameters of EngL on different cellulose substrates?
    experts: []
  - question: >-
      How does EngL's activity compare to EngK within the cellulosome complex?
    experts: []
suggested_experiments:
  - description: >-
      Detailed kinetic characterization of EngL on crystalline and amorphous
      cellulose substrates to determine Km, kcat, and processivity.
    hypothesis: >-
      EngL may exhibit processive behavior characteristic of some GH9 enzymes
      when part of the cellulosome complex.
  - description: >-
      Synergy assays with EngL and other cellulosomal enzymes (EngK, ExgS) on
      crystalline cellulose.
    hypothesis: >-
      EngL provides complementary endo-activity that synergizes with exoglucanases
      for efficient cellulose degradation.