celA

UniProt ID: A3DC29
Organism: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
Review Status: DRAFT
📝 Provide Detailed Feedback

Gene Description

CelA (also known as Cel8A or Endoglucanase A) is a secreted endo-beta-1,4-glucanase (EC 3.2.1.4) belonging to glycosyl hydrolase family 8 (GH8). It catalyzes the endohydrolysis of internal (1->4)-beta-D-glucosidic linkages in cellulose, lichenin, and cereal beta-D-glucans. The enzyme contains a GH8 catalytic domain with an open cleft/groove architecture characteristic of endo-acting glycosidases, plus a C-terminal type I dockerin domain (residues 411-477) that mediates incorporation into the cellulosome complex via cohesin-dockerin interactions with the primary scaffoldin CipA. CelA is one of the major catalytic components of the C. thermocellum cellulosome, consistently identified in proteomic surveys of purified cellulosomes grown on Avicel. The enzyme lacks its own carbohydrate-binding module (CBM) and relies on the CBM of CipA for targeting to insoluble cellulose substrates. Structural studies at atomic resolution (0.94 A) have characterized the active site, identifying residues E95 (proton donor) and D152 (nucleophile) as key catalytic residues.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
ACCEPT
Summary: CelA is an endoglucanase that participates in cellulose degradation. While cellulose is a polysaccharide, the more specific term GO:0030245 (cellulose catabolic process) is already annotated and better represents the biological process. This broader term is acceptable but redundant given the more specific annotation. The deep research review confirms CelA's role in cellulose breakdown as part of the cellulosome complex [celA-deep-research-falcon.md].
Reason: This annotation is correct but overly broad. CelA specifically degrades cellulose, which is a type of polysaccharide. The term is inferred from InterPro domain annotations (IPR002105 Dockerin, IPR016134 Dockerin domain, IPR036439 Dockerin superfamily) which correctly associate the protein with polysaccharide degradation. Since GO:0030245 (cellulose catabolic process) is also annotated and is more specific, this broader term is somewhat redundant but not incorrect. For IEA annotations, broader parent terms are acceptable to retain alongside more specific ones.
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000120
ACCEPT
Summary: CelA is a glycosyl hydrolase (GH8 family) that hydrolyzes O-glycosyl compounds, specifically (1->4)-beta-D-glucosidic linkages. This term is correct but a parent of the more specific GO:0008810 (cellulase activity) which is also annotated.
Reason: This annotation is correct. CelA belongs to GH8 and catalyzes hydrolysis of beta-glucosidic bonds in cellulose. The annotation is inferred from ARBA rule ARBA00027782 and InterPro domains IPR002037 (Glyco_hydro_8) and IPR002105 (Dockerin). While GO:0008810 (cellulase activity) is more specific, this parent term is acceptable for an IEA annotation and provides a useful grouping term.
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
ACCEPT
Summary: CelA participates in carbohydrate metabolism through its role in cellulose degradation. This is a high-level parent term of more specific annotations already present.
Reason: This annotation is correct but very broad. It is inferred from InterPro records IPR002037 (Glyco_hydro_8), IPR008928 (6-hairpin_glycosidase_sf), and IPR012341 (6hp_glycosidase-like_sf). While the more specific GO:0030245 (cellulose catabolic process) better captures CelA's function, this broader term is not incorrect and acceptable for an IEA annotation derived from domain membership.
GO:0008810 cellulase activity
IEA
GO_REF:0000003
ACCEPT
Summary: CelA (Endoglucanase A) is definitively a cellulase with EC 3.2.1.4 activity. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin, and cereal beta-D-glucans. This is the core molecular function of the enzyme and the most specific appropriate MF term.
Reason: This annotation accurately represents the core molecular function of CelA. The enzyme is explicitly annotated as EC 3.2.1.4 (endoglucanase/cellulase) in UniProt, and the GO term definition matches exactly: "Catalysis of the endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin and cereal beta-D-glucans." The annotation is derived from the EC number mapping (GO_REF:0000003). Proteomic studies identify CelA/Cel8A as a major catalytic component of the cellulosome with endo-beta-1,4-glucanase activity [PMID:16127726, PMID:17644599]. Structural studies confirm the open cleft architecture characteristic of endo-acting glycosidases [PMID:8805535].
Supporting Evidence:
PMID:16127726
Ten of the components were previously known: the structural protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N, the cellobiohydrolases Cbh9A, Cel9K, Cel48S, the xylanases Xyn10C, Xyn10Z, and the chitinase Chi18A
PMID:17644599
In total, 41 cellulosomal proteins were detected, including 36 type I dockerin-containing proteins
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
ACCEPT
Summary: CelA is a hydrolase that cleaves glycosidic bonds. This is a very broad parent term of the more specific cellulase activity already annotated.
Reason: This annotation is correct but extremely broad. CelA is indeed a hydrolase (EC 3.2.1.4), but the term GO:0008810 (cellulase activity) provides much more informative functional information. This IEA annotation derived from UniProtKB keyword KW-0378 (Hydrolase) is acceptable but provides minimal functional insight beyond what is captured by more specific terms.
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000043
ACCEPT
Summary: CelA hydrolyzes glycosyl bonds, specifically (1->4)-beta-D-glucosidic linkages. This term is a parent of GO:0004553 (hydrolase activity, hydrolyzing O-glycosyl compounds) and GO:0008810 (cellulase activity).
Reason: This annotation is correct. CelA acts on glycosyl bonds (specifically beta-1,4 glucosidic bonds in cellulose). The annotation is inferred from UniProtKB keyword KW-0326 (Glycosidase). While more specific terms are available and already annotated, this intermediate-level term is acceptable for an IEA.
GO:0030245 cellulose catabolic process
IEA
GO_REF:0000043
ACCEPT
Summary: CelA is a cellulose-degrading endoglucanase that directly participates in cellulose catabolism as part of the cellulosome complex. This is the appropriate biological process term for this enzyme.
Reason: This annotation accurately represents CelA's biological role. The enzyme catalyzes the endohydrolysis of cellulose chains, producing new chain ends for exoglucanases and contributing directly to cellulose breakdown. The annotation is derived from UniProtKB keyword KW-0136 (Cellulose degradation). Proteomic analyses confirm CelA is a major component of the cellulosome, the multi-enzyme complex responsible for cellulose catabolism in C. thermocellum [PMID:16127726, PMID:17644599, PMID:38234915].
Supporting Evidence:
PMID:38234915
the anchoring of various cellulolytic enzymes, i.e., endoglucanases, cellobiohydrolases, xylanases, to a scaffoldin makes it a complex structure in which different enzymes work synergistically to attack heterogeneous, insoluble cellulose substrates
file:ACET2/celA/celA-deep-research-falcon.md
CelA/Cel8A is an endo-beta-1,4-glucanase (EC 3.2.1.4) that cleaves internal bonds in cellulose chains, producing new chain ends for exoglucanases
GO:0043263 cellulosome
IEA
GO_REF:0000120
NEW
Summary: CelA contains a type I dockerin domain that mediates its incorporation into the cellulosome complex. The cellulosome (GO:0043263) is the appropriate cellular component annotation for this protein.
Reason: CelA has a well-characterized dockerin domain (residues 411-477) that binds to type I cohesins on the CipA scaffoldin. Proteomic surveys consistently identify CelA as a component of purified cellulosomes. This cellular component annotation should be added, as the protein functions as part of the cellulosome complex. The InterPro domains IPR002105 (Dockerin_1_rpt), IPR016134 (Dockerin_dom), and IPR036439 (Dockerin_dom_sf) all indicate cellulosome localization. Note: This annotation may already exist via IEA but was not present in the provided GOA file.
Proposed replacements: cellulosome
Supporting Evidence:
PMID:16127726
To identify the predominant catalytic components, cellulosomes were purified and the components were separated by an adapted two-dimensional gel electrophoresis technique. The apparent major spots were identified by MALDI-TOF/TOF
PMID:38234915
Cellulosome comprises two major subunits: long flexible scaffoldin, which forms the center part containing specific binding sites cohesions, and the enzymes containing a dockerin module that binds to cohesion

Core Functions

Endo-beta-1,4-glucanase activity as part of the cellulosome complex, catalyzing internal cleavage of cellulose chains to produce new chain ends for exoglucanases.

Molecular Function:
cellulase activity
Directly Involved In:
Cellular Locations:
Supporting Evidence:
  • PMID:16127726
    Ten of the components were previously known: the structural protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N
  • PMID:17644599
    In total, 41 cellulosomal proteins were detected, including 36 type I dockerin-containing proteins

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on Enzyme Commission mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Combined Automated Annotation using Multiple IEA Methods
Sequence of a cellulase gene of the thermophilic bacterium Clostridium thermocellum
  • Original cloning and sequencing of the celA gene encoding Endoglucanase A
    "The nucleotide sequence of the celA gene, encoding the extracellular endoglucanase A of Clostridium thermocellum, was determined and compared with the NH2-terminal amino acid sequence of the purified enzyme"
The crystal structure of endoglucanase CelA, a family 8 glycosyl hydrolase from Clostridium thermocellum
  • Crystal structure at 1.65 A resolution confirms GH8 family fold with active site cleft
    "The protein folds into a regular (alpha/alpha)6 barrel formed by six inner and six outer alpha helices. Cello-oligosaccharides bind to an acidic cleft containing at least five D-glucosyl-binding subsites"
Atomic (0.94 A) resolution structure of an inverting glycosidase in complex with substrate
  • High-resolution structure reveals active site and catalytic mechanism
    "The crystal structure of Clostridium thermocellum endoglucanase CelA in complex with cellopentaose has been determined at 0.94 A resolution. The oligosaccharide occupies six D-glucosyl-binding subsites"
Functional subgenomics of Clostridium thermocellum cellulosomal genes
  • CelA/Cel8A identified as one of the major catalytic components of the cellulosome
    "Ten of the components were previously known: the structural protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N, the cellobiohydrolases Cbh9A, Cel9K, Cel48S, the xylanases Xyn10C, Xyn10Z, and the chitinase Chi18A"
  • Seventy-one putative cellulosomal genes detected in C. thermocellum genome
    "Seventy-one putative cellulosomal genes were detected. One third of these ORFs may be involved in cellulose hydrolysis"
Global view of the Clostridium thermocellum cellulosome revealed by quantitative proteomic analysis
  • Quantitative proteomic analysis of cellulosome composition
    "In total, 41 cellulosomal proteins were detected, including 36 type I dockerin-containing proteins, which count among them all but three of the known docking components and 16 new subunits"
Enzymatic degradation of cellulose in soil - A review
  • 2024 review describing cellulosome architecture including CelA as endoglucanase component
    "the anchoring of various cellulolytic enzymes, i.e., endoglucanases, cellobiohydrolases, xylanases, to a scaffoldin makes it a complex structure in which different enzymes work synergistically to attack heterogeneous, insoluble cellulose substrates"
file:ACET2/celA/celA-deep-research-falcon.md
Deep research review of celA gene function
  • Comprehensive literature synthesis on CelA function and cellulosome localization
    "CelA/Cel8A is an endo-beta-1,4-glucanase (EC 3.2.1.4) that cleaves internal bonds in cellulose chains, producing new chain ends for exoglucanases. Structural evidence shows a groove/open cleft typical of endo-acting enzymes"

Suggested Questions for Experts

Q: What is the precise substrate specificity of CelA compared to other GH8 endoglucanases in the cellulosome? Does it show any preference for specific regions of crystalline vs amorphous cellulose?

Q: How does CelA synergize with exoglucanases (like CelS/Cel48A) and other endoglucanases in the cellulosome to achieve efficient cellulose degradation?

Suggested Experiments

Experiment: Compare the kinetic parameters (kcat, Km) of CelA in free form versus when bound to mini-cellulosomes containing different combinations of other cellulases. Use defined cellulose substrates (Avicel, filter paper, CMC, phosphoric acid swollen cellulose) to assess potential synergistic effects.

Hypothesis: CelA shows optimal activity when incorporated into the cellulosome complex compared to free enzyme, due to proximity effects with other cellulases.

Type: enzyme kinetics

Tags

cellulosome

📚 Additional Documentation

Deep Research Falcon

(celA-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:03:11.950390'
end_time: '2025-12-26T17:08:34.462065'
duration_seconds: 322.51
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: celA
gene_symbol: celA
uniprot_accession: A3DC29
protein_description: 'RecName: Full=Endoglucanase A; Short=EGA; EC=3.2.1.4; AltName:
Full=Cellulase A; AltName: Full=Endo-1,4-beta-glucanase; Flags: Precursor;'
gene_info: Name=celA; OrderedLocusNames=Cthe_0269;
organism_full: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322
/ NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
protein_family: Belongs to the glycosyl hydrolase 8 (cellulase D) family.
protein_domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf.
(IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf.
(IPR036439)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 11


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: A3DC29
  • Protein Description: RecName: Full=Endoglucanase A; Short=EGA; EC=3.2.1.4; AltName: Full=Cellulase A; AltName: Full=Endo-1,4-beta-glucanase; Flags: Precursor;
  • Gene Information: Name=celA; OrderedLocusNames=Cthe_0269;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 8 (cellulase D) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "celA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'celA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene celA (gene ID: celA, UniProt: A3DC29) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: A3DC29
  • Protein Description: RecName: Full=Endoglucanase A; Short=EGA; EC=3.2.1.4; AltName: Full=Cellulase A; AltName: Full=Endo-1,4-beta-glucanase; Flags: Precursor;
  • Gene Information: Name=celA; OrderedLocusNames=Cthe_0269;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 8 (cellulase D) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "celA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'celA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene celA (gene ID: celA, UniProt: A3DC29) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan and verification
- Identity verification: The target is celA (also called Cel8A/Endoglucanase A), a glycoside hydrolase family 8 (GH8) enzyme from Acetivibrio thermocellus (formerly Clostridium thermocellum). In classical proteomics of purified C. thermocellum cellulosomes, CelA/Cel8A is explicitly identified as an endoglucanase and listed among the major catalytic components; domain annotation in those datasets is GH8 with a type I dockerin (GH8-Doc1), consistent with the UniProt A3DC29 record listing an endoglucanase with dockerin motifs in a GH8 fold. Organism is correct for those identifications (C. thermocellum, synonymous with A. thermocellus). The symbol “celA” is potentially ambiguous across species; here we restrict all claims to C. thermocellum/A. thermocellus CelA/Cel8A as supported by cited sources (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 2-3, datta2024enzymaticdegradationof pages 10-12).

Comprehensive research report

1) Key concepts and definitions
- Protein/gene: celA (Cel8A; Endoglucanase A) encodes a modular, extracellular cellulase deployed within the C. thermocellum/A. thermocellus cellulosome. It belongs to GH family 8 and carries a type I dockerin for incorporation into the primary scaffoldin CipA through cohesin–dockerin interactions (notation GH8-Doc1). Proteomic analyses of purified cellulosomes identify CelA/Cel8A among the major catalytic subunits under cellulose growth (Avicel) (zverlov2005functionalsubgenomicsof pages 2-4, zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3).
- Catalytic activity: Endo-β-1,4-glucanase (EC 3.2.1.4), which cleaves internal β-1,4-glycosidic bonds along cellulose chains; structural descriptions highlight an open cleft/groove at the active site, consistent with endo-acting specificity (bras2012structureandfunction pages 56-59).
- Cellulosome context: The cellulosome is a supramolecular extracellular complex anchored to the cell surface via scaffoldin modules. CipA carries multiple type I cohesins for enzymatic dockerins and a type II dockerin for cell-surface anchoring, together with a CBM that targets insoluble cellulose, enabling synergistic action of endoglucanases, exoglucanases, and hemicellulases (datta2024enzymaticdegradationof pages 10-12).

2) Recent developments and latest research (2023–2024 prioritized)
- Current perspective: 2024 review articles reiterate that much modern understanding of cellulosome architecture and enzyme synergy derives from C. thermocellum; these reviews emphasize cohesin–dockerin specificity, scaffoldin architecture, and the advantage of multi-enzyme colocalization for crystalline cellulose depolymerization. They also underscore cellulosome engineering and designer cellulosomes for biomass conversion as active directions (datta2024enzymaticdegradationof pages 10-12). While this recent literature discusses the C. thermocellum cellulosome broadly, it does not present new CelA-specific kinetics; instead, it consolidates the role of CelA/Cel8A as a cellulosomal endoglucanase in the canonical complex.

3) Current applications and real-world implementations
- Biomass conversion: The cellulosome’s architecture—CBM-mediated substrate targeting and synergistic arrays of catalytic subunits—serves as a template for industrial enzyme cocktails and designer cellulosomes aimed at enhancing saccharification efficiency. The 2024 synthesis emphasizes that cellulosome-inspired designs can outperform free enzyme systems due to proximity and coordinated action, a concept derived largely from studies of C. thermocellum (datta2024enzymaticdegradationof pages 10-12).

4) Expert opinions and analysis from authoritative sources
- Major-component status: Proteomic cataloging established CelA/Cel8A among the prominent catalytic constituents of the native cellulosome, supporting its central role in cellulose deconstruction. Zverlov et al. identified Cel8A within a set of 13 major extracellular catalytic components; Gold and Martin quantified its presence in Avicel-grown cellulosomes, reinforcing its importance (zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3).
- Structural-functional insights: Structural characterization of Cel8A’s endo-acting cleft and arrangements on mini-cellulosomes—three Cel8A enzymes bound to consecutive cohesins, projecting catalytic domains antiparallel—offers a mechanistic rationale for enhanced substrate accessibility within scaffoldin arrays (bras2012structureandfunction pages 56-59). These observations support the design principle that specific spatial arrangements and relative rigidity of linkers can modulate enzymatic synergy.

5) Relevant statistics and data from recent and foundational studies
- Proteomic quantitation: In quantitative proteomics of purified cellulosomes from Avicel-grown cultures, CelA was identified with 14 unique peptides and an emPAI value of 2.46, indicating substantive abundance in the complex under these conditions. Cellulosomes were affinity-purified and analyzed by LC-MS with 15N metabolic labeling for quantitation (gold2007globalviewof pages 2-3). Earlier proteomics also listed Cel8A as one of 13 major catalytic components of the cellulosome prepared from cellulose-grown cultures (zverlov2005functionalsubgenomicsof pages 1-2).
- Domain annotation and size: CelA/Cel8A is annotated as GH8-Doc1 (GH8 catalytic domain plus type I dockerin) with an approximate molecular mass near 53 kDa in proteomic tables (zverlov2005functionalsubgenomicsof pages 2-4).

Functional annotation of celA (Cel8A) in A. thermocellus
- Primary function and substrate specificity: CelA/Cel8A is an endo-β-1,4-glucanase (EC 3.2.1.4) that cleaves internal bonds in cellulose chains, producing new chain ends for exoglucanases. Structural evidence shows a groove/open cleft typical of endo-acting enzymes (bras2012structureandfunction pages 56-59, datta2024enzymaticdegradationof pages 10-12).
- Domain architecture: GH8 catalytic module fused to a type I dockerin (GH8-Doc1). The dockerin targets type I cohesins on the primary scaffoldin CipA; in cellulosomal particles, CelA lacks its own CBM and relies on CipA’s CBM for cellulose targeting. Reported proteomic mass for CelA is ~53 kDa (zverlov2005functionalsubgenomicsof pages 2-4, datta2024enzymaticdegradationof pages 10-12).
- Localization and complex assembly: CelA is secreted and incorporated into the extracellular cellulosome via dockerin–cohesin binding to CipA. The cellulosome is subsequently anchored to the cell surface via type II cohesin–dockerin interactions with anchoring scaffoldins; CipA’s CBM binds crystalline cellulose, positioning CelA for action in a synergistic enzyme array (datta2024enzymaticdegradationof pages 10-12, zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3).
- Evidence for prominence: CelA is consistently detected as a significant component of the C. thermocellum cellulosome in proteomic surveys—both as a major component (Zverlov et al.) and with quantifiable abundance (14 peptides; emPAI 2.46) under Avicel growth (Gold and Martin), supporting a primary role in cellulose deconstruction (zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3).
- Structural/assembly insights: Mini-cellulosome studies show multiple Cel8A catalytic cores bound on consecutive cohesins with antiparallel orientation, and a relatively rigid inter-domain linker, offering a physical basis for enhanced surface coverage and catalytic synergy on insoluble substrates (bras2012structureandfunction pages 56-59).

Limitations and open points
- Kinetic parameters: Specific kcat, Km, and product profiles for A. thermocellus CelA/Cel8A were not reported in the recent 2023–2024 sources here; quantitative kinetics for this precise isozyme were not present in the retrieved excerpts and remain to be detailed from primary enzymology studies (datta2024enzymaticdegradationof pages 10-12).

Embedded summary table
| Attribute | Evidence-based summary | Organism / Strain | Domain architecture | Function / EC | Substrate specificity | Localization / complex | Evidence of abundance (quant.) | Notable structural / assembly insights | Recent 2023–2024 context |
|---|---|---|---|---|---|---|---:|---|---|
| Name / synonyms | Listed as CelA (also called Cel8A / Endoglucanase A) among major cellulosomal components (identified in purified cellulosome proteome) (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4) | Clostridium thermocellum (Acetivibrio thermocellus) (zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3) | — | — | — | — | — | — | — |
| Organism / Strain | Component of the C. thermocellum cellulosome; identified from cellulose-grown cultures (zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3) | Clostridium thermocellum (Acetivibrio thermocellus) (zverlov2005functionalsubgenomicsof pages 1-2) | — | — | — | — | — | — | — |
| Domain architecture | Annotated as GH8 catalytic module plus a type-I dockerin (notation: GH8-Doc1); module analysis based on genome/proteome ORF annotation (zverlov2005functionalsubgenomicsof pages 2-4) | C. thermocellum (zverlov2005functionalsubgenomicsof pages 2-4) | GH8 catalytic domain + Type I dockerin (GH8-Doc1) (zverlov2005functionalsubgenomicsof pages 2-4) | — | — | — | — | — | — |
| Molecular mass | Reported approximate mass ~53 kDa in proteomic table (zverlov2005functionalsubgenomicsof pages 2-4) | C. thermocellum (zverlov2005functionalsubgenomicsof pages 2-4) | GH8-Doc1 architecture consistent with ~53 kDa species (zverlov2005functionalsubgenomicsof pages 2-4) | — | — | — | — | — | — |
| Catalytic function (EC) | Described as an endo-β-1,4-glucanase (endo-activity consistent with EC 3.2.1.4) based on structural/functional description (bras2012structureandfunction pages 56-59) | C. thermocellum (bras2012structureandfunction pages 56-59) | GH8 (endo-β-1,4-glucanase) module (bras2012structureandfunction pages 56-59) | Endo-β-1,4-glucanase (EC 3.2.1.4) — described in literature (bras2012structureandfunction pages 56-59) | Cleaves internal β-1,4 glycosidic bonds in cellulose (endo-activity) (bras2012structureandfunction pages 56-59) | Cellulosomal enzyme (extracellular, scaffoldin-bound) (datta2024enzymaticdegradationof pages 10-12, zverlov2005functionalsubgenomicsof pages 1-2) | — | Groove-shaped binding region and open cleft consistent with endo action (structural description) (bras2012structureandfunction pages 56-59) | Reviews and analyses of cellulosome architecture and enzyme synergy remain active (general 2024 review context) (datta2024enzymaticdegradationof pages 10-12) |
| Substrate specificity | Inferred/observed endo-acting on cellulose (internal chain cleavage); described as endo-β-1,4-glucanase (bras2012structureandfunction pages 56-59) | C. thermocellum (bras2012structureandfunction pages 56-59) | GH8 catalytic signature implies activity on β-1,4-glucans (bras2012structureandfunction pages 56-59) | EC 3.2.1.4 (endo-glucanase) (bras2012structureandfunction pages 56-59) | Internal cleavage of cellulose chains (bras2012structureandfunction pages 56-59) | Functions as part of cellulosome on insoluble cellulose substrates (datta2024enzymaticdegradationof pages 10-12) | — | Endo-type groove and open cleft architecture favor internal bond hydrolysis (bras2012structureandfunction pages 56-59) | Cellulosome synergy (scaffoldin CBM and multi-enzyme arrays) emphasized in recent overviews (datta2024enzymaticdegradationof pages 10-12) |
| Localization / complex | Extracellular, cellulosome-associated enzyme that binds scaffoldin (e.g., CipA) via dockerin–cohesin interactions; isolated from purified cellulosome fractions (datta2024enzymaticdegradationof pages 10-12, zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 2-3) | C. thermocellum (datta2024enzymaticdegradationof pages 10-12, gold2007globalviewof pages 2-3) | Dockerin (type I) mediates Cohesin binding to CipA scaffoldin (zverlov2005functionalsubgenomicsof pages 2-4, datta2024enzymaticdegradationof pages 10-12) | — | Acts on cellulose in cellulosome context (datta2024enzymaticdegradationof pages 10-12) | Localized to extracellular cellulosomal particle; may be positioned by scaffoldin cohesins (datta2024enzymaticdegradationof pages 10-12) | — | Identified in mini‑cellulosome arrangements where multiple Cel8A enzymes bind consecutive cohesins (bras2012structureandfunction pages 56-59) | Cellulosome assembly and anchoring mechanisms remain a focus of recent studies/reviews (datta2024enzymaticdegradationof pages 10-12) |
| Evidence of abundance (proteomics metrics) | Detected in purified Avicel-grown cellulosomes with 14 unique peptides and an emPAI value of 2.46; included among major cellulosomal components (gold2007globalviewof pages 2-3, zverlov2005functionalsubgenomicsof pages 1-2) | C. thermocellum cellulosome (gold2007globalviewof pages 2-3) | — | — | — | Present in extracellular purified cellulosome fraction (gold2007globalviewof pages 2-3) | 14 peptides identified; emPAI = 2.46 (gold2007globalviewof pages 2-3) | Listed among 13 major catalytic components detected in cellulosome proteome (zverlov2005functionalsubgenomicsof pages 1-2) | Proteomic quantification confirms CelA as a substantive component of the cellulosome under cellulose growth (gold2007globalviewof pages 2-3) |
| Notable structural / assembly insights | Structural descriptions: groove-shaped substrate-binding region; in mini-cellulosome three Cel8A enzymes were observed bound to consecutive cohesins, projecting catalytic cores antiparallel to enhance substrate access; linker proposed relatively rigid (bras2012structureandfunction pages 56-59) | C. thermocellum (bras2012structureandfunction pages 56-59) | GH8 catalytic fold with dockerin-mediated attachment (bras2012structureandfunction pages 56-59, zverlov2005functionalsubgenomicsof pages 2-4) | Endo mechanism supported by open cleft/groove architecture (bras2012structureandfunction pages 56-59) | Optimized for internal chain cleavage; spatial arrangement on scaffoldin increases accessibility (bras2012structureandfunction pages 56-59) | Cellulosomal placement (on CipA) positions enzyme for cooperative action (datta2024enzymaticdegradationof pages 10-12, bras2012structureandfunction pages 56-59) | — | Groove/open cleft; antiparallel arrangement when multiple Cel8A bound to consecutive cohesins (bras2012structureandfunction pages 56-59) | Design principles of cellulosomes (multi-enzyme arrays for synergy) are reinforced in recent reviews (datta2024enzymaticdegradationof pages 10-12) |
| Recent 2023–2024 context | No direct new CelA-specific kinetic or expression studies in these contexts; broader 2024 reviews emphasize ongoing work on cellulosome diversity, composition, and engineering for biomass conversion (datta2024enzymaticdegradationof pages 10-12) | C. thermocellum and related Acetivibrio species discussed in genomic and review analyses (datta2024enzymaticdegradationof pages 10-12) | Domain features (dockerin/cohesin interactions) remain central to engineering/design strategies (datta2024enzymaticdegradationof pages 10-12, zverlov2005functionalsubgenomicsof pages 2-4) | — | — | Cellulosome architecture and enzyme complements are active research targets for biofuel/bioprocess applications (datta2024enzymaticdegradationof pages 10-12) | Proteomic identification of CelA in classical studies underpins its inclusion in modern cellulosome models (gold2007globalviewof pages 2-3, zverlov2005functionalsubgenomicsof pages 1-2) | — | Emphasis on exploiting cellulosome enzyme arrays for improved saccharification and synthetic designer cellulosomes in recent reviews (datta2024enzymaticdegradationof pages 10-12) |

Table: Compact, evidence-linked summary of CelA (Cel8A/Endoglucanase A) in the C. thermocellum cellulosome, showing domain architecture, function, localization, proteomics quantitation, structural insights, and recent review context with source citations. This table compiles only claims supported by the provided context IDs.

References (with URLs and dates)
- Zverlov VV, Kellermann J, Schwarz WH. Functional subgenomics of Clostridium thermocellum cellulosomal genes: Identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS. 2005 Sep;5:3646–3653. https://doi.org/10.1002/pmic.200401199 (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4)
- Gold ND, Martin VJJ. Global View of the Clostridium thermocellum Cellulosome Revealed by Quantitative Proteomic Analysis. J Bacteriol. 2007 Oct;189:6787–6795. https://doi.org/10.1128/JB.00882-07 (gold2007globalviewof pages 2-3)
- Brás JLA. Structure and function relationships in novel cellulosomal enzymes and cohesin–dockerin complexes. 2012. (Structural insights into Cel8A endo-cleft; mini-cellulosome arrangements.) (bras2012structureandfunction pages 56-59)
- Datta R. Enzymatic degradation of cellulose in soil: A review. Heliyon. 2024 Jan;10:e24022. https://doi.org/10.1016/j.heliyon.2024.e24022 (cellulosome architecture; CBM/cohesin–dockerin organization; 2024 perspective) (datta2024enzymaticdegradationof pages 10-12)

References

  1. (zverlov2005functionalsubgenomicsof pages 1-2): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  2. (zverlov2005functionalsubgenomicsof pages 2-4): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  3. (gold2007globalviewof pages 2-3): Nicholas D. Gold and Vincent J. J. Martin. Global view of the clostridium thermocellum cellulosome revealed by quantitative proteomic analysis. Journal of Bacteriology, 189:6787-6795, Oct 2007. URL: https://doi.org/10.1128/jb.00882-07, doi:10.1128/jb.00882-07. This article has 271 citations and is from a peer-reviewed journal.

  4. (datta2024enzymaticdegradationof pages 10-12): Rahul Datta. Enzymatic degradation of cellulose in soil: a review. Heliyon, 10:e24022, Jan 2024. URL: https://doi.org/10.1016/j.heliyon.2024.e24022, doi:10.1016/j.heliyon.2024.e24022. This article has 113 citations and is from a peer-reviewed journal.

  5. (bras2012structureandfunction pages 56-59): JLA Brás. Structure and function relationships in novel cellulosomal enzymes and cohesindockerin complexes. Unknown journal, 2012.

Citations

  1. bras2012structureandfunction pages 56-59
  2. datta2024enzymaticdegradationof pages 10-12
  3. gold2007globalviewof pages 2-3
  4. zverlov2005functionalsubgenomicsof pages 1-2
  5. zverlov2005functionalsubgenomicsof pages 2-4
  6. https://doi.org/10.1002/pmic.200401199
  7. https://doi.org/10.1128/JB.00882-07
  8. https://doi.org/10.1016/j.heliyon.2024.e24022
  9. https://doi.org/10.1002/pmic.200401199,
  10. https://doi.org/10.1128/jb.00882-07,
  11. https://doi.org/10.1016/j.heliyon.2024.e24022,

📄 View Raw YAML

---
id: A3DC29
gene_symbol: celA
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC
    103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
description: CelA (also known as Cel8A or Endoglucanase A) is a secreted endo-beta-1,4-glucanase
  (EC 3.2.1.4) belonging to glycosyl hydrolase family 8 (GH8). It catalyzes the endohydrolysis
  of internal (1->4)-beta-D-glucosidic linkages in cellulose, lichenin, and cereal
  beta-D-glucans. The enzyme contains a GH8 catalytic domain with an open cleft/groove
  architecture characteristic of endo-acting glycosidases, plus a C-terminal type
  I dockerin domain (residues 411-477) that mediates incorporation into the cellulosome
  complex via cohesin-dockerin interactions with the primary scaffoldin CipA. CelA
  is one of the major catalytic components of the C. thermocellum cellulosome, consistently
  identified in proteomic surveys of purified cellulosomes grown on Avicel. The enzyme
  lacks its own carbohydrate-binding module (CBM) and relies on the CBM of CipA for
  targeting to insoluble cellulose substrates. Structural studies at atomic resolution
  (0.94 A) have characterized the active site, identifying residues E95 (proton donor)
  and D152 (nucleophile) as key catalytic residues.
existing_annotations:
  - term:
      id: GO:0000272
      label: polysaccharide catabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: CelA is an endoglucanase that participates in cellulose degradation.
        While cellulose is a polysaccharide, the more specific term GO:0030245 (cellulose
        catabolic process) is already annotated and better represents the biological
        process. This broader term is acceptable but redundant given the more specific
        annotation. The deep research review confirms CelA's role in cellulose breakdown
        as part of the cellulosome complex [celA-deep-research-falcon.md].
      action: ACCEPT
      reason: This annotation is correct but overly broad. CelA specifically degrades
        cellulose, which is a type of polysaccharide. The term is inferred from InterPro
        domain annotations (IPR002105 Dockerin, IPR016134 Dockerin domain, IPR036439
        Dockerin superfamily) which correctly associate the protein with polysaccharide
        degradation. Since GO:0030245 (cellulose catabolic process) is also annotated
        and is more specific, this broader term is somewhat redundant but not incorrect.
        For IEA annotations, broader parent terms are acceptable to retain alongside
        more specific ones.
      additional_reference_ids: [file:ACET2/celA/celA-deep-research-falcon.md]
  - term:
      id: GO:0004553
      label: hydrolase activity, hydrolyzing O-glycosyl compounds
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: CelA is a glycosyl hydrolase (GH8 family) that hydrolyzes O-glycosyl
        compounds, specifically (1->4)-beta-D-glucosidic linkages. This term is correct
        but a parent of the more specific GO:0008810 (cellulase activity) which is
        also annotated.
      action: ACCEPT
      reason: This annotation is correct. CelA belongs to GH8 and catalyzes hydrolysis
        of beta-glucosidic bonds in cellulose. The annotation is inferred from ARBA
        rule ARBA00027782 and InterPro domains IPR002037 (Glyco_hydro_8) and IPR002105
        (Dockerin). While GO:0008810 (cellulase activity) is more specific, this parent
        term is acceptable for an IEA annotation and provides a useful grouping term.
      additional_reference_ids: [file:ACET2/celA/celA-deep-research-falcon.md]
  - term:
      id: GO:0005975
      label: carbohydrate metabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: CelA participates in carbohydrate metabolism through its role in cellulose
        degradation. This is a high-level parent term of more specific annotations
        already present.
      action: ACCEPT
      reason: This annotation is correct but very broad. It is inferred from InterPro
        records IPR002037 (Glyco_hydro_8), IPR008928 (6-hairpin_glycosidase_sf), and
        IPR012341 (6hp_glycosidase-like_sf). While the more specific GO:0030245 (cellulose
        catabolic process) better captures CelA's function, this broader term is not
        incorrect and acceptable for an IEA annotation derived from domain membership.
      additional_reference_ids: [file:ACET2/celA/celA-deep-research-falcon.md]
  - term:
      id: GO:0008810
      label: cellulase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000003
    review:
      summary: CelA (Endoglucanase A) is definitively a cellulase with EC 3.2.1.4
        activity. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic linkages
        in cellulose, lichenin, and cereal beta-D-glucans. This is the core molecular
        function of the enzyme and the most specific appropriate MF term.
      action: ACCEPT
      reason: 'This annotation accurately represents the core molecular function of
        CelA. The enzyme is explicitly annotated as EC 3.2.1.4 (endoglucanase/cellulase)
        in UniProt, and the GO term definition matches exactly: "Catalysis of the
        endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose, lichenin
        and cereal beta-D-glucans." The annotation is derived from the EC number mapping
        (GO_REF:0000003). Proteomic studies identify CelA/Cel8A as a major catalytic
        component of the cellulosome with endo-beta-1,4-glucanase activity [PMID:16127726,
        PMID:17644599]. Structural studies confirm the open cleft architecture characteristic
        of endo-acting glycosidases [PMID:8805535].'
      additional_reference_ids:
        - file:ACET2/celA/celA-deep-research-falcon.md
        - PMID:16127726
        - PMID:17644599
      supported_by:
        - reference_id: PMID:16127726
          supporting_text: 'Ten of the components were previously known: the structural
            protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N, the cellobiohydrolases
            Cbh9A, Cel9K, Cel48S, the xylanases Xyn10C, Xyn10Z, and the chitinase
            Chi18A'
        - reference_id: PMID:17644599
          supporting_text: In total, 41 cellulosomal proteins were detected, including
            36 type I dockerin-containing proteins
  - term:
      id: GO:0016787
      label: hydrolase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: CelA is a hydrolase that cleaves glycosidic bonds. This is a very broad
        parent term of the more specific cellulase activity already annotated.
      action: ACCEPT
      reason: This annotation is correct but extremely broad. CelA is indeed a hydrolase
        (EC 3.2.1.4), but the term GO:0008810 (cellulase activity) provides much more
        informative functional information. This IEA annotation derived from UniProtKB
        keyword KW-0378 (Hydrolase) is acceptable but provides minimal functional
        insight beyond what is captured by more specific terms.
      additional_reference_ids: [file:ACET2/celA/celA-deep-research-falcon.md]
  - term:
      id: GO:0016798
      label: hydrolase activity, acting on glycosyl bonds
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: CelA hydrolyzes glycosyl bonds, specifically (1->4)-beta-D-glucosidic
        linkages. This term is a parent of GO:0004553 (hydrolase activity, hydrolyzing
        O-glycosyl compounds) and GO:0008810 (cellulase activity).
      action: ACCEPT
      reason: This annotation is correct. CelA acts on glycosyl bonds (specifically
        beta-1,4 glucosidic bonds in cellulose). The annotation is inferred from UniProtKB
        keyword KW-0326 (Glycosidase). While more specific terms are available and
        already annotated, this intermediate-level term is acceptable for an IEA.
      additional_reference_ids: [file:ACET2/celA/celA-deep-research-falcon.md]
  - term:
      id: GO:0030245
      label: cellulose catabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: CelA is a cellulose-degrading endoglucanase that directly participates
        in cellulose catabolism as part of the cellulosome complex. This is the appropriate
        biological process term for this enzyme.
      action: ACCEPT
      reason: This annotation accurately represents CelA's biological role. The enzyme
        catalyzes the endohydrolysis of cellulose chains, producing new chain ends
        for exoglucanases and contributing directly to cellulose breakdown. The annotation
        is derived from UniProtKB keyword KW-0136 (Cellulose degradation). Proteomic
        analyses confirm CelA is a major component of the cellulosome, the multi-enzyme
        complex responsible for cellulose catabolism in C. thermocellum [PMID:16127726,
        PMID:17644599, PMID:38234915].
      additional_reference_ids:
        - file:ACET2/celA/celA-deep-research-falcon.md
        - PMID:16127726
        - PMID:38234915
      supported_by:
        - reference_id: PMID:38234915
          supporting_text: the anchoring of various cellulolytic enzymes, i.e., endoglucanases,
            cellobiohydrolases, xylanases, to a scaffoldin makes it a complex structure
            in which different enzymes work synergistically to attack heterogeneous,
            insoluble cellulose substrates
        - reference_id: file:ACET2/celA/celA-deep-research-falcon.md
          supporting_text: CelA/Cel8A is an endo-beta-1,4-glucanase (EC 3.2.1.4) that
            cleaves internal bonds in cellulose chains, producing new chain ends for
            exoglucanases
  - term:
      id: GO:0043263
      label: cellulosome
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: CelA contains a type I dockerin domain that mediates its incorporation
        into the cellulosome complex. The cellulosome (GO:0043263) is the appropriate
        cellular component annotation for this protein.
      action: NEW
      reason: 'CelA has a well-characterized dockerin domain (residues 411-477) that
        binds to type I cohesins on the CipA scaffoldin. Proteomic surveys consistently
        identify CelA as a component of purified cellulosomes. This cellular component
        annotation should be added, as the protein functions as part of the cellulosome
        complex. The InterPro domains IPR002105 (Dockerin_1_rpt), IPR016134 (Dockerin_dom),
        and IPR036439 (Dockerin_dom_sf) all indicate cellulosome localization. Note:
        This annotation may already exist via IEA but was not present in the provided
        GOA file.'
      additional_reference_ids:
        - file:ACET2/celA/celA-deep-research-falcon.md
        - PMID:16127726
        - PMID:38234915
      supported_by:
        - reference_id: PMID:16127726
          supporting_text: To identify the predominant catalytic components, cellulosomes
            were purified and the components were separated by an adapted two-dimensional
            gel electrophoresis technique. The apparent major spots were identified
            by MALDI-TOF/TOF
        - reference_id: PMID:38234915
          supporting_text: 'Cellulosome comprises two major subunits: long flexible
            scaffoldin, which forms the center part containing specific binding sites
            cohesions, and the enzymes containing a dockerin module that binds to
            cohesion'
      proposed_replacement_terms:
        - id: GO:0043263
          label: cellulosome
references:
  - id: GO_REF:0000002
    title: Gene Ontology annotation through association of InterPro records with GO
      terms
    findings: []
  - id: GO_REF:0000003
    title: Gene Ontology annotation based on Enzyme Commission mapping
    findings: []
  - id: GO_REF:0000043
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
    findings: []
  - id: GO_REF:0000120
    title: Combined Automated Annotation using Multiple IEA Methods
    findings: []
  - id: PMID:3980433
    title: Sequence of a cellulase gene of the thermophilic bacterium Clostridium
      thermocellum
    findings:
      - statement: Original cloning and sequencing of the celA gene encoding Endoglucanase
          A
        supporting_text: The nucleotide sequence of the celA gene, encoding the extracellular
          endoglucanase A of Clostridium thermocellum, was determined and compared
          with the NH2-terminal amino acid sequence of the purified enzyme
  - id: PMID:8805535
    title: The crystal structure of endoglucanase CelA, a family 8 glycosyl hydrolase
      from Clostridium thermocellum
    findings:
      - statement: Crystal structure at 1.65 A resolution confirms GH8 family fold
          with active site cleft
        supporting_text: The protein folds into a regular (alpha/alpha)6 barrel formed
          by six inner and six outer alpha helices. Cello-oligosaccharides bind to
          an acidic cleft containing at least five D-glucosyl-binding subsites
  - id: PMID:11884144
    title: Atomic (0.94 A) resolution structure of an inverting glycosidase in complex
      with substrate
    findings:
      - statement: High-resolution structure reveals active site and catalytic mechanism
        supporting_text: The crystal structure of Clostridium thermocellum endoglucanase
          CelA in complex with cellopentaose has been determined at 0.94 A resolution.
          The oligosaccharide occupies six D-glucosyl-binding subsites
  - id: PMID:16127726
    title: Functional subgenomics of Clostridium thermocellum cellulosomal genes
    findings:
      - statement: CelA/Cel8A identified as one of the major catalytic components
          of the cellulosome
        supporting_text: 'Ten of the components were previously known: the structural
          protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N, the cellobiohydrolases
          Cbh9A, Cel9K, Cel48S, the xylanases Xyn10C, Xyn10Z, and the chitinase Chi18A'
      - statement: Seventy-one putative cellulosomal genes detected in C. thermocellum
          genome
        supporting_text: Seventy-one putative cellulosomal genes were detected. One
          third of these ORFs may be involved in cellulose hydrolysis
  - id: PMID:17644599
    title: Global view of the Clostridium thermocellum cellulosome revealed by quantitative
      proteomic analysis
    findings:
      - statement: Quantitative proteomic analysis of cellulosome composition
        supporting_text: In total, 41 cellulosomal proteins were detected, including
          36 type I dockerin-containing proteins, which count among them all but three
          of the known docking components and 16 new subunits
  - id: PMID:38234915
    title: Enzymatic degradation of cellulose in soil - A review
    findings:
      - statement: 2024 review describing cellulosome architecture including CelA
          as endoglucanase component
        supporting_text: the anchoring of various cellulolytic enzymes, i.e., endoglucanases,
          cellobiohydrolases, xylanases, to a scaffoldin makes it a complex structure
          in which different enzymes work synergistically to attack heterogeneous,
          insoluble cellulose substrates
  - id: file:ACET2/celA/celA-deep-research-falcon.md
    title: Deep research review of celA gene function
    findings:
      - statement: Comprehensive literature synthesis on CelA function and cellulosome
          localization
        supporting_text: CelA/Cel8A is an endo-beta-1,4-glucanase (EC 3.2.1.4) that
          cleaves internal bonds in cellulose chains, producing new chain ends for
          exoglucanases. Structural evidence shows a groove/open cleft typical of
          endo-acting enzymes
core_functions:
  - description: Endo-beta-1,4-glucanase activity as part of the cellulosome complex,
      catalyzing internal cleavage of cellulose chains to produce new chain ends for
      exoglucanases.
    molecular_function:
      id: GO:0008810
      label: cellulase activity
    directly_involved_in:
      - id: GO:0030245
        label: cellulose catabolic process
    locations:
      - id: GO:0043263
        label: cellulosome
    supported_by:
      - reference_id: PMID:16127726
        supporting_text: 'Ten of the components were previously known: the structural
          protein CipA, the endo-glucanases Cel8A, Cel5G, Cel9N'
      - reference_id: PMID:17644599
        supporting_text: In total, 41 cellulosomal proteins were detected, including
          36 type I dockerin-containing proteins
suggested_questions:
  - question: What is the precise substrate specificity of CelA compared to other
      GH8 endoglucanases in the cellulosome? Does it show any preference for specific
      regions of crystalline vs amorphous cellulose?
    experts: []
  - question: How does CelA synergize with exoglucanases (like CelS/Cel48A) and other
      endoglucanases in the cellulosome to achieve efficient cellulose degradation?
    experts: []
suggested_experiments:
  - hypothesis: CelA shows optimal activity when incorporated into the cellulosome
      complex compared to free enzyme, due to proximity effects with other cellulases.
    description: Compare the kinetic parameters (kcat, Km) of CelA in free form versus
      when bound to mini-cellulosomes containing different combinations of other cellulases.
      Use defined cellulose substrates (Avicel, filter paper, CMC, phosphoric acid
      swollen cellulose) to assess potential synergistic effects.
    experiment_type: enzyme kinetics
tags: [cellulosome]