celK

UniProt ID: A3DCH1
Organism: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
Review Status: DRAFT
๐Ÿ“ Provide Detailed Feedback

Gene Description

CelK (also known as Cel9K, locus Cthe_0412) is a major cellulosomal cellulase belonging to glycoside hydrolase family 9 (GH9) in Acetivibrio thermocellus (formerly Clostridium thermocellum). It functions as a cellulose 1,4-beta-cellobiosidase (EC 3.2.1.91) that hydrolyzes beta-1,4-glucosidic linkages in cellulose, releasing cellobiose and cellotriose from the non-reducing ends of cellulose chains. The enzyme has a modular architecture consisting of a CBM4 carbohydrate-binding module, an Ig-like domain, the GH9 catalytic domain, and a C-terminal type I dockerin domain that mediates integration into the cellulosome complex. CelK works synergistically with Cel48S (the major GH48 cellobiohydrolase) in the cellulosome to efficiently degrade crystalline cellulose.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
MODIFY
Summary: CelK is a cellulase that degrades cellulose, which is a polysaccharide. The annotation to polysaccharide catabolic process is correct but overly broad. CelK specifically catalyzes the breakdown of cellulose (a beta-1,4-glucan), not polysaccharides in general.
Reason: While cellulose is a polysaccharide and CelK does participate in polysaccharide catabolism, this term is too general. The more specific term GO:0030245 (cellulose catabolic process) already exists in the annotation set and accurately captures the biological process. This broader term provides less informative annotation.
Proposed replacements: cellulose catabolic process
Supporting Evidence:
PMID:29075324
CBHs (exo-acting from the sugar ends) released specifically cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000120
ACCEPT
Summary: CelK is a GH9 family glycoside hydrolase that hydrolyzes beta-1,4-glucosidic bonds in cellulose. This annotation is correct but too general - the enzyme has specific cellobiohydrolase activity (EC 3.2.1.91) rather than generic glycosyl hydrolase activity.
Reason: While GO:0016162 (cellulose 1,4-beta-cellobiosidase activity) is more specific and preferred, this parent term is technically accurate. Since the more specific term is already annotated, this broader annotation is acceptable as it provides correct hierarchical coverage. IEA annotations often propagate from broader InterPro signatures and this is consistent with the enzyme's function.
Supporting Evidence:
PMID:29075324
exo-acting cellobiohydrolases thread the cellulose molecule from its free cellulose chain end through a tunnel built by loop structures around the active site
GO:0005576 extracellular region
IEA
GO_REF:0000044
MODIFY
Summary: CelK is a secreted protein that incorporates into the extracellular cellulosome complex via its dockerin domain. The signal peptide (residues 1-27) is cleaved upon secretion. Proteomics confirms CelK is abundant in cellulosome preparations from cellulose-grown cells.
Reason: While CelK is secreted to the extracellular region, a more specific cellular component annotation would be GO:0043263 (cellulosome), as CelK is a bona fide cellulosomal enzyme that integrates via dockerin-cohesin interactions. The cellulosome is a defined extracellular multi-enzyme complex in C. thermocellum.
Proposed replacements: cellulosome
Supporting Evidence:
PMID:16127726
the cellobiohydrolases Cbh9A, Cel9K, Cel48S
PMID:17644599
exoglucanases CelS and CelK
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
ACCEPT
Summary: CelK participates in carbohydrate metabolism by degrading cellulose. This annotation is correct but extremely broad - CelK's specific role is in cellulose catabolism.
Reason: This is a very high-level biological process term that is technically accurate but provides minimal information. Since more specific BP annotations exist (GO:0030245 cellulose catabolic process), this broad term is acceptable as hierarchical context but not informative as a standalone annotation. IEA from InterPro is appropriate for this level of granularity.
GO:0008810 cellulase activity
IEA
GO_REF:0000002
MODIFY
Summary: CelK has cellulase activity as a GH9 family enzyme. However, the GO definition of cellulase activity (GO:0008810) specifically refers to endohydrolysis, whereas CelK functions as a cellobiohydrolase (exo-acting, EC 3.2.1.91) releasing cellobiose from non-reducing ends.
Reason: GO:0008810 (cellulase activity) is defined as "catalysis of the endohydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose". CelK is classified as EC 3.2.1.91 (cellulose 1,4-beta-cellobiosidase), which is an exo-acting enzyme that releases cellobiose from the non-reducing ends. The correct annotation is GO:0016162 (cellulose 1,4-beta-cellobiosidase activity), which is already present in the annotation set.
Supporting Evidence:
PMID:29075324
As expected, for the CBHs Cbh9A, Cel48S, and Cel9K no or very weak activities on modified substrates were observed, whereas on PASC substantial product formation was found
GO:0016162 cellulose 1,4-beta-cellobiosidase activity
IEA
GO_REF:0000003
ACCEPT
Summary: This is the most accurate molecular function annotation for CelK. The enzyme is classified as EC 3.2.1.91 (cellulose 1,4-beta-cellobiosidase), which matches this GO term exactly. UniProt catalytic activity annotation confirms: "Hydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose and cellotetraose, releasing cellobiose from the non-reducing ends of the chains."
Reason: This is the correct and most specific molecular function annotation for CelK. The annotation is derived from EC:3.2.1.91 mapping, which is the experimentally determined enzyme classification for this protein. This represents the core catalytic function of CelK.
Supporting Evidence:
PMID:29075324
CBHs (exo-acting from the sugar ends) released specifically cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
ACCEPT
Summary: CelK is indeed a hydrolase - it catalyzes the hydrolytic cleavage of beta-1,4-glucosidic bonds. This annotation is correct but extremely general.
Reason: This is the root molecular function term for hydrolases. While technically accurate, it provides minimal information. More specific annotations (GO:0016162, GO:0004553, GO:0016798) are already present and preferred. This broad term is acceptable as it provides hierarchical consistency but is not informative as a standalone annotation.
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000120
ACCEPT
Summary: CelK hydrolyzes glycosyl bonds (specifically beta-1,4-glucosidic linkages in cellulose). This annotation is correct and at an appropriate intermediate level of specificity.
Reason: This term accurately describes CelK's molecular function at an intermediate level of specificity. While GO:0016162 is more specific, this parent term is correctly applied. IEA annotation from InterPro domain signatures appropriately captures this level of function.
GO:0030245 cellulose catabolic process
IEA
GO_REF:0000043
ACCEPT
Summary: CelK is directly involved in cellulose catabolism as a major cellulosomal cellulase. The enzyme degrades crystalline cellulose to cellobiose and cellotriose as primary products. This is the most appropriate biological process annotation for CelK.
Reason: This is the correct and most specific biological process annotation for CelK. The enzyme's primary biological role is cellulose degradation as part of the cellulosome complex. This annotation accurately represents the core biological process in which CelK participates.
Supporting Evidence:
PMID:29075324
After prolonged incubation times (overnight), the final products are mainly cellobiose and cellotriose (DP 2 to DP 3)
GO:0043263 cellulosome
IEA
GO_REF:0000044
NEW
Summary: CelK contains a C-terminal type I dockerin domain (residues 828-894) that mediates integration into the cellulosome via interaction with cohesin domains on the CipA scaffoldin protein. Proteomics confirms CelK is one of the most abundant cellulosomal enzyme subunits.
Reason: This annotation is not currently in the GOA file but is strongly supported by the domain architecture (dockerin domain) and proteomics data showing CelK as a major cellulosomal component. This is the most specific and accurate cellular component annotation.
Supporting Evidence:
PMID:17644599
exoglucanases CelS and CelK
PMID:16127726
the cellobiohydrolases Cbh9A, Cel9K, Cel48S
file:ACET2/celK/celK-deep-research-falcon.md
CelK is a secreted, extracellular cellulosomal subunit that incorporates into the cellulosome via type-I dockerin interactions

Core Functions

CelK functions as a cellulose 1,4-beta-cellobiosidase that hydrolyzes beta-1,4-glucosidic linkages in crystalline cellulose, releasing cellobiose from the non-reducing ends of cellulose chains. It operates as a component of the cellulosome complex, working synergistically with Cel48S (GH48) to efficiently degrade plant cell wall polysaccharides.

Directly Involved In:
Cellular Locations:
In Complex:
cellulosome
Substrates:
Supporting Evidence:
  • PMID:29075324
    CBHs (exo-acting from the sugar ends) released specifically cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on Enzyme Commission mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location vocabulary mapping, accompanied by conservative changes to GO terms applied by UniProt
Combined Automated Annotation using Multiple IEA Methods
Comparative characterization of all cellulosomal cellulases from Clostridium thermocellum
  • Cel9K is classified as a cellobiohydrolase (CBH) in Table 1
    "Cel9K0412GH9Inverting"
  • CBHs including Cel9K release cellobiose as the main product
    "CBHs (exo-acting from the sugar ends) released specifically cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)"
  • Cel9K shows weak activity on modified substrates but substantial activity on PASC
    "As expected, for the CBHs Cbh9A, Cel48S, and Cel9K no or very weak activities on modified substrates were observed, whereas on PASC substantial product formation was found"
Functional subgenomics of Clostridium thermocellum cellulosomal genes
  • Cel9K is identified as a major cellulosomal cellobiohydrolase
    "the cellobiohydrolases Cbh9A, Cel9K, Cel48S"
Global view of the Clostridium thermocellum cellulosome revealed by quantitative proteomic analysis
  • CelK shows higher expression in cellulosomes from cellulose-grown cells
    "exoglucanases CelS and CelK"
file:ACET2/celK/celK-deep-research-falcon.md
Deep research literature review for celK
  • CelK identity confirmed as Cel9K with locus Cthe_0412
    "The target is celK from Acetivibrio thermocellus (Clostridium thermocellum). Comparative catalogs and proteomics explicitly map CelK to Cel9K with locus tag Cthe_0412"
  • CelK is a cellulosomal subunit with type-I dockerin
    "CelK is a secreted, extracellular cellulosomal subunit that incorporates into the cellulosome via type-I dockerin interactions"
  • CelK belongs to GH9 family with inverting mechanism
    "CelK belongs to glycoside hydrolase family 9 (GH9), which utilize an inverting catalytic mechanism"

Suggested Questions for Experts

Q: What is the precise processivity of CelK - does it remain attached to the cellulose chain after each catalytic event or does it dissociate and rebind?

Suggested experts: Wolfgang H. Schwarz, Vladimir V. Zverlov

Q: What are the kinetic parameters (kcat, Km) for CelK on various cellulosic substrates (Avicel, PASC, CMC)?

Suggested experts: Benedikt Leis, Wolfgang Liebl

Suggested Experiments

Experiment: Perform single-molecule fluorescence microscopy to track individual CelK molecules on cellulose nanocrystals. Measure processivity (number of catalytic events per encounter) and compare to other GH9 family members with known processivities.

Hypothesis: CelK exhibits processive cellobiohydrolase activity on crystalline cellulose

Type: Single-molecule enzymology

Experiment: Measure hydrolysis rates of crystalline cellulose (Avicel) using CelK alone, Cel48S alone, and equimolar combinations. Quantify synergy as the ratio of combined activity to the sum of individual activities. Test at various enzyme ratios to determine optimal stoichiometry.

Hypothesis: CelK and Cel48S show synergistic enhancement of cellulose hydrolysis

Type: Enzyme kinetics / synergy assay

Tags

cellulosome

๐Ÿ“š Additional Documentation

Deep Research Falcon

(celK-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:03:10.211136'
end_time: '2025-12-26T17:10:34.347651'
duration_seconds: 444.14
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: celK
gene_symbol: celK
uniprot_accession: A3DCH1
protein_description: 'RecName: Full=Cellulose 1,4-beta-cellobiosidase; EC=3.2.1.91;
Flags: Precursor;'
gene_info: Name=celK; OrderedLocusNames=Cthe_0412; ORFNames=CtheDRAFT_2165;
organism_full: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322
/ NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
protein_family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
protein_domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf.
(IPR012341); Cellulase_Ig-like. (IPR004197); CenC_carb-bd. (IPR003305); Dockerin_1_rpt.
(IPR002105)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 27


Question

Gene Research for Functional Annotation

โš ๏ธ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: A3DCH1
  • Protein Description: RecName: Full=Cellulose 1,4-beta-cellobiosidase; EC=3.2.1.91; Flags: Precursor;
  • Gene Information: Name=celK; OrderedLocusNames=Cthe_0412; ORFNames=CtheDRAFT_2165;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Cellulase_Ig-like. (IPR004197); CenC_carb-bd. (IPR003305); Dockerin_1_rpt. (IPR002105)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "celK" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'celK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene celK (gene ID: celK, UniProt: A3DCH1) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

โš ๏ธ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: A3DCH1
  • Protein Description: RecName: Full=Cellulose 1,4-beta-cellobiosidase; EC=3.2.1.91; Flags: Precursor;
  • Gene Information: Name=celK; OrderedLocusNames=Cthe_0412; ORFNames=CtheDRAFT_2165;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
  • Key Domains: 6-hairpin_glycosidase_sf. (IPR008928); 6hp_glycosidase-like_sf. (IPR012341); Cellulase_Ig-like. (IPR004197); CenC_carb-bd. (IPR003305); Dockerin_1_rpt. (IPR002105)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "celK" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'celK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene celK (gene ID: celK, UniProt: A3DCH1) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan and verification
- Identity verification: The target is celK from Acetivibrio thermocellus (Clostridium thermocellum). Comparative catalogs and proteomics explicitly map CelK to Cel9K with locus tag Cthe_0412 (also called Cel9K), confirming gene/protein identity in the correct organismal context (C. thermocellum ATCC 27405 lineage) (leis2017comparativecharacterizationof pages 7-8, munir2014comparativeanalysisof pages 13-13). Domain annotations from proteomics list CBM and dockerin modules consistent with a cellulosomal GH9 enzyme (zverlov2005functionalsubgenomicsof pages 2-4). No conflicting usage of โ€œcelKโ€ in unrelated organisms was used for this report.

Comprehensive research report: celK (CelK; Cel9K; Cthe_0412) in Acetivibrio thermocellus

1) Key concepts and definitions
- Primary function and EC: CelK is a cellulase in the C. thermocellum cellulosome; literature treats CelK as a cellobiohydrolase (EC 3.2.1.91) and a major cellulosomal component involved in crystalline cellulose depolymerization to short cellodextrins, notably cellobiose (DP2) (Munir 2014, citing CelK as โ€œa new cellobiohydrolaseโ€ and โ€œmajor cellulosome componentโ€) (https://doi.org/10.1371/journal.pone.0104260; published Aug 2014) (munir2014comparativeanalysisof pages 13-13). Comparative analyses of all cellulosomal cellulases position Cel9K in GH9 and define catalytic behaviors of GH9 enzymes, including processive endoglucanase modes that yield cellobiose/cellotriose (DP2โ€“3) (https://doi.org/10.1186/s13068-017-0928-4; published Oct 2017) (leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8).
- Enzyme class/family and mechanism: CelK belongs to glycoside hydrolase family 9 (GH9), which utilize an inverting catalytic mechanism. GH9 cellulases in C. thermocellum encompass exo-acting and processive endoglucanase behaviors, producing characteristic product spectra in controlled assays (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 1-2).
- Domain architecture: Proteomic/proteomic-informed domain mapping lists CelK among major cellulosomal components and assigns a modular architecture typical of cellulosomal GH9s, including a catalytic GH9 domain with carbohydrate-binding and Ig-like modules and a C-terminal dockerin (e.g., โ€œCBM4-Ig-GH9-Doc1โ€ entry for CelK in proteomics table), consistent with extracellular cellulosome incorporation (https://doi.org/10.1002/pmic.200401199; Sep 2005) (zverlov2005functionalsubgenomicsof pages 2-4). GH9 cellulases in C. thermocellum frequently carry CBM3 variants (e.g., CelI GH9 with CBM3c/CBM3b) that enhance chain feeding/binding and processivity on crystalline cellulose, serving as a mechanistic analogue for GH9 domain function (https://doi.org/10.1128/jb.185.2.391-398.2003; Jan 2003) (gilad2003celianoncellulosomal pages 1-2).
- Cellular/extracellular localization: CelK is a secreted, extracellular cellulosomal subunit that incorporates into the cellulosome via type-I dockerin interactions, contributing to surface-associated cellulose degradation (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 7-8, gold2007proteomicanalysisof pages 65-68, zverlov2005functionalsubgenomicsof pages 2-4, gold2007proteomicanalysisof pages 68-69).

2) Recent developments and latest research (emphasis 2023โ€“2024)
- System-level context: Recent work underscores synergy between GH9 and GH48 cellulases in cellulosomal systems, where GH48 cellobiohydrolases (e.g., Cel48S/CelS) synergize strongly with GH9 enzymes to drive cellulose deconstruction; this is a current focus for biocatalyst engineering (https://doi.org/10.3390/fermentation9030204; Feb 2023) (gold2007proteomicanalysisof pages 68-69). Comparative catalogs (although 2017) remain the most comprehensive reference to the complete set of C. thermocellum cellulosomal cellulases and their hydrolysis modes, informing ongoing 2023โ€“2024 engineering of enzyme mixtures and designer cellulosomes (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8). While 2023โ€“2024 reviews advance broader cellulosome engineering strategies, CelK-specific new kinetic or structural determinations were not found in 2023โ€“2024; thus, CelKโ€™s current understanding still relies on the definitive proteomics and comparative functional studies cited below.

3) Current applications and real-world implementations
- Biomass deconstruction platforms: In designer miniโ€‘cellulosomes, combining GH9 representatives with Cel48S (GH48) maximizes hydrolytic performance on cellulose; artificial nonavalent cellulosomes recovered ~50% of native cellulosome activity, highlighting the operational role of GH9 (including CelK) in practical enzyme cocktails and consolidated bioprocessing concepts (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 1-2). Reviews on GH48-GH9 synergy emphasize deploying GH9 cellulases alongside GH48 in biorefinery-relevant contexts (https://doi.org/10.3390/fermentation9030204; Feb 2023) (gold2007proteomicanalysisof pages 68-69).

4) Expert opinions and analysis from authoritative sources
- Role vis-ร -vis Cel48S (CelS): Quantitative proteomics and comparative enzymology position CelK (Cel9K) as a prominent GH9 partner to Cel48S. Leis et al. explicitly list Cel48S/Cel9K as a basic functional pairing in the cellulosome; product analyses at 60ยฐC and pH 5.8 show soluble final products dominated by DP2โ€“3 (cellobiose/cellotriose), consistent with GH48+GH9 synergy (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 7-8). Proteomics ranks CelK among the most abundant enzymatic subunits of the cellulosome under cellulose or cellobiose growth, comparable in magnitude to Cel48S depending on substrate, supporting a model in which CelK contributes processive chain cleavage complementing Cel48Sโ€™s reducing-end exo activity (https://doi.org/10.1128/jb.00882-07; Oct 2007) (gold2007globalviewof pages 8-9, gold2007proteomicanalysisof pages 65-68, gold2007proteomicanalysisof pages 68-69).

5) Relevant statistics and data
- Enzyme set and assay conditions: The comprehensive comparative study tested all 24 single cellulosomal cellulases under identical conditions, defining four hydrolysis modes and product spectra. Product TLC heatmaps showed DP2โ€“3 products predominate after overnight reactions at 60ยฐC, pH 5.8 with model and cellulosic substrates, situating Cel9K within the GH9 group that contributes these product patterns (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8).
- Proteomic abundance (cellulose-grown): In Avicel-grown cellulosomes, CelK (GH9) is a major component. One proteomic table reports CelK with 39 unique peptide observations, emPAI 4.12, dockerin I/CipA mol% ~11.0, coverage ~35%, versus Cel48S with 29 peptides, emPAI 3.56, DocI/CipA mol% ~9.4, coverage ~32%โ€”all consistent with high abundance of both subunits; CipA scaffold ranked highest (emPAI 5.92) (Gold 2007; details from the table) (gold2007proteomicanalysisof pages 65-68). Under cellobiose growth, CelK remained abundant with emPAI 1.12 and normalized abundance ~0.52 relative to CipA (Gold 2007; table legend explains emPAI and normalization) (gold2007proteomicanalysisof pages 68-69). A separate proteomic study identified CelK and Cel48S among the 13 major cellulosomal components on 2โ€‘D gels of cellulose-grown cellulosomes and reported approximate abundances (Zverlov 2005) (https://doi.org/10.1002/pmic.200401199; Sep 2005) (zverlov2005functionalsubgenomicsof pages 2-4, zverlov2005functionalsubgenomicsof pages 1-2).
- Domain mapping evidence: The proteomics catalog lists CelK with CBM and Ig-like repeats plus GH9โ€“dockerin, e.g., โ€œCBM4โ€“Igโ€“GH9โ€“Doc1,โ€ consistent with an extracellular cellulosomal GH9 architecture (Zverlov 2005) (https://doi.org/10.1002/pmic.200401199; Sep 2005) (zverlov2005functionalsubgenomicsof pages 2-4). GH9 architecture-function relations are supported by GH9 exemplars such as CelI (GH9 with CBM3c/CBM3b) showing processive endoglucanase activity on crystalline cellulose (https://doi.org/10.1128/jb.185.2.391-398.2003; Jan 2003) (gilad2003celianoncellulosomal pages 1-2).

Mechanistic synthesis: reaction, specificity, localization, and pathway role
- Reaction and specificity: CelK (GH9) acts on ฮฒ-1,4-linked glucans of cellulose, contributing both processive chain cleavage and cellobiohydrolase-like activity within the cellulosome, releasing short cello-oligosaccharides (DP2โ€“3; predominantly cellobiose) under standard assay conditions (60ยฐC, pH ~5.8) (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8). Literature explicitly labels CelK as a cellobiohydrolase and as a major cellulosomal enzyme (Munir 2014) (https://doi.org/10.1371/journal.pone.0104260; Aug 2014) (munir2014comparativeanalysisof pages 13-13).
- Domain-context inference: Proteomics domain mapping places CelK as a modular GH9 with CBM(s), Ig-like repeats, and a C-terminal dockerin (Doc1), explaining its incorporation into the cellulosome and its functionality on ordered cellulose through enhanced substrate targeting and chain feeding (https://doi.org/10.1002/pmic.200401199; Sep 2005) (zverlov2005functionalsubgenomicsof pages 2-4), in line with GH9 exemplars (https://doi.org/10.1128/jb.185.2.391-398.2003; Jan 2003) (gilad2003celianoncellulosomal pages 1-2).
- Localization and pathway: CelK is extracellular and cell-surface-associated via the cellulosome, operating in concert with Cel48S (GH48) and other cellulases to hydrolyze cellulose to cellodextrins that feed into downstream uptake/fermentation pathways of C. thermocellum (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017) (leis2017comparativecharacterizationof pages 7-8). Proteomics consistently detects CelK at high abundance in cellulosome preparations from cellulose/cellobiose growth (https://doi.org/10.1128/jb.00882-07; Oct 2007; https://doi.org/10.1002/pmic.200401199; Sep 2005) (gold2007proteomicanalysisof pages 65-68, zverlov2005functionalsubgenomicsof pages 2-4, gold2007proteomicanalysisof pages 68-69).
- Role relative to Cel48S (CelS): Cel48S is the dominant GH48 exoglucanase operating from reducing ends; CelK (GH9) provides complementary chain cleavage/processivity and is explicitly paired with Cel48S in basic complexes, supporting synergistic cellulose hydrolysis where diverse processivities are necessary for high cellulosome efficiency (https://doi.org/10.1186/s13068-017-0928-4; Oct 2017; https://doi.org/10.3390/fermentation9030204; Feb 2023) (leis2017comparativecharacterizationof pages 7-8, gold2007proteomicanalysisof pages 68-69).

Limitations and open points
- Kinetic parameters: Within the provided excerpts, detailed kcat/Km values for CelK were not reported. Available data include comparative activity/product profiles and proteomic abundance; readers requiring explicit kinetics should consult targeted biochemical characterizations not captured in the retrieved excerpts (leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8).

Embedded evidence summary
| Attribute | Evidence summary | Source |
|---|---|---|
| Identity mapping (CelK = Cel9K = Cthe_0412) | CelK is listed as Cel9K (locus Cthe_0412) in comparative catalogs of C. thermocellum cellulosomal enzymes, linking the gene symbol CelK to the GH9 cellulosomal enzyme. | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4; Munir et al., 2014 https://doi.org/10.1371/journal.pone.0104260 (leis2017comparativecharacterizationof pages 7-8, munir2014comparativeanalysisof pages 13-13) |
| Enzyme class and EC | Described in the literature as a cellulase with cellobiohydrolase-like role within the cellulosome; UniProt/earlier reports annotate CelK as a cellobiohydrolase (EC 3.2.1.91), and Munir reports CelK as a major cellulosome component/new cellobiohydrolase. | Munir et al., 2014 https://doi.org/10.1371/journal.pone.0104260; Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (munir2014comparativeanalysisof pages 13-13, leis2017comparativecharacterizationof pages 7-8) |
| GH family and mechanism (GH9) | Assigned to glycoside hydrolase family 9 (GH9); GH9 enzymes use an inverting mechanism and show diversity in processivity (including processive endoglucanase behaviors that can produce cellobiose intermediates). | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 1-2) |
| Domain architecture (catalytic GH9; CBMs/Ig-like/dockerin inferred) | Excerpts do not give a complete CelK domain map in these texts; GH9 catalytic domain plus cellulose-binding modules (e.g., CBM3 variants), Ig-like repeats and a dockerin are typical for cellulosomal GH9s โ€” inference supported by detailed GH9 examples (e.g., CelI) and the cellulosomal context. | Gilad et al., 2003 https://doi.org/10.1128/jb.185.2.391-398.2003; Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (gilad2003celianoncellulosomal pages 1-2, leis2017comparativecharacterizationof pages 7-8) |
| Cellular/extracellular localization | CelK is a cellulosomal (extracellular, cell-surface-associated) enzyme incorporated via dockerinโ€“cohesin interactions into the cellulosome complex. | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 7-8) |
| Functional role relative to Cel48S (CelS) | Leis et al. list "Cel48S/Cel9K" as a basic partner pairing; Cel48S (a GH48 exo-cellobiohydrolase) acts primarily from the reducing end while Cel9K (GH9) provides complementary cleavage/processivity producing DP2โ€“3 products, so the two act synergistically in the cellulosome. | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 7-8) |
| Substrate specificity and products | Active on cellulosic substrates (e.g., PASC, Avicel); product analysis in comparative assays shows final soluble products after extended hydrolysis are mainly cellobiose and cellotriose (DP2โ€“3). | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 7-8) |
| Activity conditions reported in comparative assays | Comparative assays in Leis et al. were performed under cellulosome-relevant conditions (~60ยฐC, pH ~5.8), which were used for product/profile analyses. | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 7-8) |
| Proteomic detection / abundance | Catalogs and proteomic/functional surveys list CelK among major cellulosomal cellulases, indicating it is a prominent extracellular component in C. thermocellum cellulosomes. | Munir et al., 2014 https://doi.org/10.1371/journal.pone.0104260; Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (munir2014comparativeanalysisof pages 13-13, leis2017comparativecharacterizationof pages 7-8) |
| Kinetics / quantitative data hints | Leis et al. report measured activity values (relative activities/product profiles) for Cel9K in comparative tables but do not provide full kinetic parameters (kcat/Km) in the provided excerpts; explicit detailed kinetics appear absent from these excerpts. | Leis et al., 2017 https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 7-8, leis2017comparativecharacterizationof pages 1-2) |

Table: Concise, cited evidence summarizing identity, classification, domain inferences, localization, role with Cel48S, substrates/products, assay conditions, proteomic presence, and data availability for CelK (Cel9K; Cthe_0412). This table helps quickly map major claims to their source excerpts.

References with URLs and publication dates
- Leis B. et al. Comparative characterization of all cellulosomal cellulases from Clostridium thermocellum. Biotechnology for Biofuels, Oct 2017. https://doi.org/10.1186/s13068-017-0928-4 (leis2017comparativecharacterizationof pages 2-4, leis2017comparativecharacterizationof pages 1-2, leis2017comparativecharacterizationof pages 7-8)
- Gilad R. et al. CelI, a noncellulosomal family 9 enzyme from Clostridium thermocellum, is a processive endoglucanase. Journal of Bacteriology, Jan 2003. https://doi.org/10.1128/jb.185.2.391-398.2003 (gilad2003celianoncellulosomal pages 1-2)
- Munir R.I. et al. Comparative analysis of CAZymes in Clostridium termitidis CT1112 (discussion citing CelK as new cellobiohydrolase and major component in C. thermocellum). PLoS ONE, Aug 2014. https://doi.org/10.1371/journal.pone.0104260 (munir2014comparativeanalysisof pages 13-13)
- Zverlov V.V. et al. Functional subgenomics of C. thermocellum cellulosomal genes: major components and new enzymes. PROTEOMICS, Sep 2005. https://doi.org/10.1002/pmic.200401199 (zverlov2005functionalsubgenomicsof pages 2-4, zverlov2005functionalsubgenomicsof pages 1-2)
- Gold N.D., Martin V.J.J. Global view of the C. thermocellum cellulosome revealed by quantitative proteomics. Journal of Bacteriology, Oct 2007. https://doi.org/10.1128/jb.00882-07 (gold2007globalviewof pages 8-9)
- Gold N. Proteomic analysis of the C. thermocellum cellulosome (tables with emPAI for CelK/Cel48S under Avicel or cellobiose). 2007. (See extracted table summaries) (gold2007proteomicanalysisof pages 65-68, gold2007proteomicanalysisof pages 68-69)
- You C. et al. GH48 cellulase: key player in cellulolytic bacteria for lignocellulose biorefinery. Fermentation, Feb 2023. https://doi.org/10.3390/fermentation9030204 (gold2007proteomicanalysisof pages 68-69)

References

  1. (leis2017comparativecharacterizationof pages 7-8): Benedikt Leis, Claudia Held, Fabian Bergkemper, Katharina Dennemarck, Robert Steinbauer, Alarich Reiter, Matthias Mechelke, Matthias Moerch, Sigrid Graubner, Wolfgang Liebl, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Comparative characterization of all cellulosomal cellulases from clostridium thermocellum reveals high diversity in endoglucanase product formation essential for complex activity. Biotechnology for Biofuels, Oct 2017. URL: https://doi.org/10.1186/s13068-017-0928-4, doi:10.1186/s13068-017-0928-4. This article has 63 citations.

  2. (munir2014comparativeanalysisof pages 13-13): Riffat I. Munir, John Schellenberg, Bernard Henrissat, Tobin J. Verbeke, Richard Sparling, and David B. Levin. Comparative analysis of carbohydrate active enzymes in clostridium termitidis ct1112 reveals complex carbohydrate degradation ability. PLoS ONE, 9:e104260, Aug 2014. URL: https://doi.org/10.1371/journal.pone.0104260, doi:10.1371/journal.pone.0104260. This article has 43 citations and is from a peer-reviewed journal.

  3. (zverlov2005functionalsubgenomicsof pages 2-4): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  4. (leis2017comparativecharacterizationof pages 1-2): Benedikt Leis, Claudia Held, Fabian Bergkemper, Katharina Dennemarck, Robert Steinbauer, Alarich Reiter, Matthias Mechelke, Matthias Moerch, Sigrid Graubner, Wolfgang Liebl, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Comparative characterization of all cellulosomal cellulases from clostridium thermocellum reveals high diversity in endoglucanase product formation essential for complex activity. Biotechnology for Biofuels, Oct 2017. URL: https://doi.org/10.1186/s13068-017-0928-4, doi:10.1186/s13068-017-0928-4. This article has 63 citations.

  5. (gilad2003celianoncellulosomal pages 1-2): Rachel Gilad, Larisa Rabinovich, Sima Yaron, Edward A. Bayer, Raphael Lamed, Harry J. Gilbert, and Yuval Shoham. Celi, a noncellulosomal family 9 enzyme from clostridium thermocellum, is a processive endoglucanase that degrades crystalline cellulose. Journal of Bacteriology, 185:391-398, Jan 2003. URL: https://doi.org/10.1128/jb.185.2.391-398.2003, doi:10.1128/jb.185.2.391-398.2003. This article has 165 citations and is from a peer-reviewed journal.

  6. (gold2007proteomicanalysisof pages 65-68): N Gold. Proteomic analysis of the clostridium thermocellum cellulosome. Unknown journal, 2007.

  7. (gold2007proteomicanalysisof pages 68-69): N Gold. Proteomic analysis of the clostridium thermocellum cellulosome. Unknown journal, 2007.

  8. (gold2007globalviewof pages 8-9): Nicholas D. Gold and Vincent J. J. Martin. Global view of the clostridium thermocellum cellulosome revealed by quantitative proteomic analysis. Journal of Bacteriology, 189:6787-6795, Oct 2007. URL: https://doi.org/10.1128/jb.00882-07, doi:10.1128/jb.00882-07. This article has 271 citations and is from a peer-reviewed journal.

  9. (zverlov2005functionalsubgenomicsof pages 1-2): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  10. (leis2017comparativecharacterizationof pages 2-4): Benedikt Leis, Claudia Held, Fabian Bergkemper, Katharina Dennemarck, Robert Steinbauer, Alarich Reiter, Matthias Mechelke, Matthias Moerch, Sigrid Graubner, Wolfgang Liebl, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Comparative characterization of all cellulosomal cellulases from clostridium thermocellum reveals high diversity in endoglucanase product formation essential for complex activity. Biotechnology for Biofuels, Oct 2017. URL: https://doi.org/10.1186/s13068-017-0928-4, doi:10.1186/s13068-017-0928-4. This article has 63 citations.

Citations

  1. zverlov2005functionalsubgenomicsof pages 2-4
  2. munir2014comparativeanalysisof pages 13-13
  3. leis2017comparativecharacterizationof pages 1-2
  4. gilad2003celianoncellulosomal pages 1-2
  5. gold2007proteomicanalysisof pages 68-69
  6. leis2017comparativecharacterizationof pages 7-8
  7. gold2007proteomicanalysisof pages 65-68
  8. gold2007globalviewof pages 8-9
  9. zverlov2005functionalsubgenomicsof pages 1-2
  10. leis2017comparativecharacterizationof pages 2-4
  11. https://doi.org/10.1371/journal.pone.0104260;
  12. https://doi.org/10.1186/s13068-017-0928-4;
  13. https://doi.org/10.1002/pmic.200401199;
  14. https://doi.org/10.1128/jb.185.2.391-398.2003;
  15. https://doi.org/10.3390/fermentation9030204;
  16. https://doi.org/10.1128/jb.00882-07;
  17. https://doi.org/10.1371/journal.pone.0104260
  18. https://doi.org/10.1186/s13068-017-0928-4
  19. https://doi.org/10.1128/jb.185.2.391-398.2003
  20. https://doi.org/10.1002/pmic.200401199
  21. https://doi.org/10.1128/jb.00882-07
  22. https://doi.org/10.3390/fermentation9030204
  23. https://doi.org/10.1186/s13068-017-0928-4,
  24. https://doi.org/10.1371/journal.pone.0104260,
  25. https://doi.org/10.1002/pmic.200401199,
  26. https://doi.org/10.1128/jb.185.2.391-398.2003,
  27. https://doi.org/10.1128/jb.00882-07,

๐Ÿ“„ View Raw YAML

id: A3DCH1
gene_symbol: celK
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC
    103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
description: CelK (also known as Cel9K, locus Cthe_0412) is a major cellulosomal cellulase
  belonging to glycoside hydrolase family 9 (GH9) in Acetivibrio thermocellus (formerly
  Clostridium thermocellum). It functions as a cellulose 1,4-beta-cellobiosidase (EC
  3.2.1.91) that hydrolyzes beta-1,4-glucosidic linkages in cellulose, releasing cellobiose
  and cellotriose from the non-reducing ends of cellulose chains. The enzyme has a
  modular architecture consisting of a CBM4 carbohydrate-binding module, an Ig-like
  domain, the GH9 catalytic domain, and a C-terminal type I dockerin domain that mediates
  integration into the cellulosome complex. CelK works synergistically with Cel48S
  (the major GH48 cellobiohydrolase) in the cellulosome to efficiently degrade crystalline
  cellulose.
existing_annotations:
- term:
    id: GO:0000272
    label: polysaccharide catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: CelK is a cellulase that degrades cellulose, which is a polysaccharide.
      The annotation to polysaccharide catabolic process is correct but overly broad.
      CelK specifically catalyzes the breakdown of cellulose (a beta-1,4-glucan),
      not polysaccharides in general.
    action: MODIFY
    reason: While cellulose is a polysaccharide and CelK does participate in polysaccharide
      catabolism, this term is too general. The more specific term GO:0030245 (cellulose
      catabolic process) already exists in the annotation set and accurately captures
      the biological process. This broader term provides less informative annotation.
    proposed_replacement_terms:
    - id: GO:0030245
      label: cellulose catabolic process
    additional_reference_ids:
    - PMID:29075324
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:29075324
      supporting_text: CBHs (exo-acting from the sugar ends) released specifically
        cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)
- term:
    id: GO:0004553
    label: hydrolase activity, hydrolyzing O-glycosyl compounds
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: CelK is a GH9 family glycoside hydrolase that hydrolyzes beta-1,4-glucosidic
      bonds in cellulose. This annotation is correct but too general - the enzyme
      has specific cellobiohydrolase activity (EC 3.2.1.91) rather than generic glycosyl
      hydrolase activity.
    action: ACCEPT
    reason: While GO:0016162 (cellulose 1,4-beta-cellobiosidase activity) is more
      specific and preferred, this parent term is technically accurate. Since the
      more specific term is already annotated, this broader annotation is acceptable
      as it provides correct hierarchical coverage. IEA annotations often propagate
      from broader InterPro signatures and this is consistent with the enzyme's function.
    additional_reference_ids:
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:29075324
      supporting_text: exo-acting cellobiohydrolases thread the cellulose molecule
        from its free cellulose chain end through a tunnel built by loop structures
        around the active site
- term:
    id: GO:0005576
    label: extracellular region
  evidence_type: IEA
  original_reference_id: GO_REF:0000044
  review:
    summary: CelK is a secreted protein that incorporates into the extracellular cellulosome
      complex via its dockerin domain. The signal peptide (residues 1-27) is cleaved
      upon secretion. Proteomics confirms CelK is abundant in cellulosome preparations
      from cellulose-grown cells.
    action: MODIFY
    reason: While CelK is secreted to the extracellular region, a more specific cellular
      component annotation would be GO:0043263 (cellulosome), as CelK is a bona fide
      cellulosomal enzyme that integrates via dockerin-cohesin interactions. The cellulosome
      is a defined extracellular multi-enzyme complex in C. thermocellum.
    proposed_replacement_terms:
    - id: GO:0043263
      label: cellulosome
    additional_reference_ids:
    - PMID:16127726
    - PMID:17644599
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:16127726
      supporting_text: the cellobiohydrolases Cbh9A, Cel9K, Cel48S
    - reference_id: PMID:17644599
      supporting_text: exoglucanases CelS and CelK
- term:
    id: GO:0005975
    label: carbohydrate metabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: CelK participates in carbohydrate metabolism by degrading cellulose.
      This annotation is correct but extremely broad - CelK's specific role is in
      cellulose catabolism.
    action: ACCEPT
    reason: This is a very high-level biological process term that is technically
      accurate but provides minimal information. Since more specific BP annotations
      exist (GO:0030245 cellulose catabolic process), this broad term is acceptable
      as hierarchical context but not informative as a standalone annotation. IEA
      from InterPro is appropriate for this level of granularity.
    additional_reference_ids:
    - file:ACET2/celK/celK-deep-research-falcon.md
- term:
    id: GO:0008810
    label: cellulase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: CelK has cellulase activity as a GH9 family enzyme. However, the GO definition
      of cellulase activity (GO:0008810) specifically refers to endohydrolysis, whereas
      CelK functions as a cellobiohydrolase (exo-acting, EC 3.2.1.91) releasing cellobiose
      from non-reducing ends.
    action: MODIFY
    reason: GO:0008810 (cellulase activity) is defined as "catalysis of the endohydrolysis
      of (1->4)-beta-D-glucosidic linkages in cellulose". CelK is classified as EC
      3.2.1.91 (cellulose 1,4-beta-cellobiosidase), which is an exo-acting enzyme
      that releases cellobiose from the non-reducing ends. The correct annotation
      is GO:0016162 (cellulose 1,4-beta-cellobiosidase activity), which is already
      present in the annotation set.
    proposed_replacement_terms:
    - id: GO:0016162
      label: cellulose 1,4-beta-cellobiosidase activity
    additional_reference_ids:
    - PMID:29075324
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:29075324
      supporting_text: As expected, for the CBHs Cbh9A, Cel48S, and Cel9K no or very
        weak activities on modified substrates were observed, whereas on PASC substantial
        product formation was found
- term:
    id: GO:0016162
    label: cellulose 1,4-beta-cellobiosidase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000003
  review:
    summary: 'This is the most accurate molecular function annotation for CelK. The
      enzyme is classified as EC 3.2.1.91 (cellulose 1,4-beta-cellobiosidase), which
      matches this GO term exactly. UniProt catalytic activity annotation confirms:
      "Hydrolysis of (1->4)-beta-D-glucosidic linkages in cellulose and cellotetraose,
      releasing cellobiose from the non-reducing ends of the chains."'
    action: ACCEPT
    reason: This is the correct and most specific molecular function annotation for
      CelK. The annotation is derived from EC:3.2.1.91 mapping, which is the experimentally
      determined enzyme classification for this protein. This represents the core
      catalytic function of CelK.
    additional_reference_ids:
    - PMID:29075324
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:29075324
      supporting_text: CBHs (exo-acting from the sugar ends) released specifically
        cellobiose as the only product over time (Cel48S, Cel9K, and Cbh9A)
- term:
    id: GO:0016787
    label: hydrolase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: CelK is indeed a hydrolase - it catalyzes the hydrolytic cleavage of
      beta-1,4-glucosidic bonds. This annotation is correct but extremely general.
    action: ACCEPT
    reason: This is the root molecular function term for hydrolases. While technically
      accurate, it provides minimal information. More specific annotations (GO:0016162,
      GO:0004553, GO:0016798) are already present and preferred. This broad term is
      acceptable as it provides hierarchical consistency but is not informative as
      a standalone annotation.
    additional_reference_ids:
    - file:ACET2/celK/celK-deep-research-falcon.md
- term:
    id: GO:0016798
    label: hydrolase activity, acting on glycosyl bonds
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: CelK hydrolyzes glycosyl bonds (specifically beta-1,4-glucosidic linkages
      in cellulose). This annotation is correct and at an appropriate intermediate
      level of specificity.
    action: ACCEPT
    reason: This term accurately describes CelK's molecular function at an intermediate
      level of specificity. While GO:0016162 is more specific, this parent term is
      correctly applied. IEA annotation from InterPro domain signatures appropriately
      captures this level of function.
    additional_reference_ids:
    - file:ACET2/celK/celK-deep-research-falcon.md
- term:
    id: GO:0030245
    label: cellulose catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: CelK is directly involved in cellulose catabolism as a major cellulosomal
      cellulase. The enzyme degrades crystalline cellulose to cellobiose and cellotriose
      as primary products. This is the most appropriate biological process annotation
      for CelK.
    action: ACCEPT
    reason: This is the correct and most specific biological process annotation for
      CelK. The enzyme's primary biological role is cellulose degradation as part
      of the cellulosome complex. This annotation accurately represents the core biological
      process in which CelK participates.
    additional_reference_ids:
    - PMID:29075324
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:29075324
      supporting_text: After prolonged incubation times (overnight), the final products
        are mainly cellobiose and cellotriose (DP 2 to DP 3)
- term:
    id: GO:0043263
    label: cellulosome
  evidence_type: IEA
  original_reference_id: GO_REF:0000044
  review:
    summary: CelK contains a C-terminal type I dockerin domain (residues 828-894)
      that mediates integration into the cellulosome via interaction with cohesin
      domains on the CipA scaffoldin protein. Proteomics confirms CelK is one of the
      most abundant cellulosomal enzyme subunits.
    action: NEW
    reason: This annotation is not currently in the GOA file but is strongly supported
      by the domain architecture (dockerin domain) and proteomics data showing CelK
      as a major cellulosomal component. This is the most specific and accurate cellular
      component annotation.
    additional_reference_ids:
    - PMID:17644599
    - PMID:16127726
    - file:ACET2/celK/celK-deep-research-falcon.md
    supported_by:
    - reference_id: PMID:17644599
      supporting_text: exoglucanases CelS and CelK
    - reference_id: PMID:16127726
      supporting_text: the cellobiohydrolases Cbh9A, Cel9K, Cel48S
    - reference_id: file:ACET2/celK/celK-deep-research-falcon.md
      supporting_text: CelK is a secreted, extracellular cellulosomal subunit that
        incorporates into the cellulosome via type-I dockerin interactions
references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO
    terms
  findings: []
- id: GO_REF:0000003
  title: Gene Ontology annotation based on Enzyme Commission mapping
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000044
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location
    vocabulary mapping, accompanied by conservative changes to GO terms applied by
    UniProt
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:29075324
  title: Comparative characterization of all cellulosomal cellulases from Clostridium
    thermocellum
  findings:
  - statement: Cel9K is classified as a cellobiohydrolase (CBH) in Table 1
    supporting_text: Cel9K0412GH9Inverting
  - statement: CBHs including Cel9K release cellobiose as the main product
    supporting_text: CBHs (exo-acting from the sugar ends) released specifically cellobiose
      as the only product over time (Cel48S, Cel9K, and Cbh9A)
  - statement: Cel9K shows weak activity on modified substrates but substantial activity
      on PASC
    supporting_text: As expected, for the CBHs Cbh9A, Cel48S, and Cel9K no or very
      weak activities on modified substrates were observed, whereas on PASC substantial
      product formation was found
- id: PMID:16127726
  title: Functional subgenomics of Clostridium thermocellum cellulosomal genes
  findings:
  - statement: Cel9K is identified as a major cellulosomal cellobiohydrolase
    supporting_text: the cellobiohydrolases Cbh9A, Cel9K, Cel48S
- id: PMID:17644599
  title: Global view of the Clostridium thermocellum cellulosome revealed by quantitative
    proteomic analysis
  findings:
  - statement: CelK shows higher expression in cellulosomes from cellulose-grown cells
    supporting_text: exoglucanases CelS and CelK
- id: file:ACET2/celK/celK-deep-research-falcon.md
  title: Deep research literature review for celK
  findings:
  - statement: CelK identity confirmed as Cel9K with locus Cthe_0412
    supporting_text: The target is celK from Acetivibrio thermocellus (Clostridium
      thermocellum). Comparative catalogs and proteomics explicitly map CelK to Cel9K
      with locus tag Cthe_0412
  - statement: CelK is a cellulosomal subunit with type-I dockerin
    supporting_text: CelK is a secreted, extracellular cellulosomal subunit that incorporates
      into the cellulosome via type-I dockerin interactions
  - statement: CelK belongs to GH9 family with inverting mechanism
    supporting_text: CelK belongs to glycoside hydrolase family 9 (GH9), which utilize
      an inverting catalytic mechanism
core_functions:
- description: CelK functions as a cellulose 1,4-beta-cellobiosidase that hydrolyzes
    beta-1,4-glucosidic linkages in crystalline cellulose, releasing cellobiose from
    the non-reducing ends of cellulose chains. It operates as a component of the cellulosome
    complex, working synergistically with Cel48S (GH48) to efficiently degrade plant
    cell wall polysaccharides.
  molecular_function:
    id: GO:0016162
    label: cellulose 1,4-beta-cellobiosidase activity
  directly_involved_in:
  - id: GO:0030245
    label: cellulose catabolic process
  locations:
  - id: GO:0043263
    label: cellulosome
  substrates:
  - id: CHEBI:18246
    label: cellulose
  in_complex:
    id: GO:0043263
    label: cellulosome
  supported_by:
  - reference_id: PMID:29075324
    supporting_text: CBHs (exo-acting from the sugar ends) released specifically cellobiose
      as the only product over time (Cel48S, Cel9K, and Cbh9A)
suggested_questions:
- question: What is the precise processivity of CelK - does it remain attached to
    the cellulose chain after each catalytic event or does it dissociate and rebind?
  experts:
  - Wolfgang H. Schwarz
  - Vladimir V. Zverlov
- question: What are the kinetic parameters (kcat, Km) for CelK on various cellulosic
    substrates (Avicel, PASC, CMC)?
  experts:
  - Benedikt Leis
  - Wolfgang Liebl
suggested_experiments:
- hypothesis: CelK exhibits processive cellobiohydrolase activity on crystalline cellulose
  description: Perform single-molecule fluorescence microscopy to track individual
    CelK molecules on cellulose nanocrystals. Measure processivity (number of catalytic
    events per encounter) and compare to other GH9 family members with known processivities.
  experiment_type: Single-molecule enzymology
- hypothesis: CelK and Cel48S show synergistic enhancement of cellulose hydrolysis
  description: Measure hydrolysis rates of crystalline cellulose (Avicel) using CelK
    alone, Cel48S alone, and equimolar combinations. Quantify synergy as the ratio
    of combined activity to the sum of individual activities. Test at various enzyme
    ratios to determine optimal stoichiometry.
  experiment_type: Enzyme kinetics / synergy assay
tags:
- cellulosome