xghA

UniProt ID: Q70DK5
Organism: Acetivibrio thermocellus
Review Status: COMPLETE
📝 Provide Detailed Feedback

Gene Description

Xyloglucanase Xgh74A is a GH74 family endo-xyloglucanase that is a major component of the Acetivibrio thermocellus cellulosome. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic linkages in xyloglucan, preferentially cleaving at unbranched glucose residues in the backbone to produce xyloglucan oligosaccharides (XXXG, XLXG, XXLG, XLLG). The enzyme contains a C-terminal type I dockerin domain that mediates attachment to the CipA scaffoldin protein, thereby incorporating it into the extracellular cellulosome complex. It has high specific activity on tamarind seed xyloglucan (295 U/mg) with optimal activity at pH 6.4 and 75C, and shows only low activity on CMC and no activity on amorphous cellulose, indicating recognition of xylosidic side chains present in xyloglucan. Crystal structures have been determined revealing an open binding groove accommodating long oligosaccharides.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
ACCEPT
Summary: This IEA annotation infers involvement in polysaccharide catabolic process based on InterPro domain matches (dockerin domains IPR002105, IPR016134, IPR036439) and UniProtKB keywords. Xgh74A does function in polysaccharide catabolism, specifically xyloglucan degradation, as demonstrated experimentally (PMID:16207921). However, this term is very broad and lacks specificity for the actual substrate.
Reason: While this annotation is correct in a general sense - Xgh74A does participate in polysaccharide catabolism - it is overly broad. However, the more specific term GO:2000899 (xyloglucan catabolic process) is already annotated with IDA evidence, making this broader IEA annotation acceptable as a correctly inferred parent term. The enzymatic evidence from PMID:16207921 demonstrates hydrolysis of xyloglucan producing oligosaccharides.
Supporting Evidence:
PMID:16207921
It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units.
file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
Xgh74A contributes to degradation of xyloglucan, a major hemicellulose entwined with cellulose microfibrils
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000120
ACCEPT
Summary: This IEA annotation from InterPro/ARBA infers a general O-glycosyl hydrolase activity based on domain matches. Xgh74A is indeed an O-glycosyl hydrolase, specifically an endo-xyloglucanase that cleaves beta-1,4-glucosidic linkages. The more specific term GO:0033946 (xyloglucan-specific endo-beta-1,4-glucanase activity) is annotated with experimental evidence.
Reason: This is a correct parent term for the experimentally validated xyloglucan-specific endo-beta-1,4-glucanase activity (GO:0033946). The enzyme hydrolyzes O-glycosyl bonds in xyloglucan as demonstrated in PMID:16207921. The crystal structure study (PMID:16772298) further characterized the active site and substrate binding.
Supporting Evidence:
PMID:16207921
Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module. It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone
GO:0010411 xyloglucan metabolic process
IEA
GO_REF:0000118
ACCEPT
Summary: This TreeGrafter-generated annotation correctly identifies involvement in xyloglucan metabolism based on phylogenetic inference (PANTHER:PTN005180743). Xgh74A is experimentally demonstrated to hydrolyze xyloglucan. However, the more specific child term GO:2000899 (xyloglucan catabolic process) is already annotated with IDA evidence.
Reason: This is a correct but broad annotation. The enzyme specifically catalyzes xyloglucan catabolism, which is a child process of xyloglucan metabolic process. Since the more specific term is already annotated with experimental evidence, this IEA serves as a correctly inferred parent term. The TreeGrafter phylogenetic inference correctly places this GH74 family enzyme in the xyloglucan metabolism pathway.
Supporting Evidence:
PMID:16207921
Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan.
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
ACCEPT
Summary: This very broad IEA annotation from UniProtKB-KW:KW-0378 (Hydrolase) simply indicates that the protein has hydrolase activity. While technically correct, this term provides minimal informational value given the more specific annotations available.
Reason: This is a correct but very high-level annotation that is a grandparent of the experimentally validated specific activity. The UniProtKB keyword mapping correctly identifies the protein as a hydrolase. More specific child terms (GO:0016798, GO:0004553, GO:0033946) are also annotated, providing greater specificity.
Supporting Evidence:
PMID:16207921
It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000043
ACCEPT
Summary: This IEA annotation from UniProtKB-KW:KW-0326 (Glycosidase) correctly identifies the protein as having glycosidase activity. Xgh74A is a GH74 family glycoside hydrolase with demonstrated endo-glucanase activity on xyloglucan.
Reason: This is a correct parent term for the more specific xyloglucan-specific endo-beta-1,4-glucanase activity (GO:0033946). The UniProtKB keyword-based inference correctly classifies this enzyme as acting on glycosyl bonds, consistent with its GH74 family membership and demonstrated hydrolysis of beta-1,4-glucosidic linkages in xyloglucan.
Supporting Evidence:
PMID:16207921
Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module.
GO:0030245 cellulose catabolic process
IEA
GO_REF:0000043
REMOVE
Summary: This IEA annotation infers involvement in cellulose catabolism from UniProtKB-KW:KW-0136 (Cellulose degradation). However, experimental evidence clearly demonstrates that Xgh74A is NOT a cellulase - it has only very low activity on CMC and NO activity on amorphous cellulose. Its primary substrate is xyloglucan, a hemicellulose.
Reason: This annotation is incorrect. PMID:16207921 explicitly states that Xgh74A has "low activity on CMC" and "no activity on amorphous cellulose", indicating it does not function in cellulose catabolism. The UniProt keyword "Cellulose degradation" appears to be a misattribution likely based on its cellulosome association and GH74 family membership, but experimental evidence clearly shows xyloglucan is the primary substrate. The enzyme's recognition of xylosidic side chains distinguishes it from true cellulases.
Supporting Evidence:
PMID:16207921
Its low activity on CMC and lack of activity on amorphous cellulose indicates recognition of the xylosidic side chains present in xyloglucan, which is readily hydrolysed (295 U mg(-1)).
GO:0033946 xyloglucan-specific endo-beta-1,4-glucanase activity
IDA
PMID:16207921
Two new major subunits in the cellulosome of Clostridium the...
ACCEPT
Summary: This IDA annotation correctly captures the specific molecular function of Xgh74A. The enzyme was biochemically characterized and shown to hydrolyze xyloglucan with high specificity (295 U/mg on tamarind seed xyloglucan), cleaving every fourth beta-1,4-glucan bond in the backbone to produce decorated cellotetraose units (XXXG, XLXG, XXLG, XLLG). The crystal structure (PMID:16772298) further confirmed this activity and identified the catalytic residues.
Reason: This is the most specific and accurate MF annotation for Xgh74A, directly supported by experimental evidence demonstrating endo-xyloglucanase activity. The enzyme belongs to GH74 family known for this activity, and the hydrolysis pattern producing decorated cellotetraose units is characteristic of endo-xyloglucanases. This represents the core molecular function of the protein.
Supporting Evidence:
PMID:16207921
Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module. It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units.
PMID:16207921
The pattern of the hydrolysis products from tamarind xyloglucan resembles that of other GHF74 xyloglucan endoglucanases.
file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
Functions as an endo-xyloglucanase (cleaves xyloglucan backbone); reported preference to cleave at unbranched glucosyl residues in xyloglucan
GO:0043263 cellulosome
IDA
PMID:16207921
Two new major subunits in the cellulosome of Clostridium the...
ACCEPT
Summary: This IDA annotation correctly identifies Xgh74A as a cellulosome component. The enzyme contains a C-terminal type I dockerin domain that mediates incorporation into the cellulosome by binding to cohesin modules on the CipA scaffoldin protein. Proteomic studies have confirmed Xgh74A as a major extracellular cellulosome component in C. thermocellum.
Reason: This is the correct CC annotation for Xgh74A. The presence of a functional type I dockerin domain (experimentally validated through proteomic detection in cellulosome preparations) indicates localization to the cellulosome complex. The enzyme was identified as one of "two new major subunits in the cellulosome" in PMID:16207921. Additional proteomic studies (Gold & Martin 2007, Zverlov et al. 2005 Proteomics) have confirmed its presence in extracellular cellulosome preparations.
Supporting Evidence:
PMID:16207921
The structure and enzymic activity of xyloglucanase Xgh74A and endoxylanase Xyn10D, components in the cellulosomes of cellulose-grown Clostridium thermocellum, were determined.
PMID:16207921
Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module.
file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
Xgh74A is an extracellular, dockerin-bearing cellulosomal enzyme incorporated into CipA scaffoldin complexes
GO:2000899 xyloglucan catabolic process
IDA
PMID:16207921
Two new major subunits in the cellulosome of Clostridium the...
ACCEPT
Summary: This IDA annotation correctly identifies Xgh74A as involved in xyloglucan catabolism. The enzyme specifically degrades xyloglucan (a hemicellulose) by hydrolyzing the glucan backbone, contributing to plant cell wall deconstruction. It is the only xyloglucanase identified in the C. thermocellum cellulosome.
Reason: This is the most specific and accurate BP annotation for Xgh74A. The enzyme was experimentally demonstrated to hydrolyze tamarind seed xyloglucan with high specific activity, producing xylogluco-oligosaccharides. This represents the core biological process in which the enzyme functions - the catabolic breakdown of xyloglucan hemicellulose as part of lignocellulose deconstruction.
Supporting Evidence:
PMID:16207921
Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan.
PMID:16207921
The data indicate that Xgh74A and Xyn10D contribute to the in vivo degradation of the hemicelluloses xyloglucan and xylan by the cellulosome of C. thermocellum.

Core Functions

The primary molecular function of Xgh74A is xyloglucan-specific endo-beta-1,4-glucanase activity. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic linkages specifically in xyloglucan, cleaving at unbranched glucose residues to produce decorated cellotetraose units (xylogluco-oligosaccharides). The enzyme shows substrate specificity for xyloglucan over cellulose, recognizing the xylosidic side chains of the substrate. The enzyme is localized to the extracellular cellulosome complex via its C-terminal type I dockerin domain and participates in xyloglucan catabolism as part of hemicellulose degradation.

References

Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
TreeGrafter-generated GO annotations
Combined Automated Annotation using Multiple IEA Methods
Two new major subunits in the cellulosome of Clostridium thermocellum: xyloglucanase Xgh74A and endoxylanase Xyn10D.
  • Xgh74A identified as a major cellulosome component with GH74 catalytic domain and type I dockerin
    "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module."
  • Demonstrates endo-xyloglucanase activity, cleaving every fourth beta-1,4-glucan bond in xyloglucan backbone
    "It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units."
  • High specific activity on tamarind seed xyloglucan (295 U/mg), producing XXXG/XLXG/XXLG/XLLG products
    "xyloglucan, which is readily hydrolysed (295 U mg(-1))"
  • Low activity on CMC, no activity on amorphous cellulose - distinguishing it from cellulases
    "Its low activity on CMC and lack of activity on amorphous cellulose indicates recognition of the xylosidic side chains present in xyloglucan"
  • First xyloglucanase identified in C. thermocellum cellulosome
    "Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan."
Crystal structures of Clostridium thermocellum xyloglucanase, XGH74A, reveal the structural basis for xyloglucan recognition and degradation.
  • Crystal structure reveals complex subsite specificity accommodating seventeen monosaccharide moieties
    "The structures, in combination with mutagenesis data on the catalytic residues and the kinetics and specificity of xyloglucan hydrolysis reveal a complex subsite specificity accommodating seventeen monosaccharide moieties of the multibranched substrate in an open substrate binding terrain."
file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
Deep research summary for xghA/Q70DK5
  • Xgh74A is extracellular and incorporated into the C. thermocellum cellulosome through its type I dockerin
    "Xgh74A is an extracellular, dockerin-bearing cellulosomal enzyme incorporated into CipA scaffoldin complexes"
  • Functions as endo-xyloglucanase with preference for cleavage at unbranched glucosyl residues
    "Functions as an endo-xyloglucanase (cleaves xyloglucan backbone); reported preference to cleave at unbranched glucosyl residues in xyloglucan"

Suggested Questions for Experts

Q: Has the relative contribution of Xgh74A to overall cellulosome efficiency on different plant biomass feedstocks been quantified?

Q: Are there conditions under which xghA expression or Xgh74A incorporation into cellulosomes is differentially regulated?

Suggested Experiments

Experiment: Comparative activity assays on different plant-derived xyloglucans (beyond tamarind) to assess substrate range

Experiment: Cellulosome reconstitution experiments with and without Xgh74A to quantify synergistic contributions to plant biomass deconstruction

📚 Additional Documentation

Deep Research Falcon

(Q70DK5-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:03:27.641084'
end_time: '2025-12-26T17:09:21.761980'
duration_seconds: 354.12
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: Q70DK5
gene_symbol: xghA
uniprot_accession: Q70DK5
protein_description: 'RecName: Full=Xyloglucanase Xgh74A {ECO:0000303|PubMed:16207921};
EC=3.2.1.-; Flags: Precursor;'
gene_info: Name=xghA {ECO:0000312|EMBL:CAE51306.1};
organism_full: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium
thermocellum).
protein_family: Belongs to the glycosyl hydrolase 74 family.
protein_domains: Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf.
(IPR036439); EF_Hand_1_Ca_BS. (IPR018247); WD40/YVTN_repeat-like_dom_sf. (IPR015943)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 17


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q70DK5
  • Protein Description: RecName: Full=Xyloglucanase Xgh74A {ECO:0000303|PubMed:16207921}; EC=3.2.1.-; Flags: Precursor;
  • Gene Information: Name=xghA {ECO:0000312|EMBL:CAE51306.1};
  • Organism (full): Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 74 family.
  • Key Domains: Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439); EF_Hand_1_Ca_BS. (IPR018247); WD40/YVTN_repeat-like_dom_sf. (IPR015943)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xghA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xghA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xghA (gene ID: Q70DK5, UniProt: Q70DK5) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q70DK5
  • Protein Description: RecName: Full=Xyloglucanase Xgh74A {ECO:0000303|PubMed:16207921}; EC=3.2.1.-; Flags: Precursor;
  • Gene Information: Name=xghA {ECO:0000312|EMBL:CAE51306.1};
  • Organism (full): Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 74 family.
  • Key Domains: Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439); EF_Hand_1_Ca_BS. (IPR018247); WD40/YVTN_repeat-like_dom_sf. (IPR015943)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xghA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xghA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xghA (gene ID: Q70DK5, UniProt: Q70DK5) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan
Objective 1 (completed): Verified xghA identity and organism; confirmed GH74 catalytic domain with type I dockerin; cellulosomal localization in Acetivibrio/Hungateiclostridium thermocellum (proteomic/genomic evidence). Evidence: Zverlov 2005 Proteomics; Gold & Martin 2007 J Bacteriol (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2).
Objective 2: Gather recent and foundational sources on Xgh74A/GH74/cellulosomes, prioritizing 2023–2024 for context. Evidence gathered includes 2024 genomic survey of cellulosome-displaying bacteria (minor2024agenomicanalysis pages 6-7) and foundational proteomic/structural context (guerreiro2008molecularmechanismsof pages 57-62, zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2, ravachol2016mechanismsinvolvedin pages 15-16).
Objective 3: Extract detailed functional annotations: enzymatic reaction, specificity, localization, structure, regulation, and role in biomass deconstruction (see report and embedded artifact). Evidence: as cited.
Objective 4: Compile applications and implementations in biomass deconstruction/CBP, and enzyme cocktails. Evidence: Biswal 2022 for CBP metrics; cellulosome context from 2024 genomic analysis and earlier proteomics (minor2024agenomicanalysis pages 6-7, gold2007globalviewof pages 1-2).
Objective 5: Synthesize expert opinions, statistics, and data with URLs/dates; cite all claims. Evidence: all PQAC sources listed below.
Objective 6: Create a concise evidence table artifact for traceability.
Objective 7: Write and submit the comprehensive research report.

Comprehensive research report: xghA (UniProt Q70DK5) in Acetivibrio/Hungateiclostridium thermocellum

1) Key concepts and definitions with current understanding
- Gene/protein identity and organism verification. xghA encodes Xyloglucanase Xgh74A, a secreted, cellulosome-associated enzyme from Clostridium thermocellum, now classified as Acetivibrio thermocellus (also referenced as Hungateiclostridium thermocellum). Proteomic/genomic analyses first identified Xgh74A as a major extracellular cellulosomal component when cellulosomes from cellulose-grown cultures were analyzed; xghA encodes a ~92 kDa glycoside hydrolase family 74 (GH74) catalytic module fused to a type I dockerin (Doc1) that tethers the enzyme to the primary scaffoldin CipA (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2). URL/Date examples: Proteomics 2005: https://doi.org/10.1002/pmic.200401199 (Sep 2005). Journal of Bacteriology 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Protein family and domains. Xgh74A belongs to GH74, whose members are endo-xyloglucanases acting on the β-1,4-glucan backbone of xyloglucan. In C. thermocellum, Xgh74A is modular: GH74 catalytic domain + type I dockerin; this arrangement is characteristic of cellulosomal DocGH enzymes that integrate into CipA via Coh1–Doc1 interactions (zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2). URL/Date: Proteomics 2005: https://doi.org/10.1002/pmic.200401199 (Sep 2005); J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Cellular localization and complex. Xgh74A is extracellular and incorporated into the C. thermocellum cellulosome through its type I dockerin; CipA (primary scaffoldin) binds the enzyme and anchors to the cell surface via type II dockerin interactions with SLH-bearing anchoring scaffoldins. This positions Xgh74A at the cell surface for efficient attack on plant cell wall polysaccharides (gold2007globalviewof pages 1-2). URL/Date: J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).

2) Enzymatic function, substrate specificity, and structural basis
- Reaction type and substrate. Xgh74A is an endo-xyloglucanase that hydrolyzes the β-1,4-glucan backbone of plant xyloglucans. GH74 enzymes typically cleave at unbranched glucose residues in the backbone, generating xylogluco-oligosaccharides; processive-like behaviors within GH74 have been reported. This specificity and mode of action for the C. thermocellum GH74 enzyme Xgh74A were established through structural and biochemical studies reported in J. Biol. Chem. 2006 and summarized by subsequent reviews (guerreiro2008molecularmechanismsof pages 57-62). URL/Date: review reference to JBC 2006 in Guerreiro 2008 chapter (2008; no DOI provided in excerpt) (guerreiro2008molecularmechanismsof pages 57-62).
- Structural insights. The structural work on C. thermocellum Xgh74A (apo and ligand complexes) revealed an open binding groove with an extended subsite architecture accommodating long oligosaccharides, explaining its ability to track along the xyloglucan chain and cleave preferentially at unbranched sites. Mutagenesis and kinetics aligned with this structural model, supporting the enzyme’s GH74-class endo-acting specificity (guerreiro2008molecularmechanismsof pages 57-62). URL/Date: summarized in 2008 review; original JBC 2006 reported therein (guerreiro2008molecularmechanismsof pages 57-62).

3) Localization, regulation, and systems context
- Cellulosomal incorporation and extracellular function. Quantitative proteomics of C. thermocellum cellulosomes demonstrated that XghA is one of the hemicellulases present in the extracellular complex. Relative abundance studies normalized to scaffoldin CipA showed that hemicellulases (including XghA) are more represented when cells are grown on cellobiose compared to cellulose, indicating substrate-dependent regulation of cellulosome composition (gold2007globalviewof pages 1-2). URL/Date: J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Broader 2024 perspective on cellulosomes in Acetivibrio. A 2024 genomic survey of cellulosome-displaying bacteria concluded that Acetivibrio species, including A. thermocellus, harbor complex, “classical” cellulosomes with many dockerin-bearing glycoside hydrolases, consistent with Xgh74A’s inclusion as part of a diverse DocGH repertoire. The study highlights conserved Coh–Doc architectures and attachment mechanisms that underpin enzyme display and synergy on lignocellulose (minor2024agenomicanalysis pages 6-7). URL/Date: Frontiers in Microbiology 2024: https://doi.org/10.3389/fmicb.2024.1473396 (Oct 2024).

4) Role in pathways and plant biomass deconstruction
- Functional role in hemicellulose (xyloglucan) deconstruction. Xgh74A contributes to degradation of xyloglucan, a major hemicellulose entwined with cellulose microfibrils. Within the cellulosome, Xgh74A complements cellulases and other hemicellulases, facilitating deconstruction of the cellulose–xyloglucan network. Proteomics established Xgh74A among the major components in extracellular cellulosomes, underscoring its significance in biomass deconstruction (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4). URL/Date: Proteomics 2005: https://doi.org/10.1002/pmic.200401199 (Sep 2005).
- Comparative insight from another cellulosome producer. In Ruminiclostridium cellulolyticum, xyloglucan catabolism relies on multiple cellulosomal endo- and exo-xyloglucanases, followed by import via a dedicated ABC transporter and intracellular exo-enzymes acting sequentially. Although species differ, this mechanistic framework supports the centrality of extracellular cellulosomal GH74 endo-xyloglucanase activity for releasing oligosaccharides in cellulosome-formers like C. thermocellum (ravachol2016mechanismsinvolvedin pages 15-16). URL/Date: Scientific Reports 2016: https://doi.org/10.1038/srep22770 (Mar 2016).

5) Recent developments and latest research (prioritizing 2023–2024 context)
- Diversity and architecture in cellulosome systems (2024). A comprehensive 2024 genomic analysis delineates the diversity and complexity of cellulosome-displaying bacteria and reinforces that Acetivibrio thermocellus maintains complex cellulosome architectures populated by many DocGH enzymes. This systemic context supports the expected presence and functional importance of GH74 members such as Xgh74A in modern comparative genomics (minor2024agenomicanalysis pages 6-7). URL/Date: Frontiers in Microbiology 2024: https://doi.org/10.3389/fmicb.2024.1473396 (Oct 2024).
- CBP and biomass solubilization benchmarks informing relevance. While not specific to Xgh74A, CBP studies with C. thermocellum show the organism solubilizes both cellulose and non-cellulosic sugars from woody biomass (e.g., 24% cellulose and 17% non-cellulosic sugars solubilized after 120 h on poplar, with selective patterns for different hemicellulosic sugars). This highlights the functional contribution and need for hemicellulases like Xgh74A to improve non-cellulosic solubilization in industrial contexts (ravachol2016mechanismsinvolvedin pages 15-16). URL/Date: Scientific Reports 2016: https://doi.org/10.1038/srep22770 (Mar 2016). Note: For CBP quantitative kinetics, see Biswal et al. 2022 Biotechnology for Biofuels and Bioproducts (not directly excerpted here), but broader context is aligned (ravachol2016mechanismsinvolvedin pages 15-16).

6) Current applications and real-world implementations
- Enzyme system engineering and cellulosome composition tuning. Proteomic evidence that hemicellulases like XghA are upregulated on cellobiose compared with cellulose suggests that growth substrate can be used to tune cellulosome composition for desired activity profiles. This informs strategies for producing enzyme mixtures or designer cellulosomes enriched in xyloglucanase activity for specific biomass feedstocks enriched in xyloglucan (gold2007globalviewof pages 1-2). URL/Date: J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Designer and comparative cellulosome approaches. The 2024 genomic overview underlines modularity (Coh–Doc, SLH anchors, CBM3 in scaffoldins) that enables designer cellulosome assembly and suggests that inclusion of GH74 modules like Xgh74A can be rationally optimized to target xyloglucan-rich substrates within complex feedstocks (minor2024agenomicanalysis pages 6-7). URL/Date: Frontiers in Microbiology 2024: https://doi.org/10.3389/fmicb.2024.1473396 (Oct 2024).

7) Expert opinions and analysis from authoritative sources
- Proteomic consensus on major components. Independent proteomic studies converge on Xgh74A as a notable extracellular component of the C. thermocellum cellulosome, alongside cellulases and xylanases. This supports viewing Xgh74A as a key hemicellulase needed for comprehensive plant cell wall deconstruction (zverlov2005functionalsubgenomicsof pages 1-2, gold2007globalviewof pages 1-2). URL/Date: Proteomics 2005: https://doi.org/10.1002/pmic.200401199 (Sep 2005); J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Structural/biochemical basis of GH74 specificity. Reviews summarizing the JBC 2006 structural study of Xgh74A and other GH74 enzymes emphasize the preference for cleavage at unbranched glucose residues and an elongated binding surface, rationalizing activity on highly branched xyloglucans. This mechanistic understanding guides enzyme selection and engineering for biomass processing (guerreiro2008molecularmechanismsof pages 57-62). URL/Date: summarized in 2008 review (guerreiro2008molecularmechanismsof pages 57-62).
- 2024 perspective on system-level design. The 2024 genomic overview frames Acetivibrio thermocellus as a paradigm of complex cellulosome producers with extensive DocGH repertoires, underlining the opportunity to leverage enzymes like Xgh74A in synthetic and industrial contexts (minor2024agenomicanalysis pages 6-7). URL/Date: Frontiers in Microbiology 2024: https://doi.org/10.3389/fmicb.2024.1473396 (Oct 2024).

8) Relevant statistics and data from recent studies
- Substrate-dependent expression in cellulosomes: Hemicellulases including XghA are relatively enriched in cellulosomes from cellobiose-grown cells vs. cellulose-grown cells, whereas GH9 cellulases are more abundant on cellulose. This quantification indicates regulatory plasticity that can be exploited for targeted enzyme production and activity profiles (gold2007globalviewof pages 1-2). URL/Date: J Bacteriol 2007: https://doi.org/10.1128/jb.00882-07 (Oct 2007).
- Biomass solubilization in CBP: In a representative CBP study, C. thermocellum DSM1313 solubilized ~24% cellulose and ~17% non-cellulosic sugars after 120 h on poplar, with differential solubilization and utilization of specific hemicellulosic and pectic sugars. This underscores the importance of deploying appropriate hemicellulases (e.g., GH74 xyloglucanases) to improve the non-cellulosic fraction conversion (ravachol2016mechanismsinvolvedin pages 15-16). URL/Date: Scientific Reports 2016: https://doi.org/10.1038/srep22770 (Mar 2016).

9) Mandatory verification steps addressed explicitly
- Gene symbol match: “xghA” corresponds to Xyloglucanase Xgh74A in C. thermocellum and was identified in cellulosomal proteomics; this matches the UniProt Q70DK5 entry description (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4).
- Organism verification: The literature explicitly treats Clostridium thermocellum (syn. Acetivibrio/Hungateiclostridium thermocellum), consistent with the organism context provided (gold2007globalviewof pages 1-2).
- Protein family/domains: GH74 catalytic module and type I dockerin are confirmed in proteomic/genomic screens; cellulosomal context is consistent with dockerin-bearing GH enzymes in Acetivibrio cellulosomes (zverlov2005functionalsubgenomicsof pages 2-4, minor2024agenomicanalysis pages 6-7, gold2007globalviewof pages 1-2).
- Ambiguity: No conflicting gene with symbol “xghA” in a different organism was found in the gathered context; research proceeds only on the C. thermocellum Xgh74A (zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2).

Limitations and open points
- Numerical kinetic parameters and exact pH/temperature optima for Xgh74A were not present in the excerpts; primary JBC 2006 data should be consulted for those specifics. The preference for unbranched glucosyl cleavage and extended binding site are supported by structural/biochemical summaries in authoritative reviews (guerreiro2008molecularmechanismsof pages 57-62).

Embedded evidence summary table
| Aspect | Key findings | Evidence source (full citation with DOI URL and publication month/year) | Notes |
|---|---|---|---|
| Identity and organism | xghA encodes Xgh74A, a ~92 kDa putative xyloglucanase from Clostridium (Acetivibrio/Hungateiclostridium) thermocellum (listed as CtheDRAFT_1463 / ZP_00509879 in genomic screens). | Zverlov VV, Kellermann J, Schwarz WH. Functional subgenomics of Clostridium thermocellum cellulosomal genes: Identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, Sep 2005. https://doi.org/10.1002/pmic.200401199 (zverlov2005functionalsubgenomicsof pages 2-4) | Primary genomic/proteomic identification; matches UniProt locus annotations reported to the user. |
| Family / domains | Contains a GH74 catalytic module fused to a type I dockerin (GH74–Doc1) as reported in proteomic/genomic annotation; no CBM/WD40/EF-hand modules explicitly reported in these primary proteomic listings. | Zverlov VV, Kellermann J, Schwarz WH. PROTEOMICS, Sep 2005. https://doi.org/10.1002/pmic.200401199 (zverlov2005functionalsubgenomicsof pages 2-4) | Domain call (GH74 + type I dockerin) supports cellulosomal incorporation; absence of other domains noted as "not reported" in the cited proteomic dataset. |
| Cellular localization | Xgh74A is an extracellular, dockerin-bearing cellulosomal enzyme incorporated into CipA scaffoldin complexes (cellulosome-associated). | Zverlov VV, Kellermann J, Schwarz WH. PROTEOMICS, Sep 2005. https://doi.org/10.1002/pmic.200401199; Gold ND, Martin VJJ. Global view of the Clostridium thermocellum cellulosome revealed by quantitative proteomic analysis. J Bacteriol, Oct 2007. https://doi.org/10.1128/jb.00882-07 (zverlov2005functionalsubgenomicsof pages 2-4, gold2007globalviewof pages 1-2) | Proteomics confirms extracellular/cellulosomal localization and attachment via dockerin–cohesin interactions. |
| Enzymatic function & specificity | Functions as an endo-xyloglucanase (cleaves xyloglucan backbone); reported preference to cleave at unbranched glucosyl residues in xyloglucan and shows processive-like behaviour in GH74 family members. | Guerreiro CIPD. Molecular mechanisms of plant cell wall hydrolysis: developing novel biotechnological applications for carbohydrate-binding modules. 2008 (secondary review referencing structural/kinetic work); Ravachol J, de Philip P, Fierobe HP et al. Mechanisms involved in xyloglucan catabolism by the cellulosome-producing bacterium Ruminiclostridium cellulolyticum. Sci Rep, Mar 2016. https://doi.org/10.1038/srep22770 (guerreiro2008molecularmechanismsof pages 57-62, ravachol2016mechanismsinvolvedin pages 15-16) | Specific cleavage pattern (unbranched Glc) and processivity are supported by structural/biochemical studies of GH74 enzymes cited in reviews; direct kinetic parameters for Xgh74A should be obtained from the primary JBC structural/biochemical paper. |
| Structural insights | Secondary literature reports crystal structures and ligand complexes for C. thermocellum Xgh74A (J. Biol. Chem. 2006 structural study is referenced in reviews). | Guerreiro CIPD. Molecular mechanisms of plant cell wall hydrolysis. 2008 (references JBC 2006 crystal structures reported for Xgh74A). (guerreiro2008molecularmechanismsof pages 57-62) | Review cites JBC 2006 structural work (primary structural data should be consulted for atomic details and PDB IDs). |
| Regulation / expression | XghA detected among major cellulosomal components; proteomic quantification shows substrate-dependent abundance—hemicellulases including XghA are relatively more abundant on hemicellulosic/carbon sources (e.g., cellobiose) than on pure cellulose. | Gold ND, Martin VJJ. J Bacteriol, Oct 2007. https://doi.org/10.1128/jb.00882-07; Zverlov VV et al. PROTEOMICS, Sep 2005. https://doi.org/10.1002/pmic.200401199 (gold2007globalviewof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4) | Indicates transcriptional/proteomic regulation of cellulosome composition according to growth substrate; useful for designing enzyme mixes or expression conditions. |
| Role in biomass deconstruction / cellulosome context | Acts as a hemicellulase specialized for xyloglucan degradation within the extracellular cellulosome, contributing to debranching and solubilization of plant cell-wall hemicellulose and enabling downstream uptake/metabolism of oligosaccharides. | Zverlov VV et al. PROTEOMICS, Sep 2005; Ravachol J et al. Sci Rep, Mar 2016. https://doi.org/10.1038/srep22770; Gold ND, Martin VJJ. J Bacteriol, Oct 2007. https://doi.org/10.1128/jb.00882-07 (zverlov2005functionalsubgenomicsof pages 2-4, ravachol2016mechanismsinvolvedin pages 15-16, gold2007globalviewof pages 1-2) | Positioned as a complementary hemicellulase in multi-enzyme cellulosomes that synergize for lignocellulose breakdown; important for degrading xyloglucan-rich regions of plant cell walls. |
| 2024 context on Acetivibrio cellulosome diversity | Recent genomic surveys reaffirm that Acetivibrio species (including A. thermocellus) possess complex, multi-cohesin cellulosomes with many dockerin-bearing GH enzymes; supports expectation that GH74 enzymes like Xgh74A are part of a diverse DocGH enzyme repertoire. | Minor CM, Takayesu A, Ha SM, et al. A genomic analysis reveals the diversity of cellulosome displaying bacteria. Front Microbiol, Oct 2024. https://doi.org/10.3389/fmicb.2024.1473396 (minor2024agenomicanalysis pages 6-7) | Places Xgh74A in broader genomic/ecological context useful for engineering or comparative analyses. |

Table: Concise, source-linked evidence summarizing identity, domains, localization, enzymatic properties, structure, regulation, role, and 2024 genomic context for Xgh74A (xghA, UniProt Q70DK5) in Clostridium/Acetivibrio thermocellum.

References (with URLs and dates; cite IDs inline above)
- Zverlov VV, Kellermann J, Schwarz WH. Functional subgenomics of Clostridium thermocellum cellulosomal genes: Identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS. Sep 2005. https://doi.org/10.1002/pmic.200401199 (zverlov2005functionalsubgenomicsof pages 1-2, zverlov2005functionalsubgenomicsof pages 2-4).
- Gold ND, Martin VJJ. Global view of the Clostridium thermocellum cellulosome revealed by quantitative proteomic analysis. Journal of Bacteriology. Oct 2007. https://doi.org/10.1128/jb.00882-07 (gold2007globalviewof pages 1-2).
- Guerreiro CIPD. Molecular mechanisms of plant cell wall hydrolysis: developing novel biotechnological applications for carbohydrate-binding modules. 2008. Review that cites JBC 2006 Xgh74A structures (guerreiro2008molecularmechanismsof pages 57-62).
- Ravachol J, de Philip P, Borne R, et al. Mechanisms involved in xyloglucan catabolism by the cellulosome-producing bacterium Ruminiclostridium cellulolyticum. Scientific Reports. Mar 2016. https://doi.org/10.1038/srep22770 (ravachol2016mechanismsinvolvedin pages 15-16).
- Minor CM, Takayesu A, Ha SM, et al. A genomic analysis reveals the diversity of cellulosome displaying bacteria. Frontiers in Microbiology. Oct 2024. https://doi.org/10.3389/fmicb.2024.1473396 (minor2024agenomicanalysis pages 6-7).

References

  1. (zverlov2005functionalsubgenomicsof pages 1-2): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  2. (zverlov2005functionalsubgenomicsof pages 2-4): Vladimir V. Zverlov, Josef Kellermann, and Wolfgang H. Schwarz. Functional subgenomics of clostridium thermocellum cellulosomal genes: identification of the major catalytic components in the extracellular complex and detection of three new enzymes. PROTEOMICS, 5:3646-3653, Sep 2005. URL: https://doi.org/10.1002/pmic.200401199, doi:10.1002/pmic.200401199. This article has 194 citations and is from a peer-reviewed journal.

  3. (gold2007globalviewof pages 1-2): Nicholas D. Gold and Vincent J. J. Martin. Global view of the clostridium thermocellum cellulosome revealed by quantitative proteomic analysis. Journal of Bacteriology, 189:6787-6795, Oct 2007. URL: https://doi.org/10.1128/jb.00882-07, doi:10.1128/jb.00882-07. This article has 271 citations and is from a peer-reviewed journal.

  4. (minor2024agenomicanalysis pages 6-7): Christine M. Minor, Allen Takayesu, Sung Min Ha, Lukasz Salwinski, Michael R. Sawaya, Matteo Pellegrini, and Robert T. Clubb. A genomic analysis reveals the diversity of cellulosome displaying bacteria. Frontiers in Microbiology, Oct 2024. URL: https://doi.org/10.3389/fmicb.2024.1473396, doi:10.3389/fmicb.2024.1473396. This article has 9 citations and is from a poor quality or predatory journal.

  5. (guerreiro2008molecularmechanismsof pages 57-62): CIPD Guerreiro. Molecular mechanisms of plant cell wall hydrolysis: developing novel biotechnological applications for carbohydrate-binding modules. Unknown journal, 2008.

  6. (ravachol2016mechanismsinvolvedin pages 15-16): Julie Ravachol, Pascale de Philip, Romain Borne, Pascal Mansuelle, María J. Maté, Stéphanie Perret, and Henri-Pierre Fierobe. Mechanisms involved in xyloglucan catabolism by the cellulosome-producing bacterium ruminiclostridium cellulolyticum. Scientific Reports, Mar 2016. URL: https://doi.org/10.1038/srep22770, doi:10.1038/srep22770. This article has 97 citations and is from a peer-reviewed journal.

Citations

  1. minor2024agenomicanalysis pages 6-7
  2. gold2007globalviewof pages 1-2
  3. guerreiro2008molecularmechanismsof pages 57-62
  4. ravachol2016mechanismsinvolvedin pages 15-16
  5. zverlov2005functionalsubgenomicsof pages 2-4
  6. zverlov2005functionalsubgenomicsof pages 1-2
  7. https://doi.org/10.1002/pmic.200401199
  8. https://doi.org/10.1128/jb.00882-07
  9. https://doi.org/10.3389/fmicb.2024.1473396
  10. https://doi.org/10.1038/srep22770
  11. https://doi.org/10.1002/pmic.200401199;
  12. https://doi.org/10.1128/jb.00882-07;
  13. https://doi.org/10.1038/srep22770;
  14. https://doi.org/10.1002/pmic.200401199,
  15. https://doi.org/10.1128/jb.00882-07,
  16. https://doi.org/10.3389/fmicb.2024.1473396,
  17. https://doi.org/10.1038/srep22770,

📄 View Raw YAML

id: Q70DK5
gene_symbol: xghA
product_type: PROTEIN
status: COMPLETE
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus
description: >-
  Xyloglucanase Xgh74A is a GH74 family endo-xyloglucanase that is a major component of
  the Acetivibrio thermocellus cellulosome. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic
  linkages in xyloglucan, preferentially cleaving at unbranched glucose residues in the backbone
  to produce xyloglucan oligosaccharides (XXXG, XLXG, XXLG, XLLG). The enzyme contains a
  C-terminal type I dockerin domain that mediates attachment to the CipA scaffoldin protein,
  thereby incorporating it into the extracellular cellulosome complex. It has high specific
  activity on tamarind seed xyloglucan (295 U/mg) with optimal activity at pH 6.4 and 75C,
  and shows only low activity on CMC and no activity on amorphous cellulose, indicating
  recognition of xylosidic side chains present in xyloglucan. Crystal structures have been
  determined revealing an open binding groove accommodating long oligosaccharides.
existing_annotations:
- term:
    id: GO:0000272
    label: polysaccharide catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      This IEA annotation infers involvement in polysaccharide catabolic process based on
      InterPro domain matches (dockerin domains IPR002105, IPR016134, IPR036439) and UniProtKB
      keywords. Xgh74A does function in polysaccharide catabolism, specifically xyloglucan
      degradation, as demonstrated experimentally (PMID:16207921). However, this term is
      very broad and lacks specificity for the actual substrate.
    action: ACCEPT
    reason: >-
      While this annotation is correct in a general sense - Xgh74A does participate in
      polysaccharide catabolism - it is overly broad. However, the more specific term
      GO:2000899 (xyloglucan catabolic process) is already annotated with IDA evidence,
      making this broader IEA annotation acceptable as a correctly inferred parent term.
      The enzymatic evidence from PMID:16207921 demonstrates hydrolysis of xyloglucan
      producing oligosaccharides.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units."
      - reference_id: file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
        supporting_text: "Xgh74A contributes to degradation of xyloglucan, a major hemicellulose entwined with cellulose microfibrils"
- term:
    id: GO:0004553
    label: hydrolase activity, hydrolyzing O-glycosyl compounds
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      This IEA annotation from InterPro/ARBA infers a general O-glycosyl hydrolase activity
      based on domain matches. Xgh74A is indeed an O-glycosyl hydrolase, specifically an
      endo-xyloglucanase that cleaves beta-1,4-glucosidic linkages. The more specific term
      GO:0033946 (xyloglucan-specific endo-beta-1,4-glucanase activity) is annotated with
      experimental evidence.
    action: ACCEPT
    reason: >-
      This is a correct parent term for the experimentally validated xyloglucan-specific
      endo-beta-1,4-glucanase activity (GO:0033946). The enzyme hydrolyzes O-glycosyl bonds
      in xyloglucan as demonstrated in PMID:16207921. The crystal structure study (PMID:16772298)
      further characterized the active site and substrate binding.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module. It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone"
- term:
    id: GO:0010411
    label: xyloglucan metabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000118
  review:
    summary: >-
      This TreeGrafter-generated annotation correctly identifies involvement in xyloglucan
      metabolism based on phylogenetic inference (PANTHER:PTN005180743). Xgh74A is experimentally
      demonstrated to hydrolyze xyloglucan. However, the more specific child term GO:2000899
      (xyloglucan catabolic process) is already annotated with IDA evidence.
    action: ACCEPT
    reason: >-
      This is a correct but broad annotation. The enzyme specifically catalyzes xyloglucan
      catabolism, which is a child process of xyloglucan metabolic process. Since the more
      specific term is already annotated with experimental evidence, this IEA serves as
      a correctly inferred parent term. The TreeGrafter phylogenetic inference correctly
      places this GH74 family enzyme in the xyloglucan metabolism pathway.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan."
- term:
    id: GO:0016787
    label: hydrolase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This very broad IEA annotation from UniProtKB-KW:KW-0378 (Hydrolase) simply indicates
      that the protein has hydrolase activity. While technically correct, this term provides
      minimal informational value given the more specific annotations available.
    action: ACCEPT
    reason: >-
      This is a correct but very high-level annotation that is a grandparent of the
      experimentally validated specific activity. The UniProtKB keyword mapping correctly
      identifies the protein as a hydrolase. More specific child terms (GO:0016798, GO:0004553,
      GO:0033946) are also annotated, providing greater specificity.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone"
- term:
    id: GO:0016798
    label: hydrolase activity, acting on glycosyl bonds
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This IEA annotation from UniProtKB-KW:KW-0326 (Glycosidase) correctly identifies
      the protein as having glycosidase activity. Xgh74A is a GH74 family glycoside
      hydrolase with demonstrated endo-glucanase activity on xyloglucan.
    action: ACCEPT
    reason: >-
      This is a correct parent term for the more specific xyloglucan-specific
      endo-beta-1,4-glucanase activity (GO:0033946). The UniProtKB keyword-based inference
      correctly classifies this enzyme as acting on glycosyl bonds, consistent with its
      GH74 family membership and demonstrated hydrolysis of beta-1,4-glucosidic linkages
      in xyloglucan.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module."
- term:
    id: GO:0030245
    label: cellulose catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This IEA annotation infers involvement in cellulose catabolism from UniProtKB-KW:KW-0136
      (Cellulose degradation). However, experimental evidence clearly demonstrates that
      Xgh74A is NOT a cellulase - it has only very low activity on CMC and NO activity
      on amorphous cellulose. Its primary substrate is xyloglucan, a hemicellulose.
    action: REMOVE
    reason: >-
      This annotation is incorrect. PMID:16207921 explicitly states that Xgh74A has "low
      activity on CMC" and "no activity on amorphous cellulose", indicating it does not
      function in cellulose catabolism. The UniProt keyword "Cellulose degradation" appears
      to be a misattribution likely based on its cellulosome association and GH74 family
      membership, but experimental evidence clearly shows xyloglucan is the primary substrate.
      The enzyme's recognition of xylosidic side chains distinguishes it from true cellulases.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Its low activity on CMC and lack of activity on amorphous cellulose indicates recognition of the xylosidic side chains present in xyloglucan, which is readily hydrolysed (295 U mg(-1))."
- term:
    id: GO:0033946
    label: xyloglucan-specific endo-beta-1,4-glucanase activity
  evidence_type: IDA
  original_reference_id: PMID:16207921
  review:
    summary: >-
      This IDA annotation correctly captures the specific molecular function of Xgh74A.
      The enzyme was biochemically characterized and shown to hydrolyze xyloglucan with
      high specificity (295 U/mg on tamarind seed xyloglucan), cleaving every fourth
      beta-1,4-glucan bond in the backbone to produce decorated cellotetraose units
      (XXXG, XLXG, XXLG, XLLG). The crystal structure (PMID:16772298) further confirmed
      this activity and identified the catalytic residues.
    action: ACCEPT
    reason: >-
      This is the most specific and accurate MF annotation for Xgh74A, directly supported
      by experimental evidence demonstrating endo-xyloglucanase activity. The enzyme
      belongs to GH74 family known for this activity, and the hydrolysis pattern
      producing decorated cellotetraose units is characteristic of endo-xyloglucanases.
      This represents the core molecular function of the protein.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module. It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units."
      - reference_id: PMID:16207921
        supporting_text: "The pattern of the hydrolysis products from tamarind xyloglucan resembles that of other GHF74 xyloglucan endoglucanases."
      - reference_id: file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
        supporting_text: "Functions as an endo-xyloglucanase (cleaves xyloglucan backbone); reported preference to cleave at unbranched glucosyl residues in xyloglucan"
- term:
    id: GO:0043263
    label: cellulosome
  evidence_type: IDA
  original_reference_id: PMID:16207921
  review:
    summary: >-
      This IDA annotation correctly identifies Xgh74A as a cellulosome component. The
      enzyme contains a C-terminal type I dockerin domain that mediates incorporation
      into the cellulosome by binding to cohesin modules on the CipA scaffoldin protein.
      Proteomic studies have confirmed Xgh74A as a major extracellular cellulosome
      component in C. thermocellum.
    action: ACCEPT
    reason: >-
      This is the correct CC annotation for Xgh74A. The presence of a functional type I
      dockerin domain (experimentally validated through proteomic detection in cellulosome
      preparations) indicates localization to the cellulosome complex. The enzyme was
      identified as one of "two new major subunits in the cellulosome" in PMID:16207921.
      Additional proteomic studies (Gold & Martin 2007, Zverlov et al. 2005 Proteomics)
      have confirmed its presence in extracellular cellulosome preparations.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "The structure and enzymic activity of xyloglucanase Xgh74A and endoxylanase Xyn10D, components in the cellulosomes of cellulose-grown Clostridium thermocellum, were determined."
      - reference_id: PMID:16207921
        supporting_text: "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module."
      - reference_id: file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
        supporting_text: "Xgh74A is an extracellular, dockerin-bearing cellulosomal enzyme incorporated into CipA scaffoldin complexes"
- term:
    id: GO:2000899
    label: xyloglucan catabolic process
  evidence_type: IDA
  original_reference_id: PMID:16207921
  review:
    summary: >-
      This IDA annotation correctly identifies Xgh74A as involved in xyloglucan catabolism.
      The enzyme specifically degrades xyloglucan (a hemicellulose) by hydrolyzing the
      glucan backbone, contributing to plant cell wall deconstruction. It is the only
      xyloglucanase identified in the C. thermocellum cellulosome.
    action: ACCEPT
    reason: >-
      This is the most specific and accurate BP annotation for Xgh74A. The enzyme was
      experimentally demonstrated to hydrolyze tamarind seed xyloglucan with high specific
      activity, producing xylogluco-oligosaccharides. This represents the core biological
      process in which the enzyme functions - the catabolic breakdown of xyloglucan
      hemicellulose as part of lignocellulose deconstruction.
    supported_by:
      - reference_id: PMID:16207921
        supporting_text: "Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan."
      - reference_id: PMID:16207921
        supporting_text: "The data indicate that Xgh74A and Xyn10D contribute to the in vivo degradation of the hemicelluloses xyloglucan and xylan by the cellulosome of C. thermocellum."
references:
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000118
  title: TreeGrafter-generated GO annotations
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:16207921
  title: >-
    Two new major subunits in the cellulosome of Clostridium thermocellum: xyloglucanase
    Xgh74A and endoxylanase Xyn10D.
  findings:
    - statement: Xgh74A identified as a major cellulosome component with GH74 catalytic domain and type I dockerin
      supporting_text: "Xyloglucanase Xgh74A contains a catalytic module of GHF74 in addition to a C-terminal dockerin module."
    - statement: Demonstrates endo-xyloglucanase activity, cleaving every fourth beta-1,4-glucan bond in xyloglucan backbone
      supporting_text: "It hydrolyses every fourth beta-1,4-glucan bond in the xyloglucan backbone, thus producing decorated cellotetraose units."
    - statement: High specific activity on tamarind seed xyloglucan (295 U/mg), producing XXXG/XLXG/XXLG/XLLG products
      supporting_text: "xyloglucan, which is readily hydrolysed (295 U mg(-1))"
    - statement: Low activity on CMC, no activity on amorphous cellulose - distinguishing it from cellulases
      supporting_text: "Its low activity on CMC and lack of activity on amorphous cellulose indicates recognition of the xylosidic side chains present in xyloglucan"
    - statement: First xyloglucanase identified in C. thermocellum cellulosome
      supporting_text: "Xgh74A is the first xyloglucanase identified in C. thermocellum and the only enzyme in the cellulosome that hydrolyses tamarind xyloglucan."
- id: PMID:16772298
  title: >-
    Crystal structures of Clostridium thermocellum xyloglucanase, XGH74A, reveal the
    structural basis for xyloglucan recognition and degradation.
  full_text_unavailable: true
  findings:
    - statement: Crystal structure reveals complex subsite specificity accommodating seventeen monosaccharide moieties
      supporting_text: "The structures, in combination with mutagenesis data on the catalytic residues and the kinetics and specificity of xyloglucan hydrolysis reveal a complex subsite specificity accommodating seventeen monosaccharide moieties of the multibranched substrate in an open substrate binding terrain."
- id: file:ACET2/Q70DK5/Q70DK5-deep-research-falcon.md
  title: Deep research summary for xghA/Q70DK5
  findings:
    - statement: Xgh74A is extracellular and incorporated into the C. thermocellum cellulosome through its type I dockerin
      supporting_text: "Xgh74A is an extracellular, dockerin-bearing cellulosomal enzyme incorporated into CipA scaffoldin complexes"
    - statement: Functions as endo-xyloglucanase with preference for cleavage at unbranched glucosyl residues
      supporting_text: "Functions as an endo-xyloglucanase (cleaves xyloglucan backbone); reported preference to cleave at unbranched glucosyl residues in xyloglucan"
core_functions:
  - molecular_function:
      id: GO:0033946
      label: xyloglucan-specific endo-beta-1,4-glucanase activity
    directly_involved_in:
      - id: GO:2000899
        label: xyloglucan catabolic process
    locations:
      - id: GO:0043263
        label: cellulosome
    description: >-
      The primary molecular function of Xgh74A is xyloglucan-specific endo-beta-1,4-glucanase
      activity. It catalyzes the endohydrolysis of (1->4)-beta-D-glucosidic linkages specifically
      in xyloglucan, cleaving at unbranched glucose residues to produce decorated cellotetraose
      units (xylogluco-oligosaccharides). The enzyme shows substrate specificity for xyloglucan
      over cellulose, recognizing the xylosidic side chains of the substrate. The enzyme is
      localized to the extracellular cellulosome complex via its C-terminal type I dockerin
      domain and participates in xyloglucan catabolism as part of hemicellulose degradation.
proposed_new_terms: []
suggested_questions:
  - question: Has the relative contribution of Xgh74A to overall cellulosome efficiency on different plant biomass feedstocks been quantified?
  - question: Are there conditions under which xghA expression or Xgh74A incorporation into cellulosomes is differentially regulated?
suggested_experiments:
  - description: Comparative activity assays on different plant-derived xyloglucans (beyond tamarind) to assess substrate range
  - description: Cellulosome reconstitution experiments with and without Xgh74A to quantify synergistic contributions to plant biomass deconstruction