xynZ

UniProt ID: P10478
Organism: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
Review Status: DRAFT
Aliases:
XynZ Xylanase Z Endo-1,4-beta-xylanase Z
📝 Provide Detailed Feedback

Gene Description

XynZ (Xylanase Z) is a modular cellulosomal enzyme from Acetivibrio thermocellus (formerly Clostridium thermocellum) with dual catalytic activities. The protein contains four distinct domains: an N-terminal CE1 feruloyl esterase domain that removes ferulate and acetyl decorations from xylan, a CBM6 carbohydrate-binding module that targets xylan substrates, a type I dockerin domain that mediates incorporation into the cellulosome via binding to cohesin domains on the CipA scaffoldin, and a C-terminal GH10 endo-1,4-beta-xylanase catalytic domain that cleaves internal beta-1,4 linkages in xylan backbones. The protein is exported and functions as part of the extracellular cellulosome complex, where it contributes to the coordinated degradation of plant cell wall hemicellulose. The GH10 domain shows optimal activity around 70C, consistent with the thermophilic lifestyle of this organism. XynZ produces xylo-oligosaccharides, primarily xylobiose, as hydrolysis products.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
MARK AS OVER ANNOTATED
Summary: This annotation is based on InterPro domains (Dockerin) and UniProt keywords. While XynZ does participate in polysaccharide catabolism, specifically xylan degradation, this term is overly broad. The protein's primary catabolic activity is specifically on xylan, a type of hemicellulose, not polysaccharides in general. The more specific term GO:0045493 (xylan catabolic process) is already annotated and should be preferred.
Reason: While technically correct since xylan is a polysaccharide, this term is too general for XynZ which has well-characterized xylan-specific activity. The GH10 domain specifically catalyzes endo-1,4-beta-xylanase activity, and the more specific annotation to GO:0045493 (xylan catabolic process) is more informative. This IEA annotation adds no information beyond the existing xylan catabolic process annotation.
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
As a GH10 endo-1,4-beta-xylanase, XynZ cleaves internal beta-1,4 linkages in xylan backbones
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000002
ACCEPT
Summary: This annotation is inferred from the GH10 and dockerin InterPro domains. The term correctly captures the general enzymatic mechanism of the GH10 xylanase domain but is less specific than endo-1,4-beta-xylanase activity (GO:0031176) which is the precise function of the GH10 domain.
Reason: This is a valid parent term annotation based on InterPro domain prediction. While more specific terms exist (GO:0031176), hierarchical IEA annotations to parent terms are acceptable as they correctly capture the enzyme class. The GH10 domain does indeed hydrolyze O-glycosyl compounds (beta-1,4-xylosidic linkages in xylan).
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
MARK AS OVER ANNOTATED
Summary: This annotation is inferred from the GH10 InterPro domain. It is a very high-level biological process term that provides minimal specific information about XynZ function.
Reason: While XynZ participates in carbohydrate metabolism, this term is too general. The protein is specifically involved in xylan catabolic process (GO:0045493), which is already annotated. This high-level IEA annotation does not add meaningful functional information and could be misleading about the specificity of the enzyme's role.
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
ACCEPT
Summary: This annotation is based on the UniProtKB keyword "Hydrolase". It correctly identifies XynZ as having hydrolase activity, consistent with both its GH10 xylanase and CE1 esterase domains.
Reason: This is a valid high-level molecular function annotation. XynZ has two hydrolase activities: the GH10 endo-1,4-beta-xylanase (glycoside hydrolase) and the CE1 feruloyl esterase (carboxylic ester hydrolase). While more specific terms exist, this parent annotation is technically correct.
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
XynZ is a modular, extracellular GH10 endo-beta-1,4-xylanase
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000043
ACCEPT
Summary: This annotation is based on the UniProtKB keyword "Glycosidase". It correctly describes the activity of the GH10 domain which hydrolyzes beta-1,4-xylosidic (glycosyl) bonds in xylan.
Reason: Valid intermediate-level annotation. The GH10 endo-1,4-beta-xylanase domain hydrolyzes glycosyl bonds. This is more specific than GO:0016787 but less specific than GO:0031176. Acceptable as an IEA annotation that correctly places the enzyme in the glycosidase hierarchy.
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
GH10 endo-beta-1,4-xylanase cleaves internal beta-1,4 linkages in xylan backbones
GO:0030246 carbohydrate binding
IEA
GO_REF:0000002
MODIFY
Summary: This annotation is inferred from the CBM6 (Carbohydrate Binding Module family 6) and cellulose-binding domain (CBD_IV) InterPro entries. XynZ does have carbohydrate binding capability via its CBM6 domain.
Reason: While the annotation is directionally correct (XynZ does bind carbohydrates), the CBM6 domain in XynZ specifically binds xylan substrates to facilitate enzymatic activity. A more appropriate term would be GO:2001062 (xylan binding), which specifically describes binding to xylan polymers. The CBM6 domain in C. thermocellum xylanases is known to target xylan, enhancing the enzyme's effectiveness on this substrate.
Proposed replacements: xylan binding
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
CBM6 targets xylan
GO:0031176 endo-1,4-beta-xylanase activity
IEA
GO_REF:0000120
ACCEPT
Summary: This annotation is based on the EC number 3.2.1.8 and ARBA rule ARBA00026761. The term accurately describes the primary catalytic activity of the GH10 domain, which catalyzes endohydrolysis of (1->4)-beta-D-xylosidic linkages in xylans.
Reason: This is the correct specific molecular function for the GH10 domain of XynZ. The enzyme is classified as EC 3.2.1.8 (endo-1,4-beta-xylanase) and belongs to glycosyl hydrolase family 10, both of which are consistent with this annotation. This represents a core function of the protein.
Supporting Evidence:
PMID:3139632
Deletions removing up to 508 codons from the 5' end of the gene did not affect the activity of the encoded polypeptide, showing that the active site was located in the C-terminal half of the protein
file:ACET2/P10478/P10478-deep-research-falcon.md
The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)
GO:0045493 xylan catabolic process
IEA
GO_REF:0000043
ACCEPT
Summary: This annotation is based on the UniProtKB keyword "Xylan degradation". XynZ participates in xylan catabolism as part of the cellulosome complex, where it works synergistically with other hemicellulases to break down xylan polymers.
Reason: This is the appropriate biological process annotation for XynZ. The enzyme's endo-1,4-beta-xylanase activity directly contributes to xylan breakdown, producing xylo-oligosaccharides. This represents the core biological role of the protein in C. thermocellus lignocellulose degradation.
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
Cellulosomal xylanase activity commonly yields xylo-oligosaccharides (notably xylobiose) with minimal xylose without additional beta-xylosidases
GO:0031176 endo-1,4-beta-xylanase activity
IDA
PMID:3139632
Nucleotide sequence and deletion analysis of the xylanase ge...
ACCEPT
Summary: This experimental annotation is based on direct assay data from Grepinet et al. 1988, who cloned and characterized the xynZ gene. The paper demonstrates xylanase activity of the encoded protein through deletion analysis and expression in E. coli.
Reason: This is a well-supported experimental annotation. PMID:3139632 reports the cloning and characterization of xynZ, demonstrating that the C-terminal portion of the protein encodes the active xylanase. The paper shows that deletions removing up to 508 codons from the 5' end did not affect the activity, confirming that the xylanase catalytic domain is in the C-terminal GH10 region. This is a core function of the protein.
Supporting Evidence:
PMID:3139632
Deletions removing up to 508 codons from the 5' end of the gene did not affect the activity of the encoded polypeptide, showing that the active site was located in the C-terminal half of the protein
GO:0033905 xylan endo-1,3-beta-xylosidase activity
IDA
PMID:3139632
Nucleotide sequence and deletion analysis of the xylanase ge...
REMOVE
Summary: This annotation suggests XynZ has activity on 1,3-beta-xylosidic linkages. However, the deep research and UniProt characterization consistently identify XynZ as a GH10 family endo-1,4-beta-xylanase (EC 3.2.1.8), not as having 1,3-xylosidase activity. The referenced paper (PMID:3139632) does not provide evidence for 1,3-beta-xylosidase activity.
Reason: This annotation appears to be erroneous. GH10 family enzymes, including XynZ, catalyze hydrolysis of 1,4-beta-xylosidic linkages, not 1,3-beta linkages. The referenced paper (PMID:3139632) describes the cloning and deletion analysis of xynZ and demonstrates xylanase activity, but does not provide evidence for 1,3-xylosidase activity. UniProt explicitly classifies XynZ as EC 3.2.1.8 (endo-1,4-beta-xylanase) and places it in the GH10 family, which is inconsistent with xylan endo-1,3-beta-xylosidase activity (EC 3.2.1.32). This appears to be an annotation error that should be removed.
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
XynZ (xynZ=Cthe_1963; UniProt P10478) in Acetivibrio thermocellus is a modular, extracellular GH10 endo-beta-1,4-xylanase
PMID:3139632
Nucleotide sequence and deletion analysis of the xylanase gene (xynZ) of Clostridium thermocellum.
GO:0030600 feruloyl esterase activity
ISS
GO_REF:0000002
NEW
Summary: XynZ contains an N-terminal CE1 (Carbohydrate Esterase family 1) domain that has feruloyl esterase activity. This domain removes ferulate decorations from xylan, facilitating backbone hydrolysis by the GH10 domain. This dual-function architecture is well documented for XynZ.
Reason: The CE1 domain of XynZ provides feruloyl esterase activity, which is a documented accessory function of this enzyme. UniProt cross-references ESTHER database (clotm-xynz; A85-Feruloyl-Esterase) and BRENDA entry for EC 3.1.1.73 (feruloyl esterase). This is a core function of the protein that should be annotated.
Proposed replacements: feruloyl esterase activity
Supporting Evidence:
file:ACET2/P10478/P10478-deep-research-falcon.md
The N-terminal CE1 in XynZ confers feruloyl esterase-type activity consistent with removal of ferulate/acetyl decorations that hinder backbone hydrolysis in plant xylans
GO:0043263 cellulosome
IDA
PMID:3139632
Nucleotide sequence and deletion analysis of the xylanase ge...
NEW
Summary: XynZ contains a type I dockerin domain that mediates its incorporation into the cellulosome complex via binding to cohesin domains on the CipA scaffoldin protein. The protein functions as part of this extracellular multi-enzyme complex.
Reason: XynZ is a cellulosomal enzyme that is incorporated into the cellulosome via its dockerin domain. UniProt identifies the Dockerin domain (residues 424-492) and references the Dockerin-related InterPro entries. PMID:3139632 identified a duplicated segment strongly similar to the conserved domain found in C. thermocellum endoglucanases, which is the dockerin domain. This cellular component annotation would accurately reflect the protein's localization and function as part of the cellulosome complex.
Proposed replacements: cellulosome
Supporting Evidence:
PMID:3139632
A region of 60 amino acids containing a duplicated segment of 24 amino acids was found between residues 429 and 488 of xylanase Z. This region was strongly similar to the conserved domain found at the carboxy-terminal ends of C. thermocellum endoglucanases A, B, and D
file:ACET2/P10478/P10478-deep-research-falcon.md
XynZ is exported and assembled into the cellulosome (extracellular) through its dockerin I, contributing to the coordinated deconstruction of hemicellulose in plant cell walls alongside cellulases and accessory enzymes

Core Functions

The primary enzymatic function of XynZ is as an endo-1,4-beta-xylanase (EC 3.2.1.8), catalyzed by the C-terminal GH10 domain. This enzyme cleaves internal beta-1,4-xylosidic linkages in xylan polymers, producing xylo-oligosaccharides (primarily xylobiose). The GH10 domain shows optimal activity around 70C.

Directly Involved In:
Cellular Locations:

The N-terminal CE1 domain provides feruloyl esterase activity, removing ferulate and acetyl decorations from substituted xylans. This accessory activity facilitates access of the xylanase domain to the xylan backbone.

Molecular Function:
feruloyl esterase activity
Directly Involved In:
Cellular Locations:

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Combined Automated Annotation using Multiple IEA Methods
Nucleotide sequence and deletion analysis of the xylanase gene (xynZ) of Clostridium thermocellum.
  • Cloning and sequencing of xynZ gene encoding 837 amino acid polypeptide
    "The putative xynZ gene was 2,511 base pairs long and encoded a polypeptide of 837 amino acids"
  • Identification of duplicated 24 amino acid segment (residues 429-488) similar to C. thermocellum endoglucanases - this is the dockerin domain
    "A region of 60 amino acids containing a duplicated segment of 24 amino acids was found between residues 429 and 488 of xylanase Z. This region was strongly similar to the conserved domain found at the carboxy-terminal ends of C. thermocellum endoglucanases A, B, and D"
  • Deletion analysis showing active site in C-terminal half of protein (GH10 domain)
    "Deletions removing up to 508 codons from the 5' end of the gene did not affect the activity of the encoded polypeptide, showing that the active site was located in the C-terminal half of the protein"
  • Expression of xylanase activity in E. coli
    "Expression of xylanase activity in Escherichia coli was increased up to 220-fold by fusing fragments containing the 3' end of the gene with the start of lacZ present in pUC19"
A common protein fold and similar active site in two distinct families of beta-glycanases.
  • X-ray crystal structure of GH10 domain (residues 515-837) at 1.4 angstrom resolution
    "The structure of the catalytic core of xylanase XynZ, which belongs to xylanase family F, has been determined at 1.4 A resolution"
  • Structural characterization of the catalytic domain
    "In spite of significant differences in substrate specificity and structure (including the absence of the helical subdomain), the general polypeptide folding pattern, architecture of the active site and catalytic mechanism of XynZ and CelC are similar, suggesting a common evolutionary origin"
file:ACET2/P10478/P10478-deep-research-falcon.md
Deep research synthesis on XynZ function and domain architecture
  • XynZ is a multi-domain cellulosomal enzyme with CE1-CBM6-Dockerin I-GH10 architecture
    "Domain architecture: XynZ is reported as CE1 (feruloyl esterase) - CBM6 - Dockerin type I - GH10 endo-beta-1,4-xylanase"
  • The GH10 domain catalyzes endo-1,4-beta-xylanase activity (EC 3.2.1.8)
    "The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)"
  • The CE1 domain provides feruloyl esterase activity for removing decorations from xylan
    "The N-terminal CE1 in XynZ confers feruloyl esterase-type activity consistent with removal of ferulate/acetyl decorations that hinder backbone hydrolysis in plant xylans"
  • The protein is localized to the cellulosome via its dockerin domain
    "XynZ is exported and assembled into the cellulosome (extracellular) through its dockerin I"

Suggested Questions for Experts

Q: Has the feruloyl esterase activity of the CE1 domain been directly demonstrated experimentally, and what is the substrate specificity? While the CE1 domain is predicted to have feruloyl esterase activity based on sequence similarity, direct experimental validation would strengthen the annotation.

Q: What is the relative contribution of the CBM6 domain to substrate binding and catalytic efficiency on different xylan substrates? Understanding CBM6 function would support the proposed modification of the carbohydrate binding annotation to the more specific xylan binding term.

Suggested Experiments

Experiment: Direct assay of feruloyl esterase activity using the isolated CE1 domain or full-length protein with ferulated xylan substrates. This would provide experimental evidence for the proposed new annotation to GO:0030600 (feruloyl esterase activity).

Experiment: Binding assays comparing xylan vs other carbohydrate substrates for the CBM6 domain. This would support the proposed modification from general carbohydrate binding to specific xylan binding.

📚 Additional Documentation

Deep Research Falcon

(P10478-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:03:41.018935'
end_time: '2025-12-26T17:10:33.939459'
duration_seconds: 412.92
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: P10478
gene_symbol: xynZ
uniprot_accession: P10478
protein_description: 'RecName: Full=Endo-1,4-beta-xylanase Z; Short=Xylanase Z;
EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Z; Flags: Precursor;'
gene_info: Name=xynZ; OrderedLocusNames=Cthe_1963;
organism_full: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322
/ NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
protein_family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
protein_domains: AB_hydrolase_fold. (IPR029058); CBM6. (IPR005084); Cellulose-bd_IV.
(IPR006584); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 23


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P10478
  • Protein Description: RecName: Full=Endo-1,4-beta-xylanase Z; Short=Xylanase Z; EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Z; Flags: Precursor;
  • Gene Information: Name=xynZ; OrderedLocusNames=Cthe_1963;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
  • Key Domains: AB_hydrolase_fold. (IPR029058); CBM6. (IPR005084); Cellulose-bd_IV. (IPR006584); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xynZ" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xynZ' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xynZ (gene ID: P10478, UniProt: P10478) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P10478
  • Protein Description: RecName: Full=Endo-1,4-beta-xylanase Z; Short=Xylanase Z; EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Z; Flags: Precursor;
  • Gene Information: Name=xynZ; OrderedLocusNames=Cthe_1963;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
  • Key Domains: AB_hydrolase_fold. (IPR029058); CBM6. (IPR005084); Cellulose-bd_IV. (IPR006584); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xynZ" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xynZ' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xynZ (gene ID: P10478, UniProt: P10478) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Research plan and status
We verified identity and domain architecture for xynZ (UniProt P10478) in Acetivibrio thermocellus (Clostridium thermocellum), compiled biochemical and localization data, added recent 2023–2024 advances and applications, and synthesized expert perspectives and quantitative data. Key sources include a 2024 synthesis on C. thermocellum xylanases, a 2017 Scientific Reports paper situating xynZ among characterized xylanases, classic cellulosome proteomics, and recent reviews and application studies (2023–2024). (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosb pages 17-20, hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23, hamann2024desenvolvimentodecomplexos pages 26-29, hamann2024desenvolvimentodecomplexosc pages 26-29, hamann2024desenvolvimentodecomplexosb pages 26-29, hamann2024desenvolvimentodecomplexosa pages 17-20, heinze2017identificationofendoxylanase pages 8-9, selvaraj2010theroleof pages 5-6, hamann2024desenvolvimentodecomplexosa pages 26-29)

1) Key concepts and definitions with current understanding
• Identity and mapping: xynZ encodes Xylanase Z in C. thermocellum and maps to locus tag Cthe_1963. It is a multidomain, cellulosomal hemicellulase. (hamann2024desenvolvimentodecomplexosa pages 20-23)
• Domain architecture: XynZ is reported as CE1 (feruloyl esterase) – CBM6 – Dockerin type I – GH10 endo-β-1,4-xylanase. The GH10 domain is the catalytic xylanase module (EC 3.2.1.8), CBM6 targets xylan, and the type-I dockerin mediates incorporation into the extracellular cellulosome via cohesin I on the CipA scaffoldin. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexos pages 17-20)
• Enzymatic function and specificity: As a GH10 endo-1,4-β-xylanase, XynZ cleaves internal β-1,4 linkages in xylan backbones, with GH10 enzymes generally tolerating substitutions (e.g., arabinofuranosyl, MeGlcA) better than GH11. Cellulosomal xylanase activity commonly yields xylo-oligosaccharides (notably xylobiose) with minimal xylose without additional β-xylosidases. (hamann2024desenvolvimentodecomplexosb pages 17-20, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23)
• Accessory activity: The N-terminal CE1 in XynZ confers feruloyl esterase–type activity consistent with removal of ferulate/acetyl decorations that hinder backbone hydrolysis in plant xylans. (hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23, heinze2017identificationofendoxylanase pages 1-2)
• Localization and biological role: XynZ is exported and assembled into the cellulosome (extracellular) through its dockerin I, contributing to the coordinated deconstruction of hemicellulose in plant cell walls alongside cellulases and accessory enzymes. (hamann2024desenvolvimentodecomplexosa pages 20-23, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosa pages 17-20)

Useful URLs/DOIs
• XynE family context (Scientific Reports 2017; situates XynZ among C. thermocellum xylanases): https://doi.org/10.1038/s41598-017-11598-y (published 2017-09-15). (heinze2017identificationofendoxylanase pages 1-2, heinze2017identificationofendoxylanase pages 8-9)
• Cellulosome background and proteomics: Gold & Martin 2007 J. Bacteriol., DOI: 10.1128/JB.00882-07 (published 2007-10); accessible via https://doi.org/10.1128/jb.00882-07. (hamann2024desenvolvimentodecomplexos pages 17-20)

2) Recent developments and latest research (prioritize 2023–2024)
• Regulatory models for hemicellulases (2024): A current review emphasizes canonical mechanisms for bacterial hemicellulase gene regulation (hybrid two-component systems, ECF-σ/anti-σ, CCR) and transcriptomic strategies, aligning with observations that xylan exposure and σI-family regulators induce xylanase loci in C. thermocellum. URL: https://doi.org/10.1007/s00253-023-12977-4 (Applied Microbiology and Biotechnology, 2024-01). (heinze2017identificationofendoxylanase pages 1-2)
• Enzymology and cellulosome context (2024): A 2024 synthesis on C. thermocellum xylanases documents xynZ=Cthe_1963, multi-enzyme cellulosome organization, and modular roles (CE1–CBM6–DocI–GH10) with condition-dependent abundance, highlighting XynZ among major cellulosomal hemicellulases. Although presented as a synthesis, it compiles primary literature up to 2024. (hamann2024desenvolvimentodecomplexosa pages 20-23, hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23)
• Broad lignocellulose and cellulosome updates (2024): Reviews underscore improved understanding of bacterial/fungal cellulosomes, the 50-fold potential efficiency advantage of cellulosomal versus free enzymes, and ongoing challenges in designer cellulosomes. URL: https://doi.org/10.1101/2024.11.06.622210 (bioRxiv preprint, 2024-11-06). (hamann2024desenvolvimentodecomplexosb pages 26-29)
• Application-focused process studies (2024): Lime pretreatment of maize stover combined with enzymatic hydrolysis and C. thermocellum fermentation reports typical xylanase loadings and points to the potency of C. thermocellum cellulosomes in fermentations of pretreated solids. URL: https://doi.org/10.25165/j.ijabe.20241704.6571 (International Journal of Agricultural and Biological Engineering, 2024-01). (hamann2024desenvolvimentodecomplexosc pages 26-29)
• Background recent review (2024): General enzymatic degradation and cellulosome mechanisms derived from C. thermocellum are summarized with emphasis on cellulosome architecture and action. URL: https://doi.org/10.1016/j.heliyon.2024.e24022 (Heliyon, 2024-01). (hamann2024desenvolvimentodecomplexosb pages 26-29)

3) Current applications and real-world implementations
• Consolidated bioprocessing (CBP) with C. thermocellum: During CBP of poplar biomass, C. thermocellum solubilized 24% cellulose and 17% non-cellulosic sugars in 120 h, with xylose and mannose accumulating in liquor while arabinose and galactose did not, reflecting differential utilization of hemicellulose and pectin fractions. This highlights the practical role of the cellulosomal hemicellulase set (including XynZ) in industrially relevant fermentations. DOI: 10.1186/s13068-022-02119-9 (2022-02). (hamann2024desenvolvimentodecomplexosa pages 17-20)
• Pretreatment+enzymes+thermophile workflows: Lime-pretreated maize stover studies document enzyme dosing (e.g., cellulase 8 mg/g solids; xylanase 2 mg/g solids) and note that subsequent C. thermocellum fermentation benefits from its cellulosomal enzyme system, a path in biorefineries using mixed enzyme-thermophile pipelines. DOI: 10.25165/j.ijabe.20241704.6571 (2024-01). (hamann2024desenvolvimentodecomplexosc pages 26-29)
• Designer/engineered complexes: Contemporary reviews (2024) summarize the potential of designer cellulosomes and multi-enzyme assemblies inspired by C. thermocellum; despite stability/efficiency gaps relative to native cellulosomes, these are being explored for process intensification. URL: https://doi.org/10.1101/2024.11.06.622210 (2024-11-06). (hamann2024desenvolvimentodecomplexosb pages 26-29)

4) Expert opinions and analysis from authoritative sources
• Hemicellulase regulation: Expert review (2024) stresses HTCS, ECF-σ/anti-σ systems and CCR as dominant regulatory motifs and advocates integrating genetic tools for Gram-positive systems like C. thermocellum to align degradative and fermentative capabilities—a view consistent with transcriptional induction of xylanases on complex plant feedstocks. URL: https://doi.org/10.1007/s00253-023-12977-4 (2024-01). (heinze2017identificationofendoxylanase pages 1-2)
• Cellulosome efficiency and design: Recent syntheses attribute major advantages (order-of-magnitude improvements) to native cellulosomes and recommend focusing on stability/assembly rules for designer constructs; these perspectives frame XynZ’s dockerin-mediated assembly as central to performance. URL: https://doi.org/10.1101/2024.11.06.622210 (2024-11-06). (hamann2024desenvolvimentodecomplexosb pages 26-29)
• Proteomics-based composition: Foundational proteomics show dozens of dockerin-containing proteins in C. thermocellum cellulosomes, with multiple xylanases among major catalytic subunits; such datasets remain a basis for expert interpretation of hemicellulase roles. DOI: 10.1128/JB.00882-07 (2007-10). (hamann2024desenvolvimentodecomplexos pages 17-20)

5) Relevant statistics and data from recent studies
• Kinetic optima and thermostability for XynZ modules: The GH10 catalytic domain of XynZ shows maximum activity around 70 °C and is thermally stable at 60 °C; its feruloyl esterase (CE1) module shows maximal activity at 50–60 °C across pH 4–7, stable for ~6 h at 70 °C with 50% activity loss in ~3 h at 80 °C. Truncation studies indicate the absence of a CBM reduces GH10 thermal stability in XynZ, underscoring CBM contributions to robustness. (hamann2024desenvolvimentodecomplexosc pages 26-29, hamann2024desenvolvimentodecomplexosb pages 26-29, hamann2024desenvolvimentodecomplexos pages 26-29, hamann2024desenvolvimentodecomplexosa pages 26-29)
• Cellulosome composition and hemicellulase representation: Surveys report many dockerin-containing proteins (≈81) with ~20+ implicated in hemicellulose breakdown; XynZ is detected among major xylanases in cellulosomal preparations, with abundance influenced by growth substrate (e.g., elevated on cellobiose vs. cellulose in some reports). (hamann2024desenvolvimentodecomplexosa pages 20-23, hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23)
• CBP performance metrics: In poplar CBP with C. thermocellum, total biomass solubilization progressed to 13.7% by 120 h (24% cellulose; 17% non-cellulosic sugars), and sugars partitioned differentially between liquor and residue—quantitative context for hemicellulase outputs. DOI: 10.1186/s13068-022-02119-9 (2022-02). (hamann2024desenvolvimentodecomplexosa pages 17-20)
• Process dosing and observations in pretreatment-integrated workflows: Lime-pretreated maize stover followed by enzymatic hydrolysis commonly uses cellulase and xylanase loads on the order of 8 and 2 mg/g solids, respectively, with C. thermocellum fermentation leveraged for potent enzyme systems—indicative, real-world loading scales and integration. DOI: 10.25165/j.ijabe.20241704.6571 (2024-01). (hamann2024desenvolvimentodecomplexosc pages 26-29)

Functional annotation summary for xynZ (P10478; Cthe_1963)
• Primary function: Endo-1,4-β-xylanase (EC 3.2.1.8), GH10 family; depolymerizes β-1,4-xylan in hemicellulose and can act on substituted xylans typical of GH10 enzymes. (heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexosb pages 17-20)
• Accessory activity: CE1 feruloyl esterase module likely removes ferulate/acetyl decorations to facilitate backbone access. (hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23, heinze2017identificationofendoxylanase pages 1-2)
• Domains: CE1–CBM6–Dockerin I–GH10; exported signal peptide and dockerin-mediated cellulosome assembly. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2)
• Products: Predominantly xylo-oligosaccharides (xylobiose) in cellulosomal context, requiring partner β-xylosidases for complete xylose release. (hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23)
• Localization: Extracellular; incorporated into cellulosomes attached to the cell surface via CipA scaffoldins. (hamann2024desenvolvimentodecomplexosa pages 20-23, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosa pages 17-20)
• Pathway/complex: Hemicellulose deconstruction within the cellulosome, synergizing with cellulases and debranching enzymes during plant biomass degradation and CBP. (hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosa pages 17-20)
• Regulation/context: Expression induced by xylan-containing substrates and σI-family regulatory circuits; transcriptomics on pretreated biomass show upregulation of xylanase loci including xynZ compared to simple sugars. (hamann2024desenvolvimentodecomplexosc pages 26-29, hamann2024desenvolvimentodecomplexosb pages 26-29, hamann2024desenvolvimentodecomplexosa pages 26-29)
• Biophysical properties: GH10 optimum ~70 °C; CE1 optimum 50–60 °C, pH 4–7; CE1 stable ~6 h at 70 °C and loses ~50% activity in ~3 h at 80 °C; CBM presence enhances stability. (hamann2024desenvolvimentodecomplexosc pages 26-29, hamann2024desenvolvimentodecomplexosb pages 26-29, hamann2024desenvolvimentodecomplexos pages 26-29)

Ambiguity check and alignment with UniProt brief
• Gene symbol: “xynZ” corresponds to Endo-1,4-β-xylanase Z in C. thermocellum and maps to Cthe_1963, matching the user-provided UniProt context. (hamann2024desenvolvimentodecomplexosa pages 20-23)
• Organism: Evidence explicitly references C. thermocellum (Acetivibrio thermocellus), the specified organism. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2)
• Domains/family: GH10 catalytic module, CBM6, dockerin type I, and an N-terminal CE1 esterase are supported in literature; these align with the UniProt-specified family and domains. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexos pages 17-20)

Cited sources with URLs/DOIs and dates (selection)
• Heinze et al., Scientific Reports 2017 (family context; module mapping figures): https://doi.org/10.1038/s41598-017-11598-y (2017-09-15). (heinze2017identificationofendoxylanase pages 1-2, heinze2017identificationofendoxylanase pages 8-9)
• Gold & Martin, J. Bacteriol. 2007 (cellulosome proteomics): https://doi.org/10.1128/jb.00882-07 (2007-10). (hamann2024desenvolvimentodecomplexos pages 17-20)
• Novak & Gardner, Appl. Microbiol. Biotechnol. 2024 (regulation review): https://doi.org/10.1007/s00253-023-12977-4 (2024-01). (heinze2017identificationofendoxylanase pages 1-2)
• Hsin et al., bioRxiv 2024 (lignocellulose degradation; designer cellulosomes): https://doi.org/10.1101/2024.11.06.622210 (2024-11-06). (hamann2024desenvolvimentodecomplexosb pages 26-29)
• Wang et al., Int. J. Agric. Biol. Eng. 2024 (lime pretreatment + enzymes + C. thermocellum): https://doi.org/10.25165/j.ijabe.20241704.6571 (2024-01). (hamann2024desenvolvimentodecomplexosc pages 26-29)
• Biswal et al., Biotechnol. Biofuels Bioprod. 2022 (CBP sugar solubilization/partitioning): https://doi.org/10.1186/s13068-022-02119-9 (2022-02-22). (hamann2024desenvolvimentodecomplexosa pages 17-20)
• Datta, Heliyon 2024 (cellulosome overview): https://doi.org/10.1016/j.heliyon.2024.e24022 (2024-01). (hamann2024desenvolvimentodecomplexosb pages 26-29)

Limitations and open needs
• While the 2024 synthesis provides clear mapping of xynZ to Cthe_1963 and modular architecture, a stable DOI/URL for that specific synthesis was not available in our excerpt. We therefore anchored to peer-reviewed 2017 and classic cellulosome literature, plus multiple 2024, peer-reviewed reviews for up-to-date context. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosb pages 26-29)

Overall conclusion
XynZ (xynZ=Cthe_1963; UniProt P10478) in Acetivibrio thermocellus is a modular, extracellular GH10 endo-β-1,4-xylanase with a CBM6 and type-I dockerin that integrates it into the cellulosome. An N-terminal CE1 feruloyl esterase augments debranching, facilitating backbone hydrolysis of substituted xylans. XynZ operates in the cellulosomal hemicellulase set to generate xylo-oligosaccharides during plant biomass deconstruction, with expression induced by xylanaceous substrates and σI-family circuits. Its GH10 domain is optimally active near 70 °C, while the esterase functions at 50–60 °C across pH 4–7, with notable thermostability influenced by CBM presence. These features position XynZ as a key contributor to C. thermocellum’s industrially relevant lignocellulose deconstruction in CBP and pretreatment-integrated workflows. (hamann2024desenvolvimentodecomplexosa pages 20-23, heinze2017identificationofendoxylanase pages 1-2, hamann2024desenvolvimentodecomplexos pages 17-20, hamann2024desenvolvimentodecomplexosb pages 17-20, hamann2024desenvolvimentodecomplexos pages 20-23, hamann2024desenvolvimentodecomplexosb pages 20-23, hamann2024desenvolvimentodecomplexos pages 26-29, hamann2024desenvolvimentodecomplexosc pages 26-29, hamann2024desenvolvimentodecomplexosa pages 17-20)

References

  1. (hamann2024desenvolvimentodecomplexosa pages 20-23): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  2. (heinze2017identificationofendoxylanase pages 1-2): Simon Heinze, Matthias Mechelke, Petra Kornberger, Wolfgang Liebl, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Identification of endoxylanase xyne from clostridium thermocellum as the first xylanase of glycoside hydrolase family gh141. Scientific Reports, Sep 2017. URL: https://doi.org/10.1038/s41598-017-11598-y, doi:10.1038/s41598-017-11598-y. This article has 40 citations and is from a peer-reviewed journal.

  3. (hamann2024desenvolvimentodecomplexos pages 17-20): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  4. (hamann2024desenvolvimentodecomplexosb pages 17-20): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  5. (hamann2024desenvolvimentodecomplexos pages 20-23): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  6. (hamann2024desenvolvimentodecomplexosb pages 20-23): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  7. (hamann2024desenvolvimentodecomplexos pages 26-29): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  8. (hamann2024desenvolvimentodecomplexosc pages 26-29): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  9. (hamann2024desenvolvimentodecomplexosb pages 26-29): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  10. (hamann2024desenvolvimentodecomplexosa pages 17-20): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

  11. (heinze2017identificationofendoxylanase pages 8-9): Simon Heinze, Matthias Mechelke, Petra Kornberger, Wolfgang Liebl, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Identification of endoxylanase xyne from clostridium thermocellum as the first xylanase of glycoside hydrolase family gh141. Scientific Reports, Sep 2017. URL: https://doi.org/10.1038/s41598-017-11598-y, doi:10.1038/s41598-017-11598-y. This article has 40 citations and is from a peer-reviewed journal.

  12. (selvaraj2010theroleof pages 5-6): Thangaswamy Selvaraj, Sung Kyum Kim, Yong Ho Kim, Yu Seok Jeong, Yu-Jeong Kim, Nguyen Dinh Phuong, Kyung Hwa Jung, Jungho Kim, Han Dae Yun, and Hoon Kim. The role of carbohydrate-binding module (cbm) repeat of a multimodular xylanase (xynx) from clostridium thermocellum in cellulose and xylan binding. The Journal of Microbiology, 48:856-861, Dec 2010. URL: https://doi.org/10.1007/s12275-010-0285-5, doi:10.1007/s12275-010-0285-5. This article has 24 citations.

  13. (hamann2024desenvolvimentodecomplexosa pages 26-29): PRV Hamann. Desenvolvimento de complexos enzimáticos baseados em xilanases de clostridium thermocellum. Unknown journal, 2024.

Citations

  1. hamann2024desenvolvimentodecomplexosa pages 20-23
  2. hamann2024desenvolvimentodecomplexos pages 17-20
  3. heinze2017identificationofendoxylanase pages 1-2
  4. hamann2024desenvolvimentodecomplexosb pages 26-29
  5. hamann2024desenvolvimentodecomplexosc pages 26-29
  6. hamann2024desenvolvimentodecomplexosa pages 17-20
  7. hamann2024desenvolvimentodecomplexosb pages 17-20
  8. hamann2024desenvolvimentodecomplexos pages 20-23
  9. hamann2024desenvolvimentodecomplexosb pages 20-23
  10. hamann2024desenvolvimentodecomplexos pages 26-29
  11. heinze2017identificationofendoxylanase pages 8-9
  12. selvaraj2010theroleof pages 5-6
  13. hamann2024desenvolvimentodecomplexosa pages 26-29
  14. https://doi.org/10.1038/s41598-017-11598-y
  15. https://doi.org/10.1128/jb.00882-07.
  16. https://doi.org/10.1007/s00253-023-12977-4
  17. https://doi.org/10.1101/2024.11.06.622210
  18. https://doi.org/10.25165/j.ijabe.20241704.6571
  19. https://doi.org/10.1016/j.heliyon.2024.e24022
  20. https://doi.org/10.1128/jb.00882-07
  21. https://doi.org/10.1186/s13068-022-02119-9
  22. https://doi.org/10.1038/s41598-017-11598-y,
  23. https://doi.org/10.1007/s12275-010-0285-5,

📄 View Raw YAML

id: P10478
gene_symbol: xynZ
aliases:
  - XynZ
  - Xylanase Z
  - Endo-1,4-beta-xylanase Z
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / 
    NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
description: >-
  XynZ (Xylanase Z) is a modular cellulosomal enzyme from Acetivibrio thermocellus
  (formerly Clostridium thermocellum) with dual catalytic activities. The protein
  contains four distinct domains: an N-terminal CE1 feruloyl esterase domain that
  removes ferulate and acetyl decorations from xylan, a CBM6 carbohydrate-binding
  module that targets xylan substrates, a type I dockerin domain that mediates
  incorporation into the cellulosome via binding to cohesin domains on the CipA
  scaffoldin, and a C-terminal GH10 endo-1,4-beta-xylanase catalytic domain that
  cleaves internal beta-1,4 linkages in xylan backbones. The protein is exported
  and functions as part of the extracellular cellulosome complex, where it
  contributes to the coordinated degradation of plant cell wall hemicellulose.
  The GH10 domain shows optimal activity around 70C, consistent with the
  thermophilic lifestyle of this organism. XynZ produces xylo-oligosaccharides,
  primarily xylobiose, as hydrolysis products.
existing_annotations:
  - term:
      id: GO:0000272
      label: polysaccharide catabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: >-
        This annotation is based on InterPro domains (Dockerin) and UniProt keywords.
        While XynZ does participate in polysaccharide catabolism, specifically xylan
        degradation, this term is overly broad. The protein's primary catabolic
        activity is specifically on xylan, a type of hemicellulose, not polysaccharides
        in general. The more specific term GO:0045493 (xylan catabolic process) is
        already annotated and should be preferred.
      action: MARK_AS_OVER_ANNOTATED
      reason: >-
        While technically correct since xylan is a polysaccharide, this term is too
        general for XynZ which has well-characterized xylan-specific activity. The
        GH10 domain specifically catalyzes endo-1,4-beta-xylanase activity, and
        the more specific annotation to GO:0045493 (xylan catabolic process) is
        more informative. This IEA annotation adds no information beyond the
        existing xylan catabolic process annotation.
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "As a GH10 endo-1,4-beta-xylanase, XynZ cleaves internal
            beta-1,4 linkages in xylan backbones"

  - term:
      id: GO:0004553
      label: hydrolase activity, hydrolyzing O-glycosyl compounds
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: >-
        This annotation is inferred from the GH10 and dockerin InterPro domains.
        The term correctly captures the general enzymatic mechanism of the GH10
        xylanase domain but is less specific than endo-1,4-beta-xylanase activity
        (GO:0031176) which is the precise function of the GH10 domain.
      action: ACCEPT
      reason: >-
        This is a valid parent term annotation based on InterPro domain prediction.
        While more specific terms exist (GO:0031176), hierarchical IEA annotations
        to parent terms are acceptable as they correctly capture the enzyme class.
        The GH10 domain does indeed hydrolyze O-glycosyl compounds (beta-1,4-xylosidic
        linkages in xylan).
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)"

  - term:
      id: GO:0005975
      label: carbohydrate metabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: >-
        This annotation is inferred from the GH10 InterPro domain. It is a very
        high-level biological process term that provides minimal specific
        information about XynZ function.
      action: MARK_AS_OVER_ANNOTATED
      reason: >-
        While XynZ participates in carbohydrate metabolism, this term is too
        general. The protein is specifically involved in xylan catabolic process
        (GO:0045493), which is already annotated. This high-level IEA annotation
        does not add meaningful functional information and could be misleading
        about the specificity of the enzyme's role.

  - term:
      id: GO:0016787
      label: hydrolase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: >-
        This annotation is based on the UniProtKB keyword "Hydrolase". It correctly
        identifies XynZ as having hydrolase activity, consistent with both its
        GH10 xylanase and CE1 esterase domains.
      action: ACCEPT
      reason: >-
        This is a valid high-level molecular function annotation. XynZ has two
        hydrolase activities: the GH10 endo-1,4-beta-xylanase (glycoside hydrolase)
        and the CE1 feruloyl esterase (carboxylic ester hydrolase). While more
        specific terms exist, this parent annotation is technically correct.
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "XynZ is a modular, extracellular GH10 endo-beta-1,4-xylanase"

  - term:
      id: GO:0016798
      label: hydrolase activity, acting on glycosyl bonds
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: >-
        This annotation is based on the UniProtKB keyword "Glycosidase". It
        correctly describes the activity of the GH10 domain which hydrolyzes
        beta-1,4-xylosidic (glycosyl) bonds in xylan.
      action: ACCEPT
      reason: >-
        Valid intermediate-level annotation. The GH10 endo-1,4-beta-xylanase
        domain hydrolyzes glycosyl bonds. This is more specific than GO:0016787
        but less specific than GO:0031176. Acceptable as an IEA annotation that
        correctly places the enzyme in the glycosidase hierarchy.
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "GH10 endo-beta-1,4-xylanase cleaves internal beta-1,4
            linkages in xylan backbones"

  - term:
      id: GO:0030246
      label: carbohydrate binding
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: >-
        This annotation is inferred from the CBM6 (Carbohydrate Binding Module
        family 6) and cellulose-binding domain (CBD_IV) InterPro entries. XynZ
        does have carbohydrate binding capability via its CBM6 domain.
      action: MODIFY
      reason: >-
        While the annotation is directionally correct (XynZ does bind carbohydrates),
        the CBM6 domain in XynZ specifically binds xylan substrates to facilitate
        enzymatic activity. A more appropriate term would be GO:2001062 (xylan
        binding), which specifically describes binding to xylan polymers. The CBM6
        domain in C. thermocellum xylanases is known to target xylan, enhancing
        the enzyme's effectiveness on this substrate.
      proposed_replacement_terms:
        - id: GO:2001062
          label: xylan binding
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "CBM6 targets xylan"

  - term:
      id: GO:0031176
      label: endo-1,4-beta-xylanase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: >-
        This annotation is based on the EC number 3.2.1.8 and ARBA rule
        ARBA00026761. The term accurately describes the primary catalytic
        activity of the GH10 domain, which catalyzes endohydrolysis of
        (1->4)-beta-D-xylosidic linkages in xylans.
      action: ACCEPT
      reason: >-
        This is the correct specific molecular function for the GH10 domain
        of XynZ. The enzyme is classified as EC 3.2.1.8 (endo-1,4-beta-xylanase)
        and belongs to glycosyl hydrolase family 10, both of which are
        consistent with this annotation. This represents a core function of
        the protein.
      supported_by:
        - reference_id: PMID:3139632
          supporting_text: "Deletions removing up to 508 codons from the 5' end of
            the gene did not affect the activity of the encoded polypeptide, showing
            that the active site was located in the C-terminal half of the protein"
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)"

  - term:
      id: GO:0045493
      label: xylan catabolic process
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: >-
        This annotation is based on the UniProtKB keyword "Xylan degradation".
        XynZ participates in xylan catabolism as part of the cellulosome
        complex, where it works synergistically with other hemicellulases to
        break down xylan polymers.
      action: ACCEPT
      reason: >-
        This is the appropriate biological process annotation for XynZ. The
        enzyme's endo-1,4-beta-xylanase activity directly contributes to
        xylan breakdown, producing xylo-oligosaccharides. This represents
        the core biological role of the protein in C. thermocellus
        lignocellulose degradation.
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "Cellulosomal xylanase activity commonly yields xylo-oligosaccharides
            (notably xylobiose) with minimal xylose without additional beta-xylosidases"

  - term:
      id: GO:0031176
      label: endo-1,4-beta-xylanase activity
    evidence_type: IDA
    original_reference_id: PMID:3139632
    review:
      summary: >-
        This experimental annotation is based on direct assay data from
        Grepinet et al. 1988, who cloned and characterized the xynZ gene.
        The paper demonstrates xylanase activity of the encoded protein
        through deletion analysis and expression in E. coli.
      action: ACCEPT
      reason: >-
        This is a well-supported experimental annotation. PMID:3139632
        reports the cloning and characterization of xynZ, demonstrating
        that the C-terminal portion of the protein encodes the active
        xylanase. The paper shows that deletions removing up to 508
        codons from the 5' end did not affect the activity, confirming
        that the xylanase catalytic domain is in the C-terminal GH10 region.
        This is a core function of the protein.
      supported_by:
        - reference_id: PMID:3139632
          supporting_text: "Deletions removing up to 508 codons from the 5' end of
            the gene did not affect the activity of the encoded polypeptide, showing
            that the active site was located in the C-terminal half of the protein"

  - term:
      id: GO:0033905
      label: xylan endo-1,3-beta-xylosidase activity
    evidence_type: IDA
    original_reference_id: PMID:3139632
    review:
      summary: >-
        This annotation suggests XynZ has activity on 1,3-beta-xylosidic
        linkages. However, the deep research and UniProt characterization
        consistently identify XynZ as a GH10 family endo-1,4-beta-xylanase
        (EC 3.2.1.8), not as having 1,3-xylosidase activity. The referenced
        paper (PMID:3139632) does not provide evidence for 1,3-beta-xylosidase
        activity.
      action: REMOVE
      reason: >-
        This annotation appears to be erroneous. GH10 family enzymes,
        including XynZ, catalyze hydrolysis of 1,4-beta-xylosidic linkages,
        not 1,3-beta linkages. The referenced paper (PMID:3139632) describes
        the cloning and deletion analysis of xynZ and demonstrates xylanase
        activity, but does not provide evidence for 1,3-xylosidase activity.
        UniProt explicitly classifies XynZ as EC 3.2.1.8 (endo-1,4-beta-xylanase)
        and places it in the GH10 family, which is inconsistent with
        xylan endo-1,3-beta-xylosidase activity (EC 3.2.1.32). This appears
        to be an annotation error that should be removed.
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "XynZ (xynZ=Cthe_1963; UniProt P10478) in Acetivibrio thermocellus
            is a modular, extracellular GH10 endo-beta-1,4-xylanase"

# New annotations that should be added based on comprehensive review
        - reference_id: PMID:3139632
          supporting_text: Nucleotide sequence and deletion analysis of the 
            xylanase gene (xynZ) of Clostridium thermocellum.
  - term:
      id: GO:0030600
      label: feruloyl esterase activity
    evidence_type: ISS
    original_reference_id: GO_REF:0000002
    review:
      summary: >-
        XynZ contains an N-terminal CE1 (Carbohydrate Esterase family 1)
        domain that has feruloyl esterase activity. This domain removes
        ferulate decorations from xylan, facilitating backbone hydrolysis
        by the GH10 domain. This dual-function architecture is well
        documented for XynZ.
      action: NEW
      reason: >-
        The CE1 domain of XynZ provides feruloyl esterase activity, which
        is a documented accessory function of this enzyme. UniProt cross-references
        ESTHER database (clotm-xynz; A85-Feruloyl-Esterase) and BRENDA entry for
        EC 3.1.1.73 (feruloyl esterase). This is a core function of the protein
        that should be annotated.
      proposed_replacement_terms:
        - id: GO:0030600
          label: feruloyl esterase activity
      supported_by:
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "The N-terminal CE1 in XynZ confers feruloyl esterase-type
            activity consistent with removal of ferulate/acetyl decorations that hinder
            backbone hydrolysis in plant xylans"

  - term:
      id: GO:0043263
      label: cellulosome
    evidence_type: IDA
    original_reference_id: PMID:3139632
    review:
      summary: >-
        XynZ contains a type I dockerin domain that mediates its incorporation
        into the cellulosome complex via binding to cohesin domains on the
        CipA scaffoldin protein. The protein functions as part of this
        extracellular multi-enzyme complex.
      action: NEW
      reason: >-
        XynZ is a cellulosomal enzyme that is incorporated into the
        cellulosome via its dockerin domain. UniProt identifies the Dockerin
        domain (residues 424-492) and references the Dockerin-related InterPro
        entries. PMID:3139632 identified a duplicated segment strongly similar
        to the conserved domain found in C. thermocellum endoglucanases, which
        is the dockerin domain. This cellular component annotation would
        accurately reflect the protein's localization and function as part of
        the cellulosome complex.
      proposed_replacement_terms:
        - id: GO:0043263
          label: cellulosome
      supported_by:
        - reference_id: PMID:3139632
          supporting_text: "A region of 60 amino acids containing a duplicated segment
            of 24 amino acids was found between residues 429 and 488 of xylanase Z.
            This region was strongly similar to the conserved domain found at the
            carboxy-terminal ends of C. thermocellum endoglucanases A, B, and D"
        - reference_id: file:ACET2/P10478/P10478-deep-research-falcon.md
          supporting_text: "XynZ is exported and assembled into the cellulosome (extracellular)
            through its dockerin I, contributing to the coordinated deconstruction
            of hemicellulose in plant cell walls alongside cellulases and accessory
            enzymes"

references:
  - id: GO_REF:0000002
    title: Gene Ontology annotation through association of InterPro records with
      GO terms
    findings: []
  - id: GO_REF:0000043
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword 
      mapping
    findings: []
  - id: GO_REF:0000120
    title: Combined Automated Annotation using Multiple IEA Methods
    findings: []
  - id: PMID:3139632
    title: Nucleotide sequence and deletion analysis of the xylanase gene (xynZ)
      of Clostridium thermocellum.
    findings:
      - statement: Cloning and sequencing of xynZ gene encoding 837 amino acid 
          polypeptide
        supporting_text: "The putative xynZ gene was 2,511 base pairs long and encoded
          a polypeptide of 837 amino acids"
      - statement: Identification of duplicated 24 amino acid segment (residues 
          429-488) similar to C. thermocellum endoglucanases - this is the 
          dockerin domain
        supporting_text: "A region of 60 amino acids containing a duplicated segment
          of 24 amino acids was found between residues 429 and 488 of xylanase Z.
          This region was strongly similar to the conserved domain found at the carboxy-terminal
          ends of C. thermocellum endoglucanases A, B, and D"
      - statement: Deletion analysis showing active site in C-terminal half of 
          protein (GH10 domain)
        supporting_text: "Deletions removing up to 508 codons from the 5' end of the
          gene did not affect the activity of the encoded polypeptide, showing that
          the active site was located in the C-terminal half of the protein"
      - statement: Expression of xylanase activity in E. coli
        supporting_text: "Expression of xylanase activity in Escherichia coli was
          increased up to 220-fold by fusing fragments containing the 3' end of the
          gene with the start of lacZ present in pUC19"
  - id: PMID:7664125
    title: A common protein fold and similar active site in two distinct 
      families of beta-glycanases.
    findings:
      - statement: X-ray crystal structure of GH10 domain (residues 515-837) at 
          1.4 angstrom resolution
        supporting_text: "The structure of the catalytic core of xylanase XynZ, which
          belongs to xylanase family F, has been determined at 1.4 A resolution"
      - statement: Structural characterization of the catalytic domain
        supporting_text: "In spite of significant differences in substrate specificity
          and structure (including the absence of the helical subdomain), the general
          polypeptide folding pattern, architecture of the active site and catalytic
          mechanism of XynZ and CelC are similar, suggesting a common evolutionary
          origin"
  - id: file:ACET2/P10478/P10478-deep-research-falcon.md
    title: Deep research synthesis on XynZ function and domain architecture
    findings:
      - statement: XynZ is a multi-domain cellulosomal enzyme with 
          CE1-CBM6-Dockerin I-GH10 architecture
        supporting_text: "Domain architecture: XynZ is reported as CE1 (feruloyl esterase)
          - CBM6 - Dockerin type I - GH10 endo-beta-1,4-xylanase"
      - statement: The GH10 domain catalyzes endo-1,4-beta-xylanase activity (EC
          3.2.1.8)
        supporting_text: "The GH10 domain is the catalytic xylanase module (EC 3.2.1.8)"
      - statement: The CE1 domain provides feruloyl esterase activity for 
          removing decorations from xylan
        supporting_text: "The N-terminal CE1 in XynZ confers feruloyl esterase-type
          activity consistent with removal of ferulate/acetyl decorations that hinder
          backbone hydrolysis in plant xylans"
      - statement: The protein is localized to the cellulosome via its dockerin 
          domain
        supporting_text: "XynZ is exported and assembled into the cellulosome (extracellular)
          through its dockerin I"

core_functions:
  - molecular_function:
      id: GO:0031176
      label: endo-1,4-beta-xylanase activity
    description: >-
      The primary enzymatic function of XynZ is as an endo-1,4-beta-xylanase
      (EC 3.2.1.8), catalyzed by the C-terminal GH10 domain. This enzyme
      cleaves internal beta-1,4-xylosidic linkages in xylan polymers,
      producing xylo-oligosaccharides (primarily xylobiose). The GH10
      domain shows optimal activity around 70C.
    directly_involved_in:
      - id: GO:0045493
        label: xylan catabolic process
    locations:
      - id: GO:0043263
        label: cellulosome
  - molecular_function:
      id: GO:0030600
      label: feruloyl esterase activity
    description: >-
      The N-terminal CE1 domain provides feruloyl esterase activity,
      removing ferulate and acetyl decorations from substituted xylans.
      This accessory activity facilitates access of the xylanase domain
      to the xylan backbone.
    directly_involved_in:
      - id: GO:0045493
        label: xylan catabolic process
    locations:
      - id: GO:0043263
        label: cellulosome

proposed_new_terms: []

suggested_questions:
  - question: >-
      Has the feruloyl esterase activity of the CE1 domain been directly
      demonstrated experimentally, and what is the substrate specificity?
      While the CE1 domain is predicted to have feruloyl esterase activity
      based on sequence similarity, direct experimental validation would
      strengthen the annotation.
  - question: >-
      What is the relative contribution of the CBM6 domain to substrate
      binding and catalytic efficiency on different xylan substrates?
      Understanding CBM6 function would support the proposed modification
      of the carbohydrate binding annotation to the more specific xylan
      binding term.

suggested_experiments:
  - description: >-
      Direct assay of feruloyl esterase activity using the isolated CE1
      domain or full-length protein with ferulated xylan substrates.
      This would provide experimental evidence for the proposed new annotation
      to GO:0030600 (feruloyl esterase activity).
  - description: >-
      Binding assays comparing xylan vs other carbohydrate substrates
      for the CBM6 domain. This would support the proposed modification
      from general carbohydrate binding to specific xylan binding.