xynY

UniProt ID: P51584
Organism: Acetivibrio thermocellus
Review Status: DRAFT
📝 Provide Detailed Feedback

Gene Description

XynY (Endo-1,4-beta-xylanase Y) is a modular cellulosomal hemicellulase from Acetivibrio thermocellum (formerly Clostridium thermocellum). The enzyme belongs to glycoside hydrolase family 10 (GH10) and catalyzes the endohydrolysis of 1,4-beta-D-xylosidic linkages in xylans, producing xylo-oligosaccharides. XynY has a complex multi-domain architecture comprising two N-terminal CBM22 carbohydrate-binding modules, a central GH10 catalytic domain, a CE1 feruloyl esterase domain, and a C-terminal type-I dockerin domain that mediates integration into the cellulosome via interaction with cohesin domains of the CipA scaffoldin. The enzyme functions extracellularly as part of the cellulosome complex, contributing to the degradation of plant cell wall hemicellulose. XynY displays optimal activity at pH 6.8 and 75 degrees Celsius, consistent with the thermophilic lifestyle of the organism.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
ACCEPT
Summary: This annotation is derived from InterPro domain mappings (dockerin domains IPR002105, IPR016134, IPR036439 and polysaccharide degradation keyword KW-0624). While polysaccharide catabolic process is accurate, XynY specifically functions in xylan catabolism, which is a more specific child term. The annotation is not incorrect but is less informative than the more specific GO:0045493 (xylan catabolic process) which is also annotated.
Reason: The annotation is accurate as xylan is a polysaccharide, and XynY contributes to plant cell wall polysaccharide degradation. However, this is a parent term of the more specific xylan catabolic process (GO:0045493). Both can be retained as the broader term captures the enzyme's role in cellulosome-mediated polysaccharide degradation.
Supporting Evidence:
file:ACET2/P51584/P51584-deep-research-falcon.md
See deep research file for comprehensive analysis of XynY function in polysaccharide degradation
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000002
ACCEPT
Summary: This annotation derives from InterPro domain matches (IPR001000 GH10 domain, IPR002105 dockerin, IPR044846 GH10 family). The GH10 catalytic domain of XynY is indeed a glycoside hydrolase that hydrolyzes O-glycosyl compounds. However, this is a very general parent term; the more specific endo-1,4-beta-xylanase activity (GO:0031176) is also annotated and is more informative.
Reason: This is a correct parent term for endo-1,4-beta-xylanase activity. While not as informative as the more specific term GO:0031176, retaining parent terms in IEA annotations is acceptable as they capture the broader enzymatic class.
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
ACCEPT
Summary: This annotation derives from InterPro domain matches (IPR001000 GH10 domain, IPR044846 GH10 family). Carbohydrate metabolic process is a very broad parent term. XynY specifically participates in xylan catabolic process (GO:0045493), which is the appropriate specific biological process term.
Reason: This is a correct but very general parent term. The annotation accurately captures that XynY is involved in carbohydrate metabolism (specifically catabolism of xylan). IEA mappings to broad parent terms are acceptable alongside more specific annotations.
GO:0016787 hydrolase activity
IEA
GO_REF:0000043
ACCEPT
Summary: This annotation derives from UniProtKB keyword mapping (KW-0378 Hydrolase). Hydrolase activity is the most general molecular function term for enzymes catalyzing hydrolysis reactions. XynY has much more specific molecular functions including endo-1,4-beta-xylanase activity (GO:0031176).
Reason: This is a correct but very general parent term. While the more specific GO:0031176 (endo-1,4-beta-xylanase activity) is more informative, the broad hydrolase activity term from keyword mapping is acceptable for IEA annotations.
GO:0016798 hydrolase activity, acting on glycosyl bonds
IEA
GO_REF:0000120
ACCEPT
Summary: This annotation derives from InterPro CBM-CenC domain (IPR003305) and glycosidase keyword (KW-0326). The term is accurate as XynY hydrolyzes glycosyl bonds in xylan. This is an intermediate term between the general GO:0016787 (hydrolase activity) and the specific GO:0031176 (endo-1,4-beta-xylanase activity).
Reason: This is a correct intermediate-level molecular function term. XynY's GH10 domain hydrolyzes beta-1,4-xylosidic bonds, which are glycosyl bonds. The annotation is accurate.
GO:0031176 endo-1,4-beta-xylanase activity
IEA
GO_REF:0000003
ACCEPT
Summary: This annotation derives from EC number mapping (EC:3.2.1.8). The enzyme is classified as EC 3.2.1.8 (endo-1,4-beta-xylanase) in UniProt based on experimental characterization. PMID:7717969 demonstrated that recombinant XynY rapidly hydrolyzed oat spelt, wheat and rye arabinoxylans and displayed features characteristic of an endo-beta1,4-xylanase. This is the core molecular function of the enzyme.
Reason: This is the correct and most informative molecular function term for XynY. The enzyme's endo-1,4-beta-xylanase activity is well-characterized biochemically. The GH10 catalytic domain contains the conserved catalytic glutamate residues (Glu337 proton donor, Glu460 nucleophile per UniProt feature annotation).
Supporting Evidence:
PMID:7717969
The encoded enzyme, xylanase Y (XYLY), displayed features characteristic of an endo-beta1,4-xylanase: the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates.
file:ACET2/P51584/P51584-deep-research-falcon.md
See deep research file for comprehensive analysis
GO:0045493 xylan catabolic process
IEA
GO_REF:0000043
ACCEPT
Summary: This annotation derives from UniProtKB keyword mapping (KW-0858 Xylan degradation). XynY catalyzes the breakdown of xylan, making this the appropriate specific biological process term. The enzyme is part of the cellulosome complex that degrades plant cell wall hemicellulose including xylan.
Reason: This is the correct specific biological process term for XynY. The enzyme's primary function is xylan catabolism, cleaving beta-1,4 glycosidic bonds in the xylan backbone to produce xylo-oligosaccharides. PMID:7717969 demonstrated xylanase activity on various xylan substrates.
Supporting Evidence:
PMID:7717969
the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates.
GO:0005515 protein binding
IPI
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
MODIFY
Summary: This IPI annotation indicates XynY binds to CipA (Q06851), the scaffoldin protein of the cellulosome. The interaction is mediated by the type-I dockerin domain of XynY binding to cohesin domains of CipA. PMID:14623971 reports the crystal structure of the cohesin-dockerin complex from C. thermocellum at 2.2 Angstrom resolution, revealing the structural basis of cellulosome assembly. However, protein binding (GO:0005515) is too general and uninformative. A more specific term exists: GO:1990311 (type-I cohesin domain binding).
Reason: The protein binding annotation should be replaced with the more specific term GO:1990311 (type-I cohesin domain binding). XynY contains a type-I dockerin domain (residues 728-796 per UniProt) that specifically binds type-I cohesin domains of CipA. PMID:14623971 determined the crystal structure of the C. thermocellum cohesin-dockerin complex, directly demonstrating this specific interaction mechanism.
Proposed replacements: type-I cohesin domain binding
Supporting Evidence:
PMID:14623971
Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution. The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin.
GO:0005515 protein binding
IPI
PMID:17360613
Evidence for a dual binding mode of dockerin modules to cohe...
MODIFY
Summary: This is a second IPI annotation for protein binding with CipA (Q06851), based on PMID:17360613 which demonstrated the dual binding mode of dockerin modules to cohesins. The study showed that the internal sequence duplication within the dockerin allows it to bind cohesins in two orientations. As with the previous annotation, protein binding is too general; GO:1990311 (type-I cohesin domain binding) is more appropriate.
Reason: Should be replaced with GO:1990311 (type-I cohesin domain binding). PMID:17360613 provides structural evidence for the dual binding mode of type-I dockerin to type-I cohesin, showing that both halves of the duplicated dockerin sequence can interact with cohesins. This is a well-characterized, specific protein-protein interaction.
Proposed replacements: type-I cohesin domain binding
Supporting Evidence:
PMID:17360613
The crystal structure of a C. thermocellum type I cohesin-dockerin complex showed that cohesin recognition was predominantly through helix-3 of the dockerin. The sequence duplication is reflected in near-perfect 2-fold structural symmetry, suggesting that both repeats could interact with cohesins by a common mechanism
GO:0043263 cellulosome
IDA
PMID:7717969
Evidence for a general role for non-catalytic thermostabiliz...
ACCEPT
Summary: This IDA annotation indicates XynY is located in the cellulosome, based on experimental evidence from PMID:7717969. The abstract states that Western blot analysis using antiserum raised against XYLY showed that the xylanase was located in the cellulosome. This is direct experimental evidence for cellulosome localization.
Reason: This is a well-supported IDA annotation. The original publication (PMID:7717969) provided direct experimental evidence via Western blot analysis demonstrating that XynY is a component of the cellulosome. The presence of a functional dockerin domain (residues 728-796) that binds CipA cohesins provides the mechanistic basis for this localization.
Supporting Evidence:
PMID:7717969
The C-terminal portion of XYLY comprised the 23-residue duplicated docking sequence found in all other C. thermocellum plant cell wall hydrolases that are constituents of the bacterium's multienzyme complex, termed the cellulosome
PMID:7717969
Western blot analysis using antiserum raised against XYLY showed that the xylanase was located in the cellulosome and did not appear to be extensively glycosylated.
GO:0030600 feruloyl esterase activity
IEA
UniProt:P51584
NEW
Summary: XynY contains a C-terminal esterase domain classified as CE1 (carbohydrate esterase family 1) by CAZy. The UniProt record shows this domain (Pfam:PF00756 Esterase) and the ESTHER database classifies it as A85-Feruloyl-Esterase (clotm-xyny). Feruloyl esterases cleave ester bonds between ferulic acid and arabinose side chains of arabinoxylans, facilitating complete hemicellulose degradation. This activity complements the endo-xylanase activity.
Reason: The CE1 feruloyl esterase domain is a well-characterized accessory domain in XynY. While not annotated in the current GOA, domain architecture analysis and CAZy/ESTHER database classification strongly support this function. Feruloyl esterases are important accessory enzymes that remove ferulic acid decorations from xylans, enhancing access for xylanases.
Supporting Evidence:
file:ACET2/P51584/P51584-deep-research-falcon.md
See deep research file for comprehensive analysis of XynY domain architecture
GO:1990311 type-I cohesin domain binding
IPI
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
NEW
Summary: This is the more specific term that should replace the generic protein binding annotations for the XynY-CipA interaction. The type-I dockerin domain of XynY (residues 728-796) binds specifically to type-I cohesin domains of CipA.
Reason: Crystal structures (PMID:14623971, PMID:17360613) have determined the molecular details of the type-I cohesin-dockerin interaction. This specific term accurately describes the XynY dockerin binding to CipA cohesins and should replace the generic GO:0005515 annotations.
Supporting Evidence:
PMID:14623971
Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution.
PMID:17360613
The crystal structure of a C. thermocellum type I cohesin-dockerin complex showed that cohesin recognition was predominantly through helix-3 of the dockerin.
GO:0005576 extracellular region
IDA
PMID:7717969
Evidence for a general role for non-catalytic thermostabiliz...
NEW
Summary: XynY is secreted (contains signal peptide, residues 1-26) and functions extracellularly as part of the cellulosome complex. The cellulosome is an extracellular multienzyme complex attached to the cell surface via anchoring proteins.
Reason: XynY contains a signal peptide and is secreted to function in the extracellular cellulosome. While GO:0043263 (cellulosome) is already annotated, the broader GO:0005576 (extracellular region) could also be appropriate as the cellulosome operates in the extracellular space. The signal peptide and cellulosome localization support this.
Supporting Evidence:
PMID:7717969
The encoded enzyme contained a typical N-terminal 26-residue signal peptide, followed by a 164 amino acid sequence, designated domain A, that was not essential for catalytic activity.

Core Functions

XynY is an endo-xylanase that cleaves internal beta-1,4-xylosidic bonds in xylan, releasing xylo-oligosaccharides. This is the primary catalytic function of the enzyme.

Directly Involved In:
Cellular Locations:

XynY binds to the CipA scaffoldin protein via its C-terminal type I dockerin domain, enabling its incorporation into the cellulosome complex for coordinated plant cell wall degradation.

Cellular Locations:

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on Enzyme Commission mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Combined Automated Annotation using Multiple IEA Methods
Evidence for a general role for non-catalytic thermostabilizing domains in xylanases from thermophilic bacteria.
  • Cloned and characterized xynY gene from C. thermocellum
    "A genomic library of Clostridium thermocellum DNA constructed in lambda ZAPII was screened for xylanase-expressing clones. Cross-hybridization experiments revealed a new xylanase gene isolated from the gene library, which was designated xyn Y."
  • Demonstrated endo-beta-1,4-xylanase activity on oat spelt, wheat and rye arabinoxylans
    "The encoded enzyme, xylanase Y (XYLY), displayed features characteristic of an endo-beta1,4-xylanase: the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates."
  • Identified signal peptide, catalytic domain, thermostable domain, and dockerin docking sequence
    "The encoded enzyme contained a typical N-terminal 26-residue signal peptide, followed by a 164 amino acid sequence, designated domain A, that was not essential for catalytic activity."
  • Western blot confirmed localization of XynY to the cellulosome
    "Western blot analysis using antiserum raised against XYLY showed that the xylanase was located in the cellulosome and did not appear to be extensively glycosylated."
  • Determined pH optimum (6.8) and temperature optimum (75 degrees C)
    "The pH and temperature optima of the enzyme were 6.8 and 75 degrees C respectively"
Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex.
  • Determined 2.2 Angstrom crystal structure of C. thermocellum cohesin-dockerin complex
    "Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution."
  • Revealed structural basis for cellulosome assembly via cohesin-dockerin interaction
    "The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin."
  • Showed dockerin contains two EF-hand calcium-binding sites
    "the classical 12-residue EF-hand coordination to two calcium ions is maintained."
  • Internal sequence duplication in dockerin creates near-perfect twofold symmetry
    "internal sequence duplication within the dockerin is manifested in near-perfect internal twofold symmetry"
Evidence for a dual binding mode of dockerin modules to cohesins.
  • Demonstrated dual binding mode of type-I dockerin to type-I cohesin
    "The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly"
  • Both halves of the duplicated dockerin sequence can interact with cohesins
    "The sequence duplication is reflected in near-perfect 2-fold structural symmetry, suggesting that both repeats could interact with cohesins by a common mechanism"
  • Dual binding mode provides plasticity in catalytic subunit orientation within cellulosome
    "The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly"
UniProt:P51584
UniProtKB entry for XynY (XYNY_ACETH)
file:ACET2/P51584/P51584-deep-research-falcon.md
Deep research on XynY function and annotations

Suggested Questions for Experts

Q: What is the catalytic activity and substrate specificity of the CE1 feruloyl esterase domain? Has this domain been biochemically characterized independently?

Q: Do the two CBM22 modules (CBM22-1 and CBM22-2) have distinct binding specificities or functions? What is their contribution to XynY activity on natural substrates?

Q: What is the relative contribution of XynY versus other cellulosomal xylanases (e.g., XynA, XynB, XynC, XynZ) to xylan degradation in vivo?

Suggested Experiments

Experiment: Biochemical characterization of the isolated CE1 domain to confirm feruloyl esterase activity and determine substrate specificity

Experiment: Binding studies with the individual CBM22 modules to determine polysaccharide binding specificity

Experiment: Gene knockout/knockdown studies to determine the contribution of XynY to xylan degradation in the context of other cellulosomal xylanases

📚 Additional Documentation

Deep Research Falcon

(P51584-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:03:25.142207'
end_time: '2025-12-26T17:09:14.154231'
duration_seconds: 349.01
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: P51584
gene_symbol: xynY
uniprot_accession: P51584
protein_description: 'RecName: Full=Endo-1,4-beta-xylanase Y; Short=Xylanase Y;
EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Y; Short=XylY; Flags:
Precursor;'
gene_info: Name=xynY;
organism_full: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium
thermocellum).
protein_family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
protein_domains: AB_hydrolase_fold. (IPR029058); CenC_carb-bd. (IPR003305); Dockerin_1_rpt.
(IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 15


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P51584
  • Protein Description: RecName: Full=Endo-1,4-beta-xylanase Y; Short=Xylanase Y; EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Y; Short=XylY; Flags: Precursor;
  • Gene Information: Name=xynY;
  • Organism (full): Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
  • Key Domains: AB_hydrolase_fold. (IPR029058); CenC_carb-bd. (IPR003305); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xynY" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xynY' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xynY (gene ID: P51584, UniProt: P51584) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P51584
  • Protein Description: RecName: Full=Endo-1,4-beta-xylanase Y; Short=Xylanase Y; EC=3.2.1.8; AltName: Full=1,4-beta-D-xylan xylanohydrolase Y; Short=XylY; Flags: Precursor;
  • Gene Information: Name=xynY;
  • Organism (full): Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  • Protein Family: Belongs to the glycosyl hydrolase 10 (cellulase F) family.
  • Key Domains: AB_hydrolase_fold. (IPR029058); CenC_carb-bd. (IPR003305); Dockerin_1_rpt. (IPR002105); Dockerin_dom. (IPR016134); Dockerin_dom_sf. (IPR036439)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "xynY" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (Hungateiclostridium thermocellum) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'xynY' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene xynY (gene ID: P51584, UniProt: P51584) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan
- Verify identity and resolve symbol/organism ambiguity for xynY (UniProt P51584) from Acetivibrio thermocellus (syn. Clostridium thermocellum), including family/domain architecture and cellulosome context.
- Assemble primary evidence on function, catalytic mechanism and specificity, domain architecture, secretion and localization, and role in hemicellulose deconstruction.
- Incorporate recent developments (2023–2024) on cellulosome regulation and designer cellulosomes, and summarize applications and quantitative performance data.
- Synthesize into a comprehensive, well-cited research report with URLs and dates where available.

Executive verification of target identity
- Gene/protein symbol and organism: Multiple cellulosomal xylanases have been cloned from Clostridium (Acetivibrio) thermocellum, including XynY; Demain et al. explicitly list XynY among the cloned xylanases of C. thermocellum, confirming symbol–organism linkage (Mar 2005; https://doi.org/10.1128/MMBR.69.1.124-154.2005) (demain2005cellulaseclostridiaand pages 12-13). A structural crystallography report on Thermocellum Xyn10B (formerly XynY; UniProt P51584) documents the same enzyme under the Xyn10B/XynY nomenclature, substantiating the correspondence of P51584 with the cellulosomal xylanase Y from C. thermocellum (2008) (najmudin2008purificationcrystallizationand pages 1-2).
- Family/domains: XynY belongs to glycoside hydrolase family 10 (GH10) and is a modular, cellulosomal protein bearing a C‑terminal dockerin domain for incorporation onto the CipA scaffoldin; this is supported by structural/domain mapping and prior alignments that place XynY with GH10 catalytic domains and dockerins of cellulosomal enzymes (2004; https://doi.org/10.1128/JB.186.24.8347-8355.2004) (han2004isolationandexpression pages 5-8). The crystallographic analysis of Xyn10B/XynY directly assigns the internal catalytic domain to GH10 and maps the dockerin (residues ~730–791), with flanking CBM22 modules and a CE1 esterase (2008) (najmudin2008purificationcrystallizationand pages 1-2). These features match the UniProt summary (GH10, dockerin, carbohydrate-binding module) and the InterPro-style annotations provided by the user.
- Conclusion: The gene symbol xynY, organism C. thermocellum (A. thermocellus), and GH10/dockerin/CBM cellulosomal architecture are coherent across authoritative literature; no conflicting xynY usage in other organisms was identified in the gathered evidence. Proceeding with this target is appropriate (demain2005cellulaseclostridiaand pages 12-13, najmudin2008purificationcrystallizationand pages 1-2, han2004isolationandexpression pages 5-8).

1) Key concepts and definitions
- Cellulosome and dockerin–cohesin system: The cellulosome is a multienzyme extracellular complex assembled on scaffoldin(s) (e.g., CipA) via high-affinity cohesin–dockerin interactions; many C. thermocellum glycoside hydrolases, including xylanases, are secreted dockerin-bearing subunits targeted to this complex (reviewed Mar 2005; https://doi.org/10.1128/MMBR.69.1.124-154.2005) (demain2005cellulaseclostridiaand pages 12-13). The cohesin–dockerin interaction is species- and module‑specific and supports dynamic assembly and substrate targeting (2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11).
- GH10 endo‑1,4‑β‑xylanases: Family‑10 xylanases are endo‑acting enzymes that hydrolyze β‑1,4 linkages in the xylan backbone via a retaining mechanism mediated by two catalytic glutamates; conserved catalytic residues have been mapped across GH10 members, including Thermocellum xylanases (Dec 2004; https://doi.org/10.1128/JB.186.24.8347-8355.2004) (han2004isolationandexpression pages 5-8). XynY/Xyn10B exemplifies this family and is a hemicellulase component of the Thermocellum cellulosome (2008) (najmudin2008purificationcrystallizationand pages 1-2).

2) Detailed functional annotation of xynY (UniProt P51584)
- Primary function and catalytic mechanism: XynY is an endo‑1,4‑β‑xylanase of GH10 that cleaves internal β‑1,4 glycosidic bonds within xylan to generate xylo‑oligosaccharides. Structural work on Thermocellum Xyn10B (formerly XynY; P51584) identifies GH10 catalytic residues (e.g., Glu337 proton donor and Glu480 nucleophile in the Xyn10B numbering) and confirms endo‑xylanase activity via complexation with xylohexaose (2008) (najmudin2008purificationcrystallizationand pages 1-2). Sequence alignments across GH10 place XynY within the canonical GH10 clade with the two conserved catalytic glutamates (Dec 2004; https://doi.org/10.1128/JB.186.24.8347-8355.2004) (han2004isolationandexpression pages 5-8).
- Substrate specificity and products: GH10 xylanases display broader substrate tolerance than GH11 and commonly produce xylobiose and xylotriose as major products from glucuronoxylan/arabinoxylan substrates; this product profile and hemicellulolytic role are consistent for Thermocellum cellulosomal xylanases (Dec 2004; https://doi.org/10.1128/JB.186.24.8347-8355.2004) (han2004isolationandexpression pages 5-8). Demain et al. note multiple xylanases in C. thermocellum, highlighting xylanase activity despite the organism’s limited growth on xylan/xylose per se, reflecting a role in complex biomass deconstruction rather than sole carbon sourcing (Mar 2005; https://doi.org/10.1128/MMBR.69.1.124-154.2005) (demain2005cellulaseclostridiaand pages 12-13).
- Domain architecture: XynY/Xyn10B is modular, comprising two N‑terminal CBM22 modules (CBM22‑1, CBM22‑2), a central GH10 catalytic domain, a C‑terminal CE1 esterase, and a C‑terminal dockerin that docks to CipA cohesins; the dockerin is mapped between residues ~730–791 (2008) (najmudin2008purificationcrystallizationand pages 1-2). Earlier comparative analyses of cellulosomal xylanases show XynY clustering within GH10 catalytic domains and within dockerin-domain alignments, reinforcing a canonical cellulosomal layout (Dec 2004; https://doi.org/10.1128/JB.186.24.8347-8355.2004) (han2004isolationandexpression pages 5-8).
- Subcellular localization: XynY is synthesized with a signal peptide for secretion and is localized extracellularly as a cellulosomal component through its dockerin–cohesin interaction with the CipA scaffoldin. Synthetic cellulosome assemblies that include Thermocellum Xyn10Y further confirm its dockerin-mediated incorporation into CipA-based complexes (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11). Demain et al. review the dockerin–cohesin architecture and provide biophysical detail on dockerin Ca2+-binding and cohesin recognition supporting extracellular complex assembly (Mar 2005; https://doi.org/10.1128/MMBR.69.1.124-154.2005) (demain2005cellulaseclostridiaand pages 12-13).
- Biological pathway context: Within the plant cell wall deconstruction pathway, XynY acts on hemicellulose (xylan) in concert with cellulases and accessory hemicellulases/esterases in the cellulosome to expose and liberate cellulose/hemicellulose sugars. Designer/synthetic cellulosomes demonstrate that inclusion of xylanase activities increases hydrolysis of lignocellulosic substrates, reflecting XynY’s role as part of a minimal functional set for biomass saccharification (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11).

3) Recent developments and latest research (with emphasis on 2023–2024)
- Regulation of cellulosome/xylanase expression via RsgI/SigI systems: Contemporary studies consolidate that C. thermocellum employs membrane-associated anti‑σ factors (RsgI-like) and alternative σI factors to sense extracellular polysaccharides and regulate cellulosomal gene expression; recent mechanistic work has uncovered autoproteolysis within RsgI periplasmic domains as an activation step for transmembrane signaling to SigI, thereby modulating cellulosomal enzymes including xylanases (Science Advances, Jul 2023; DOI available in source) and recent syntheses that overview SigI/RsgI regulation in Thermocellum (2025 synthesis referencing prior work). These advances place xylanases such as XynY under substrate-responsive transcriptional control (2025 overview citing regulatory systems) (takayesu2025expandingcellulosomicsinvestigationa pages 204-208, takayesu2025expandingcellulosomicsinvestigation pages 204-208).
- Designer/synthetic cellulosomes and integration of xylanases: Recent reviews highlight progress and persistent challenges in designer cellulosomes, noting that properly balanced inclusion of endoglucanases, exoglucanases, and hemicellulases (xylanases) is essential; bioRxiv 2024 synthesizes advances across bacteria and fungi and emphasizes designer complex stability and component optimization for industrial use (Nov 2024; https://doi.org/10.1101/2024.11.06.622210) (hsin2024lignocellulosedegradationin pages 32-34). Prior experimental optimization on softwood pulp explicitly included Thermocellum Xyn10Y in an engineered CipA8-based complex and found that xylanase class members were part of a minimal high-performance enzyme set (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11).

4) Current applications and real‑world implementations
- Synthetic bacterial cellulosomes for biomass hydrolysis: An engineered Thermocellum-derived complex containing 47 cellulosomal components exhibited activity on softwood pulp comparable to a commercial fungal enzyme cocktail, demonstrating real-world viability of bacterial multi-enzyme complexes; xylanase inclusion was among the classes required for optimal hydrolysis (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11). The 2024 review emphasizes opportunities to integrate bacterial and fungal cellulosomal principles and to optimize multi-enzyme assemblies for industrial biomass conversion (Nov 2024; https://doi.org/10.1101/2024.11.06.622210) (hsin2024lignocellulosedegradationin pages 32-34).

5) Expert opinions and authoritative syntheses
- Demain, Newcomb, and Wu provide a high‑impact, foundational synthesis of Thermocellum cellulosome biology, dockerin–cohesin biophysics, and the multiplicity of xylanases (including XynY), establishing the canonical framework still used in modern engineering efforts (Mar 2005; https://doi.org/10.1128/MMBR.69.1.124-154.2005) (demain2005cellulaseclostridiaand pages 12-13).
- Contemporary engineering work shows the expert consensus that inclusion of xylanase activities is necessary for robust plant biomass hydrolysis, and that modular, dockerin‑bearing xylanases such as XynY can be rationally incorporated into CipA‑scaffolded designer complexes (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11). The 2024 review consolidates views on stability and performance gaps in designer cellulosomes relevant to industrial deployment (Nov 2024; https://doi.org/10.1101/2024.11.06.622210) (hsin2024lignocellulosedegradationin pages 32-34).

6) Quantitative and structural data supporting annotation
- Modular map and catalytic residues: XynY/Xyn10B contains CBM22‑1, CBM22‑2, GH10 catalytic domain, CE1 module, and a C‑terminal dockerin; GH10 catalytic residues mapped as Glu337 (acid/base) and Glu480 (nucleophile). Crystals of the CBM22‑1–GH10(E337A) construct with xylohexaose diffracted beyond 2.0 Å (2008) (najmudin2008purificationcrystallizationand pages 1-2).
- Synthetic complex performance: A seven‑component synthetic complex optimized for softwood pulp, and a larger 47‑component designer cellulosome, displayed high thermostability (~60–65 °C optimum) and activity approaching commercial fungal preparations (Aug 2018; https://doi.org/10.1186/s13068-018-1220-y) (leis2018optimizingthecomposition pages 8-11).

Conclusion and knowledge gaps
The preponderance of evidence supports that xynY (UniProt P51584) from Acetivibrio (Clostridium) thermocellum encodes a modular GH10 endo‑1,4‑β‑xylanase integrated into the cellulosome via a C‑terminal dockerin. The enzyme harbors CBM22 modules and an accessory CE1 esterase, consistent with a role in binding/processing decorated xylans in plant cell walls. Structural, biochemical, and engineering studies corroborate its function and localization, while recent regulatory insights (RsgI/SigI signaling) reinforce substrate‑responsive control over cellulosomal xylanases. Ongoing designer‑cellulosome optimization indicates that xylanase activities such as XynY’s are integral to high‑performance lignocellulose hydrolysis in industrially relevant settings (najmudin2008purificationcrystallizationand pages 1-2, han2004isolationandexpression pages 5-8, demain2005cellulaseclostridiaand pages 12-13, leis2018optimizingthecomposition pages 8-11, hsin2024lignocellulosedegradationin pages 32-34, takayesu2025expandingcellulosomicsinvestigationa pages 204-208, takayesu2025expandingcellulosomicsinvestigation pages 204-208).

References

  1. (demain2005cellulaseclostridiaand pages 12-13): Arnold L. Demain, Michael Newcomb, and J. H. David Wu. Cellulase, clostridia, and ethanol. Microbiology and Molecular Biology Reviews, 69:124-154, Mar 2005. URL: https://doi.org/10.1128/mmbr.69.1.124-154.2005, doi:10.1128/mmbr.69.1.124-154.2005. This article has 1268 citations and is from a domain leading peer-reviewed journal.

  2. (najmudin2008purificationcrystallizationand pages 1-2): S Najmudin, BA Pinheiro, and MJ Romão. Purification, crystallization and crystallographic analysis of clostridium thermocellum endo-1, 4-β-d-xylanase 10b in complex with xylohexaose. Unknown journal, 2008.

  3. (han2004isolationandexpression pages 5-8): Sung Ok Han, Hideaki Yukawa, Masayuki Inui, and Roy H. Doi. Isolation and expression of the xynb gene and its product, xynb, a consistent component of the clostridium cellulovorans cellulosome. Journal of Bacteriology, 186:8347-8355, Dec 2004. URL: https://doi.org/10.1128/jb.186.24.8347-8355.2004, doi:10.1128/jb.186.24.8347-8355.2004. This article has 31 citations and is from a peer-reviewed journal.

  4. (leis2018optimizingthecomposition pages 8-11): Benedikt Leis, Claudia Held, Björn Andreeßen, Wolfgang Liebl, Sigrid Graubner, Louis-Philipp Schulte, Wolfgang H. Schwarz, and Vladimir V. Zverlov. Optimizing the composition of a synthetic cellulosome complex for the hydrolysis of softwood pulp: identification of the enzymatic core functions and biochemical complex characterization. Biotechnology for Biofuels, Aug 2018. URL: https://doi.org/10.1186/s13068-018-1220-y, doi:10.1186/s13068-018-1220-y. This article has 34 citations.

  5. (takayesu2025expandingcellulosomicsinvestigationa pages 204-208): AT Takayesu. Expanding cellulosomics: investigation of novel cellulolytic species and transcriptional regulation mechanisms in thermophilic anaerobes. Unknown journal, 2025.

  6. (takayesu2025expandingcellulosomicsinvestigation pages 204-208): AT Takayesu. Expanding cellulosomics: investigation of novel cellulolytic species and transcriptional regulation mechanisms in thermophilic anaerobes. Unknown journal, 2025.

  7. (hsin2024lignocellulosedegradationin pages 32-34): Kuan-Ting Hsin, HueyTyng Lee, Ying-Chung Jimmy Lin, and Pao-Yang Chen. Lignocellulose degradation in bacteria and fungi for biomass conversion. bioRxiv, Nov 2024. URL: https://doi.org/10.1101/2024.11.06.622210, doi:10.1101/2024.11.06.622210. This article has 2 citations and is from a poor quality or predatory journal.

Citations

  1. demain2005cellulaseclostridiaand pages 12-13
  2. najmudin2008purificationcrystallizationand pages 1-2
  3. han2004isolationandexpression pages 5-8
  4. leis2018optimizingthecomposition pages 8-11
  5. hsin2024lignocellulosedegradationin pages 32-34
  6. takayesu2025expandingcellulosomicsinvestigationa pages 204-208
  7. takayesu2025expandingcellulosomicsinvestigation pages 204-208
  8. https://doi.org/10.1128/MMBR.69.1.124-154.2005
  9. https://doi.org/10.1128/JB.186.24.8347-8355.2004
  10. https://doi.org/10.1186/s13068-018-1220-y
  11. https://doi.org/10.1101/2024.11.06.622210
  12. https://doi.org/10.1128/mmbr.69.1.124-154.2005,
  13. https://doi.org/10.1128/jb.186.24.8347-8355.2004,
  14. https://doi.org/10.1186/s13068-018-1220-y,
  15. https://doi.org/10.1101/2024.11.06.622210,

📄 View Raw YAML

id: P51584
gene_symbol: xynY
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus
description: >-
  XynY (Endo-1,4-beta-xylanase Y) is a modular cellulosomal hemicellulase from Acetivibrio thermocellum
  (formerly Clostridium thermocellum). The enzyme belongs to glycoside hydrolase family 10 (GH10) and
  catalyzes the endohydrolysis of 1,4-beta-D-xylosidic linkages in xylans, producing xylo-oligosaccharides.
  XynY has a complex multi-domain architecture comprising two N-terminal CBM22 carbohydrate-binding modules,
  a central GH10 catalytic domain, a CE1 feruloyl esterase domain, and a C-terminal type-I dockerin domain
  that mediates integration into the cellulosome via interaction with cohesin domains of the CipA scaffoldin.
  The enzyme functions extracellularly as part of the cellulosome complex, contributing to the degradation
  of plant cell wall hemicellulose. XynY displays optimal activity at pH 6.8 and 75 degrees Celsius,
  consistent with the thermophilic lifestyle of the organism.
existing_annotations:
- term:
    id: GO:0000272
    label: polysaccharide catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      This annotation is derived from InterPro domain mappings (dockerin domains IPR002105, IPR016134, IPR036439
      and polysaccharide degradation keyword KW-0624). While polysaccharide catabolic process is accurate,
      XynY specifically functions in xylan catabolism, which is a more specific child term. The annotation
      is not incorrect but is less informative than the more specific GO:0045493 (xylan catabolic process)
      which is also annotated.
    action: ACCEPT
    reason: >-
      The annotation is accurate as xylan is a polysaccharide, and XynY contributes to plant cell wall
      polysaccharide degradation. However, this is a parent term of the more specific xylan catabolic
      process (GO:0045493). Both can be retained as the broader term captures the enzyme's role in
      cellulosome-mediated polysaccharide degradation.
    supported_by:
      - reference_id: file:ACET2/P51584/P51584-deep-research-falcon.md
        supporting_text: See deep research file for comprehensive analysis of XynY function in polysaccharide degradation
- term:
    id: GO:0004553
    label: hydrolase activity, hydrolyzing O-glycosyl compounds
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      This annotation derives from InterPro domain matches (IPR001000 GH10 domain, IPR002105 dockerin,
      IPR044846 GH10 family). The GH10 catalytic domain of XynY is indeed a glycoside hydrolase that
      hydrolyzes O-glycosyl compounds. However, this is a very general parent term; the more specific
      endo-1,4-beta-xylanase activity (GO:0031176) is also annotated and is more informative.
    action: ACCEPT
    reason: >-
      This is a correct parent term for endo-1,4-beta-xylanase activity. While not as informative as
      the more specific term GO:0031176, retaining parent terms in IEA annotations is acceptable as
      they capture the broader enzymatic class.
- term:
    id: GO:0005975
    label: carbohydrate metabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      This annotation derives from InterPro domain matches (IPR001000 GH10 domain, IPR044846 GH10 family).
      Carbohydrate metabolic process is a very broad parent term. XynY specifically participates in
      xylan catabolic process (GO:0045493), which is the appropriate specific biological process term.
    action: ACCEPT
    reason: >-
      This is a correct but very general parent term. The annotation accurately captures that XynY
      is involved in carbohydrate metabolism (specifically catabolism of xylan). IEA mappings to
      broad parent terms are acceptable alongside more specific annotations.
- term:
    id: GO:0016787
    label: hydrolase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This annotation derives from UniProtKB keyword mapping (KW-0378 Hydrolase). Hydrolase activity
      is the most general molecular function term for enzymes catalyzing hydrolysis reactions. XynY
      has much more specific molecular functions including endo-1,4-beta-xylanase activity (GO:0031176).
    action: ACCEPT
    reason: >-
      This is a correct but very general parent term. While the more specific GO:0031176
      (endo-1,4-beta-xylanase activity) is more informative, the broad hydrolase activity
      term from keyword mapping is acceptable for IEA annotations.
- term:
    id: GO:0016798
    label: hydrolase activity, acting on glycosyl bonds
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      This annotation derives from InterPro CBM-CenC domain (IPR003305) and glycosidase keyword
      (KW-0326). The term is accurate as XynY hydrolyzes glycosyl bonds in xylan. This is an
      intermediate term between the general GO:0016787 (hydrolase activity) and the specific
      GO:0031176 (endo-1,4-beta-xylanase activity).
    action: ACCEPT
    reason: >-
      This is a correct intermediate-level molecular function term. XynY's GH10 domain hydrolyzes
      beta-1,4-xylosidic bonds, which are glycosyl bonds. The annotation is accurate.
- term:
    id: GO:0031176
    label: endo-1,4-beta-xylanase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000003
  review:
    summary: >-
      This annotation derives from EC number mapping (EC:3.2.1.8). The enzyme is classified as
      EC 3.2.1.8 (endo-1,4-beta-xylanase) in UniProt based on experimental characterization.
      PMID:7717969 demonstrated that recombinant XynY rapidly hydrolyzed oat spelt, wheat and
      rye arabinoxylans and displayed features characteristic of an endo-beta1,4-xylanase.
      This is the core molecular function of the enzyme.
    action: ACCEPT
    reason: >-
      This is the correct and most informative molecular function term for XynY. The enzyme's
      endo-1,4-beta-xylanase activity is well-characterized biochemically. The GH10 catalytic
      domain contains the conserved catalytic glutamate residues (Glu337 proton donor, Glu460
      nucleophile per UniProt feature annotation).
    supported_by:
      - reference_id: PMID:7717969
        supporting_text: >-
          The encoded enzyme, xylanase Y (XYLY), displayed features characteristic of an endo-beta1,4-xylanase: the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates.
      - reference_id: file:ACET2/P51584/P51584-deep-research-falcon.md
        supporting_text: See deep research file for comprehensive analysis
- term:
    id: GO:0045493
    label: xylan catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This annotation derives from UniProtKB keyword mapping (KW-0858 Xylan degradation). XynY
      catalyzes the breakdown of xylan, making this the appropriate specific biological process
      term. The enzyme is part of the cellulosome complex that degrades plant cell wall
      hemicellulose including xylan.
    action: ACCEPT
    reason: >-
      This is the correct specific biological process term for XynY. The enzyme's primary
      function is xylan catabolism, cleaving beta-1,4 glycosidic bonds in the xylan backbone
      to produce xylo-oligosaccharides. PMID:7717969 demonstrated xylanase activity on various
      xylan substrates.
    supported_by:
      - reference_id: PMID:7717969
        supporting_text: >-
          the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates.
- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:14623971
  review:
    summary: >-
      This IPI annotation indicates XynY binds to CipA (Q06851), the scaffoldin protein of the
      cellulosome. The interaction is mediated by the type-I dockerin domain of XynY binding
      to cohesin domains of CipA. PMID:14623971 reports the crystal structure of the cohesin-dockerin
      complex from C. thermocellum at 2.2 Angstrom resolution, revealing the structural basis
      of cellulosome assembly. However, protein binding (GO:0005515) is too general and
      uninformative. A more specific term exists: GO:1990311 (type-I cohesin domain binding).
    action: MODIFY
    reason: >-
      The protein binding annotation should be replaced with the more specific term
      GO:1990311 (type-I cohesin domain binding). XynY contains a type-I dockerin domain
      (residues 728-796 per UniProt) that specifically binds type-I cohesin domains of CipA.
      PMID:14623971 determined the crystal structure of the C. thermocellum cohesin-dockerin
      complex, directly demonstrating this specific interaction mechanism.
    proposed_replacement_terms:
      - id: GO:1990311
        label: type-I cohesin domain binding
    supported_by:
      - reference_id: PMID:14623971
        supporting_text: >-
          Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution. The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin.
- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:17360613
  review:
    summary: >-
      This is a second IPI annotation for protein binding with CipA (Q06851), based on
      PMID:17360613 which demonstrated the dual binding mode of dockerin modules to cohesins.
      The study showed that the internal sequence duplication within the dockerin allows
      it to bind cohesins in two orientations. As with the previous annotation, protein
      binding is too general; GO:1990311 (type-I cohesin domain binding) is more appropriate.
    action: MODIFY
    reason: >-
      Should be replaced with GO:1990311 (type-I cohesin domain binding). PMID:17360613
      provides structural evidence for the dual binding mode of type-I dockerin to type-I
      cohesin, showing that both halves of the duplicated dockerin sequence can interact
      with cohesins. This is a well-characterized, specific protein-protein interaction.
    proposed_replacement_terms:
      - id: GO:1990311
        label: type-I cohesin domain binding
    supported_by:
      - reference_id: PMID:17360613
        supporting_text: >-
          The crystal structure of a C. thermocellum type I cohesin-dockerin complex showed that cohesin recognition was predominantly through helix-3 of the dockerin. The sequence duplication is reflected in near-perfect 2-fold structural symmetry, suggesting that both repeats could interact with cohesins by a common mechanism
- term:
    id: GO:0043263
    label: cellulosome
  evidence_type: IDA
  original_reference_id: PMID:7717969
  review:
    summary: >-
      This IDA annotation indicates XynY is located in the cellulosome, based on experimental
      evidence from PMID:7717969. The abstract states that Western blot analysis using antiserum
      raised against XYLY showed that the xylanase was located in the cellulosome. This is
      direct experimental evidence for cellulosome localization.
    action: ACCEPT
    reason: >-
      This is a well-supported IDA annotation. The original publication (PMID:7717969)
      provided direct experimental evidence via Western blot analysis demonstrating that
      XynY is a component of the cellulosome. The presence of a functional dockerin domain
      (residues 728-796) that binds CipA cohesins provides the mechanistic basis for this
      localization.
    supported_by:
      - reference_id: PMID:7717969
        supporting_text: >-
          The C-terminal portion of XYLY comprised the 23-residue duplicated docking sequence found in all other C. thermocellum plant cell wall hydrolases that are constituents of the bacterium's multienzyme complex, termed the cellulosome
      - reference_id: PMID:7717969
        supporting_text: >-
          Western blot analysis using antiserum raised against XYLY showed that the xylanase was located in the cellulosome and did not appear to be extensively glycosylated.
# Suggested new annotations based on domain analysis and literature evidence
- term:
    id: GO:0030600
    label: feruloyl esterase activity
  evidence_type: IEA
  original_reference_id: UniProt:P51584
  review:
    summary: >-
      XynY contains a C-terminal esterase domain classified as CE1 (carbohydrate esterase family 1)
      by CAZy. The UniProt record shows this domain (Pfam:PF00756 Esterase) and the ESTHER database
      classifies it as A85-Feruloyl-Esterase (clotm-xyny). Feruloyl esterases cleave ester bonds
      between ferulic acid and arabinose side chains of arabinoxylans, facilitating complete
      hemicellulose degradation. This activity complements the endo-xylanase activity.
    action: NEW
    reason: >-
      The CE1 feruloyl esterase domain is a well-characterized accessory domain in XynY.
      While not annotated in the current GOA, domain architecture analysis and CAZy/ESTHER
      database classification strongly support this function. Feruloyl esterases are important
      accessory enzymes that remove ferulic acid decorations from xylans, enhancing access
      for xylanases.
    additional_reference_ids:
      - CAZy:CE1
      - ESTHER:clotm-xyny
    supported_by:
      - reference_id: file:ACET2/P51584/P51584-deep-research-falcon.md
        supporting_text: See deep research file for comprehensive analysis of XynY domain architecture
- term:
    id: GO:1990311
    label: type-I cohesin domain binding
  evidence_type: IPI
  original_reference_id: PMID:14623971
  review:
    summary: >-
      This is the more specific term that should replace the generic protein binding
      annotations for the XynY-CipA interaction. The type-I dockerin domain of XynY
      (residues 728-796) binds specifically to type-I cohesin domains of CipA.
    action: NEW
    reason: >-
      Crystal structures (PMID:14623971, PMID:17360613) have determined the molecular
      details of the type-I cohesin-dockerin interaction. This specific term accurately
      describes the XynY dockerin binding to CipA cohesins and should replace the
      generic GO:0005515 annotations.
    supported_by:
      - reference_id: PMID:14623971
        supporting_text: >-
          Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution.
      - reference_id: PMID:17360613
        supporting_text: >-
          The crystal structure of a C. thermocellum type I cohesin-dockerin complex showed that cohesin recognition was predominantly through helix-3 of the dockerin.
- term:
    id: GO:0005576
    label: extracellular region
  evidence_type: IDA
  original_reference_id: PMID:7717969
  review:
    summary: >-
      XynY is secreted (contains signal peptide, residues 1-26) and functions extracellularly
      as part of the cellulosome complex. The cellulosome is an extracellular multienzyme
      complex attached to the cell surface via anchoring proteins.
    action: NEW
    reason: >-
      XynY contains a signal peptide and is secreted to function in the extracellular
      cellulosome. While GO:0043263 (cellulosome) is already annotated, the broader
      GO:0005576 (extracellular region) could also be appropriate as the cellulosome
      operates in the extracellular space. The signal peptide and cellulosome localization
      support this.
    supported_by:
      - reference_id: PMID:7717969
        supporting_text: >-
          The encoded enzyme contained a typical N-terminal 26-residue signal peptide, followed by a 164 amino acid sequence, designated domain A, that was not essential for catalytic activity.
references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000003
  title: Gene Ontology annotation based on Enzyme Commission mapping
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:7717969
  title: Evidence for a general role for non-catalytic thermostabilizing domains in xylanases from thermophilic bacteria.
  findings:
    - statement: Cloned and characterized xynY gene from C. thermocellum
      supporting_text: >-
        A genomic library of Clostridium thermocellum DNA constructed in lambda ZAPII was screened for xylanase-expressing clones. Cross-hybridization experiments revealed a new xylanase gene isolated from the gene library, which was designated xyn Y.
    - statement: Demonstrated endo-beta-1,4-xylanase activity on oat spelt, wheat and rye arabinoxylans
      supporting_text: >-
        The encoded enzyme, xylanase Y (XYLY), displayed features characteristic of an endo-beta1,4-xylanase: the enzyme rapidly hydrolysed oat spelt, wheat and rye arabinoxylans and was active against methyl-umbelliferyl-beta-D-cellobioside, but did not hydrolyse any cellulosic substrates.
    - statement: Identified signal peptide, catalytic domain, thermostable domain, and dockerin docking sequence
      supporting_text: >-
        The encoded enzyme contained a typical N-terminal 26-residue signal peptide, followed by a 164 amino acid sequence, designated domain A, that was not essential for catalytic activity.
    - statement: Western blot confirmed localization of XynY to the cellulosome
      supporting_text: >-
        Western blot analysis using antiserum raised against XYLY showed that the xylanase was located in the cellulosome and did not appear to be extensively glycosylated.
    - statement: Determined pH optimum (6.8) and temperature optimum (75 degrees C)
      supporting_text: >-
        The pH and temperature optima of the enzyme were 6.8 and 75 degrees C respectively
- id: PMID:14623971
  title: Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex.
  findings:
    - statement: Determined 2.2 Angstrom crystal structure of C. thermocellum cohesin-dockerin complex
      supporting_text: >-
        Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution.
    - statement: Revealed structural basis for cellulosome assembly via cohesin-dockerin interaction
      supporting_text: >-
        The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin.
    - statement: Showed dockerin contains two EF-hand calcium-binding sites
      supporting_text: >-
        the classical 12-residue EF-hand coordination to two calcium ions is maintained.
    - statement: Internal sequence duplication in dockerin creates near-perfect twofold symmetry
      supporting_text: >-
        internal sequence duplication within the dockerin is manifested in near-perfect internal twofold symmetry
- id: PMID:17360613
  title: Evidence for a dual binding mode of dockerin modules to cohesins.
  findings:
    - statement: Demonstrated dual binding mode of type-I dockerin to type-I cohesin
      supporting_text: >-
        The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly
    - statement: Both halves of the duplicated dockerin sequence can interact with cohesins
      supporting_text: >-
        The sequence duplication is reflected in near-perfect 2-fold structural symmetry, suggesting that both repeats could interact with cohesins by a common mechanism
    - statement: Dual binding mode provides plasticity in catalytic subunit orientation within cellulosome
      supporting_text: >-
        The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly
- id: UniProt:P51584
  title: UniProtKB entry for XynY (XYNY_ACETH)
  findings: []
- id: file:ACET2/P51584/P51584-deep-research-falcon.md
  title: Deep research on XynY function and annotations
  findings: []
core_functions:
- description: >-
    XynY is an endo-xylanase that cleaves internal beta-1,4-xylosidic bonds in xylan,
    releasing xylo-oligosaccharides. This is the primary catalytic function of the enzyme.
  molecular_function:
    id: GO:0031176
    label: endo-1,4-beta-xylanase activity
  directly_involved_in:
    - id: GO:0045493
      label: xylan catabolic process
  locations:
    - id: GO:0043263
      label: cellulosome
- description: >-
    XynY binds to the CipA scaffoldin protein via its C-terminal type I dockerin domain,
    enabling its incorporation into the cellulosome complex for coordinated plant cell wall
    degradation.
  molecular_function:
    id: GO:1990311
    label: type-I cohesin domain binding
  locations:
    - id: GO:0043263
      label: cellulosome
proposed_new_terms: []
suggested_questions:
  - question: What is the catalytic activity and substrate specificity of the CE1 feruloyl esterase domain? Has this domain been biochemically characterized independently?
  - question: Do the two CBM22 modules (CBM22-1 and CBM22-2) have distinct binding specificities or functions? What is their contribution to XynY activity on natural substrates?
  - question: What is the relative contribution of XynY versus other cellulosomal xylanases (e.g., XynA, XynB, XynC, XynZ) to xylan degradation in vivo?
suggested_experiments:
  - description: >-
      Biochemical characterization of the isolated CE1 domain to confirm feruloyl esterase
      activity and determine substrate specificity
  - description: >-
      Binding studies with the individual CBM22 modules to determine polysaccharide binding
      specificity
  - description: >-
      Gene knockout/knockdown studies to determine the contribution of XynY to xylan
      degradation in the context of other cellulosomal xylanases