cipA

UniProt ID: Q06851
Organism: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
Review Status: DRAFT
Aliases:
CipA Cellulose-integrating protein A Cellulosomal glycoprotein S1/SL Cthe_3077
📝 Provide Detailed Feedback

Gene Description

CipA is the primary cellulosomal scaffoldin protein in Acetivibrio thermocellus (formerly Clostridium thermocellum). It is a large (1853 aa) non-catalytic structural protein that serves as the central organizing scaffold of the cellulosome - a supramolecular multi-enzyme complex that efficiently degrades plant cell wall polysaccharides. CipA contains nine type I cohesin domains that bind dockerin-bearing catalytic enzymes (cellulases, hemicellulases), a central CBM3a carbohydrate-binding module that targets the complex to crystalline cellulose, and a C-terminal type II dockerin (with associated X module) that anchors the cellulosome to cell-surface proteins bearing type II cohesins (SdbA, OlpB, Orf2p). CipA does NOT possess any catalytic/enzymatic activity - its function is purely structural and organizational, bringing multiple enzymes into proximity for synergistic cellulose degradation.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000272 polysaccharide catabolic process
IEA
GO_REF:0000120
KEEP AS NON CORE
Summary: CipA is involved in polysaccharide catabolism but only indirectly through its scaffolding role. CipA itself has no catalytic activity; it organizes dockerin-bearing enzymes that perform the actual hydrolysis. The annotation captures participation in the process but overstates direct involvement.
Reason: While CipA is essential for efficient polysaccharide degradation by the cellulosome, it is not directly involved in the catabolic process itself. CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex. The scaffoldin promotes enzyme synergy but does not catalyze any reactions. This annotation is acceptable as a secondary/non-core function reflecting its organizational role in the process.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex through high-affinity type I cohesin-dockerin interactions
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds
IEA
GO_REF:0000002
REMOVE
Summary: This annotation is INCORRECT. CipA has no hydrolase activity whatsoever. It is a non-catalytic scaffoldin protein that organizes enzymes but does not itself possess any enzymatic function. This appears to be an erroneous transfer from the dockerin domain (IPR002105), which is also present in catalytic cellulosomal enzymes.
Reason: CipA is explicitly described in the literature as a non-catalytic scaffolding glycoprotein (UniProt CC-FUNCTION). The UniProt function annotation states that CipA acts as a scaffolding protein in the cellulosome and promotes binding of cellulose to the catalytic domains of the cellulolytic enzymes - note it promotes binding TO catalytic domains, it does not have catalytic activity itself. The deep research confirms CipA is a non-catalytic structural organizer (cipA-deep-research-falcon.md). The InterPro domain IPR002105 (Dockerin_1_rpt) is present in both scaffoldins and catalytic enzymes; the hydrolase activity annotation was incorrectly transferred because dockerins are commonly found in hydrolases, but the scaffoldin CipA only uses its dockerin (type II) for anchoring to cell-surface proteins, not for catalysis. This is a clear over-annotation error.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex
GO:0005576 extracellular region
IEA
GO_REF:0000044
ACCEPT
Summary: This annotation is correct. CipA is a secreted protein that functions in the extracellular space, either cell-surface anchored or as part of cell-free cellulosomes.
Reason: UniProt explicitly states that CipA is secreted and remains at the cell surface. The deep research confirms that CipA-enabled cellulosomes are predominantly cell surface-associated through anchoring scaffoldins bearing SLH domains. C. thermocellum also produces cell-free (diffusible) cellulosomes that operate at a distance from the bacterial surface. The protein has a signal peptide (residues 1-28) and functions extracellularly.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CipA-enabled cellulosomes are predominantly cell surface-associated through anchoring scaffoldins bearing SLH domains. C. thermocellum also produces cell-free (diffusible) cellulosomes that operate at a distance from the bacterial surface
GO:0005975 carbohydrate metabolic process
IEA
GO_REF:0000002
MODIFY
Summary: This annotation is too general and misrepresents CipA's role. CipA does not metabolize carbohydrates; it organizes enzymes that do. This is an overly broad annotation derived from the CBM3 domain.
Reason: CipA's CBM3a domain binds cellulose but does not metabolize it. The protein's role is structural - organizing the cellulosome for efficient carbohydrate degradation. A more appropriate annotation would be GO:0044575 (cellulosome assembly) which captures the actual biological process CipA participates in. The annotation to carbohydrate metabolic process incorrectly implies direct metabolic activity.
Proposed replacements: cellulosome assembly
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CBM3a function: targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis; CBM3a is the prevalent cellulosomal CBM in C. thermocellum
GO:0030245 cellulose catabolic process
IEA
GO_REF:0000043
KEEP AS NON CORE
Summary: Similar to GO:0000272, this annotation captures CipA's indirect involvement in cellulose degradation through its scaffolding function but overstates direct participation in the catabolic process.
Reason: CipA is essential for efficient cellulose catabolism via the cellulosome, but it does not itself degrade cellulose. Its CBM3a module targets the complex to cellulose substrate, and its cohesins organize the catalytic enzymes that perform hydrolysis. The annotation reflects involvement in the process but should be considered non-core since CipA's primary function is structural organization (cellulosome assembly) rather than direct catalysis.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CipA is essential for high-efficiency cellulose deconstruction, functioning at the interface of enzyme synergy, substrate targeting, and cell-surface attachment
GO:0030246 carbohydrate binding
IEA
GO_REF:0000002
ACCEPT
Summary: This annotation is correct but could be made more specific. CipA binds carbohydrates via its CBM3a domain, specifically binding crystalline cellulose.
Reason: CipA contains a well-characterized CBM3a (carbohydrate-binding module family 3a) domain at positions 365-523. The deep research confirms that CBM3a targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis. CBM3a is the prevalent cellulosomal CBM in C. thermocellum (cipA-deep-research-falcon.md). The crystal structure of this domain has been solved (PDB: 1NBC at 1.75 A resolution). This is a core molecular function.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CBM3a function: targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis
GO:0030248 cellulose binding
IEA
GO_REF:0000002
ACCEPT
Summary: This annotation is correct and represents a core molecular function of CipA. The CBM3a domain specifically binds crystalline cellulose.
Reason: CipA's CBM3a domain (residues 365-523) specifically binds crystalline cellulose, targeting the cellulosome to its substrate. This is experimentally verified through structural studies - the crystal structure of the CBM3a domain was solved at 1.75 A resolution (PDB: 1NBC; Tormo et al., EMBO J 1996). The deep research states that CBM3a function targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis (cipA-deep-research-falcon.md). This is a core function of CipA - without cellulose binding, the cellulosome cannot target its substrate.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CBM3a function: targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis
GO:0071555 cell wall organization
IEA
GO_REF:0000043
REMOVE
Summary: This annotation is misleading. CipA is involved in PLANT cell wall DEGRADATION (as a target substrate), not in bacterial cell wall organization. The UniProt keyword mapping appears to have conflated cell wall degradation with cell wall organization.
Reason: CipA functions in the degradation of plant cell walls (cellulose, hemicellulose), not in organizing the bacterial cell wall. While CipA is anchored at the cell surface, its function is to organize the cellulosome for degrading extracellular plant material, not to organize the bacterium's own cell wall. The UniProt keywords include Cell wall biogenesis/degradation which likely triggered this annotation, but the context is substrate degradation, not endogenous cell wall organization. This is a semantic error in the automated annotation pipeline.
Supporting Evidence:
file:ACET2/cipA/cipA-deep-research-falcon.md
CipA's modular architecture orchestrates assembly of numerous dockerin-bearing CAZymes and anchors the complex to the cell surface through SLH-bearing secondary scaffoldins
GO:0005515 protein binding
IPI
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
MODIFY
Summary: This annotation is experimentally supported but should be made more specific. PMID:14623971 reports the crystal structure of the cohesin-dockerin complex, demonstrating that CipA's cohesin domains bind dockerin domains on enzymes.
Reason: The IPI annotation from PMID:14623971 is based on the crystal structure of the type I cohesin-dockerin complex from C. thermocellum (WITH: UniProtKB:P51584, XynY xylanase). The study demonstrates that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin (PMID:14623971). However, protein binding is too vague. CipA specifically binds type I dockerin domains through its nine type I cohesin modules. The annotation should be changed to GO:1990308 (type-I dockerin domain binding) which precisely describes this molecular function.
Proposed replacements: type-I dockerin domain binding
Supporting Evidence:
PMID:14623971
The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin.
GO:0005515 protein binding
IPI
PMID:16384918
Mechanism of bacterial cell-surface attachment revealed by t...
MODIFY
Summary: This annotation documents CipA's C-terminal type II dockerin binding to type II cohesins on cell-surface anchoring proteins. The interaction demonstrated is between CipA's X-DocII module and a type II cohesin from Cthe_1307.
Reason: PMID:16384918 reports the structure of the type II cohesin-dockerin complex involving CipA's C-terminal X-dockerin module (WITH: UniProtKB:A3DF10, Cthe_1307). The study describes an ultra-high-affinity complex between type II Doc together with its neighboring X module from the cellulosome scaffold of Clostridium thermocellum and a type II Coh module associated with the bacterial cell surface. The annotation should be more specific: GO:1990312 (type-II cohesin domain binding) captures that CipA's dockerin binds to type II cohesins.
Proposed replacements: type-II cohesin domain binding
Supporting Evidence:
PMID:16384918
Here, we report the structure of an ultra-high-affinity (K(a) = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring X module from the cellulosome scaffold of Clostridium thermocellum, and a type II Coh module associated with the bacterial cell surface.
GO:0005515 protein binding
IPI
PMID:17360613
Evidence for a dual binding mode of dockerin modules to cohe...
MODIFY
Summary: This annotation provides additional evidence for CipA's cohesin-dockerin interactions, demonstrating a dual binding mode where dockerins can interact with cohesins in two orientations.
Reason: PMID:17360613 provides structural evidence for the dual binding mode of type I dockerins to type I cohesins. The study shows that both repeats could interact with cohesins by a common mechanism and that dockerins can bind in a 180-degree rotated orientation (PMID:17360613). This further characterizes the type I cohesin-dockerin interaction. As with PMID:14623971, the annotation should be made more specific to GO:1990308 (type-I dockerin domain binding).
Proposed replacements: type-I dockerin domain binding
Supporting Evidence:
PMID:17360613
The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly
GO:0043263 cellulosome
IDA
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
NEW
Summary: CipA is the primary structural component of the cellulosome. This cellular component annotation is essential and missing from the current annotation set.
Reason: CipA is THE primary scaffoldin of the C. thermocellum cellulosome - it IS the cellulosome scaffold. UniProt names it Cellulosomal-scaffolding protein A and states it acts as a scaffolding protein in the cellulosome. The deep research confirms that cipA encodes the primary cellulosomal scaffoldin in Clostridium thermocellum (cipA-deep-research-falcon.md). This CC annotation is critical for understanding CipA's identity and should be added with IDA evidence based on direct characterization studies.
Supporting Evidence:
PMID:14623971
This megadalton catalytic machine organizes an enzymatic consortium on a multifaceted molecular scaffold
GO:0044575 cellulosome assembly
IDA
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
NEW
Summary: CipA's primary biological process is assembling the cellulosome by recruiting dockerin-bearing enzymes. This is a core function that should be annotated.
Reason: CipA's nine type I cohesin domains bind dockerin-bearing enzymes, directly assembling the cellulosome complex. PMID:14623971 reveals the structure of the cohesin-dockerin complex from Clostridium thermocellum showing how cellulosome assembly occurs. The deep research states that CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex through high-affinity type I cohesin-dockerin interactions (cipA-deep-research-falcon.md). This is CipA's primary biological process.
Supporting Evidence:
PMID:14623971
This megadalton catalytic machine organizes an enzymatic consortium on a multifaceted molecular scaffold whose "cohesin" domains interact with corresponding "dockerin" domains of the enzymes.
GO:0005198 structural molecule activity
IDA
PMID:14623971
Cellulosome assembly revealed by the crystal structure of th...
NEW
Summary: CipA provides the structural framework of the cellulosome. This molecular function annotation captures its non-catalytic scaffolding role.
Reason: CipA contributes to the structural integrity of the cellulosome complex. It is explicitly described as a non-catalytic scaffolding glycoprotein that provides the structural backbone for enzyme assembly. Unlike catalytic cellulosomal components, CipA's molecular function is purely structural - organizing and maintaining the multi-enzyme complex. This annotation appropriately captures that CipA's molecular function is structural organization rather than catalysis.
Supporting Evidence:
PMID:14623971
This megadalton catalytic machine organizes an enzymatic consortium on a multifaceted molecular scaffold

Core Functions

CipA's nine type I cohesin domains bind type I dockerin domains on catalytic enzymes (cellulases, hemicellulases), recruiting them to the cellulosome complex. This is the primary mechanism by which CipA organizes the multi-enzyme complex.

Directly Involved In:
Cellular Locations:
In Complex:
cellulosome
Supporting Evidence:
  • PMID:14623971
    The data show that the beta-sheet cohesin domain interacts predominantly with one of the helices of the dockerin.
  • PMID:17360613
    The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly

CipA's CBM3a domain (residues 365-523) specifically binds crystalline cellulose, targeting the entire cellulosome complex to its substrate. This concentrates the enzymatic machinery at the site of cellulose degradation.

Molecular Function:
cellulose binding
Cellular Locations:
In Complex:
cellulosome

CipA's C-terminal type II dockerin (with X module) binds type II cohesins on cell-surface anchoring proteins (SdbA, OlpB, Orf2p), tethering the cellulosome to the bacterial cell surface.

Cellular Locations:
In Complex:
cellulosome
Supporting Evidence:
  • PMID:16384918
    Here, we report the structure of an ultra-high-affinity (K(a) = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring X module from the cellulosome scaffold of Clostridium thermocellum, and a type II Coh module associated with the bacterial cell surface.

CipA provides structural molecule activity as the primary scaffoldin that assembles the cellulosome by integrating dockerin-bearing enzymes, binding substrate via CBM3a, and anchoring to the cell surface.

Molecular Function:
structural molecule activity
Directly Involved In:
Cellular Locations:
In Complex:
cellulosome

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location vocabulary mapping
Combined Automated Annotation using Multiple IEA Methods
Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin complex
  • First crystal structure of type I cohesin-dockerin complex at 2.2 A resolution
    "Here we report the structure of the cohesin-dockerin complex from Clostridium thermocellum at 2.2-A resolution."
  • Demonstrates structural basis for cellulosome assembly via cohesin-dockerin interaction
    "This megadalton catalytic machine organizes an enzymatic consortium on a multifaceted molecular scaffold whose "cohesin" domains interact with corresponding "dockerin" domains of the enzymes."
  • Shows dockerin internal symmetry suggesting dual binding mode
    "Significantly, internal sequence duplication within the dockerin is manifested in near-perfect internal twofold symmetry, suggesting that both "halves" of the dockerin may interact with cohesins in a similar manner"
Mechanism of bacterial cell-surface attachment revealed by the structure of cellulosomal type II cohesin-dockerin complex
  • Crystal structure of type II cohesin-dockerin complex with ultra-high affinity
    "Here, we report the structure of an ultra-high-affinity (K(a) = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring X module from the cellulosome scaffold of Clostridium thermocellum, and a type II Coh module associated with the bacterial cell surface."
  • X module enhances dockerin stability and cohesin recognition
    "Identification of X module-Doc and X module-Coh contacts reveal roles for the X module in Doc stability and enhanced Coh recognition."
Evidence for a dual binding mode of dockerin modules to cohesins
  • Demonstrates dual binding mode imparts plasticity to cellulosome structure
    "The dual binding mode is predicted to impart significant plasticity into the orientation of the catalytic subunits within this supramolecular assembly"
cipA-deep-research-falcon.md
Deep research summary of CipA function
  • Comprehensive literature synthesis confirms CipA is a non-catalytic scaffoldin
    "CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex through high-affinity type I cohesin-dockerin interactions"

Suggested Questions for Experts

Q: Are there conditions under which CipA might directly participate in carbohydrate modification or processing beyond its scaffolding role?

Suggested Experiments

Experiment: Comprehensive interactome analysis of CipA cohesin domains to quantify the full repertoire of dockerin-bearing proteins recruited to the cellulosome in vivo.

Hypothesis: While the cohesin-dockerin interaction is well-characterized structurally, the complete set of CipA interaction partners in vivo remains incompletely defined.

Tags

cellulosome

📚 Additional Documentation

Deep Research Falcon

(cipA-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-26T17:02:56.112205'
end_time: '2025-12-26T17:09:11.500770'
duration_seconds: 375.39
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ACET2
gene_id: cipA
gene_symbol: cipA
uniprot_accession: Q06851
protein_description: 'RecName: Full=Cellulosomal-scaffolding protein A; AltName:
Full=Cellulose-integrating protein A; AltName: Full=Cellulosomal glycoprotein
S1/SL; AltName: Full=Cohesin; Flags: Precursor;'
gene_info: Name=cipA; OrderedLocusNames=Cthe_3077;
organism_full: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322
/ NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
protein_family: Not specified in UniProt
protein_domains: CarboxyPept-like_regulatory. (IPR008969); CBM2/CBM3_carb-bd_dom_sf.
(IPR008965); CBM3. (IPR001956); CBM3_sf. (IPR036966); Cohesin_dom. (IPR002102)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 18


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q06851
  • Protein Description: RecName: Full=Cellulosomal-scaffolding protein A; AltName: Full=Cellulose-integrating protein A; AltName: Full=Cellulosomal glycoprotein S1/SL; AltName: Full=Cohesin; Flags: Precursor;
  • Gene Information: Name=cipA; OrderedLocusNames=Cthe_3077;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Not specified in UniProt
  • Key Domains: CarboxyPept-like_regulatory. (IPR008969); CBM2/CBM3_carb-bd_dom_sf. (IPR008965); CBM3. (IPR001956); CBM3_sf. (IPR036966); Cohesin_dom. (IPR002102)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "cipA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'cipA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene cipA (gene ID: cipA, UniProt: Q06851) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q06851
  • Protein Description: RecName: Full=Cellulosomal-scaffolding protein A; AltName: Full=Cellulose-integrating protein A; AltName: Full=Cellulosomal glycoprotein S1/SL; AltName: Full=Cohesin; Flags: Precursor;
  • Gene Information: Name=cipA; OrderedLocusNames=Cthe_3077;
  • Organism (full): Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  • Protein Family: Not specified in UniProt
  • Key Domains: CarboxyPept-like_regulatory. (IPR008969); CBM2/CBM3_carb-bd_dom_sf. (IPR008965); CBM3. (IPR001956); CBM3_sf. (IPR036966); Cohesin_dom. (IPR002102)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "cipA" matches the protein description above
  2. Verify the organism is correct: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) (Clostridium thermocellum).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'cipA' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene cipA (gene ID: cipA, UniProt: Q06851) in ACET2.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan status: Objectives verified and evidence gathered from primary and recent literature. Below is a comprehensive, cited research report on cipA (Cthe_3077; UniProt Q06851) from Acetivibrio thermocellus (Clostridium thermocellum), including identity verification, functional annotation, domain architecture, localization, roles in cellulose degradation, recent developments, applications, expert analysis, and quantitative data.

Comprehensive research report: cipA (Cthe_3077; UniProt Q06851) in Acetivibrio thermocellus

Identity verification and nomenclature
- Gene/protein identity: cipA encodes the primary cellulosomal scaffoldin (often termed “CipA”) in Clostridium thermocellum (synonym: Acetivibrio thermocellus), locus Cthe_3077. Classical and mechanistic literature consistently identifies CipA as the main non-catalytic scaffoldin of the C. thermocellum cellulosome, organizing dockerin-bearing enzymes into a multienzyme complex and linking the complex to the cell surface via anchoring scaffoldins (SdbA/OlpB/Orf2p). This aligns with UniProt Q06851 annotations. (pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 1-4)

Key concepts, definitions, and current understanding
- Cellulosome: a supramolecular multi-enzyme complex that deconstructs plant cell wall polysaccharides. In C. thermocellum, the primary scaffoldin CipA contains multiple type I cohesins that bind enzymatic subunits carrying type I dockerins, a cellulose-targeting CBM3a, and a C-terminal X–dockerin module (type II) that binds anchoring scaffoldins bearing type II cohesins. (pinheiro2010novelinsightinto pages 47-50, pinheiro2010novelinsightinto pages 37-40)
- Cohesin–dockerin specificity: Type I cohesins on CipA bind type I dockerins on catalytic subunits; the C-terminal type II dockerin (on CipA’s X–Doc dyad) binds type II cohesins on anchoring scaffoldins (no cross-specificity between types I and II). Coh–Doc interactions are Ca2+-dependent and among the strongest non-covalent protein–protein interactions; dockerins carry duplicated EF-hand-like Ca2+-binding motifs and can support dual-binding modes, though stoichiometry is 1:1. (pinheiro2010novelinsightinto pages 47-50, santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2010novelinsightinto pages 40-42)
- Central role of CipA: Genetic and systems studies demonstrate that the primary scaffoldin CipA is essential for high-efficiency cellulose degradation in C. thermocellum; secondary scaffoldins contribute but are less critical. C. thermocellum deploys both cell-bound and cell-free cellulosomes. Science Advances (2016) provides a detailed systems analysis with functional deletion evidence. URL: https://doi.org/10.1126/sciadv.1501254 (Feb 2016). (xu2016dramaticperformanceof pages 1-2)

Protein architecture and domains (alignment with UniProt/IPR domains)
- Domain map: 2 CohI – CBM3a – 7 CohI – X–DocII, totaling nine type I cohesins with a central family 3a CBM and C-terminal X–dockerin (type II). This corroborates the reported modular architecture and the presence of CBM3/CBM3_sf and cohesin domains. (pinheiro2010novelinsightinto pages 37-40, pinheiro2010novelinsightinto pages 47-50)
- CBM3a function: targets crystalline cellulose to increase local enzyme concentration and synergize hydrolysis; CBM3a is the prevalent cellulosomal CBM in C. thermocellum. (pinheiro2010novelinsightinto pages 47-50, xu2016dramaticperformanceof pages 1-2)
- Cohesins and dockerins: nine type I cohesins on CipA integrate numerous dockerin-bearing CAZymes; dockerins are ~70 aa with duplicated segments yielding two potential binding interfaces and Ca2+ dependence. (pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 7-9, santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2010novelinsightinto pages 40-42)
- Anchoring via X–DocII and SLH-bearing partners: CipA’s X–DocII binds type II cohesins on anchoring scaffoldins (e.g., SdbA, OlpB, Orf2p), which tether the complex to the cell wall through surface-layer homology (SLH) modules. (pinheiro2010novelinsightinto pages 37-40, pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 7-9)

Feature Description Evidence (citation IDs)
Protein identity / organism CipA (gene cipA; locus Cthe_3077) — primary scaffoldin from Clostridium thermocellum (syn. Acetivibrio thermocellus). (pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 1-4)
Primary role Organizes the cellulosome by anchoring dockerin-bearing enzymatic subunits (type I Coh–Doc) and linking to anchoring scaffoldins via C-terminal X–Doc/type II interactions. (xu2016dramaticperformanceof pages 1-2, pinheiro2010novelinsightinto pages 47-50)
Domain map (reported) Modular architecture reported as 2 CohI – CBM3a – 7 CohI – X–DocII (total nine type I cohesins, central CBM3a, C-terminal X–Doc/type II). (pinheiro2010novelinsightinto pages 37-40, pinheiro2010novelinsightinto pages 47-50)
Number and type of cohesins Nine type I cohesin modules (type I Coh) distributed along the scaffoldin. (pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 7-9)
CBM3a function Family 3a carbohydrate-binding module that binds crystalline cellulose, targeting the cellulosome to substrate and increasing local enzyme concentration. (pinheiro2010novelinsightinto pages 47-50, xu2016dramaticperformanceof pages 1-2)
X–DocII function & anchoring partners C-terminal X–dockerin (type II) mediates binding to type II cohesins on anchoring scaffoldins (e.g., SdbA, OlpB, Orf2p) which are cell-surface associated (SLH-bearing). (pinheiro2010novelinsightinto pages 47-50, pinheiro2010novelinsightinto pages 37-40)
CohI–DocI binding properties Cohesin–dockerin interactions are very strong noncovalent bonds, Ca2+-dependent; dockerins have duplicated Ca2+-binding motifs and can permit dual-binding (180°) modes though stoichiometry is 1:1. (santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2010novelinsightinto pages 40-42)
Localization Primarily cell-surface anchored via SLH-bearing secondary scaffoldins (OlpA/OlpB/SdbA/Orf2p) but also participates in cell-free (diffusible) cellulosomes. (pinheiro2009functionalinsightsinto pages 7-9, xu2016dramaticperformanceof pages 1-2)
Quantitative system context C. thermocellum encodes ~72 dockerin-containing cellulosomal proteins (plus additional free enzymes); CipA is central to assembly and activity. (xu2016dramaticperformanceof pages 1-2, pinheiro2009functionalinsightsinto pages 1-4)
Recent (2023–2024) applications & engineering Recent reviews and studies describe designer cellulosomes, scaffold/cohesin engineering, CBM-enabled material functionalization, and modular scaffold approaches for immobilization and enhanced hydrolysis. (li2024harnessingcellulosebindingprotein pages 10-10, lindic2025structuralandfunctional pages 4-5, lindic2025structuralandfunctional pages 19-20)

Table: Concise table summarizing CipA (Cthe_3077) domain architecture, roles, localization, system context, and recent 2023–2024 application-relevant literature with evidence citations.

Primary function, pathway context, and localization
- Function: CipA is a non-catalytic structural organizer that assembles multiple cellulases and hemicellulases into a multienzyme complex through high-affinity type I cohesin–dockerin interactions, thereby enhancing synergistic cellulose and plant cell wall deconstruction. Its CBM3a targets the complex to cellulose, and its C-terminal X–DocII connects the complex to the cell via anchoring scaffoldins. (pinheiro2010novelinsightinto pages 47-50, xu2016dramaticperformanceof pages 1-2)
- Localization: CipA-enabled cellulosomes are predominantly cell surface-associated through anchoring scaffoldins bearing SLH domains. C. thermocellum also produces cell-free (diffusible) cellulosomes that operate at a distance from the bacterial surface, consistent with secondary scaffoldins lacking SLH modules. (pinheiro2009functionalinsightsinto pages 7-9, xu2016dramaticperformanceof pages 1-2)
- Pathway context: CipA is central to the exoproteome that deconstructs cellulose to cellodextrins for uptake and metabolism. Disruption of the primary scaffoldin alters expression of CAZymes and transport systems, underscoring its systems-level role in lignocellulose conversion. (xu2016dramaticperformanceof pages 1-2)

Recent developments and latest research (priority to 2023–2024)
- 2024 review on cellulose-binding domains for materials engineering: Detailed coverage of CBM-based functionalization strategies—including family 3 CBMs—demonstrates active 2024 work leveraging cellulosomal CBMs for material binding and enzyme immobilization. URL: https://doi.org/10.1186/s40643-024-00790-4 (Jul 2024). (li2024harnessingcellulosebindingprotein pages 10-10)
- 2023–2024 synthesis and engineering trends summarized in recent reviews: Emerging strategies to engineer cohesin–dockerin affinity and modularity and to apply CBMs/cohesins in designer assemblies are highlighted, informing CipA-like scaffoldin engineering and applications. While these summaries appear in a 2025 review, they draw on 2023–2024 primary sources on cohesin–dockerin tuning and CBM-enhanced activity/stability. (lindic2025structuralandfunctional pages 19-20, lindic2025structuralandfunctional pages 4-5)
- Genomic and ecological context (2024): Large-scale genomic analyses underscore the diversity and complexity of cellulosome-producing bacteria, with Acetivibrio/Clostridium thermocellum exemplifying complex cellulosomes and diverse enzyme complements, supporting CipA’s central role in such architectures. (lindic2025structuralandfunctional pages 19-20)

Current applications and real-world implementations
- Designer/synthetic cellulosomes: Modular scaffoldins and cohesin–dockerin pairs are widely used to build designer cellulosomes that recapitulate CipA-like architecture to enhance biomass deconstruction; design considerations include dockerin/cohesin selection, linker composition, and CBM choice. Results consistently show enhanced hydrolysis versus equivalent free-enzyme mixtures, though stability and context-dependence remain challenges in industrial settings. (lindic2025structuralandfunctional pages 4-5, li2024harnessingcellulosebindingprotein pages 10-10)
- Immobilization and materials functionalization: CBM-driven binding (including CBM3a derivatives) enables enzyme immobilization on cellulose and cellulose-like supports for biocatalysis and biosensing, aligning with CipA’s CBM3a targeting principle. 2024 examples include use of CBMs to functionalize cellulose materials and immobilize fusion proteins for improved performance. (li2024harnessingcellulosebindingprotein pages 10-10)
- Consolidated bioprocessing and heterologous expression: Prior implementations have expressed CipA variants and cellulosomal components in other clostridia to secrete self-assembling mini-cellulosomes, illustrating cross-species deployment of CipA-like scaffoldins and informing current CBP strategies. URL: https://doi.org/10.1186/1754-6834-6-117 (Aug 2013). ()

Expert opinions and authoritative analyses
- Mechanistic consensus: Primary scaffoldin CipA integrates enzymes via type I cohesins, targets cellulose via CBM3a, and anchors to the cell via type II interactions—yielding a highly synergistic macromolecular machine. Expert reviews emphasize the unmatched strength and specificity of cohesin–dockerin interactions and the functional necessity of Ca2+. (pinheiro2010novelinsightinto pages 47-50, santos2025unconventionalcohesindockerinbinding pages 46-49)
- Systems-level importance of CipA: Experimental deletions and exoproteome analyses support that CipA is the dominant determinant of high cellulose solubilization efficiency in C. thermocellum; intact CipA-mediated assembly affects gene regulation, transport, and particle-scale deconstruction mechanisms. URL: https://doi.org/10.1126/sciadv.1501254 (Feb 2016). (xu2016dramaticperformanceof pages 1-2)

Relevant quantitative statistics and data
- Domain and module counts: CipA carries nine type I cohesins and a single CBM3a, with a C-terminal X–DocII dyad. (pinheiro2010novelinsightinto pages 47-50, pinheiro2010novelinsightinto pages 37-40)
- System enzyme complement: The C. thermocellum genome encodes approximately 70+ dockerin-containing cellulosomal proteins in addition to free CAZymes, underscoring the potential multivalency of CipA-centered complexes. (pinheiro2009functionalinsightsinto pages 1-4)
- Binding properties: Cohesin–dockerin complexes are Ca2+-dependent, with dockerins containing duplicated EF-hand-like loops and capable of dual-binding orientations; the interaction is among the strongest non-covalent protein–protein interactions reported for modular assemblies. (santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2010novelinsightinto pages 40-42)

Ambiguity check and domain alignment
- Ambiguity: The symbol “cipA” can be ambiguous across organisms; the organismal context here is explicitly C. thermocellum (A. thermocellus). All cited sources that name CipA refer to the C. thermocellum primary scaffoldin Cthe_3077, matching UniProt Q06851. (pinheiro2010novelinsightinto pages 47-50, pinheiro2009functionalinsightsinto pages 1-4)
- Domain/family alignment: Literature-confirmed domains (cohesins, CBM3a, X–DocII; anchoring via SLH-bearing secondary scaffoldins) align with UniProt/IPR annotations for cohesin domains (IPR002102) and CBM3-related domains (IPR001956; IPR036966). (pinheiro2010novelinsightinto pages 37-40, pinheiro2010novelinsightinto pages 47-50)

Data and source details with URLs and dates (where available)
- Xu et al., Science Advances, 2016—systems and genetic insights on CipA’s centrality and cell-free cellulosomes. URL: https://doi.org/10.1126/sciadv.1501254 (Feb 2016). (xu2016dramaticperformanceof pages 1-2)
- Li & Liu, Bioresources and Bioprocessing, 2024—2024 review on CBM-based material functionalization relevant to CBM3a uses in designer systems. URL: https://doi.org/10.1186/s40643-024-00790-4 (Jul 2024). (li2024harnessingcellulosebindingprotein pages 10-10)
- Pinheiro et al., Biochemical Journal, 2009—CipA architecture (nine type I cohesins, CBM3a, X–DocII) and cohesin/dockerin specificity mapping. URL: https://doi.org/10.1042/bj20091152 (Dec 2009). (pinheiro2009functionalinsightsinto pages 1-4)
- Pinheiro (2010) review—mechanistic overview of type I vs. type II cohesin–dockerin interactions; CipA’s 2 CohI–CBM3a–7 CohI–X–DocII map and anchoring partners with SLH modules. (pinheiro2010novelinsightinto pages 47-50, pinheiro2010novelinsightinto pages 37-40)
- Coh–Doc biophysics and dual-binding modes—general principles and strength/stoichiometry. (santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2010novelinsightinto pages 40-42)

Conclusions
CipA (cipA; Cthe_3077; UniProt Q06851) is the primary scaffoldin of A. thermocellus/C. thermocellum cellulosomes. Its modular architecture—nine type I cohesins, a central CBM3a, and a C-terminal X–dockerin (type II)—orchestrates assembly of numerous dockerin-bearing CAZymes and anchors the complex to the cell surface through SLH-bearing secondary scaffoldins. CipA is essential for high-efficiency cellulose deconstruction, functioning at the interface of enzyme synergy, substrate targeting, and cell-surface attachment, with additional evidence for cell-free complexes in the exoproteome. Recent 2023–2024 advances emphasize modular engineering of CBMs/cohesins for designer cellulosomes and materials functionalization, reinforcing the translational value of the CipA paradigm. (pinheiro2010novelinsightinto pages 47-50, pinheiro2010novelinsightinto pages 37-40, xu2016dramaticperformanceof pages 1-2, li2024harnessingcellulosebindingprotein pages 10-10, santos2025unconventionalcohesindockerinbinding pages 46-49, pinheiro2009functionalinsightsinto pages 1-4)

References

  1. (pinheiro2010novelinsightinto pages 47-50): BA Pinheiro. Novel insight into the mechanism of cellulosome assembly and plant cell wall hydrolysis in anaerobic bacteria. Unknown journal, 2010.

  2. (pinheiro2009functionalinsightsinto pages 1-4): Benedita A. Pinheiro, Harry J. Gilbert, Kazutaka Sakka, Kazuo Sakka, Vânia O. Fernandes, José A. M. Prates, Victor D. Alves, David N. Bolam, Luís M. A. Ferreira, and Carlos M. G. A Fontes. Functional insights into the role of novel type i cohesin and dockerin domains from clostridium thermocellum. The Biochemical journal, 424 3:375-84, Dec 2009. URL: https://doi.org/10.1042/bj20091152, doi:10.1042/bj20091152. This article has 56 citations.

  3. (pinheiro2010novelinsightinto pages 37-40): BA Pinheiro. Novel insight into the mechanism of cellulosome assembly and plant cell wall hydrolysis in anaerobic bacteria. Unknown journal, 2010.

  4. (santos2025unconventionalcohesindockerinbinding pages 46-49): MRCD Santos. Unconventional cohesin-dockerin binding mechanisms reveal the complexity of cellulosome assembly. Unknown journal, 2025.

  5. (pinheiro2010novelinsightinto pages 40-42): BA Pinheiro. Novel insight into the mechanism of cellulosome assembly and plant cell wall hydrolysis in anaerobic bacteria. Unknown journal, 2010.

  6. (xu2016dramaticperformanceof pages 1-2): Qi Xu, Michael G. Resch, Kara Podkaminer, Shihui Yang, John O. Baker, Bryon S. Donohoe, Charlotte Wilson, Dawn M. Klingeman, Daniel G. Olson, Stephen R. Decker, Richard J. Giannone, Robert L. Hettich, Steven D. Brown, Lee R. Lynd, Edward A. Bayer, Michael E. Himmel, and Yannick J. Bomble. Dramatic performance of clostridium thermocellum explained by its wide range of cellulase modalities. Science Advances, Feb 2016. URL: https://doi.org/10.1126/sciadv.1501254, doi:10.1126/sciadv.1501254. This article has 166 citations and is from a highest quality peer-reviewed journal.

  7. (pinheiro2009functionalinsightsinto pages 7-9): Benedita A. Pinheiro, Harry J. Gilbert, Kazutaka Sakka, Kazuo Sakka, Vânia O. Fernandes, José A. M. Prates, Victor D. Alves, David N. Bolam, Luís M. A. Ferreira, and Carlos M. G. A Fontes. Functional insights into the role of novel type i cohesin and dockerin domains from clostridium thermocellum. The Biochemical journal, 424 3:375-84, Dec 2009. URL: https://doi.org/10.1042/bj20091152, doi:10.1042/bj20091152. This article has 56 citations.

  8. (li2024harnessingcellulosebindingprotein pages 10-10): Shaowei Li and Guodong Liu. Harnessing cellulose-binding protein domains for the development of functionalized cellulose materials. Bioresources and Bioprocessing, Jul 2024. URL: https://doi.org/10.1186/s40643-024-00790-4, doi:10.1186/s40643-024-00790-4. This article has 6 citations and is from a peer-reviewed journal.

  9. (lindic2025structuralandfunctional pages 4-5): Nataša Lindič and Maša Vodovnik. Structural and functional insights into cellulosomes: masters of plant cell wall degradation. Frontiers in Microbiology, Sep 2025. URL: https://doi.org/10.3389/fmicb.2025.1638551, doi:10.3389/fmicb.2025.1638551. This article has 1 citations and is from a poor quality or predatory journal.

  10. (lindic2025structuralandfunctional pages 19-20): Nataša Lindič and Maša Vodovnik. Structural and functional insights into cellulosomes: masters of plant cell wall degradation. Frontiers in Microbiology, Sep 2025. URL: https://doi.org/10.3389/fmicb.2025.1638551, doi:10.3389/fmicb.2025.1638551. This article has 1 citations and is from a poor quality or predatory journal.

Citations

  1. xu2016dramaticperformanceof pages 1-2
  2. li2024harnessingcellulosebindingprotein pages 10-10
  3. lindic2025structuralandfunctional pages 19-20
  4. pinheiro2009functionalinsightsinto pages 1-4
  5. pinheiro2010novelinsightinto pages 47-50
  6. pinheiro2010novelinsightinto pages 37-40
  7. santos2025unconventionalcohesindockerinbinding pages 46-49
  8. pinheiro2010novelinsightinto pages 40-42
  9. pinheiro2009functionalinsightsinto pages 7-9
  10. lindic2025structuralandfunctional pages 4-5
  11. https://doi.org/10.1126/sciadv.1501254
  12. https://doi.org/10.1186/s40643-024-00790-4
  13. https://doi.org/10.1186/1754-6834-6-117
  14. https://doi.org/10.1042/bj20091152
  15. https://doi.org/10.1042/bj20091152,
  16. https://doi.org/10.1126/sciadv.1501254,
  17. https://doi.org/10.1186/s40643-024-00790-4,
  18. https://doi.org/10.3389/fmicb.2025.1638551,

📄 View Raw YAML

id: Q06851
gene_symbol: cipA
aliases:
- CipA
- Cellulose-integrating protein A
- Cellulosomal glycoprotein S1/SL
- Cthe_3077
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:203119
  label: Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC
    103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372)
description: CipA is the primary cellulosomal scaffoldin protein in Acetivibrio thermocellus
  (formerly Clostridium thermocellum). It is a large (1853 aa) non-catalytic structural
  protein that serves as the central organizing scaffold of the cellulosome - a supramolecular
  multi-enzyme complex that efficiently degrades plant cell wall polysaccharides.
  CipA contains nine type I cohesin domains that bind dockerin-bearing catalytic enzymes
  (cellulases, hemicellulases), a central CBM3a carbohydrate-binding module that targets
  the complex to crystalline cellulose, and a C-terminal type II dockerin (with associated
  X module) that anchors the cellulosome to cell-surface proteins bearing type II
  cohesins (SdbA, OlpB, Orf2p). CipA does NOT possess any catalytic/enzymatic activity
  - its function is purely structural and organizational, bringing multiple enzymes
  into proximity for synergistic cellulose degradation.
existing_annotations:
- term:
    id: GO:0000272
    label: polysaccharide catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: CipA is involved in polysaccharide catabolism but only indirectly through
      its scaffolding role. CipA itself has no catalytic activity; it organizes dockerin-bearing
      enzymes that perform the actual hydrolysis. The annotation captures participation
      in the process but overstates direct involvement.
    action: KEEP_AS_NON_CORE
    reason: While CipA is essential for efficient polysaccharide degradation by the
      cellulosome, it is not directly involved in the catabolic process itself. CipA
      is a non-catalytic structural organizer that assembles multiple cellulases and
      hemicellulases into a multienzyme complex. The scaffoldin promotes enzyme synergy
      but does not catalyze any reactions. This annotation is acceptable as a secondary/non-core
      function reflecting its organizational role in the process.
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: CipA is a non-catalytic structural organizer that assembles
        multiple cellulases and hemicellulases into a multienzyme complex through
        high-affinity type I cohesin-dockerin interactions
- term:
    id: GO:0004553
    label: hydrolase activity, hydrolyzing O-glycosyl compounds
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This annotation is INCORRECT. CipA has no hydrolase activity whatsoever.
      It is a non-catalytic scaffoldin protein that organizes enzymes but does not
      itself possess any enzymatic function. This appears to be an erroneous transfer
      from the dockerin domain (IPR002105), which is also present in catalytic cellulosomal
      enzymes.
    action: REMOVE
    reason: CipA is explicitly described in the literature as a non-catalytic scaffolding
      glycoprotein (UniProt CC-FUNCTION). The UniProt function annotation states that
      CipA acts as a scaffolding protein in the cellulosome and promotes binding of
      cellulose to the catalytic domains of the cellulolytic enzymes - note it promotes
      binding TO catalytic domains, it does not have catalytic activity itself. The
      deep research confirms CipA is a non-catalytic structural organizer (cipA-deep-research-falcon.md).
      The InterPro domain IPR002105 (Dockerin_1_rpt) is present in both scaffoldins
      and catalytic enzymes; the hydrolase activity annotation was incorrectly transferred
      because dockerins are commonly found in hydrolases, but the scaffoldin CipA
      only uses its dockerin (type II) for anchoring to cell-surface proteins, not
      for catalysis. This is a clear over-annotation error.
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: CipA is a non-catalytic structural organizer that assembles
        multiple cellulases and hemicellulases into a multienzyme complex
- term:
    id: GO:0005576
    label: extracellular region
  evidence_type: IEA
  original_reference_id: GO_REF:0000044
  review:
    summary: This annotation is correct. CipA is a secreted protein that functions
      in the extracellular space, either cell-surface anchored or as part of cell-free
      cellulosomes.
    action: ACCEPT
    reason: UniProt explicitly states that CipA is secreted and remains at the cell
      surface. The deep research confirms that CipA-enabled cellulosomes are predominantly
      cell surface-associated through anchoring scaffoldins bearing SLH domains. C.
      thermocellum also produces cell-free (diffusible) cellulosomes that operate
      at a distance from the bacterial surface. The protein has a signal peptide (residues
      1-28) and functions extracellularly.
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: CipA-enabled cellulosomes are predominantly cell surface-associated
        through anchoring scaffoldins bearing SLH domains. C. thermocellum also produces
        cell-free (diffusible) cellulosomes that operate at a distance from the bacterial
        surface
- term:
    id: GO:0005975
    label: carbohydrate metabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This annotation is too general and misrepresents CipA's role. CipA does
      not metabolize carbohydrates; it organizes enzymes that do. This is an overly
      broad annotation derived from the CBM3 domain.
    action: MODIFY
    reason: CipA's CBM3a domain binds cellulose but does not metabolize it. The protein's
      role is structural - organizing the cellulosome for efficient carbohydrate degradation.
      A more appropriate annotation would be GO:0044575 (cellulosome assembly) which
      captures the actual biological process CipA participates in. The annotation
      to carbohydrate metabolic process incorrectly implies direct metabolic activity.
    proposed_replacement_terms:
    - id: GO:0044575
      label: cellulosome assembly
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: 'CBM3a function: targets crystalline cellulose to increase
        local enzyme concentration and synergize hydrolysis; CBM3a is the prevalent
        cellulosomal CBM in C. thermocellum'
- term:
    id: GO:0030245
    label: cellulose catabolic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: Similar to GO:0000272, this annotation captures CipA's indirect involvement
      in cellulose degradation through its scaffolding function but overstates direct
      participation in the catabolic process.
    action: KEEP_AS_NON_CORE
    reason: CipA is essential for efficient cellulose catabolism via the cellulosome,
      but it does not itself degrade cellulose. Its CBM3a module targets the complex
      to cellulose substrate, and its cohesins organize the catalytic enzymes that
      perform hydrolysis. The annotation reflects involvement in the process but should
      be considered non-core since CipA's primary function is structural organization
      (cellulosome assembly) rather than direct catalysis.
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: CipA is essential for high-efficiency cellulose deconstruction,
        functioning at the interface of enzyme synergy, substrate targeting, and cell-surface
        attachment
- term:
    id: GO:0030246
    label: carbohydrate binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This annotation is correct but could be made more specific. CipA binds
      carbohydrates via its CBM3a domain, specifically binding crystalline cellulose.
    action: ACCEPT
    reason: 'CipA contains a well-characterized CBM3a (carbohydrate-binding module
      family 3a) domain at positions 365-523. The deep research confirms that CBM3a
      targets crystalline cellulose to increase local enzyme concentration and synergize
      hydrolysis. CBM3a is the prevalent cellulosomal CBM in C. thermocellum (cipA-deep-research-falcon.md).
      The crystal structure of this domain has been solved (PDB: 1NBC at 1.75 A resolution).
      This is a core molecular function.'
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: 'CBM3a function: targets crystalline cellulose to increase
        local enzyme concentration and synergize hydrolysis'
- term:
    id: GO:0030248
    label: cellulose binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This annotation is correct and represents a core molecular function of
      CipA. The CBM3a domain specifically binds crystalline cellulose.
    action: ACCEPT
    reason: 'CipA''s CBM3a domain (residues 365-523) specifically binds crystalline
      cellulose, targeting the cellulosome to its substrate. This is experimentally
      verified through structural studies - the crystal structure of the CBM3a domain
      was solved at 1.75 A resolution (PDB: 1NBC; Tormo et al., EMBO J 1996). The
      deep research states that CBM3a function targets crystalline cellulose to increase
      local enzyme concentration and synergize hydrolysis (cipA-deep-research-falcon.md).
      This is a core function of CipA - without cellulose binding, the cellulosome
      cannot target its substrate.'
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: 'CBM3a function: targets crystalline cellulose to increase
        local enzyme concentration and synergize hydrolysis'
- term:
    id: GO:0071555
    label: cell wall organization
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: This annotation is misleading. CipA is involved in PLANT cell wall DEGRADATION
      (as a target substrate), not in bacterial cell wall organization. The UniProt
      keyword mapping appears to have conflated cell wall degradation with cell wall
      organization.
    action: REMOVE
    reason: CipA functions in the degradation of plant cell walls (cellulose, hemicellulose),
      not in organizing the bacterial cell wall. While CipA is anchored at the cell
      surface, its function is to organize the cellulosome for degrading extracellular
      plant material, not to organize the bacterium's own cell wall. The UniProt keywords
      include Cell wall biogenesis/degradation which likely triggered this annotation,
      but the context is substrate degradation, not endogenous cell wall organization.
      This is a semantic error in the automated annotation pipeline.
    supported_by:
    - reference_id: file:ACET2/cipA/cipA-deep-research-falcon.md
      supporting_text: CipA's modular architecture orchestrates assembly of numerous
        dockerin-bearing CAZymes and anchors the complex to the cell surface through
        SLH-bearing secondary scaffoldins
- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:14623971
  review:
    summary: This annotation is experimentally supported but should be made more specific.
      PMID:14623971 reports the crystal structure of the cohesin-dockerin complex,
      demonstrating that CipA's cohesin domains bind dockerin domains on enzymes.
    action: MODIFY
    reason: 'The IPI annotation from PMID:14623971 is based on the crystal structure
      of the type I cohesin-dockerin complex from C. thermocellum (WITH: UniProtKB:P51584,
      XynY xylanase). The study demonstrates that the beta-sheet cohesin domain interacts
      predominantly with one of the helices of the dockerin (PMID:14623971). However,
      protein binding is too vague. CipA specifically binds type I dockerin domains
      through its nine type I cohesin modules. The annotation should be changed to
      GO:1990308 (type-I dockerin domain binding) which precisely describes this molecular
      function.'
    proposed_replacement_terms:
    - id: GO:1990308
      label: type-I dockerin domain binding
    supported_by:
    - reference_id: PMID:14623971
      supporting_text: The data show that the beta-sheet cohesin domain interacts
        predominantly with one of the helices of the dockerin.
    additional_reference_ids:
    - PMID:14623971
- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:16384918
  review:
    summary: This annotation documents CipA's C-terminal type II dockerin binding
      to type II cohesins on cell-surface anchoring proteins. The interaction demonstrated
      is between CipA's X-DocII module and a type II cohesin from Cthe_1307.
    action: MODIFY
    reason: 'PMID:16384918 reports the structure of the type II cohesin-dockerin complex
      involving CipA''s C-terminal X-dockerin module (WITH: UniProtKB:A3DF10, Cthe_1307).
      The study describes an ultra-high-affinity complex between type II Doc together
      with its neighboring X module from the cellulosome scaffold of Clostridium thermocellum
      and a type II Coh module associated with the bacterial cell surface. The annotation
      should be more specific: GO:1990312 (type-II cohesin domain binding) captures
      that CipA''s dockerin binds to type II cohesins.'
    proposed_replacement_terms:
    - id: GO:1990312
      label: type-II cohesin domain binding
    supported_by:
    - reference_id: PMID:16384918
      supporting_text: Here, we report the structure of an ultra-high-affinity (K(a)
        = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring
        X module from the cellulosome scaffold of Clostridium thermocellum, and a
        type II Coh module associated with the bacterial cell surface.
    additional_reference_ids:
    - PMID:16384918
- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:17360613
  review:
    summary: This annotation provides additional evidence for CipA's cohesin-dockerin
      interactions, demonstrating a dual binding mode where dockerins can interact
      with cohesins in two orientations.
    action: MODIFY
    reason: PMID:17360613 provides structural evidence for the dual binding mode of
      type I dockerins to type I cohesins. The study shows that both repeats could
      interact with cohesins by a common mechanism and that dockerins can bind in
      a 180-degree rotated orientation (PMID:17360613). This further characterizes
      the type I cohesin-dockerin interaction. As with PMID:14623971, the annotation
      should be made more specific to GO:1990308 (type-I dockerin domain binding).
    proposed_replacement_terms:
    - id: GO:1990308
      label: type-I dockerin domain binding
    supported_by:
    - reference_id: PMID:17360613
      supporting_text: The dual binding mode is predicted to impart significant plasticity
        into the orientation of the catalytic subunits within this supramolecular
        assembly
    additional_reference_ids:
    - PMID:17360613
- term:
    id: GO:0043263
    label: cellulosome
  evidence_type: IDA
  original_reference_id: PMID:14623971
  review:
    summary: CipA is the primary structural component of the cellulosome. This cellular
      component annotation is essential and missing from the current annotation set.
    action: NEW
    reason: CipA is THE primary scaffoldin of the C. thermocellum cellulosome - it
      IS the cellulosome scaffold. UniProt names it Cellulosomal-scaffolding protein
      A and states it acts as a scaffolding protein in the cellulosome. The deep research
      confirms that cipA encodes the primary cellulosomal scaffoldin in Clostridium
      thermocellum (cipA-deep-research-falcon.md). This CC annotation is critical
      for understanding CipA's identity and should be added with IDA evidence based
      on direct characterization studies.
    supported_by:
    - reference_id: PMID:14623971
      supporting_text: This megadalton catalytic machine organizes an enzymatic consortium
        on a multifaceted molecular scaffold
    additional_reference_ids:
    - cipA-deep-research-falcon.md
- term:
    id: GO:0044575
    label: cellulosome assembly
  evidence_type: IDA
  original_reference_id: PMID:14623971
  review:
    summary: CipA's primary biological process is assembling the cellulosome by recruiting
      dockerin-bearing enzymes. This is a core function that should be annotated.
    action: NEW
    reason: CipA's nine type I cohesin domains bind dockerin-bearing enzymes, directly
      assembling the cellulosome complex. PMID:14623971 reveals the structure of the
      cohesin-dockerin complex from Clostridium thermocellum showing how cellulosome
      assembly occurs. The deep research states that CipA is a non-catalytic structural
      organizer that assembles multiple cellulases and hemicellulases into a multienzyme
      complex through high-affinity type I cohesin-dockerin interactions (cipA-deep-research-falcon.md).
      This is CipA's primary biological process.
    supported_by:
    - reference_id: PMID:14623971
      supporting_text: This megadalton catalytic machine organizes an enzymatic consortium
        on a multifaceted molecular scaffold whose "cohesin" domains interact with
        corresponding "dockerin" domains of the enzymes.
    additional_reference_ids:
    - cipA-deep-research-falcon.md
- term:
    id: GO:0005198
    label: structural molecule activity
  evidence_type: IDA
  original_reference_id: PMID:14623971
  review:
    summary: CipA provides the structural framework of the cellulosome. This molecular
      function annotation captures its non-catalytic scaffolding role.
    action: NEW
    reason: CipA contributes to the structural integrity of the cellulosome complex.
      It is explicitly described as a non-catalytic scaffolding glycoprotein that
      provides the structural backbone for enzyme assembly. Unlike catalytic cellulosomal
      components, CipA's molecular function is purely structural - organizing and
      maintaining the multi-enzyme complex. This annotation appropriately captures
      that CipA's molecular function is structural organization rather than catalysis.
    supported_by:
    - reference_id: PMID:14623971
      supporting_text: This megadalton catalytic machine organizes an enzymatic consortium
        on a multifaceted molecular scaffold
    additional_reference_ids:
    - cipA-deep-research-falcon.md
core_functions:
- description: CipA's nine type I cohesin domains bind type I dockerin domains on
    catalytic enzymes (cellulases, hemicellulases), recruiting them to the cellulosome
    complex. This is the primary mechanism by which CipA organizes the multi-enzyme
    complex.
  molecular_function:
    id: GO:1990308
    label: type-I dockerin domain binding
  directly_involved_in:
  - id: GO:0044575
    label: cellulosome assembly
  locations:
  - id: GO:0005576
    label: extracellular region
  in_complex:
    id: GO:0043263
    label: cellulosome
  supported_by:
  - reference_id: PMID:14623971
    supporting_text: The data show that the beta-sheet cohesin domain interacts predominantly
      with one of the helices of the dockerin.
  - reference_id: PMID:17360613
    supporting_text: The dual binding mode is predicted to impart significant plasticity
      into the orientation of the catalytic subunits within this supramolecular assembly
- description: CipA's CBM3a domain (residues 365-523) specifically binds crystalline
    cellulose, targeting the entire cellulosome complex to its substrate. This concentrates
    the enzymatic machinery at the site of cellulose degradation.
  molecular_function:
    id: GO:0030248
    label: cellulose binding
  locations:
  - id: GO:0005576
    label: extracellular region
  in_complex:
    id: GO:0043263
    label: cellulosome
- description: CipA's C-terminal type II dockerin (with X module) binds type II cohesins
    on cell-surface anchoring proteins (SdbA, OlpB, Orf2p), tethering the cellulosome
    to the bacterial cell surface.
  molecular_function:
    id: GO:1990312
    label: type-II cohesin domain binding
  locations:
  - id: GO:0005576
    label: extracellular region
  in_complex:
    id: GO:0043263
    label: cellulosome
  supported_by:
  - reference_id: PMID:16384918
    supporting_text: Here, we report the structure of an ultra-high-affinity (K(a)
      = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring
      X module from the cellulosome scaffold of Clostridium thermocellum, and a type
      II Coh module associated with the bacterial cell surface.
- description: CipA provides structural molecule activity as the primary scaffoldin
    that assembles the cellulosome by integrating dockerin-bearing enzymes, binding
    substrate via CBM3a, and anchoring to the cell surface.
  molecular_function:
    id: GO:0005198
    label: structural molecule activity
  directly_involved_in:
  - id: GO:0044575
    label: cellulosome assembly
  locations:
  - id: GO:0005576
    label: extracellular region
  in_complex:
    id: GO:0043263
    label: cellulosome
references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO
    terms
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000044
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location
    vocabulary mapping
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:14623971
  title: Cellulosome assembly revealed by the crystal structure of the cohesin-dockerin
    complex
  findings:
  - statement: First crystal structure of type I cohesin-dockerin complex at 2.2 A
      resolution
    supporting_text: Here we report the structure of the cohesin-dockerin complex
      from Clostridium thermocellum at 2.2-A resolution.
  - statement: Demonstrates structural basis for cellulosome assembly via cohesin-dockerin
      interaction
    supporting_text: This megadalton catalytic machine organizes an enzymatic consortium
      on a multifaceted molecular scaffold whose "cohesin" domains interact with corresponding
      "dockerin" domains of the enzymes.
  - statement: Shows dockerin internal symmetry suggesting dual binding mode
    supporting_text: Significantly, internal sequence duplication within the dockerin
      is manifested in near-perfect internal twofold symmetry, suggesting that both
      "halves" of the dockerin may interact with cohesins in a similar manner
- id: PMID:16384918
  title: Mechanism of bacterial cell-surface attachment revealed by the structure
    of cellulosomal type II cohesin-dockerin complex
  findings:
  - statement: Crystal structure of type II cohesin-dockerin complex with ultra-high
      affinity
    supporting_text: Here, we report the structure of an ultra-high-affinity (K(a)
      = 1.44 x 10(10) M(-1)) complex between type II Doc, together with its neighboring
      X module from the cellulosome scaffold of Clostridium thermocellum, and a type
      II Coh module associated with the bacterial cell surface.
  - statement: X module enhances dockerin stability and cohesin recognition
    supporting_text: Identification of X module-Doc and X module-Coh contacts reveal
      roles for the X module in Doc stability and enhanced Coh recognition.
- id: PMID:17360613
  title: Evidence for a dual binding mode of dockerin modules to cohesins
  findings:
  - statement: Demonstrates dual binding mode imparts plasticity to cellulosome structure
    supporting_text: The dual binding mode is predicted to impart significant plasticity
      into the orientation of the catalytic subunits within this supramolecular assembly
- id: cipA-deep-research-falcon.md
  title: Deep research summary of CipA function
  findings:
  - statement: Comprehensive literature synthesis confirms CipA is a non-catalytic
      scaffoldin
    supporting_text: CipA is a non-catalytic structural organizer that assembles multiple
      cellulases and hemicellulases into a multienzyme complex through high-affinity
      type I cohesin-dockerin interactions
proposed_new_terms: []
suggested_questions:
- question: Are there conditions under which CipA might directly participate in carbohydrate
    modification or processing beyond its scaffolding role?
  experts: []
suggested_experiments:
- description: Comprehensive interactome analysis of CipA cohesin domains to quantify
    the full repertoire of dockerin-bearing proteins recruited to the cellulosome
    in vivo.
  hypothesis: While the cohesin-dockerin interaction is well-characterized structurally,
    the complete set of CipA interaction partners in vivo remains incompletely defined.
tags:
- cellulosome