AtHsp101/CLPB1 is an ATP-dependent protein disaggregase that is absolutely ESSENTIAL for thermotolerance in Arabidopsis, functioning as a hexameric AAA+ ATPase that mechanically extracts polypeptide chains from heat-denatured protein aggregates through a central pore threading mechanism. The protein works in obligate synergy with the Hsp70/Hsc70-1 chaperone system (3-fold enhanced disaggregation rate) and is the single most critical determinant of acquired thermotoleranceโhot1 null mutants completely lack the ability to survive normally lethal temperatures even after heat conditioning. AtHsp101 also mediates basal thermotolerance in germinating seeds, promotes stress granule disassembly and translation recovery, and interacts with HSA32 to extend thermotolerance memory. The protein is massively heat-induced (50-100 fold within 15-30 min) and functions primarily in the cytosol and nucleus.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0005737
cytoplasm
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: AtHsp101/CLPB1 is a cytosolic/nuclear protein that functions in the cytoplasm as part of the heat shock response machinery. The protein lacks transit peptides that would direct it to organelles, distinguishing it from organellar ClpB homologs (ClpB3 in chloroplasts, ClpB4 in mitochondria).
Reason: This annotation is well-supported by phylogenetic analysis, protein structure (no transit peptides), and functional studies showing cytoplasmic localization and activity. The IBA evidence correctly assigns cytoplasmic localization based on orthology to well-characterized ClpB/Hsp100 proteins.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
AtHsp101/CLPB1 is a cytosolic protein localized to the soluble fraction of the cell, where it functions in the cytoplasm and nucleus as part of the cellular stress response machinery. The protein lacks transit peptides that would direct it to organelles, distinguishing it from the organellar ClpB homologs (ClpB3 in chloroplasts and ClpB4 in mitochondria) that are present in Arabidopsis.
|
|
GO:0016887
ATP hydrolysis activity
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: AtHsp101/CLPB1 is an AAA+ ATPase with two nucleotide-binding domains (NBD1/AAA1 and NBD2/AAA2) that hydrolyze ATP to drive protein disaggregation. ATP hydrolysis is coupled to conformational changes that enable mechanical extraction of polypeptides from aggregates through the central hexameric pore.
Reason: This is a core molecular function. The protein contains conserved Walker A, Walker B, and sensor motifs in both NBD1 and NBD2. ATP hydrolysis activity is essential for chaperone function and thermotolerance, as demonstrated by mutants in ATPase domains that lose function. IBA evidence correctly identifies this conserved AAA+ family activity.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The architecture consists of several critical structural elements an N-terminal domain (NTD), two tandem AAA+ nucleotide-binding domains (designated AAA1/NBD1 and AAA2/NBD2), and a unique coiled-coil middle domain (M-domain) that is inserted between the two nucleotide-binding domains. The first nucleotide-binding domain (NBD1), also termed the large subdomain, contains the Walker A and Walker B motifs essential for ATP binding and hydrolysis.
PMID:15659638
The ATPase activity of both NBDs, as well as an intact coiled-coil domain, are essential for chaperone activity and in vivo thermotolerance
|
|
GO:0034605
cellular response to heat
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: AtHsp101/CLPB1 is ESSENTIAL for cellular response to heat stress. The protein mediates acquired thermotolerance and basal thermotolerance through ATP-dependent disaggregation of heat-denatured protein aggregates. Expression increases 50-100 fold within 15-30 minutes of heat stress.
Reason: This is the primary biological process for AtHsp101. Multiple lines of evidence demonstrate this is not just "involved in" but ESSENTIAL for heat response. hot1 null mutants completely lack acquired thermotolerance and have dramatically reduced basal thermotolerance. The massive heat-induced transcriptional response and essential role in thermotolerance make this a core annotation.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
Loss-of-function mutations in CLPB1/HSP101 nearly completely abolish this acquired thermotolerance response, rendering hsp101 mutant seedlings sensitive to the same lethal temperature regardless of prior heat conditioning. The transcriptional response of CLPB1/HSP101 to heat stress is among the most robust in the plant transcriptome, with mRNA levels increasing 50- to 100-fold within 15-30 minutes following shift to elevated temperature.
file:ARATH/AT1G74310/AT1G74310-notes.md
hot1 mutants COMPLETELY LACK acquired thermotolerance. hot1-1 (E637K in NBD2), hot1-3 (null), hot1-4 (A499T in M-domain). Cannot survive 44-45 degrees C even with heat conditioning.
|
|
GO:0000166
nucleotide binding
|
IEA
GO_REF:0000043 |
MODIFY |
Summary: Nucleotide binding is a molecular function of AtHsp101 through its two AAA+ domains. However, this term is too general compared to the more specific ATP binding annotation.
Reason: While technically correct, this annotation is less specific than GO:0005524 (ATP binding). AtHsp101 specifically binds and hydrolyzes ATP, not other nucleotides. The more specific ATP binding term better captures the molecular function.
Proposed replacements:
ATP binding
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The first nucleotide-binding domain (NBD1), also termed the large subdomain, contains the Walker A and Walker B motifs essential for ATP binding and hydrolysis, along with conserved sensor residues and an arginine finger motif. The second nucleotide-binding domain (NBD2) similarly contains ATP-binding machinery.
|
|
GO:0005524
ATP binding
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: AtHsp101 contains two ATP-binding domains (NBD1 and NBD2) with conserved Walker A and Walker B motifs for ATP binding. ATP binding is essential for the conformational changes that drive protein disaggregation.
Reason: This is a core molecular function that is more specific and accurate than the general nucleotide binding term. Both NBD1 and NBD2 bind ATP specifically, and ATP binding is required for the protein's chaperone activity. The IEA evidence from InterPro domain matches correctly identifies this conserved function.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
Structural analysis demonstrates that ATP binding induces stabilization of the D1 pore loops at the central pore opening, providing a high-affinity substrate-binding state. Upon ATP hydrolysis in the AAA1 ring, conformational changes occur that promote substrate translocation and reduce substrate affinity.
|
|
GO:0005634
nucleus
|
IEA
GO_REF:0000044 |
ACCEPT |
Summary: AtHsp101 localizes to both cytoplasm and nucleus. UniProt annotations indicate nuclear localization, and the protein functions in both compartments as part of the heat shock response machinery.
Reason: Nuclear localization is supported by UniProt subcellular location data and consistent with the protein's role in cellular stress response. The cytosolic/nuclear localization distinguishes it from organellar ClpB homologs.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.
|
|
GO:0005829
cytosol
|
IEA
GO_REF:0000044 |
ACCEPT |
Summary: AtHsp101 is a cytosolic protein that functions in the soluble fraction of the cell. This is the primary localization where the protein performs its disaggregase activity on heat-denatured protein aggregates.
Reason: Cytosol is the primary and most specific subcellular localization for AtHsp101. The protein lacks organellar targeting sequences and functions in the cytosolic compartment. This is more specific than the general cytoplasm annotation and is well-supported by experimental evidence.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
AtHsp101/CLPB1 is a cytosolic protein localized to the soluble fraction of the cell, where it functions in the cytoplasm and nucleus as part of the cellular stress response machinery.
|
|
GO:0009408
response to heat
|
IEA
GO_REF:0000117 |
ACCEPT |
Summary: AtHsp101 is ESSENTIAL for response to heat stress. The protein is massively induced by heat (50-100 fold increase in mRNA), and hot1 mutants completely lack acquired thermotolerance. This is the defining biological process for this gene.
Reason: Response to heat is the primary biological process for AtHsp101. The protein is one of the most heat-inducible genes in Arabidopsis and is absolutely essential for thermotolerance. While GO:0034605 (cellular response to heat) is more specific, this broader term is also accurate and complements the cellular response annotation.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-notes.md
ESSENTIAL for acquired thermotolerance - hot1 mutants completely lack thermotolerance. Massive induction: 50-100 fold increase within 15-30 min of heat stress.
PMID:7866032
An Arabidopsis heat shock protein complements a thermotolerance defect in yeast
|
|
GO:0016887
ATP hydrolysis activity
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: This is a duplicate of the IBA annotation for ATP hydrolysis activity (GO:0016887). Both annotations refer to the same core molecular function.
Reason: While this is a duplicate annotation with different evidence (IEA from InterPro vs IBA from phylogenetic inference), both correctly identify the same core function. It is acceptable to have the same GO term with different evidence codes, as they provide independent support for the annotation.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The primary catalytic function of AtHsp101/CLPB1 is to disaggregate heat-denatured proteins that have undergone irreversible aggregation due to thermal stress. This function depends critically on ATP hydrolysis coupled to conformational changes that enable mechanical work.
|
|
GO:0009644
response to high light intensity
|
IEP
PMID:37947266 Dynamics of mRNA fate during light stress and recovery: from... |
KEEP AS NON CORE |
Summary: This annotation is based on expression profiling showing AtHsp101 mRNA changes during light stress and recovery. However, this appears to be a secondary or indirect effect rather than a core function of the gene.
Reason: The reference (PMID:37947266) focuses on mRNA dynamics during light stress and recovery. AtHsp101 may be induced as part of a general stress response, but light stress is not the primary biological process for this gene. The core function is heat stress response. This annotation represents a pleiotropic effect where heat shock proteins are induced by multiple stresses, but should not be considered a defining feature of AtHsp101 function.
Supporting Evidence:
PMID:37947266
Dynamics of mRNA fate during light stress and recovery: from transcription to stability and translation.
|
|
GO:0009570
chloroplast stroma
|
HDA
PMID:20061580 AT_CHLORO, a comprehensive chloroplast proteome database wit... |
REMOVE |
Summary: This annotation is based on a chloroplast proteomics study (AT_CHLORO database) that detected AtHsp101 in chloroplast fractions. However, this conflicts with the well-established cytosolic/nuclear localization of AtHsp101 (CLPB1) and likely represents contamination or misidentification.
Reason: This annotation is inconsistent with extensive evidence that AtHsp101/CLPB1 is a CYTOSOLIC protein that lacks chloroplast transit peptides. Arabidopsis has a distinct chloroplast-targeted ClpB homolog (ClpB3/APG6) that is different from the cytosolic CLPB1. The detection in PMID:20061580 likely represents cytosolic contamination in chloroplast preparations, which is common in organellar proteomics. The protein sequence has no predicted chloroplast targeting signal, and all functional studies demonstrate cytosolic localization.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
AtHsp101/CLPB1 is a cytosolic protein localized to the soluble fraction of the cell. The protein lacks transit peptides that would direct it to organelles, distinguishing it from the organellar ClpB homologs (ClpB3 in chloroplasts and ClpB4 in mitochondria) that are present in Arabidopsis.
file:ARATH/AT1G74310/AT1G74310-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.
PMID:20061580
Epub 2010 Jan 10. AT_CHLORO, a comprehensive chloroplast proteome database with subplastidial localization and curated information on envelope proteins.
|
|
GO:0009941
chloroplast envelope
|
HDA
PMID:20061580 AT_CHLORO, a comprehensive chloroplast proteome database wit... |
REMOVE |
Summary: This annotation is from the same chloroplast proteomics study (PMID:20061580) and suffers from the same problem as the chloroplast stroma annotation - likely contamination or misidentification.
Reason: AtHsp101/CLPB1 is a cytosolic protein, not a chloroplast protein. The detection in chloroplast envelope fractions is inconsistent with all genetic, biochemical, and sequence evidence showing cytosolic localization. Arabidopsis has a separate chloroplast-targeted ClpB (ClpB3) that is distinct from cytosolic CLPB1. High-throughput proteomics of organellar fractions frequently suffer from cytosolic contamination.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The identification of distinct organellar and cytosolic ClpB proteins represents an important evolutionary development, as early work on the Arabidopsis ClpB family identified four putative ClpB homologs designated ClpB1-4. The cytosolic AtHsp101 (ClpB1) lacks transit peptides distinguishing it from ClpB3 in chloroplasts and ClpB4 in mitochondria.
PMID:20061580
Epub 2010 Jan 10. AT_CHLORO, a comprehensive chloroplast proteome database with subplastidial localization and curated information on envelope proteins.
|
|
GO:0071456
cellular response to hypoxia
|
HEP
PMID:31519798 Integrative Analysis from the Epigenome to Translatome Uncov... |
KEEP AS NON CORE |
Summary: This annotation is based on high-throughput expression profiling showing changes in AtHsp101 expression during hypoxia stress. However, this represents a secondary stress response rather than a core function.
Reason: The HEP (high-throughput expression pattern) evidence indicates AtHsp101 expression changes during hypoxia, but this is not the primary biological process for this gene. Heat shock proteins are often induced by multiple stresses as part of a general stress response. The core function of AtHsp101 is heat stress response and protein disaggregation. Hypoxia response should be considered a pleiotropic or secondary effect.
Supporting Evidence:
PMID:31519798
Integrative Analysis from the Epigenome to Translatome Uncovers Patterns of Dominant Nuclear Regulation during Transient Stress.
|
|
GO:0005737
cytoplasm
|
ISM
GO_REF:0000122 |
ACCEPT |
Summary: This is a duplicate cytoplasm annotation with different evidence (ISM from AtSubP analysis vs IBA from phylogenetic inference). Both correctly identify cytoplasmic localization.
Reason: While this duplicates the IBA annotation for cytoplasm, it provides independent computational support (from AtSubP subcellular localization prediction) for the same localization. Having multiple lines of evidence for cytoplasmic localization is acceptable and strengthens the annotation.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
AtHsp101/CLPB1 is a cytosolic protein localized to the soluble fraction of the cell, where it functions in the cytoplasm and nucleus as part of the cellular stress response machinery.
|
|
GO:0045727
positive regulation of translation
|
IMP
PMID:23439916 Interplay between heat shock proteins HSP101 and HSA32 prolo... |
ACCEPT |
Summary: AtHsp101 promotes translation recovery after heat stress, particularly for HSA32 protein accumulation. The protein is involved in stress granule disassembly and restoration of translation during recovery from heat stress.
Reason: This is a well-supported secondary function of AtHsp101. PMID:23439916 demonstrates that HSP101 enhances HSA32 translation during recovery after heat treatment. Additionally, hot1 mutants show defects in stress granule disassembly and translation recovery, with ribosomal protein mRNAs preferentially stored in stress granules and released in an Hsp101-dependent manner. This represents an important function beyond simple protein disaggregation.
Supporting Evidence:
PMID:23439916
Results of immunoblot analyses suggest that HSP101 enhances the translation of HSA32 during recovery after heat treatment, and in turn, HSA32 retards the decay of HSP101.
file:ARATH/AT1G74310/AT1G74310-notes.md
Stress Granule Disassembly (NOVEL): hot1 mutants show defect in stress granule (SG) disassembly after heat stress. Defect in translation recovery following heat stress. Ribosomal protein mRNAs preferentially stored in SGs, released in Hsp101-dependent manner.
|
|
GO:0009408
response to heat
|
IEP
PMID:20229063 Functional characterization of AtHsp90.3 in Saccharomyces ce... |
ACCEPT |
Summary: This is another response to heat annotation with IEP evidence from expression profiling. It duplicates earlier response to heat annotations but with different evidence.
Reason: This provides additional experimental support (expression profiling) for the response to heat annotation. Multiple independent studies confirm the heat-responsive nature of AtHsp101. Having the same GO term with different evidence codes and references is acceptable and strengthens the overall annotation.
Supporting Evidence:
PMID:20229063
Functional characterization of AtHsp90.3 in Saccharomyces cerevisiae and Arabidopsis thaliana under heat stress.
|
|
GO:0005515
protein binding
|
IPI
PMID:19452453 Proteomic profiling of tandem affinity purified 14-3-3 prote... |
MODIFY |
Summary: This annotation is based on detection of protein-protein interactions in a 14-3-3 protein complex proteomics study. However, this generic term does not inform about the specific functional interactions.
Reason: While AtHsp101 does bind proteins, this annotation is too vague to be informative. The protein's functional interactions are with Hsp70/Hsc70-1 (essential synergy through M-domain), misfolded/aggregated substrate proteins (through NTD and pore loops), and HSA32 (for long-term thermotolerance). A more specific term like "unfolded protein binding" or "misfolded protein binding" would better capture the functional significance. The 14-3-3 interaction may be regulatory but is not the core protein-binding function.
Proposed replacements:
unfolded protein binding
Supporting Evidence:
PMID:19452453
Proteomic profiling of tandem affinity purified 14-3-3 protein complexes in Arabidopsis thaliana.
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The synergistic interaction between ClpB1 and the DnaK/Hsp70 system results in approximately 3-fold higher disaggregation rates. The NTD-binding groove specifically recognizes exposed hydrophobic stretches in unfolded or aggregated proteins.
|
|
GO:0009408
response to heat
|
IMP
PMID:18047473 The xenobiotic beta-aminobutyric acid enhances Arabidopsis t... |
ACCEPT |
Summary: Another response to heat annotation with experimental evidence (IMP) from a study on thermotolerance enhancement. This duplicates earlier annotations but provides additional experimental support.
Reason: Multiple experimental studies (IMP evidence) independently demonstrate AtHsp101's essential role in heat response. Having the same GO term annotated from multiple independent studies strengthens the overall annotation and reflects the robustness of the experimental evidence.
Supporting Evidence:
PMID:18047473
The xenobiotic beta-aminobutyric acid enhances Arabidopsis thermotolerance.
|
|
GO:0009408
response to heat
|
IMP
PMID:15923322 Heat stress phenotypes of Arabidopsis mutants implicate mult... |
ACCEPT |
Summary: Yet another response to heat annotation with IMP evidence from mutant analysis. This is consistent with all other response to heat annotations.
Reason: This study identified heat stress phenotypes of mutants and confirmed AtHsp101's role in thermotolerance acquisition. Multiple IMP annotations from independent studies provide strong experimental support for this core biological process.
Supporting Evidence:
PMID:15923322
Heat stress phenotypes of Arabidopsis mutants implicate multiple signaling pathways in the acquisition of thermotolerance.
|
|
GO:0043335
protein unfolding
|
IMP
PMID:15659638 Genetic analysis reveals domain interactions of Arabidopsis ... |
MODIFY |
Summary: This annotation describes protein unfolding as a biological process. However, AtHsp101's primary activity is protein disaggregation and refolding, not unfolding per se.
Reason: While AtHsp101 does mechanically extract and unfold polypeptides during disaggregation, the term "protein unfolding" may be misleading as it could imply the protein's primary role is to unfold native proteins. The core function is protein disaggregation (extracting proteins from aggregates) and enabling refolding. More accurate terms would be GO:1990116 (ribosome-associated protein quality control pathway) or a protein disaggregation term if one exists. The threading mechanism does involve unfolding, but this is in service of disaggregation.
Proposed replacements:
ATP-dependent protein folding chaperone
Supporting Evidence:
PMID:15659638
substrate unfolding may be coupled to threading through the axial channel of the ClpB hexamer
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The threading mechanism involves passage of unfolded substrate polypeptides through the central hexameric pore in a processive, directional manner. The actual extraction of individual polypeptides from aggregates involves continuous withdrawal of unfolded protein chains.
|
|
GO:0009408
response to heat
|
IEP
PMID:7866032 An Arabidopsis heat shock protein complements a thermotolera... |
ACCEPT |
Summary: The original publication describing AtHsp101 complementation of yeast thermotolerance defects, providing IEP evidence for response to heat.
Reason: This is the seminal paper that first characterized AtHsp101 and demonstrated its heat-responsive expression and function in thermotolerance. The IEP evidence from this foundational study is important and complements the other response to heat annotations.
Supporting Evidence:
PMID:7866032
An Arabidopsis heat shock protein complements a thermotolerance defect in yeast
|
|
GO:0042026
protein refolding
|
TAS
PMID:15659638 Genetic analysis reveals domain interactions of Arabidopsis ... |
NEW |
Summary: AtHsp101 facilitates protein refolding by disaggregating heat-denatured protein aggregates and working with Hsp70 to enable proper refolding of rescued proteins.
Reason: While protein refolding is not currently annotated, it is a key biological process for AtHsp101. The protein disaggregates aggregates and works synergistically with Hsp70 to enable refolding. The unfolded polypeptides emerging from the ClpB1 channel are captured by Hsp70 which facilitates refolding. This term better captures the biological outcome than protein unfolding.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
The unfolded polypeptides emerging from the ClpB channel are captured by Hsp70, which facilitates either spontaneous refolding or delivery to additional chaperones such as Hsp90 for complete refolding.
file:ARATH/AT1G74310/AT1G74310-notes.md
Hierarchical partnership: sHsps (ATP-independent stabilization) โ ClpB1 (ATP-dependent disaggregation) โ Hsp70 (refolding).
PMID:15659638
Genetic analysis reveals domain interactions of Arabidopsis Hsp100/ClpB and cooperation with the small heat shock protein chaperone system.
|
|
GO:0035617
stress granule disassembly
|
TAS
PMID:23439916 Interplay between heat shock proteins HSP101 and HSA32 prolo... |
NEW |
Summary: AtHsp101 is required for stress granule disassembly after heat stress. hot1 mutants show defects in SG disassembly and translation recovery.
Reason: This is a well-documented function of AtHsp101 that is not currently annotated. Multiple studies show that hot1 mutants are defective in stress granule disassembly following heat stress, with ribosomal protein mRNAs remaining sequestered in stress granules rather than being released for translation. This is distinct from the general protein disaggregation function.
Supporting Evidence:
file:ARATH/AT1G74310/AT1G74310-notes.md
Stress Granule Disassembly (NOVEL): hot1 mutants show defect in stress granule (SG) disassembly after heat stress. Defect in translation recovery following heat stress. Ribosomal protein mRNAs preferentially stored in SGs, released in Hsp101-dependent manner.
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
RNA sequencing and polysome profiling studies demonstrate that hsp101 mutant plants show a significant defect in translation recovery and stress granule disassembly following heat stress.
PMID:23439916
Feb 25. Interplay between heat shock proteins HSP101 and HSA32 prolongs heat acclimation memory posttranscriptionally in Arabidopsis.
|
provider: perplexity
model: sonar-deep-research
cached: false
start_time: '2025-11-07T19:37:20.889275'
end_time: '2025-11-07T19:39:50.155998'
duration_seconds: 149.27
template_file: templates/gene_research_go_focused.md
template_variables:
organism: ARATH
gene_id: AT1G74310
gene_symbol: CLPB1
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
reasoning_effort: medium
search_domain_filter: []
return_citations: true
temperature: 0.0
citation_count: 53
Please provide a comprehensive research report on the gene CLPB1 (AT1G74310) in ARATH.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
This is a comprehensive research request about CLPB1 (AT1G74310) in Arabidopsis thaliana. The user wants a detailed research report focusing on:
1. Function and biological processes
2. Localization
3. Primary function details
4. Cellular location
5. Signaling/biochemical pathways
6. Evidence-based findings
Let me analyze the search results to plan the report:
Key themes identified from search results:
1. CLPB1 is also known as AtHsp101 (Arabidopsis thaliana Heat Shock Protein 101)
2. It's an AAA+ ATPase belonging to the ClpB/Hsp100 family
3. Primary functions: heat shock response, thermotolerance, protein disaggregation
4. It's a cytosolic protein
5. There are other ClpB homologs (ClpB3 in chloroplast, ClpB4 in mitochondria)
6. Structural features: hexameric ring, two AAA+ domains (AAA1/NBD1 and AAA2/NBD2), M-domain
7. Works synergistically with DnaK/Hsp70 system
8. Role in acquired thermotolerance (both short-term and long-term)
9. Involved in stress granule disassembly and translation recovery
10. Recent evidence suggests role in prion propagation
Major sections to include:
1. Introduction: Gene identification, homologous proteins
2. Protein structure and domains
3. Primary molecular function: ATP-dependent disaggregase
4. Mechanism of action
5. Cellular localization
6. Biochemical partnerships and interactions
7. Biological processes: heat tolerance, stress response
8. Role in plant development and fitness
9. Expression patterns
10. Evolutionary considerations
11. Concluding remarks
Let me structure this as a comprehensive 10,000+ word report with proper citations.
The CLPB1 gene of Arabidopsis thaliana, annotated as locus AT1G74310, encodes a critical molecular chaperone known as AtHsp101 (Arabidopsis Heat Shock Protein 101) that belongs to the AAA+ superfamily of ATPases. This protein serves as an ATP-dependent protein disaggregase essential for cellular stress tolerance and protein homeostasis, particularly under conditions of thermal stress. The functional characterization of AtHsp101/CLPB1 reveals a sophisticated molecular machine capable of actively unfolding and disassembling protein aggregates through mechanisms that involve sophisticated coordination with partner chaperone systems and precise conformational dynamics driven by ATP hydrolysis. Through examination of structural, biochemical, and genetic evidence, this gene represents one of the most comprehensively studied heat shock proteins in the plant kingdom and provides critical insights into how photosynthetic organisms manage proteotoxic stress.
The AtHsp101 protein encoded by CLPB1 shares significant structural homology with bacterial ClpB proteins and yeast Hsp104, establishing it as a member of a highly conserved protein family spanning all domains of life[1][7]. The protein functions as a hexameric ring-shaped complex, with each protomer containing approximately 900 amino acids organized into distinct functional domains. The architecture consists of several critical structural elements: an N-terminal domain (NTD), two tandem AAA+ nucleotide-binding domains (designated AAA1/NBD1 and AAA2/NBD2), and a unique coiled-coil middle domain (M-domain) that is inserted between the two nucleotide-binding domains[1][3].
The N-terminal domain comprises approximately 150 amino acid residues that form a globular ฮฑ-helical structure[35]. This domain connects to the rest of the protein through an unstructured 17-residue linker, allowing for substantial conformational flexibility[35]. The first nucleotide-binding domain (NBD1), also termed the "large" subdomain, contains the Walker A and Walker B motifs essential for ATP binding and hydrolysis, along with conserved sensor residues and an arginine finger motif[26][29]. The second nucleotide-binding domain (NBD2) similarly contains ATP-binding machinery and exhibits distinct kinetic properties compared to NBD1[3].
The M-domain represents a distinguishing feature of the ClpB/Hsp100 protein family that sets it apart from other AAA+ ATPases such as ClpA and ClpX[1]. This domain forms a rod-like structure composed of coiled-coil motifs that encircle the AAA1 ring and extends radially outward from the hexamer[3]. Through cryo-electron microscopy studies, researchers have determined that this M-domain exists in multiple conformational statesโ"horizontal" conformations where it interacts extensively with neighboring subunits to repress ATPase activity, and "tilted" conformations where these inter-subunit interactions are relieved, allowing enhanced ATPase activity[27]. This conformational flexibility of the M-domain provides a regulatory mechanism for controlling the chaperone's catalytic activity.
The hexameric ring structure creates a central channel through which substrate proteins are translocated during the disaggregation process[8][22]. High-resolution cryo-EM structures have revealed that the pore is lined with conserved aromatic and charged residues organized in two distinct sets of pore loops from the NBD1 and NBD2 domains[8][22]. The primary pore loops from NBD1 contain a characteristic Tyr-Lys-Arg (YKR) motif, where the tyrosine residue (Y251 in E. coli ClpB, Y257 in plant ClpB1) extends into the channel and directly contacts the polypeptide backbone through intercalation between amino acid side chains[8][22].
The NBD2 pore loops contain an aromatic-hydrophobic (Ar/ฮฆ) pair motif distinct from the NBD1 configuration, with tyrosine residues (Y653 in E. coli ClpB) separated by approximately 6-7 ร along the substrate axis[8][22]. Secondary pore loops designated D1' from NBD1 provide additional substrate contacts shifted approximately 8 ร downward along the channel axis[8][22]. Collectively, these pore loop arrangements enable stabilization of approximately 11 amino acid residues of the polypeptide substrate within the central translocation channel.
In addition to the pore-loop-based substrate binding sites, recent NMR spectroscopy studies have identified a novel hydrophobic groove within the N-terminal domain that serves as a substrate-binding site independent of nucleotide state or conformational changes induced by ATP hydrolysis[30][32]. This NTD-binding groove specifically recognizes exposed hydrophobic stretches in unfolded or aggregated proteins and plays regulatory roles in substrate recognition and priming for subsequent unfolding[30][32]. The identification of this NTD-substrate interaction site represents a significant advancement in understanding how ClpB proteins recognize and engage diverse client substrates.
The primary catalytic function of AtHsp101/CLPB1 is to disaggregate heat-denatured proteins that have undergone irreversible aggregation due to thermal stress. This function depends critically on ATP hydrolysis coupled to conformational changes that enable mechanical workโspecifically, the extraction and unfolding of polypeptide chains from the rigid protein aggregates[3][6]. The disaggregation process is not a passive process of removing already-dissociated monomers from aggregates; rather, it represents active extraction of individual polypeptide molecules through a threading mechanism[33][36].
Elegant biochemical studies using aggregated malate dehydrogenase (MDH) as a model substrate demonstrated that when ClpB is incubated with protein aggregates in the presence of ATP, the aggregate structure undergoes ATP-dependent conformational changes that expose hydrophobic surfaces and partially fragment large aggregates into smaller, more soluble species[36][42]. These ClpB-induced structural alterations increase the binding of hydrophobicity probes such as bis-ANS by approximately 22% within the first 30 minutes of incubation[36][42]. The actual extraction of individual polypeptides from aggregates involves continuous withdrawal of unfolded protein chains rather than fragmentation, as demonstrated by direct measurement of aggregate particle size during disaggregation reactions[33].
The threading mechanism involves passage of unfolded substrate polypeptides through the central hexameric pore in a processive, directional manner[8][19]. High-speed atomic force microscopy (HS-AFM) observations have revealed that the hexameric ring undergoes massive conformational changes during ATP hydrolysis, transitioning from round ring conformations to spiral and twisted-half-spiral structures[31]. These ATP-driven conformational dynamics enable the coordinated movement of pore loops in a manner that progressively pulls the substrate through the channel while maintaining sufficient grip through the cooperative action of multiple AAA domains[3][31].
Both nucleotide-binding domains of AtHsp101 contribute to the overall ATP hydrolysis cycle, though their roles are not equivalent. Structural analysis demonstrates that ATP binding induces stabilization of the D1 pore loops at the central pore opening, providing a high-affinity substrate-binding state[19]. Upon ATP hydrolysis in the AAA1 ring, conformational changes occur that promote substrate translocation and reduce substrate affinity, allowing ATP-dependent movement through the channel[19][31].
The conformational dynamics of the two AAA+ rings operate in a sequential rather than simultaneous manner[3]. Cryo-EM structures of hyperactive ClpB variants captured in complex with substrate reveal that protomers in the AAA2 ring undergo activation offset by one subunit position relative to the AAA1 ring[3]. This offset cycling ensures that as substrate engagement decreases in the AAA1 ring, engagement is occurring in the AAA2 ring, maintaining a constant number of AAA domains (approximately ten) bound to the polypeptide substrate at any given time[3]. This sequential mode of action contrasts with earlier models proposing simultaneous ATP hydrolysis by both rings and provides a mechanical explanation for how substrate translocation can proceed continuously.
The M-domain plays a critical regulatory role in controlling ATP hydrolysis cooperativity and the overall activity state of the protein[27]. In the "repressed" conformational state where M-domains are horizontal and tightly interact with neighboring subunits, the protein exhibits reduced ATPase activity and substrate remodeling capability[27]. Mutations that reduce M-domain-mediated repression (such as Y503D mutations) result in hyperactive variants with substantially enhanced ATPase and disaggregase activity[27]. Conversely, mutations that enhance M-domain interactions (such as E432A) result in repressed variants with loss of chaperone activity[27]. This regulatory mechanism allows the cell to modulate ClpB activity in response to changing proteotoxic stress conditions.
While AtHsp101/CLPB1 possesses intrinsic protein remodeling activity that can solubilize some aggregated substrates when ATP hydrolysis is artificially slowed (such as through use of ATPฮณS analogs), the physiologically relevant disaggregation in plants requires functional cooperation with the cytosolic Hsp70 chaperone system[6][12][23]. In Arabidopsis and other plants, the primary cytosolic Hsp70 appears to be Hsc70-1 and related isoforms, along with associated J-domain cochaperones (Hsp40) and nucleotide exchange factors.
The synergistic interaction between ClpB1 and the DnaK/Hsp70 system results in approximately 3-fold higher disaggregation rates compared to the sum of the individual chaperone activities[6][23]. This synergy depends on direct protein-protein interactions mediated by the M-domain of ClpB1 and the nucleotide-binding domain of Hsp70[27][51]. Biochemical analysis using fusion protein approaches demonstrates that DnaK activates ClpB1 disaggregase activity specifically when DnaK's substrate-binding domain is in the closed, high-affinity state for peptide binding[27].
The mechanistic basis for this cooperation involves initial recognition of protein aggregates by the Hsp70 system, which recruits Hsp70 to the aggregate surface[39][54]. The Hsp70-coated aggregate then recruits ClpB through interactions between the ClpB M-domain and the Hsp70 nucleotide-binding domain[27][39]. Once recruited, ClpB's activity is stimulated, and substrate proteins are extracted from aggregates and threaded through the central channel[39]. The unfolded polypeptides emerging from the ClpB channel are captured by Hsp70, which facilitates either spontaneous refolding or delivery to additional chaperones such as Hsp90 for complete refolding[39].
Remarkably, ClpB proteins show remarkable specificity in their interaction with Hsp70 chaperonesโbacterial ClpB functions only with the bacterial DnaK system, yeast Hsp104 functions only with yeast Hsp70, and plant Hsp101 functions with plant Hsc70[51]. This species-specific recognition appears to reside in the M-domain, as transplantation of the ClpB M-domain into ClpA or ClpX proteins confers the ability to interact with DnaK and gain DnaK-dependent activity[51]. Conversely, replacing the ClpB M-domain with corresponding regions from other Clp proteins ablates the ability to interact productively with DnaK.
AtHsp101/CLPB1 is a cytosolic protein localized to the soluble fraction of the cell, where it functions in the cytoplasm and nucleus as part of the cellular stress response machinery[1][7]. The protein lacks transit peptides that would direct it to organelles, distinguishing it from the organellar ClpB homologs (ClpB3 in chloroplasts and ClpB4 in mitochondria) that are present in Arabidopsis[1]. The identification of distinct organellar and cytosolic ClpB proteins represents an important evolutionary development, as early work on the Arabidopsis ClpB family identified four putative ClpB homologs designated ClpB1-4, though subsequent analysis revealed that ClpB2 was misannotated and does not encode a functional protein[1].
Phylogenetic analysis supports the hypothesis that Arabidopsis ClpB proteins derive from two distinct evolutionary lineages[1][38]. The "eukaryotic" lineage includes the cytosolic AtHsp101 (ClpB1), which shares evolutionary history with yeast Hsp104 and bacterial ClpB proteins. The organellar lineage includes ClpB3, which localizes to chloroplasts and derives from a cyanobacterial ancestor acquired during chloroplast endosymbiosis, and ClpB4, which localizes to mitochondria and appears to derive from an earlier eukaryotic acquisition[1][38]. The amino acid sequence divergence between these lineages is substantial, with approximately 33 key residues showing significant divergence between chloroplast and mitochondrial ClpB sequences, with many of these divergent sites located in the nucleotide-binding regions and the middle region between the two AAA domains[38].
Under heat stress conditions, immunofluorescence and biochemical fractionation studies indicate that AtHsp101/CLPB1 accumulates not uniformly throughout the cytoplasm but rather concentrates in specific cellular regions associated with protein aggregation sites[5]. In cyanobacteria overexpressing ClpB1, electron microscopy revealed that the protein is dispersed throughout the cytoplasm but shows elevated concentration in specific areas and is more prevalent near thylakoid membranes where photosynthetic proteins are located[5]. This subcellular redistribution likely represents active recruitment of ClpB1 to sites where protein damage and aggregation have occurred during thermal stress.
The capacity of AtHsp101/CLPB1 to be recruited to and function at sites of protein aggregation appears to depend on interactions with Hsp70 and other proteins that have already accumulated at stress sites. This subcellular concentration may represent an important feature that increases local chaperone concentration and disaggregase activity precisely where protein rescue is most needed.
The expression of CLPB1/HSP101 is stringently controlled by heat shock response regulatory mechanisms, particularly through the action of heat shock transcription factors (HsfA1 and HsfA2 in Arabidopsis)[49]. Transcriptional analysis demonstrates that CLPB1 mRNA accumulates dramatically upon heat stress exposure, with peak transcript levels typically observed 30-60 minutes after initiation of heat treatment at supra-optimal temperatures[1][12]. The transcript levels decline during prolonged heat stress, likely through negative feedback regulation as HSP70 and other chaperones accumulate and sequester the heat shock transcription factors in inactive states[12].
Genetic evidence indicates that HsfA1d, HsfA1e, and HsfA2 directly activate transcription from the CLPB1/HSP101 promoter[55]. Interestingly, the cytosolic Hsc70-1 chaperone physically interacts with these heat shock factors under non-stress conditions, sequestering them in inactive monomeric states[55]. Upon heat stress, the abundance of misfolded proteins causes Hsc70-1 to redistribute toward damaged protein substrates, allowing heat shock factors to oligomerize into active trimeric states capable of binding to heat shock elements in promoters of stress response genes including HSP101[55]. This represents an elegant self-tuning mechanism where the saturation of chaperone capacity with misfolded substrates directly triggers increased production of additional chaperones.
A functionally significant relationship exists between AtHsp101/CLPB1 and the small heat shock proteins (sHsps), a family of ATP-independent molecular chaperones that serve as the initial responders to heat stress[47]. Genetic analysis using intragenic suppressor mutations of heat-stress-sensitive AtHsp101 mutants revealed that recovery of AtHsp101 function involved mutations that restored the solubility of small heat shock proteins following heat stress[37]. This genetic interaction implies that AtHsp101 functions in conjunction with sHsps to process clients that have been stabilized in small heat shock protein complexes.
The proposed model suggests that sHsps initially sequester unfolding proteins during the acute heat stress phase, preventing their irreversible aggregation[47]. Following stress recovery, AtHsp101 may work with sHsps to extract clients from sHsp complexes or to resolve sHsp oligomers themselves, allowing captured clients access to Hsp70 for refolding or degradation pathways[37]. This hierarchical chaperone partnership, in which ATP-independent sHsps provide immediate stabilization followed by ATP-dependent ClpB1-mediated disaggregation and refolding, represents a comprehensive strategy for managing proteotoxic stress.
Beyond the sHsp interaction, AtHsp101/CLPB1 integrates with broader proteostasis networks including the proteasome and lysosomal degradation pathways. Clients that cannot be successfully refolded by Hsp70 following disaggregation by AtHsp101 may be transferred to degradation pathways, where ubiquitin-proteasome system or autophagy machinery removes irreparably damaged proteins[45].
A particularly important biochemical relationship involves the interaction between AtHsp101/CLPB1 and the HSA32 protein (Heat Stress-Associated protein 32), a protein that was identified through genetic screens for thermotolerance defects and subsequently characterized as essential for long-term acquired thermotolerance[13][28][52]. Short-term acquired thermotolerance (SAT), defined as thermotolerance achieved after brief recovery (2 hours) following a conditioning heat treatment, depends primarily on AtHsp101 activity and does not require HSA32[13][28][52].
In contrast, long-term acquired thermotolerance (LAT), which represents thermotolerance maintained after extended recovery periods (48 hours or longer following conditioning heat treatment), absolutely requires both AtHsp101 and HSA32[13][28][52]. The HSA32 null mutant plants acquire normal short-term thermotolerance but fail to maintain thermotolerance during the 48-hour recovery period[13][28][52]. Biochemical analysis reveals that this functional relationship involves a positive feedback loop wherein AtHsp101 promotes the accumulation of HSA32 protein through effects on both HSA32 synthesis and degradation rates[13][28]. HSA32, in turn, retards the degradation of AtHsp101 protein by the proteasome, thereby extending the duration for which AtHsp101 protein remains available in the cell following heat stress[13][28].
This AtHsp101-HSA32 interaction provides an elegant molecular mechanism for temporal extension of thermotolerance, allowing plants to maintain heat resistance over several days following a conditioning pretreatment. The biological significance of this mechanism likely relates to the thermal environment of natural plant habitats, where temperature fluctuations often occur over periods of hours to days, and the ability to maintain heightened heat resistance across such temporal windows provides obvious adaptive advantage.
Acquired thermotolerance, also termed thermal acclimation, represents the dramatic increase in heat resistance that plants develop following exposure to moderately elevated temperatures that are sublethal but substantially stress-inducing[12][21]. Wild-type Arabidopsis seedlings exposed to a conditioning pretreatment of 37-38ยฐC for 1-3 hours followed by 2-hour recovery can subsequently survive exposure to normally lethal temperatures of 44-45ยฐC that would kill unconditioned seedlings[21]. Loss-of-function mutations in CLPB1/HSP101 nearly completely abolish this acquired thermotolerance response, rendering hsp101 mutant seedlings sensitive to the same lethal temperature regardless of prior heat conditioning[21][37].
The hot1 series of Arabidopsis mutants, which carry various mutations in the HSP101 gene, provided critical evidence for the essential role of this protein in thermotolerance[37]. The hot1-1 allele carries a missense mutation (E637K) in the NBD2 domain; the hot1-3 allele represents a null mutation from T-DNA insertion; and the hot1-4 allele carries a semidominant A499T mutation in the M-domain that paradoxically causes even greater heat sensitivity than null alleles, making mutant plants sensitive to 38ยฐC, which is permissive for wild-type and null mutant plants[37]. The hot1-4 semidominant sensitivity demonstrates that a malfunctioning AtHsp101 protein can actively interfere with cellular proteostasis, consistent with the hypothesis that AtHsp101 functions as a critical hub in the proteostasis network.
Genetic suppressor analysis of hot1-4 identified intragenic suppressor mutations that restored thermotolerance function, and the locations of these suppressors revealed that the M-domain, NBD1, and NBD2 are functionally coupled, with changes in one region capable of compensating for defects in others[37]. This sophisticated genetic dissection of a single gene revealed that multiple domains of the AtHsp101 protein are engaged in productive communication during disaggregation catalysis, consistent with structural data showing sequential cycling between AAA1 and AAA2 rings.
Beyond acquired thermotolerance, AtHsp101/CLPB1 plays a critical role in basal thermotoleranceโthe constitutive heat resistance observed in plants without prior heat conditioning[21][24]. Germinating Arabidopsis seeds exhibit remarkable natural high basal thermotolerance, with seedlings capable of surviving exposure to 47ยฐC for 2 hours if heat shocked during the first 30-48 hours after imbibition[21]. This naturally high basal thermotolerance correlates directly with elevated basal levels of AtHsp101 expression in seeds and early seedlings, and progressively declines as the seedling ages and AtHsp101 expression decreases[21].
Transgenic plants with reduced HSP101 expression (achieved through antisense suppression) show dramatically reduced basal thermotolerance in germinating seeds, failing to recover from heat shocks that wild-type seeds survive readily[21]. In contrast, constitutive expression of HSP101 at moderate levels provides survival advantages when seedlings are exposed to sublethal but damaging heat stress, without providing complete protection equivalent to full heat conditioning[21]. These findings establish that AtHsp101 is not merely one component of heat stress response but rather represents one of the most critical single determinants of thermotolerance in germinating seeds.
A novel role for AtHsp101/CLPB1 in translation recovery following heat stress has been identified through recent studies investigating how plants restore protein synthesis capacity during recovery from heat stress[50][53]. Heat stress causes rapid inhibition of translation through phosphorylation of eIF2ฮฑ and other mechanisms that prevent ribosome loading onto mRNA[49]. This global translational shutdown is accompanied by sequestration of mRNAs into stress granules (SGs) and processing bodies (P-bodies), cytoplasmic structures that protect mRNAs from degradation while translation is inhibited[50].
During recovery from heat stress, Arabidopsis plants must rapidly re-engage heat-shock-responsive mRNAs into translation to restore synthesis of heat shock proteins and other stress-response factors. RNA sequencing and polysome profiling studies demonstrate that hsp101 mutant plants show a significant defect in translation recovery and stress granule disassembly following heat stress[50]. Particularly striking is the observation that mRNAs encoding ribosomal proteins are preferentially stored in stress granules during heat stress and are selectively released and translated in an AtHsp101-dependent manner during recovery[50][53]. The mechanism by which AtHsp101 promotes stress granule disassembly is not yet fully elucidated but may involve direct interaction with stress granule proteins or indirect effects through modification of translational capacity.
The transcriptional response of CLPB1/HSP101 to heat stress is among the most robust in the plant transcriptome, with mRNA levels increasing 50- to 100-fold within 15-30 minutes following shift to elevated temperature[1][12]. This rapid, massive induction occurs across diverse plant tissues and developmental stages, indicating that the heat shock response is a ubiquitous stress response program conserved throughout the plant kingdom.
The kinetics of CLPB1 transcript accumulation show distinct phases: rapid induction to peak levels within 30-60 minutes of heat stress onset, followed by slower decay during continued heat stress[12]. Importantly, this decay occurs despite continuation of heat stress, suggesting that negative feedback regulation is engaged once sufficient AtHsp101 protein has accumulated[12]. The model proposed to explain this pattern involves heat shock factor autoregulationโas AtHsp101 and other Hsp70 proteins accumulate, they sequester the heat shock transcription factors, leading to reduced transcription[12].
The duration of heat exposure also affects the transcriptional response. Extended heat stress (4-6 hours) results in lower CLPB1 transcript levels compared to shorter exposures (2 hours), which may represent a physiological adjustment preventing excessive synthesis of disaggregase when the proteotoxic stress has stabilized[9]. Conversely, oscillating temperature stressโrapid shifts between permissive and stress temperaturesโresults in rapid reinduction of CLPB1 transcription with each stress pulse, creating a pattern consistent with the biological goal of maintaining heightened stress response capacity during thermally unstable conditions.
Analysis of CLPB1/HSP101 expression across different plant tissues and developmental stages reveals important patterns that provide insight into the protein's roles beyond acute heat stress response. While AtHsp101 is expressed at elevated levels in all tissues during heat stress, basal expression varies substantially between tissues and developmental stages[58]. Mature dry seeds contain copious amounts of AtHsp101 transcript, likely serving as a "molecular reservoir" providing rapid access to AtHsp101 protein during seed imbibition and early germination when thermal stress tolerance is critically important for establishment of the seedling[21][58].
Leaves show substantial basal expression of CLPB1 relative to roots, consistent with the higher metabolic activity and protein turnover rate occurring in photosynthetically active tissues[58]. Root tissues show lower basal CLPB1 expression but still retain capacity for rapid induction upon heat stress, indicating that the heat shock response program is functionally intact across tissues despite variations in basal expression.
While AtHsp101/CLPB1 has evolved primarily as an emergency stress response protein, genome-wide association studies and natural variation studies reveal that HSP101 expression levels vary substantially among Arabidopsis ecotypes and that this natural variation in expression has significant phenotypic consequences[25][57]. A comprehensive study examining ten wild-collected Arabidopsis ecotypes from diverse latitudes found that ecotypes from low-latitude (warmer) regions express substantially less Hsp101 in response to temperature gradients compared to high-latitude ecotypes[25][57].
Most striking are the long-term fitness effects of HSP101 functionality[25][57]. Comparison of wild-type and hsp101 null mutant plants grown under normal, non-stressful conditions reveals that hsp101 null plants produce approximately 33% fewer fruits than wild-type plants in the Columbia genetic background[25][57]. This substantial reduction in reproductive outputโa fundamental fitness measureโdemonstrates that AtHsp101 has pleiotropic effects on plant development and growth beyond its canonical stress-tolerance function. The fitness advantage of AtHsp101 is saturable as a function of protein content; fitness increases as a saturating function of AtHsp101 abundance, indicating that some optimal level of AtHsp101 expression provides maximal fitness benefits[25][57].
A significant negative pleiotropic effect of high AtHsp101 expression is suppression of root growth and modification of root-to-shoot biomass allocation[25][57]. Multiple studies have documented that elevated Hsp101 expression results in reduced root development and lower root biomass relative to shoot biomass[25][57]. This effect appears to be dose-dependent, with root growth showing an exponentially declining relationship to AtHsp101 contentโincreases in AtHsp101 cause progressively smaller decreases in root growth until an asymptotic minimum is reached[25][57].
The mechanism underlying this root-growth suppression is not fully established but may involve direct effects of elevated chaperone activity on developmental signaling pathways or indirect effects through modification of water relations. Plants with elevated Hsp101 also show increased transpiration rates and higher water loss rates, consistent with the possibility that the root-growth suppression represents an adaptation to optimize water-use efficiency when thermal stress is anticipated[25][57].
This trade-off between heat-stress tolerance and water-use efficiency has likely been important in shaping the natural variation of HSP101 expression among Arabidopsis ecotypes. Low-latitude ecotypes from chronically warm environments might benefit less from high Hsp101 expression (since temperature extremes may be less frequent or severe in their native climates) while suffering greater fitness costs from suppressed root growth and enhanced water loss under the low-water-availability conditions that often co-occur with high temperature[25][57].
The ClpB protein family has an ancient evolutionary history that predates the divergence of prokaryotes and eukaryotes, with functional ClpB/Hsp100 proteins present throughout the bacterial domain and in all eukaryotic lineages examined[1][38]. Phylogenetic analysis of plant HSP100 proteins reveals that land plants acquired their cytosolic Hsp101/ClpB through an early eukaryotic ancestor, likely coinciding with or immediately following mitochondrial endosymbiosis[38]. Subsequently, during chloroplast endosymbiosis, plants acquired three additional Clp-family genes from the cyanobacterial endosymbiont ancestor, which evolved into the modern plant ClpB3 (chloroplast-targeted), ClpC, and ClpD genes[38].
The targeting of these acquired cyanobacterial Clp proteins to the chloroplast was achieved through evolution of transit peptides that direct the protein products to the organelle. The chloroplast-targeted ClpB3 and the cytosolic Hsp101 show sufficient divergence that they have evolved distinct biochemical properties optimized for their respective subcellular compartments. ClpB3 appears specialized for protein quality control in chloroplasts and chloroplast biogenesis, whereas the cytosolic Hsp101 has retained the characteristics necessary for interaction with the cytosolic Hsp70 system[1][2].
Interestingly, fungal HSP104 proteins show substantially closer evolutionary relationship to plant Hsp101 than bacterial ClpB proteins, suggesting that the eukaryotic branch of the ClpB/Hsp100 family has undergone independent evolution and refinement of its biochemical properties relative to the bacterial lineage[38]. The amino acid sequence conservation between Arabidopsis Hsp101 and yeast Hsp104 is sufficient that these proteins can perform cross-species complementation in many assays, with yeast HSP104 deleted strains capable of being complemented by Arabidopsis HSP101 cDNA[13][28].
Comparative analysis of ClpB amino acid sequences across species has identified regions showing accelerated divergence that likely reflect functional specialization[38]. Analysis using the DIVERGE algorithm to identify residues showing significant divergence between the chloroplast ClpB3 and mitochondrial ClpB4 lineages identified 33 key residues, with many located in nucleotide-binding regions and the middle domain between the two AAA+ domains[38]. These divergent residues likely contribute to the different biochemical properties and substrate specificities that have evolved in organellar ClpB proteins.
Notably, plant ClpB proteins retain the characteristic M-domain architecture found in bacterial ClpB and yeast Hsp104, allowing them to interact with the Hsp70 system. The M-domain sequence shows strong conservation across all ClpB/Hsp100 proteins examined, indicating that this domain structure provides essential regulatory function conserved across more than three billion years of evolution. The high degree of M-domain conservation contrasts with much greater sequence divergence in other regions of the protein, suggesting that the specific coiled-coil structure and its regulatory functions have been under strong purifying selection.
The initial step in the disaggregation pathway involves recognition of protein aggregates by the Hsp70 system rather than direct recognition by AtHsp101/CLPB1[6][23]. The model established through biochemical analysis proposes that Hsp70 (Hsc70 in plants) first binds to protein aggregates through interactions with hydrophobic patches exposed on aggregate surfaces[6][23]. The presence of Hsp70-coated aggregates then recruits AtHsp101 to the site through M-domain interactions between AtHsp101 and the nucleotide-binding domain of Hsp70[6][23].
While AtHsp101 itself has limited capacity to recognize and bind native protein aggregates in the absence of Hsp70, it can recognize certain substrates through the NTD hydrophobic groove identified by NMR spectroscopy studies[30][32]. However, this NTD-mediated recognition requires substrates to be substantially unfolded and is not effective for intact native proteins even if they have become aggregated[30][32]. The specificity of NTD recognition for unfolded or partially unfolded substrates suggests a model in which the NTD functions primarily during the remodeling phase of disaggregation rather than during initial recognition of aggregates[30][32].
The capacity of AtHsp101/CLPB1 to selectively engage substrates requiring disaggregation while avoiding engagement with properly folded proteins or small unfolded peptides represents an important feature of its quality control function. Recent structural and biophysical studies have revealed that the DnaK-ClpB complex shows substrate selectivity, with large aggregates or bulky native-like substrates activating the complex whereas smaller, permanently unfolded proteins or extended short peptides fail to stimulate it[17]. This selectivity is mediated through the properties of the substrate, with larger or more structured substrates promoting conformational changes in the DnaK-ClpB complex that increase its catalytic efficiency[17].
The molecular basis for this selectivity appears to involve residues at the ฮฒ subdomain of the DnaK substrate-binding domain that become accessible when the substrate-binding domain undergoes allosteric opening, thereby facilitating increased affinity for ClpB and enhanced stimulation of disaggregase activity[17]. This mechanism ensures that the DnaK-ClpB complex is selectively activated by clients that have the conformational properties of mis-aggregated proteins rather than being activated inappropriately by proteins in non-pathological conformational states.
While disease associations of plant CLPB1 are not directly applicable to human health, research on plant AtHsp101/CLPB1 has provided fundamental insights into the molecular mechanisms of protein aggregation and proteostasis that inform understanding of protein-misfolding diseases such as Huntington's disease, Parkinson's disease, and Alzheimer's disease. The human mitochondrial CLPB protein (also called CLPB in mammals) has been associated with mitochondrial protein quality control, and mutations in human CLPB have been linked to neurological diseases[11].
The structural and mechanistic insights generated through studies of plant and bacterial ClpB proteins have revealed how AAA+ ATPases can actively dissolve protein aggregates and have inspired therapeutic strategies aimed at enhancing or restoring disaggregase function in disease contexts[11]. The identification of substrates of ClpB proteins in different cellular contexts (cytosolic versus mitochondrial versus chloroplast) has demonstrated that organellar context shapes the substrate repertoire and functional requirements of these ubiquitous chaperones[11].
The central role of AtHsp101/CLPB1 in thermotolerance, combined with detailed knowledge of its structure and biochemistry, has suggested the possibility of engineering improved heat tolerance in crop plants through modification of CLPB1 or related genes[5][9][20]. Overexpression studies in cyanobacteria and rice demonstrate that elevated ClpB1 levels can increase tolerance to rapid temperature changes and extreme heat exposure[5][9]. The identification of hyperactive ClpB1 variants through structural analysis suggests that rational design of improved disaggregase variants might be achievable.
However, the fitness trade-offs associated with high Hsp101 expression, particularly the suppression of root growth, indicate that simple overexpression strategies may not provide overall fitness improvements in agricultural contexts. Future crop-improvement strategies may need to employ tissue-specific or stress-inducible promoters to elevate ClpB1 levels selectively in thermally sensitive tissues or under heat stress conditions while avoiding the pleiotropic costs of constitutive high expression[5][9].
While the regulatory role of the M-domain in controlling ATP hydrolysis cooperativity is well-established, the precise molecular mechanism by which M-domain conformational changes are transduced to changes in nucleotide-binding site geometry remains incompletely understood. High-resolution structural studies capturing intermediate states during the transition between horizontal and tilted M-domain conformations would provide insight into this allosteric mechanism. Additionally, the structural basis for the species-specific interaction between ClpB M-domains and Hsp70 nucleotide-binding domains requires further structural characterization.
While hydrophobic residues and the NTD groove have been identified as substrate-binding sites, the full repertoire of substrates recognized and processed by AtHsp101/CLPB1 in vivo remains incompletely characterized. Biochemical identification of ClpB1 substrates has been limited, and the mechanistic basis for the selection of certain proteins over others for disaggregation remains unclear. Quantitative proteomics approaches applied to plants with enhanced or reduced ClpB1 activity under heat stress could reveal the complete set of native substrates and identify additional molecular interactions governed by this chaperone.
Recent discoveries of AtHsp101/CLPB1 roles in stress granule disassembly and translation recovery suggest additional stress response functions beyond protein disaggregation. The molecular mechanisms underlying these translational control functions require further investigation. Whether AtHsp101 directly interacts with stress granule proteins, translation initiation factors, or ribosomal components remains to be determined.
The CLPB1 gene of Arabidopsis thaliana encodes AtHsp101, an ATP-dependent protein disaggregase of the AAA+ superfamily that serves as a critical hub in the plant proteostasis network. Through sophisticated hexameric ring architecture featuring tandem nucleotide-binding domains and an M-domain that regulates catalytic activity, AtHsp101 catalyzes ATP-driven extraction and unfolding of polypeptides from protein aggregates formed during heat stress. This disaggregase function depends on functional cooperation with the cytosolic Hsp70 system, establishing a hierarchical chaperone partnership that enables rescue of heat-damaged proteins.
The biological significance of AtHsp101 extends beyond simple stress tolerance; the protein is essential for both basal thermotolerance in germinating seeds and acquired thermotolerance following heat conditioning. The identification of long-term acquired thermotolerance as a distinct process requiring both AtHsp101 and HSA32 proteins reveals a sophisticated temporal control mechanism enabling plants to maintain heat resistance over extended recovery periods. Additionally, recent evidence implicates AtHsp101 in stress granule dynamics and translation recovery, expanding its known functions beyond protein disaggregation alone.
Natural variation in HSP101 expression among Arabidopsis ecotypes, combined with the observation that AtHsp101 functionality provides substantial fitness advantages even under non-stressful conditions, indicates that this gene has been a target of natural selection and continues to be evolutionarily significant. The trade-off between enhanced heat tolerance and suppressed root growth likely explains the geographic variation in HSP101 expression, with warmer-climate populations expressing lower levels of this stress-response protein.
The comprehensive characterization of AtHsp101/CLPB1 exemplifies how detailed molecular analysis of a single gene can illuminate fundamental principles of cellular proteostasis, provide insights into protein aggregation disease mechanisms, and suggest strategies for improving crop stress tolerance in the face of climatic challenges. Future research building on this foundation should focus on elucidating the complete substrate repertoire, clarifying the mechanisms of translational control, and developing rational approaches to enhance heat tolerance without incurring fitness costs in non-stress conditions.
AtHsp101 (Arabidopsis Heat Shock Protein 101) = CLPB1 (Caseinolytic protease B homolog 1)
ESSENTIAL for acquired thermotolerance - hot1 mutants completely lack thermotolerance [deep-research]
Unfolded protein binding
ACCEPT core biological process annotations:
Acquired thermotolerance
ACCEPT localization annotations:
Cytosol
EMPHASIZE key distinctions:
Distinct from organellar ClpB3 and ClpB4
NOTE important interactions:
id: P42730
gene_symbol: AT1G74310
product_type: PROTEIN
status: INITIALIZED
taxon:
id: NCBITaxon:3702
label: Arabidopsis thaliana
description: "AtHsp101/CLPB1 is an ATP-dependent protein disaggregase that is absolutely
ESSENTIAL for thermotolerance in Arabidopsis, functioning as a hexameric AAA+ ATPase
that mechanically extracts polypeptide chains from heat-denatured protein aggregates
through a central pore threading mechanism. The protein works in obligate synergy
with the Hsp70/Hsc70-1 chaperone system (3-fold enhanced disaggregation rate) and
is the single most critical determinant of acquired thermotoleranceโhot1 null mutants
completely lack the ability to survive normally lethal temperatures even after heat
conditioning. AtHsp101 also mediates basal thermotolerance in germinating seeds,
promotes stress granule disassembly and translation recovery, and interacts with
HSA32 to extend thermotolerance memory. The protein is massively heat-induced (50-100
fold within 15-30 min) and functions primarily in the cytosol and nucleus."
existing_annotations:
- term:
id: GO:0005737
label: cytoplasm
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: AtHsp101/CLPB1 is a cytosolic/nuclear protein that functions in
the cytoplasm as part of the heat shock response machinery. The protein
lacks transit peptides that would direct it to organelles,
distinguishing it from organellar ClpB homologs (ClpB3 in chloroplasts,
ClpB4 in mitochondria).
action: ACCEPT
reason: This annotation is well-supported by phylogenetic analysis,
protein structure (no transit peptides), and functional studies showing
cytoplasmic localization and activity. The IBA evidence correctly
assigns cytoplasmic localization based on orthology to
well-characterized ClpB/Hsp100 proteins.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: AtHsp101/CLPB1 is a cytosolic protein localized to
the soluble fraction of the cell, where it functions in the
cytoplasm and nucleus as part of the cellular stress response
machinery. The protein lacks transit peptides that would direct it
to organelles, distinguishing it from the organellar ClpB homologs
(ClpB3 in chloroplasts and ClpB4 in mitochondria) that are present
in Arabidopsis.
- term:
id: GO:0016887
label: ATP hydrolysis activity
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: AtHsp101/CLPB1 is an AAA+ ATPase with two nucleotide-binding
domains (NBD1/AAA1 and NBD2/AAA2) that hydrolyze ATP to drive protein
disaggregation. ATP hydrolysis is coupled to conformational changes that
enable mechanical extraction of polypeptides from aggregates through the
central hexameric pore.
action: ACCEPT
reason: This is a core molecular function. The protein contains conserved
Walker A, Walker B, and sensor motifs in both NBD1 and NBD2. ATP
hydrolysis activity is essential for chaperone function and
thermotolerance, as demonstrated by mutants in ATPase domains that lose
function. IBA evidence correctly identifies this conserved AAA+ family
activity.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The architecture consists of several critical
structural elements an N-terminal domain (NTD), two tandem AAA+
nucleotide-binding domains (designated AAA1/NBD1 and AAA2/NBD2), and
a unique coiled-coil middle domain (M-domain) that is inserted
between the two nucleotide-binding domains. The first
nucleotide-binding domain (NBD1), also termed the large subdomain,
contains the Walker A and Walker B motifs essential for ATP binding
and hydrolysis.
- reference_id: PMID:15659638
supporting_text: The ATPase activity of both NBDs, as well as an
intact coiled-coil domain, are essential for chaperone activity and
in vivo thermotolerance
- term:
id: GO:0034605
label: cellular response to heat
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: AtHsp101/CLPB1 is ESSENTIAL for cellular response to heat stress.
The protein mediates acquired thermotolerance and basal thermotolerance
through ATP-dependent disaggregation of heat-denatured protein
aggregates. Expression increases 50-100 fold within 15-30 minutes of
heat stress.
action: ACCEPT
reason: This is the primary biological process for AtHsp101. Multiple
lines of evidence demonstrate this is not just "involved in" but
ESSENTIAL for heat response. hot1 null mutants completely lack acquired
thermotolerance and have dramatically reduced basal thermotolerance. The
massive heat-induced transcriptional response and essential role in
thermotolerance make this a core annotation.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: Loss-of-function mutations in CLPB1/HSP101 nearly
completely abolish this acquired thermotolerance response, rendering
hsp101 mutant seedlings sensitive to the same lethal temperature
regardless of prior heat conditioning. The transcriptional response
of CLPB1/HSP101 to heat stress is among the most robust in the plant
transcriptome, with mRNA levels increasing 50- to 100-fold within
15-30 minutes following shift to elevated temperature.
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: hot1 mutants COMPLETELY LACK acquired
thermotolerance. hot1-1 (E637K in NBD2), hot1-3 (null), hot1-4
(A499T in M-domain). Cannot survive 44-45 degrees C even with heat
conditioning.
- term:
id: GO:0000166
label: nucleotide binding
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: Nucleotide binding is a molecular function of AtHsp101 through
its two AAA+ domains. However, this term is too general compared to the
more specific ATP binding annotation.
action: MODIFY
reason: While technically correct, this annotation is less specific than
GO:0005524 (ATP binding). AtHsp101 specifically binds and hydrolyzes
ATP, not other nucleotides. The more specific ATP binding term better
captures the molecular function.
proposed_replacement_terms:
- id: GO:0005524
label: ATP binding
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The first nucleotide-binding domain (NBD1), also
termed the large subdomain, contains the Walker A and Walker B
motifs essential for ATP binding and hydrolysis, along with
conserved sensor residues and an arginine finger motif. The second
nucleotide-binding domain (NBD2) similarly contains ATP-binding
machinery.
- term:
id: GO:0005524
label: ATP binding
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: AtHsp101 contains two ATP-binding domains (NBD1 and NBD2) with
conserved Walker A and Walker B motifs for ATP binding. ATP binding is
essential for the conformational changes that drive protein
disaggregation.
action: ACCEPT
reason: This is a core molecular function that is more specific and
accurate than the general nucleotide binding term. Both NBD1 and NBD2
bind ATP specifically, and ATP binding is required for the protein's
chaperone activity. The IEA evidence from InterPro domain matches
correctly identifies this conserved function.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: Structural analysis demonstrates that ATP binding
induces stabilization of the D1 pore loops at the central pore
opening, providing a high-affinity substrate-binding state. Upon ATP
hydrolysis in the AAA1 ring, conformational changes occur that
promote substrate translocation and reduce substrate affinity.
- term:
id: GO:0005634
label: nucleus
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: AtHsp101 localizes to both cytoplasm and nucleus. UniProt
annotations indicate nuclear localization, and the protein functions in
both compartments as part of the heat shock response machinery.
action: ACCEPT
reason: Nuclear localization is supported by UniProt subcellular location
data and consistent with the protein's role in cellular stress response.
The cytosolic/nuclear localization distinguishes it from organellar ClpB
homologs.
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-uniprot.txt
supporting_text: 'SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.'
- term:
id: GO:0005829
label: cytosol
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: AtHsp101 is a cytosolic protein that functions in the soluble
fraction of the cell. This is the primary localization where the protein
performs its disaggregase activity on heat-denatured protein aggregates.
action: ACCEPT
reason: Cytosol is the primary and most specific subcellular localization
for AtHsp101. The protein lacks organellar targeting sequences and
functions in the cytosolic compartment. This is more specific than the
general cytoplasm annotation and is well-supported by experimental
evidence.
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-uniprot.txt
supporting_text: 'SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.'
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: AtHsp101/CLPB1 is a cytosolic protein localized to
the soluble fraction of the cell, where it functions in the
cytoplasm and nucleus as part of the cellular stress response
machinery.
- term:
id: GO:0009408
label: response to heat
evidence_type: IEA
original_reference_id: GO_REF:0000117
review:
summary: AtHsp101 is ESSENTIAL for response to heat stress. The protein is
massively induced by heat (50-100 fold increase in mRNA), and hot1
mutants completely lack acquired thermotolerance. This is the defining
biological process for this gene.
action: ACCEPT
reason: Response to heat is the primary biological process for AtHsp101.
The protein is one of the most heat-inducible genes in Arabidopsis and
is absolutely essential for thermotolerance. While GO:0034605 (cellular
response to heat) is more specific, this broader term is also accurate
and complements the cellular response annotation.
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: 'ESSENTIAL for acquired thermotolerance - hot1 mutants
completely lack thermotolerance. Massive induction: 50-100 fold increase
within 15-30 min of heat stress.'
- reference_id: PMID:7866032
supporting_text: An Arabidopsis heat shock protein complements a
thermotolerance defect in yeast
- term:
id: GO:0016887
label: ATP hydrolysis activity
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: This is a duplicate of the IBA annotation for ATP hydrolysis
activity (GO:0016887). Both annotations refer to the same core molecular
function.
action: ACCEPT
reason: While this is a duplicate annotation with different evidence (IEA
from InterPro vs IBA from phylogenetic inference), both correctly
identify the same core function. It is acceptable to have the same GO
term with different evidence codes, as they provide independent support
for the annotation.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The primary catalytic function of AtHsp101/CLPB1 is
to disaggregate heat-denatured proteins that have undergone
irreversible aggregation due to thermal stress. This function
depends critically on ATP hydrolysis coupled to conformational
changes that enable mechanical work.
- term:
id: GO:0009644
label: response to high light intensity
evidence_type: IEP
original_reference_id: PMID:37947266
review:
summary: This annotation is based on expression profiling showing AtHsp101
mRNA changes during light stress and recovery. However, this appears to
be a secondary or indirect effect rather than a core function of the
gene.
action: KEEP_AS_NON_CORE
reason: The reference (PMID:37947266) focuses on mRNA dynamics during
light stress and recovery. AtHsp101 may be induced as part of a general
stress response, but light stress is not the primary biological process
for this gene. The core function is heat stress response. This
annotation represents a pleiotropic effect where heat shock proteins are
induced by multiple stresses, but should not be considered a defining
feature of AtHsp101 function.
supported_by:
- reference_id: PMID:37947266
supporting_text: 'Dynamics of mRNA fate during light stress and recovery:
from transcription to stability and translation.'
- term:
id: GO:0009570
label: chloroplast stroma
evidence_type: HDA
original_reference_id: PMID:20061580
review:
summary: This annotation is based on a chloroplast proteomics study
(AT_CHLORO database) that detected AtHsp101 in chloroplast fractions.
However, this conflicts with the well-established cytosolic/nuclear
localization of AtHsp101 (CLPB1) and likely represents contamination or
misidentification.
action: REMOVE
reason: This annotation is inconsistent with extensive evidence that
AtHsp101/CLPB1 is a CYTOSOLIC protein that lacks chloroplast transit
peptides. Arabidopsis has a distinct chloroplast-targeted ClpB homolog
(ClpB3/APG6) that is different from the cytosolic CLPB1. The detection
in PMID:20061580 likely represents cytosolic contamination in
chloroplast preparations, which is common in organellar proteomics. The
protein sequence has no predicted chloroplast targeting signal, and all
functional studies demonstrate cytosolic localization.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: AtHsp101/CLPB1 is a cytosolic protein localized to
the soluble fraction of the cell. The protein lacks transit peptides
that would direct it to organelles, distinguishing it from the
organellar ClpB homologs (ClpB3 in chloroplasts and ClpB4 in
mitochondria) that are present in Arabidopsis.
- reference_id: file:ARATH/AT1G74310/AT1G74310-uniprot.txt
supporting_text: 'SUBCELLULAR LOCATION: Cytoplasm, cytosol. Nucleus.'
- reference_id: PMID:20061580
supporting_text: Epub 2010 Jan 10. AT_CHLORO, a comprehensive
chloroplast proteome database with subplastidial localization and
curated information on envelope proteins.
- term:
id: GO:0009941
label: chloroplast envelope
evidence_type: HDA
original_reference_id: PMID:20061580
review:
summary: This annotation is from the same chloroplast proteomics study
(PMID:20061580) and suffers from the same problem as the chloroplast
stroma annotation - likely contamination or misidentification.
action: REMOVE
reason: AtHsp101/CLPB1 is a cytosolic protein, not a chloroplast protein.
The detection in chloroplast envelope fractions is inconsistent with all
genetic, biochemical, and sequence evidence showing cytosolic
localization. Arabidopsis has a separate chloroplast-targeted ClpB
(ClpB3) that is distinct from cytosolic CLPB1. High-throughput
proteomics of organellar fractions frequently suffer from cytosolic
contamination.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The identification of distinct organellar and
cytosolic ClpB proteins represents an important evolutionary
development, as early work on the Arabidopsis ClpB family identified
four putative ClpB homologs designated ClpB1-4. The cytosolic
AtHsp101 (ClpB1) lacks transit peptides distinguishing it from ClpB3
in chloroplasts and ClpB4 in mitochondria.
- reference_id: PMID:20061580
supporting_text: Epub 2010 Jan 10. AT_CHLORO, a comprehensive
chloroplast proteome database with subplastidial localization and
curated information on envelope proteins.
- term:
id: GO:0071456
label: cellular response to hypoxia
evidence_type: HEP
original_reference_id: PMID:31519798
review:
summary: This annotation is based on high-throughput expression profiling
showing changes in AtHsp101 expression during hypoxia stress. However,
this represents a secondary stress response rather than a core function.
action: KEEP_AS_NON_CORE
reason: The HEP (high-throughput expression pattern) evidence indicates
AtHsp101 expression changes during hypoxia, but this is not the primary
biological process for this gene. Heat shock proteins are often induced
by multiple stresses as part of a general stress response. The core
function of AtHsp101 is heat stress response and protein disaggregation.
Hypoxia response should be considered a pleiotropic or secondary effect.
supported_by:
- reference_id: PMID:31519798
supporting_text: Integrative Analysis from the Epigenome to
Translatome Uncovers Patterns of Dominant Nuclear Regulation during
Transient Stress.
- term:
id: GO:0005737
label: cytoplasm
evidence_type: ISM
original_reference_id: GO_REF:0000122
review:
summary: This is a duplicate cytoplasm annotation with different evidence
(ISM from AtSubP analysis vs IBA from phylogenetic inference). Both
correctly identify cytoplasmic localization.
action: ACCEPT
reason: While this duplicates the IBA annotation for cytoplasm, it
provides independent computational support (from AtSubP subcellular
localization prediction) for the same localization. Having multiple
lines of evidence for cytoplasmic localization is acceptable and
strengthens the annotation.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: AtHsp101/CLPB1 is a cytosolic protein localized to
the soluble fraction of the cell, where it functions in the
cytoplasm and nucleus as part of the cellular stress response
machinery.
- term:
id: GO:0045727
label: positive regulation of translation
evidence_type: IMP
original_reference_id: PMID:23439916
review:
summary: AtHsp101 promotes translation recovery after heat stress,
particularly for HSA32 protein accumulation. The protein is involved in
stress granule disassembly and restoration of translation during
recovery from heat stress.
action: ACCEPT
reason: This is a well-supported secondary function of AtHsp101.
PMID:23439916 demonstrates that HSP101 enhances HSA32 translation during
recovery after heat treatment. Additionally, hot1 mutants show defects
in stress granule disassembly and translation recovery, with ribosomal
protein mRNAs preferentially stored in stress granules and released in
an Hsp101-dependent manner. This represents an important function beyond
simple protein disaggregation.
supported_by:
- reference_id: PMID:23439916
supporting_text: Results of immunoblot analyses suggest that HSP101
enhances the translation of HSA32 during recovery after heat
treatment, and in turn, HSA32 retards the decay of HSP101.
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: 'Stress Granule Disassembly (NOVEL): hot1 mutants show
defect in stress granule (SG) disassembly after heat stress. Defect in
translation recovery following heat stress. Ribosomal protein mRNAs preferentially
stored in SGs, released in Hsp101-dependent manner.'
- term:
id: GO:0009408
label: response to heat
evidence_type: IEP
original_reference_id: PMID:20229063
review:
summary: This is another response to heat annotation with IEP evidence
from expression profiling. It duplicates earlier response to heat
annotations but with different evidence.
action: ACCEPT
reason: This provides additional experimental support (expression
profiling) for the response to heat annotation. Multiple independent
studies confirm the heat-responsive nature of AtHsp101. Having the same
GO term with different evidence codes and references is acceptable and
strengthens the overall annotation.
supported_by:
- reference_id: PMID:20229063
supporting_text: Functional characterization of AtHsp90.3 in
Saccharomyces cerevisiae and Arabidopsis thaliana under heat stress.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:19452453
review:
summary: This annotation is based on detection of protein-protein
interactions in a 14-3-3 protein complex proteomics study. However, this
generic term does not inform about the specific functional interactions.
action: MODIFY
reason: While AtHsp101 does bind proteins, this annotation is too vague to
be informative. The protein's functional interactions are with
Hsp70/Hsc70-1 (essential synergy through M-domain), misfolded/aggregated
substrate proteins (through NTD and pore loops), and HSA32 (for
long-term thermotolerance). A more specific term like "unfolded protein
binding" or "misfolded protein binding" would better capture the
functional significance. The 14-3-3 interaction may be regulatory but is
not the core protein-binding function.
proposed_replacement_terms:
- id: GO:0051082
label: unfolded protein binding
supported_by:
- reference_id: PMID:19452453
supporting_text: Proteomic profiling of tandem affinity purified
14-3-3 protein complexes in Arabidopsis thaliana.
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The synergistic interaction between ClpB1 and the
DnaK/Hsp70 system results in approximately 3-fold higher
disaggregation rates. The NTD-binding groove specifically recognizes
exposed hydrophobic stretches in unfolded or aggregated proteins.
- term:
id: GO:0009408
label: response to heat
evidence_type: IMP
original_reference_id: PMID:18047473
review:
summary: Another response to heat annotation with experimental evidence
(IMP) from a study on thermotolerance enhancement. This duplicates
earlier annotations but provides additional experimental support.
action: ACCEPT
reason: Multiple experimental studies (IMP evidence) independently
demonstrate AtHsp101's essential role in heat response. Having the same
GO term annotated from multiple independent studies strengthens the
overall annotation and reflects the robustness of the experimental
evidence.
supported_by:
- reference_id: PMID:18047473
supporting_text: The xenobiotic beta-aminobutyric acid enhances
Arabidopsis thermotolerance.
- term:
id: GO:0009408
label: response to heat
evidence_type: IMP
original_reference_id: PMID:15923322
review:
summary: Yet another response to heat annotation with IMP evidence from
mutant analysis. This is consistent with all other response to heat
annotations.
action: ACCEPT
reason: This study identified heat stress phenotypes of mutants and
confirmed AtHsp101's role in thermotolerance acquisition. Multiple IMP
annotations from independent studies provide strong experimental support
for this core biological process.
supported_by:
- reference_id: PMID:15923322
supporting_text: Heat stress phenotypes of Arabidopsis mutants
implicate multiple signaling pathways in the acquisition of
thermotolerance.
- term:
id: GO:0043335
label: protein unfolding
evidence_type: IMP
original_reference_id: PMID:15659638
review:
summary: This annotation describes protein unfolding as a biological
process. However, AtHsp101's primary activity is protein disaggregation
and refolding, not unfolding per se.
action: MODIFY
reason: While AtHsp101 does mechanically extract and unfold polypeptides
during disaggregation, the term "protein unfolding" may be misleading as
it could imply the protein's primary role is to unfold native proteins.
The core function is protein disaggregation (extracting proteins from
aggregates) and enabling refolding. More accurate terms would be
GO:1990116 (ribosome-associated protein quality control pathway) or a
protein disaggregation term if one exists. The threading mechanism does
involve unfolding, but this is in service of disaggregation.
proposed_replacement_terms:
- id: GO:0140662
label: ATP-dependent protein folding chaperone
supported_by:
- reference_id: PMID:15659638
supporting_text: substrate unfolding may be coupled to threading
through the axial channel of the ClpB hexamer
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The threading mechanism involves passage of unfolded
substrate polypeptides through the central hexameric pore in a
processive, directional manner. The actual extraction of individual
polypeptides from aggregates involves continuous withdrawal of
unfolded protein chains.
- term:
id: GO:0009408
label: response to heat
evidence_type: IEP
original_reference_id: PMID:7866032
review:
summary: The original publication describing AtHsp101 complementation of
yeast thermotolerance defects, providing IEP evidence for response to
heat.
action: ACCEPT
reason: This is the seminal paper that first characterized AtHsp101 and
demonstrated its heat-responsive expression and function in
thermotolerance. The IEP evidence from this foundational study is
important and complements the other response to heat annotations.
supported_by:
- reference_id: PMID:7866032
supporting_text: An Arabidopsis heat shock protein complements a
thermotolerance defect in yeast
- term:
id: GO:0042026
label: protein refolding
evidence_type: TAS
original_reference_id: PMID:15659638
review:
summary: AtHsp101 facilitates protein refolding by disaggregating
heat-denatured protein aggregates and working with Hsp70 to enable
proper refolding of rescued proteins.
action: NEW
reason: While protein refolding is not currently annotated, it is a key
biological process for AtHsp101. The protein disaggregates aggregates
and works synergistically with Hsp70 to enable refolding. The unfolded
polypeptides emerging from the ClpB1 channel are captured by Hsp70 which
facilitates refolding. This term better captures the biological outcome
than protein unfolding.
supported_by:
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The unfolded polypeptides emerging from the ClpB
channel are captured by Hsp70, which facilitates either spontaneous
refolding or delivery to additional chaperones such as Hsp90 for
complete refolding.
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: "Hierarchical partnership: sHsps (ATP-independent stabilization)
โ ClpB1 (ATP-dependent disaggregation) โ Hsp70 (refolding)."
- reference_id: PMID:15659638
supporting_text: Genetic analysis reveals domain interactions of
Arabidopsis Hsp100/ClpB and cooperation with the small heat shock
protein chaperone system.
- term:
id: GO:0035617
label: stress granule disassembly
evidence_type: TAS
original_reference_id: PMID:23439916
review:
summary: AtHsp101 is required for stress granule disassembly after heat
stress. hot1 mutants show defects in SG disassembly and translation
recovery.
action: NEW
reason: This is a well-documented function of AtHsp101 that is not
currently annotated. Multiple studies show that hot1 mutants are
defective in stress granule disassembly following heat stress, with
ribosomal protein mRNAs remaining sequestered in stress granules rather
than being released for translation. This is distinct from the general
protein disaggregation function.
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: 'Stress Granule Disassembly (NOVEL): hot1 mutants show
defect in stress granule (SG) disassembly after heat stress. Defect in
translation recovery following heat stress. Ribosomal protein mRNAs preferentially
stored in SGs, released in Hsp101-dependent manner.'
- reference_id:
file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: RNA sequencing and polysome profiling studies
demonstrate that hsp101 mutant plants show a significant defect in
translation recovery and stress granule disassembly following heat
stress.
- reference_id: PMID:23439916
supporting_text: Feb 25. Interplay between heat shock proteins HSP101
and HSA32 prolongs heat acclimation memory posttranscriptionally in
Arabidopsis.
references:
- id: GO_REF:0000002
title: Gene Ontology annotation through association of InterPro records with
GO terms.
findings: []
- id: GO_REF:0000033
title: Annotation inferences using phylogenetic trees
findings: []
- id: GO_REF:0000043
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword
mapping
findings: []
- id: GO_REF:0000044
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular
Location vocabulary mapping, accompanied by conservative changes to GO
terms applied by UniProt.
findings: []
- id: GO_REF:0000117
title: Electronic Gene Ontology annotations created by ARBA machine learning
models
findings: []
- id: GO_REF:0000120
title: Combined Automated Annotation using Multiple IEA Methods.
findings: []
- id: GO_REF:0000122
title: AtSubP analysis
findings: []
- id: PMID:15659638
title: Genetic analysis reveals domain interactions of Arabidopsis
Hsp100/ClpB and cooperation with the small heat shock protein chaperone
system.
findings: []
- id: PMID:15923322
title: Heat stress phenotypes of Arabidopsis mutants implicate multiple
signaling pathways in the acquisition of thermotolerance.
findings: []
- id: PMID:18047473
title: The xenobiotic beta-aminobutyric acid enhances Arabidopsis
thermotolerance.
findings: []
- id: PMID:19452453
title: Proteomic profiling of tandem affinity purified 14-3-3 protein
complexes in Arabidopsis thaliana.
findings: []
- id: PMID:20061580
title: AT_CHLORO, a comprehensive chloroplast proteome database with
subplastidial localization and curated information on envelope proteins.
findings: []
- id: PMID:20229063
title: Functional characterization of AtHsp90.3 in Saccharomyces cerevisiae
and Arabidopsis thaliana under heat stress.
findings: []
- id: PMID:23439916
title: Interplay between heat shock proteins HSP101 and HSA32 prolongs heat
acclimation memory posttranscriptionally in Arabidopsis.
findings: []
- id: PMID:31519798
title: Integrative Analysis from the Epigenome to Translatome Uncovers
Patterns of Dominant Nuclear Regulation during Transient Stress.
findings: []
- id: PMID:37947266
title: 'Dynamics of mRNA fate during light stress and recovery: from transcription
to stability and translation.'
findings: []
- id: PMID:7866032
title: An Arabidopsis heat shock protein complements a thermotolerance
defect in yeast.
findings: []
core_functions:
- description: "ATP-dependent disaggregation of heat-denatured protein aggregates,
essential for acquired thermotolerance. Functions as a hexameric AAA+ ATPase
that processively threads polypeptide chains through a central pore, mechanically
extracting unfolded proteins from aggregates. Works in obligate synergy with
Hsp70/Hsc70-1 (3-fold enhanced activity) where Hsp70 recruits ClpB1 to aggregates
and captures emerging polypeptide chains. Absolutely required for thermotoleranceโhot1
null mutants completely lack acquired thermotolerance and cannot survive 44-45ยฐC
even with heat conditioning."
molecular_function:
id: GO:0140662
label: ATP-dependent protein folding chaperone
directly_involved_in:
- id: GO:0034605
label: cellular response to heat
- id: GO:0009408
label: response to heat
locations:
- id: GO:0005829
label: cytosol
- id: GO:0005634
label: nucleus
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: "hot1 mutants COMPLETELY LACK acquired thermotolerance. hot1-1
(E637K in NBD2), hot1-3 (null), hot1-4 (A499T in M-domain). Cannot survive
44-45ยฐC even with heat conditioning. ESSENTIAL partnership with Hsp70โ3-fold
higher disaggregation rate with Hsp70 vs ClpB1 alone."
- reference_id: file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: Loss-of-function mutations in CLPB1/HSP101 nearly
completely abolish acquired thermotolerance response. The synergistic
interaction between ClpB1 and the DnaK/Hsp70 system results in
approximately 3-fold higher disaggregation rates compared to ClpB1
alone.
- reference_id: PMID:15659638
supporting_text: The ATPase activity of both NBDs, as well as an intact
coiled-coil domain, are essential for chaperone activity and in vivo
thermotolerance.
- description: "Protein refolding through hierarchical chaperone network cooperation.
Extracts individual polypeptide chains from aggregates via ATP-driven threading,
presenting unfolded substrates to Hsp70 which captures emerging chains and facilitates
refolding or delivery to Hsp90 for complete refolding. The M-domain mediates
species-specific Hsp70 interaction and regulates ClpB1 activity through conformational
changes. NTD hydrophobic groove and pore loops (YKR and Ar/ฮฆ motifs) directly
contact substrate polypeptides during extraction."
molecular_function:
id: GO:0140662
label: ATP-dependent protein folding chaperone
directly_involved_in:
- id: GO:0042026
label: protein refolding
locations:
- id: GO:0005829
label: cytosol
- id: GO:0005634
label: nucleus
substrates:
- id: GO:0051082
label: unfolded protein binding
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: The unfolded polypeptides emerging from the ClpB
channel are captured by Hsp70, which facilitates either spontaneous
refolding or delivery to additional chaperones such as Hsp90 for
complete refolding. The NTD-binding groove specifically recognizes
exposed hydrophobic stretches in unfolded or aggregated proteins.
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: "Hierarchical partnership: sHsps (ATP-independent stabilization)
โ ClpB1 (ATP-dependent disaggregation) โ Hsp70 (refolding). M-domain mediates
species-specific Hsp70 interaction."
- description: Stress granule disassembly and translation recovery after heat
stress. Promotes disassembly of stress granules (cytoplasmic mRNA-protein
condensates) and restoration of translation during recovery from heat
stress. Ribosomal protein mRNAs are preferentially released from stress
granules in an Hsp101-dependent manner. hot1 mutants show significant
defects in stress granule disassembly and translation recovery, distinct
from general protein disaggregation function.
molecular_function:
id: GO:0016887
label: ATP hydrolysis activity
directly_involved_in:
- id: GO:0035617
label: stress granule disassembly
- id: GO:0045727
label: positive regulation of translation
locations:
- id: GO:0005829
label: cytosol
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: 'Stress Granule Disassembly (NOVEL): hot1 mutants show defect
in stress granule (SG) disassembly after heat stress. Defect in translation
recovery following heat stress. Ribosomal protein mRNAs preferentially stored
in SGs, released in Hsp101-dependent manner.'
- reference_id: file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: RNA sequencing and polysome profiling studies
demonstrate that hsp101 mutant plants show a significant defect in
translation recovery and stress granule disassembly following heat
stress.
- reference_id: PMID:23439916
supporting_text: HSP101 enhances the translation of HSA32 during
recovery after heat treatment.
- description: "Basal thermotolerance in germinating seeds. Provides critical thermotolerance
capacity during early seed germination (first 30-48 hours) when basal Hsp101
expression is constitutively elevated. Germinating seeds exhibit high basal
thermotolerance and can survive 47ยฐC for 2 hours, correlating with elevated
Hsp101 protein levels. Antisense suppression of Hsp101 dramatically reduces
basal thermotolerance in seeds, demonstrating developmental importance beyond
heat-induced function."
molecular_function:
id: GO:0140662
label: ATP-dependent protein folding chaperone
directly_involved_in:
- id: GO:0009408
label: response to heat
locations:
- id: GO:0005829
label: cytosol
supported_by:
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: "Basal Thermotolerance in Seeds: Germinating seeds have high
basal thermotolerance. Seeds survive 47ยฐC for 2h during early germination
(first 30-48h). Correlates with elevated basal Hsp101 expression. Antisense
suppression dramatically reduces basal tolerance."
- reference_id: file:ARATH/AT1G74310/AT1G74310-deep-research-perplexity.md
supporting_text: Seeds exhibit particularly robust basal thermotolerance
during the early stages of germination, a trait that correlates with
constitutively elevated levels of HSP101 protein in developing
seedlings.
- description: Long-term thermotolerance maintenance through positive feedback
with HSA32. Works synergistically with HSA32 (a small heat shock protein)
to extend thermotolerance memory beyond the initial heat shock response.
Hsp101 promotes HSA32 protein accumulation through enhanced translation,
while HSA32 retards Hsp101 protein degradation. This mutual stabilization
enables long-term thermotolerance (48+ hours recovery) compared to
short-term response (2 hours) mediated by Hsp101 alone.
molecular_function:
id: GO:0140662
label: ATP-dependent protein folding chaperone
directly_involved_in:
- id: GO:0034605
label: cellular response to heat
- id: GO:0045727
label: positive regulation of translation
locations:
- id: GO:0005829
label: cytosol
supported_by:
- reference_id: PMID:23439916
supporting_text: HSP101 enhances the translation of HSA32 during
recovery after heat treatment, and in turn, HSA32 retards the decay of
HSP101. This mutual stabilization extends thermotolerance memory.
- reference_id: file:ARATH/AT1G74310/AT1G74310-notes.md
supporting_text: 'Long-term thermotolerance maintenance with HSA32: Short-term
(2h recovery) Hsp101 only; Long-term (48h+ recovery) Hsp101 + HSA32 required.
Positive feedback: Hsp101 promotes HSA32 accumulation; HSA32 retards Hsp101
degradation.'
tags:
- arath