sigK

UniProt ID: P12254
Organism: Bacillus subtilis (strain 168)
Review Status: DRAFT
📝 Provide Detailed Feedback

Gene Description

SigK (also known as spoIVCB/spoIIIC) encodes the RNA polymerase sigma-K factor, the late mother cell-specific sigma factor of the sigma-70 family that directs transcription of late sporulation genes in B. subtilis. SigK has a unique feature: in strain 168, the sigK gene is interrupted by a ~48 kb prophage-like "skin" element (skinBs), and the functional sigK is only reconstituted late in sporulation when the SpoIVCA recombinase excises this element under sigE/SpoIIID control in the mother cell. SigK is synthesized as an inactive precursor (pro-sigK) with a 20-residue propeptide that must be removed by the intramembrane metalloprotease SpoIVFB for activation. SpoIVFB is held inactive by BofA and SpoIVFA until forespore-secreted proteases SpoIVB and CtpB cleave these inhibitors, thereby coupling sigK activation to developmental stage. The mature sigma-K factor contains conserved sigma-70 regions including a helix-turn-helix DNA-binding motif (residues 251-270) that recognizes sigK-dependent promoters. SigK activates the final phase of mother cell gene expression including coat proteins (cotA, cotB, cotD, cotE, cotH), cortex, and germination genes. SigK operates in a hierarchical feed-forward network with GerE, which both represses some sigK-activated genes and activates additional late genes.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000976 transcription cis-regulatory region binding
IBA
GO_REF:0000033
REMOVE
Summary: This annotation suggests sigK binds to transcription cis-regulatory regions independently. However, according to the GO definition of sigma factor activity (GO:0016987), sigma factors do not bind DNA on their own but when combined with the core polymerase to form the holoenzyme, they confer promoter specificity. This term is technically inaccurate for sigma factors as they require the core polymerase for DNA binding.
Reason: Sigma factors do not bind DNA independently. The GO definition of sigma factor activity explicitly states that "sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, the sigma factor binds specifically to promoter elements." This term implies autonomous DNA binding which is misleading for sigma factors. The correct molecular function is GO:0016987 sigma factor activity, which is already annotated for sigK.
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.
GO:0003899 DNA-directed RNA polymerase activity
IBA
GO_REF:0000033
REMOVE
Summary: This annotation incorrectly attributes catalytic RNA polymerase activity to sigK. Sigma factors are NOT enzymes and do not catalyze the polymerization reaction. The catalytic activity resides in the core polymerase subunits (beta, beta-prime), not in sigma factors.
Reason: This is a clear over-annotation. GO:0003899 is defined as "Catalysis of the reaction: nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1)." Sigma factors do not possess catalytic activity. The catalytic activity of RNA polymerization is performed by the core enzyme subunits, not by sigma factors. SigK confers promoter specificity but does not catalyze nucleotide addition. UniProt describes sigma factors as "initiation factors that promote the attachment of RNA polymerase to specific initiation sites" - not enzymes.
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.
GO:0006355 regulation of DNA-templated transcription
IBA
GO_REF:0000033
ACCEPT
Summary: SigK regulates transcription by directing RNA polymerase to mother cell-specific promoters during late sporulation. The sigK regulon includes coat proteins, cortex genes, and germination factors. SigK operates in a feed-forward network with GerE to sculpt the late mother cell transcriptional program.
Reason: Sigma factors inherently regulate transcription by determining which genes are transcribed. SigK specifically activates the late mother cell developmental program. UniProt states that "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell." Deep research confirms sigK activates coat/cortex/germination genes.
Supporting Evidence:
UniProt:P12254
This sigma factor is responsible for the expression of sporulation specific genes in the mother cell.
file:BACSU/sigK/sigK-deep-research-falcon.md
sigK operates in a hierarchical, feed-forward mother-cell program following sigE and preceding GerE, activating coat/cortex/germination genes
GO:0016987 sigma factor activity
IBA
GO_REF:0000033
ACCEPT
Summary: This is the primary and most accurate molecular function annotation for sigK. SigK is a well-characterized sigma factor of the sigma-70 family. It contains conserved sigma-70 regions (regions 2 and 4) that mediate core RNAP binding and promoter recognition. UniProt explicitly classifies sigK as belonging to the sigma-70 factor family.
Reason: This is the canonical molecular function for sigK. UniProt identifies sigK as "RNA polymerase sigma-K factor" belonging to the "sigma-70 factor family." The protein contains all characteristic sigma-70 family domains including the HTH DNA-binding motif (residues 251-270) and polymerase core binding motif (residues 79-92).
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.
UniProt:P12254
Belongs to the sigma-70 factor family.
GO:1903865 sigma factor antagonist complex
IBA
GO_REF:0000033
ACCEPT
Summary: SigK (as pro-sigK) forms a complex with its regulatory proteins SpoIVFB, BofA, and SpoIVFA. GO:1903865 is defined as "A protein complex which is capable of sigma factor antagonist activity." The BofA/SpoIVFA complex inhibits SpoIVFB-mediated pro-sigK processing, thereby preventing premature sigK activation.
Reason: UniProt documents the interaction between sigK and SpoIVFB (IntAct:EBI-15806052). PMID:19805276 demonstrates that "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K)". The pro-sigK-SpoIVFB-BofA-SpoIVFA complex represents a sigma factor antagonist complex that prevents premature sigK activity until the forespore signals for activation via SpoIVB and CtpB proteases.
Supporting Evidence:
UniProt:P12254
P12254; P26937: spoIVFB; NbExp=2; IntAct=EBI-15806052, EBI-15806037
PMID:19805276
The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction
GO:0003677 DNA binding
IEA
GO_REF:0000043
KEEP AS NON CORE
Summary: This generic DNA binding annotation is inferred from the UniProt DNA-binding keyword. SigK does contain a helix-turn-helix DNA-binding motif (residues 251-270) as documented in UniProt. However, sigma factors require the core polymerase for DNA binding - they do not bind DNA independently.
Reason: SigK contains a documented HTH DNA-binding motif at residues 251-270 (UniProt feature). However, per the GO definition of sigma factor activity, "sigma does not bind DNA on its own." The more precise molecular function is GO:0016987 sigma factor activity. This annotation is kept as non-core since it reflects domain architecture but is less informative than the sigma factor activity annotation.
Supporting Evidence:
UniProt:P12254
DNA_BIND 251..270 /note="H-T-H motif"
GO:0003700 DNA-binding transcription factor activity
IEA
GO_REF:0000002
MODIFY
Summary: This annotation from InterPro domain mapping is incorrect for sigma factors. Sigma factors are not transcription factors in the classical sense; they are subunits of the RNA polymerase holoenzyme that confer promoter specificity rather than independent regulatory factors.
Reason: GO:0003700 is defined as sequence-specific DNA binding that modulates transcription, applicable to classical transcription factors that independently bind DNA regulatory elements. Sigma factors function differently: they are dissociable subunits of RNA polymerase that determine promoter recognition as part of the holoenzyme complex. The correct term is GO:0016987 sigma factor activity, which is already annotated for sigK.
Proposed replacements: sigma factor activity
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released
GO:0006351 DNA-templated transcription
IEA
GO_REF:0000043
ACCEPT
Summary: SigK is involved in DNA-templated transcription as a component of the RNA polymerase holoenzyme. This is an appropriate general biological process annotation inferred from the UniProt Transcription keyword.
Reason: As a sigma factor, sigK is essential for transcription initiation from specific promoters. This general process term appropriately captures sigK's role in transcription. The annotation is supported by UniProt keywords (Transcription) and the well-characterized function of sigma factors in bacterial transcription.
Supporting Evidence:
UniProt:P12254
This sigma factor is responsible for the expression of sporulation specific genes in the mother cell.
GO:0006352 DNA-templated transcription initiation
IEA
GO_REF:0000002
ACCEPT
Summary: This is a highly appropriate annotation for sigma factors. Sigma factors specifically function in transcription initiation by directing RNA polymerase to promoters and are released after initiation. SigK acts at the initiation step to enable recognition of sigK-dependent promoters.
Reason: UniProt states that "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released." The specific role of sigma factors in initiation (as opposed to elongation or termination) makes this a precise and accurate annotation for sigK.
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released
GO:0006355 regulation of DNA-templated transcription
IEA
GO_REF:0000120
ACCEPT
Summary: Duplicate annotation (same term as the IBA annotation above) from automated inference. This accurately captures sigK's regulatory role in transcription of the late mother cell sporulation program.
Reason: This is a duplicate of the IBA annotation for the same term. Both annotations are valid and supported by the same evidence. Sigma factors regulate transcription by determining which genes are expressed. SigK specifically regulates the late mother cell gene expression program during sporulation.
Supporting Evidence:
UniProt:P12254
This sigma factor is responsible for the expression of sporulation specific genes in the mother cell.
GO:0016987 sigma factor activity
IEA
GO_REF:0000043
ACCEPT
Summary: Duplicate annotation (same term as the IBA annotation above) from automated inference based on UniProt Sigma factor keyword.
Reason: This is a duplicate of the IBA annotation for sigma factor activity. Both annotations are valid. This is the core molecular function of sigK, supported by the UniProt Sigma factor keyword and sigma-70 family domain architecture.
Supporting Evidence:
UniProt:P12254
Belongs to the sigma-70 factor family.
GO:0030435 sporulation resulting in formation of a cellular spore
IEA
GO_REF:0000043
ACCEPT
Summary: SigK is the late mother cell-specific sigma factor essential for sporulation in B. subtilis. It activates genes required for coat assembly, cortex formation, and germination, all essential for production of a mature spore. This biological process annotation appropriately captures the developmental context of sigK function.
Reason: SigK is specifically active in the mother cell during late sporulation and is absolutely required for spore formation. UniProt states it is "responsible for the expression of sporulation specific genes in the mother cell." PMID:19805276 confirms that sigK directs "transcription of genes necessary for spore formation."
Supporting Evidence:
UniProt:P12254
This sigma factor is responsible for the expression of sporulation specific genes in the mother cell.
PMID:19805276
releasing sigma(K) to direct transcription of genes necessary for spore formation
GO:2000142 regulation of DNA-templated transcription initiation
IEA
GO_REF:0000108
ACCEPT
Summary: This annotation is logically inferred from sigma factor activity (GO:0016987). Since sigma factors specifically function at the initiation step, regulation of transcription initiation is an appropriate process annotation for sigK.
Reason: This is a more specific version of the transcription regulation annotation that correctly reflects that sigma factors act at the initiation stage. Since sigma factors "promote the attachment of RNA polymerase to specific initiation sites," they specifically regulate the initiation step of transcription. This logical inference from GO:0016987 is valid.
Supporting Evidence:
UniProt:P12254
Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites
GO:0005515 protein binding
IPI
PMID:19805276
Intramembrane proteolytic cleavage of a membrane-tethered tr...
MODIFY
Summary: This annotation is based on the physical interaction between pro-sigK and the intramembrane metalloprotease SpoIVFB, demonstrated in PMID:19805276. The study shows that SpoIVFB engages pro-sigK as its substrate for regulated intramembrane proteolysis, releasing active sigK.
Reason: While the annotation is not wrong (sigK does bind to SpoIVFB), "protein binding" is too vague to be informative. The specific interaction documented is between pro-sigK and its processing protease SpoIVFB. The CC term GO:1903865 (sigma factor antagonist complex) already captures this regulatory relationship more specifically. The protein binding annotation adds little value over the more specific complex annotation.
Proposed replacements: sigma factor antagonist complex
Supporting Evidence:
PMID:19805276
The Bacillus subtilis SpoIVFB protein is a putative metalloprotease that cleaves membrane-tethered Pro-sigma(K), releasing sigma(K) to direct transcription of genes necessary for spore formation.
PMID:19805276
The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction

Core Functions

Late mother cell-specific sigma factor activity directing transcription initiation of sporulation genes. SigK confers promoter recognition specificity to RNA polymerase, enabling transcription of coat proteins, cortex assembly genes, and germination factors. SigK is activated by proteolytic removal of its N-terminal propeptide by SpoIVFB after forespore-derived SpoIVB/CtpB proteases relieve inhibition of SpoIVFB by BofA/SpoIVFA.

Supporting Evidence:
  • UniProt:P12254
    Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.
  • UniProt:P12254
    This sigma factor is responsible for the expression of sporulation specific genes in the mother cell.

References

Gene Ontology annotation through association of InterPro records with GO terms
Annotation inferences using phylogenetic trees
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Automatic assignment of GO terms using logical inference, based on inter-ontology links
Combined Automated Annotation using Multiple IEA Methods
Intramembrane proteolytic cleavage of a membrane-tethered transcription factor by a metalloprotease depends on ATP.
  • SpoIVFB cleaves pro-sigK to release active sigK
    "The Bacillus subtilis SpoIVFB protein is a putative metalloprotease that cleaves membrane-tethered Pro-sigma(K), releasing sigma(K) to direct transcription of genes necessary for spore formation."
  • SpoIVFB CBS domain interacts with pro-sigK and ATP modulates this interaction
    "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction"
UniProt:P12254
UniProt entry for RNA polymerase sigma-K factor (sigK/spoIVCB/spoIIIC)
  • SigK is synthesized as a precursor with a propeptide that is removed
    "PROPEP 1..20 /id="PRO_0000032588""
  • SigK contains a helix-turn-helix DNA-binding motif at residues 251-270
    "DNA_BIND 251..270 /note="H-T-H motif""
  • SigK belongs to the sigma-70 factor family
    "Belongs to the sigma-70 factor family."
  • Sigma factors are initiation factors that promote RNAP attachment to specific sites
    "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."
  • The sigK gene is produced by chromosomal rearrangement - excision of skin element
    "The spoIVCB gene is an incomplete structural gene that is capable of encoding only the N-terminal half of sigma-K."
  • SigK interacts with SpoIVFB
    "P12254; P26937: spoIVFB; NbExp=2; IntAct=EBI-15806052, EBI-15806037"
file:BACSU/sigK/sigK-deep-research-falcon.md
Deep research report on sigK function and regulation
  • SigK operates in a hierarchical feed-forward mother-cell program
    "sigK operates in a hierarchical, feed-forward mother-cell program following sigE and preceding GerE, activating coat/cortex/germination genes"

Suggested Questions for Experts

Q: What is the complete SigK regulon and how does the feed-forward regulation with GerE sculpt the temporal expression pattern?

Q: How does SigK promoter recognition specificity differ from that of the early mother cell sigma factor SigE?

Q: Is skin element excision absolutely required for sigK function, or can strains without the skin element sporulate normally?

Suggested Experiments

Experiment: ChIP-seq analysis of SigK binding during late sporulation would enable genome-wide identification of SigK binding sites, comprehensively defining the SigK regulon and validating the regulatory relationships with GerE.

Hypothesis: SigK binds to specific promoter sequences genome-wide to activate a defined late mother cell transcriptional program

Type: ChIP-seq

Experiment: Cryo-EM structural analysis of the full pro-sigK-SpoIVFB-BofA-SpoIVFA inhibitory complex would reveal the molecular mechanism of sigK sequestration prior to activation and how SpoIVB/CtpB-mediated cleavage relieves this inhibition.

Hypothesis: BofA and SpoIVFA form a steric block preventing pro-sigK from accessing the SpoIVFB active site

Type: Structural biology

📚 Additional Documentation

Deep Research Bioreason

(sigK-deep-research-bioreason.md)

BioReason Chat Export

Exported on March 22, 2026 at 02:27 AM


Organism: Bacillus subtilis

Sequence:

MVTGVFAALGFVVKELVFLVSYVKNNAFPQPLSSSEEKKYLELMAKGDEHARNMLIEHNLRLVAHIVKKFENTGEDAEDLISIGTIGLIKGIESYSAGKGTKLATYAARCIENEIVITKGGCIHPSLIRFNIYGVRIHNGNFFHDKVNNCFFIFKSMPPLFVMNNEILMHLRALKKTKKDVSLHDPIGQDKEGNEISLIDVLKSENEDVIDTIQLNMELEKVKQYIDILDDREKEVIVGRFGLDLKKEKTQREIAKELGISRSYVSRIEKRALMKMFHEFYRAEKEKRKKAKGK

Thinking Trace

I begin by dissecting the InterPro architecture and its order along the polypeptide. The N-terminal half is defined by IPR000943 (RNA polymerase sigma-70, residues 5–294) and reinforced by IPR014284 (RNA polymerase sigma-70-like domain, residues 50–276), establishing a canonical σ70-type initiation factor scaffold. Within this scaffold, region 2 is specified by IPR013325 (sigma factor region 2 homologous superfamily, residues 10–116), IPR007627 (sigma-70 region 2, residues 55–115), and the family signature IPR050813 (Bacterial Sigma-70 Factor, residues 17–115). Region 2 mediates high-affinity binding to the RNA polymerase core and recognizes the −10 promoter element via the σ2 helix-turn-helix and acidic tail that stabilizes open-complex formation. The C-terminal half transitions into region 3/4: IPR013324 (sigma factor region 3/4-like homologous superfamily, residues 192–278) and IPR007630 (sigma-70 region 4, residues 226–276) together encode the promoter-recognition and initiation-control apparatus. Region 4 carries IPR001387 (Cro/C1-type helix-turn-helix domain, residues 250–270), which forms the helix-turn-helix that docks onto the −35 element and couples to the β-flap of the core enzyme. The contiguous arrangement—region 2 followed by region 3/4 capped by the HTH-bearing region 4—creates a promoter-targeting module that binds the bacterial RNA polymerase core and enforces promoter specificity.

This domain architecture causes a specific molecular function: sigma-70-type factors are the intrinsic initiation determinants of bacterial RNA polymerases. The composite of σ2 and σ4 modules confers sequence-specific promoter recognition and assembly of the holoenzyme, matching GO:0005515 (the molecular function label provided). Mechanistically, σ2 nucleates core-enzyme binding and −10 recognition, while σ4’s HTH engages −35; together they drive isomerization to the open complex and trigger transcription initiation.

From this function flows the biological process. By imposing promoter selectivity and catalyzing open-complex formation, the protein initiates transcription of the chromosome-encoded transcriptome—an archetypal bacterial transcriptional initiation pathway. This aligns with the biological process labeled BP: GO:0019222, which I interpret as the transcriptional initiation axis that seeds RNA synthesis from DNA templates in bacteria.

Cellular localization follows from the soluble, non-membranous architecture and the bacterial context. Sigma-70-like factors operate with the cytosolic RNA polymerase at nucleoid-associated transcription sites. Thus, the protein resides in the cytoplasm, consistent with the cellular component label CC: GO:0005575 and the absence of transmembrane segments or secretion signals.

Integrating these lines, I hypothesize a mechanism in which the σ2 module (residues 10–116) binds the core polymerase and stabilizes the melted promoter region, while the σ4 HTH (residues 226–276; Cro/C1-type HTH 250–270) anchors to −35 and coordinates β-flap contacts. Together with region 3/4 (192–278), this architecture tunes promoter choice and initiation efficiency. Likely interaction partners include the core RNA polymerase subunits and nucleoid-associated transcription assemblies; transient contacts with other sigma factors may occur during holoenzyme exchange and promoter competition. This cytosolic initiation factor thereby orchestrates promoter-specific assembly of the transcription machinery in Bacillus subtilis.

Functional Summary

A cytosolic bacterial transcription initiation factor that associates with the core RNA polymerase to confer promoter specificity. Its modular architecture binds the polymerase through an N-terminal recognition module and uses a C-terminal helix–turn–helix to engage −35 promoter elements, thereby assembling the initiation-competent holoenzyme, stabilizing the open complex, and launching RNA synthesis from chromosome-embedded promoters in Bacillus subtilis.

UniProt Summary

Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.

InterPro Domains

  • RNA polymerase sigma-70 (IPR000943, domain) — residues 5-294
  • RNA polymerase sigma factor, region 2 (IPR013325, homologous_superfamily) — residues 10-116
  • Bacterial Sigma-70 Factor (IPR050813, family) — residues 17-115
  • RNA polymerase sigma-70-like domain (IPR014284, domain) — residues 50-276
  • RNA polymerase sigma-70 region 2 (IPR007627, domain) — residues 55-115
  • RNA polymerase sigma factor, region 3/4-like (IPR013324, homologous_superfamily) — residues 192-278
  • RNA polymerase sigma-70 region 4 (IPR007630, domain) — residues 226-276
  • Cro/C1-type, helix-turn-helix domain (IPR001387, domain) — residues 250-270

GO Terms

Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)

Biological Process: biological_process (GO:0008150), biological regulation (GO:0065007), regulation of biological process (GO:0050789), negative regulation of biological process (GO:0048519), regulation of metabolic process (GO:0019222), regulation of cellular process (GO:0050794), negative regulation of cellular process (GO:0048523), negative regulation of metabolic process (GO:0009892), negative regulation of nitrogen compound metabolic process (GO:0051172), negative regulation of macromolecule metabolic process (GO:0010605), regulation of biosynthetic process (GO:0009889), regulation of nitrogen compound metabolic process (GO:0051171), regulation of macromolecule metabolic process (GO:0060255), regulation of cellular metabolic process (GO:0031323), negative regulation of biosynthetic process (GO:0009890), regulation of primary metabolic process (GO:0080090), negative regulation of cellular metabolic process (GO:0031324), regulation of macromolecule biosynthetic process (GO:0010556), regulation of cellular biosynthetic process (GO:0031326), regulation of nucleobase-containing compound metabolic process (GO:0019219), negative regulation of macromolecule biosynthetic process (GO:0010558), negative regulation of cellular biosynthetic process (GO:0031327), regulation of RNA metabolic process (GO:0051252), negative regulation of RNA metabolic process (GO:0051253), negative regulation of nucleobase-containing compound metabolic process (GO:0045934), regulation of gene expression (GO:0010468), regulation of RNA biosynthetic process (GO:2001141), negative regulation of RNA biosynthetic process (GO:1902679), regulation of DNA-templated transcription (GO:0006355), negative regulation of nucleic acid-templated transcription (GO:1903507), regulation of nucleic acid-templated transcription (GO:1903506), negative regulation of DNA-templated transcription (GO:0045892)

Cellular Component: cellular_component (GO:0005575), cellular anatomical entity (GO:0110165), protein-containing complex (GO:0032991), host cellular component (GO:0018995), catalytic complex (GO:1902494), intracellular anatomical structure (GO:0005622), organelle (GO:0043226), intracellular protein-containing complex (GO:0140535), intracellular organelle (GO:0043229), endonuclease complex (GO:1905348), membrane-bounded organelle (GO:0043227), intracellular membrane-bounded organelle (GO:0043231), nucleus (GO:0005634)


Generated by BioReason

Deep Research Bioreason Rl

(sigK-deep-research-bioreason-rl.md)

BioReason Chat Export

Exported on March 22, 2026 at 02:27 AM


Organism: Bacillus subtilis

Sequence:

MVTGVFAALGFVVKELVFLVSYVKNNAFPQPLSSSEEKKYLELMAKGDEHARNMLIEHNLRLVAHIVKKFENTGEDAEDLISIGTIGLIKGIESYSAGKGTKLATYAARCIENEIVITKGGCIHPSLIRFNIYGVRIHNGNFFHDKVNNCFFIFKSMPPLFVMNNEILMHLRALKKTKKDVSLHDPIGQDKEGNEISLIDVLKSENEDVIDTIQLNMELEKVKQYIDILDDREKEVIVGRFGLDLKKEKTQREIAKELGISRSYVSRIEKRALMKMFHEFYRAEKEKRKKAKGK

Thinking Trace

I begin by dissecting the InterPro architecture and its order along the polypeptide. The N-terminal half is defined by IPR000943 (RNA polymerase sigma-70, residues 5–294) and reinforced by IPR014284 (RNA polymerase sigma-70-like domain, residues 50–276), establishing a canonical σ70-type initiation factor scaffold. Within this scaffold, region 2 is specified by IPR013325 (sigma factor region 2 homologous superfamily, residues 10–116), IPR007627 (sigma-70 region 2, residues 55–115), and the family signature IPR050813 (Bacterial Sigma-70 Factor, residues 17–115). Region 2 mediates high-affinity binding to the RNA polymerase core and recognizes the −10 promoter element via the σ2 helix-turn-helix and acidic tail that stabilizes open-complex formation. The C-terminal half transitions into region 3/4: IPR013324 (sigma factor region 3/4-like homologous superfamily, residues 192–278) and IPR007630 (sigma-70 region 4, residues 226–276) together encode the promoter-recognition and initiation-control apparatus. Region 4 carries IPR001387 (Cro/C1-type helix-turn-helix domain, residues 250–270), which forms the helix-turn-helix that docks onto the −35 element and couples to the β-flap of the core enzyme. The contiguous arrangement—region 2 followed by region 3/4 capped by the HTH-bearing region 4—creates a promoter-targeting module that binds the bacterial RNA polymerase core and enforces promoter specificity.

This domain architecture causes a specific molecular function: sigma-70-type factors are the intrinsic initiation determinants of bacterial RNA polymerases. The composite of σ2 and σ4 modules confers sequence-specific promoter recognition and assembly of the holoenzyme, matching GO:0005515 (the molecular function label provided). Mechanistically, σ2 nucleates core-enzyme binding and −10 recognition, while σ4’s HTH engages −35; together they drive isomerization to the open complex and trigger transcription initiation.

From this function flows the biological process. By imposing promoter selectivity and catalyzing open-complex formation, the protein initiates transcription of the chromosome-encoded transcriptome—an archetypal bacterial transcriptional initiation pathway. This aligns with the biological process labeled BP: GO:0019222, which I interpret as the transcriptional initiation axis that seeds RNA synthesis from DNA templates in bacteria.

Cellular localization follows from the soluble, non-membranous architecture and the bacterial context. Sigma-70-like factors operate with the cytosolic RNA polymerase at nucleoid-associated transcription sites. Thus, the protein resides in the cytoplasm, consistent with the cellular component label CC: GO:0005575 and the absence of transmembrane segments or secretion signals.

Integrating these lines, I hypothesize a mechanism in which the σ2 module (residues 10–116) binds the core polymerase and stabilizes the melted promoter region, while the σ4 HTH (residues 226–276; Cro/C1-type HTH 250–270) anchors to −35 and coordinates β-flap contacts. Together with region 3/4 (192–278), this architecture tunes promoter choice and initiation efficiency. Likely interaction partners include the core RNA polymerase subunits and nucleoid-associated transcription assemblies; transient contacts with other sigma factors may occur during holoenzyme exchange and promoter competition. This cytosolic initiation factor thereby orchestrates promoter-specific assembly of the transcription machinery in Bacillus subtilis.

Functional Summary

A cytosolic bacterial transcription initiation factor that associates with the core RNA polymerase to confer promoter specificity. Its modular architecture binds the polymerase through an N-terminal recognition module and uses a C-terminal helix–turn–helix to engage −35 promoter elements, thereby assembling the initiation-competent holoenzyme, stabilizing the open complex, and launching RNA synthesis from chromosome-embedded promoters in Bacillus subtilis.

UniProt Summary

Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.

InterPro Domains

  • RNA polymerase sigma-70 (IPR000943, domain) — residues 5-294
  • RNA polymerase sigma factor, region 2 (IPR013325, homologous_superfamily) — residues 10-116
  • Bacterial Sigma-70 Factor (IPR050813, family) — residues 17-115
  • RNA polymerase sigma-70-like domain (IPR014284, domain) — residues 50-276
  • RNA polymerase sigma-70 region 2 (IPR007627, domain) — residues 55-115
  • RNA polymerase sigma factor, region 3/4-like (IPR013324, homologous_superfamily) — residues 192-278
  • RNA polymerase sigma-70 region 4 (IPR007630, domain) — residues 226-276
  • Cro/C1-type, helix-turn-helix domain (IPR001387, domain) — residues 250-270

GO Terms

Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)

Biological Process: biological_process (GO:0008150), biological regulation (GO:0065007), regulation of biological process (GO:0050789), negative regulation of biological process (GO:0048519), regulation of metabolic process (GO:0019222), regulation of cellular process (GO:0050794), negative regulation of cellular process (GO:0048523), negative regulation of metabolic process (GO:0009892), negative regulation of nitrogen compound metabolic process (GO:0051172), negative regulation of macromolecule metabolic process (GO:0010605), regulation of biosynthetic process (GO:0009889), regulation of nitrogen compound metabolic process (GO:0051171), regulation of macromolecule metabolic process (GO:0060255), regulation of cellular metabolic process (GO:0031323), negative regulation of biosynthetic process (GO:0009890), regulation of primary metabolic process (GO:0080090), negative regulation of cellular metabolic process (GO:0031324), regulation of macromolecule biosynthetic process (GO:0010556), regulation of cellular biosynthetic process (GO:0031326), regulation of nucleobase-containing compound metabolic process (GO:0019219), negative regulation of macromolecule biosynthetic process (GO:0010558), negative regulation of cellular biosynthetic process (GO:0031327), regulation of RNA metabolic process (GO:0051252), negative regulation of RNA metabolic process (GO:0051253), negative regulation of nucleobase-containing compound metabolic process (GO:0045934), regulation of gene expression (GO:0010468), regulation of RNA biosynthetic process (GO:2001141), negative regulation of RNA biosynthetic process (GO:1902679), regulation of DNA-templated transcription (GO:0006355), negative regulation of nucleic acid-templated transcription (GO:1903507), regulation of nucleic acid-templated transcription (GO:1903506), negative regulation of DNA-templated transcription (GO:0045892)

Cellular Component: cellular_component (GO:0005575), cellular anatomical entity (GO:0110165), protein-containing complex (GO:0032991), host cellular component (GO:0018995), catalytic complex (GO:1902494), intracellular anatomical structure (GO:0005622), organelle (GO:0043226), intracellular protein-containing complex (GO:0140535), intracellular organelle (GO:0043229), endonuclease complex (GO:1905348), membrane-bounded organelle (GO:0043227), intracellular membrane-bounded organelle (GO:0043231), nucleus (GO:0005634)


Generated by BioReason

Deep Research Falcon

(sigK-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-18T08:26:33.950142'
end_time: '2025-12-18T08:36:54.320784'
duration_seconds: 620.37
template_file: templates/gene_research_go_focused.md
template_variables:
organism: BACSU
gene_id: sigK
gene_symbol: sigK
uniprot_accession: P12254
protein_description: 'RecName: Full=RNA polymerase sigma-K factor; AltName: Full=Sigma-27;
AltName: Full=Stage IV sporulation protein CB; Flags: Precursor;'
gene_info: Name=sigK; Synonyms=cisB, spoIIIC, spoIVCB; OrderedLocusNames=BSU25760/BSU26390;
organism_full: Bacillus subtilis (strain 168).
protein_family: Belongs to the sigma-70 factor family. .
protein_domains: Cro/C1-type_HTH. (IPR001387); RNA_pol_sigma-70_dom. (IPR014284);
RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r4.
(IPR007630)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 40


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P12254
  • Protein Description: RecName: Full=RNA polymerase sigma-K factor; AltName: Full=Sigma-27; AltName: Full=Stage IV sporulation protein CB; Flags: Precursor;
  • Gene Information: Name=sigK; Synonyms=cisB, spoIIIC, spoIVCB; OrderedLocusNames=BSU25760/BSU26390;
  • Organism (full): Bacillus subtilis (strain 168).
  • Protein Family: Belongs to the sigma-70 factor family. .
  • Key Domains: Cro/C1-type_HTH. (IPR001387); RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r4. (IPR007630)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "sigK" matches the protein description above
  2. Verify the organism is correct: Bacillus subtilis (strain 168).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'sigK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene sigK (gene ID: sigK, UniProt: P12254) in BACSU.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P12254
  • Protein Description: RecName: Full=RNA polymerase sigma-K factor; AltName: Full=Sigma-27; AltName: Full=Stage IV sporulation protein CB; Flags: Precursor;
  • Gene Information: Name=sigK; Synonyms=cisB, spoIIIC, spoIVCB; OrderedLocusNames=BSU25760/BSU26390;
  • Organism (full): Bacillus subtilis (strain 168).
  • Protein Family: Belongs to the sigma-70 factor family. .
  • Key Domains: Cro/C1-type_HTH. (IPR001387); RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r4. (IPR007630)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "sigK" matches the protein description above
  2. Verify the organism is correct: Bacillus subtilis (strain 168).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'sigK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene sigK (gene ID: sigK, UniProt: P12254) in BACSU.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Research plan and verification: We verified the target is Bacillus subtilis (strain 168) sigK (UniProt P12254), encoding the RNA polymerase sigma-K factor (σK), a σ70-family transcription factor produced as an inactive pro-σK precursor and activated late in sporulation within the mother cell. All evidence below pertains to this protein/organism and is consistent with σ70-family domain architecture and σK’s role in the sporulation sigma cascade (σE/σK in the mother cell and σF/σG in the forespore) (eichenberger2004theprogramof pages 1-2, galperin2022conservationandevolution pages 7-9, phillips2002bacillussubtilissporulation pages 7-9).

Module Key concept Mechanistic highlights Timing / localization Top sources (year)
Identity / family sigK encodes pro-σK, a member of the σ70-family of transcription factors; produced as an inactive precursor (pro-σK) requiring proteolytic activation Pro-σK is synthesized as a proprotein with N-terminal inhibitory segment that must be removed for σK to bind RNAP and direct late mother-cell transcription (σ-classified as rK). Active in the mother cell after engulfment completion; σK functions in cytoplasm after pro-domain removal and release from membrane vicinity (mother-cell compartment). (eichenberger2004theprogramof pages 1-2, galperin2022conservationandevolution pages 7-9, ramirezguadiana2018evidencethatregulation pages 12-14)
Activation (SpoIVFB/BofA/SpoIVFA; SpoIVB/CtpB) Regulated intramembrane proteolysis (RIP): SpoIVFB (S2P-family intramembrane metalloprotease) cleaves Pro-σK when inhibition relieved SpoIVFB is inhibited by SpoIVFA and BofA which sterically block substrate access; forespore-produced proteases SpoIVB and CtpB cleave inhibitory partners (site-1 signalling), triggering SpoIVFB lateral-gate opening and Pro-σK processing. Activation requires intercompartmental signalling from forespore (SpoIVB/CtpB) to mother-cell SpoIVFB complex; spatially restricted to forespore-surrounding/mother-cell membranes during late engulfment. (olenic2022inhibitoryproteinsblock pages 19-21, orlando2024substrateengagementby pages 1-2, ramirezguadiana2018evidencethatregulation pages 18-19, ramirezguadiana2018evidencethatregulation pages 9-12)
Structural insights (SpoIVFB–Pro-σK cryo-EM) Cryo-EM/mutational data reveal substrate positioning and putative water access path to membrane-buried active site SpoIVFB engages Pro-σK via β-sheet augmentation; interdomain linker and membrane lipids facilitate substrate capture; supports substrate-gating/lateral-gate model for regulated intramembrane proteolysis. Structural complex solved (cryo-EM) of SpoIVFB bound to Pro-σK shows membrane-embedded active site and substrate path relevant during mother-cell processing. (orlando2024substrateengagementby pages 1-2, olenic2022inhibitoryproteinsblock pages 19-21)
Skin element excision (SpoIVCA) sigK is interrupted by a prophage-like 'skin' element in B. subtilis 168; SpoIVCA (site-specific recombinase) excises skin to restore an intact sigK spoIVCA (large serine recombinase) mediates precise excision of skin; expression of spoIVCA and timing of excision are controlled by mother-cell regulators (σE and SpoIIID) to restrict excision to the mother cell. Excision occurs toward end of engulfment in the mother cell, enabling later transcription/translation of pro-σK. (galperin2022conservationandevolution pages 7-9, serrano2016arecombinationdirectionality pages 2-3, serrano2016arecombinationdirectionality pages 5-7)
Sigma cascade hierarchy (σF/σE/σG → σK) Sequential, compartmentalized sigma cascade controls progressive sporulation programs across forespore and mother cell σF→σG in forespore and σE→(SpoIIID/GerR)→σK in mother cell; multiple feed-forward and repressive interactions ensure correct temporal order and prevent premature σK activity. Cascade is spatially compartmentalized (forespore vs mother cell); σK activation restricted to mother cell post-engulfment. (phillips2002bacillussubtilissporulation pages 7-9, eichenberger2004theprogramof pages 1-2)
σK regulon & GerE feed-forward control σK activates late mother-cell genes (coat/cortex/germination-related); GerE acts downstream to refine the late program σK directly activates a core set of late genes (many coat proteins); GerE both represses a large subset of σK-activated genes and activates a final cohort, forming linked feed-forward loops that sculpt late transcription. σK-driven transcription occurs after engulfment completion and pro-σK processing; GerE-dependent remodeling defines final gene expression stage for coat/cortex assembly. (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 4-5, eichenberger2004theprogramof pages 9-10)
Quantitative mother-cell program metrics Sizes and temporal structure of mother-cell regulons and program duration Mother-cell program activates ~383 genes (≈242 transcription units); σE controls ~262 genes (≈163 TUs); σK newly activates ~75 genes (44 TUs) (core rK=103 genes; up to ~144 when GerE-dependent genes included); expression occurs over ~5 h window post-asymmetric division. Program spans ~5 hours of mother-cell differentiation after asymmetric septation, with sampling/timepoints typically at ~2.5, 3.5, 4.5, 5.5, 6.5 h in transcriptional studies. (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 4-5, eichenberger2004theprogramof pages 2-3)

Table: Compact reference table summarizing sigK (pro-σK/σK) identity, activation mechanism, structural insights, skin excision, sigma cascade position, regulon control by GerE, and quantitative mother-cell program metrics with primary sources for each claim.

1) Key concepts and definitions (current understanding)
- Identity and compartment: sigK encodes pro-σK, a σ70-family sigma factor that, after proteolytic removal of its N-terminal pro-sequence, functions with RNA polymerase in the mother-cell cytoplasm to drive late sporulation gene expression (e.g., coat/cortex/germination genes) (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328; Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22) (eichenberger2004theprogramof pages 1-2, galperin2022conservationandevolution pages 7-9).
- Activation by regulated intramembrane proteolysis (RIP): The intramembrane metalloprotease SpoIVFB (site-2 protease, S2P family) cleaves pro-σK within the mother-cell membrane system, converting it to active σK. SpoIVFB is held inactive by SpoIVFA and BofA, and activation is triggered by intercompartmental signaling proteases made in the forespore (SpoIVB and CtpB) that relieve inhibition and permit substrate access (“substrate gating”) (eLife, 2022; https://doi.org/10.7554/eLife.74275; PLoS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (olenic2022inhibitoryproteinsblock pages 19-21, ramirezguadiana2018evidencethatregulation pages 18-19, ramirezguadiana2018evidencethatregulation pages 9-12, ramirezguadiana2018evidencethatregulation pages 12-14).
- Developmental gene rearrangement (“skin” excision) required for sigK expression: In B. subtilis 168, sigK is split by a prophage-like element (skinBs) into spoIVCB and spoIIIC; SpoIVCA, a large serine recombinase, excises skinBs late in engulfment to reconstitute sigK in the mother cell. Skin elements vary widely across Firmicutes but consistently encode SpoIVCA-like recombinases (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22; CMLS, 2002; https://doi.org/10.1007/s00018-002-8431-9) (galperin2022conservationandevolution pages 7-9, phillips2002bacillussubtilissporulation pages 7-9).
- Position in the sigma cascade: σF (forespore) activates σE (mother cell) via SpoIIR/SpoIIGA, σE enables σG (forespore) and sets the stage for σK activation in the mother cell; σK then drives late transcription and is further sculpted by GerE. Feed-forward and repressive interactions enforce temporal order and compartment specificity (CMLS, 2002; https://doi.org/10.1007/s00018-002-8431-9; PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (phillips2002bacillussubtilissporulation pages 7-9, eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 2-3).

2) Recent developments and latest research (2023–2024 prioritized)
- Cryo-EM structure of SpoIVFB–pro-σK (2024): A high-resolution cryo-EM study resolved SpoIVFB bound to pro-σK, showing β-sheet augmentation for substrate engagement, critical contacts with the interdomain linker, and a plausible membrane water-access path to the buried active site; MD suggests lipid participation in substrate capture. These data substantiate the lateral/substrate gating model for regulated intramembrane proteolysis of pro-σK (Nat Commun, 2024; https://doi.org/10.1038/s41467-024-52634-6) (orlando2024substrateengagementby pages 1-2).
- Inhibitory complex mechanism (2022 update): Cross-linking and modeling show that BofA (with SpoIVFA) occupies SpoIVFB’s active-site cleft to block substrate access; SpoIVB/CtpB cleavage of inhibitory partners relieves inhibition, enabling pro-σK processing. This provides a concrete steric mechanism for RIP control (eLife, 2022; https://doi.org/10.7554/eLife.74275) (olenic2022inhibitoryproteinsblock pages 19-21, olenic2022inhibitoryproteinsblock pages 30-31).
- Substrate-gating and conformational control (2018): In vivo genetics, imaging, and covariation-guided modeling support SpoIVFB closed/open states gated by F66 and IVB-dependent signaling; ATP-binding to the SpoIVFB CBS domain modulates efficiency but is not strictly required (PLOS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (ramirezguadiana2018evidencethatregulation pages 9-12, ramirezguadiana2018evidencethatregulation pages 4-5, ramirezguadiana2018evidencethatregulation pages 12-14, ramirezguadiana2018evidencethatregulation pages 14-15).
- Skin element diversity and conservation (2022): Comparative genomics across Firmicutes shows variable skin element sizes (2.6–48 kb), conserved SpoIVCA recombinase placement, and universal presence of the sporulation sigma set, including sigK, among spore formers (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22) (galperin2022conservationandevolution pages 7-9).

3) Current applications and real-world implementations
- Mechanistic insights from σK activation have broader relevance for regulated intramembrane proteolysis (RIP) across domains of life, informing inhibitor design and signal-transduction logic in membrane-embedded proteases. The SpoIVFB–BofA–SpoIVFA inhibitory interface and steric occlusion of the active-site cleft suggest design principles for modulators of S2P-family proteases (eLife, 2022; https://doi.org/10.7554/eLife.74275). The cryo-EM structure of SpoIVFB–pro-σK further enables structure-guided hypotheses for lipid-dependent substrate capture within membranes (Nat Commun, 2024; https://doi.org/10.1038/s41467-024-52634-6) (olenic2022inhibitoryproteinsblock pages 19-21, orlando2024substrateengagementby pages 1-2).
- Conservation analyses connecting σK and skin recombinases across spore-forming pathogens underscore translational relevance for controlling sporulation traits (e.g., cortex/coat assembly and germination), with implications for persistence/interventions in Bacillota pathogens (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22) (galperin2022conservationandevolution pages 7-9).

4) Expert opinions and analysis from authoritative sources
- Mother-cell program architecture: A hierarchical series of coherent and incoherent feed-forward loops (σE→SpoIIID/GerR→σK→GerE) produces successive transcriptional pulses, ensuring timely shutoff of earlier genes and activation of late genes for spore maturation (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 2-3, eichenberger2004theprogramof pages 9-10).
- Intercompartmental signaling and spatial control: Activation of pro-σK requires forespore-secreted proteases SpoIVB and CtpB that act in the intermembrane space to cleave inhibitory partners, thereby coupling σK activation to developmental stage and membrane topology late in engulfment (CMLS, 2002; https://doi.org/10.1007/s00018-002-8431-9; PLoS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (phillips2002bacillussubtilissporulation pages 7-9, ramirezguadiana2018evidencethatregulation pages 18-19, ramirezguadiana2018evidencethatregulation pages 9-12).
- Skin excision logic: SpoIVCA-dependent excision is controlled by σE and SpoIIID to ensure that functional sigK is reconstituted only in the mother cell late in development, preventing premature σK activity (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22) (galperin2022conservationandevolution pages 7-9, serrano2016arecombinationdirectionality pages 5-7).

5) Relevant statistics and data from recent/authoritative studies
- Quantitative mother-cell program (B. subtilis 168): The mother-cell lineage activates approximately 383 genes (~242 transcription units) over about 5 hours following asymmetric division. σE activates ~262 genes (163 TUs), many of which are then repressed by SpoIIID/GerR. σK newly activates ~75 genes (44 TUs) among a core σK regulon of 103 genes (63 TUs), expanding to ~144 genes (94 TUs) when strongly GerE-dependent σK targets are included. GerE represses roughly half of σK-activated genes while activating a final set of 36 genes (27 TUs) (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 4-5, eichenberger2004theprogramof pages 8-9, eichenberger2004theprogramof pages 9-10, eichenberger2004theprogramof pages 2-3).
- σK regulon content: The σK regulon includes many coat genes (e.g., cotA, cotB, cotD, cotE, cotH) and other maturation factors; GerE downstream further refines coat/cortex/germination gene expression through activation and repression in a feed-forward motif (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (eichenberger2004theprogramof pages 3-4, eichenberger2004theprogramof pages 4-5, eichenberger2004theprogramof pages 10-11).
- Skin element properties: In B. subtilis 168, the sigK gene is interrupted by a ~48 kb skinBs element; SpoIVCA recombinase excises skinBs towards the end of engulfment to reconstitute sigK. Across Firmicutes, skin elements vary from ~2.6–48 kb and consistently encode SpoIVCA-like recombinases near spoIIIC, reflecting likely phage-derived ancestry (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22; CMLS, 2002; https://doi.org/10.1007/s00018-002-8431-9) (galperin2022conservationandevolution pages 7-9, phillips2002bacillussubtilissporulation pages 7-9).
- Localization and kinetics of activation: Imaging and time-course assays show pro-σK associates with SpoIVFB at forespore-surrounding mother-cell membranes, and its processing/σK activation correlate with SpoIVB-dependent signaling onset; σK-responsive reporters (e.g., PgerE-lacZ) and immunoblots quantify processing during sporulation. Mutations such as SpoIVFB(F66A) support gating models (PLOS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (ramirezguadiana2018evidencethatregulation pages 18-19, ramirezguadiana2018evidencethatregulation pages 9-12, ramirezguadiana2018evidencethatregulation pages 12-14).

Focused functional narrative for sigK (UniProt P12254)
- Primary function: σK is a transcription initiation specificity factor (σ70 family) that reprograms RNA polymerase to transcribe late sporulation genes in the mother cell. It does not catalyze a metabolic reaction; rather, it recognizes σK-specific promoter motifs to activate target genes (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 8-9).
- Activation mechanism: pro-σK is processed by SpoIVFB, an S2P intramembrane metalloprotease, once inhibition by SpoIVFA/BofA is relieved by forespore-derived SpoIVB/CtpB proteolysis. Structural and biochemical data reveal substrate β-sheet augmentation and steric blocking of the active-site cleft by inhibitors, with signaling converting SpoIVFB to an open, substrate-accessible state (Nat Commun, 2024; https://doi.org/10.1038/s41467-024-52634-6; eLife, 2022; https://doi.org/10.7554/eLife.74275; PLoS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (orlando2024substrateengagementby pages 1-2, olenic2022inhibitoryproteinsblock pages 19-21, ramirezguadiana2018evidencethatregulation pages 9-12, ramirezguadiana2018evidencethatregulation pages 12-14).
- Gene rearrangement prerequisite: In strain 168, skinBs excision by SpoIVCA reconstitutes intact sigK late in engulfment under σE/SpoIIID control, ensuring σK availability only in the mother cell (Journal of Bacteriology, 2022; https://doi.org/10.1128/jb.00079-22) (galperin2022conservationandevolution pages 7-9, serrano2016arecombinationdirectionality pages 5-7).
- Cellular localization: σK activity occurs in the mother-cell cytoplasm after pro-σK processing; pro-σK engagement occurs at forespore-surrounding mother-cell membranes where SpoIVFB and its inhibitors localize (PLOS Genet, 2018; https://doi.org/10.1371/journal.pgen.1007753) (ramirezguadiana2018evidencethatregulation pages 18-19, ramirezguadiana2018evidencethatregulation pages 9-12, ramirezguadiana2018evidencethatregulation pages 14-15).
- Pathway position and outputs: σK operates in a hierarchical, feed-forward mother-cell program following σE and preceding GerE, activating coat/cortex/germination genes and orchestrating mother-cell lysis for spore release. Quantitatively, ~75 newly activated genes (of a 103-gene core σK regulon; up to ~144 with GerE-dependent targets) characterize this phase (PLoS Biol, 2004; https://doi.org/10.1371/journal.pbio.0020328) (eichenberger2004theprogramof pages 1-2, eichenberger2004theprogramof pages 4-5, eichenberger2004theprogramof pages 8-9, eichenberger2004theprogramof pages 9-10).

Notes on verification and ambiguity
- The symbol “sigK” is used across spore-forming Firmicutes, occasionally with distinct features (e.g., σK lacking a pro-sequence in C. difficile), but all core claims above are restricted to B. subtilis 168 and its σK (pro-σK processing by SpoIVFB and skinBs excision by SpoIVCA) (PLOS Genet, 2016; https://doi.org/10.1371/journal.pgen.1006312) (serrano2016arecombinationdirectionality pages 2-3, serrano2016arecombinationdirectionality pages 19-20, serrano2016arecombinationdirectionality pages 20-21).

References

  1. (eichenberger2004theprogramof pages 1-2): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  2. (galperin2022conservationandevolution pages 7-9): Michael Y. Galperin, Natalya Yutin, Yuri I. Wolf, Roberto Vera Alvarez, and Eugene V. Koonin. Conservation and evolution of the sporulation gene set in diverse members of the firmicutes. Journal of Bacteriology, Jun 2022. URL: https://doi.org/10.1128/jb.00079-22, doi:10.1128/jb.00079-22. This article has 88 citations and is from a peer-reviewed journal.

  3. (phillips2002bacillussubtilissporulation pages 7-9): Z. E. V. Phillips and M. A. Strauch. Bacillus subtilis sporulation and stationary phase gene expression. Cellular and Molecular Life Sciences CMLS, 59:392-402, Mar 2002. URL: https://doi.org/10.1007/s00018-002-8431-9, doi:10.1007/s00018-002-8431-9. This article has 275 citations.

  4. (ramirezguadiana2018evidencethatregulation pages 12-14): Fernando H. Ramírez-Guadiana, Christopher D. A. Rodrigues, Kathleen A. Marquis, Nathalie Campo, Rocío del Carmen Barajas-Ornelas, Kelly Brock, Debora S. Marks, Andrew C. Kruse, and David Z. Rudner. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in bacillus subtilis. PLOS Genetics, 14:e1007753, Nov 2018. URL: https://doi.org/10.1371/journal.pgen.1007753, doi:10.1371/journal.pgen.1007753. This article has 24 citations and is from a domain leading peer-reviewed journal.

  5. (olenic2022inhibitoryproteinsblock pages 19-21): Sandra Olenic, Lim Heo, Michael Feig, and Lee Kroos. Inhibitory proteins block substrate access by occupying the active site cleft of bacillus subtilis intramembrane protease spoivfb. eLife, Apr 2022. URL: https://doi.org/10.7554/elife.74275, doi:10.7554/elife.74275. This article has 8 citations and is from a domain leading peer-reviewed journal.

  6. (orlando2024substrateengagementby pages 1-2): Melanie A. Orlando, Hunter J. T. Pouillon, Saikat Mandal, Lee Kroos, and Benjamin J. Orlando. Substrate engagement by the intramembrane metalloprotease spoivfb. Nature Communications, Oct 2024. URL: https://doi.org/10.1038/s41467-024-52634-6, doi:10.1038/s41467-024-52634-6. This article has 3 citations and is from a highest quality peer-reviewed journal.

  7. (ramirezguadiana2018evidencethatregulation pages 18-19): Fernando H. Ramírez-Guadiana, Christopher D. A. Rodrigues, Kathleen A. Marquis, Nathalie Campo, Rocío del Carmen Barajas-Ornelas, Kelly Brock, Debora S. Marks, Andrew C. Kruse, and David Z. Rudner. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in bacillus subtilis. PLOS Genetics, 14:e1007753, Nov 2018. URL: https://doi.org/10.1371/journal.pgen.1007753, doi:10.1371/journal.pgen.1007753. This article has 24 citations and is from a domain leading peer-reviewed journal.

  8. (ramirezguadiana2018evidencethatregulation pages 9-12): Fernando H. Ramírez-Guadiana, Christopher D. A. Rodrigues, Kathleen A. Marquis, Nathalie Campo, Rocío del Carmen Barajas-Ornelas, Kelly Brock, Debora S. Marks, Andrew C. Kruse, and David Z. Rudner. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in bacillus subtilis. PLOS Genetics, 14:e1007753, Nov 2018. URL: https://doi.org/10.1371/journal.pgen.1007753, doi:10.1371/journal.pgen.1007753. This article has 24 citations and is from a domain leading peer-reviewed journal.

  9. (serrano2016arecombinationdirectionality pages 2-3): Mónica Serrano, Nicolas Kint, Fátima C. Pereira, Laure Saujet, Pierre Boudry, Bruno Dupuy, Adriano O. Henriques, and Isabelle Martin-Verstraete. A recombination directionality factor controls the cell type-specific activation of σk and the fidelity of spore development in clostridium difficile. PLOS Genetics, 12:e1006312, Sep 2016. URL: https://doi.org/10.1371/journal.pgen.1006312, doi:10.1371/journal.pgen.1006312. This article has 42 citations and is from a domain leading peer-reviewed journal.

  10. (serrano2016arecombinationdirectionality pages 5-7): Mónica Serrano, Nicolas Kint, Fátima C. Pereira, Laure Saujet, Pierre Boudry, Bruno Dupuy, Adriano O. Henriques, and Isabelle Martin-Verstraete. A recombination directionality factor controls the cell type-specific activation of σk and the fidelity of spore development in clostridium difficile. PLOS Genetics, 12:e1006312, Sep 2016. URL: https://doi.org/10.1371/journal.pgen.1006312, doi:10.1371/journal.pgen.1006312. This article has 42 citations and is from a domain leading peer-reviewed journal.

  11. (eichenberger2004theprogramof pages 4-5): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  12. (eichenberger2004theprogramof pages 9-10): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  13. (eichenberger2004theprogramof pages 2-3): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  14. (olenic2022inhibitoryproteinsblock pages 30-31): Sandra Olenic, Lim Heo, Michael Feig, and Lee Kroos. Inhibitory proteins block substrate access by occupying the active site cleft of bacillus subtilis intramembrane protease spoivfb. eLife, Apr 2022. URL: https://doi.org/10.7554/elife.74275, doi:10.7554/elife.74275. This article has 8 citations and is from a domain leading peer-reviewed journal.

  15. (ramirezguadiana2018evidencethatregulation pages 4-5): Fernando H. Ramírez-Guadiana, Christopher D. A. Rodrigues, Kathleen A. Marquis, Nathalie Campo, Rocío del Carmen Barajas-Ornelas, Kelly Brock, Debora S. Marks, Andrew C. Kruse, and David Z. Rudner. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in bacillus subtilis. PLOS Genetics, 14:e1007753, Nov 2018. URL: https://doi.org/10.1371/journal.pgen.1007753, doi:10.1371/journal.pgen.1007753. This article has 24 citations and is from a domain leading peer-reviewed journal.

  16. (ramirezguadiana2018evidencethatregulation pages 14-15): Fernando H. Ramírez-Guadiana, Christopher D. A. Rodrigues, Kathleen A. Marquis, Nathalie Campo, Rocío del Carmen Barajas-Ornelas, Kelly Brock, Debora S. Marks, Andrew C. Kruse, and David Z. Rudner. Evidence that regulation of intramembrane proteolysis is mediated by substrate gating during sporulation in bacillus subtilis. PLOS Genetics, 14:e1007753, Nov 2018. URL: https://doi.org/10.1371/journal.pgen.1007753, doi:10.1371/journal.pgen.1007753. This article has 24 citations and is from a domain leading peer-reviewed journal.

  17. (eichenberger2004theprogramof pages 8-9): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  18. (eichenberger2004theprogramof pages 3-4): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  19. (eichenberger2004theprogramof pages 10-11): Patrick Eichenberger, Masaya Fujita, Shane T Jensen, Erin M Conlon, David Z Rudner, Stephanie T Wang, Caitlin Ferguson, Koki Haga, Tsutomu Sato, Jun S Liu, and Richard Losick. The program of gene transcription for a single differentiating cell type during sporulation in bacillus subtilis. PLoS Biology, 2:e328, Sep 2004. URL: https://doi.org/10.1371/journal.pbio.0020328, doi:10.1371/journal.pbio.0020328. This article has 474 citations and is from a highest quality peer-reviewed journal.

  20. (serrano2016arecombinationdirectionality pages 19-20): Mónica Serrano, Nicolas Kint, Fátima C. Pereira, Laure Saujet, Pierre Boudry, Bruno Dupuy, Adriano O. Henriques, and Isabelle Martin-Verstraete. A recombination directionality factor controls the cell type-specific activation of σk and the fidelity of spore development in clostridium difficile. PLOS Genetics, 12:e1006312, Sep 2016. URL: https://doi.org/10.1371/journal.pgen.1006312, doi:10.1371/journal.pgen.1006312. This article has 42 citations and is from a domain leading peer-reviewed journal.

  21. (serrano2016arecombinationdirectionality pages 20-21): Mónica Serrano, Nicolas Kint, Fátima C. Pereira, Laure Saujet, Pierre Boudry, Bruno Dupuy, Adriano O. Henriques, and Isabelle Martin-Verstraete. A recombination directionality factor controls the cell type-specific activation of σk and the fidelity of spore development in clostridium difficile. PLOS Genetics, 12:e1006312, Sep 2016. URL: https://doi.org/10.1371/journal.pgen.1006312, doi:10.1371/journal.pgen.1006312. This article has 42 citations and is from a domain leading peer-reviewed journal.

Citations

  1. orlando2024substrateengagementby pages 1-2
  2. galperin2022conservationandevolution pages 7-9
  3. eichenberger2004theprogramof pages 1-2
  4. phillips2002bacillussubtilissporulation pages 7-9
  5. ramirezguadiana2018evidencethatregulation pages 12-14
  6. olenic2022inhibitoryproteinsblock pages 19-21
  7. ramirezguadiana2018evidencethatregulation pages 18-19
  8. ramirezguadiana2018evidencethatregulation pages 9-12
  9. serrano2016arecombinationdirectionality pages 2-3
  10. serrano2016arecombinationdirectionality pages 5-7
  11. eichenberger2004theprogramof pages 4-5
  12. eichenberger2004theprogramof pages 9-10
  13. eichenberger2004theprogramof pages 2-3
  14. olenic2022inhibitoryproteinsblock pages 30-31
  15. ramirezguadiana2018evidencethatregulation pages 4-5
  16. ramirezguadiana2018evidencethatregulation pages 14-15
  17. eichenberger2004theprogramof pages 8-9
  18. eichenberger2004theprogramof pages 3-4
  19. eichenberger2004theprogramof pages 10-11
  20. serrano2016arecombinationdirectionality pages 19-20
  21. serrano2016arecombinationdirectionality pages 20-21
  22. https://doi.org/10.1371/journal.pbio.0020328;
  23. https://doi.org/10.1128/jb.00079-22
  24. https://doi.org/10.7554/eLife.74275;
  25. https://doi.org/10.1371/journal.pgen.1007753
  26. https://doi.org/10.1128/jb.00079-22;
  27. https://doi.org/10.1007/s00018-002-8431-9
  28. https://doi.org/10.1007/s00018-002-8431-9;
  29. https://doi.org/10.1371/journal.pbio.0020328
  30. https://doi.org/10.1038/s41467-024-52634-6
  31. https://doi.org/10.7554/eLife.74275
  32. https://doi.org/10.1038/s41467-024-52634-6;
  33. https://doi.org/10.1371/journal.pgen.1006312
  34. https://doi.org/10.1371/journal.pbio.0020328,
  35. https://doi.org/10.1128/jb.00079-22,
  36. https://doi.org/10.1007/s00018-002-8431-9,
  37. https://doi.org/10.1371/journal.pgen.1007753,
  38. https://doi.org/10.7554/elife.74275,
  39. https://doi.org/10.1038/s41467-024-52634-6,
  40. https://doi.org/10.1371/journal.pgen.1006312,

Bioreason Rl Review

(sigK-bioreason-rl-review.md)

BioReason-Pro RL Review: sigK (B. subtilis)

Source: sigK-deep-research-bioreason-rl.md

  • Correctness: 3/5
  • Completeness: 2/5

Functional Summary Review

The BioReason functional summary reads:

A cytosolic bacterial transcription initiation factor that associates with the core RNA polymerase to confer promoter specificity. Its modular architecture binds the polymerase through an N-terminal recognition module and uses a C-terminal helix-turn-helix to engage -35 promoter elements, thereby assembling the initiation-competent holoenzyme, stabilizing the open complex, and launching RNA synthesis from chromosome-embedded promoters in Bacillus subtilis.

This is a generic description of a sigma-70 family sigma factor that could apply to essentially any bacterial sigma factor. While technically correct in describing sigma factor biochemistry (GO:0016987), it fails to capture anything specific about SigK.

Key omissions and errors:

  1. Missing identity as late mother cell sigma factor: SigK is specifically the late mother cell-specific sigma factor during sporulation. The curated review emphasizes that SigK activates coat proteins (cotA, cotB, cotD, cotE, cotH), cortex genes, and germination genes. BioReason's summary contains no hint of this.

  2. Missing unique regulatory features: SigK has a remarkable biogenesis -- the sigK gene is interrupted by a ~48 kb skin element that must be excised by SpoIVCA recombinase before the gene is reconstituted. SigK is synthesized as an inactive pro-sigK precursor with a 20-residue propeptide cleaved by the intramembrane metalloprotease SpoIVFB. This regulated intramembrane proteolysis mechanism, coupled to forespore-derived signals (SpoIVB/CtpB), is entirely absent.

  3. No sporulation context: The summary does not mention sporulation at all, despite sporulation (GO:0030435) being a core biological process for SigK.

  4. Erroneous GO predictions: BioReason predicts negative regulation of transcription terms (GO:0045892, GO:1903507) and nucleus (GO:0005634), host cell cytoplasm (GO:0030430), and endonuclease complex -- all incorrect for this bacterial sigma factor.

  5. Missing feed-forward regulation: SigK operates in a hierarchical feed-forward network with GerE, which the curated review identifies as important regulatory context.

Comparison with interpro2go:

The interpro2go annotations for sigK include DNA-binding transcription factor activity (GO:0003700), which the curated review flags as incorrect for sigma factors. BioReason's GO predictions include negative regulation of transcription terms that are likely inherited from the Bacterial Sigma-70 Factor family IPR050813 mappings. Neither interpro2go nor BioReason captures the mother cell specificity, pro-sigK processing, or sporulation context. BioReason provides a more articulate but equally uninformative narrative compared to interpro2go.

Notes on thinking trace

The trace identifies the sigma-70 domain architecture correctly but does not leverage the specific family-level signatures to distinguish SigK from other sigma factors. The Bacterial Sigma-70 Factor (IPR050813) and region 2/4 analysis is generic and could apply to any sigma factor in B. subtilis.

📄 View Raw YAML

id: P12254
gene_symbol: sigK
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:224308
  label: Bacillus subtilis (strain 168)
description: >-
  SigK (also known as spoIVCB/spoIIIC) encodes the RNA polymerase sigma-K factor, the late
  mother cell-specific sigma factor of the sigma-70 family that directs transcription of
  late sporulation genes in B. subtilis. SigK has a unique feature: in strain 168, the
  sigK gene is interrupted by a ~48 kb prophage-like "skin" element (skinBs), and the
  functional sigK is only reconstituted late in sporulation when the SpoIVCA recombinase
  excises this element under sigE/SpoIIID control in the mother cell. SigK is synthesized
  as an inactive precursor (pro-sigK) with a 20-residue propeptide that must be removed by
  the intramembrane metalloprotease SpoIVFB for activation. SpoIVFB is held inactive by
  BofA and SpoIVFA until forespore-secreted proteases SpoIVB and CtpB cleave these inhibitors,
  thereby coupling sigK activation to developmental stage. The mature sigma-K factor contains
  conserved sigma-70 regions including a helix-turn-helix DNA-binding motif (residues 251-270)
  that recognizes sigK-dependent promoters. SigK activates the final phase of mother cell
  gene expression including coat proteins (cotA, cotB, cotD, cotE, cotH), cortex, and
  germination genes. SigK operates in a hierarchical feed-forward network with GerE, which
  both represses some sigK-activated genes and activates additional late genes.

existing_annotations:
- term:
    id: GO:0000976
    label: transcription cis-regulatory region binding
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: >-
      This annotation suggests sigK binds to transcription cis-regulatory regions independently.
      However, according to the GO definition of sigma factor activity (GO:0016987), sigma factors
      do not bind DNA on their own but when combined with the core polymerase to form the
      holoenzyme, they confer promoter specificity. This term is technically inaccurate for
      sigma factors as they require the core polymerase for DNA binding.
    action: REMOVE
    reason: >-
      Sigma factors do not bind DNA independently. The GO definition of sigma factor activity
      explicitly states that "sigma does not bind DNA on its own, when combined with the core
      to form the holoenzyme, the sigma factor binds specifically to promoter elements."
      This term implies autonomous DNA binding which is misleading for sigma factors.
      The correct molecular function is GO:0016987 sigma factor activity, which is already
      annotated for sigK.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."

- term:
    id: GO:0003899
    label: DNA-directed RNA polymerase activity
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: >-
      This annotation incorrectly attributes catalytic RNA polymerase activity to sigK.
      Sigma factors are NOT enzymes and do not catalyze the polymerization reaction.
      The catalytic activity resides in the core polymerase subunits (beta, beta-prime),
      not in sigma factors.
    action: REMOVE
    reason: >-
      This is a clear over-annotation. GO:0003899 is defined as "Catalysis of the reaction:
      nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1)." Sigma factors do not possess
      catalytic activity. The catalytic activity of RNA polymerization is performed by the
      core enzyme subunits, not by sigma factors. SigK confers promoter specificity but does
      not catalyze nucleotide addition. UniProt describes sigma factors as "initiation factors
      that promote the attachment of RNA polymerase to specific initiation sites" - not enzymes.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."

- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: >-
      SigK regulates transcription by directing RNA polymerase to mother cell-specific promoters
      during late sporulation. The sigK regulon includes coat proteins, cortex genes, and
      germination factors. SigK operates in a feed-forward network with GerE to sculpt the
      late mother cell transcriptional program.
    action: ACCEPT
    reason: >-
      Sigma factors inherently regulate transcription by determining which genes are transcribed.
      SigK specifically activates the late mother cell developmental program. UniProt states
      that "This sigma factor is responsible for the expression of sporulation specific genes
      in the mother cell." Deep research confirms sigK activates coat/cortex/germination genes.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell."
      - reference_id: file:BACSU/sigK/sigK-deep-research-falcon.md
        supporting_text: "sigK operates in a hierarchical, feed-forward mother-cell program following sigE and preceding GerE, activating coat/cortex/germination genes"

- term:
    id: GO:0016987
    label: sigma factor activity
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: >-
      This is the primary and most accurate molecular function annotation for sigK. SigK is
      a well-characterized sigma factor of the sigma-70 family. It contains conserved sigma-70
      regions (regions 2 and 4) that mediate core RNAP binding and promoter recognition.
      UniProt explicitly classifies sigK as belonging to the sigma-70 factor family.
    action: ACCEPT
    reason: >-
      This is the canonical molecular function for sigK. UniProt identifies sigK as
      "RNA polymerase sigma-K factor" belonging to the "sigma-70 factor family." The protein
      contains all characteristic sigma-70 family domains including the HTH DNA-binding motif
      (residues 251-270) and polymerase core binding motif (residues 79-92).
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."
      - reference_id: UniProt:P12254
        supporting_text: "Belongs to the sigma-70 factor family."

- term:
    id: GO:1903865
    label: sigma factor antagonist complex
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: >-
      SigK (as pro-sigK) forms a complex with its regulatory proteins SpoIVFB, BofA, and SpoIVFA.
      GO:1903865 is defined as "A protein complex which is capable of sigma factor antagonist
      activity." The BofA/SpoIVFA complex inhibits SpoIVFB-mediated pro-sigK processing, thereby
      preventing premature sigK activation.
    action: ACCEPT
    reason: >-
      UniProt documents the interaction between sigK and SpoIVFB (IntAct:EBI-15806052). PMID:19805276
      demonstrates that "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K)".
      The pro-sigK-SpoIVFB-BofA-SpoIVFA complex represents a sigma factor antagonist complex
      that prevents premature sigK activity until the forespore signals for activation via
      SpoIVB and CtpB proteases.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "P12254; P26937: spoIVFB; NbExp=2; IntAct=EBI-15806052, EBI-15806037"
      - reference_id: PMID:19805276
        supporting_text: "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction"

- term:
    id: GO:0003677
    label: DNA binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      This generic DNA binding annotation is inferred from the UniProt DNA-binding keyword.
      SigK does contain a helix-turn-helix DNA-binding motif (residues 251-270) as documented
      in UniProt. However, sigma factors require the core polymerase for DNA binding - they
      do not bind DNA independently.
    action: KEEP_AS_NON_CORE
    reason: >-
      SigK contains a documented HTH DNA-binding motif at residues 251-270 (UniProt feature).
      However, per the GO definition of sigma factor activity, "sigma does not bind DNA on its
      own." The more precise molecular function is GO:0016987 sigma factor activity. This
      annotation is kept as non-core since it reflects domain architecture but is less
      informative than the sigma factor activity annotation.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "DNA_BIND        251..270 /note=\"H-T-H motif\""

- term:
    id: GO:0003700
    label: DNA-binding transcription factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      This annotation from InterPro domain mapping is incorrect for sigma factors. Sigma factors
      are not transcription factors in the classical sense; they are subunits of the RNA
      polymerase holoenzyme that confer promoter specificity rather than independent regulatory
      factors.
    action: MODIFY
    reason: >-
      GO:0003700 is defined as sequence-specific DNA binding that modulates transcription,
      applicable to classical transcription factors that independently bind DNA regulatory
      elements. Sigma factors function differently: they are dissociable subunits of RNA
      polymerase that determine promoter recognition as part of the holoenzyme complex.
      The correct term is GO:0016987 sigma factor activity, which is already annotated for sigK.
    proposed_replacement_terms:
      - id: GO:0016987
        label: sigma factor activity
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released"

- term:
    id: GO:0006351
    label: DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      SigK is involved in DNA-templated transcription as a component of the RNA polymerase
      holoenzyme. This is an appropriate general biological process annotation inferred from
      the UniProt Transcription keyword.
    action: ACCEPT
    reason: >-
      As a sigma factor, sigK is essential for transcription initiation from specific promoters.
      This general process term appropriately captures sigK's role in transcription. The
      annotation is supported by UniProt keywords (Transcription) and the well-characterized
      function of sigma factors in bacterial transcription.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell."

- term:
    id: GO:0006352
    label: DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      This is a highly appropriate annotation for sigma factors. Sigma factors specifically
      function in transcription initiation by directing RNA polymerase to promoters and are
      released after initiation. SigK acts at the initiation step to enable recognition of
      sigK-dependent promoters.
    action: ACCEPT
    reason: >-
      UniProt states that "Sigma factors are initiation factors that promote the attachment of
      RNA polymerase to specific initiation sites and are then released." The specific role of
      sigma factors in initiation (as opposed to elongation or termination) makes this a precise
      and accurate annotation for sigK.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released"

- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      Duplicate annotation (same term as the IBA annotation above) from automated inference.
      This accurately captures sigK's regulatory role in transcription of the late mother
      cell sporulation program.
    action: ACCEPT
    reason: >-
      This is a duplicate of the IBA annotation for the same term. Both annotations are valid
      and supported by the same evidence. Sigma factors regulate transcription by determining
      which genes are expressed. SigK specifically regulates the late mother cell gene
      expression program during sporulation.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell."

- term:
    id: GO:0016987
    label: sigma factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      Duplicate annotation (same term as the IBA annotation above) from automated inference
      based on UniProt Sigma factor keyword.
    action: ACCEPT
    reason: >-
      This is a duplicate of the IBA annotation for sigma factor activity. Both annotations are
      valid. This is the core molecular function of sigK, supported by the UniProt Sigma factor
      keyword and sigma-70 family domain architecture.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Belongs to the sigma-70 factor family."

- term:
    id: GO:0030435
    label: sporulation resulting in formation of a cellular spore
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >-
      SigK is the late mother cell-specific sigma factor essential for sporulation in B. subtilis.
      It activates genes required for coat assembly, cortex formation, and germination, all
      essential for production of a mature spore. This biological process annotation appropriately
      captures the developmental context of sigK function.
    action: ACCEPT
    reason: >-
      SigK is specifically active in the mother cell during late sporulation and is absolutely
      required for spore formation. UniProt states it is "responsible for the expression of
      sporulation specific genes in the mother cell." PMID:19805276 confirms that sigK directs
      "transcription of genes necessary for spore formation."
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell."
      - reference_id: PMID:19805276
        supporting_text: "releasing sigma(K) to direct transcription of genes necessary for spore formation"

- term:
    id: GO:2000142
    label: regulation of DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000108
  review:
    summary: >-
      This annotation is logically inferred from sigma factor activity (GO:0016987). Since sigma
      factors specifically function at the initiation step, regulation of transcription initiation
      is an appropriate process annotation for sigK.
    action: ACCEPT
    reason: >-
      This is a more specific version of the transcription regulation annotation that correctly
      reflects that sigma factors act at the initiation stage. Since sigma factors "promote the
      attachment of RNA polymerase to specific initiation sites," they specifically regulate the
      initiation step of transcription. This logical inference from GO:0016987 is valid.
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites"

- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:19805276
  review:
    summary: >-
      This annotation is based on the physical interaction between pro-sigK and the intramembrane
      metalloprotease SpoIVFB, demonstrated in PMID:19805276. The study shows that SpoIVFB
      engages pro-sigK as its substrate for regulated intramembrane proteolysis, releasing
      active sigK.
    action: MODIFY
    reason: >-
      While the annotation is not wrong (sigK does bind to SpoIVFB), "protein binding" is
      too vague to be informative. The specific interaction documented is between pro-sigK
      and its processing protease SpoIVFB. The CC term GO:1903865 (sigma factor antagonist
      complex) already captures this regulatory relationship more specifically. The protein
      binding annotation adds little value over the more specific complex annotation.
    proposed_replacement_terms:
      - id: GO:1903865
        label: sigma factor antagonist complex
    additional_reference_ids:
      - PMID:19805276
    supported_by:
      - reference_id: PMID:19805276
        supporting_text: "The Bacillus subtilis SpoIVFB protein is a putative metalloprotease that cleaves membrane-tethered Pro-sigma(K), releasing sigma(K) to direct transcription of genes necessary for spore formation."
      - reference_id: PMID:19805276
        supporting_text: "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction"

references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000033
  title: Annotation inferences using phylogenetic trees
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000108
  title: Automatic assignment of GO terms using logical inference, based on inter-ontology links
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:19805276
  title: Intramembrane proteolytic cleavage of a membrane-tethered transcription factor by a metalloprotease depends on ATP.
  findings:
    - statement: SpoIVFB cleaves pro-sigK to release active sigK
      supporting_text: "The Bacillus subtilis SpoIVFB protein is a putative metalloprotease that cleaves membrane-tethered Pro-sigma(K), releasing sigma(K) to direct transcription of genes necessary for spore formation."
    - statement: SpoIVFB CBS domain interacts with pro-sigK and ATP modulates this interaction
      supporting_text: "The CBS domain of SpoIVFB was shown to interact with Pro-sigma(K) and ATP changed the interaction"
- id: UniProt:P12254
  title: UniProt entry for RNA polymerase sigma-K factor (sigK/spoIVCB/spoIIIC)
  findings:
    - statement: SigK is synthesized as a precursor with a propeptide that is removed
      supporting_text: "PROPEP          1..20 /id=\"PRO_0000032588\""
    - statement: SigK contains a helix-turn-helix DNA-binding motif at residues 251-270
      supporting_text: "DNA_BIND        251..270 /note=\"H-T-H motif\""
    - statement: SigK belongs to the sigma-70 factor family
      supporting_text: "Belongs to the sigma-70 factor family."
    - statement: Sigma factors are initiation factors that promote RNAP attachment to specific sites
      supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."
    - statement: The sigK gene is produced by chromosomal rearrangement - excision of skin element
      supporting_text: "The spoIVCB gene is an incomplete structural gene that is capable of encoding only the N-terminal half of sigma-K."
    - statement: SigK interacts with SpoIVFB
      supporting_text: "P12254; P26937: spoIVFB; NbExp=2; IntAct=EBI-15806052, EBI-15806037"
- id: file:BACSU/sigK/sigK-deep-research-falcon.md
  title: Deep research report on sigK function and regulation
  findings:
    - statement: SigK operates in a hierarchical feed-forward mother-cell program
      supporting_text: "sigK operates in a hierarchical, feed-forward mother-cell program following sigE and preceding GerE, activating coat/cortex/germination genes"

core_functions:
  - description: >-
      Late mother cell-specific sigma factor activity directing transcription initiation of
      sporulation genes. SigK confers promoter recognition specificity to RNA polymerase,
      enabling transcription of coat proteins, cortex assembly genes, and germination factors.
      SigK is activated by proteolytic removal of its N-terminal propeptide by SpoIVFB after
      forespore-derived SpoIVB/CtpB proteases relieve inhibition of SpoIVFB by BofA/SpoIVFA.
    molecular_function:
      id: GO:0016987
      label: sigma factor activity
    directly_involved_in:
      - id: GO:0030435
        label: sporulation resulting in formation of a cellular spore
      - id: GO:0006352
        label: DNA-templated transcription initiation
    supported_by:
      - reference_id: UniProt:P12254
        supporting_text: "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."
      - reference_id: UniProt:P12254
        supporting_text: "This sigma factor is responsible for the expression of sporulation specific genes in the mother cell."

suggested_questions:
  - question: What is the complete SigK regulon and how does the feed-forward regulation with GerE sculpt the temporal expression pattern?
  - question: How does SigK promoter recognition specificity differ from that of the early mother cell sigma factor SigE?
  - question: Is skin element excision absolutely required for sigK function, or can strains without the skin element sporulate normally?

suggested_experiments:
  - description: >-
      ChIP-seq analysis of SigK binding during late sporulation would enable genome-wide
      identification of SigK binding sites, comprehensively defining the SigK regulon and
      validating the regulatory relationships with GerE.
    hypothesis: SigK binds to specific promoter sequences genome-wide to activate a defined late mother cell transcriptional program
    experiment_type: ChIP-seq
  - description: >-
      Cryo-EM structural analysis of the full pro-sigK-SpoIVFB-BofA-SpoIVFA inhibitory complex
      would reveal the molecular mechanism of sigK sequestration prior to activation and how
      SpoIVB/CtpB-mediated cleavage relieves this inhibition.
    hypothesis: BofA and SpoIVFA form a steric block preventing pro-sigK from accessing the SpoIVFB active site
    experiment_type: Structural biology