sigG

UniProt ID: P19940
Organism: Bacillus subtilis (strain 168)
Review Status: DRAFT
📝 Provide Detailed Feedback

Gene Description

RNA polymerase sigma-G factor (also known as SpoIIIG) is the late forespore-specific sigma factor in Bacillus subtilis sporulation. Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released after transcription begins. SigG becomes active only in the forespore compartment approximately 2 hours after sporulation starts, following the earlier action of sigF (the early forespore sigma factor). SigG is responsible for the expression of late sporulation-specific genes in the forespore and auto-stimulates its own transcription. The activity of sigG is tightly regulated during early forespore development by the anti-sigma-G factor Gin (CsfB) and by Lon protease. SigG interacts with Gin via its N-terminal region (residues 1-71), and this interaction inhibits sigG activity until the appropriate developmental stage. The protein contains the characteristic sigma-70 family domains including a helix-turn-helix DNA-binding motif (residues 229-248) and a polymerase core binding motif (residues 67-80).

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000976 transcription cis-regulatory region binding
IBA
GO_REF:0000033
REMOVE
Summary: This annotation suggests sigG binds to transcription cis-regulatory regions. According to the GO definition of sigma factor activity (GO:0016987), sigma factors do not bind DNA on their own but when combined with the core polymerase to form the holoenzyme, they confer promoter specificity. This term is technically inaccurate for sigma factors as they require the core polymerase for DNA binding.
Reason: Sigma factors do not bind DNA independently. The GO definition of sigma factor activity explicitly states that "sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, the sigma factor binds specifically to promoter elements." This term implies autonomous DNA binding which is misleading for sigma factors. The correct molecular function is GO:0016987 sigma factor activity.
GO:0003899 DNA-directed RNA polymerase activity
IBA
GO_REF:0000033
REMOVE
Summary: This annotation incorrectly attributes catalytic RNA polymerase activity to sigG. Sigma factors are NOT enzymes and do not catalyze the polymerization reaction. The catalytic activity resides in the core polymerase subunits (beta, beta-prime).
Reason: This is a clear over-annotation. GO:0003899 is defined as "Catalysis of the reaction: nucleoside triphosphate + RNA(n) = diphosphate + RNA(n+1)." Sigma factors do not possess catalytic activity. The catalytic activity of RNA polymerization is performed by the core enzyme subunits, not by sigma factors. SigG confers promoter specificity but does not catalyze nucleotide addition. UniProt describes sigG as an "initiation factor that promote[s] the attachment of RNA polymerase to specific initiation sites."
GO:0006355 regulation of DNA-templated transcription
IBA
GO_REF:0000033
ACCEPT
Summary: SigG regulates transcription by directing RNA polymerase to forespore-specific promoters and by auto-stimulating its own transcription. This is an appropriate biological process annotation for a sigma factor.
Reason: Sigma factors inherently regulate transcription by determining which genes are transcribed. SigG specifically regulates the expression of late sporulation genes in the forespore and auto-stimulates its own transcription (per UniProt PMID:18208527). This is a core function of sigG.
Supporting Evidence:
file:BACSU/sigG/sigG-deep-research-falcon.md
See deep research file for comprehensive analysis
GO:0016987 sigma factor activity
IBA
GO_REF:0000033
ACCEPT
Summary: This is the primary and most accurate molecular function annotation for sigG. GO:0016987 is defined as the activity of sigma factors that act as promoter specificity subunits of bacterial RNA polymerase.
Reason: This is the canonical molecular function for sigG. UniProt explicitly identifies sigG as "RNA polymerase sigma-G factor" belonging to the "sigma-70 factor family." The protein contains all characteristic sigma-70 family domains including the HTH DNA-binding motif (residues 229-248) and polymerase core binding motif (residues 67-80).
GO:1903865 sigma factor antagonist complex
IBA
GO_REF:0000033
ACCEPT
Summary: SigG forms a complex with its anti-sigma factor Gin (CsfB). GO:1903865 is defined as "A protein complex which is capable of sigma factor antagonist activity." When sigG is bound by Gin, this complex inhibits sigG transcriptional activity.
Reason: UniProt clearly documents that sigG interacts with anti-sigma-G factor Gin (CsfB) via its N-terminal region (residues 1-71). This interaction inhibits sigG activity (per PMID:18208527, PMID:19497328). This sigG-Gin complex represents a sigma factor antagonist complex that prevents premature sigG activity during early forespore development.
GO:0003677 DNA binding
IEA
GO_REF:0000120
KEEP AS NON CORE
Summary: This generic DNA binding annotation is inferred from the presence of sigma-70 family domains. While sigG does contain a helix-turn-helix DNA-binding motif (residues 229-248), sigma factors require the core polymerase for DNA binding.
Reason: SigG contains a HTH DNA-binding motif at residues 229-248 (UniProt feature). However, per the GO definition of sigma factor activity, "sigma does not bind DNA on its own." The more precise molecular function is GO:0016987 sigma factor activity. This annotation is kept as non-core since it reflects domain architecture but is less informative than the sigma factor activity annotation.
GO:0003700 DNA-binding transcription factor activity
IEA
GO_REF:0000002
MODIFY
Summary: This annotation from InterPro domain mapping is incorrect for sigma factors. Sigma factors are not transcription factors in the classical sense; they are subunits of the RNA polymerase holoenzyme that confer promoter specificity.
Reason: GO:0003700 is defined as sequence-specific DNA binding that modulates transcription. Sigma factors function differently: they are dissociable subunits of RNA polymerase that determine promoter recognition. The correct term is GO:0016987 sigma factor activity. While both involve promoter recognition, sigma factors act as part of the polymerase holoenzyme rather than as independent transcription factors.
Proposed replacements: sigma factor activity
GO:0006351 DNA-templated transcription
IEA
GO_REF:0000043
ACCEPT
Summary: SigG is involved in DNA-templated transcription as a component of the RNA polymerase holoenzyme. This is an appropriate general biological process annotation inferred from the UniProt Transcription keyword.
Reason: As a sigma factor, sigG is essential for transcription initiation from specific promoters. This general process term appropriately captures sigG's role in transcription. The annotation is supported by UniProt keywords (Transcription) and the well-characterized function of sigma factors in bacterial transcription.
GO:0006352 DNA-templated transcription initiation
IEA
GO_REF:0000002
ACCEPT
Summary: This is a highly appropriate annotation for sigma factors. Sigma factors specifically function in transcription initiation by directing RNA polymerase to promoters and are released after initiation.
Reason: UniProt states that "Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released." The specific role of sigma factors in initiation (as opposed to elongation or termination) makes this a precise and accurate annotation.
GO:0006355 regulation of DNA-templated transcription
IEA
GO_REF:0000120
ACCEPT
Summary: Duplicate annotation (same term as the IBA annotation above) from automated inference. This accurately captures sigG's regulatory role in transcription.
Reason: This is a duplicate of the IBA annotation for the same term. Both annotations are valid. Sigma factors regulate transcription by determining which genes are expressed. SigG specifically regulates late forespore gene expression and auto-stimulates its own transcription.
GO:0016987 sigma factor activity
IEA
GO_REF:0000120
ACCEPT
Summary: Duplicate annotation (same term as the IBA annotation above) from automated inference based on InterPro domains and UniProt keywords.
Reason: This is a duplicate of the IBA annotation for sigma factor activity. Both annotations are valid. This is the core molecular function of sigG, supported by the UniProt Sigma factor keyword and sigma-70 family domain architecture.
GO:0030435 sporulation resulting in formation of a cellular spore
IEA
GO_REF:0000043
ACCEPT
Summary: SigG is the late forespore-specific sigma factor essential for sporulation in B. subtilis. This biological process annotation appropriately captures the developmental context of sigG function.
Reason: SigG is specifically active in the forespore during sporulation. UniProt states it is "responsible for the expression of sporulation specific genes in the forespore" and is "Active only in the forespore" approximately "2 hours after sporulation starts." The Sporulation UniProt keyword correctly maps to this GO term.
GO:2000142 regulation of DNA-templated transcription initiation
IEA
GO_REF:0000108
ACCEPT
Summary: This annotation is logically inferred from sigma factor activity (GO:0016987). Since sigma factors specifically function at the initiation step, regulation of transcription initiation is an appropriate process annotation.
Reason: This is a more specific version of the transcription regulation annotation that correctly reflects that sigma factors act at the initiation stage. Since sigma factors "promote the attachment of RNA polymerase to specific initiation sites," they specifically regulate the initiation step of transcription. This logical inference from GO:0016987 is valid.
GO:0045152 antisigma factor binding
IC
PMID:18208527
How the early sporulation sigma factor sigmaF delays the swi...
NEW
Summary: SigG binds to its anti-sigma factor Gin (CsfB) via residues 1-71. This interaction is critical for temporal regulation of sigG activity during sporulation.
Reason: UniProt documents direct interaction between sigG and anti-sigma-G factor Gin (PMID:18208527, PMID:19497328). The N-terminal region (residues 1-71) is annotated as the recognition region for Gin binding. This molecular function annotation would accurately capture sigG's ability to bind its cognate anti-sigma factor.
Supporting Evidence:
PMID:18208527
Gin interacts strongly with sigma(G) but not with sigma(F) in a yeast two-hybrid experiment
PMID:19497328
The Gin protein, encoded by csfB, is able to strongly inhibit the activity of one of these factors, sigma(G), in vivo
GO:0042601 endospore-forming forespore
IC
PMID:18208527
How the early sporulation sigma factor sigmaF delays the swi...
NEW
Summary: SigG is specifically localized and active in the forespore compartment during sporulation. This cellular component annotation would capture the spatial specificity of sigG function.
Reason: UniProt clearly states that sigG is "Active only in the forespore" (PMID:18208527). The forespore is the compartment that will ultimately become the mature spore. This cellular component annotation would accurately capture the subcellular localization and activity context of sigG.
Supporting Evidence:
PMID:18208527
Sporulation in Bacillus subtilis is a primitive differentiation process involving two cell types, the forespore and the mother cell

Core Functions

Late forespore-specific sigma factor activity directing transcription initiation of sporulation genes. SigG is the canonical sigma factor for late forespore development, promoting attachment of RNA polymerase to specific promoters.

References

How the early sporulation sigma factor sigmaF delays the switch to late development in Bacillus subtilis
  • sigG is the late forespore-specific sigma factor
    "We report that activity of sigma(G), the late forespore sigma factor, is kept in check by Gin, the product of csfB"
  • Interacts with anti-sigma-G factor Gin
    "Gin interacts strongly with sigma(G) but not with sigma(F) in a yeast two-hybrid experiment"
  • Active only in the forespore
    "Sporulation in Bacillus subtilis is a primitive differentiation process involving two cell types, the forespore and the mother cell"
  • Gin is the anti-sigma-G factor that prevents premature activation
    "Gin allows sigma(F) to delay the switch to the late forespore transcription programme by preventing sigma(G) to take over before the cell has reached a critical stage of development"
  • Premature sigma-G activity leads to slow-germinating spores
    "Premature sigma(G) activity leads to the formation of slow-germinating spores"
Expression of the sigmaF-directed csfB locus prevents premature appearance of sigmaG activity during sporulation of Bacillus subtilis
  • Activity repressed by anti-sigma-G factor Gin (csfB) and Lon protease during early forespore development
    "CsfB exerted control distinct from but overlapping with that exerted by LonA to prevent inappropriate sigma(G) activation"
Genetic dissection of an inhibitor of the sporulation sigma factor sigma(G)
  • Gin inhibits sigma-G activity by direct physical interaction
    "The Gin protein, encoded by csfB, is able to strongly inhibit the activity of one of these factors, sigma(G), in vivo"
  • Gin binding to sigma-G may involve more than simple sequestration
    "some mutants have kept most of their ability to bind sigma(G) but are completely unable to inhibit sigma(G) transcriptional activity, raising the possibility that Gin might act by a mechanism more complex than just sequestration of sigma(G)"
Gene Ontology annotation through association of InterPro records with GO terms
Annotation inferences using phylogenetic trees
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Automatic assignment of GO terms using logical inference, based on inter-ontology links
Combined Automated Annotation using Multiple IEA Methods
file:BACSU/sigG/sigG-deep-research-falcon.md
Deep research on sigG function

Suggested Questions for Experts

Q: What is the complete set of genes whose expression is directly controlled by sigG in the forespore, and how does this differ from sigF-controlled genes?

Q: What is the mechanism by which Lon protease contributes to sigG regulation, and does it act on sigG directly or on Gin?

Suggested Experiments

Experiment: ChIP-seq analysis of sigG binding sites to define the complete sigG regulon in sporulating B. subtilis cells. This would provide a comprehensive view of direct sigG target genes and validate the biological process annotations.

Experiment: Time-course transcriptomics comparing wild-type and sigG mutant strains during sporulation to identify sigG-dependent gene expression changes. This would help distinguish direct from indirect effects of sigG on sporulation gene expression.

📚 Additional Documentation

Deep Research Bioreason

(sigG-deep-research-bioreason.md)

BioReason Chat Export

Exported on March 22, 2026 at 03:18 AM


Organism: Bacillus subtilis

Sequence:

MSRNKVEICGVDTSKLPVLKNEEMRKLFRQLQDEGDDSAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSIDNFDLSHNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVRERLISETSKEPTAEDIAKVLEVPHEEIVFALDAIQDPVSLFEPIYNDGGDPIYVMDQISDERNTDSQWIEELALKEGMRRLNDREKMILRKRFFQGKTQMEVAEEIGISQAQVSRLEKAAIKQMNKNIHQ

Thinking Trace

I begin with the InterPro architecture. The N-terminal half is defined by IPR014212 (RNA polymerase sigma-G type, residues 4–258) and IPR014322 (RNA polymerase sigma-B/F/G type, residues 37–258), which together place the protein in the sporulation-associated sigma-B/G clade. This clade-level identity is reinforced by two family-wide envelopes that span almost the entire factor: IPR000943 (RNA polymerase sigma-70, residues 21–257) and IPR050239 (Sigma-70 factor family, RNA polymerase initiation factors, residues 25–256). Within this sigma-70 framework, the core functional modules are arranged in canonical order. Region 2 is captured by IPR013325 (RNA polymerase sigma factor, region 2 homologous superfamily, residues 12–111) and IPR007627 (sigma-70 region 2, residues 43–111), followed by region 3 and then region 4: IPR007624 (sigma-70 region 3, residues 125–177) and IPR007630 (sigma-70 region 4, residues 208–254). The C-terminal half is additionally encompassed by IPR014284 (RNA polymerase sigma-70-like domain, residues 38–257) and IPR013324 (sigma factor region 3/4-like homologous superfamily, residues 115–258), which together specify the DNA-recognition and RNA polymerase-binding surfaces that drive promoter selection.

This ordered arrangement causes sigma factor activity: region 2 binds and stabilizes the open complex and recognizes -10 elements; region 3 modulates -10 recognition and promotes isomerization; and region 4 binds the -35 element and anchors the holoenzyme to promoter DNA. These modules specify the molecular function formalized as GO:0016987 sigma factor activity. Because sigma factors act as initiation determinants for bacterial-type RNA polymerase, this activity directly instantiates the biological process of transcription initiation from bacterial-type promoters, aligning with GO:0006352 transcription, DNA-templated. The sigma-G/B/F/G clade annotation indicates specialization in sporulation regulons in Bacillus, implying promoter selectivity that coordinates developmental transcription programs.

Cellular localization follows from function and architecture. Sigma-70 family proteins operate with the core bacterial RNA polymerase in the soluble fraction of the bacterial cytoplasm. The absence of transmembrane signatures and the soluble sigma-70 fold point to a cytoplasmic residency, consistent with the functional cycle of transcription initiation and diffusion-limited promoter search in the bacterial cytoplasm.

Mechanistically, the protein forms a holoenzyme by docking onto core RNA polymerase, then uses region 2/3/4 surfaces to nucleate promoter commitment and melt promoter DNA to form the open complex. I hypothesize that it engages core subunits (RpoA/RpoB/RpoC) and transiently cooperates with initiation cofactors and nucleoid-associated proteins typical of sporulation pathways in Bacillus. Partners likely include other sigma-family members that compete for core polymerase, as well as promoter-bound assemblies that bias transcription toward developmental and stress-responsive regulons characteristic of sigma-G-like factors.

Functional Summary

A soluble bacterial transcription initiation factor that binds core RNA polymerase to confer promoter specificity during transcription initiation. Its modular architecture organizes the recognition and melting surfaces that select -10 and -35 promoter elements, assemble the open complex, and drive start-site commitment. In Bacillus, this factor likely directs developmental and stress-responsive transcription programs characteristic of sporulation-associated regulons and operates in the cytoplasm where it dynamically associates with the transcription machinery.

UniProt Summary

Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.

InterPro Domains

  • RNA polymerase sigma-G type (IPR014212, family) — residues 4-258
  • RNA polymerase sigma factor, region 2 (IPR013325, homologous_superfamily) — residues 12-111
  • RNA polymerase sigma-70 (IPR000943, domain) — residues 21-257
  • Sigma-70 factor family, RNA polymerase initiation factors (IPR050239, family) — residues 25-256
  • RNA polymerase sigma-B/F/G type (IPR014322, family) — residues 37-258
  • RNA polymerase sigma-70-like domain (IPR014284, domain) — residues 38-257
  • RNA polymerase sigma-70 region 2 (IPR007627, domain) — residues 43-111
  • RNA polymerase sigma factor, region 3/4-like (IPR013324, homologous_superfamily) — residues 115-258
  • RNA polymerase sigma-70 region 3 (IPR007624, domain) — residues 125-177
  • RNA polymerase sigma-70 region 4 (IPR007630, domain) — residues 208-254

GO Terms

Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)

Biological Process: biological_process (GO:0008150), cellular process (GO:0009987), biological regulation (GO:0065007), response to stimulus (GO:0050896), regulation of biological process (GO:0050789), cellular response to stimulus (GO:0051716), response to stress (GO:0006950), regulation of metabolic process (GO:0019222), regulation of cellular process (GO:0050794), regulation of macromolecule metabolic process (GO:0060255), regulation of cellular metabolic process (GO:0031323), regulation of primary metabolic process (GO:0080090), cellular response to stress (GO:0033554), regulation of biosynthetic process (GO:0009889), regulation of nitrogen compound metabolic process (GO:0051171), regulation of nucleobase-containing compound metabolic process (GO:0019219), regulation of RNA metabolic process (GO:0051252), regulation of macromolecule biosynthetic process (GO:0010556), regulation of gene expression (GO:0010468), regulation of DNA-templated transcription in response to stress (GO:0043620), regulation of cellular biosynthetic process (GO:0031326), regulation of RNA biosynthetic process (GO:2001141), regulation of DNA-templated transcription (GO:0006355), regulation of nucleic acid-templated transcription (GO:1903506)

Cellular Component: cellular_component (GO:0005575), protein-containing complex (GO:0032991), transcription regulator complex (GO:0005667)


Generated by BioReason

Deep Research Bioreason Rl

(sigG-deep-research-bioreason-rl.md)

BioReason Chat Export

Exported on March 22, 2026 at 03:18 AM


Organism: Bacillus subtilis

Sequence:

MSRNKVEICGVDTSKLPVLKNEEMRKLFRQLQDEGDDSAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSIDNFDLSHNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVRERLISETSKEPTAEDIAKVLEVPHEEIVFALDAIQDPVSLFEPIYNDGGDPIYVMDQISDERNTDSQWIEELALKEGMRRLNDREKMILRKRFFQGKTQMEVAEEIGISQAQVSRLEKAAIKQMNKNIHQ

Thinking Trace

I begin with the InterPro architecture. The N-terminal half is defined by IPR014212 (RNA polymerase sigma-G type, residues 4–258) and IPR014322 (RNA polymerase sigma-B/F/G type, residues 37–258), which together place the protein in the sporulation-associated sigma-B/G clade. This clade-level identity is reinforced by two family-wide envelopes that span almost the entire factor: IPR000943 (RNA polymerase sigma-70, residues 21–257) and IPR050239 (Sigma-70 factor family, RNA polymerase initiation factors, residues 25–256). Within this sigma-70 framework, the core functional modules are arranged in canonical order. Region 2 is captured by IPR013325 (RNA polymerase sigma factor, region 2 homologous superfamily, residues 12–111) and IPR007627 (sigma-70 region 2, residues 43–111), followed by region 3 and then region 4: IPR007624 (sigma-70 region 3, residues 125–177) and IPR007630 (sigma-70 region 4, residues 208–254). The C-terminal half is additionally encompassed by IPR014284 (RNA polymerase sigma-70-like domain, residues 38–257) and IPR013324 (sigma factor region 3/4-like homologous superfamily, residues 115–258), which together specify the DNA-recognition and RNA polymerase-binding surfaces that drive promoter selection.

This ordered arrangement causes sigma factor activity: region 2 binds and stabilizes the open complex and recognizes -10 elements; region 3 modulates -10 recognition and promotes isomerization; and region 4 binds the -35 element and anchors the holoenzyme to promoter DNA. These modules specify the molecular function formalized as GO:0016987 sigma factor activity. Because sigma factors act as initiation determinants for bacterial-type RNA polymerase, this activity directly instantiates the biological process of transcription initiation from bacterial-type promoters, aligning with GO:0006352 transcription, DNA-templated. The sigma-G/B/F/G clade annotation indicates specialization in sporulation regulons in Bacillus, implying promoter selectivity that coordinates developmental transcription programs.

Cellular localization follows from function and architecture. Sigma-70 family proteins operate with the core bacterial RNA polymerase in the soluble fraction of the bacterial cytoplasm. The absence of transmembrane signatures and the soluble sigma-70 fold point to a cytoplasmic residency, consistent with the functional cycle of transcription initiation and diffusion-limited promoter search in the bacterial cytoplasm.

Mechanistically, the protein forms a holoenzyme by docking onto core RNA polymerase, then uses region 2/3/4 surfaces to nucleate promoter commitment and melt promoter DNA to form the open complex. I hypothesize that it engages core subunits (RpoA/RpoB/RpoC) and transiently cooperates with initiation cofactors and nucleoid-associated proteins typical of sporulation pathways in Bacillus. Partners likely include other sigma-family members that compete for core polymerase, as well as promoter-bound assemblies that bias transcription toward developmental and stress-responsive regulons characteristic of sigma-G-like factors.

Functional Summary

A soluble bacterial transcription initiation factor that binds core RNA polymerase to confer promoter specificity during transcription initiation. Its modular architecture organizes the recognition and melting surfaces that select -10 and -35 promoter elements, assemble the open complex, and drive start-site commitment. In Bacillus, this factor likely directs developmental and stress-responsive transcription programs characteristic of sporulation-associated regulons and operates in the cytoplasm where it dynamically associates with the transcription machinery.

UniProt Summary

Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.

InterPro Domains

  • RNA polymerase sigma-G type (IPR014212, family) — residues 4-258
  • RNA polymerase sigma factor, region 2 (IPR013325, homologous_superfamily) — residues 12-111
  • RNA polymerase sigma-70 (IPR000943, domain) — residues 21-257
  • Sigma-70 factor family, RNA polymerase initiation factors (IPR050239, family) — residues 25-256
  • RNA polymerase sigma-B/F/G type (IPR014322, family) — residues 37-258
  • RNA polymerase sigma-70-like domain (IPR014284, domain) — residues 38-257
  • RNA polymerase sigma-70 region 2 (IPR007627, domain) — residues 43-111
  • RNA polymerase sigma factor, region 3/4-like (IPR013324, homologous_superfamily) — residues 115-258
  • RNA polymerase sigma-70 region 3 (IPR007624, domain) — residues 125-177
  • RNA polymerase sigma-70 region 4 (IPR007630, domain) — residues 208-254

GO Terms

Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)

Biological Process: biological_process (GO:0008150), cellular process (GO:0009987), biological regulation (GO:0065007), response to stimulus (GO:0050896), regulation of biological process (GO:0050789), cellular response to stimulus (GO:0051716), response to stress (GO:0006950), regulation of metabolic process (GO:0019222), regulation of cellular process (GO:0050794), regulation of macromolecule metabolic process (GO:0060255), regulation of cellular metabolic process (GO:0031323), regulation of primary metabolic process (GO:0080090), cellular response to stress (GO:0033554), regulation of biosynthetic process (GO:0009889), regulation of nitrogen compound metabolic process (GO:0051171), regulation of nucleobase-containing compound metabolic process (GO:0019219), regulation of RNA metabolic process (GO:0051252), regulation of macromolecule biosynthetic process (GO:0010556), regulation of gene expression (GO:0010468), regulation of DNA-templated transcription in response to stress (GO:0043620), regulation of cellular biosynthetic process (GO:0031326), regulation of RNA biosynthetic process (GO:2001141), regulation of DNA-templated transcription (GO:0006355), regulation of nucleic acid-templated transcription (GO:1903506)

Cellular Component: cellular_component (GO:0005575), protein-containing complex (GO:0032991), transcription regulator complex (GO:0005667)


Generated by BioReason

Deep Research Falcon

(sigG-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-18T08:26:33.950145'
end_time: '2025-12-18T08:33:17.673820'
duration_seconds: 403.72
template_file: templates/gene_research_go_focused.md
template_variables:
organism: BACSU
gene_id: sigG
gene_symbol: sigG
uniprot_accession: P19940
protein_description: 'RecName: Full=RNA polymerase sigma-G factor; AltName: Full=Stage
III sporulation protein G;'
gene_info: Name=sigG; Synonyms=spoIIIG; OrderedLocusNames=BSU15330;
organism_full: Bacillus subtilis (strain 168).
protein_family: Belongs to the sigma-70 factor family. .
protein_domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma-B/F/G. (IPR014322);
RNA_pol_sigma-G. (IPR014212); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2.
(IPR007627)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 28


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P19940
  • Protein Description: RecName: Full=RNA polymerase sigma-G factor; AltName: Full=Stage III sporulation protein G;
  • Gene Information: Name=sigG; Synonyms=spoIIIG; OrderedLocusNames=BSU15330;
  • Organism (full): Bacillus subtilis (strain 168).
  • Protein Family: Belongs to the sigma-70 factor family. .
  • Key Domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma-B/F/G. (IPR014322); RNA_pol_sigma-G. (IPR014212); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2. (IPR007627)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "sigG" matches the protein description above
  2. Verify the organism is correct: Bacillus subtilis (strain 168).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'sigG' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene sigG (gene ID: sigG, UniProt: P19940) in BACSU.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P19940
  • Protein Description: RecName: Full=RNA polymerase sigma-G factor; AltName: Full=Stage III sporulation protein G;
  • Gene Information: Name=sigG; Synonyms=spoIIIG; OrderedLocusNames=BSU15330;
  • Organism (full): Bacillus subtilis (strain 168).
  • Protein Family: Belongs to the sigma-70 factor family. .
  • Key Domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma-B/F/G. (IPR014322); RNA_pol_sigma-G. (IPR014212); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r2. (IPR007627)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "sigG" matches the protein description above
  2. Verify the organism is correct: Bacillus subtilis (strain 168).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'sigG' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene sigG (gene ID: sigG, UniProt: P19940) in BACSU.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Comprehensive research report: Bacillus subtilis sigG (SpoIIIG), UniProt P19940 — forespore sigma-G

Identity verification and scope
- Gene/protein: sigG (formerly spoIIIG) encodes sigma-G (σG), a sporulation-specific RNA polymerase sigma factor in Bacillus subtilis 168 that directs late forespore gene expression and autoregulates its own transcription (forespore) (Mearls 2018, PLOS Genetics, Apr 2018; Khanna et al. 2020, Annu Rev Microbiol, Sep 2020). URLs: https://doi.org/10.1371/journal.pgen.1007350; https://doi.org/10.1146/annurev-micro-022520-074650 (mearls2018transcriptionandtranslation pages 1-2, khanna2020shapinganendospore pages 20-21).
- Biochemical class and family: Alternative sigma factor of the bacterial RNA polymerase sigma-70 family that binds RNAP core to confer promoter specificity during sporulation; active specifically in the forespore after engulfment (Khanna et al. 2020; Stragier 2022, J Bacteriol, Sep 2022). URLs: https://doi.org/10.1146/annurev-micro-022520-074650; https://doi.org/10.1128/jb.00187-22 (khanna2020shapinganendospore pages 20-21, stragier2022tofeedor pages 6-8).

1) Key concepts and definitions with current understanding
- Functional role: σG is the late-acting forespore sigma factor, replacing σF after engulfment to drive late developmental genes required for spore core maturation, cortex/coat assembly inputs from the forespore, and germination readiness. It also activates its own promoter (PsigG) creating a positive feedback loop (Mearls 2018; Khanna et al. 2020). URLs: https://doi.org/10.1371/journal.pgen.1007350; https://doi.org/10.1146/annurev-micro-022520-074650 (mearls2018transcriptionandtranslation pages 1-2, khanna2020shapinganendospore pages 20-21).
- Cellular compartment and timing: σG activity is forespore-specific and becomes detectable after engulfment; the σF→σG switch occurs approximately 2.5–3 h into sporulation under the studied conditions, with σG activity from ~3–6 h (Mearls 2018; Khanna et al. 2020). URL: https://doi.org/10.1371/journal.pgen.1007350; https://doi.org/10.1146/annurev-micro-022520-074650 (mearls2018transcriptionandtranslation pages 4-5, khanna2020shapinganendospore pages 20-21).
- Activation dependencies: Late forespore gene expression depends on the intercellular SpoIIIA–SpoIIQ (A–Q) channel connecting mother cell and forespore; this complex supports forespore physiology and is broadly required for late forespore transcription programs (Stragier 2022; Mearls 2018). URLs: https://doi.org/10.1128/jb.00187-22; https://doi.org/10.1371/journal.pgen.1007350 (stragier2022tofeedor pages 6-8, mearls2018transcriptionandtranslation pages 2-4, mearls2018transcriptionandtranslation pages 7-9).
- Inhibitory controls and checkpointing: Multiple layers prevent premature or ectopic σG activity. Early σG is inhibited by anti-sigma factors and proteolysis: CsfB (Gin) binds and inhibits σG (also reported to inhibit σE), SpoIIAB can antagonize σG in inappropriate compartments, and LonA protease counters σG outside the forespore. Translational dampening via a conserved 5′-leader hairpin that occludes the RBS, a GTG start codon, suboptimal -35/−10 spacing, and a suboptimal 5′ coding region keep early σG levels low; removal of these features triggers premature σG activity (Mearls 2018). URL: https://doi.org/10.1371/journal.pgen.1007350 (mearls2018transcriptionandtranslation pages 20-21, mearls2018transcriptionandtranslation pages 19-20).

2) Regulation, pathway context, and mechanistic details
- Sigma cascade and intercellular coupling: Sporulation proceeds via compartment-specific sigma factors: early σF (forespore) and σE (mother cell) precede late σG (forespore) and σK (mother cell). The A–Q channel (SpoIIIAA–AH in mother; SpoIIQ in forespore) is conserved and links compartments; σE drives expression of SpoIIIAA–AH, while σF drives spoIIQ. The channel is essential for sustained late forespore gene expression and supports the σF→σG program switch post-engulfment (Stragier 2022; Mearls 2018). URLs: https://doi.org/10.1128/jb.00187-22; https://doi.org/10.1371/journal.pgen.1007350 (stragier2022tofeedor pages 6-8, mearls2018transcriptionandtranslation pages 7-9, mearls2018transcriptionandtranslation pages 2-4).
- Timing and PsigG transcription: Contrary to earlier “delayed transcription” models, PsigG is transcribed at normal early timing by σF but at very low levels due to intrinsic sequence features; later, σG autoregulates PsigG to amplify expression. Apparent dependencies on σE/SpoIIQ seen in mutants can reflect the general requirement of the A–Q channel for late forespore gene expression rather than a σG-specific promoter delay (Mearls 2018). URL: https://doi.org/10.1371/journal.pgen.1007350 (mearls2018transcriptionandtranslation pages 4-5, mearls2018transcriptionandtranslation pages 19-20, mearls2018transcriptionandtranslation pages 5-7).
- Compartment-specific inhibition: CsfB (Gin), produced under σF, inhibits σG early in the forespore and contributes to preventing σG activity in mother/vegetative cells; SpoIIAB and LonA further limit σG outside the forespore. Misexpression of sigG without its dampening features causes vegetative or premature forespore σG activity (Mearls 2018). URL: https://doi.org/10.1371/journal.pgen.1007350 (mearls2018transcriptionandtranslation pages 20-21).

3) Transcriptional targets and regulon (representative)
- Autoregulation and late forespore program: σG activates its own gene (sigG) and drives late-stage forespore genes, including the transcription factor SpoVT that modulates the σG regulon, genes facilitating σK activation in the mother cell (e.g., spoIVB and related factors), small acid-soluble spore proteins (SASPs) for DNA protection, germination receptor components, and forespore-contributed enzymes relevant to cortex/core maturation. These assignments are consistent with classical forespore regulomics and were summarized in recent text describing σG’s downstream suite (Doğan 2024, text excerpt). While the excerpt is not a primary peer-reviewed article, the content aligns with the established σG regulon architecture in B. subtilis (URL unavailable) (dogan2024…ofbacilysin pages 70-74).

4) Recent developments and latest research (priority 2023–2024)
- Proteomics advances for spores: A 2024 shotgun proteomics workflow using ionic liquid-assisted sample preparation (pTRUST) enabled high-sensitivity analysis of highly pure B. subtilis spore preparations, identifying 445 proteins, including 52 of 79 previously localized spore proteins in SubtiWiki, and nominating 393 additional candidates. GFP fusions validated localization for 20 newly identified candidates. This methodological advance supports more complete mapping of spore proteomes, aiding studies of σG-controlled developmental outputs. Scientific Reports, Jul 2024; URL: https://doi.org/10.1038/s41598-024-67010-z (taoka2024ionicliquidassistedsample pages 1-2, taoka2024ionicliquidassistedsample pages 5-7).
- Global growth-to-sporulation transitions: A 2023 time-course RNA-seq in B. subtilis DB104 delineated gene expression changes across the growth-to-stationary transition when sporulation initiates, and examined sigma-factor-linked expression patterns across clusters. This supports the timing of global regulatory switches that include activation of sporulation cascades culminating in σG activity. Microorganisms, Jul 2023; URL: https://doi.org/10.3390/microorganisms11081928 (jun2023timecoursetranscriptomeanalysis pages 14-16).
- Evolving model of the A–Q channel: A 2022 minireview synthesizes comparative genomics showing conservation and variation in the SpoIIIA–SpoIIQ machinery among Firmicutes, refining views of channel composition and function during engulfment and late forespore transcription maintenance. J Bacteriol, Sep 2022; URL: https://doi.org/10.1128/jb.00187-22 (stragier2022tofeedor pages 6-8).

5) Current applications and real-world implementations
- Spore-based technologies: Bacterial spores are explored as robust delivery vehicles for probiotics, vaccines, and therapeutics due to resistance and ability to germinate in vivo. Mechanistic understanding of sporulation, including σG-directed late programs that equip spores with protective components and germination competence, underpins these applications. International Journal of Molecular Sciences, Mar 2022; URL: https://doi.org/10.3390/ijms23063405 (koopman2022mechanismsandapplications pages 10-12).
- Analytical platforms: High-sensitivity proteomics (pTRUST) enhances detection/characterization of spore components, potentially improving environmental/food safety monitoring and biodefense analytics for spore-formers by better defining protein markers originating from late sporulation programs influenced by σG. Scientific Reports, Jul 2024; URL: https://doi.org/10.1038/s41598-024-67010-z (taoka2024ionicliquidassistedsample pages 1-2, taoka2024ionicliquidassistedsample pages 5-7).

6) Expert opinions and authoritative analysis
- Sigma cascade and morphology reviews emphasize that σG activation awaits completion of engulfment and depends on intercellular coupling, arguing for tightly segregated forespore vs mother-cell programs to ensure orderly development. Annual Review of Microbiology, Sep 2020; J Bacteriol minireview, Sep 2022. URLs: https://doi.org/10.1146/annurev-micro-022520-074650; https://doi.org/10.1128/jb.00187-22 (khanna2020shapinganendospore pages 20-21, stragier2022tofeedor pages 6-8).
- Mechanistic tuning of sigG: PLOS Genetics (2018) demonstrated that intrinsic promoter/leader features, rather than an external checkpoint delaying transcription onset, explain the low early expression of sigG and enforce the temporal separation of σF and σG programs, providing a precise and experimentally grounded update to the regulatory model. URL: https://doi.org/10.1371/journal.pgen.1007350 (mearls2018transcriptionandtranslation pages 4-5, mearls2018transcriptionandtranslation pages 19-20, mearls2018transcriptionandtranslation pages 5-7).

7) Relevant statistics and data points from recent studies
- Timing metrics: σF activity ~1.5–2.5 h; σG activity ~3–6 h; σF→σG switch ~2.5–3 h under the studied sporulation regimen (Mearls 2018, PLOS Genetics; Apr 2018; URL: https://doi.org/10.1371/journal.pgen.1007350) (mearls2018transcriptionandtranslation pages 4-5).
- Translational control magnitudes: The conserved 5′-leader hairpin can reduce translation up to 4-fold; changing GTG to ATG increases translation ~1.3×; optimizing 5′ coding region and promoter architecture further elevates early expression, but misexpression triggers premature σG activity (Mearls 2018; URL: https://doi.org/10.1371/journal.pgen.1007350) (mearls2018transcriptionandtranslation pages 1-2, mearls2018transcriptionandtranslation pages 5-7).
- Spore proteome coverage: 445 total proteins identified; 52/79 known spore-localized proteins recovered (≈70%); 393 additional candidates; 20 novel candidates confirmed by GFP localization (Taoka et al. 2024; Scientific Reports; Jul 2024; URL: https://doi.org/10.1038/s41598-024-67010-z) (taoka2024ionicliquidassistedsample pages 1-2, taoka2024ionicliquidassistedsample pages 5-7).

Conclusions — functional annotation summary for sigG (P19940)
- Primary function: σG is a forespore-specific alternative sigma factor (sigma-70 family) that reprograms RNA polymerase to transcribe late forespore genes after engulfment; it autoregulates sigG and activates genes necessary for spore core maturation, protective SASPs, germination apparatus components, and intercellular signaling back to the mother cell (via factors that promote σK activation). Activity is gated by intercellular A–Q channel function and multiple inhibitory/dampening layers (CsfB, SpoIIAB, LonA; 5′ leader hairpin; start codon; promoter spacing) to prevent premature or ectopic expression (Mearls 2018; Khanna 2020; Stragier 2022). URLs: https://doi.org/10.1371/journal.pgen.1007350; https://doi.org/10.1146/annurev-micro-022520-074650; https://doi.org/10.1128/jb.00187-22 (mearls2018transcriptionandtranslation pages 1-2, khanna2020shapinganendospore pages 20-21, stragier2022tofeedor pages 6-8).
- Localization: Acts in the forespore cytoplasm after engulfment; late gene expression requires the SpoIIIA–SpoIIQ channel that bridges to the mother cell (Khanna 2020; Stragier 2022; Mearls 2018). URLs: https://doi.org/10.1146/annurev-micro-022520-074650; https://doi.org/10.1128/jb.00187-22; https://doi.org/10.1371/journal.pgen.1007350 (khanna2020shapinganendospore pages 20-21, stragier2022tofeedor pages 6-8, mearls2018transcriptionandtranslation pages 7-9).

Notes on gene symbol disambiguation
- The gene symbol “sigG” here refers explicitly to Bacillus subtilis 168 σG (formerly spoIIIG), as corroborated by the cited B. subtilis sporulation literature; no conflicting gene usage is used in this report (Mearls 2018; Khanna 2020). URLs: https://doi.org/10.1371/journal.pgen.1007350; https://doi.org/10.1146/annurev-micro-022520-074650 (mearls2018transcriptionandtranslation pages 1-2, khanna2020shapinganendospore pages 20-21).

References

  1. (mearls2018transcriptionandtranslation pages 1-2): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  2. (khanna2020shapinganendospore pages 20-21): Kanika Khanna, Javier Lopez-Garrido, and Kit Pogliano. Shaping an endospore: architectural transformations during bacillus subtilis sporulation. Annual Review of Microbiology, 74:361-386, Sep 2020. URL: https://doi.org/10.1146/annurev-micro-022520-074650, doi:10.1146/annurev-micro-022520-074650. This article has 92 citations and is from a peer-reviewed journal.

  3. (stragier2022tofeedor pages 6-8): Patrick Stragier. To feed or to stick? genomic analysis offers clues for the role of a molecular machine in endospore formers. Journal of Bacteriology, Sep 2022. URL: https://doi.org/10.1128/jb.00187-22, doi:10.1128/jb.00187-22. This article has 6 citations and is from a peer-reviewed journal.

  4. (mearls2018transcriptionandtranslation pages 4-5): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  5. (mearls2018transcriptionandtranslation pages 2-4): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  6. (mearls2018transcriptionandtranslation pages 7-9): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  7. (mearls2018transcriptionandtranslation pages 20-21): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  8. (mearls2018transcriptionandtranslation pages 19-20): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  9. (mearls2018transcriptionandtranslation pages 5-7): Elizabeth B. Mearls, Jacquelin Jackter, Jennifer M. Colquhoun, Veronica Farmer, Allison J. Matthews, Laura S. Murphy, Colleen Fenton, and Amy H. Camp. Transcription and translation of the sigg gene is tuned for proper execution of the switch from early to late gene expression in the developing bacillus subtilis spore. PLOS Genetics, 14:e1007350, Apr 2018. URL: https://doi.org/10.1371/journal.pgen.1007350, doi:10.1371/journal.pgen.1007350. This article has 19 citations and is from a domain leading peer-reviewed journal.

  10. (dogan2024…ofbacilysin pages 70-74): BA Doğan. … of bacilysin production and rescue of sporulation defects in a bacilysin negative strain by the sporulation transcription factors gerr and spovt in bacillus subtilis. Unknown journal, 2024.

  11. (taoka2024ionicliquidassistedsample pages 1-2): Masato Taoka, Ritsuko Kuwana, Tatsumi Fukube, Akiko Kashima, Yuko Nobe, Takamasa Uekita, Tohru Ichimura, and Hiromu Takamatsu. Ionic liquid-assisted sample preparation mediates sensitive proteomic analysis of bacillus subtilis spores. Scientific Reports, Jul 2024. URL: https://doi.org/10.1038/s41598-024-67010-z, doi:10.1038/s41598-024-67010-z. This article has 4 citations and is from a peer-reviewed journal.

  12. (taoka2024ionicliquidassistedsample pages 5-7): Masato Taoka, Ritsuko Kuwana, Tatsumi Fukube, Akiko Kashima, Yuko Nobe, Takamasa Uekita, Tohru Ichimura, and Hiromu Takamatsu. Ionic liquid-assisted sample preparation mediates sensitive proteomic analysis of bacillus subtilis spores. Scientific Reports, Jul 2024. URL: https://doi.org/10.1038/s41598-024-67010-z, doi:10.1038/s41598-024-67010-z. This article has 4 citations and is from a peer-reviewed journal.

  13. (jun2023timecoursetranscriptomeanalysis pages 14-16): Ji-Su Jun, Hyang-Eun Jeong, Su-Yeong Moon, Se-Hee Shin, and Kwang-Won Hong. Time-course transcriptome analysis of bacillus subtilis db104 during growth. Microorganisms, 11:1928, Jul 2023. URL: https://doi.org/10.3390/microorganisms11081928, doi:10.3390/microorganisms11081928. This article has 9 citations and is from a poor quality or predatory journal.

  14. (koopman2022mechanismsandapplications pages 10-12): Nienke Koopman, Lauren Remijas, Jurgen Seppen, Peter Setlow, and Stanley Brul. Mechanisms and applications of bacterial sporulation and germination in the intestine. International Journal of Molecular Sciences, 23:3405, Mar 2022. URL: https://doi.org/10.3390/ijms23063405, doi:10.3390/ijms23063405. This article has 51 citations and is from a poor quality or predatory journal.

Citations

  1. mearls2018transcriptionandtranslation pages 20-21
  2. jun2023timecoursetranscriptomeanalysis pages 14-16
  3. stragier2022tofeedor pages 6-8
  4. koopman2022mechanismsandapplications pages 10-12
  5. mearls2018transcriptionandtranslation pages 4-5
  6. mearls2018transcriptionandtranslation pages 1-2
  7. khanna2020shapinganendospore pages 20-21
  8. mearls2018transcriptionandtranslation pages 2-4
  9. mearls2018transcriptionandtranslation pages 7-9
  10. mearls2018transcriptionandtranslation pages 19-20
  11. mearls2018transcriptionandtranslation pages 5-7
  12. taoka2024ionicliquidassistedsample pages 1-2
  13. taoka2024ionicliquidassistedsample pages 5-7
  14. https://doi.org/10.1371/journal.pgen.1007350;
  15. https://doi.org/10.1146/annurev-micro-022520-074650
  16. https://doi.org/10.1146/annurev-micro-022520-074650;
  17. https://doi.org/10.1128/jb.00187-22
  18. https://doi.org/10.1128/jb.00187-22;
  19. https://doi.org/10.1371/journal.pgen.1007350
  20. https://doi.org/10.1038/s41598-024-67010-z
  21. https://doi.org/10.3390/microorganisms11081928
  22. https://doi.org/10.3390/ijms23063405
  23. https://doi.org/10.1371/journal.pgen.1007350,
  24. https://doi.org/10.1146/annurev-micro-022520-074650,
  25. https://doi.org/10.1128/jb.00187-22,
  26. https://doi.org/10.1038/s41598-024-67010-z,
  27. https://doi.org/10.3390/microorganisms11081928,
  28. https://doi.org/10.3390/ijms23063405,

Bioreason Rl Review

(sigG-bioreason-rl-review.md)

BioReason-Pro RL Review: sigG (B. subtilis)

Source: sigG-deep-research-bioreason-rl.md

  • Correctness: 4/5
  • Completeness: 2/5

Functional Summary Review

The BioReason functional summary reads:

A soluble bacterial transcription initiation factor that binds core RNA polymerase to confer promoter specificity during transcription initiation. Its modular architecture organizes the recognition and melting surfaces that select -10 and -35 promoter elements, assemble the open complex, and drive start-site commitment. In Bacillus, this factor likely directs developmental and stress-responsive transcription programs characteristic of sporulation-associated regulons and operates in the cytoplasm where it dynamically associates with the transcription machinery.

The core description of sigma factor function is correct: SigG is a sigma-70 family transcription initiation factor that confers promoter specificity (GO:0016987, sigma factor activity) and participates in transcription initiation (GO:0006352). The cytoplasmic localization is reasonable.

However, the summary is notably vague and generic:

  1. Missing specificity: SigG is the late forespore-specific sigma factor, the second in the forespore lineage after SigF. BioReason says it "likely directs developmental and stress-responsive transcription programs" -- the hedging "likely" is inappropriate for a well-characterized sigma factor whose function is thoroughly established. The curated review explicitly states SigG is "Active only in the forespore" approximately 2 hours after sporulation starts.

  2. Missing anti-sigma regulation: SigG activity is controlled by the anti-sigma-G factor Gin (CsfB), which inhibits SigG via direct binding to its N-terminal region (residues 1-71). The curated review identifies antisigma factor binding (GO:0045152) and sigma factor antagonist complex (GO:1903865) as important annotations. BioReason misses this entirely.

  3. Missing temporal context: SigG auto-stimulates its own transcription and is regulated by Lon protease. The sequential sigma factor cascade (SigF -> SigG in forespore; SigE -> SigK in mother cell) is not captured.

  4. Forespore localization missing: The curated review proposes endospore-forming forespore (GO:0042601) as the appropriate CC term. BioReason assigns generic cytoplasm.

  5. Erroneous GO predictions: BioReason's GO terms include nucleus (GO:0005634) and endonuclease complex (GO:1905348) -- both completely wrong for a bacterial sigma factor.

Comparison with interpro2go:

The interpro2go annotations for sigG include transcription regulation terms and the sigma-B/F/G type family classification. BioReason adds little beyond interpro2go -- both correctly identify sigma factor function but neither captures forespore specificity, anti-sigma regulation, or the sporulation context. BioReason's narrative is more articulate but the GO term predictions include errors (nucleus, endonuclease complex) that interpro2go would not produce.

Notes on thinking trace

The trace identifies the sigma-G type classification (IPR014212) but fails to leverage this specific family assignment to infer the late forespore role. The reasoning about regions 2/3/4 and HTH domain function is standard and correct. The trace mentions "sporulation-associated regulons" but does not commit to the specific forespore identity that the domain classification clearly indicates.

📄 View Raw YAML

id: P19940
gene_symbol: sigG
product_type: PROTEIN
status: DRAFT
taxon:
  id: NCBITaxon:224308
  label: Bacillus subtilis (strain 168)
description: RNA polymerase sigma-G factor (also known as SpoIIIG) is the late forespore-specific sigma
  factor in Bacillus subtilis sporulation. Sigma factors are initiation factors that promote the attachment
  of RNA polymerase to specific initiation sites and are then released after transcription begins. SigG
  becomes active only in the forespore compartment approximately 2 hours after sporulation starts, following
  the earlier action of sigF (the early forespore sigma factor). SigG is responsible for the expression
  of late sporulation-specific genes in the forespore and auto-stimulates its own transcription. The activity
  of sigG is tightly regulated during early forespore development by the anti-sigma-G factor Gin (CsfB)
  and by Lon protease. SigG interacts with Gin via its N-terminal region (residues 1-71), and this interaction
  inhibits sigG activity until the appropriate developmental stage. The protein contains the characteristic
  sigma-70 family domains including a helix-turn-helix DNA-binding motif (residues 229-248) and a polymerase
  core binding motif (residues 67-80).
existing_annotations:
- term:
    id: GO:0000976
    label: transcription cis-regulatory region binding
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: This annotation suggests sigG binds to transcription cis-regulatory regions. According to
      the GO definition of sigma factor activity (GO:0016987), sigma factors do not bind DNA on their
      own but when combined with the core polymerase to form the holoenzyme, they confer promoter specificity.
      This term is technically inaccurate for sigma factors as they require the core polymerase for DNA
      binding.
    action: REMOVE
    reason: Sigma factors do not bind DNA independently. The GO definition of sigma factor activity explicitly
      states that "sigma does not bind DNA on its own, when combined with the core to form the holoenzyme,
      the sigma factor binds specifically to promoter elements." This term implies autonomous DNA binding
      which is misleading for sigma factors. The correct molecular function is GO:0016987 sigma factor
      activity.
- term:
    id: GO:0003899
    label: DNA-directed RNA polymerase activity
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: This annotation incorrectly attributes catalytic RNA polymerase activity to sigG. Sigma factors
      are NOT enzymes and do not catalyze the polymerization reaction. The catalytic activity resides
      in the core polymerase subunits (beta, beta-prime).
    action: REMOVE
    reason: 'This is a clear over-annotation. GO:0003899 is defined as "Catalysis of the reaction: nucleoside
      triphosphate + RNA(n) = diphosphate + RNA(n+1)." Sigma factors do not possess catalytic activity.
      The catalytic activity of RNA polymerization is performed by the core enzyme subunits, not by sigma
      factors. SigG confers promoter specificity but does not catalyze nucleotide addition. UniProt describes
      sigG as an "initiation factor that promote[s] the attachment of RNA polymerase to specific initiation
      sites."'
- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: SigG regulates transcription by directing RNA polymerase to forespore-specific promoters
      and by auto-stimulating its own transcription. This is an appropriate biological process annotation
      for a sigma factor.
    action: ACCEPT
    reason: Sigma factors inherently regulate transcription by determining which genes are transcribed.
      SigG specifically regulates the expression of late sporulation genes in the forespore and auto-stimulates
      its own transcription (per UniProt PMID:18208527). This is a core function of sigG.
    supported_by:
    - reference_id: file:BACSU/sigG/sigG-deep-research-falcon.md
      supporting_text: See deep research file for comprehensive analysis
- term:
    id: GO:0016987
    label: sigma factor activity
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: This is the primary and most accurate molecular function annotation for sigG. GO:0016987
      is defined as the activity of sigma factors that act as promoter specificity subunits of bacterial
      RNA polymerase.
    action: ACCEPT
    reason: This is the canonical molecular function for sigG. UniProt explicitly identifies sigG as "RNA
      polymerase sigma-G factor" belonging to the "sigma-70 factor family." The protein contains all characteristic
      sigma-70 family domains including the HTH DNA-binding motif (residues 229-248) and polymerase core
      binding motif (residues 67-80).
- term:
    id: GO:1903865
    label: sigma factor antagonist complex
  evidence_type: IBA
  original_reference_id: GO_REF:0000033
  review:
    summary: SigG forms a complex with its anti-sigma factor Gin (CsfB). GO:1903865 is defined as "A protein
      complex which is capable of sigma factor antagonist activity." When sigG is bound by Gin, this complex
      inhibits sigG transcriptional activity.
    action: ACCEPT
    reason: UniProt clearly documents that sigG interacts with anti-sigma-G factor Gin (CsfB) via its
      N-terminal region (residues 1-71). This interaction inhibits sigG activity (per PMID:18208527, PMID:19497328).
      This sigG-Gin complex represents a sigma factor antagonist complex that prevents premature sigG
      activity during early forespore development.
- term:
    id: GO:0003677
    label: DNA binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: This generic DNA binding annotation is inferred from the presence of sigma-70 family domains.
      While sigG does contain a helix-turn-helix DNA-binding motif (residues 229-248), sigma factors require
      the core polymerase for DNA binding.
    action: KEEP_AS_NON_CORE
    reason: SigG contains a HTH DNA-binding motif at residues 229-248 (UniProt feature). However, per
      the GO definition of sigma factor activity, "sigma does not bind DNA on its own." The more precise
      molecular function is GO:0016987 sigma factor activity. This annotation is kept as non-core since
      it reflects domain architecture but is less informative than the sigma factor activity annotation.
- term:
    id: GO:0003700
    label: DNA-binding transcription factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This annotation from InterPro domain mapping is incorrect for sigma factors. Sigma factors
      are not transcription factors in the classical sense; they are subunits of the RNA polymerase holoenzyme
      that confer promoter specificity.
    action: MODIFY
    reason: 'GO:0003700 is defined as sequence-specific DNA binding that modulates transcription. Sigma
      factors function differently: they are dissociable subunits of RNA polymerase that determine promoter
      recognition. The correct term is GO:0016987 sigma factor activity. While both involve promoter recognition,
      sigma factors act as part of the polymerase holoenzyme rather than as independent transcription
      factors.'
    proposed_replacement_terms:
    - id: GO:0016987
      label: sigma factor activity
- term:
    id: GO:0006351
    label: DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: SigG is involved in DNA-templated transcription as a component of the RNA polymerase holoenzyme.
      This is an appropriate general biological process annotation inferred from the UniProt Transcription
      keyword.
    action: ACCEPT
    reason: As a sigma factor, sigG is essential for transcription initiation from specific promoters.
      This general process term appropriately captures sigG's role in transcription. The annotation is
      supported by UniProt keywords (Transcription) and the well-characterized function of sigma factors
      in bacterial transcription.
- term:
    id: GO:0006352
    label: DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: This is a highly appropriate annotation for sigma factors. Sigma factors specifically function
      in transcription initiation by directing RNA polymerase to promoters and are released after initiation.
    action: ACCEPT
    reason: UniProt states that "Sigma factors are initiation factors that promote the attachment of RNA
      polymerase to specific initiation sites and are then released." The specific role of sigma factors
      in initiation (as opposed to elongation or termination) makes this a precise and accurate annotation.
- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: Duplicate annotation (same term as the IBA annotation above) from automated inference. This
      accurately captures sigG's regulatory role in transcription.
    action: ACCEPT
    reason: This is a duplicate of the IBA annotation for the same term. Both annotations are valid. Sigma
      factors regulate transcription by determining which genes are expressed. SigG specifically regulates
      late forespore gene expression and auto-stimulates its own transcription.
- term:
    id: GO:0016987
    label: sigma factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: Duplicate annotation (same term as the IBA annotation above) from automated inference based
      on InterPro domains and UniProt keywords.
    action: ACCEPT
    reason: This is a duplicate of the IBA annotation for sigma factor activity. Both annotations are
      valid. This is the core molecular function of sigG, supported by the UniProt Sigma factor keyword
      and sigma-70 family domain architecture.
- term:
    id: GO:0030435
    label: sporulation resulting in formation of a cellular spore
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: SigG is the late forespore-specific sigma factor essential for sporulation in B. subtilis.
      This biological process annotation appropriately captures the developmental context of sigG function.
    action: ACCEPT
    reason: SigG is specifically active in the forespore during sporulation. UniProt states it is "responsible
      for the expression of sporulation specific genes in the forespore" and is "Active only in the forespore"
      approximately "2 hours after sporulation starts." The Sporulation UniProt keyword correctly maps
      to this GO term.
- term:
    id: GO:2000142
    label: regulation of DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000108
  review:
    summary: This annotation is logically inferred from sigma factor activity (GO:0016987). Since sigma
      factors specifically function at the initiation step, regulation of transcription initiation is
      an appropriate process annotation.
    action: ACCEPT
    reason: This is a more specific version of the transcription regulation annotation that correctly
      reflects that sigma factors act at the initiation stage. Since sigma factors "promote the attachment
      of RNA polymerase to specific initiation sites," they specifically regulate the initiation step
      of transcription. This logical inference from GO:0016987 is valid.
- term:
    id: GO:0045152
    label: antisigma factor binding
  evidence_type: IC
  original_reference_id: PMID:18208527
  review:
    summary: SigG binds to its anti-sigma factor Gin (CsfB) via residues 1-71. This interaction is critical
      for temporal regulation of sigG activity during sporulation.
    action: NEW
    reason: UniProt documents direct interaction between sigG and anti-sigma-G factor Gin (PMID:18208527,
      PMID:19497328). The N-terminal region (residues 1-71) is annotated as the recognition region for
      Gin binding. This molecular function annotation would accurately capture sigG's ability to bind
      its cognate anti-sigma factor.
    supported_by:
    - reference_id: PMID:18208527
      supporting_text: Gin interacts strongly with sigma(G) but not with sigma(F) in a yeast two-hybrid
        experiment
    - reference_id: PMID:19497328
      supporting_text: The Gin protein, encoded by csfB, is able to strongly inhibit the activity of one
        of these factors, sigma(G), in vivo
- term:
    id: GO:0042601
    label: endospore-forming forespore
  evidence_type: IC
  original_reference_id: PMID:18208527
  review:
    summary: SigG is specifically localized and active in the forespore compartment during sporulation.
      This cellular component annotation would capture the spatial specificity of sigG function.
    action: NEW
    reason: UniProt clearly states that sigG is "Active only in the forespore" (PMID:18208527). The forespore
      is the compartment that will ultimately become the mature spore. This cellular component annotation
      would accurately capture the subcellular localization and activity context of sigG.
    supported_by:
    - reference_id: PMID:18208527
      supporting_text: Sporulation in Bacillus subtilis is a primitive differentiation process involving
        two cell types, the forespore and the mother cell
core_functions:
- description: Late forespore-specific sigma factor activity directing transcription initiation of sporulation
    genes. SigG is the canonical sigma factor for late forespore development, promoting attachment of
    RNA polymerase to specific promoters.
  molecular_function:
    id: GO:0016987
    label: sigma factor activity
  directly_involved_in:
  - id: GO:0030435
    label: sporulation resulting in formation of a cellular spore
  - id: GO:0006352
    label: DNA-templated transcription initiation
  locations:
  - id: GO:0042601
    label: endospore-forming forespore
references:
- id: PMID:18208527
  title: How the early sporulation sigma factor sigmaF delays the switch to late development in Bacillus
    subtilis
  full_text_unavailable: true
  findings:
  - statement: sigG is the late forespore-specific sigma factor
    supporting_text: We report that activity of sigma(G), the late forespore sigma factor, is kept in
      check by Gin, the product of csfB
  - statement: Interacts with anti-sigma-G factor Gin
    supporting_text: Gin interacts strongly with sigma(G) but not with sigma(F) in a yeast two-hybrid
      experiment
  - statement: Active only in the forespore
    supporting_text: Sporulation in Bacillus subtilis is a primitive differentiation process involving
      two cell types, the forespore and the mother cell
  - statement: Gin is the anti-sigma-G factor that prevents premature activation
    supporting_text: Gin allows sigma(F) to delay the switch to the late forespore transcription programme
      by preventing sigma(G) to take over before the cell has reached a critical stage of development
  - statement: Premature sigma-G activity leads to slow-germinating spores
    supporting_text: Premature sigma(G) activity leads to the formation of slow-germinating spores
- id: PMID:17921305
  title: Expression of the sigmaF-directed csfB locus prevents premature appearance of sigmaG activity
    during sporulation of Bacillus subtilis
  full_text_unavailable: true
  findings:
  - statement: Activity repressed by anti-sigma-G factor Gin (csfB) and Lon protease during early forespore
      development
    supporting_text: CsfB exerted control distinct from but overlapping with that exerted by LonA to prevent
      inappropriate sigma(G) activation
- id: PMID:19497328
  title: Genetic dissection of an inhibitor of the sporulation sigma factor sigma(G)
  full_text_unavailable: true
  findings:
  - statement: Gin inhibits sigma-G activity by direct physical interaction
    supporting_text: The Gin protein, encoded by csfB, is able to strongly inhibit the activity of one
      of these factors, sigma(G), in vivo
  - statement: Gin binding to sigma-G may involve more than simple sequestration
    supporting_text: some mutants have kept most of their ability to bind sigma(G) but are completely
      unable to inhibit sigma(G) transcriptional activity, raising the possibility that Gin might act
      by a mechanism more complex than just sequestration of sigma(G)
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000033
  title: Annotation inferences using phylogenetic trees
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000108
  title: Automatic assignment of GO terms using logical inference, based on inter-ontology links
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: file:BACSU/sigG/sigG-deep-research-falcon.md
  title: Deep research on sigG function
  findings: []
suggested_questions:
- question: What is the complete set of genes whose expression is directly controlled by sigG in the forespore,
    and how does this differ from sigF-controlled genes?
- question: What is the mechanism by which Lon protease contributes to sigG regulation, and does it act
    on sigG directly or on Gin?
suggested_experiments:
- description: ChIP-seq analysis of sigG binding sites to define the complete sigG regulon in sporulating
    B. subtilis cells. This would provide a comprehensive view of direct sigG target genes and validate
    the biological process annotations.
- description: Time-course transcriptomics comparing wild-type and sigG mutant strains during sporulation
    to identify sigG-dependent gene expression changes. This would help distinguish direct from indirect
    effects of sigG on sporulation gene expression.