SCO7099 (Q9KZ33) is a predicted extracytoplasmic function (ECF) sigma factor in Streptomyces coelicolor A3(2). It contains ECF Sigma-70, RNA polymerase sigma r3/r4-like, and winged helix-like DNA-binding domains, consistent with the minimal two-domain architecture (sigma2 and sigma4) characteristic of ECF sigma factors. As a sigma factor, it is expected to associate with the RNA polymerase core enzyme and direct transcription initiation at a specific but currently unknown set of promoters, likely in response to extracytoplasmic signals. S. coelicolor encodes 64-65 sigma factors, the majority belonging to the ECF family. SCO7099 remains functionally uncharacterized: no regulon, promoter specificity, inducing signal, anti-sigma factor partner, or knockout phenotype has been determined. A 2024 systematic overexpression study failed to recover transformants for SCO7099, suggesting potential toxicity from uncontrolled expression, but no follow-up experiments resolved this observation.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0016987
sigma factor activity
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: Sigma factor activity is well-supported by domain architecture. Q9KZ33 contains the ECF_Sigma-70_Domain (IPR052704), RNA_pol_sigma_r3/r4-like (IPR013324), and WH-like_DNA-bd_sf (IPR036388) domains, which constitute the minimal sigma2 and sigma4 domain architecture required for promoter recognition and RNA polymerase interaction. The IBA evidence is based on PANTHER phylogenetic inference (PTN001249270) with L0TCG5 (SigJ from Mycobacterium tuberculosis) as the with/from reference, a genuine ECF sigma factor ortholog in actinobacteria.
Reason: The domain architecture unambiguously supports classification as an ECF sigma factor. The PANTHER phylogenetic inference is well-grounded, with a bona fide ECF sigma factor ortholog as the reference. Although no direct experimental evidence exists for Q9KZ33 itself, the structural and phylogenetic evidence for sigma factor activity is strong.
Supporting Evidence:
file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
The presence of ECF_Sigma-70_Domain and RNA_pol_sigma_r3/r4-like domains strongly supports classification as a bona fide sigma factor capable of directing transcription initiation
|
|
GO:2000142
regulation of DNA-templated transcription initiation
|
IEA
GO_REF:0000108 |
ACCEPT |
Summary: This annotation is a correct logical inference from the sigma factor activity (GO:0016987) annotation via GO_REF:0000108 (inter-ontology logical links). Sigma factors by definition modulate transcription initiation by directing RNA polymerase to specific promoters, so involvement in regulation of DNA-templated transcription initiation follows directly from the molecular function annotation.
Reason: This BP annotation is automatically derived from the MF sigma factor activity annotation using inter-ontology inference rules. Since sigma factors regulate which promoters RNA polymerase recognizes, they inherently regulate transcription initiation. The inference is biologically sound.
Supporting Evidence:
file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
sigma factors serve as dissociable regulatory subunits of bacterial RNA polymerase...the sigma factor binds specifically to promoter elements
|
Q: What is the regulon of SCO7099 and under what conditions is it active?
Q: Does SCO7099 have a cognate anti-sigma factor encoded in its genomic neighborhood?
Q: Is the failure to recover overexpression transformants due to toxicity from transcriptional dysregulation, or a technical artifact?
Experiment: Construct a clean deletion mutant of SCO7099 and assess phenotypes under diverse stress conditions, developmental stages, and nutrient limitations.
Hypothesis: SCO7099 deletion may reveal condition-specific phenotypes related to extracytoplasmic stress response.
Experiment: Use an inducible promoter system (e.g., tipA-thiostrepton) to express SCO7099 at controlled levels and profile transcriptome changes by RNA-seq.
Hypothesis: Controlled expression will identify the SCO7099 regulon without the toxicity observed during constitutive overexpression.
Experiment: ChIP-seq with epitope-tagged SCO7099 expressed from its native locus to map genome-wide DNA-binding sites.
Hypothesis: SCO7099 binds a specific set of promoters with a characteristic ECF sigma factor -35/-10 motif.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
SCO7099 (UniProt accession Q9KZ33) encodes a predicted extracytoplasmic function (ECF) sigma factor in Streptomyces coelicolor strain ATCC BAA-471/A3(2)/M145. Despite belonging to a well-characterized protein family, SCO7099 remains functionally uncharacterized, with no experimental data defining its regulon, promoter specificity, inducing conditions, or physiological role. The only direct experimental evidence comes from a 2024 study that failed to obtain transformants during systematic overexpression attempts, suggesting potential toxicity or regulatory constraints (lee2024machine‐learninganalysisof pages 10-11). This report synthesizes current knowledge based on domain structure predictions and comparative analysis with characterized ECF sigma factors in Streptomyces.
SCO7099 is the ordered locus name for this gene in S. coelicolor (ORF name: SC3A4.25c). The encoded protein is annotated as an RNA polymerase sigma factor with UniProt accession Q9KZ33 (lee2024machine‐learninganalysisof pages 10-11). Streptomyces coelicolor is a model actinobacterium notable for producing diverse secondary metabolites and exhibiting complex developmental differentiation. The organism possesses an exceptionally large sigma factor repertoire, encoding 64-65 sigma factors in its genome, far exceeding the number found in other bacteria such as Escherichia coli (7 sigma factors) or Bacillus subtilis (19 sigma factors) (lee2024machine‐learninganalysisof pages 10-11, pospisil2024σeofstreptomyces pages 1-2). The majority of these sigma factors belong to the ECF (extracytoplasmic function) family, reflecting the complex regulatory demands of Streptomyces physiology and environmental adaptation (lee2024machine‐learninganalysisof pages 10-11).
The gene symbol Q9KZ33 unambiguously refers to the SCO7099 locus encoding an RNA polymerase sigma factor in S. coelicolor. This identity is consistent across UniProt annotations and genomic databases (accession NC003888.3) (lee2024machine‐learninganalysisof pages 10-11). No evidence was found for ambiguity with genes from other organisms or alternative gene products at this locus.
According to InterPro annotations, Q9KZ33 contains the following conserved domains:
- ECF_Sigma-70_Domain (IPR052704): The signature domain defining ECF sigma factors
- NTF2-like_dom_sf (IPR032710): A structural fold superfamily
- RNA_pol_sigma_r3/r4-like (IPR013324): Regions responsible for promoter recognition
- WH-like_DNA-bd_sf (IPR036388): Winged helix-like DNA-binding structural superfamily (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2)
Based on its domain architecture, Q9KZ33 is classified as a member of the ECF (extracytoplasmic function) sigma factor family, also known as Group IV of the σ70 family (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). ECF sigma factors are the largest and most diverse group of alternative sigma factors in bacteria, with 157 phylogenetically defined groups identified across bacterial genomes (dios2021extracytoplasmicfunctionσ pages 1-2).
ECF sigma factors are structurally distinguished by containing only the essential domains for RNA polymerase (RNAP) interaction and promoter recognition: the σ2 domain (responsible for -10 promoter element recognition and DNA melting) and the σ4 domain (responsible for -35 promoter element recognition), connected by a short linker of fewer than 50 residues (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). This minimal architecture contrasts with the primary sigma factor (σ70/HrdB in Streptomyces), which contains additional regulatory domains including σ1.1 and σ1.2 (dios2021extracytoplasmicfunctionσ pages 2-4).
As a sigma factor, Q9KZ33 functions as a transcription initiation specificity determinant rather than a catalytic enzyme. Sigma factors do not possess enzymatic activity themselves; instead, they serve as dissociable regulatory subunits of bacterial RNA polymerase (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4). The core RNAP enzyme (composed of α2ββ'ω subunits) is catalytically competent for RNA synthesis but lacks promoter specificity. Upon binding a sigma factor, the RNAP holoenzyme acquires the ability to recognize specific promoter sequences, form stable promoter-RNAP complexes, and initiate transcription at defined genomic locations (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4).
For ECF sigma factors, the σ2 domain recognizes the -10 promoter element (typically with sequence motif around TANNNT) and facilitates DNA melting to form the transcription bubble, while the σ4 domain recognizes the -35 promoter element upstream (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). The spacing between these elements is typically 16-19 base pairs, with some ECF sigma factors showing preference for 18 bp spacers (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2). Recent structural and biochemical studies of S. coelicolor σE (another ECF sigma factor, SCO3356) revealed that its -35 motif matches the E. coli σE consensus (GGAACTT), demonstrating conservation of promoter recognition mechanisms despite limited protein sequence similarity (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2).
For sigma factors, "substrate specificity" refers to promoter selectivity rather than enzymatic substrate preference. Each sigma factor recognizes a distinct set of promoter sequences, thereby controlling transcription of a specific gene set called a regulon (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4). For SCO7099, the actual promoter consensus sequence, target genes, and regulon composition remain completely unknown (lee2024machine‐learninganalysisof pages 10-11). No ChIP-seq, transcriptomic profiling, or promoter mapping studies have identified the DNA-binding specificity or downstream targets of this sigma factor.
Based on domain structure and ECF sigma factor biology, Q9KZ33 is predicted to function as a cytoplasmic protein (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). ECF sigma factors lack transmembrane domains and are not secreted; they interact with the cytoplasmic RNA polymerase core enzyme to regulate transcription of chromosomal DNA. The name "extracytoplasmic function" refers to their typical role in responding to signals originating outside the cytoplasm (cell envelope stress, extracellular signals, periplasmic events), not to their physical location (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2).
No experimental localization data (e.g., fluorescence microscopy, subcellular fractionation) have been reported for SCO7099. However, sigma factors are inherently cytoplasmic proteins that must access the RNA polymerase machinery associated with the nucleoid region where bacterial chromosomal DNA resides (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4).
ECF sigma factors in Streptomyces and other actinobacteria participate in diverse stress responses and regulatory networks (pospisil2024σeofstreptomyces pages 1-2, sekurova2024deletionsofconserved pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). Well-characterized examples include:
A 2024 machine learning analysis of 454 transcriptome profiles from S. coelicolor identified independently modulated gene groups (iModulons) and predicted putative regulons for 30 sigma factors, revealing complex cross-talk between sigma factor-mediated regulation, secondary metabolism, and stress responses involving iron and phosphate homeostasis (lee2024machine‐learninganalysisof pages 4-8).
No specific pathways, stress responses, or developmental processes have been experimentally linked to SCO7099 (lee2024machine‐learninganalysisof pages 10-11). The sigma factor was not among those with defined iModulons in the comprehensive transcriptomic analysis by Lee et al. (2024), suggesting either low expression levels, conditional activity under untested conditions, or functional redundancy with other sigma factors (lee2024machine‐learninganalysisof pages 4-8).
ECF sigma factors are typically regulated at multiple levels (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2):
For SCO7099, no regulatory mechanism, anti-sigma factor partner, or inducing signal has been identified (lee2024machine‐learninganalysisof pages 10-11).
The only direct experimental study of SCO7099 comes from Lee et al. (2024), who conducted systematic overexpression of sigma factors in S. coelicolor to map their regulons using transcriptomics (lee2024machine‐learninganalysisof pages 10-11). In this study, 64 sigma factors were targeted for overexpression using MBP (maltose-binding protein) fusion constructs driven by the strong ermE promoter in plasmids pIBR25 (high-copy) and pSET152 (integrative, low-copy). Transformants were not obtained for three sigma factors: SCO4908 (SigQ), SCO5243 (SigH), and SCO7099* (lee2024machine‐learninganalysisof pages 10-11, lee2024machine‐learninganalysisof pages 4-8).
The authors noted that "S. coelicolor seemed to be sensitive to increased expression of sigma factors, resulting in growth failure" (lee2024machine‐learninganalysisof pages 10-11). The failure to recover transformants suggests several possible interpretations:
Importantly, these interpretations remain speculative, as the study did not include follow-up experiments (e.g., inducible expression systems, deletion mutants, or dose-response analyses) to distinguish among these possibilities (lee2024machine‐learninganalysisof pages 10-11, lee2024machine‐learninganalysisof pages 4-8).
No published studies have reported:
- Gene deletion or knockout phenotypes for SCO7099
- ChIP-seq or ChIP-chip mapping of DNA-binding sites
- Transcriptomic profiling of SCO7099-dependent genes
- In vitro promoter binding or transcription assays
- Protein structure determination (X-ray, cryo-EM, NMR)
- Protein-protein interaction studies to identify anti-sigma factors or other partners
- Expression pattern analysis under diverse growth conditions
This absence of data places SCO7099 among the many uncharacterized ECF sigma factors in bacteria. A recent review noted that 102 of 157 ECF phylogenetic groups still lack any mechanistic or functional insight (dios2021extracytoplasmicfunctionσ pages 1-2), highlighting the broader challenge of characterizing this diverse protein family.
For context, well-characterized S. coelicolor ECF sigma factors have been studied using multiple complementary approaches. For example, σE (SCO3356):
- ChIP-seq identified 193 binding sites across the genome (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2)
- The regulon includes 127 genes, with enrichment for cell envelope and membrane proteins (pospisil2024σeofstreptomyces pages 2-4)
- Promoter motif analysis defined -35 (GGAACTT-like) and -10 consensus sequences (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2)
- Kinetic modeling and RT-qPCR revealed both activator and repressor functions (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2)
- Stress response was characterized under ethanol, osmotic, oxidative, and heat stress conditions (pospisil2024σeofstreptomyces pages 1-2)
None of these approaches have been applied to SCO7099, leaving its function entirely speculative (lee2024machine‐learninganalysisof pages 10-11).
The presence of ECF_Sigma-70_Domain and RNA_pol_sigma_r3/r4-like domains strongly supports classification as a bona fide sigma factor capable of directing transcription initiation (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2). The WH-like_DNA-bd_sf (winged helix-like DNA-binding) domain is characteristic of the σ4 region that makes sequence-specific contacts with the -35 promoter element (dios2021extracytoplasmicfunctionσ pages 2-4). The NTF2-like_dom_sf may contribute to protein-protein interactions, either with RNAP core or with regulatory partners.
No phylogenetic analysis or comparative genomics study specifically addressing SCO7099 was identified in the retrieved literature. However, ECF sigma factors are broadly distributed across bacterial phyla, with particular expansion in soil-dwelling actinobacteria that face diverse environmental stresses (dios2021extracytoplasmicfunctionσ pages 1-2). A 2024 study examining conserved ECF sigma factors across Streptomyces species did not specifically highlight SCO7099, suggesting it may be less highly conserved than other family members (sekurova2024deletionsofconserved pages 2-4).
Based on available evidence, the most accurate functional annotation for SCO7099 (Q9KZ33) is:
"Uncharacterized extracytoplasmic function (ECF) sigma factor; predicted transcription initiation factor capable of directing RNA polymerase to a specific but unknown set of promoters in Streptomyces coelicolor"
This annotation reflects:
- Strong structural evidence for sigma factor function (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2)
- Classification within the ECF sigma factor family (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2)
- Complete absence of experimental characterization of biological role (lee2024machine‐learninganalysisof pages 10-11)
- Failed experimental attempt at functional investigation through overexpression (lee2024machine‐learninganalysisof pages 10-11)
To functionally annotate SCO7099, the following experimental approaches would be most informative:
SCO7099 exemplifies a broader challenge in bacterial genomics: the gap between sequence-based annotation and functional understanding. Despite powerful bioinformatic predictions, experimental validation remains essential for assigning biological roles. The expansion of ECF sigma factors in Streptomyces genomes—with 64 paralogs in S. coelicolor alone—suggests extensive regulatory specialization, but also presents a daunting task for comprehensive functional characterization (lee2024machine‐learninganalysisof pages 10-11, dios2021extracytoplasmicfunctionσ pages 1-2).
Recent advances in machine learning approaches to transcriptome analysis, high-throughput ChIP-seq, and synthetic biology tools for Streptomyces genetics provide new opportunities to systematically characterize these orphan regulators (lee2024machine‐learninganalysisof pages 4-8, sekurova2024deletionsofconserved pages 2-4). Until such studies are performed for SCO7099, its function remains a matter of inference rather than empirical knowledge.
| Feature | Details | Evidence Type | Citations |
|---|---|---|---|
| Protein identification | UniProt accession: Q9KZ33; ordered locus name: SCO7099; ORF name: SC3A4.25c; protein description: RNA polymerase sigma factor; organism: Streptomyces coelicolor strain ATCC BAA-471 / A3(2) / M145 | Database annotation provided in prompt; organism-level literature context | (lee2024machine‐learninganalysisof pages 10-11, pospisil2024σeofstreptomyces pages 1-2) |
| Gene/protein identity confidence | Identity is consistent with a Streptomyces sigma factor and specifically with an ECF sigma factor-like protein based on domain architecture; no evidence was found that Q9KZ33 refers to a different protein in S. coelicolor | Inference from domain composition and sigma-factor literature | (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2) |
| Protein family | Predicted member of the extracytoplasmic function (ECF) sigma factor family, a compact subgroup of the σ70 family that typically contains only the core promoter-recognition modules | Domain-based inference supported by ECF reviews | (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2) |
| Key domains | UniProt/InterPro domains supplied for Q9KZ33: ECF_Sigma-70_Domain (IPR052704), NTF2-like_dom_sf (IPR032710), RNA_pol_sigma_r3/r4-like (IPR013324), WH-like_DNA-bd_sf (IPR036388) | Database/domain annotation and structural family inference | (dios2021extracytoplasmicfunctionσ pages 2-4) |
| Structural interpretation of domains | ECF sigma factors generally comprise σ2 and σ4 domains separated by a short linker; σ2 contributes promoter melting and -10 recognition, while σ4 binds the -35 element and contacts RNAP/core DNA-binding interfaces | General mechanistic inference from ECF/sigma-factor literature | (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2) |
| Predicted molecular function | Likely functions as a transcription initiation specificity factor that associates with core RNA polymerase to redirect transcription toward a specific promoter subset rather than catalyzing a small-molecule reaction | Strong family-based functional inference | (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2) |
| Predicted biochemical activity | Expected to bind RNAP core enzyme and recognize target promoter DNA motifs, thereby promoting open-complex formation and transcription initiation at its cognate regulon | General sigma-factor mechanism; no SCO7099-specific assay | (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4) |
| Substrate specificity / promoter specificity | For sigma factors, “specificity” refers to promoter selectivity rather than a chemical substrate. For SCO7099, the actual promoter motif, target operons, and regulon remain unknown | Absence of direct SCO7099 data; inference from sigma-factor biology | (helmann2019wheretobegin? pages 1-3, dios2021extracytoplasmicfunctionσ pages 2-4) |
| Cellular localization | Predicted cytoplasmic protein acting on chromosomal DNA through interaction with cytoplasmic RNAP; ECF sigma factors sense extracytoplasmic stress indirectly but function in the cytoplasm | Family-based inference from ECF biology | (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2) |
| Biological context in S. coelicolor | S. coelicolor encodes an unusually large sigma-factor repertoire: 64–65 sigma factors have been reported, with many belonging to the ECF class; this supports assignment of SCO7099 to a large stress-responsive regulatory landscape | Organism-level literature evidence | (lee2024machine‐learninganalysisof pages 10-11, pospisil2024σeofstreptomyces pages 1-2) |
| Experimental evidence directly on SCO7099 | In a 2024 large-scale sigma-factor overexpression study, transformants were not obtained for SCO7099, unlike many other sigma factors; the authors listed SCO7099 as having unknown function | Direct experimental observation | (lee2024machine‐learninganalysisof pages 10-11) |
| Interpretation of failed overexpression | Failure to recover transformants suggests that elevated SCO7099 expression may be toxic, impose severe transcriptional burden, or disrupt essential regulatory balance, but this was not resolved experimentally | Experimental observation plus cautious interpretation by context | (lee2024machine‐learninganalysisof pages 10-11, lee2024machine‐learninganalysisof pages 4-8) |
| Known regulon | No regulon has been defined for SCO7099 by ChIP-seq, transcriptomics, promoter mapping, or genetic analysis in the retrieved literature | Negative evidence / knowledge gap | (lee2024machine‐learninganalysisof pages 10-11) |
| Known inducing conditions | Unknown. No published stress condition, nutrient cue, developmental stage, or envelope signal was identified specifically for SCO7099 | Negative evidence / knowledge gap | (lee2024machine‐learninganalysisof pages 10-11) |
| Known pathway involvement | Unknown. By analogy to other Streptomyces ECF sigma factors, SCO7099 could participate in stress signaling or cell-envelope-associated transcriptional control, but there is no direct pathway assignment | Comparative inference only | (pospisil2024σeofstreptomyces pages 1-2, sekurova2024deletionsofconserved pages 2-4, dios2021extracytoplasmicfunctionσ pages 2-4) |
| Comparison to characterized ECF sigma factors | Other S. coelicolor ECF sigma factors such as σE have defined stress-responsive regulons and promoter motifs, illustrating what is currently missing for SCO7099: promoter map, regulon, activation signal, and phenotype | Comparative literature context | (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2) |
| Current functional annotation status | Best current annotation is “uncharacterized ECF sigma factor / RNA polymerase sigma factor”; a more specific biological-process assignment is not justified by current evidence | Synthesis of direct and indirect evidence | (lee2024machine‐learninganalysisof pages 10-11, dios2021extracytoplasmicfunctionσ pages 2-4) |
| Highest-priority knowledge gaps | Missing data include: native expression pattern, anti-sigma partner (if any), activating signal, promoter consensus, direct DNA-binding sites, regulon composition, knockout phenotype, and role in development or secondary metabolism | Research-gap synthesis | (lee2024machine‐learninganalysisof pages 10-11, sekurova2024deletionsofconserved pages 2-4, dios2021extracytoplasmicfunctionσ pages 2-4) |
Table: This table summarizes what is currently known and not known about SCO7099 (UniProt Q9KZ33) in Streptomyces coelicolor. It highlights that the protein is strongly predicted to be an ECF sigma factor, but remains experimentally uncharacterized beyond a reported failure to recover overexpression transformants.
SCO7099 (UniProt Q9KZ33) encodes a predicted ECF sigma factor in Streptomyces coelicolor with a domain architecture consistent with transcription initiation function. Based on structural features, the protein is expected to function as a cytoplasmic regulator that directs RNA polymerase to a specific subset of promoters, likely in response to environmental or developmental signals. However, no experimental data define its regulon, promoter specificity, inducing conditions, regulatory partners, or physiological role.
The only experimental study directly addressing SCO7099 reported failure to obtain overexpression transformants, suggesting potential toxicity or regulatory constraints but providing no functional insights (lee2024machine‐learninganalysisof pages 10-11). Functional annotation of SCO7099 remains a priority for understanding the regulatory complexity of Streptomyces coelicolor, particularly given the organism's medical and biotechnological importance.
Primary sources cited throughout this report are indicated by context IDs (lee2024machine‐learninganalysisof pages 11-12, dios2021extracytoplasmicfunctionσ pages 1-2), corresponding to:
- Lee et al. (2024) - Machine learning analysis of S. coelicolor transcriptomes (lee2024machine‐learninganalysisof pages 10-11, lee2024machine‐learninganalysisof pages 4-8)
- Pospíšil et al. (2024) - σE functional characterization in S. coelicolor (pospisil2024σeofstreptomyces pages 2-4, pospisil2024σeofstreptomyces pages 1-2)
- Sekurova et al. (2024) - ECF sigma factor deletions and secondary metabolism (sekurova2024deletionsofconserved pages 2-4)
- Helmann (2019) - Sigma factors and transcription initiation (helmann2019wheretobegin? pages 1-3)
- de Dios et al. (2021) - ECF sigma factors as stress response coordinators (dios2021extracytoplasmicfunctionσ pages 2-4, dios2021extracytoplasmicfunctionσ pages 1-2)
References
(lee2024machine‐learninganalysisof pages 10-11): Yongjae Lee, Donghui Choe, Bernhard O. Palsson, and Byung‐Kwan Cho. Machine‐learning analysis of streptomyces coelicolor transcriptomes reveals a transcription regulatory network encompassing biosynthetic gene clusters. Advanced Science, Sep 2024. URL: https://doi.org/10.1002/advs.202403912, doi:10.1002/advs.202403912. This article has 12 citations and is from a peer-reviewed journal.
(pospisil2024σeofstreptomyces pages 1-2): Jiří Pospíšil, Marek Schwarz, Alice Ziková, Dragana Vítovská, Miluše Hradilová, Michal Kolář, Alena Křenková, Martin Hubálek, Libor Krásný, and Jiří Vohradský. Σe of streptomyces coelicolor can function both as a direct activator or repressor of transcription. Communications Biology, Jan 2024. URL: https://doi.org/10.1038/s42003-023-05716-y, doi:10.1038/s42003-023-05716-y. This article has 7 citations and is from a peer-reviewed journal.
(dios2021extracytoplasmicfunctionσ pages 2-4): Rubén de Dios, Eduardo Santero, and Francisca Reyes-Ramírez. Extracytoplasmic function σ factors as tools for coordinating stress responses. International Journal of Molecular Sciences, 22:3900, Apr 2021. URL: https://doi.org/10.3390/ijms22083900, doi:10.3390/ijms22083900. This article has 29 citations.
(dios2021extracytoplasmicfunctionσ pages 1-2): Rubén de Dios, Eduardo Santero, and Francisca Reyes-Ramírez. Extracytoplasmic function σ factors as tools for coordinating stress responses. International Journal of Molecular Sciences, 22:3900, Apr 2021. URL: https://doi.org/10.3390/ijms22083900, doi:10.3390/ijms22083900. This article has 29 citations.
(helmann2019wheretobegin? pages 1-3): John D. Helmann. Where to begin? sigma factors and the selectivity of transcription initiation in bacteria. Molecular Microbiology, 112:335-347, Jun 2019. URL: https://doi.org/10.1111/mmi.14309, doi:10.1111/mmi.14309. This article has 105 citations and is from a domain leading peer-reviewed journal.
(pospisil2024σeofstreptomyces pages 2-4): Jiří Pospíšil, Marek Schwarz, Alice Ziková, Dragana Vítovská, Miluše Hradilová, Michal Kolář, Alena Křenková, Martin Hubálek, Libor Krásný, and Jiří Vohradský. Σe of streptomyces coelicolor can function both as a direct activator or repressor of transcription. Communications Biology, Jan 2024. URL: https://doi.org/10.1038/s42003-023-05716-y, doi:10.1038/s42003-023-05716-y. This article has 7 citations and is from a peer-reviewed journal.
(sekurova2024deletionsofconserved pages 2-4): Olga N. Sekurova, Martin Zehl, Michael Predl, Peter Hunyadi, Thomas Rattei, and Sergey B. Zotchev. Deletions of conserved extracytoplasmic function sigma factors-encoding genes in streptomyces have a major impact on secondary metabolism. Microbial Cell Factories, Jul 2024. URL: https://doi.org/10.1186/s12934-024-02479-x, doi:10.1186/s12934-024-02479-x. This article has 1 citations and is from a peer-reviewed journal.
(lee2024machine‐learninganalysisof pages 4-8): Yongjae Lee, Donghui Choe, Bernhard O. Palsson, and Byung‐Kwan Cho. Machine‐learning analysis of streptomyces coelicolor transcriptomes reveals a transcription regulatory network encompassing biosynthetic gene clusters. Advanced Science, Sep 2024. URL: https://doi.org/10.1002/advs.202403912, doi:10.1002/advs.202403912. This article has 12 citations and is from a peer-reviewed journal.
(lee2024machine‐learninganalysisof pages 11-12): Yongjae Lee, Donghui Choe, Bernhard O. Palsson, and Byung‐Kwan Cho. Machine‐learning analysis of streptomyces coelicolor transcriptomes reveals a transcription regulatory network encompassing biosynthetic gene clusters. Advanced Science, Sep 2024. URL: https://doi.org/10.1002/advs.202403912, doi:10.1002/advs.202403912. This article has 12 citations and is from a peer-reviewed journal.
id: Q9KZ33
gene_symbol: Q9KZ33
product_type: PROTEIN
status: COMPLETE
taxon:
id: NCBITaxon:100226
label: Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145)
description: >-
SCO7099 (Q9KZ33) is a predicted extracytoplasmic function (ECF) sigma factor
in Streptomyces coelicolor A3(2). It contains ECF Sigma-70, RNA polymerase
sigma r3/r4-like, and winged helix-like DNA-binding domains, consistent with
the minimal two-domain architecture (sigma2 and sigma4) characteristic of ECF
sigma factors. As a sigma factor, it is expected to associate with the RNA
polymerase core enzyme and direct transcription initiation at a specific but
currently unknown set of promoters, likely in response to extracytoplasmic
signals. S. coelicolor encodes 64-65 sigma factors, the majority belonging to
the ECF family. SCO7099 remains functionally uncharacterized: no regulon,
promoter specificity, inducing signal, anti-sigma factor partner, or knockout
phenotype has been determined. A 2024 systematic overexpression study failed
to recover transformants for SCO7099, suggesting potential toxicity from
uncontrolled expression, but no follow-up experiments resolved this
observation.
existing_annotations:
- term:
id: GO:0016987
label: sigma factor activity
evidence_type: IBA
original_reference_id: GO_REF:0000033
qualifier: enables
review:
summary: >-
Sigma factor activity is well-supported by domain architecture. Q9KZ33
contains the ECF_Sigma-70_Domain (IPR052704), RNA_pol_sigma_r3/r4-like
(IPR013324), and WH-like_DNA-bd_sf (IPR036388) domains, which constitute
the minimal sigma2 and sigma4 domain architecture required for promoter
recognition and RNA polymerase interaction. The IBA evidence is based on
PANTHER phylogenetic inference (PTN001249270) with L0TCG5 (SigJ from
Mycobacterium tuberculosis) as the with/from reference, a genuine ECF
sigma factor ortholog in actinobacteria.
action: ACCEPT
reason: >-
The domain architecture unambiguously supports classification as an ECF
sigma factor. The PANTHER phylogenetic inference is well-grounded, with
a bona fide ECF sigma factor ortholog as the reference. Although no
direct experimental evidence exists for Q9KZ33 itself, the structural
and phylogenetic evidence for sigma factor activity is strong.
supported_by:
- reference_id: file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
supporting_text: >-
The presence of ECF_Sigma-70_Domain and RNA_pol_sigma_r3/r4-like
domains strongly supports classification as a bona fide sigma factor
capable of directing transcription initiation
- term:
id: GO:2000142
label: regulation of DNA-templated transcription initiation
evidence_type: IEA
original_reference_id: GO_REF:0000108
qualifier: involved_in
review:
summary: >-
This annotation is a correct logical inference from the sigma factor
activity (GO:0016987) annotation via GO_REF:0000108 (inter-ontology
logical links). Sigma factors by definition modulate transcription
initiation by directing RNA polymerase to specific promoters, so
involvement in regulation of DNA-templated transcription initiation
follows directly from the molecular function annotation.
action: ACCEPT
reason: >-
This BP annotation is automatically derived from the MF sigma factor
activity annotation using inter-ontology inference rules. Since sigma
factors regulate which promoters RNA polymerase recognizes, they
inherently regulate transcription initiation. The inference is
biologically sound.
supported_by:
- reference_id: file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
supporting_text: >-
sigma factors serve as dissociable regulatory subunits of bacterial
RNA polymerase...the sigma factor binds specifically to promoter
elements
references:
- id: GO_REF:0000033
title: Annotation inferences using phylogenetic trees
findings:
- statement: >-
Q9KZ33 is phylogenetically related to characterized ECF sigma factors
including L0TCG5 (SigJ) from Mycobacterium tuberculosis
- id: GO_REF:0000108
title: Automatic assignment of GO terms using logical inference, based on inter-ontology
links
findings:
- statement: >-
Regulation of transcription initiation is logically inferred from
sigma factor activity via inter-ontology links
- id: file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
title: Deep research summary for Q9KZ33/SCO7099
findings:
- statement: >-
SCO7099 is a predicted ECF sigma factor with sigma2 and sigma4
domain architecture
supporting_text: >-
The presence of ECF_Sigma-70_Domain and RNA_pol_sigma_r3/r4-like
domains strongly supports classification as a bona fide sigma factor
capable of directing transcription initiation
- statement: >-
SCO7099 remains functionally uncharacterized with no defined regulon
supporting_text: >-
For SCO7099, the actual promoter consensus sequence, target genes,
and regulon composition remain completely unknown
- statement: >-
Overexpression of SCO7099 failed in a systematic study
supporting_text: >-
Transformants were not obtained for three sigma factors: SCO4908
(SigQ), SCO5243 (SigH), and SCO7099
core_functions:
- description: >-
SCO7099 is a predicted ECF sigma factor that associates with RNA
polymerase core enzyme to direct transcription initiation at specific
promoters. Its regulon, promoter specificity, and inducing signal
remain unknown.
molecular_function:
id: GO:0016987
label: sigma factor activity
directly_involved_in:
- id: GO:2000142
label: regulation of DNA-templated transcription initiation
locations:
- id: GO:0005737
label: cytoplasm
supported_by:
- reference_id: file:STRCO/Q9KZ33/Q9KZ33-deep-research-falcon.md
supporting_text: >-
sigma factors serve as dissociable regulatory subunits of bacterial
RNA polymerase...the sigma factor binds specifically to promoter
elements
proposed_new_terms: []
suggested_questions:
- question: >-
What is the regulon of SCO7099 and under what conditions is it active?
experts: []
- question: >-
Does SCO7099 have a cognate anti-sigma factor encoded in its genomic
neighborhood?
experts: []
- question: >-
Is the failure to recover overexpression transformants due to toxicity
from transcriptional dysregulation, or a technical artifact?
experts: []
suggested_experiments:
- description: >-
Construct a clean deletion mutant of SCO7099 and assess phenotypes
under diverse stress conditions, developmental stages, and nutrient
limitations.
hypothesis: >-
SCO7099 deletion may reveal condition-specific phenotypes related to
extracytoplasmic stress response.
- description: >-
Use an inducible promoter system (e.g., tipA-thiostrepton) to express
SCO7099 at controlled levels and profile transcriptome changes by
RNA-seq.
hypothesis: >-
Controlled expression will identify the SCO7099 regulon without the
toxicity observed during constitutive overexpression.
- description: >-
ChIP-seq with epitope-tagged SCO7099 expressed from its native locus
to map genome-wide DNA-binding sites.
hypothesis: >-
SCO7099 binds a specific set of promoters with a characteristic ECF
sigma factor -35/-10 motif.