ComK is the master transcriptional regulator of genetic competence in Bacillus subtilis, functioning as a DNA-binding transcription activator that controls the expression of late competence genes required for DNA uptake, processing, and recombination. ComK binds to specific AT-rich motifs called K-boxes (consensus AAAA-N5-TTTT) in promoter regions as a tetramer (dimer-of-dimers) to activate transcription of target genes including the comC, comE, comF, comG operons encoding DNA uptake machinery, as well as recA and nucA involved in DNA processing. A critical feature of ComK regulation is its positive autoregulation at its own promoter (PcomK), which creates a bistable switch leading to bimodal population behavior where only 5-10% of cells enter the competent K-state. ComK protein levels are tightly controlled through post-translational degradation by the MecA-ClpC-ClpP protease system; the adaptor protein MecA targets ComK to the ClpC-ClpP protease for degradation. This proteolytic control is relieved by ComS, a small protein produced from the quorum-sensing regulated srf operon, which competes with ComK for MecA binding and thereby stabilizes ComK. ComK expression is also negatively regulated at the transcriptional level by multiple repressors including AbrB, Rok, and CodY, with Spo0A-P relieving AbrB repression during transition to stationary phase. The ComK regulon encompasses at least 165 genes, making it a central hub connecting quorum sensing, nutritional status, and stress responses to the developmental decision to enter the competent state.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0003677
DNA binding
|
IEA
GO_REF:0000043 |
MODIFY |
Summary: ComK is definitively a DNA-binding protein. It binds to specific AT-rich K-box motifs (AAAA-N5-TTTT) in promoter regions to activate transcription of competence genes. However, the term GO:0003677 "DNA binding" is too general - it does not capture the specific role of ComK as a sequence-specific DNA-binding transcription activator.
Reason: While ComK does bind DNA, the generic "DNA binding" term is insufficiently informative. ComK is specifically a sequence-specific DNA-binding transcription factor that binds K-box regulatory elements to activate gene expression. The more specific term GO:0001216 "DNA-binding transcription activator activity" more accurately captures ComK's molecular function as it includes both the DNA binding and transcriptional activation aspects of its role.
Proposed replacements:
DNA-binding transcription activator activity
Supporting Evidence:
file:BACSU/comK/comK-deep-research-falcon.md
ComK is a DNA-binding transcription activator that triggers the K-state (competent state) by upregulating late competence genes required for DNA uptake, processing, and recombination
file:BACSU/comK/comK-deep-research-falcon.md
ComK binds promoter K-boxes, typically recognized as a symmetric AT-rich motif AAAA–N5–TTTT; ComK forms a dimer-of-dimers (tetramer) to bind K-box elements and activate transcription at competence promoters including comG and its own comK promoter
|
|
GO:0006351
DNA-templated transcription
|
IEA
GO_REF:0000043 |
MODIFY |
Summary: ComK is involved in transcription as a transcription factor, but GO:0006351 "DNA-templated transcription" refers to the actual synthesis of RNA from a DNA template (the catalytic process performed by RNA polymerase). ComK is a regulatory transcription factor that modulates transcription but does not directly catalyze RNA synthesis. A more appropriate annotation would be a regulatory process term.
Reason: GO:0006351 is the core transcription process (RNA synthesis from DNA template), which is carried out by RNA polymerase, not by transcription factors. ComK's role is to regulate transcription by binding to promoter elements and recruiting/activating the transcription machinery. The appropriate biological process term would be GO:0045893 "positive regulation of DNA-templated transcription" which accurately describes ComK's role in activating transcription of target genes.
Proposed replacements:
positive regulation of DNA-templated transcription
Supporting Evidence:
file:BACSU/comK/comK-deep-research-falcon.md
Genome-scale analyses identified at least ~165 ComK-activated genes, many in operons preceding identifiable ComK-box motifs, including DNA uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA)
file:BACSU/comK/comK-deep-research-falcon.md
ComK is a DNA-binding transcription activator that triggers the K-state (competent state) by upregulating late competence genes required for DNA uptake, processing, and recombination
|
|
GO:0030420
establishment of competence for transformation
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: This is the most appropriate biological process term for ComK. ComK is the master regulator that directly controls the establishment of competence for transformation in B. subtilis. ComK activates all the late competence genes required for DNA uptake and processing.
Reason: ComK is definitively required for the establishment of genetic competence. ComK directly activates genes encoding DNA uptake machinery (comC, comE, comF, comG operons) and DNA processing enzymes (nucA, recA). The term accurately captures the biological process in which ComK participates. The deep research confirms ComK triggers the K-state (competent state) by upregulating late competence genes.
Supporting Evidence:
file:BACSU/comK/comK-deep-research-falcon.md
ComK is a DNA-binding transcription activator that triggers the K-state (competent state) by upregulating late competence genes required for DNA uptake, processing, and recombination
file:BACSU/comK/comK-deep-research-falcon.md
Genome-scale analyses identified at least ~165 ComK-activated genes, many in operons preceding identifiable ComK-box motifs, including DNA uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA)
|
|
GO:0045809
positive regulation of establishment of competence for transformation
|
TAS
file:BACSU/comK/comK-deep-research-falcon.md |
NEW |
Summary: ComK is the master positive regulator of competence establishment. It activates transcription of all late competence genes and also positively regulates its own expression through an autoregulatory feedback loop.
Reason: This term should be added as it specifically captures ComK's regulatory role in competence. ComK positively regulates the establishment of competence by activating transcription of late competence genes. The positive autoregulation at the comK promoter creates a bistable switch that controls entry into the competent state. This term complements GO:0030420 by specifying the regulatory relationship.
Supporting Evidence:
file:BACSU/comK/comK-deep-research-falcon.md
ComK is a DNA-binding transcription activator that triggers the K-state (competent state) by upregulating late competence genes required for DNA uptake, processing, and recombination; it also activates its own transcription, forming a positive feedback loop (autoregulation)
file:BACSU/comK/comK-deep-research-falcon.md
Positive feedback at PcomK creates a switch-like, bimodal population in which only a subpopulation becomes competent (K-state)
|
|
GO:0000976
transcription cis-regulatory region binding
|
TAS
file:BACSU/comK/comK-deep-research-falcon.md |
NEW |
Summary: ComK binds to specific cis-regulatory regions (K-boxes) in promoters of competence genes. The K-box is a well-characterized AT-rich motif (AAAA-N5-TTTT) that ComK recognizes as a tetramer.
Reason: This molecular function term captures the specific DNA-binding activity of ComK at promoter regulatory regions. ComK binds K-box elements which are cis-regulatory regions in promoters of competence genes. This term provides more specific information about the nature of ComK's DNA binding activity compared to the generic GO:0003677.
Supporting Evidence:
file:BACSU/comK/comK-deep-research-falcon.md
ComK binds promoter K-boxes, typically recognized as a symmetric AT-rich motif AAAA–N5–TTTT; ComK forms a dimer-of-dimers (tetramer) to bind K-box elements and activate transcription at competence promoters including comG and its own comK promoter
|
Q: What is the precise molecular mechanism by which ComK tetramer binding to K-boxes recruits RNA polymerase and activates transcription? While ComK binding to K-boxes is established, the detailed mechanism of transcription activation (interaction with sigma factors, RNAP recruitment) is less clear.
Q: Are there ComK-repressed genes beyond rok, and what is the mechanism of repression? UniProt notes ComK may repress some genes, but the extent and mechanism is unclear.
Experiment: ChIP-seq of ComK in competent B. subtilis cells to comprehensively map K-box binding sites genome-wide. This would provide definitive map of direct ComK targets versus indirect effects.
Experiment: Structural studies of ComK tetramer bound to K-box DNA. This would reveal molecular basis for K-box recognition and tetramer assembly.
Exported on March 22, 2026 at 02:23 AM
Organism: Bacillus subtilis
Sequence:
MSQKTDAPLESYEVNGATIAVLPEEIDGKICSKIIEKDCVFYVNMKPLQIVDRSCRFFGSSYAGRKAGTYEVTKISHKPPIMVDPSNQIFLFPTLSSTRPQCGWISHVHVKEFKATEFDDTEVTFSNGKTMELPISYNSFENQVYRTAWLRTKFQDRIDHRVPKRQEFMLYPKEERTKMIYDFILRELGERY
I begin with the InterPro architecture: a single, nearly full-length competence factor signature, IPR010461 (Competence protein ComK family) spanning residues 5–191. This solitary, lineage-defining domain occupies essentially the entire polypeptide, which indicates a dedicated, non-modular regulator rather than a multi-domain enzyme. ComK-family architectures in Gram-positive bacteria are characterized by sequence features that favor nucleic-acid-associated transcriptional regulation and assembly of competence-specific complexes. The absence of additional catalytic or transmembrane modules implies a soluble regulator that exerts function through macromolecular binding and complex formation rather than through enzymatic turnover or membrane transport.
From this architecture, the molecular function is best explained by multivalent macromolecular binding that enables transcriptional control. A full-length competence regulator typically binds both DNA and partner proteins to switch on the competence regulon. This supports a core functional profile centered on protein binding (GO:0005515) as the minimal, well-supported activity that enables higher-order regulation and complex assembly. The ComK-family signature causally points to a transcriptional switch behavior: the domain’s prevalence in competence systems indicates that the protein’s primary biochemical role is to assemble and stabilize regulatory assemblies that activate competence gene expression.
These molecular activities feed directly into the biological process of genetic competence. By orchestrating the transcriptional state that enables uptake and processing of extracellular DNA, the regulator drives the broader competence program. Thus, the process aligns with genetic competence (GO:0009294), where the regulatory cascade licenses DNA acquisition and downstream recombination/repair pathways.
The cellular context follows from the soluble, non-membranous architecture and competence-specific transcriptional control. With no signal peptides or transmembrane segments and a regulator-like domain, the protein operates in the cytoplasm, consistent with a soluble assembly hub coordinating transcriptional machinery and competence-specific factors; this matches a cytoplasmic localization (GO:0005737).
Mechanistically, I infer that the ComK-family core forms a cytoplasmic regulatory scaffold that nucleates competence assemblies. By binding partner proteins and likely engaging promoter-proximal DNA, it elevates transcription of competence genes and couples them to downstream uptake pathways. This model predicts transient interactions with competence transcriptional modules and uptake machinery: cytoplasmic competence factors that drive DNA acquisition, along with general transcriptional regulators and RNA-associated assemblies. Such partnerships would allow the protein to gate the competence regulon, synchronize transcriptional activation with cytoplasmic uptake readiness, and ensure spatiotemporal fidelity of the competence state.
A cytoplasmic competence regulator in Bacillus subtilis that assembles and stabilizes macromolecular complexes to activate the competence program. By using a competence-specific regulatory core, it binds partner proteins to orchestrate transcriptional switching of the competence regulon and coordinate downstream DNA uptake readiness. Its soluble architecture and regulatory role position it as a cytoplasmic hub that couples transcriptional control with the broader machinery of genetic competence.
Involved in genetic competence.
IPR010461, family) — residues 5-191Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)
Biological Process: biological_process (GO:0008150), cellular process (GO:0009987), cellular component organization or biogenesis (GO:0071840), cellular component organization (GO:0016043), cellular component biogenesis (GO:0044085), protein-containing complex organization (GO:0043933), cellular component assembly (GO:0022607), protein-containing complex assembly (GO:0065003), protein complex oligomerization (GO:0051259), protein homooligomerization (GO:0051260)
Cellular Component: cellular_component (GO:0005575), cellular anatomical entity (GO:0110165), intracellular immature spore (GO:0042763)
Generated by BioReason
Exported on March 22, 2026 at 02:23 AM
Organism: Bacillus subtilis
Sequence:
MSQKTDAPLESYEVNGATIAVLPEEIDGKICSKIIEKDCVFYVNMKPLQIVDRSCRFFGSSYAGRKAGTYEVTKISHKPPIMVDPSNQIFLFPTLSSTRPQCGWISHVHVKEFKATEFDDTEVTFSNGKTMELPISYNSFENQVYRTAWLRTKFQDRIDHRVPKRQEFMLYPKEERTKMIYDFILRELGERY
I begin with the InterPro architecture: a single, nearly full-length competence factor signature, IPR010461 (Competence protein ComK family) spanning residues 5–191. This solitary, lineage-defining domain occupies essentially the entire polypeptide, which indicates a dedicated, non-modular regulator rather than a multi-domain enzyme. ComK-family architectures in Gram-positive bacteria are characterized by sequence features that favor nucleic-acid-associated transcriptional regulation and assembly of competence-specific complexes. The absence of additional catalytic or transmembrane modules implies a soluble regulator that exerts function through macromolecular binding and complex formation rather than through enzymatic turnover or membrane transport.
From this architecture, the molecular function is best explained by multivalent macromolecular binding that enables transcriptional control. A full-length competence regulator typically binds both DNA and partner proteins to switch on the competence regulon. This supports a core functional profile centered on protein binding (GO:0005515) as the minimal, well-supported activity that enables higher-order regulation and complex assembly. The ComK-family signature causally points to a transcriptional switch behavior: the domain’s prevalence in competence systems indicates that the protein’s primary biochemical role is to assemble and stabilize regulatory assemblies that activate competence gene expression.
These molecular activities feed directly into the biological process of genetic competence. By orchestrating the transcriptional state that enables uptake and processing of extracellular DNA, the regulator drives the broader competence program. Thus, the process aligns with genetic competence (GO:0009294), where the regulatory cascade licenses DNA acquisition and downstream recombination/repair pathways.
The cellular context follows from the soluble, non-membranous architecture and competence-specific transcriptional control. With no signal peptides or transmembrane segments and a regulator-like domain, the protein operates in the cytoplasm, consistent with a soluble assembly hub coordinating transcriptional machinery and competence-specific factors; this matches a cytoplasmic localization (GO:0005737).
Mechanistically, I infer that the ComK-family core forms a cytoplasmic regulatory scaffold that nucleates competence assemblies. By binding partner proteins and likely engaging promoter-proximal DNA, it elevates transcription of competence genes and couples them to downstream uptake pathways. This model predicts transient interactions with competence transcriptional modules and uptake machinery: cytoplasmic competence factors that drive DNA acquisition, along with general transcriptional regulators and RNA-associated assemblies. Such partnerships would allow the protein to gate the competence regulon, synchronize transcriptional activation with cytoplasmic uptake readiness, and ensure spatiotemporal fidelity of the competence state.
A cytoplasmic competence regulator in Bacillus subtilis that assembles and stabilizes macromolecular complexes to activate the competence program. By using a competence-specific regulatory core, it binds partner proteins to orchestrate transcriptional switching of the competence regulon and coordinate downstream DNA uptake readiness. Its soluble architecture and regulatory role position it as a cytoplasmic hub that couples transcriptional control with the broader machinery of genetic competence.
Involved in genetic competence.
IPR010461, family) — residues 5-191Molecular Function: molecular_function (GO:0003674), binding (GO:0005488), protein binding (GO:0005515)
Biological Process: biological_process (GO:0008150), cellular process (GO:0009987), cellular component organization or biogenesis (GO:0071840), cellular component organization (GO:0016043), cellular component biogenesis (GO:0044085), protein-containing complex organization (GO:0043933), cellular component assembly (GO:0022607), protein-containing complex assembly (GO:0065003), protein complex oligomerization (GO:0051259), protein homooligomerization (GO:0051260)
Cellular Component: cellular_component (GO:0005575), cellular anatomical entity (GO:0110165), intracellular immature spore (GO:0042763)
Generated by BioReason
provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2025-12-17T23:25:13.079380'
end_time: '2025-12-17T23:31:15.032269'
duration_seconds: 361.95
template_file: templates/gene_research_go_focused.md
template_variables:
organism: BACSU
gene_id: comK
gene_symbol: comK
uniprot_accession: P40396
protein_description: 'RecName: Full=Competence transcription factor; Short=CTF;
AltName: Full=Competence protein K;'
gene_info: Name=comK; OrderedLocusNames=BSU10420;
organism_full: Bacillus subtilis (strain 168).
protein_family: Not specified in UniProt
protein_domains: ComK. (IPR010461); ComK (PF06338)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 37
BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.
DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'comK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information
Please provide a comprehensive research report on the gene comK (gene ID: comK, UniProt: P40396) in BACSU.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Question: You are an expert researcher providing comprehensive, well-cited information.
Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies
Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.
BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.
DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'comK' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information
Please provide a comprehensive research report on the gene comK (gene ID: comK, UniProt: P40396) in BACSU.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Plan and identity verification
- Identity: The target is comK (UniProt P40396), Bacillus subtilis strain 168, encoding the competence transcription factor ComK; literature consistently refers to B. subtilis ComK as the master regulator of genetic competence, matching the UniProt description and the ComK family/domain annotations (PF06338/IPR010461) (maier2020competenceandtransformation pages 3-5, berka2002microarrayanalysisof pages 1-2).
Comprehensive research report
1) Key concepts and definitions with current understanding
- Primary function: ComK is a DNA-binding transcription activator that triggers the K-state (competent state) by upregulating late competence genes required for DNA uptake, processing, and recombination; it also activates its own transcription, forming a positive feedback loop (autoregulation) (berka2002microarrayanalysisof pages 1-2, maier2020competenceandtransformation pages 1-3). URL and date: Berka et al., Mol Microbiol, 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x; Maier, Current Issues Mol Biol, 2020-01, https://doi.org/10.21775/cimb.037.057.
- DNA-binding mechanism (K-box): ComK binds promoter K-boxes, typically recognized as a symmetric AT-rich motif AAAA–N5–TTTT; ComK forms a dimer-of-dimers (tetramer) to bind K-box elements and activate transcription at competence promoters including comG and its own comK promoter (maier2020competenceandtransformation pages 3-5). URL and date: Maier, 2020-01, https://doi.org/10.21775/cimb.037.057.
- Regulon: Genome-scale analyses identified at least ~165 ComK-activated genes, many in operons preceding identifiable ComK-box motifs, including DNA uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA) (berka2002microarrayanalysisof pages 1-2). URL and date: Berka et al., 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x.
- Cellular localization of function: ComK acts in the cytoplasm as a transcription factor, binding DNA at promoter regions to recruit/assist RNA polymerase at competence gene promoters; downstream competence machinery spans the envelope (ComEA receptor, DNA translocation complexes), but ComK’s action is nuclear equivalent—i.e., on chromosomal promoters (harms2024molecularmechanismsofa pages 51-54, harms2024molecularmechanismsof pages 51-54). URL and date: Harms, 2024, unknown venue.
2) Regulatory network and pathway role
- Autoregulation and bistability: Positive feedback at PcomK creates a switch-like, bimodal population in which only a subpopulation becomes competent (K-state) (berka2002microarrayanalysisof pages 1-2, maier2020competenceandtransformation pages 1-3). URL and dates: Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x; Maier 2020-01, https://doi.org/10.21775/cimb.037.057.
- Post-translational control by MecA–ClpC/ClpP and ComS: MecA binds ComK and targets it to ClpC/ClpP for ATP-dependent proteolysis; ComS, produced from the quorum-sensing srf operon, competes for MecA and thereby protects/stabilizes ComK. This MecA–ClpC/ClpP–ComS axis constitutes a molecular switch controlling ComK stability and competence entry (berka2002microarrayanalysisof pages 1-2, maier2020competenceandtransformation pages 3-5, harms2024molecularmechanismsof pages 51-54). URLs and dates: Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x; Maier 2020-01, https://doi.org/10.21775/cimb.037.057.
- Quorum sensing and ComA→ComS coupling: The ComX pheromone activates ComP→ComA~P, inducing the srf operon that encodes ComS; higher ComS reduces ComK proteolysis and permits competence (rahmer2015constructionofa pages 1-2). URL and date: Rahmer et al., Front Microbiol, 2015-12, https://doi.org/10.3389/fmicb.2015.01431.
- Transcriptional regulators upstream of comK: AbrB, Rok, and CodY repress PcomK (and competence), while Spo0A~P relieves AbrB repression in early transition states. DegU can prime ComK action at low ComK levels; global integration links competence, stress, and developmental programs (rahmer2015constructionofa pages 1-2, harms2024molecularmechanismsofa pages 51-54). URL and date: Rahmer 2015-12, https://doi.org/10.3389/fmicb.2015.01431; Harms 2024.
- Stress sigma SigB and antisense regulation: Activation of SigB induces an antisense transcript as-comK that forms duplex with comK mRNA, leading to RNase-mediated degradation and suppression of ComK protein and transformability, thereby prioritizing stress protection over competence (harms2024molecularmechanismsof pages 75-78). URL and date: Harms, 2024.
- Additional layers and chromosomal control: Rho termination factor levels can reprogram stationary-phase transcription and indirectly block competence by disrupting comK’s positive autoregulatory loop; nucleoid structuring and RNAP-associated RNAs influence competence gene expression architecture (harms2024molecularmechanismsofa pages 51-54, harms2024molecularmechanismsof pages 51-54). URLs and dates: Harms, 2024; Bidnenko et al., PLoS Genet, 2023-02, https://doi.org/10.1371/journal.pgen.1010618.
3) Recent developments and latest research (priority 2023–2024)
- Quorum sensing policing via DegQ influences competence inputs: Work on ComQXPA signaling showed DegQ differentially tunes srfAA and aprE promoters, acting as a “policing” link to prevent overinvestment in costly processes; these srf outputs impact ComS and thereby ComK stability and competence propensity (harms2024molecularmechanismsofa pages 51-54). URL and date: Danevčič et al., Microbiol Spectrum, 2023-10, https://doi.org/10.1128/spectrum.00908-23.
- Stress–competence crosstalk via SigB-as-comK: 2024 synthesis indicates SigB-driven antisense transcription of as-comK potently represses competence under stress (e.g., light/ethanol), adding an RNA-based layer onto ComK regulation (harms2024molecularmechanismsof pages 75-78). URL and date: Harms, 2024.
- Transcription termination and competence: Stabilized Rho globally alters stationary-phase gene expression and blocks competence and sporulation, partly via lowering Spo0A and perturbing comK autoregulation (harms2024molecularmechanismsofa pages 51-54). URL and date: Bidnenko et al., PLoS Genet, 2023-02, https://doi.org/10.1371/journal.pgen.1010618.
- Proteostasis and Clp system updates: Contemporary reviews and analyses of Clp systems reaffirm MecA–ClpC/ClpP targeting of ComK and expand understanding of adaptor networks and proteolysis, highlighting broad proteostasis–competence coupling in Gram-positives (harms2024molecularmechanismsof pages 51-54). URL and date: Ishikawa et al., Genes to Cells, 2024-07, https://doi.org/10.1111/gtc.13141.
- RNAP-interacting RNAs and transcriptional heterogeneity: RIP-seq maps RNAs associated with RNAP and sigma factors in B. subtilis; single-cell transcriptomics reviews emphasize competence as a hallmark of phenotypic heterogeneity in bacterial populations, underscoring the importance of bimodal comK expression (harms2024molecularmechanismsofa pages 51-54). URLs and dates: Hausnerová et al., Nucleic Acids Res, 2024-02, https://doi.org/10.1093/nar/gkae081; Walls & Rosenthal, Transcription, 2024-03, https://doi.org/10.1080/21541264.2024.2334110.
4) Current applications and real-world implementations
- Super-competent laboratory strains via comK/comS induction: An inducible PmtlA-comKS cassette (strain REG19) yielded a 6.7-fold increase in transformation efficiency versus wild type; transformation peaked ~1.5 h after mannitol induction and saturated near ~100 ng plasmid DNA or ~3 µg chromosomal DNA per reaction, illustrating engineered control of competence in B. subtilis 168 (rahmer2015constructionofa pages 1-2). URL and date: Rahmer et al., Front Microbiol, 2015-12, https://doi.org/10.3389/fmicb.2015.01431.
- Process/strain engineering levers connected to comK: Manipulating QS (ComX/ComA→ComS), relieving repression (Rok, AbrB, CodY), tuning DegU/DegQ, or modulating Clp proteostasis (e.g., MecA or ClpC/P activity) are actionable to bias competence for transformation workflows in industrial and research settings (rahmer2015constructionofa pages 1-2, harms2024molecularmechanismsofa pages 51-54). URLs and dates: Rahmer 2015-12, https://doi.org/10.3389/fmicb.2015.01431; Harms 2024; Danevčič 2023-10, https://doi.org/10.1128/spectrum.00908-23.
- Constraints in environmental isolates and biofilms: Environmental B. subtilis strains often form robust biofilms that sharply reduce the frequency of competent cells due to cross-pathway repression (e.g., ComK directly repressing sinI; biofilm regulator SlrR repressing competence), impacting transformability in situ (boonstra2020analysesofcompetent pages 1-2). URL and date: She et al., mSystems, 2020-10, https://doi.org/10.1128/msystems.00539-20.
5) Expert opinions and analyses from authoritative sources
- Systems perspective on competence: Competence is a stochastic differentiation program shaped by positive feedback at comK, QS control over ComS, and proteolytic gating by MecA–ClpC/ClpP, ensuring only a fraction of cells engage in costly DNA uptake; single-cell and modeling work underpin this view (maier2020competenceandtransformation pages 1-3). URL and date: Maier, 2020-01, https://doi.org/10.21775/cimb.037.057.
- Integration with stress and global regulation: Reviews and syntheses emphasize that competence is deprioritized under acute stress via SigB and reshaped by global transcription termination (Rho), reflecting hierarchical resource allocation across differentiation programs (harms2024molecularmechanismsof pages 75-78, harms2024molecularmechanismsofa pages 51-54). URLs and dates: Harms, 2024; Bidnenko et al., 2023-02, https://doi.org/10.1371/journal.pgen.1010618.
6) Relevant statistics and quantitative data
- Fraction of competent cells: Approximately 5–10% of cells express comK and enter the K-state under standard competence-inducing conditions; depending on conditions/population structure, studies report 5% to as high as ~50% competent subpopulation ranges (berka2002microarrayanalysisof pages 1-2, boonstra2020analysesofcompetent pages 1-2). URLs and dates: Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x; Boonstra et al., Environ Microbiol, 2020-04, https://doi.org/10.1111/1462-2920.15005.
- Timing of expression and sampling: Differential expression between competent and non-competent subpopulations was mapped at 5.5 h and 6.5 h after induction in B. subtilis 168; dozens to hundreds of genes showed altered expression at each timepoint, confirming ComK regulon activation and broader transcriptome remodeling (boonstra2020analysesofcompetent pages 1-2). URL and date: Boonstra et al., 2020-04, https://doi.org/10.1111/1462-2920.15005.
- Regulon breadth: At least 165 genes upregulated by ComK, with 23 clusters preceded by ComK-box motifs, including key DNA uptake and recombination genes; competent cells exhibit growth arrest (“K-state”) distinct from sporulation (berka2002microarrayanalysisof pages 1-2). URL and date: Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x.
- Engineered super-competence performance: Mannitol-induced PmtlA-comKS increased transformation 6.7-fold, with peak at ~1.5 h induction and DNA saturation values as above, providing practical benchmarks for transformation optimization (rahmer2015constructionofa pages 1-2). URL and date: Rahmer 2015-12, https://doi.org/10.3389/fmicb.2015.01431.
7) Clarifications on family/domains and organism specificity
- Domain/family alignment: Literature on ComK in B. subtilis 168 consistently refers to the ComK competence transcription factor family and aligns with the UniProt PF06338/IPR010461 domain annotations, confirming we are analyzing the correct gene/protein in the specified organism (maier2020competenceandtransformation pages 3-5, berka2002microarrayanalysisof pages 1-2). URLs and dates: Maier 2020-01, https://doi.org/10.21775/cimb.037.057; Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x.
Focused mechanistic summary
- ComK binds K-boxes as a tetramer (dimer-of-dimers) and directly activates promoters of late competence genes (e.g., comG) and PcomK, producing positive feedback and bistability (maier2020competenceandtransformation pages 3-5, berka2002microarrayanalysisof pages 1-2). URL and dates: Maier 2020-01, https://doi.org/10.21775/cimb.037.057; Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x.
- Upstream signals flow through ComX/ComP/ComA to induce ComS (srf), which titrates MecA and reduces ComK degradation by ClpC/ClpP, enabling competence entry; repression by Rok, CodY, AbrB is relieved in specific states (e.g., Spo0A~P lowering AbrB), with DegU priming effects at low ComK; stress-activated SigB antisense as-comK shuts competence under acute stress (rahmer2015constructionofa pages 1-2, harms2024molecularmechanismsofa pages 51-54, harms2024molecularmechanismsof pages 75-78). URLs and dates: Rahmer 2015-12, https://doi.org/10.3389/fmicb.2015.01431; Harms 2024.
- The regulon encompasses DNA uptake (comCDEFG, comFA, comEA), DNA processing (nucA, recA), and additional genes revealed by high-resolution transcriptomics and proteomics, while competence imposes a growth-arrested physiological state (berka2002microarrayanalysisof pages 1-2, boonstra2020analysesofcompetent pages 1-2). URLs and dates: Berka 2002-03, https://doi.org/10.1046/j.1365-2958.2002.02833.x; Boonstra 2020-04, https://doi.org/10.1111/1462-2920.15005.
Limitations and open directions
- While foundational mechanisms of ComK and competence are established, newer layers involving antisense RNAs (SigB→as-comK), transcription termination (Rho), and proteostasis remodeling during stress continue to refine models of when and how competence is permitted or repressed in dynamic environments (harms2024molecularmechanismsof pages 75-78, harms2024molecularmechanismsofa pages 51-54, harms2024molecularmechanismsof pages 51-54). URLs and dates: Harms 2024; Bidnenko 2023-02, https://doi.org/10.1371/journal.pgen.1010618; Ishikawa 2024-07, https://doi.org/10.1111/gtc.13141.
References (URLs and publication dates)
- Berka RM et al. Microarray analysis of the Bacillus subtilis K-state… Mol Microbiol. 2002-03. https://doi.org/10.1046/j.1365-2958.2002.02833.x (berka2002microarrayanalysisof pages 1-2).
- Ogura M et al. Whole-genome analysis of genes regulated by ComK. J Bacteriol. 2002-05. https://doi.org/10.1128/jb.184.9.2344-2351.2002 (supports regulon scope and K-box context) (berka2002microarrayanalysisof pages 1-2).
- Maier B. Competence and Transformation in Bacillus subtilis. Current Issues Mol Biol. 2020-01. https://doi.org/10.21775/cimb.037.057 (K-box, tetramer model, network) (maier2020competenceandtransformation pages 3-5, maier2020competenceandtransformation pages 1-3).
- Boonstra M et al. Competent vs non-competent subpopulations… Environ Microbiol. 2020-04. https://doi.org/10.1111/1462-2920.15005 (quantitative subpopulation data, new players) (boonstra2020analysesofcompetent pages 1-2).
- She Q et al. Negative interplay between biofilm and competence. mSystems. 2020-10. https://doi.org/10.1128/msystems.00539-20 (biofilm-competence antagonism) (boonstra2020analysesofcompetent pages 1-2).
- Rahmer R et al. Super-competent B. subtilis 168 using PmtlA-comKS. Front Microbiol. 2015-12. https://doi.org/10.3389/fmicb.2015.01431 (applications; mechanistic summary) (rahmer2015constructionofa pages 1-2).
- Danevčič T et al. DegQ policing link for QS-regulated traits. Microbiol Spectrum. 2023-10. https://doi.org/10.1128/spectrum.00908-23 (QS–competence context) (harms2024molecularmechanismsofa pages 51-54).
- Bidnenko V et al. Rho-mediated reprogramming blocks competence/sporulation. PLoS Genet. 2023-02. https://doi.org/10.1371/journal.pgen.1010618 (global transcription effects) (harms2024molecularmechanismsofa pages 51-54).
- Hausnerová VV et al. RIP-seq RNAs interacting with RNAP. Nucleic Acids Res. 2024-02. https://doi.org/10.1093/nar/gkae081 (RNAP–RNA landscape) (harms2024molecularmechanismsofa pages 51-54).
- Walls AW & Rosenthal AZ. Bacterial phenotypic heterogeneity and scRNA-seq. Transcription. 2024-03. https://doi.org/10.1080/21541264.2024.2334110 (heterogeneity; competence context) (harms2024molecularmechanismsofa pages 51-54).
- Ishikawa F et al. ClpP/ClpC proteolysis review with MecA–ComK context. Genes to Cells. 2024-07. https://doi.org/10.1111/gtc.13141 (proteostasis link) (harms2024molecularmechanismsof pages 51-54).
- Harms M. Molecular mechanisms of Sig-dependent gene regulation in B. subtilis. 2024. (SigB→as-comK repression of competence; network synthesis) (harms2024molecularmechanismsof pages 75-78, harms2024molecularmechanismsofa pages 51-54, harms2024molecularmechanismsof pages 51-54).
References
(maier2020competenceandtransformation pages 3-5): Berenike Maier. Competence and transformation in bacillus subtilis. Current issues in molecular biology, 37:57-76, Jan 2020. URL: https://doi.org/10.21775/cimb.037.057, doi:10.21775/cimb.037.057. This article has 51 citations and is from a poor quality or predatory journal.
(berka2002microarrayanalysisof pages 1-2): Randy M. Berka, Jeanette Hahn, Mark Albano, Irena Draskovic, Marjan Persuh, Xianju Cui, Alan Sloma, William Widner, and David Dubnau. Microarray analysis of the bacillus subtilis k‐state: genome‐wide expression changes dependent on comk. Molecular Microbiology, 43:1331-1345, Mar 2002. URL: https://doi.org/10.1046/j.1365-2958.2002.02833.x, doi:10.1046/j.1365-2958.2002.02833.x. This article has 298 citations and is from a domain leading peer-reviewed journal.
(maier2020competenceandtransformation pages 1-3): Berenike Maier. Competence and transformation in bacillus subtilis. Current issues in molecular biology, 37:57-76, Jan 2020. URL: https://doi.org/10.21775/cimb.037.057, doi:10.21775/cimb.037.057. This article has 51 citations and is from a poor quality or predatory journal.
(harms2024molecularmechanismsofa pages 51-54): M Harms. Molecular mechanisms of sig-dependent gene regulation in bacillus subtilis. Unknown journal, 2024.
(harms2024molecularmechanismsof pages 51-54): M Harms. Molecular mechanisms of sig-dependent gene regulation in bacillus subtilis. Unknown journal, 2024.
(rahmer2015constructionofa pages 1-2): Regine Rahmer, Kambiz Morabbi Heravi, and Josef Altenbuchner. Construction of a super-competent bacillus subtilis 168 using the pmtla-comks inducible cassette. Frontiers in Microbiology, Dec 2015. URL: https://doi.org/10.3389/fmicb.2015.01431, doi:10.3389/fmicb.2015.01431. This article has 98 citations and is from a poor quality or predatory journal.
(harms2024molecularmechanismsof pages 75-78): M Harms. Molecular mechanisms of sig-dependent gene regulation in bacillus subtilis. Unknown journal, 2024.
(boonstra2020analysesofcompetent pages 1-2): Mirjam Boonstra, Marc Schaffer, Joana Sousa, Luiza Morawska, Siger Holsappel, Petra Hildebrandt, Praveen Kumar Sappa, Hermann Rath, Anne de Jong, Michael Lalk, Ulrike Mäder, Uwe Völker, and Oscar P. Kuipers. Analyses of competent and non‐competent subpopulations of bacillus subtilis reveal
Source: comK-deep-research-bioreason-rl.md
The BioReason functional summary reads:
A cytoplasmic competence regulator in Bacillus subtilis that assembles and stabilizes macromolecular complexes to activate the competence program. By using a competence-specific regulatory core, it binds partner proteins to orchestrate transcriptional switching of the competence regulon and coordinate downstream DNA uptake readiness. Its soluble architecture and regulatory role position it as a cytoplasmic hub that couples transcriptional control with the broader machinery of genetic competence.
The summary correctly identifies ComK as a cytoplasmic competence regulator and links it to genetic competence (GO:0009294 / GO:0030420). However, it fundamentally mischaracterizes the molecular function. The curated review establishes that ComK is a DNA-binding transcription activator (GO:0001216) that binds to specific K-box motifs (AAAA-N5-TTTT) as a tetramer (dimer-of-dimers) to activate transcription of late competence genes. BioReason instead describes ComK as a "macromolecular complex assembler" with "protein binding" (GO:0005515) as its primary molecular function, and the predicted GO terms list only protein binding and protein complex assembly terms -- completely missing DNA binding, transcription factor activity, or transcription regulation.
This is a significant error. The curated review proposes DNA-binding transcription activator activity as the core MF and positive regulation of establishment of competence for transformation (GO:0045809) as the core BP. BioReason's narrative vaguely alludes to "transcriptional switching" but assigns the wrong molecular function, treating ComK more like a scaffold than the sequence-specific DNA-binding transcription factor it is.
The summary also omits ComK's positive autoregulatory feedback loop, the bistable switch generating bimodal population behavior (5-10% K-state cells), and the MecA-ClpC-ClpP proteolytic control mechanism -- all central to understanding ComK biology.
Comparison with interpro2go:
The interpro2go annotations for comK are minimal. The only InterPro domain is IPR010461 (Competence protein ComK), which maps to generic protein binding and protein homooligomerization terms in GO. BioReason closely recapitulates these interpro2go-level predictions without adding meaningful insight. Both miss the DNA-binding transcription factor activity that is ComK's actual function. BioReason provides no advantage over interpro2go here.
The trace acknowledges the single-domain architecture and correctly infers a regulatory rather than enzymatic role. However, it defaults to "protein binding" as the molecular function, likely because the InterPro entry for the ComK family does not explicitly annotate DNA-binding activity. The trace does mention "nucleic-acid-associated transcriptional regulation" early on but does not carry this through to the GO term predictions.
id: P40396
gene_symbol: comK
product_type: PROTEIN
status: DRAFT
taxon:
id: NCBITaxon:224308
label: Bacillus subtilis (strain 168)
description: ComK is the master transcriptional regulator of genetic competence in
Bacillus subtilis, functioning as a DNA-binding transcription activator that controls
the expression of late competence genes required for DNA uptake, processing, and
recombination. ComK binds to specific AT-rich motifs called K-boxes (consensus AAAA-N5-TTTT)
in promoter regions as a tetramer (dimer-of-dimers) to activate transcription of
target genes including the comC, comE, comF, comG operons encoding DNA uptake machinery,
as well as recA and nucA involved in DNA processing. A critical feature of ComK
regulation is its positive autoregulation at its own promoter (PcomK), which creates
a bistable switch leading to bimodal population behavior where only 5-10% of cells
enter the competent K-state. ComK protein levels are tightly controlled through
post-translational degradation by the MecA-ClpC-ClpP protease system; the adaptor
protein MecA targets ComK to the ClpC-ClpP protease for degradation. This proteolytic
control is relieved by ComS, a small protein produced from the quorum-sensing regulated
srf operon, which competes with ComK for MecA binding and thereby stabilizes ComK.
ComK expression is also negatively regulated at the transcriptional level by multiple
repressors including AbrB, Rok, and CodY, with Spo0A-P relieving AbrB repression
during transition to stationary phase. The ComK regulon encompasses at least 165
genes, making it a central hub connecting quorum sensing, nutritional status, and
stress responses to the developmental decision to enter the competent state.
references:
- id: GO_REF:0000043
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
findings: []
- id: GO_REF:0000120
title: Combined Automated Annotation using Multiple IEA Methods
findings: []
- id: file:BACSU/comK/comK-deep-research-falcon.md
title: Deep research report on comK competence transcription factor
findings:
- statement: ComK is a DNA-binding transcription activator that triggers the K-state
supporting_text: ComK is a DNA-binding transcription activator that triggers the
K-state (competent state) by upregulating late competence genes required for
DNA uptake, processing, and recombination; it also activates its own transcription,
forming a positive feedback loop (autoregulation)
- statement: ComK binds K-boxes as tetramer to activate transcription
supporting_text: "ComK binds promoter K-boxes, typically recognized as a symmetric\
\ AT-rich motif AAAA\u2013N5\u2013TTTT; ComK forms a dimer-of-dimers (tetramer)\
\ to bind K-box elements and activate transcription at competence promoters\
\ including comG and its own comK promoter"
- statement: ComK regulon encompasses at least 165 genes
supporting_text: Genome-scale analyses identified at least ~165 ComK-activated
genes, many in operons preceding identifiable ComK-box motifs, including DNA
uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA)
- statement: ComK protein stability is controlled by MecA-ClpC/ClpP proteolysis
supporting_text: MecA binds ComK and targets it to ClpC/ClpP for ATP-dependent
proteolysis; ComS, produced from the quorum-sensing srf operon, competes for
MecA and thereby protects/stabilizes ComK
- statement: Positive feedback creates bistability and bimodal competence
supporting_text: Positive feedback at PcomK creates a switch-like, bimodal population
in which only a subpopulation becomes competent (K-state)
- statement: Only 5-10% of cells enter the competent K-state
supporting_text: "Approximately 5\u201310% of cells express comK and enter the\
\ K-state under standard competence-inducing conditions"
existing_annotations:
- term:
id: GO:0003677
label: DNA binding
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: ComK is definitively a DNA-binding protein. It binds to specific AT-rich
K-box motifs (AAAA-N5-TTTT) in promoter regions to activate transcription of
competence genes. However, the term GO:0003677 "DNA binding" is too general
- it does not capture the specific role of ComK as a sequence-specific DNA-binding
transcription activator.
action: MODIFY
reason: While ComK does bind DNA, the generic "DNA binding" term is insufficiently
informative. ComK is specifically a sequence-specific DNA-binding transcription
factor that binds K-box regulatory elements to activate gene expression. The
more specific term GO:0001216 "DNA-binding transcription activator activity"
more accurately captures ComK's molecular function as it includes both the DNA
binding and transcriptional activation aspects of its role.
proposed_replacement_terms:
- id: GO:0001216
label: DNA-binding transcription activator activity
supported_by:
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: ComK is a DNA-binding transcription activator that triggers
the K-state (competent state) by upregulating late competence genes required
for DNA uptake, processing, and recombination
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: "ComK binds promoter K-boxes, typically recognized as a symmetric\
\ AT-rich motif AAAA\u2013N5\u2013TTTT; ComK forms a dimer-of-dimers (tetramer)\
\ to bind K-box elements and activate transcription at competence promoters\
\ including comG and its own comK promoter"
- term:
id: GO:0006351
label: DNA-templated transcription
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: ComK is involved in transcription as a transcription factor, but GO:0006351
"DNA-templated transcription" refers to the actual synthesis of RNA from a DNA
template (the catalytic process performed by RNA polymerase). ComK is a regulatory
transcription factor that modulates transcription but does not directly catalyze
RNA synthesis. A more appropriate annotation would be a regulatory process term.
action: MODIFY
reason: GO:0006351 is the core transcription process (RNA synthesis from DNA template),
which is carried out by RNA polymerase, not by transcription factors. ComK's
role is to regulate transcription by binding to promoter elements and recruiting/activating
the transcription machinery. The appropriate biological process term would be
GO:0045893 "positive regulation of DNA-templated transcription" which accurately
describes ComK's role in activating transcription of target genes.
proposed_replacement_terms:
- id: GO:0045893
label: positive regulation of DNA-templated transcription
supported_by:
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: Genome-scale analyses identified at least ~165 ComK-activated
genes, many in operons preceding identifiable ComK-box motifs, including DNA
uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA)
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: ComK is a DNA-binding transcription activator that triggers
the K-state (competent state) by upregulating late competence genes required
for DNA uptake, processing, and recombination
- term:
id: GO:0030420
label: establishment of competence for transformation
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: This is the most appropriate biological process term for ComK. ComK is
the master regulator that directly controls the establishment of competence
for transformation in B. subtilis. ComK activates all the late competence genes
required for DNA uptake and processing.
action: ACCEPT
reason: ComK is definitively required for the establishment of genetic competence.
ComK directly activates genes encoding DNA uptake machinery (comC, comE, comF,
comG operons) and DNA processing enzymes (nucA, recA). The term accurately captures
the biological process in which ComK participates. The deep research confirms
ComK triggers the K-state (competent state) by upregulating late competence
genes.
supported_by:
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: ComK is a DNA-binding transcription activator that triggers
the K-state (competent state) by upregulating late competence genes required
for DNA uptake, processing, and recombination
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: Genome-scale analyses identified at least ~165 ComK-activated
genes, many in operons preceding identifiable ComK-box motifs, including DNA
uptake and recombination loci (e.g., comCDEFG, comFA, comEA, nucA, recA)
- term:
id: GO:0045809
label: positive regulation of establishment of competence for transformation
evidence_type: TAS
original_reference_id: file:BACSU/comK/comK-deep-research-falcon.md
review:
summary: ComK is the master positive regulator of competence establishment. It
activates transcription of all late competence genes and also positively regulates
its own expression through an autoregulatory feedback loop.
action: NEW
reason: This term should be added as it specifically captures ComK's regulatory
role in competence. ComK positively regulates the establishment of competence
by activating transcription of late competence genes. The positive autoregulation
at the comK promoter creates a bistable switch that controls entry into the
competent state. This term complements GO:0030420 by specifying the regulatory
relationship.
supported_by:
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: ComK is a DNA-binding transcription activator that triggers
the K-state (competent state) by upregulating late competence genes required
for DNA uptake, processing, and recombination; it also activates its own transcription,
forming a positive feedback loop (autoregulation)
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: Positive feedback at PcomK creates a switch-like, bimodal population
in which only a subpopulation becomes competent (K-state)
- term:
id: GO:0000976
label: transcription cis-regulatory region binding
evidence_type: TAS
original_reference_id: file:BACSU/comK/comK-deep-research-falcon.md
review:
summary: ComK binds to specific cis-regulatory regions (K-boxes) in promoters
of competence genes. The K-box is a well-characterized AT-rich motif (AAAA-N5-TTTT)
that ComK recognizes as a tetramer.
action: NEW
reason: This molecular function term captures the specific DNA-binding activity
of ComK at promoter regulatory regions. ComK binds K-box elements which are
cis-regulatory regions in promoters of competence genes. This term provides
more specific information about the nature of ComK's DNA binding activity compared
to the generic GO:0003677.
supported_by:
- reference_id: file:BACSU/comK/comK-deep-research-falcon.md
supporting_text: "ComK binds promoter K-boxes, typically recognized as a symmetric\
\ AT-rich motif AAAA\u2013N5\u2013TTTT; ComK forms a dimer-of-dimers (tetramer)\
\ to bind K-box elements and activate transcription at competence promoters\
\ including comG and its own comK promoter"
core_functions:
- molecular_function:
id: GO:0001216
label: DNA-binding transcription activator activity
description: Core molecular function - ComK binds K-box motifs as a tetramer to
activate transcription of late competence genes
directly_involved_in:
- id: GO:0045809
label: positive regulation of establishment of competence for transformation
- id: GO:0030420
label: establishment of competence for transformation
proposed_new_terms: []
suggested_questions:
- question: What is the precise molecular mechanism by which ComK tetramer binding
to K-boxes recruits RNA polymerase and activates transcription? While ComK binding
to K-boxes is established, the detailed mechanism of transcription activation
(interaction with sigma factors, RNAP recruitment) is less clear.
- question: Are there ComK-repressed genes beyond rok, and what is the mechanism of
repression? UniProt notes ComK may repress some genes, but the extent and mechanism
is unclear.
suggested_experiments:
- description: ChIP-seq of ComK in competent B. subtilis cells to comprehensively
map K-box binding sites genome-wide. This would provide definitive map of direct
ComK targets versus indirect effects.
- description: Structural studies of ComK tetramer bound to K-box DNA. This would
reveal molecular basis for K-box recognition and tetramer assembly.
tags:
- bacsu