rpoD

UniProt ID: Q72B50
Organism: Nitratidesulfovibrio vulgaris Hildenborough
Review Status: COMPLETE
πŸ“ Provide Detailed Feedback

Gene Description

RNA polymerase sigma factor SigA (RpoD) is the primary/housekeeping sigma-70 factor in Nitratidesulfovibrio vulgaris (formerly Desulfovibrio vulgaris) Hildenborough. Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. RpoD is the principal sigma factor during exponential growth, directing RNA polymerase to housekeeping gene promoters. The protein contains conserved sigma-70 domains: region 2 (recognizes -10 promoter element and promotes DNA melting), region 4 (contains helix-turn-helix motif that recognizes -35 promoter element), region 1.2 (contributes to promoter escape), and region 1.1 (modulates promoter complex formation). RpoD interacts transiently with the RNA polymerase catalytic core to form the holoenzyme and functions in the cytoplasm during transcription initiation.

Existing Annotations Review

GO Term Evidence Action Reason
GO:2000142 regulation of DNA-templated transcription initiation
IEA
GO_REF:0000108
KEEP AS NON CORE
Summary: This annotation is appropriate but represents an indirect consequence of sigma factor function rather than the core activity. Sigma factors directly participate in transcription initiation rather than regulating it. The term describes modulation of the frequency/rate of transcription initiation, whereas sigma factors are essential components of the transcription initiation machinery itself.
Reason: While RpoD does influence transcription initiation rates by directing RNAP to promoters, this is a downstream effect of its primary sigma factor activity (GO:0016987). The core function is sigma factor activity enabling promoter recognition and transcription initiation, not regulation of that process. This annotation captures a true aspect of sigma factor biology but at a less precise level than the more specific terms.
Supporting Evidence:
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Primary sigma factor (RpoD/SigA): The sigma-70-type primary sigma factor enables bacterial RNA polymerase (RNAP) to initiate transcription at housekeeping promoters under normal growth conditions.
GO:0003677 DNA binding
IEA
GO_REF:0000120
ACCEPT
Summary: This annotation is correct but very general. RpoD binds DNA specifically at promoter sequences through its sigma-70 domains. Region 2 recognizes the -10 element and region 4 recognizes the -35 element via a helix-turn-helix motif (residues 550-569 in Q72B50). However, sigma factors do not bind DNA independently - they only recognize promoter DNA when complexed with RNAP core as part of the holoenzyme.
Reason: DNA binding is a fundamental activity of sigma factors, although it occurs in the context of the holoenzyme complex. The UniProt entry documents a DNA_BIND region (H-T-H motif at residues 550-569) which directly supports this annotation. This is appropriately general for an IEA annotation.
Supporting Evidence:
UniProt:Q72B50
DNA_BIND 550..569 /note="H-T-H motif" /evidence="ECO:0000256|HAMAP-Rule:MF_00963"
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
region 2 recognizes the -10 element and promotes DNA melting; region 4 contains a helix-turn-helix that recognizes the -35 element
GO:0003700 DNA-binding transcription factor activity
IEA
GO_REF:0000002
MODIFY
Summary: This annotation is INCORRECT for sigma factors. The GO term definition states that DNA-binding transcription factors modulate transcription via "selective and non-covalent binding to a specific double-stranded genomic DNA sequence within a cis-regulatory region." While sigma factors do recognize promoter elements, they do not function as transcription factors that independently bind DNA to regulate transcription. Sigma factors are integral components of the transcription machinery, not regulatory factors.
Reason: Sigma factors have a distinct molecular function (GO:0016987 sigma factor activity) that is fundamentally different from transcription factor activity. Sigma factors do not bind DNA independently - they only bind promoter DNA when part of the RNAP holoenzyme. The GO guidelines explicitly note that sigma factors should be annotated to sigma factor activity, not transcription factor activity. This InterPro-derived annotation is an over-generalization.
Proposed replacements: sigma factor activity
Supporting Evidence:
GO:0016987
Sigma factors act as the promoter specificity subunit of eubacterial and plant plastid multisubunit RNA polymerases... Although sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, the sigma factor binds specifically to promoter elements.
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters
GO:0005737 cytoplasm
IEA
GO_REF:0000120
ACCEPT
Summary: This cellular component annotation is correct. RpoD functions in the bacterial cytoplasm as part of the RNAP holoenzyme, binding core RNAP to engage promoter DNA during transcription initiation. This is consistent with UniProt subcellular location annotation and structural studies.
Reason: Cytoplasmic localization is well-established for bacterial sigma factors. The UniProt entry explicitly states "SUBCELLULAR LOCATION: Cytoplasm" based on HAMAP rule MF_00963. The deep research confirms "Cellular localization: Cytoplasmic; functions within the RNAP holoenzyme engaging cytoplasmic DNA during transcription initiation."
Supporting Evidence:
UniProt:Q72B50
SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|HAMAP-Rule:MF_00963}
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Cellular localization: Cytoplasmic; functions within the RNAP holoenzyme engaging cytoplasmic DNA during transcription initiation
GO:0006352 DNA-templated transcription initiation
IEA
GO_REF:0000120
ACCEPT
Summary: This is an excellent and appropriate biological process annotation for sigma factors. The primary function of RpoD is to enable transcription initiation by directing RNAP to promoters. The GO term describes "The initial step of transcription, consisting of the assembly of the RNA polymerase preinitiation complex (PIC) at a gene promoter, as well as the formation of the first few bonds of the RNA transcript."
Reason: This is a core biological process annotation that accurately captures the essential role of sigma factors in transcription. RpoD directly participates in transcription initiation by forming the holoenzyme with RNAP core and recognizing promoter elements, which is precisely what this term describes. UniProt function annotation states sigma factors "promote the attachment of RNA polymerase to specific initiation sites."
Supporting Evidence:
UniProt:Q72B50
FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released.
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Primary sigma factor (RpoD/SigA): The sigma-70-type primary sigma factor enables bacterial RNA polymerase (RNAP) to initiate transcription at housekeeping promoters under normal growth conditions.
GO:0006355 regulation of DNA-templated transcription
IEA
GO_REF:0000120
KEEP AS NON CORE
Summary: This annotation is redundant and less specific than GO:0006352 (DNA-templated transcription initiation). While sigma factors do influence transcription, they are more precisely involved in the initiation process itself rather than regulation of transcription broadly. This term is an ancestor of more specific transcription-related terms.
Reason: This is a broader parent term that is technically true but not the most informative annotation for sigma factors. The more specific GO:0006352 (DNA-templated transcription initiation) better captures the precise role of sigma factors. This annotation is acceptable for completeness but represents an over-generalization of sigma factor function.
Supporting Evidence:
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Central to transcription initiation of core cellular processes (housekeeping gene expression); thereby indirectly influences growth, metabolism, and stress-responsive regulons by setting basal transcriptional regimes
GO:0010468 regulation of gene expression
IEA
GO_REF:0000104
KEEP AS NON CORE
Summary: This is a very high-level annotation that is technically correct but not very informative. All sigma factors influence gene expression by directing RNAP to specific promoters, but this term is too general to be useful for understanding sigma factor function.
Reason: While sigma factors do affect gene expression by determining which promoters are recognized by RNAP, this annotation is at too high a level of abstraction to be informative. More specific terms (GO:0006352, GO:0016987) better capture the actual molecular role of sigma factors. This annotation adds little value beyond the more specific terms.
Supporting Evidence:
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
directs transcription initiation at housekeeping promoters during normal growth
GO:0016987 sigma factor activity
IEA
GO_REF:0000120
ACCEPT
Summary: This is the most precise and appropriate molecular function annotation for RpoD. Sigma factor activity is defined as the promoter specificity subunit of bacterial RNA polymerases that, when combined with the core enzyme to form the holoenzyme, binds specifically to promoter elements. This exactly describes the function of RpoD.
Reason: This annotation is the core molecular function of RpoD. The GO term definition states "Sigma factors act as the promoter specificity subunit of eubacterial and plant plastid multisubunit RNA polymerases... Although sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, the sigma factor binds specifically to promoter elements." This precisely matches the known function of RpoD as the primary sigma factor.
Supporting Evidence:
UniProt:Q72B50
FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is the primary sigma factor during exponential growth.
UniProt:Q72B50
SIMILARITY: Belongs to the sigma-70 factor family. RpoD/SigA subfamily.
file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters during normal growth
GO:0005515 protein binding
IPI
PMID:26873250
Bacterial Interactomes: Interacting Protein Partners Share S...
MODIFY
Summary: This annotation is based on high-throughput AP-MS (affinity purification-mass spectrometry) data from the Butland et al. D. vulgaris interactome study. The UniProt entry shows RpoD interacts with DVU_1368 (Q72CB5) with 2 experiments in IntAct. However, "protein binding" is an uninformative term that should be replaced with a more specific binding term if possible.
Reason: While the protein-protein interaction is experimentally supported, "protein binding" (GO:0005515) is explicitly discouraged in GO annotation guidelines as uninformative. For sigma factors, the known protein interactions are with RNAP core subunits (RpoA, RpoB, RpoC). The UniProt entry notes "Interacts transiently with the RNA polymerase catalytic core" and shows a motif for "Interaction with polymerase core subunit RpoC" at residues 380-383. A more appropriate term would be RNA polymerase binding or a related term.
Supporting Evidence:
UniProt:Q72B50
SUBUNIT: Interacts transiently with the RNA polymerase catalytic core.
UniProt:Q72B50
MOTIF 380..383 /note="Interaction with polymerase core subunit RpoC" /evidence="ECO:0000256|HAMAP-Rule:MF_00963"
PMID:26873250
Numerous affinity purification-mass spectrometry (AP-MS) and yeast two-hybrid screens have each defined thousands of pairwise protein-protein interactions (PPIs)... We have identified 459 high confidence PPIs from D. vulgaris

Core Functions

RpoD (SigA) is the primary housekeeping sigma factor in N. vulgaris, belonging to the sigma-70 factor family RpoD/SigA subfamily. It contains all conserved sigma-70 domains (regions 1.1, 1.2, 2, 3, and 4) with documented helix-turn-helix DNA binding motif (residues 550-569). UniProt annotation and HAMAP rule MF_00963 confirm sigma factor function. The protein promotes RNAP attachment to specific promoter initiation sites and is the primary sigma factor during exponential growth.

Molecular Function:
sigma factor activity
Cellular Locations:
Supporting Evidence:
  • UniProt:Q72B50
    FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is the primary sigma factor during exponential growth.
  • file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
    Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters during normal growth

References

Gene Ontology annotation through association of InterPro records with GO terms
Electronic Gene Ontology annotations created by transferring manual GO annotations between related proteins based on shared sequence features
Automatic assignment of GO terms using logical inference, based on inter-ontology links
Combined Automated Annotation using Multiple IEA Methods
Bacterial Interactomes: Interacting Protein Partners Share Similar Function and Are Validated in Independent Assays More Frequently Than Previously Reported.
  • High-throughput AP-MS study identifying protein-protein interactions in D. vulgaris
    "Here, we present an AP-MS survey of the bacterium Desulfovibrio vulgaris together with a critical reanalysis of nine published bacterial yeast two-hybrid and AP-MS screens"
  • Identified 459 high confidence PPIs from D. vulgaris
    "We have identified 459 high confidence PPIs from D. vulgaris and 391 from Escherichia coli"
The genome sequence of the anaerobic, sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough
  • Genome sequence establishing DVU_1788 locus for rpoD
    "Desulfovibrio vulgaris Hildenborough is a model organism for studying the energy metabolism of sulfate-reducing bacteria"
DOI:10.1186/s12864-024-10415-x
The modular architecture of sigma factors in cyanobacteria
  • Confirms essential domain set for primary sigma-70 factors (r1_2, r2, r3, r4)
  • Documents conserved roles in promoter recognition
  • Organism-specific mapping of rpoD to DVU_1788 in D. vulgaris
DOI:10.3390/microorganisms11041077
Structural analysis of Bacillus subtilis sigma factors
  • Structural insights into sigma-70 family domain architecture
  • Confirms sigma-2 and sigma-4 as primary DNA-recognition modules

Suggested Questions for Experts

Q: What are the specific promoter sequence motifs recognized by RpoD in N. vulgaris?

Q: Are there alternative sigma factors in N. vulgaris that compete with RpoD under stress conditions?

Q: What is the identity and function of DVU_1368 (Q72CB5) that interacts with RpoD?

Suggested Experiments

Experiment: ChIP-seq analysis to map RpoD binding sites genome-wide and identify the RpoD regulon

Hypothesis: RpoD binds to canonical -10/-35 promoter elements in N. vulgaris housekeeping genes

Type: ChIP-seq

Experiment: In vitro transcription assays with purified components to confirm promoter recognition specificity

Hypothesis: Purified RpoD enables RNAP to initiate transcription at predicted housekeeping promoters

Type: In vitro transcription assay

Experiment: Protein interaction studies to characterize the RpoD-RNAP core interaction interface

Hypothesis: RpoD interacts with RNAP core via the conserved sigma-70 region 2 and region 4 domains

Type: Co-immunoprecipitation or crosslinking mass spectrometry

πŸ“š Additional Documentation

Deep Research Falcon

(Q72B50-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2026-01-15T10:26:29.384461'
end_time: '2026-01-15T10:30:37.395119'
duration_seconds: 248.01
template_file: templates/gene_research_go_focused.md
template_variables:
organism: DESVH
gene_id: Q72B50
gene_symbol: rpoD
uniprot_accession: Q72B50
protein_description: 'RecName: Full=RNA polymerase sigma factor SigA {ECO:0000256|HAMAP-Rule:MF_00963};'
gene_info: Name=rpoD {ECO:0000313|EMBL:AAS96265.1}; Synonyms=sigA {ECO:0000256|HAMAP-Rule:MF_00963};
OrderedLocusNames=DVU_1788 {ECO:0000313|EMBL:AAS96265.1};
organism_full: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG
34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
protein_family: Belongs to the sigma-70 factor family. RpoD/SigA subfamily.
protein_domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma70. (IPR000943);
RNA_pol_sigma70_r1_2. (IPR009042); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r3.
(IPR007624)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 12


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q72B50
  • Protein Description: RecName: Full=RNA polymerase sigma factor SigA {ECO:0000256|HAMAP-Rule:MF_00963};
  • Gene Information: Name=rpoD {ECO:0000313|EMBL:AAS96265.1}; Synonyms=sigA {ECO:0000256|HAMAP-Rule:MF_00963}; OrderedLocusNames=DVU_1788 {ECO:0000313|EMBL:AAS96265.1};
  • Organism (full): Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  • Protein Family: Belongs to the sigma-70 factor family. RpoD/SigA subfamily.
  • Key Domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r1_2. (IPR009042); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r3. (IPR007624)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "rpoD" matches the protein description above
  2. Verify the organism is correct: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'rpoD' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene rpoD (gene ID: Q72B50, UniProt: Q72B50) in DESVH.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q72B50
  • Protein Description: RecName: Full=RNA polymerase sigma factor SigA {ECO:0000256|HAMAP-Rule:MF_00963};
  • Gene Information: Name=rpoD {ECO:0000313|EMBL:AAS96265.1}; Synonyms=sigA {ECO:0000256|HAMAP-Rule:MF_00963}; OrderedLocusNames=DVU_1788 {ECO:0000313|EMBL:AAS96265.1};
  • Organism (full): Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  • Protein Family: Belongs to the sigma-70 factor family. RpoD/SigA subfamily.
  • Key Domains: RNA_pol_sigma-70_dom. (IPR014284); RNA_pol_sigma70. (IPR000943); RNA_pol_sigma70_r1_2. (IPR009042); RNA_pol_sigma70_r2. (IPR007627); RNA_pol_sigma70_r3. (IPR007624)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "rpoD" matches the protein description above
  2. Verify the organism is correct: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'rpoD' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene rpoD (gene ID: Q72B50, UniProt: Q72B50) in DESVH.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan and verification
- Identity check: The target is rpoD (DVU_1788) encoding the primary sigma factor (SigA/RpoD) in Desulfovibrio vulgaris Hildenborough; organism-specific mapping of rpoD to locus DVU_1788 is reported in primary literature (D. vulgaris Hildenborough) (gevin2024themodulararchitecture pages 17-17). Family and domains: housekeeping Οƒ70 (RpoD/SigA) with conserved regions 1.2, 2, 3, and 4 characteristic of the RpoD/SigA subfamily; domains 2 and 4 recognize the βˆ’10 and βˆ’35 promoter elements, respectively, consistent with sigma-70 family annotations (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 2-4). No conflicting usage of the symbol rpoD in a different organism is used as a surrogate here; all subsequent functional inferences are restricted to Οƒ70 biology and the D. vulgaris mapping cited above (gevin2024themodulararchitecture pages 17-17).

Source (authors, title, citation) Year Focus / Organism Key relevance to RpoD/SigA (function/domains/promoter recognition/organism mapping) URL / DOI Publication month
Rajeev et al., "Regulation of Nitrite Stress Response in Desulfovibrio vulgaris Hildenborough" (rajeev2015) (gevin2024themodulararchitecture pages 17-17) 2015 Desulfovibrio vulgaris Hildenborough Reports rpoD (DVU1788) in D. vulgaris Hildenborough; supports organism-specific gene mapping and use of rpoD as a reference/housekeeping sigma factor. https://doi.org/10.1128/jb.00319-15 Nov
Collins et al., "Structural Analysis of Bacillus subtilis Sigma Factors" (collins2023) (collins2023structuralanalysisof pages 11-12, collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9) 2023 Bacillus subtilis (structural review) Provides experimental and AlphaFold-based structural insights into sigma-70 family domains (region 1.1, 2, 3, 4), DNA-binding (–10/–35) interactions, and methodological context for interpreting RpoD/SigA structure–function. https://doi.org/10.3390/microorganisms11041077 Apr
Gevin et al., "The modular architecture of sigma factors in cyanobacteria" (gevin2024) (gevin2024themodulararchitecture pages 17-17, gevin2024themodulararchitecture pages 2-4, gevin2024themodulararchitecture pages 1-2) 2024 Cyanobacteria (comparative genomics) Large-scale classification of sigma-70 homologs; identifies essential domains (r1_2, r2, r3, r4), reports modular patterns and evolutionary scenarios relevant for annotating primary sigma factors like RpoD/SigA. https://doi.org/10.1186/s12864-024-10415-x May

Table: Compact reference table of primary sources used to annotate Desulfovibrio vulgaris rpoD (DVU_1788) and general sigma‑70 (RpoD/SigA) domain–function relationships, with context IDs for cited evidence.

Comprehensive research report
1. Key concepts and definitions
- Primary sigma factor (RpoD/SigA): The Οƒ70-type primary sigma factor enables bacterial RNA polymerase (RNAP) to initiate transcription at housekeeping promoters under normal growth conditions. It recognizes specific promoter elements and recruits/positions RNAP for open complex formation and start-site selection (gevin2024themodulararchitecture pages 1-2). In D. vulgaris Hildenborough, rpoD is mapped to locus DVU_1788 (gevin2024themodulararchitecture pages 17-17).
- Conserved domains and their roles: Canonical Οƒ70 proteins contain conserved regions that enable RNAP binding and promoter recognition: region 2 recognizes the βˆ’10 element and promotes DNA melting; region 4 contains a helix–turn–helix that recognizes the βˆ’35 element; region 1.2 contributes to promoter escape and DNA unwinding downstream of the start site; region 1.1 modulates promoter complex formation (where present) (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 17-17). Structural studies and reviews further establish Οƒ2 and Οƒ4 as the primary DNA-recognition modules and highlight how these domains are organized in the sigma-70 fold (collins2023structuralanalysisof pages 7-9, collins2023structuralanalysisof pages 9-11).
- Cellular localization: RpoD functions in the bacterial cytoplasm as part of the RNAP holoenzyme, binding core RNAP to form Eσ70 and engaging promoter DNA during transcription initiation (collins2023structuralanalysisof pages 9-11).

  1. Recent developments and latest research (emphasis 2023–2024)
  2. Structural insights and modeling: Recent analyses combining experimental structures with AlphaFold models clarify full-length and domain-level architectures of sigma factors. For Bacillus sigma factors (extrapolatable to Οƒ70 family), NMR and crystallography of Οƒ1.1, Οƒ2, and Οƒ4 domains and cryo-EM complexes underscore reliable modeling of structured cores, confirm Οƒ4–DNA (βˆ’35) interactions, and contextualize dynamic loops; AlphaFold is generally accurate for core helices but may overbuild in flexible regions (Apr 2023; URL: https://doi.org/10.3390/microorganisms11041077) (collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9, collins2023structuralanalysisof pages 11-12).
  3. Modular architecture and evolution of Οƒ70: A 2024 comparative genomic analysis across cyanobacteria (a broad bacterial reference for Οƒ70 diversity) proposes a robust modular classification of Οƒ70 proteins and catalogs extensive diversity: 4,193 Οƒ70 homologs, 59 modular patterns, with essential domains including r1_2, r2, r3, r4. The study reinforces the conserved functional roles of r2 (βˆ’10 recognition/melting) and r4 (βˆ’35 recognition) and emphasizes accessory domains that may tune regulatory interactions (May 2024; URL: https://doi.org/10.1186/s12864-024-10415-x) (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 2-4, gevin2024themodulararchitecture pages 17-17).
  4. Mechanistic updates: Contemporary reviews and structural summaries highlighted in 2023 work synthesize how Οƒ70 domains coordinate promoter binding, DNA melting, and interactions with transcription activators. They point to cryo-EM as pivotal for visualizing functional complexes and to computational advances for building testable structural hypotheses for primary sigma factors such as RpoD/SigA (collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9, collins2023structuralanalysisof pages 11-12).

  5. Current applications and real-world implementations

  6. Housekeeping reference and regulon anchor: In D. vulgaris Hildenborough, rpoD (DVU_1788) features in systems-level studies as a core transcriptional component; organism-specific work on stress responses lists rpoD (DVU_1788) together with essential housekeeping genes, reflecting its central role as the primary sigma factor (Nov 2015; URL: https://doi.org/10.1128/jb.00319-15) (gevin2024themodulararchitecture pages 17-17).
  7. Structure-guided annotation and modeling: Recent structural and AlphaFold-based analyses enable improved annotation of Οƒ70 domains in diverse bacteria and guide hypotheses about promoter recognition and regulatory interfaces for primary sigma factors like RpoD/SigA, which can be leveraged in synthetic biology and systems biology to interpret promoter usage and transcriptional program structure (Apr 2023; URL: https://doi.org/10.3390/microorganisms11041077; May 2024; URL: https://doi.org/10.1186/s12864-024-10415-x) (collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9, gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 2-4).

  8. Expert opinions and analysis from authoritative sources

  9. Structural reviewers emphasize that Οƒ70 domains are partially disordered outside core helices and that partner-bound structures (RNAP, DNA, regulators) are essential for accurate mechanistic interpretation; AlphaFold is a strong complement but must be validated, especially for loops and flexible linkers (collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9, collins2023structuralanalysisof pages 11-12).
  10. Comparative genomics experts propose that the conserved domain core of Οƒ70 underlies universal promoter recognition logic, while accessory domains and modular rearrangements contribute to lineage-specific regulation; this supports confident transfer of functional annotation to uncharacterized primary sigma factors like D. vulgaris RpoD (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 2-4).

  11. Relevant statistics and data from recent studies

  12. Sigma-70 modular diversity: 4,193 homologs examined across 361 genomes; 59 modular domain organizations; essential domains include r1_2, r2, r3, r4; accessory domains such as DUF6596 are frequent in some lineages (May 2024; URL: https://doi.org/10.1186/s12864-024-10415-x) (gevin2024themodulararchitecture pages 2-4, gevin2024themodulararchitecture pages 1-2).
  13. Structural concordance metrics: Cross-structure overlays of Οƒ2 domains show tight alignment (e.g., RMSD ~0.99 over 72 residues in representative comparisons), supporting conserved architecture of DNA-recognition cores (Apr 2023; URL: https://doi.org/10.3390/microorganisms11041077) (collins2023structuralanalysisof pages 7-9).

Functional annotation for rpoD (DVU_1788) in D. vulgaris Hildenborough
- Primary function: Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters during normal growth (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 17-17).
- Substrate/interaction specificity: Not an enzyme; its specificity is for promoter DNA elements: Οƒ region 2 recognizes the βˆ’10 element and facilitates promoter melting; Οƒ region 4 recognizes the βˆ’35 element via an HTH motif; region 1.2 contributes to promoter escape; region 1.1 modulates promoter complex formation (where present) (gevin2024themodulararchitecture pages 1-2, collins2023structuralanalysisof pages 7-9, collins2023structuralanalysisof pages 9-11).
- Biological processes and pathways: Central to transcription initiation of core cellular processes (housekeeping gene expression); thereby indirectly influences growth, metabolism, and stress-responsive regulons by setting basal transcriptional regimes (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 17-17).
- Cellular localization: Cytoplasmic; functions within the RNAP holoenzyme engaging cytoplasmic DNA during transcription initiation (collins2023structuralanalysisof pages 9-11).
- Domain architecture support: Recent comparative work affirms the essential domain set for primary Οƒ70 factors (r1_2, r2, r3, r4) and their conserved roles in promoter recognition and initiation across bacteria, supporting inference for DVU_1788 (May 2024; URL: https://doi.org/10.1186/s12864-024-10415-x) (gevin2024themodulararchitecture pages 1-2, gevin2024themodulararchitecture pages 2-4).

Organism-specific note and limitations
- Desulfovibrio-specific promoter motifs and a full RpoD regulon have not been directly delineated in the cited recent literature. However, the organism-specific locus mapping of rpoD to DVU_1788 is documented (Nov 2015; URL: https://doi.org/10.1128/jb.00319-15). Therefore, functional details are inferred from conserved Οƒ70 mechanisms observed broadly in bacteria and recent structural/genomic syntheses (2023–2024) (gevin2024themodulararchitecture pages 17-17, collins2023structuralanalysisof pages 9-11, gevin2024themodulararchitecture pages 1-2).

Conclusion
The D. vulgaris Hildenborough rpoD gene (DVU_1788) encodes the primary Οƒ70 family sigma factor (SigA/RpoD) responsible for directing RNAP to housekeeping promoters. Its conserved domains (r2 and r4) recognize the βˆ’10 and βˆ’35 elements, with additional roles for r1.2 (escape) and r1.1 (initiation modulation), consistent with modern structural and comparative analyses. Recent advances in structural biology and large-scale modular classification reinforce these assignments and support confident functional annotation, while highlighting the need for organism-specific promoter and regulon mapping to refine DVU_1788’s detailed regulatory scope (gevin2024themodulararchitecture pages 17-17, gevin2024themodulararchitecture pages 1-2, collins2023structuralanalysisof pages 9-11, collins2023structuralanalysisof pages 7-9, gevin2024themodulararchitecture pages 2-4).

References

  1. (gevin2024themodulararchitecture pages 17-17): Marine Gevin, Amel Latifi, and Emmanuel Talla. The modular architecture of sigma factors in cyanobacteria: a framework to assess their diversity and understand their evolution. BMC Genomics, May 2024. URL: https://doi.org/10.1186/s12864-024-10415-x, doi:10.1186/s12864-024-10415-x. This article has 1 citations and is from a peer-reviewed journal.

  2. (gevin2024themodulararchitecture pages 1-2): Marine Gevin, Amel Latifi, and Emmanuel Talla. The modular architecture of sigma factors in cyanobacteria: a framework to assess their diversity and understand their evolution. BMC Genomics, May 2024. URL: https://doi.org/10.1186/s12864-024-10415-x, doi:10.1186/s12864-024-10415-x. This article has 1 citations and is from a peer-reviewed journal.

  3. (gevin2024themodulararchitecture pages 2-4): Marine Gevin, Amel Latifi, and Emmanuel Talla. The modular architecture of sigma factors in cyanobacteria: a framework to assess their diversity and understand their evolution. BMC Genomics, May 2024. URL: https://doi.org/10.1186/s12864-024-10415-x, doi:10.1186/s12864-024-10415-x. This article has 1 citations and is from a peer-reviewed journal.

  4. (collins2023structuralanalysisof pages 11-12): Katherine M. Collins, Nicola J. Evans, James H. Torpey, Jonathon M. Harris, Bethany A. Haynes, Amy H. Camp, and Rivka L. Isaacson. Structural analysis of bacillus subtilis sigma factors. Microorganisms, 11:1077, Apr 2023. URL: https://doi.org/10.3390/microorganisms11041077, doi:10.3390/microorganisms11041077. This article has 8 citations and is from a poor quality or predatory journal.

  5. (collins2023structuralanalysisof pages 9-11): Katherine M. Collins, Nicola J. Evans, James H. Torpey, Jonathon M. Harris, Bethany A. Haynes, Amy H. Camp, and Rivka L. Isaacson. Structural analysis of bacillus subtilis sigma factors. Microorganisms, 11:1077, Apr 2023. URL: https://doi.org/10.3390/microorganisms11041077, doi:10.3390/microorganisms11041077. This article has 8 citations and is from a poor quality or predatory journal.

  6. (collins2023structuralanalysisof pages 7-9): Katherine M. Collins, Nicola J. Evans, James H. Torpey, Jonathon M. Harris, Bethany A. Haynes, Amy H. Camp, and Rivka L. Isaacson. Structural analysis of bacillus subtilis sigma factors. Microorganisms, 11:1077, Apr 2023. URL: https://doi.org/10.3390/microorganisms11041077, doi:10.3390/microorganisms11041077. This article has 8 citations and is from a poor quality or predatory journal.

Citations

  1. gevin2024themodulararchitecture pages 17-17
  2. gevin2024themodulararchitecture pages 1-2
  3. collins2023structuralanalysisof pages 9-11
  4. collins2023structuralanalysisof pages 7-9
  5. gevin2024themodulararchitecture pages 2-4
  6. collins2023structuralanalysisof pages 11-12
  7. https://doi.org/10.1128/jb.00319-15
  8. https://doi.org/10.3390/microorganisms11041077
  9. https://doi.org/10.1186/s12864-024-10415-x
  10. https://doi.org/10.3390/microorganisms11041077;
  11. https://doi.org/10.1186/s12864-024-10415-x,
  12. https://doi.org/10.3390/microorganisms11041077,

πŸ“„ View Raw YAML

id: Q72B50
gene_symbol: rpoD
product_type: PROTEIN
status: COMPLETE
taxon:
  id: NCBITaxon:882
  label: Nitratidesulfovibrio vulgaris Hildenborough
description: >
  RNA polymerase sigma factor SigA (RpoD) is the primary/housekeeping sigma-70 factor in
  Nitratidesulfovibrio vulgaris (formerly Desulfovibrio vulgaris) Hildenborough. Sigma factors
  are initiation factors that promote the attachment of RNA polymerase to specific initiation
  sites and are then released. RpoD is the principal sigma factor during exponential growth,
  directing RNA polymerase to housekeeping gene promoters. The protein contains conserved
  sigma-70 domains: region 2 (recognizes -10 promoter element and promotes DNA melting),
  region 4 (contains helix-turn-helix motif that recognizes -35 promoter element), region 1.2
  (contributes to promoter escape), and region 1.1 (modulates promoter complex formation).
  RpoD interacts transiently with the RNA polymerase catalytic core to form the holoenzyme
  and functions in the cytoplasm during transcription initiation.
existing_annotations:
- term:
    id: GO:2000142
    label: regulation of DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000108
  review:
    summary: >
      This annotation is appropriate but represents an indirect consequence of sigma factor
      function rather than the core activity. Sigma factors directly participate in transcription
      initiation rather than regulating it. The term describes modulation of the frequency/rate
      of transcription initiation, whereas sigma factors are essential components of the
      transcription initiation machinery itself.
    action: KEEP_AS_NON_CORE
    reason: >
      While RpoD does influence transcription initiation rates by directing RNAP to promoters,
      this is a downstream effect of its primary sigma factor activity (GO:0016987). The core
      function is sigma factor activity enabling promoter recognition and transcription initiation,
      not regulation of that process. This annotation captures a true aspect of sigma factor
      biology but at a less precise level than the more specific terms.
    supported_by:
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Primary sigma factor (RpoD/SigA): The sigma-70-type primary sigma factor enables bacterial RNA polymerase (RNAP) to initiate transcription at housekeeping promoters under normal growth conditions."

- term:
    id: GO:0003677
    label: DNA binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This annotation is correct but very general. RpoD binds DNA specifically at promoter
      sequences through its sigma-70 domains. Region 2 recognizes the -10 element and region 4
      recognizes the -35 element via a helix-turn-helix motif (residues 550-569 in Q72B50).
      However, sigma factors do not bind DNA independently - they only recognize promoter DNA
      when complexed with RNAP core as part of the holoenzyme.
    action: ACCEPT
    reason: >
      DNA binding is a fundamental activity of sigma factors, although it occurs in the context
      of the holoenzyme complex. The UniProt entry documents a DNA_BIND region (H-T-H motif at
      residues 550-569) which directly supports this annotation. This is appropriately general
      for an IEA annotation.
    supported_by:
      - reference_id: UniProt:Q72B50
        supporting_text: "DNA_BIND 550..569 /note=\"H-T-H motif\" /evidence=\"ECO:0000256|HAMAP-Rule:MF_00963\""
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "region 2 recognizes the -10 element and promotes DNA melting; region 4 contains a helix-turn-helix that recognizes the -35 element"

- term:
    id: GO:0003700
    label: DNA-binding transcription factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >
      This annotation is INCORRECT for sigma factors. The GO term definition states that
      DNA-binding transcription factors modulate transcription via "selective and non-covalent
      binding to a specific double-stranded genomic DNA sequence within a cis-regulatory region."
      While sigma factors do recognize promoter elements, they do not function as transcription
      factors that independently bind DNA to regulate transcription. Sigma factors are integral
      components of the transcription machinery, not regulatory factors.
    action: MODIFY
    reason: >
      Sigma factors have a distinct molecular function (GO:0016987 sigma factor activity) that
      is fundamentally different from transcription factor activity. Sigma factors do not bind
      DNA independently - they only bind promoter DNA when part of the RNAP holoenzyme. The
      GO guidelines explicitly note that sigma factors should be annotated to sigma factor
      activity, not transcription factor activity. This InterPro-derived annotation is an
      over-generalization.
    proposed_replacement_terms:
      - id: GO:0016987
        label: sigma factor activity
    supported_by:
      - reference_id: GO:0016987
        supporting_text: "Sigma factors act as the promoter specificity subunit of eubacterial and plant plastid multisubunit RNA polymerases... Although sigma does not bind DNA on its own, when combined with the core to form the holoenzyme, the sigma factor binds specifically to promoter elements."
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters"

- term:
    id: GO:0005737
    label: cytoplasm
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This cellular component annotation is correct. RpoD functions in the bacterial cytoplasm
      as part of the RNAP holoenzyme, binding core RNAP to engage promoter DNA during
      transcription initiation. This is consistent with UniProt subcellular location annotation
      and structural studies.
    action: ACCEPT
    reason: >
      Cytoplasmic localization is well-established for bacterial sigma factors. The UniProt
      entry explicitly states "SUBCELLULAR LOCATION: Cytoplasm" based on HAMAP rule MF_00963.
      The deep research confirms "Cellular localization: Cytoplasmic; functions within the
      RNAP holoenzyme engaging cytoplasmic DNA during transcription initiation."
    supported_by:
      - reference_id: UniProt:Q72B50
        supporting_text: "SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|HAMAP-Rule:MF_00963}"
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Cellular localization: Cytoplasmic; functions within the RNAP holoenzyme engaging cytoplasmic DNA during transcription initiation"

- term:
    id: GO:0006352
    label: DNA-templated transcription initiation
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This is an excellent and appropriate biological process annotation for sigma factors.
      The primary function of RpoD is to enable transcription initiation by directing RNAP
      to promoters. The GO term describes "The initial step of transcription, consisting of
      the assembly of the RNA polymerase preinitiation complex (PIC) at a gene promoter, as
      well as the formation of the first few bonds of the RNA transcript."
    action: ACCEPT
    reason: >
      This is a core biological process annotation that accurately captures the essential
      role of sigma factors in transcription. RpoD directly participates in transcription
      initiation by forming the holoenzyme with RNAP core and recognizing promoter elements,
      which is precisely what this term describes. UniProt function annotation states sigma
      factors "promote the attachment of RNA polymerase to specific initiation sites."
    supported_by:
      - reference_id: UniProt:Q72B50
        supporting_text: "FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released."
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Primary sigma factor (RpoD/SigA): The sigma-70-type primary sigma factor enables bacterial RNA polymerase (RNAP) to initiate transcription at housekeeping promoters under normal growth conditions."

- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This annotation is redundant and less specific than GO:0006352 (DNA-templated transcription
      initiation). While sigma factors do influence transcription, they are more precisely
      involved in the initiation process itself rather than regulation of transcription broadly.
      This term is an ancestor of more specific transcription-related terms.
    action: KEEP_AS_NON_CORE
    reason: >
      This is a broader parent term that is technically true but not the most informative
      annotation for sigma factors. The more specific GO:0006352 (DNA-templated transcription
      initiation) better captures the precise role of sigma factors. This annotation is
      acceptable for completeness but represents an over-generalization of sigma factor function.
    supported_by:
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Central to transcription initiation of core cellular processes (housekeeping gene expression); thereby indirectly influences growth, metabolism, and stress-responsive regulons by setting basal transcriptional regimes"

- term:
    id: GO:0010468
    label: regulation of gene expression
  evidence_type: IEA
  original_reference_id: GO_REF:0000104
  review:
    summary: >
      This is a very high-level annotation that is technically correct but not very informative.
      All sigma factors influence gene expression by directing RNAP to specific promoters,
      but this term is too general to be useful for understanding sigma factor function.
    action: KEEP_AS_NON_CORE
    reason: >
      While sigma factors do affect gene expression by determining which promoters are
      recognized by RNAP, this annotation is at too high a level of abstraction to be
      informative. More specific terms (GO:0006352, GO:0016987) better capture the actual
      molecular role of sigma factors. This annotation adds little value beyond the more
      specific terms.
    supported_by:
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "directs transcription initiation at housekeeping promoters during normal growth"

- term:
    id: GO:0016987
    label: sigma factor activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This is the most precise and appropriate molecular function annotation for RpoD.
      Sigma factor activity is defined as the promoter specificity subunit of bacterial
      RNA polymerases that, when combined with the core enzyme to form the holoenzyme,
      binds specifically to promoter elements. This exactly describes the function of RpoD.
    action: ACCEPT
    reason: >
      This annotation is the core molecular function of RpoD. The GO term definition states
      "Sigma factors act as the promoter specificity subunit of eubacterial and plant plastid
      multisubunit RNA polymerases... Although sigma does not bind DNA on its own, when
      combined with the core to form the holoenzyme, the sigma factor binds specifically to
      promoter elements." This precisely matches the known function of RpoD as the primary
      sigma factor.
    supported_by:
      - reference_id: UniProt:Q72B50
        supporting_text: "FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is the primary sigma factor during exponential growth."
      - reference_id: UniProt:Q72B50
        supporting_text: "SIMILARITY: Belongs to the sigma-70 factor family. RpoD/SigA subfamily."
      - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
        supporting_text: "Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters during normal growth"

- term:
    id: GO:0005515
    label: protein binding
  evidence_type: IPI
  original_reference_id: PMID:26873250
  review:
    summary: >
      This annotation is based on high-throughput AP-MS (affinity purification-mass spectrometry)
      data from the Butland et al. D. vulgaris interactome study. The UniProt entry shows
      RpoD interacts with DVU_1368 (Q72CB5) with 2 experiments in IntAct. However, "protein
      binding" is an uninformative term that should be replaced with a more specific binding
      term if possible.
    action: MODIFY
    reason: >
      While the protein-protein interaction is experimentally supported, "protein binding"
      (GO:0005515) is explicitly discouraged in GO annotation guidelines as uninformative.
      For sigma factors, the known protein interactions are with RNAP core subunits (RpoA,
      RpoB, RpoC). The UniProt entry notes "Interacts transiently with the RNA polymerase
      catalytic core" and shows a motif for "Interaction with polymerase core subunit RpoC"
      at residues 380-383. A more appropriate term would be RNA polymerase binding or a
      related term.
    proposed_replacement_terms:
      - id: GO:0001000
        label: bacterial-type RNA polymerase core enzyme binding
    additional_reference_ids:
      - PMID:26873250
    supported_by:
      - reference_id: UniProt:Q72B50
        supporting_text: "SUBUNIT: Interacts transiently with the RNA polymerase catalytic core."
      - reference_id: UniProt:Q72B50
        supporting_text: "MOTIF 380..383 /note=\"Interaction with polymerase core subunit RpoC\" /evidence=\"ECO:0000256|HAMAP-Rule:MF_00963\""
      - reference_id: PMID:26873250
        supporting_text: "Numerous affinity purification-mass spectrometry (AP-MS) and yeast two-hybrid screens have each defined thousands of pairwise protein-protein interactions (PPIs)... We have identified 459 high confidence PPIs from D. vulgaris"

references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000104
  title: Electronic Gene Ontology annotations created by transferring manual GO annotations
    between related proteins based on shared sequence features
  findings: []
- id: GO_REF:0000108
  title: Automatic assignment of GO terms using logical inference, based on inter-ontology links
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:26873250
  title: 'Bacterial Interactomes: Interacting Protein Partners Share Similar Function
    and Are Validated in Independent Assays More Frequently Than Previously Reported.'
  findings:
    - statement: High-throughput AP-MS study identifying protein-protein interactions in D. vulgaris
      supporting_text: "Here, we present an AP-MS survey of the bacterium Desulfovibrio vulgaris together with a critical reanalysis of nine published bacterial yeast two-hybrid and AP-MS screens"
    - statement: Identified 459 high confidence PPIs from D. vulgaris
      supporting_text: "We have identified 459 high confidence PPIs from D. vulgaris and 391 from Escherichia coli"
- id: PMID:15077118
  title: The genome sequence of the anaerobic, sulfate-reducing bacterium Desulfovibrio
    vulgaris Hildenborough
  findings:
    - statement: Genome sequence establishing DVU_1788 locus for rpoD
      supporting_text: "Desulfovibrio vulgaris Hildenborough is a model organism for studying the energy metabolism of sulfate-reducing bacteria"
- id: DOI:10.1186/s12864-024-10415-x
  title: The modular architecture of sigma factors in cyanobacteria
  findings:
    - statement: Confirms essential domain set for primary sigma-70 factors (r1_2, r2, r3, r4)
    - statement: Documents conserved roles in promoter recognition
    - statement: Organism-specific mapping of rpoD to DVU_1788 in D. vulgaris
- id: DOI:10.3390/microorganisms11041077
  title: Structural analysis of Bacillus subtilis sigma factors
  findings:
    - statement: Structural insights into sigma-70 family domain architecture
    - statement: Confirms sigma-2 and sigma-4 as primary DNA-recognition modules

core_functions:
- molecular_function:
    id: GO:0016987
    label: sigma factor activity
  description: >
    RpoD (SigA) is the primary housekeeping sigma factor in N. vulgaris, belonging to the
    sigma-70 factor family RpoD/SigA subfamily. It contains all conserved sigma-70 domains
    (regions 1.1, 1.2, 2, 3, and 4) with documented helix-turn-helix DNA binding motif
    (residues 550-569). UniProt annotation and HAMAP rule MF_00963 confirm sigma factor
    function. The protein promotes RNAP attachment to specific promoter initiation sites
    and is the primary sigma factor during exponential growth.
  directly_involved_in:
    - id: GO:0006352
      label: DNA-templated transcription initiation
  locations:
    - id: GO:0005737
      label: cytoplasm
  supported_by:
    - reference_id: UniProt:Q72B50
      supporting_text: "FUNCTION: Sigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released. This sigma factor is the primary sigma factor during exponential growth."
    - reference_id: file:DESVH/Q72B50/Q72B50-deep-research-falcon.md
      supporting_text: "Encodes the primary sigma factor (SigA/RpoD) that binds RNAP core to form the holoenzyme and directs transcription initiation at housekeeping promoters during normal growth"

proposed_new_terms: []

suggested_questions:
- question: What are the specific promoter sequence motifs recognized by RpoD in N. vulgaris?
- question: Are there alternative sigma factors in N. vulgaris that compete with RpoD under stress conditions?
- question: What is the identity and function of DVU_1368 (Q72CB5) that interacts with RpoD?

suggested_experiments:
- description: ChIP-seq analysis to map RpoD binding sites genome-wide and identify the RpoD regulon
  hypothesis: RpoD binds to canonical -10/-35 promoter elements in N. vulgaris housekeeping genes
  experiment_type: ChIP-seq
- description: In vitro transcription assays with purified components to confirm promoter recognition specificity
  hypothesis: Purified RpoD enables RNAP to initiate transcription at predicted housekeeping promoters
  experiment_type: In vitro transcription assay
- description: Protein interaction studies to characterize the RpoD-RNAP core interaction interface
  hypothesis: RpoD interacts with RNAP core via the conserved sigma-70 region 2 and region 4 domains
  experiment_type: Co-immunoprecipitation or crosslinking mass spectrometry