DVU_3335

UniProt ID: Q725U0
Organism: Nitratidesulfovibrio vulgaris Hildenborough
Review Status: COMPLETE
📝 Provide Detailed Feedback

Gene Description

DVU_3335 (Q725U0) is a membrane-associated two-component system (TCS) sensor histidine kinase in Desulfovibrio vulgaris Hildenborough (now Nitratidesulfovibrio vulgaris). The protein contains two predicted transmembrane helices (residues 14-36 and 148-170), a PAS sensory domain (residues 238-283), and a C-terminal histidine kinase catalytic core comprising a dimerization/histidine phosphotransfer (DHp/HisKA) domain and a HATPase_C domain (residues 363-578). It catalyzes ATP-dependent autophosphorylation on a conserved histidine residue and subsequent phosphotransfer to a cognate response regulator. The specific environmental signal detected and the cognate response regulator remain uncharacterized. Based on domain architecture and organism-level TCS studies, it likely participates in adaptive regulation tied to nutrient availability, stress sensing, or biofilm-related behaviors.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000160 phosphorelay signal transduction system
IEA
GO_REF:0000108
ACCEPT
Summary: This biological process annotation is appropriate for a sensor histidine kinase that functions within a two-component phosphorelay system. The annotation was inferred from GO:0000155 (phosphorelay sensor kinase activity) via logical inference. DVU_3335 contains the canonical HisKA and HATPase_C domains characteristic of TCS sensor kinases, supporting participation in phosphorelay signaling.
Reason: The domain architecture (HisKA/DHp + HATPase_C) is diagnostic of two-component system sensor histidine kinases that function via autophosphorylation and phosphotransfer to response regulators. This is the core biological process for TCS sensor kinases. UniProt annotation confirms EC 2.7.13.3 (protein histidine kinase) activity. Genome-wide studies in D. vulgaris Hildenborough have demonstrated TCS roles in regulating substrate-specific functions and stress responses.
Supporting Evidence:
UniProt:Q725U0
PROSITE; PS50109; HIS_KIN; 1.
file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
Large-scale RB-TnSeq links specific phenotypes to genes in DvH, identifying conditional phenotypes for >1,100 non-essential genes and showing examples where nearby two-component systems regulate substrate-specific transporters.
GO:0000155 phosphorelay sensor kinase activity
IEA
GO_REF:0000120
ACCEPT
Summary: This is the most specific and appropriate molecular function term for DVU_3335. The protein contains the HisK_dim/P (HisKA/DHp) and HATPase_C domains that define phosphorelay sensor kinase activity. This term specifically describes catalysis of histidine autophosphorylation in response to an extracellular signal, followed by phosphotransfer to a response regulator.
Reason: GO:0000155 is the correct specific MF term for two-component sensor histidine kinases. The protein has InterPro domains IPR003661 (HisK_dim/P_dom) and IPR003594 (HATPase_dom) which are the structural basis for this activity. The PAS domain (IPR000014) provides the sensory input module typical of environmental signal-responsive sensor kinases. This term captures both the kinase activity and its functional context in phosphorelay signaling better than more generic kinase terms.
Supporting Evidence:
UniProt:Q725U0
InterPro; IPR003661; HisK_dim/P_dom.
file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
The UniProt-listed domains (HATPase_C/HATPase_dom; His_kinase_dom; HisK_dim/P_dom; superfamily annotations HATPase_C_sf, HisK_dim/P_sf) are consistent with a bacterial two-component system (TCS) histidine kinase catalytic core.
GO:0004673 protein histidine kinase activity
IEA
GO_REF:0000003
ACCEPT
Summary: This annotation is based on EC 2.7.13.3 mapping and describes the core catalytic activity (ATP + protein L-histidine = ADP + protein phospho-L-histidine). While technically correct, it is less informative than GO:0000155 which captures the same reaction in the context of two-component signaling.
Reason: The annotation is accurate based on EC number mapping. UniProt explicitly annotates this protein with EC 2.7.13.3. The term describes the biochemical reaction without the functional context. Having both GO:0000155 (more specific, context-aware) and GO:0004673 (based on EC classification) is acceptable as they represent different evidence sources and annotation pipelines.
Supporting Evidence:
UniProt:Q725U0
EC=2.7.13.3 {ECO:0000256|ARBA:ARBA00012438};
UniProt:Q725U0
Reaction=ATP + protein L-histidine = ADP + protein N-phospho-L-
GO:0006355 regulation of DNA-templated transcription
IEA
GO_REF:0000002
REMOVE
Summary: This annotation was inferred from InterPro:IPR013767 (PAS_fold). While many TCS sensor kinases ultimately regulate transcription through their cognate response regulators, DVU_3335 is a sensor kinase, not a transcriptional regulator itself. The transcriptional regulation is performed by the downstream response regulator, not the sensor kinase.
Reason: This annotation represents an over-annotation based on the PAS domain. The PAS domain in DVU_3335 functions as a signal sensor, not as a DNA-binding or transcriptional regulatory domain. The protein lacks any DNA-binding domains and is a membrane-associated sensor with two transmembrane helices. While the phosphorelay cascade may ultimately affect transcription via the response regulator, the sensor kinase itself does not directly regulate transcription. This annotation conflates the downstream effect of phosphorelay signaling with the direct function of the sensor kinase.
Supporting Evidence:
UniProt:Q725U0
Membrane {ECO:0000256|SAM:Phobius};
file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
The phosphate is then transferred to an aspartate residue in a downstream response regulator, to trigger a response.
GO:0007165 signal transduction
IEA
GO_REF:0000002
ACCEPT
Summary: This is a broad biological process term that accurately describes the general function of DVU_3335 as a signal transduction component. However, GO:0000160 (phosphorelay signal transduction system) is more specific and already present, making this annotation redundant.
Reason: While redundant with the more specific GO:0000160, this general annotation is not incorrect. It was inferred from the HisK_dim/P domains (IPR003661, IPR036097) which are hallmarks of signal transducing histidine kinases. Retaining both allows for different levels of annotation granularity in queries. The IEA annotation is valid and consistent with domain-based inference.
Supporting Evidence:
UniProt:Q725U0
InterPro; IPR036097; HisK_dim/P_sf.
GO:0016301 kinase activity
IEA
GO_REF:0000043
MARK AS OVER ANNOTATED
Summary: This is a very general molecular function term inferred from the UniProt keyword "Kinase". It is true but uninformative given that more specific terms (GO:0000155, GO:0004673) are already annotated.
Reason: While technically correct, this term is too general to be informative. GO:0000155 (phosphorelay sensor kinase activity) and GO:0004673 (protein histidine kinase activity) already provide the specific kinase function. "Kinase activity" alone does not distinguish this protein from serine/threonine kinases, tyrosine kinases, or small molecule kinases. The annotation is not wrong but adds no information beyond what is captured by the more specific annotations.
Supporting Evidence:
UniProt:Q725U0
Kinase {ECO:0000313|EMBL:AAS97803.1};
GO:0016740 transferase activity
IEA
GO_REF:0000043
MARK AS OVER ANNOTATED
Summary: This is the most general transferase term, inferred from the UniProt keyword "Transferase". It is an ancestor of all more specific kinase/transferase annotations and provides no useful information.
Reason: GO:0016740 is a root-level molecular function term that encompasses all transferases. While accurate, it provides no discriminatory information. The more specific annotations (GO:0000155, GO:0004673, GO:0016772) already capture the transferase activity in meaningful detail. This annotation is redundant and uninformative for understanding the specific function of DVU_3335.
Supporting Evidence:
UniProt:Q725U0
Transferase {ECO:0000313|EMBL:AAS97803.1};
GO:0016772 transferase activity, transferring phosphorus-containing groups
IEA
GO_REF:0000002
ACCEPT
Summary: This is an intermediate-specificity term describing the transfer of phosphorus groups, inferred from InterPro:IPR004358 (Sig_transdc_His_kin-like_C). It is more specific than GO:0016740 but less specific than the kinase annotations.
Reason: This annotation bridges the GO hierarchy between general transferase activity and specific kinase activity. It was correctly inferred from the histidine kinase-like C-terminal domain (IPR004358). While less informative than GO:0000155 or GO:0004673, it is a valid IEA annotation that accurately describes the biochemical class of reaction catalyzed. The annotation is acceptable as part of the annotation hierarchy.
Supporting Evidence:
UniProt:Q725U0
InterPro; IPR004358; Sig_transdc_His_kin-like_C.

Core Functions

DVU_3335 is a two-component system sensor histidine kinase with canonical domain architecture (HisKA/DHp + HATPase_C + PAS sensory domain). It functions via ATP-dependent autophosphorylation on a conserved histidine residue followed by phosphotransfer to a cognate response regulator. This is the core molecular function of the protein.

References

Gene Ontology annotation through association of InterPro records with GO terms
Gene Ontology annotation based on Enzyme Commission mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Automatic assignment of GO terms using logical inference, based on on inter-ontology links
Combined Automated Annotation using Multiple IEA Methods

📚 Additional Documentation

Deep Research Falcon

(Q725U0-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2026-01-15T10:40:05.484179'
end_time: '2026-01-15T10:45:45.994319'
duration_seconds: 340.51
template_file: templates/gene_research_go_focused.md
template_variables:
organism: DESVH
gene_id: Q725U0
gene_symbol: Q725U0
uniprot_accession: Q725U0
protein_description: 'RecName: Full=histidine kinase {ECO:0000256|ARBA:ARBA00012438};
EC=2.7.13.3 {ECO:0000256|ARBA:ARBA00012438};'
gene_info: OrderedLocusNames=DVU_3335 {ECO:0000313|EMBL:AAS97803.1};
organism_full: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG
34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
protein_family: Not specified in UniProt
protein_domains: HATPase_C_sf. (IPR036890); HATPase_dom. (IPR003594); His_kinase_dom.
(IPR005467); HisK_dim/P_dom. (IPR003661); HisK_dim/P_sf. (IPR036097)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 9


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q725U0
  • Protein Description: RecName: Full=histidine kinase {ECO:0000256|ARBA:ARBA00012438}; EC=2.7.13.3 {ECO:0000256|ARBA:ARBA00012438};
  • Gene Information: OrderedLocusNames=DVU_3335 {ECO:0000313|EMBL:AAS97803.1};
  • Organism (full): Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  • Protein Family: Not specified in UniProt
  • Key Domains: HATPase_C_sf. (IPR036890); HATPase_dom. (IPR003594); His_kinase_dom. (IPR005467); HisK_dim/P_dom. (IPR003661); HisK_dim/P_sf. (IPR036097)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "Q725U0" matches the protein description above
  2. Verify the organism is correct: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'Q725U0' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene Q725U0 (gene ID: Q725U0, UniProt: Q725U0) in DESVH.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: Q725U0
  • Protein Description: RecName: Full=histidine kinase {ECO:0000256|ARBA:ARBA00012438}; EC=2.7.13.3 {ECO:0000256|ARBA:ARBA00012438};
  • Gene Information: OrderedLocusNames=DVU_3335 {ECO:0000313|EMBL:AAS97803.1};
  • Organism (full): Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  • Protein Family: Not specified in UniProt
  • Key Domains: HATPase_C_sf. (IPR036890); HATPase_dom. (IPR003594); His_kinase_dom. (IPR005467); HisK_dim/P_dom. (IPR003661); HisK_dim/P_sf. (IPR036097)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "Q725U0" matches the protein description above
  2. Verify the organism is correct: Nitratidesulfovibrio vulgaris (strain ATCC 29579 / DSM 644 / CCUG 34227 / NCIMB 8303 / VKM B-1760 / Hildenborough) (Desulfovibrio vulgaris).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'Q725U0' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene Q725U0 (gene ID: Q725U0, UniProt: Q725U0) in DESVH.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Plan and verification
- Target verified: UniProt Q725U0 corresponds to DVU_3335 in Desulfovibrio vulgaris Hildenborough (now Nitratidesulfovibrio vulgaris Hildenborough), annotated as a sensor histidine kinase (EC 2.7.13.3). The UniProt-listed domains (HATPase_C/HATPase_dom; His_kinase_dom; HisK_dim/P_dom; superfamily annotations HATPase_C_sf, HisK_dim/P_sf) are consistent with a bacterial two-component system (TCS) histidine kinase catalytic core. No conflicting gene symbol usage was found in recent literature. Literature specific to DVU_3335 is limited; functional inference is therefore made from domain architecture and organism-level TCS studies.

Key concepts and definitions
- Two-component histidine kinase (HK): membrane-associated or cytosolic sensor enzymes that autophosphorylate a conserved histidine residue using ATP and transfer the phosphoryl group to an aspartate on a cognate response regulator (RR), modulating transcription or behavior. The catalytic core typically comprises a dimerization and histidine phosphotransfer (DHp/HisKA) domain and a C-terminal HATPase (H-box/N/G motifs) domain. Some HK-like systems show domain variations (e.g., GHKL-family HATPase_c_3 fusions) that broaden architecture diversity (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).
- Reaction chemistry (EC 2.7.13.3): ATP-dependent autophosphorylation on histidine followed by phosphotransfer to RR Asp; regulates adaptive responses to environmental cues.

Identity and domain architecture of DVU_3335 (Q725U0)
- Identity: DVU_3335 encodes a histidine kinase by UniProt annotation; locus/organism match confirmed.
- Domains: Presence of HATPase_C and HisKA/His_kinase_dom implies a canonical cytosolic catalytic core. The “HisK_dim/P” annotations indicate a dimerization/phosphotransfer module typical of HKs. These align with known HK architectures; comparative analyses describe additional HK-like architectures (e.g., HEF_HK associated with HATPase_c_3) that emphasize the diversity of kinase scaffolds, reinforcing the plausibility of Q725U0’s HK assignment (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).
- Predicted localization: The catalytic HATPase/DHp core is cytosolic. Many HKs possess N-terminal sensory/transmembrane segments to detect periplasmic/extracellular signals; UniProt’s catalytic domain set supports a membrane-associated sensor HK model for DVU_3335, although a precise transmembrane topology and periplasmic domain have not been resolved in the sources consulted (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

Gene-specific evidence and limitations
- No primary publications were found that experimentally characterize DVU_3335/Q725U0. Consequently, cognate RR identity, specific stimulus, and direct phenotypes remain unresolved in current accessible sources. Inference is based on conserved HK domains and organism-level TCS studies in D. vulgaris Hildenborough (trotter2023largescalegeneticcharacterization pages 4-6, thakur2023textminingtoidentify pages 17-18).

Broader context: two-component systems in Desulfovibrio vulgaris Hildenborough and SRB
- Genome-wide functional genetics in DvH (RB-TnSeq, 2023): A randomly barcoded transposon mutant library across DvH yielded 757 genome-wide fitness assays covering 2,741 protein-coding genes and conditional phenotypes for 1,137 non-essential genes. The dataset links TCS modules to regulation of substrate-specific functions, exemplified by a two-component system (DVU0596:DVU0597) showing pyruvate-specific phenotypes and regulation of a nearby transporter (DVU0599/cstA). This directly evidences TCS roles in coordinating metabolic uptake in DvH and provides a resource to test additional HKs like DVU_3335 under diverse conditions (Frontiers in Microbiology, March 2023; https://doi.org/10.3389/fmicb.2023.1095191) (trotter2023largescalegeneticcharacterization pages 4-6).
- SRB TCS in stress, biofilms, and metabolism (text-mining synthesis, 2023): A semi-automated workflow compiling SRB literature identifies numerous Desulfovibrio TCS genes implicated in stress sensing, biofilm formation, electron transfer, and biocorrosion contexts. The study integrates GO enrichment and network analysis, supporting the involvement of histidine kinases in adaptive responses relevant to SRB ecology and engineering challenges (Microorganisms, January 2023; https://doi.org/10.3390/microorganisms11010119) (thakur2023textminingtoidentify pages 17-18).
- HK domain diversity (comparative domain analysis, 2021): Genome-wide analysis of novel TCS-like elements across bacteria highlights variant HK-like architectures (e.g., HEF_HK paired with a Response_reg_2 domain and HATPase_c_3), often linked to restriction–modification loci. This underscores the architectural and functional breadth of HKs and informs interpretation of kinase domain annotations when precise sensor modules are unannotated (Genome Biology and Evolution, February 2021; https://doi.org/10.1093/gbe/evab024) (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

Current understanding of DVU_3335 function and pathway context (inferred)
- Primary biochemical function: Based on EC 2.7.13.3 and domain composition, DVU_3335 is predicted to function as a sensor HK that autophosphorylates and transfers phosphate to a RR, thereby regulating gene expression or behavior in response to an environmental or intracellular cue. The specific substrate/signal is unknown; in DvH, TCS commonly regulate nutrient uptake, stress responses, and biofilm/electron-transfer adaptations (trotter2023largescalegeneticcharacterization pages 4-6, thakur2023textminingtoidentify pages 17-18).
- Likely cellular location: Inner membrane-associated sensor with a cytosolic DHp/HATPase catalytic core; exact transmembrane and periplasmic sensory domain architecture is unresolved in sources used (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).
- Pathways and processes: By analogy to experimentally supported DvH TCS roles, DVU_3335 plausibly participates in adaptive regulation tied to nutrient availability (e.g., uptake/transport systems), stress sensing (e.g., redox, metals, osmotic), or community behaviors (e.g., biofilm), but gene-specific pathway assignment awaits experimental mapping (trotter2023largescalegeneticcharacterization pages 4-6, thakur2023textminingtoidentify pages 17-18).

Recent developments and latest research (2023–2024 emphasis)
- DvH RB-TnSeq resource enables direct testing of DVU_3335: The 2023 dataset provides condition-specific fitness measurements and TCS examples (DVU0596:DVU0597→DVU0599), highlighting an actionable path to interrogate DVU_3335 under targeted stresses or substrates (March 2023; https://doi.org/10.3389/fmicb.2023.1095191) (trotter2023largescalegeneticcharacterization pages 4-6).
- SRB TCS roles synthesized for applications: 2023 text-mining analysis synthesizes evidence linking Desulfovibrio HKs to stress and biocorrosion phenotypes, prioritizing gene sets for further validation in industrially relevant settings (January 2023; https://doi.org/10.3390/microorganisms11010119) (thakur2023textminingtoidentify pages 17-18).
- Domain-level advances: Novel HK-like systems illuminate architectural breadth relevant to hidden/atypical domains that might occur in poorly annotated SRB HKs; useful when interpreting Q725U0 in absence of experimentally defined sensor modules (February 2021; https://doi.org/10.1093/gbe/evab024) (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

Current applications and real-world implementations
- Biocorrosion and biofilms: Desulfovibrio spp. are central to sulfate-reducing biofilms and pipeline/corrosion contexts. Compiled SRB literature identifies histidine kinases/TCS among gene sets implicated in biofilm formation, stress response, and electron transfer, marking these regulators as potential targets for mitigation/monitoring strategies (thakur2023textminingtoidentify pages 17-18).
- Systems-level functional genomics: The DvH RB-TnSeq resource can be applied to define condition-specific roles for uncharacterized HKs (including DVU_3335), accelerating hypothesis-driven intervention points in SRB management (trotter2023largescalegeneticcharacterization pages 4-6).

Expert opinions and analyses from authoritative sources
- The RB-TnSeq authors emphasize that specific phenotypes in DvH can be mapped to nearby TCS modules, revealing regulatory wiring for substrate uptake and stress adaptation, thus advocating the use of high-throughput genetics to resolve functions of uncharacterized regulators (trotter2023largescalegeneticcharacterization pages 4-6).
- Comparative domain work argues for broad HK architectural diversity beyond canonical HisKA/HATPase_C, advising caution and comprehensive domain evaluation in annotations, which is particularly pertinent for SRB HKs with sparse experimental data (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

Relevant statistics and datasets
- DvH RB-TnSeq: 757 assays; 2,741 protein-coding genes; conditional phenotypes for 1,137 non-essential genes; example: DVU0596:DVU0597 TCS regulating DVU0599/cstA in pyruvate conditions (trotter2023largescalegeneticcharacterization pages 4-6).
- SRB text-mining: 227 SRB genes compiled with enrichment of functions in metal binding, electron transfer, and sulfur metabolism; Desulfovibrio histidine kinases included among implicated regulators in biocorrosion-relevant contexts (thakur2023textminingtoidentify pages 17-18).

Outstanding gaps and recommendations for DVU_3335 (Q725U0)
- Cognate RR and operon context: Not resolved in available sources; recommend mining the DvH genome neighborhood and RB-TnSeq cofitness/conditional phenotypes to identify candidate RR(s) and operonic links (trotter2023largescalegeneticcharacterization pages 4-6).
- Stimulus and phenotype: Test DVU_3335 mutants (or barcoded transposon insertions) across substrate and stress panels where SRB TCS are implicated (nutrient shifts, metal stress, redox perturbations) to detect condition-specific fitness signatures, following the strategy that revealed DVU0596:DVU0597→DVU0599 regulation (trotter2023largescalegeneticcharacterization pages 4-6, thakur2023textminingtoidentify pages 17-18).
- Domain mapping: Verify transmembrane segments and potential periplasmic sensor domain via updated predictive tools; consider atypical HK-like domain architectures described for bacterial HKs (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

Embedded evidence summary
| Source (short citation) | Year | URL | DvH/DVU locus relevance (yes/no and details) | Key findings relevant to histidine kinases/TCS | Notes |
|---|---:|---|---|---|---|
| Trotter et al., Front. Microbiol. (trotter2023largescalegeneticcharacterization pages 4-6) | 2023 | https://doi.org/10.3389/fmicb.2023.1095191 | Yes — DvH RB‑TnSeq resource; reports TCS-related phenotypes (example: DVU0596:DVU0597); DVU_3335 not specifically characterized. | Large-scale RB‑TnSeq links specific phenotypes to genes in DvH, identifying conditional phenotypes for >1,100 non-essential genes and showing examples where nearby two-component systems regulate substrate-specific transporters. | Genome-wide fitness data and RB‑TnSeq mutant library available (figshare/SRA); valuable for hypothesis-driven functional tests of TCS. |
| Thakur et al., Microorganisms (thakur2023textminingtoidentify pages 17-18) | 2023 | https://doi.org/10.3390/microorganisms11010119 | Yes — Text‑mining retrieved Desulfovibrio vulgaris Hildenborough and highlights numerous histidine kinases/TCS involved in biocorrosion and stress response; no DVU_3335-specific evidence. | Semi-automated literature mining implicates TCS in stress sensing, biofilm regulation, and electron-transfer processes relevant to SRB biocorrosion, compiling a gene list that includes many Desulfovibrio histidine kinases. | Identified 227 SRB genes linked to biocorrosion processes; supports organism-level inference when gene-specific data are lacking. |
| Bellieny-Rabelo et al., Genome Biol. Evol. (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11) | 2021 | https://doi.org/10.1093/gbe/evab024 | No — Pan-bacterial analysis of novel TCS-like elements; not DvH-specific but informs HK domain diversity and atypical architectures. | Describes novel domains (HEF_HK, Response_reg_2) and frequent association with HATPase_c_3 (GHKL family), revealing alternative histidine-kinase–like architectures and genomic linkages useful to interpret variant domain compositions in bacterial HKs. | Provides comparative domain context helpful for interpreting UniProt domain annotations (HATPase_C, HisKA, HATPase_c_3) in proteins such as Q725U0. |

Table: A concise table summarizing three key sources (RB‑TnSeq resource, literature text‑mining, and domain‑architecture analysis) that inform histidine kinase/TCS function and domain context in Desulfovibrio vulgaris Hildenborough; includes URLs and notes for follow‑up.

Mandatory ambiguity statement
- The gene symbol “Q725U0” itself is a UniProt accession, not a conventional gene symbol; gene-specific literature for DVU_3335 is currently limited. The functional description above relies on UniProt-provided domain annotations and inference from recent organism-level and domain-level studies of Desulfovibrio TCS. Further experimental validation in DvH is needed to establish DVU_3335’s precise signal, cognate RR, and regulatory targets (trotter2023largescalegeneticcharacterization pages 4-6, thakur2023textminingtoidentify pages 17-18, bellienyrabelo2021noveltwocomponentsystemlike pages 10-11).

References

  1. (bellienyrabelo2021noveltwocomponentsystemlike pages 10-11): Daniel Bellieny-Rabelo, Willem J S Pretorius, and Lucy N Moleleki. Novel two-component system-like elements reveal functional domains associated with restriction–modification systems and paramorc atpases in bacteria. Genome Biology and Evolution, Feb 2021. URL: https://doi.org/10.1093/gbe/evab024, doi:10.1093/gbe/evab024. This article has 6 citations and is from a domain leading peer-reviewed journal.

  2. (trotter2023largescalegeneticcharacterization pages 4-6): Valentine V. Trotter, Maxim Shatsky, Morgan N. Price, Thomas R. Juba, Grant M. Zane, Kara B. De León, Erica L.-W. Majumder, Qin Gui, Rida Ali, Kelly M. Wetmore, Jennifer V. Kuehl, Adam P. Arkin, Judy D. Wall, Adam M. Deutschbauer, John-Marc Chandonia, and Gareth P. Butland. Large-scale genetic characterization of the model sulfate-reducing bacterium, desulfovibrio vulgaris hildenborough. Frontiers in Microbiology, Mar 2023. URL: https://doi.org/10.3389/fmicb.2023.1095191, doi:10.3389/fmicb.2023.1095191. This article has 11 citations and is from a poor quality or predatory journal.

  3. (thakur2023textminingtoidentify pages 17-18): Payal Thakur, Mathew O. Alaba, Shailabh Rauniyar, Ram Nageena Singh, Priya Saxena, Alain Bomgni, Etienne Z. Gnimpieba, Carol Lushbough, Kian Mau Goh, and Rajesh Kumar Sani. Text-mining to identify gene sets involved in biocorrosion by sulfate-reducing bacteria: a semi-automated workflow. Microorganisms, 11:119, Jan 2023. URL: https://doi.org/10.3390/microorganisms11010119, doi:10.3390/microorganisms11010119. This article has 17 citations and is from a poor quality or predatory journal.

Citations

  1. bellienyrabelo2021noveltwocomponentsystemlike pages 10-11
  2. trotter2023largescalegeneticcharacterization pages 4-6
  3. thakur2023textminingtoidentify pages 17-18
  4. https://doi.org/10.3389/fmicb.2023.1095191
  5. https://doi.org/10.3390/microorganisms11010119
  6. https://doi.org/10.1093/gbe/evab024
  7. https://doi.org/10.1093/gbe/evab024,
  8. https://doi.org/10.3389/fmicb.2023.1095191,
  9. https://doi.org/10.3390/microorganisms11010119,

📄 View Raw YAML

id: Q725U0
gene_symbol: DVU_3335
product_type: PROTEIN
status: COMPLETE
taxon:
  id: NCBITaxon:882
  label: Nitratidesulfovibrio vulgaris Hildenborough
description: >
  DVU_3335 (Q725U0) is a membrane-associated two-component system (TCS) sensor histidine kinase
  in Desulfovibrio vulgaris Hildenborough (now Nitratidesulfovibrio vulgaris). The protein contains
  two predicted transmembrane helices (residues 14-36 and 148-170), a PAS sensory domain (residues 238-283),
  and a C-terminal histidine kinase catalytic core comprising a dimerization/histidine phosphotransfer
  (DHp/HisKA) domain and a HATPase_C domain (residues 363-578). It catalyzes ATP-dependent autophosphorylation
  on a conserved histidine residue and subsequent phosphotransfer to a cognate response regulator.
  The specific environmental signal detected and the cognate response regulator remain uncharacterized.
  Based on domain architecture and organism-level TCS studies, it likely participates in adaptive
  regulation tied to nutrient availability, stress sensing, or biofilm-related behaviors.
existing_annotations:
- term:
    id: GO:0000160
    label: phosphorelay signal transduction system
  evidence_type: IEA
  original_reference_id: GO_REF:0000108
  review:
    summary: >
      This biological process annotation is appropriate for a sensor histidine kinase that functions
      within a two-component phosphorelay system. The annotation was inferred from GO:0000155
      (phosphorelay sensor kinase activity) via logical inference. DVU_3335 contains the canonical
      HisKA and HATPase_C domains characteristic of TCS sensor kinases, supporting participation
      in phosphorelay signaling.
    action: ACCEPT
    reason: >
      The domain architecture (HisKA/DHp + HATPase_C) is diagnostic of two-component system sensor
      histidine kinases that function via autophosphorylation and phosphotransfer to response regulators.
      This is the core biological process for TCS sensor kinases. UniProt annotation confirms EC 2.7.13.3
      (protein histidine kinase) activity. Genome-wide studies in D. vulgaris Hildenborough have
      demonstrated TCS roles in regulating substrate-specific functions and stress responses.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "PROSITE; PS50109; HIS_KIN; 1."
      - reference_id: file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
        supporting_text: "Large-scale RB-TnSeq links specific phenotypes to genes in DvH, identifying conditional phenotypes for >1,100 non-essential genes and showing examples where nearby two-component systems regulate substrate-specific transporters."
- term:
    id: GO:0000155
    label: phosphorelay sensor kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >
      This is the most specific and appropriate molecular function term for DVU_3335. The protein
      contains the HisK_dim/P (HisKA/DHp) and HATPase_C domains that define phosphorelay sensor
      kinase activity. This term specifically describes catalysis of histidine autophosphorylation
      in response to an extracellular signal, followed by phosphotransfer to a response regulator.
    action: ACCEPT
    reason: >
      GO:0000155 is the correct specific MF term for two-component sensor histidine kinases.
      The protein has InterPro domains IPR003661 (HisK_dim/P_dom) and IPR003594 (HATPase_dom)
      which are the structural basis for this activity. The PAS domain (IPR000014) provides
      the sensory input module typical of environmental signal-responsive sensor kinases.
      This term captures both the kinase activity and its functional context in phosphorelay
      signaling better than more generic kinase terms.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "InterPro; IPR003661; HisK_dim/P_dom."
      - reference_id: file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
        supporting_text: "The UniProt-listed domains (HATPase_C/HATPase_dom; His_kinase_dom; HisK_dim/P_dom; superfamily annotations HATPase_C_sf, HisK_dim/P_sf) are consistent with a bacterial two-component system (TCS) histidine kinase catalytic core."
- term:
    id: GO:0004673
    label: protein histidine kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000003
  review:
    summary: >
      This annotation is based on EC 2.7.13.3 mapping and describes the core catalytic activity
      (ATP + protein L-histidine = ADP + protein phospho-L-histidine). While technically correct,
      it is less informative than GO:0000155 which captures the same reaction in the context
      of two-component signaling.
    action: ACCEPT
    reason: >
      The annotation is accurate based on EC number mapping. UniProt explicitly annotates
      this protein with EC 2.7.13.3. The term describes the biochemical reaction without
      the functional context. Having both GO:0000155 (more specific, context-aware) and
      GO:0004673 (based on EC classification) is acceptable as they represent different
      evidence sources and annotation pipelines.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "EC=2.7.13.3 {ECO:0000256|ARBA:ARBA00012438};"
      - reference_id: UniProt:Q725U0
        supporting_text: "Reaction=ATP + protein L-histidine = ADP + protein N-phospho-L-"
- term:
    id: GO:0006355
    label: regulation of DNA-templated transcription
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >
      This annotation was inferred from InterPro:IPR013767 (PAS_fold). While many TCS sensor
      kinases ultimately regulate transcription through their cognate response regulators,
      DVU_3335 is a sensor kinase, not a transcriptional regulator itself. The transcriptional
      regulation is performed by the downstream response regulator, not the sensor kinase.
    action: REMOVE
    reason: >
      This annotation represents an over-annotation based on the PAS domain. The PAS domain
      in DVU_3335 functions as a signal sensor, not as a DNA-binding or transcriptional
      regulatory domain. The protein lacks any DNA-binding domains and is a membrane-associated
      sensor with two transmembrane helices. While the phosphorelay cascade may ultimately
      affect transcription via the response regulator, the sensor kinase itself does not
      directly regulate transcription. This annotation conflates the downstream effect of
      phosphorelay signaling with the direct function of the sensor kinase.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "Membrane {ECO:0000256|SAM:Phobius};"
      - reference_id: file:DESVH/Q725U0/Q725U0-deep-research-falcon.md
        supporting_text: "The phosphate is then transferred to an aspartate residue in a downstream response regulator, to trigger a response."
- term:
    id: GO:0007165
    label: signal transduction
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >
      This is a broad biological process term that accurately describes the general function
      of DVU_3335 as a signal transduction component. However, GO:0000160 (phosphorelay signal
      transduction system) is more specific and already present, making this annotation redundant.
    action: ACCEPT
    reason: >
      While redundant with the more specific GO:0000160, this general annotation is not
      incorrect. It was inferred from the HisK_dim/P domains (IPR003661, IPR036097) which
      are hallmarks of signal transducing histidine kinases. Retaining both allows for
      different levels of annotation granularity in queries. The IEA annotation is valid
      and consistent with domain-based inference.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "InterPro; IPR036097; HisK_dim/P_sf."
- term:
    id: GO:0016301
    label: kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >
      This is a very general molecular function term inferred from the UniProt keyword "Kinase".
      It is true but uninformative given that more specific terms (GO:0000155, GO:0004673)
      are already annotated.
    action: MARK_AS_OVER_ANNOTATED
    reason: >
      While technically correct, this term is too general to be informative. GO:0000155
      (phosphorelay sensor kinase activity) and GO:0004673 (protein histidine kinase activity)
      already provide the specific kinase function. "Kinase activity" alone does not distinguish
      this protein from serine/threonine kinases, tyrosine kinases, or small molecule kinases.
      The annotation is not wrong but adds no information beyond what is captured by the
      more specific annotations.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "Kinase {ECO:0000313|EMBL:AAS97803.1};"
- term:
    id: GO:0016740
    label: transferase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000043
  review:
    summary: >
      This is the most general transferase term, inferred from the UniProt keyword "Transferase".
      It is an ancestor of all more specific kinase/transferase annotations and provides
      no useful information.
    action: MARK_AS_OVER_ANNOTATED
    reason: >
      GO:0016740 is a root-level molecular function term that encompasses all transferases.
      While accurate, it provides no discriminatory information. The more specific annotations
      (GO:0000155, GO:0004673, GO:0016772) already capture the transferase activity in
      meaningful detail. This annotation is redundant and uninformative for understanding
      the specific function of DVU_3335.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "Transferase {ECO:0000313|EMBL:AAS97803.1};"
- term:
    id: GO:0016772
    label: transferase activity, transferring phosphorus-containing groups
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >
      This is an intermediate-specificity term describing the transfer of phosphorus groups,
      inferred from InterPro:IPR004358 (Sig_transdc_His_kin-like_C). It is more specific
      than GO:0016740 but less specific than the kinase annotations.
    action: ACCEPT
    reason: >
      This annotation bridges the GO hierarchy between general transferase activity and
      specific kinase activity. It was correctly inferred from the histidine kinase-like
      C-terminal domain (IPR004358). While less informative than GO:0000155 or GO:0004673,
      it is a valid IEA annotation that accurately describes the biochemical class of
      reaction catalyzed. The annotation is acceptable as part of the annotation hierarchy.
    supported_by:
      - reference_id: UniProt:Q725U0
        supporting_text: "InterPro; IPR004358; Sig_transdc_His_kin-like_C."
core_functions:
  - molecular_function:
      id: GO:0000155
      label: phosphorelay sensor kinase activity
    description: >
      DVU_3335 is a two-component system sensor histidine kinase with canonical domain architecture
      (HisKA/DHp + HATPase_C + PAS sensory domain). It functions via ATP-dependent autophosphorylation
      on a conserved histidine residue followed by phosphotransfer to a cognate response regulator.
      This is the core molecular function of the protein.
references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000003
  title: Gene Ontology annotation based on Enzyme Commission mapping
  findings: []
- id: GO_REF:0000043
  title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
  findings: []
- id: GO_REF:0000108
  title: Automatic assignment of GO terms using logical inference, based on on inter-ontology links
  findings: []
- id: GO_REF:0000120
  title: Combined Automated Annotation using Multiple IEA Methods
  findings: []