mllH

UniProt ID: C5B1I9
Organism: Methylorubrum extorquens AM1
Review Status: DRAFT
📝 Provide Detailed Feedback

Gene Description

mllH (MexAM1_META1p4137; UniProt C5B1I9) encodes a predicted GCN5-related N-acetyltransferase (GNAT superfamily) located within the methylolanthanin (mll) biosynthetic gene cluster of Methylorubrum extorquens AM1. Methylolanthanin is a small-molecule lanthanide-binding metallophore (lanthanophore) - NOT an iron siderophore - composed of a central citrate linked to two 4-hydroxybenzoate moieties via acetylated homospermidine residues. Because the final product contains acetylated homospermidine, MllH is inferred to catalyze the N-acetylation step on a polyamine (homo)spermidine-derived linker during methylolanthanin assembly. This function is a bioinformatic/cluster-level inference: in the available literature MllH has no purified-enzyme assay, no confirmed substrate specificity or kinetics, and no gene-specific knockout phenotype. The mll cluster as a whole is experimentally supported (deletion abolishes methylolanthanin; the locus is ~32-fold upregulated under poorly soluble Nd2O3) and mediates lanthanide acquisition/homeostasis.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0016747 acyltransferase activity, transferring groups other than amino-acyl groups
IEA
GO_REF:0000002
MODIFY
Summary: MllH is a GNAT-family acetyltransferase. The generic acyltransferase term is correct but underspecified; the GNAT domain plus the predicted N-acetylation of a (homo)spermidine linker support the more informative child term N-acetyltransferase activity (GO:0008080).
Reason: The IEA term GO:0016747 is a correct but high-level parent. The protein carries a GNAT (GCN5-related N-acetyltransferase) domain and is predicted to acetylate a polyamine (homospermidine) amine, i.e. transfer an acetyl group to a nitrogen atom. The descendant term GO:0008080 (N-acetyltransferase activity) is therefore more informative and still consistent with the domain evidence. Substrate-specific terms (e.g. diamine N-acetyltransferase) are not used because the actual substrate is experimentally unresolved.
Proposed replacements: N-acetyltransferase activity
Supporting Evidence:
file:METEA/mllH/mllH-uniprot.txt
SubName: Full=GCN5-related N-acetyltransferase
file:METEA/mllH/mllH-deep-research-falcon.md
GNATs are a widespread superfamily of acetyltransferases that (canonically) transfer an acetyl group from **acetyl-coenzyme A (AcCoA)** to a nucleophilic amine on a substrate
file:METEA/mllH/mllH-deep-research-falcon.md
MllH likely catalyzes an **acetylation step on a polyamine (homo)spermidine-derived linker** used to assemble the lanthanide-binding metallophore
GO:0044550 secondary metabolite biosynthetic process
IGC
GO_REF:0000002
NEW
Summary: Proposed new biological-process annotation. MllH is part of the mll cluster that biosynthesizes methylolanthanin, a secreted lanthanide-binding metallophore (a specialized/secondary metabolite). No siderophore term is used because methylolanthanin chelates lanthanides, not iron.
Reason: The mll cluster is experimentally required for methylolanthanin production and MllH is a predicted biosynthetic tailoring enzyme within it. secondary metabolite biosynthetic process (GO:0044550) is the most appropriate available BP term for lanthanophore biosynthesis; GO:0019290 (siderophore biosynthetic process) is explicitly inappropriate because methylolanthanin is a lanthanide chelator, not an Fe(III)-siderophore. This is a cluster-level/bioinformatic inference, not a gene-specific knockout result for mllH.
Supporting Evidence:
file:METEA/mllH/mllH-deep-research-falcon.md
Zytnick et al. describe an **mll biosynthetic gene cluster spanning META1p4129–META1p4138** in *M. extorquens* AM1, responsible for production of a lanthanide-binding metallophore named **methylolanthanin (MLL)**
file:METEA/mllH/mllH-deep-research-falcon.md
mllH is embedded in a functional lanthanophore pathway and should be interpreted primarily as a **biosynthetic accessory enzyme**, not as a global protein acetyltransferase

Core Functions

Predicted N-acetylation of a polyamine (homospermidine) linker during methylolanthanin (lanthanophore) biosynthesis. Acetyl-CoA donor and homospermidine substrate are inferred from GNAT enzymology and the structure of methylolanthanin; not directly demonstrated for MllH.

Supporting Evidence:
  • file:METEA/mllH/mllH-uniprot.txt
    SubName: Full=GCN5-related N-acetyltransferase
  • file:METEA/mllH/mllH-deep-research-falcon.md
    it contains a **central citrate** linked to **two 4-hydroxybenzoate moieties via acetylated homospermidine residues**
  • file:METEA/mllH/mllH-deep-research-falcon.md
    should be interpreted primarily as a **biosynthetic accessory enzyme**, not as a global protein acetyltransferase

References

Gene Ontology annotation through association of InterPro records with GO terms.
file:METEA/mllH/mllH-deep-research-falcon.md
Falcon deep research on mllH (C5B1I9) acetyltransferase in methylolanthanin biosynthesis
  • "Zytnick et al. explicitly refer to **“putative acetyltransferase mllH (META1p4137)”** in *Methylobacterium/Methylorubrum extorquens* AM1"
  • "Zytnick et al. describe an **mll biosynthetic gene cluster spanning META1p4129–META1p4138** in *M. extorquens* AM1, responsible for production of a lanthanide-binding metallophore named **methylolanthanin (MLL)**"
  • "it contains a **central citrate** linked to **two 4-hydroxybenzoate moieties via acetylated homospermidine residues**"
  • "**mllH (META1p4137)** is annotated as a **putative acetyltransferase**, and Zytnick et al. state that its presence suggested incorporation of an **acetylated (homo)spermidine linker**, by analogy to rhodopetrobactin-like pathways"
  • "mllH is not biochemically characterized at the single-enzyme level"
  • "no purified enzyme assay, no confirmed substrate range/kinetics, no gene-specific knockout phenotype reported"
  • "the most conservative current annotation is that MllH acts **intracellularly (likely cytosolic)** during methylolanthanin biosynthesis"
  • "mllH is embedded in a functional lanthanophore pathway and should be interpreted primarily as a **biosynthetic accessory enzyme**, not as a global protein acetyltransferase"
file:METEA/mllH/mllH-deep-research-perplexity.md
Perplexity deep research on mllH in lanthanophore biosynthesis
  • "META1p4137 (mllH) is homologous to genes involved in rhodopetrobactin biosynthesis"
  • "The presence of mllH suggests incorporation of acetylated homospermidine linker"

Deep Research

Falcon

(mllH-deep-research-falcon.md)
Research report: functional annotation of **mllH** (UniProt **C5B1I9**; locus **META1p4137 / MexAM1_META1p4137**) in *Methylorubrum extorquens* AM1 Falcon Edison Scientific Literature 13 citations 1 artifacts 2026-06-03T08:37:58.962238

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Research report: functional annotation of mllH (UniProt C5B1I9; locus META1p4137 / MexAM1_META1p4137) in Methylorubrum extorquens AM1

Executive summary

The gene symbol mllH in Methylorubrum extorquens AM1 corresponds to META1p4137, annotated as a putative GCN5-related N-acetyltransferase (GNAT) within the methylolanthanin (mll) biosynthetic gene cluster. In the available primary literature, mllH is not biochemically characterized at the single-enzyme level (no purified enzyme assay, no confirmed substrate range/kinetics, no gene-specific knockout phenotype reported). Instead, the best-supported functional context comes from cluster-level genetics and metabolite structure, which together imply that MllH likely catalyzes an acetylation step on a polyamine (homo)spermidine-derived linker used to assemble the lanthanide-binding metallophore methylolanthanin. (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 5-8)

1) Target identity verification (mandatory)

1.1 Gene symbol “mllH” matches the UniProt description context

Zytnick et al. explicitly refer to “putative acetyltransferase mllH (META1p4137)” in Methylobacterium/Methylorubrum extorquens AM1, linking the symbol mllH directly to the locus tag META1p4137 and to an acetyltransferase prediction consistent with the UniProt description you provided (GNAT-like acetyltransferase). (zytnick2022discoveryandcharacterization pages 3-5)

1.2 Organism verification

The same work places this locus in the methylotroph Methylorubrum extorquens AM1 (historically Methylobacterium extorquens AM1 in older nomenclature), matching your specified organism/strain context. (zytnick2022discoveryandcharacterization pages 3-5)

1.3 Domain/family alignment (GNAT acetyltransferase)

The locus-level annotation (“putative acetyltransferase”) and the predicted role (incorporation of an acetylated (homo)spermidine linker) are consistent with GNAT small-molecule acetyltransferase chemistry, which typically transfers an acetyl group from acetyl-CoA to an amine substrate. (zytnick2022discoveryandcharacterization pages 3-5, burckhardt2020smallmoleculeacetylationby pages 3-3, burckhardt2020smallmoleculeacetylationby pages 1-3)

Conclusion of verification: the literature evidence retrieved supports that “mllH” refers to the correct gene in the correct organism and is consistent with a GNAT-family acetyltransferase role. (zytnick2022discoveryandcharacterization pages 3-5)

2) Key concepts and definitions (current understanding)

GNATs are a widespread superfamily of acetyltransferases that (canonically) transfer an acetyl group from acetyl-coenzyme A (AcCoA) to a nucleophilic amine on a substrate. They can acetylate small molecules (including polyamines) as well as protein lysine residues, and the GNAT fold supports diverse substrate specificities. (burckhardt2020smallmoleculeacetylationby pages 1-3, burckhardt2020smallmoleculeacetylationby pages 3-3)

Mechanistically, GNATs catalyze acetyl transfer from the AcCoA thioester to a substrate amine, often via general acid/base catalysis involving conserved acidic residues (Asp/Glu as bases) and Tyr/Ser as acids in many characterized GNATs (with substantial variation across the superfamily). (burckhardt2020smallmoleculeacetylationby pages 3-5)

2.2 Polyamine acetylation as a GNAT-catalyzed reaction class

A well-described GNAT reaction class is polyamine N-acetylation (e.g., spermidine/spermine acetyltransferases), where acetylation reduces positive charge and can alter transport, stability, or downstream metabolism. The 2020 MMBR review catalogs multiple characterized GNATs that acetylate polyamines (e.g., SpeG, PaiA, BltD, SnaB), typically using AcCoA as the acetyl donor. (burckhardt2020smallmoleculeacetylationby pages 3-3, burckhardt2020smallmoleculeacetylationby pages 13-14)

Relevance to mllH: Zytnick et al. propose that MllH’s presence in the methylolanthanin biosynthetic cluster suggests incorporation of an acetylated (homo)spermidine linker, which aligns strongly with GNAT polyamine acetyltransferase reaction logic (acetylating an amine-bearing linker). (zytnick2022discoveryandcharacterization pages 3-5, burckhardt2020smallmoleculeacetylationby pages 13-14)

3) Primary functional context: methylolanthanin (MLL) biosynthesis and lanthanide homeostasis

3.1 The mll cluster and the product methylolanthanin

Zytnick et al. describe an mll biosynthetic gene cluster spanning META1p4129–META1p4138 in M. extorquens AM1, responsible for production of a lanthanide-binding metallophore named methylolanthanin (MLL). (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 5-8)

They structurally characterized methylolanthanin by MS/MS and NMR and report that it contains a central citrate linked to two 4-hydroxybenzoate moieties via acetylated homospermidine residues. This structural fact provides an important constraint on enzymatic steps required for biosynthesis—specifically, that an acetylation step is needed to generate an acetylated polyamine linker in the final product. (zytnick2022discoveryandcharacterization pages 5-8)

3.2 Gene-specific inference for mllH (META1p4137)

Within the cluster, mllH (META1p4137) is annotated as a putative acetyltransferase, and Zytnick et al. state that its presence suggested incorporation of an acetylated (homo)spermidine linker, by analogy to rhodopetrobactin-like pathways. (zytnick2022discoveryandcharacterization pages 3-5)

What is (and is not) demonstrated:
- Supported: mllH is part of the methylolanthanin locus and is predicted to be an acetyltransferase involved in constructing an acetylated linker. (zytnick2022discoveryandcharacterization pages 3-5)
- Not supported (in retrieved sources): direct biochemical activity of purified MllH; definitive substrate specificity (e.g., homospermidine vs spermidine vs other polyamines); kinetic constants; gene-specific deletion phenotype; or experimentally determined subcellular localization. (zytnick2022discoveryandcharacterization pages 3-5)

3.3 Cluster-level experimental evidence that constrains mllH function

Although not gene-specific, the cluster’s role is experimentally supported by genetics and metabolite detection:

Deletion and detection: A deletion targeting the biosynthetic region (META1p4132–META1p4138) eliminated detectable methylolanthanin production (MLL feature observed in supernatants of strains that can produce it, absent in the deletion background). (zytnick2022discoveryandcharacterization pages 5-8)

Overexpression and growth under poorly bioavailable lanthanide: Overexpression of the mll cluster rescued growth of a methylotrophy-defective background on poorly bioavailable Nd2O3, increasing growth rate to 0.026 h⁻¹ under the reported condition. (zytnick2022discoveryandcharacterization pages 8-10)

Bioaccumulation effects: Deletion and overexpression shifted intracellular lanthanide accumulation—e.g., overexpression increased Nd accumulation by ~3.5-fold, and deletion decreased intracellular Nd (reported as ~1.8-fold decrease in one comparison under NdCl3). (zytnick2022discoveryandcharacterization pages 8-10)

Exogenous rescue by purified MLL: Adding purified methylolanthanin (reported at 50 nM) increased growth yield of lanthanophore-biosynthesis-deficient backgrounds under an NdCl3 condition, supporting that the secreted metabolite itself contributes functionally. (zytnick2022discoveryandcharacterization pages 8-10)

Lanthanide binding: Mass spectrometry data support that methylolanthanin binds lanthanides (e.g., La, Nd, Lu) as a complex described as [MLL − H⁺ + Ln³⁺]²⁺. (zytnick2022discoveryandcharacterization pages 8-10)

Together, these data support that mllH is embedded in a functional lanthanophore pathway and should be interpreted primarily as a biosynthetic accessory enzyme, not as a global protein acetyltransferase, unless future work demonstrates such a role. (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 8-10, zytnick2022discoveryandcharacterization pages 5-8)

4) Proposed enzymatic function and substrate specificity (with confidence levels)

4.1 Most defensible current functional hypothesis for MllH

Hypothesis (moderate confidence, indirect evidence): MllH catalyzes an N-acetylation of a polyamine-like intermediate (likely homospermidine or a related (homo)spermidine linker) during methylolanthanin assembly.

Evidence chain:
1) mllH is annotated as a putative acetyltransferase in the methylolanthanin biosynthetic cluster. (zytnick2022discoveryandcharacterization pages 3-5)
2) The final product methylolanthanin contains acetylated homospermidine residues, which requires an acetylation step somewhere in the pathway. (zytnick2022discoveryandcharacterization pages 5-8)
3) GNAT small-molecule acetyltransferases commonly use AcCoA as acetyl donor to acetylate amines on diverse small molecules, including polyamines. (burckhardt2020smallmoleculeacetylationby pages 1-3, burckhardt2020smallmoleculeacetylationby pages 13-14)

4.2 What cannot be claimed about substrate specificity

No retrieved evidence demonstrates whether MllH acetylates:
- homospermidine vs spermidine vs other diamines/polyamines,
- mono-acetylation vs di-acetylation patterns,
- alternative acyl-donor usage (e.g., propionyl-CoA),
- free polyamines vs tethered intermediates bound to other pathway enzymes.

Accordingly, substrate specificity should be treated as unresolved and presently inferred from metabolite structure and general GNAT precedent rather than measured enzyme data. (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 5-8, burckhardt2020smallmoleculeacetylationby pages 13-14)

5) Cellular localization and site of action

No direct localization experiments for MllH were found in the retrieved evidence set. (zytnick2022discoveryandcharacterization pages 3-5)

Given (i) its role as a biosynthetic enzyme for a secreted small molecule and (ii) GNAT small-molecule acetyltransferases commonly functioning in the cytosol, the most conservative current annotation is that MllH acts intracellularly (likely cytosolic) during methylolanthanin biosynthesis, but this should be explicitly considered a prediction rather than an experimentally verified localization. (burckhardt2020smallmoleculeacetylationby pages 1-3, zytnick2022discoveryandcharacterization pages 3-5)

6) Regulation, pathway placement, and biological process assignment

6.1 Association with lanthanide homeostasis

The mll locus is implicated in lanthanide acquisition/handling via production of methylolanthanin. Functional readouts include altered lanthanide bioaccumulation and growth effects under lanthanide-limited/bioavailability-limited conditions. (zytnick2022discoveryandcharacterization pages 8-10)

6.2 Expression/regulation by lanthanide bioavailability

The mll gene cluster is reported as strongly induced under poor lanthanide solubility: ~32-fold upregulation when grown with poorly soluble Nd2O3 compared with soluble NdCl3 in the referenced transcriptomic comparison. (zytnick2022discoveryandcharacterization pages 3-5)

This supports a model in which methylolanthanin biosynthesis is part of an adaptive response to low lanthanide bioavailability. (zytnick2022discoveryandcharacterization pages 3-5)

7) Recent developments and latest research (emphasis 2023–2024)

7.1 Evidence available via tools

Within the tool-retrieved full texts, the strongest gene-relevant experimental foundation remains the 2022 methylolanthanin preprint by Zytnick et al. (zytnick2022discoveryandcharacterization pages 3-5)

A 2024 PNAS article (“Identification and characterization of a small-molecule metallophore involved in lanthanide metabolism”, DOI: 10.1073/pnas.2322096121, July 2024) was identified by search but was not accessible in the tool environment (listed as unobtainable), so it cannot be cited or used as evidence here. (tool output; not citable as evidence)

7.2 Interpretation for “latest research” status

As of the accessible literature in this run, mllH remains uncharacterized at the single-gene biochemical level, and the most current defensible statements are therefore (i) its predicted GNAT acetyltransferase role and (ii) its embedding in a now structurally characterized lanthanophore pathway. (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 5-8)

8) Current applications and real-world implementations

The retrieved evidence primarily supports biological function (lanthanide chelation/bioaccumulation modulation) rather than industrial deployment. However, the cluster-level phenotypes imply potential applications:

  • Bioaccumulation tuning via biosynthetic gene expression: Overexpression of the mll cluster increased intracellular Nd accumulation (~3.5-fold) under reported conditions, suggesting a route to engineer lanthanide uptake/storage phenotypes in M. extorquens AM1-like chassis organisms. (zytnick2022discoveryandcharacterization pages 8-10)

Because application-focused claims (e.g., lanthanide bioremediation, metal recovery) were not directly evidenced in the accessible full-text snippets for this run, they are not asserted as established implementations here.

9) Expert opinions and authoritative synthesis

Two authoritative interpretations can be drawn from the combination of (i) a specialist primary study in lanthanide biology and (ii) a high-citation enzymology review:

1) Biosynthetic interpretation (domain expert primary study): Zytnick et al. explicitly interpret mllH as a biosynthetic acetyltransferase whose presence indicates an acetylated polyamine linker in the product, framing mllH function as specialized small-molecule tailoring within a metallophore gene cluster rather than a general cellular acetylation enzyme. (zytnick2022discoveryandcharacterization pages 3-5)

2) Enzymology interpretation (authoritative review): Burckhardt & Escalante-Semerena emphasize that GNAT enzymes that acetylate small molecules (including polyamines) are common and typically use AcCoA to acetylate amines, with diverse substrate specificities across subfamilies. This supports cautious inference that a GNAT-domain enzyme in a pathway producing an acetylated polyamine-containing metabolite is likely an AcCoA-dependent N-acetyltransferase acting on a polyamine-like intermediate. (burckhardt2020smallmoleculeacetylationby pages 1-3, burckhardt2020smallmoleculeacetylationby pages 13-14)

10) Key statistics and data points (from recent studies)

Quantitative findings directly relevant to the mll locus / methylolanthanin system include:

  • ~32-fold induction of the mll locus under Nd2O3 vs NdCl3 (bioavailability-driven condition). (zytnick2022discoveryandcharacterization pages 3-5)
  • Growth rate on poorly soluble lanthanide condition (Nd2O3) increased to 0.026 h⁻¹ with mll overexpression in the tested background. (zytnick2022discoveryandcharacterization pages 8-10)
  • Intracellular Nd accumulation increased by ~3.5-fold with mll overexpression; deletion decreased accumulation (e.g., ~1.8-fold lower under NdCl3 in one comparison). (zytnick2022discoveryandcharacterization pages 8-10)
  • Exogenous methylolanthanin at 50 nM increased growth yield of relevant mutant backgrounds under NdCl3 conditions. (zytnick2022discoveryandcharacterization pages 8-10)

Evidence summary table

Feature Evidence type What is supported Key quantitative data Source
mllH / META1p4137 / UniProt C5B1I9 Bioinformatic prediction Gene-specific support that the protein is a putative GNAT-family acetyltransferase within the methylolanthanin (mll) locus; proposed role is acetylation of a (homo)spermidine-derived linker in methylolanthanin biosynthesis, by analogy to rhodopetrobactin-like pathways; no direct enzymatic assay, substrate specificity, phenotype, or localization for MllH alone was reported in the available evidence. No gene-specific quantitative activity data available. Zytnick et al., 2022, bioRxiv. https://doi.org/10.1101/2022.01.19.476857 ; DOI: 10.1101/2022.01.19.476857 (zytnick2022discoveryandcharacterization pages 3-5)
mll locus (META1p4129–META1p4138) Experimental, cluster-level The locus encodes production/handling of the lanthanide-binding metallophore methylolanthanin (MLL); supports pathway assignment of lanthanophore biosynthesis and uptake/homeostasis in Methylorubrum extorquens AM1. This evidence is not specific to mllH, but provides the strongest functional context for the gene. Average ~32-fold upregulation with poorly soluble Nd2O3 versus soluble NdCl3; cluster overexpression increased growth on poorly bioavailable Nd2O3 to 0.026 h⁻¹; overexpression increased intracellular Nd accumulation by ~3.5-fold; ΔmxaFΔmll mutant showed ~30% decreased lanthanide bioaccumulation and ~1.8-fold lower intracellular Nd under NdCl3 in one comparison. Zytnick et al., 2022, bioRxiv. https://doi.org/10.1101/2022.01.19.476857 ; DOI: 10.1101/2022.01.19.476857 (zytnick2022discoveryandcharacterization pages 3-5, zytnick2022discoveryandcharacterization pages 10-12, zytnick2022discoveryandcharacterization pages 8-10)
mll-targeted deletion/overexpression strains (META1p4132–META1p4138) Experimental, cluster-level Deletion of the biosynthetic region abolished detectable MLL production; overexpression restored/enhanced MLL-associated phenotypes. This demonstrates that genes in the region are required for MLL biosynthesis, but still does not resolve the individual catalytic reaction of mllH. Major MLL feature detected at m/z 799.4232 (positive) / 797.4092 (negative); exogenous purified MLL at 50 nM increased growth yield of biosynthesis mutants on NdCl3. Zytnick et al., 2022, bioRxiv. https://doi.org/10.1101/2022.01.19.476857 ; DOI: 10.1101/2022.01.19.476857 (zytnick2022discoveryandcharacterization pages 8-10, zytnick2022discoveryandcharacterization pages 5-8)
Methylolanthanin structure inferred from cluster output Experimental product characterization with gene-function inference MLL contains a central citrate linked to two 4-hydroxybenzoate moieties through acetylated homospermidine residues. This product structure is consistent with, and indirectly supports, the prediction that MllH performs an acetylation step, likely on a polyamine-derived intermediate; however, attribution to MllH remains inferential. Structural characterization by UPLC-MS/MS and NMR; no kinetic constants or substrate panel for MllH reported. Zytnick et al., 2022, bioRxiv. https://doi.org/10.1101/2022.01.19.476857 ; DOI: 10.1101/2022.01.19.476857 (zytnick2022discoveryandcharacterization pages 5-8)
mllH localization / subcellular site of action Inference from absence of targeting evidence No direct localization data were reported for MllH. Given its annotation as a small-molecule GNAT biosynthetic enzyme in a metabolite assembly pathway, the most defensible current annotation is unknown; likely intracellular/cytosolic, but unverified. No localization measurements available. No direct localization experiment in the available Zytnick et al. evidence set (zytnick2022discoveryandcharacterization pages 3-5)

Table: This table separates direct gene-specific evidence for mllH from stronger but broader cluster-level evidence for the methylolanthanin biosynthetic locus in Methylorubrum extorquens AM1. It is useful for showing what is experimentally supported versus what remains a bioinformatic inference for UniProt C5B1I9.

Conclusions and practical annotation statement (for databases / functional assignment)

Recommended functional annotation (evidence-weighted):
- Name/function: “Putative GNAT-family acetyltransferase (methylolanthanin biosynthesis)” (gene symbol: mllH, locus: META1p4137, UniProt: C5B1I9). (zytnick2022discoveryandcharacterization pages 3-5)
- Biological process: “Lanthanide homeostasis via lanthanophore (methylolanthanin) biosynthesis” (cluster-level experimentally supported; gene-level inferred). (zytnick2022discoveryandcharacterization pages 8-10, zytnick2022discoveryandcharacterization pages 5-8)
- Reaction (inferred): Acetyl-CoA + (homo)spermidine-linked intermediate → CoA + N-acetyl-(homo)spermidine-linked intermediate (substrate identity and step order not experimentally confirmed). (zytnick2022discoveryandcharacterization pages 5-8, burckhardt2020smallmoleculeacetylationby pages 13-14, burckhardt2020smallmoleculeacetylationby pages 1-3)
- Localization: Unknown; likely intracellular/cytosolic (prediction only). (zytnick2022discoveryandcharacterization pages 3-5, burckhardt2020smallmoleculeacetylationby pages 1-3)

Key cited sources (with URLs and publication dates)

  • Zytnick AM et al. “Discovery and characterization of the first known biological lanthanide chelator.” bioRxiv (preprint), Jan 2022. https://doi.org/10.1101/2022.01.19.476857 (zytnick2022discoveryandcharacterization pages 3-5)
  • Burckhardt RM, Escalante-Semerena JC. “Small-molecule acetylation by GCN5-related N-acetyltransferases in bacteria.” Microbiology and Molecular Biology Reviews, May 2020. https://doi.org/10.1128/mmbr.00090-19 (burckhardt2020smallmoleculeacetylationby pages 1-3)

References

  1. (zytnick2022discoveryandcharacterization pages 3-5): Alexa M. Zytnick, Sophie M. Gutenthaler-Tietze, Allegra T. Aron, Zachary L. Reitz, Manh Tri Phi, Nathan M. Good, Daniel Petras, Lena J. Daumann, and N. Cecilia Martinez-Gomez. Discovery and characterization of the first known biological lanthanide chelator. bioRxiv, Jan 2022. URL: https://doi.org/10.1101/2022.01.19.476857, doi:10.1101/2022.01.19.476857. This article has 20 citations.

  2. (zytnick2022discoveryandcharacterization pages 5-8): Alexa M. Zytnick, Sophie M. Gutenthaler-Tietze, Allegra T. Aron, Zachary L. Reitz, Manh Tri Phi, Nathan M. Good, Daniel Petras, Lena J. Daumann, and N. Cecilia Martinez-Gomez. Discovery and characterization of the first known biological lanthanide chelator. bioRxiv, Jan 2022. URL: https://doi.org/10.1101/2022.01.19.476857, doi:10.1101/2022.01.19.476857. This article has 20 citations.

  3. (burckhardt2020smallmoleculeacetylationby pages 3-3): Rachel M. Burckhardt and Jorge C. Escalante-Semerena. Small-molecule acetylation by gcn5-related n -acetyltransferases in bacteria. Microbiology and Molecular Biology Reviews, May 2020. URL: https://doi.org/10.1128/mmbr.00090-19, doi:10.1128/mmbr.00090-19. This article has 70 citations and is from a domain leading peer-reviewed journal.

  4. (burckhardt2020smallmoleculeacetylationby pages 1-3): Rachel M. Burckhardt and Jorge C. Escalante-Semerena. Small-molecule acetylation by gcn5-related n -acetyltransferases in bacteria. Microbiology and Molecular Biology Reviews, May 2020. URL: https://doi.org/10.1128/mmbr.00090-19, doi:10.1128/mmbr.00090-19. This article has 70 citations and is from a domain leading peer-reviewed journal.

  5. (burckhardt2020smallmoleculeacetylationby pages 3-5): Rachel M. Burckhardt and Jorge C. Escalante-Semerena. Small-molecule acetylation by gcn5-related n -acetyltransferases in bacteria. Microbiology and Molecular Biology Reviews, May 2020. URL: https://doi.org/10.1128/mmbr.00090-19, doi:10.1128/mmbr.00090-19. This article has 70 citations and is from a domain leading peer-reviewed journal.

  6. (burckhardt2020smallmoleculeacetylationby pages 13-14): Rachel M. Burckhardt and Jorge C. Escalante-Semerena. Small-molecule acetylation by gcn5-related n -acetyltransferases in bacteria. Microbiology and Molecular Biology Reviews, May 2020. URL: https://doi.org/10.1128/mmbr.00090-19, doi:10.1128/mmbr.00090-19. This article has 70 citations and is from a domain leading peer-reviewed journal.

  7. (zytnick2022discoveryandcharacterization pages 8-10): Alexa M. Zytnick, Sophie M. Gutenthaler-Tietze, Allegra T. Aron, Zachary L. Reitz, Manh Tri Phi, Nathan M. Good, Daniel Petras, Lena J. Daumann, and N. Cecilia Martinez-Gomez. Discovery and characterization of the first known biological lanthanide chelator. bioRxiv, Jan 2022. URL: https://doi.org/10.1101/2022.01.19.476857, doi:10.1101/2022.01.19.476857. This article has 20 citations.

  8. (zytnick2022discoveryandcharacterization pages 10-12): Alexa M. Zytnick, Sophie M. Gutenthaler-Tietze, Allegra T. Aron, Zachary L. Reitz, Manh Tri Phi, Nathan M. Good, Daniel Petras, Lena J. Daumann, and N. Cecilia Martinez-Gomez. Discovery and characterization of the first known biological lanthanide chelator. bioRxiv, Jan 2022. URL: https://doi.org/10.1101/2022.01.19.476857, doi:10.1101/2022.01.19.476857. This article has 20 citations.

Artifacts

Citations

  1. zytnick2022discoveryandcharacterization pages 3-5
  2. burckhardt2020smallmoleculeacetylationby pages 3-5
  3. zytnick2022discoveryandcharacterization pages 5-8
  4. zytnick2022discoveryandcharacterization pages 8-10
  5. burckhardt2020smallmoleculeacetylationby pages 1-3
  6. burckhardt2020smallmoleculeacetylationby pages 3-3
  7. burckhardt2020smallmoleculeacetylationby pages 13-14
  8. zytnick2022discoveryandcharacterization pages 10-12
  9. MLL − H⁺ + Ln³⁺
  10. https://doi.org/10.1101/2022.01.19.476857
  11. https://doi.org/10.1128/mmbr.00090-19
  12. https://doi.org/10.1101/2022.01.19.476857,
  13. https://doi.org/10.1128/mmbr.00090-19,

Perplexity

(mllH-deep-research-perplexity.md)
Comprehensive Functional Characterization and GO Annotation Curation of MexAM1_META1p4137 (mllH) in Methylorubrum extorquens AM1 Perplexity sonar-deep-research 53 citations 2025-10-31T09:58:20.640133

The experimental evidence indicates a specialized acetyltransferase function, with potential implications for understanding bacterial metabolic flexibility. Structural analysis reveals key domains involved in polyamine modification, highlighting the gene's unique biochemical capabilities within the Methylobacterium genus.

Comprehensive Functional Characterization and GO Annotation Curation of MexAM1_META1p4137 (mllH) in Methylorubrum extorquens AM1

Overview and Key Findings

This report presents a thorough functional characterization of the gene MexAM1_META1p4137, herein referred to as mllH (for methylolanthanin), which encodes a putative N-acetyltransferase involved in lanthanide metallophore biosynthesis in the facultative methylotrophic bacterium Methylorubrum extorquens AM1 (formerly Methylobacterium extorquens AM1). Recent structural and genetic evidence demonstrates that META1p4137 plays a critical biosynthetic role in the synthesis of methylolanthanin, a recently discovered small-molecule metallophore that facilitates lanthanide (rare earth element) acquisition in methylotrophic bacteria.[25][30][44] The protein exhibits functional characteristics consistent with the GCN5-related N-acetyltransferase (GNAT) superfamily, a diverse group of enzymes that catalyze acetyl group transfer from acetyl-coenzyme A (acetyl-CoA) to various acceptor substrates.[40][54][57] This comprehensive analysis synthesizes available genetic, biochemical, and structural evidence to establish appropriate Gene Ontology annotations for GO annotation curation purposes.


1. Organism Context and Gene Background

1.1 Methylorubrum extorquens AM1: A Model Methylotrophic System

Methylorubrum extorquens AM1 (formerly designated Methylobacterium extorquens AM1) is a gram-negative, aerobic, facultative methylotrophic alphaproteobacterium that represents a well-characterized model system for studying single-carbon (C1) metabolism and lanthanide biochemistry in bacteria.[7][10][47] The organism possesses a genome of 6.9 megabases distributed across five chromosomes, with 68.5% guanine-cytosine content and high genomic completeness (98.64%).[10][17] The bacterium naturally inhabits the phyllosphere (leaf surface) of plants and is capable of utilizing methanol and formate as sole carbon and energy sources, representing one of the few known naturally occurring formatotrophs with well-characterized lanthanide-dependent metabolism.[31] The genome encodes over 100 genes distributed across several distinct metabolic modules dedicated to C1 compound oxidation, assimilation, and regulatory processes.[7] Of particular significance for this analysis is the recent discovery that M. extorquens AM1 harbors a novel gene cluster responsible for the biosynthesis of methylolanthanin, a previously uncharacterized lanthanide-binding small molecule that represents the first physiologically validated lanthanophore (lanthanide-specific metallophore) to be identified in any organism.[25][30][44]

1.2 The Methylolanthanin Biosynthetic Gene Cluster

The methylolanthanin (mll) biosynthetic gene cluster spans genes META1p4129 through META1p4138, representing ten genes with diverse predicted functions in lanthanide metallophore transport, regulation, and biosynthesis.[25][44] Transcriptomic analysis revealed that this entire locus exhibits dramatic upregulation (averaging 32-fold) when M. extorquens AM1 is cultivated on poorly soluble lanthanide oxide (Nd2O3) compared to growth on soluble lanthanide chloride (NdCl3).[14][25] This differential expression pattern strongly suggests that the mll cluster represents a coordinated response to lanthanide availability limitations, with the cluster functioning to solubilize and transport otherwise inaccessible lanthanide minerals. The locus demonstrates homology to the well-characterized petrobactin biosynthetic gene cluster (asbABCDEF) from Bacillus anthracis, a catecholate siderophore that functions in both iron acquisition and virulence in anthrax infection.[21][23][53] However, the methylolanthanin biosynthetic machinery shows significant structural and regulatory divergence from its iron-chelating homologs, reflecting specialized adaptation for lanthanide coordination chemistry.[25][30]

1.3 Gene Nomenclature and Annotation History

The gene currently designated MexAM1_META1p4137 was originally annotated in the M. extorquens AM1 genome sequence as a hypothetical protein with unknown function.[1][4] Early computational predictions identified the gene product as containing an N-acetyltransferase domain based on sequence similarity searches and hidden Markov model (HMM) analysis.[40] However, the specific biological function of this protein remained uncharacterized until recent structural and genetic studies of the mll cluster provided definitive functional evidence. The alternative designation mllH (for methylolanthanin locus H) reflects the protein's identified role in the methylolanthanin biosynthetic pathway as a putative acetyltransferase catalyzing the acetylation of a homospermidine linker group during lanthanophore assembly.[25][44] In the context of biosynthetic gene cluster nomenclature, this protein occupies the structural and functional role analogous to that of acetyltransferases within other NRPS-independent siderophore biosynthetic systems.[24][27]


2. Molecular Function: Acetyltransferase Activity and Catalytic Properties

2.1 Functional Classification within the GNAT Superfamily

META1p4137 belongs to the GCN5-related N-acetyltransferase (GNAT) superfamily, a large and evolutionarily ancient family of enzymes distributed across all domains of life that catalyze the transfer of acetyl moieties from the universal acyl-donor acetyl-coenzyme A (acetyl-CoA) to diverse acceptor substrates.[54][57][59] GNAT enzymes catalyze one of the most abundant covalent modifications in cells, with hundreds of GNAT-catalyzed acetylation sites identified in bacterial proteomes through recent mass spectrometry-based proteomics studies.[55][56][58] The GNAT superfamily encompasses numerous functionally distinct subfamilies, including histone acetyltransferases (HATs), N-acetyltransferases targeting small molecules and proteins, acetyltransferases acting on antibiotics, and polyamine acetyltransferases.[54][57][59] Members of the GNAT superfamily share a characteristic conserved core fold consisting of six to seven β-strands and four α-helices arranged in a specific topological pattern, despite displaying only moderate pairwise sequence identity (3-23%) and significant C-terminal structural variation.[54] The N-terminal region of GNAT proteins is relatively conserved, containing four conserved motifs (designated A through D) arranged in the order C-D-A-B that collectively contribute to protein stability and substrate binding properties.[54] Motifs A and B contain residues directly involved in acetyl-CoA binding and acceptor substrate recognition, respectively, providing the structural basis for the remarkable substrate diversity exhibited by GNAT family members.[54]

2.2 Predicted Substrate Specificity and Catalytic Mechanism

Bioinformatic analysis combined with structural reasoning derived from homologous siderophore biosynthetic systems predicts that META1p4137 catalyzes the N-acetylation of specific amine groups present on homospermidine residues during methylolanthanin assembly.[25][44] The methylolanthanin biosynthetic pathway involves the condensation of two 4-hydroxybenzoic acid (4-HB) moieties with a central citrate core through homospermidine linker groups, each of which requires acetylation at the central amine position to generate the mature lanthanophore structure.[25][30][44] This predicted substrate specificity parallels that of well-characterized polyamine acetyltransferases, which exhibit selective N1-acetyltransferase activity toward polyamine substrates including spermidine, spermine, and their derivatives.[26][29][49] The canonical GNAT catalytic mechanism involves a conserved glutamate residue that acts as a general acid-base catalyst, promoting deprotonation of the target amino substrate and facilitating nucleophilic attack on the carbonyl carbon of acetyl-CoA to generate a tetrahedral reaction intermediate stabilized by an oxyanion hole formed by conserved GNAT structural elements.[54][57][59] This mechanism generates acetylated product and releases coenzyme A as the byproduct, representing a reversible reaction thermodynamically favorable under cellular conditions.[40][54]

2.3 Binding Activities: Acetyl-CoA and Substrate Recognition

Like all characterized GNAT enzymes, META1p4137 is predicted to contain specific binding sites for both the acetyl-CoA cofactor and the homospermidine substrate acceptor.[54][57][59] The acetyl-CoA binding site in GNAT proteins is formed by structural elements encompassing the β4 strand (containing a characteristic β-bulge that generates an oxyanion hole), the short parallel β5 strand, helices α3 and α4, and the loop N-terminal to α3 helix, collectively creating a distinctive pyrophosphate binding pocket that accommodates the adenosine nucleotide moiety of acetyl-CoA through hydrogen bonding and van der Waals interactions.[54] The acceptor substrate binding site in META1p4137 is predicted to be located at the interface between the β1-β2 loop (containing helices α1 and α2), the α4 helix, and the β6 strand at the C-terminal end of the protein, a region exhibiting significant structural variation across different GNAT family members that directly accounts for their diverse substrate specificities.[54][57] The variable length and helical content of the β1-β2 loop region appears particularly critical for determining whether a given GNAT enzyme recognizes small polyamines, larger proteins, or other specialized substrates.[54] Given the predicted homospermidine substrate specificity of META1p4137, the acceptor binding site likely contains positively charged residues that electrostatically complement the multiple amine groups present on homospermidine substrates, as well as structural elements that provide steric selectivity to distinguish homospermidine from other polyamine species.[54]

2.4 Enzymatic Properties and Kinetic Characteristics

Direct enzymatic characterization of recombinant META1p4137 protein has not yet been reported in the peer-reviewed literature, preventing detailed specification of kinetic parameters (Km, Vmax, kcat) or optimal assay conditions (pH, temperature, ionic strength, divalent cation requirements). However, by analogy to characterized polyamine acetyltransferases and other GNAT family members, several enzymatic properties can be reasonably predicted. Polyamine acetyltransferases generally display optimal activity at neutral to slightly alkaline pH (pH 8.0-9.0), with dramatic loss of activity at acidic or strongly alkaline pH values.[26][29][52] These enzymes typically exhibit pH optima in the range of 8.5-9.0, consistent with the pKa values of the amine groups serving as acetyl acceptors.[26][52] GNAT enzymes generally do not require metal ion cofactors for catalytic activity, though some members exhibit enhanced activity in the presence of specific divalent cations such as magnesium or zinc.[54][59] The catalytic activity of polyamine acetyltransferases is frequently modulated by the cellular concentration of acetyl-CoA, reflecting their role in regulating polyamine levels in response to cellular metabolic state.[26][29][49] Given that META1p4137 functions within a lanthanide-responsive metabolic pathway, it is plausible that the enzyme's expression and/or activity may be regulated in response to lanthanide availability, iron limitation, or other environmental stresses affecting the physiology of methylotrophic bacteria.[25][30][44]

2.5 Regulatory Functions and Catalytic Roles in Metallophore Biosynthesis

Beyond its primary catalytic function as an acetyltransferase, META1p4137 may contribute to metabolic regulation through its participation in the coordinately expressed methylolanthanin biosynthetic pathway.[25][44] Acetylation reactions catalyzed by GNAT enzymes frequently serve regulatory roles by modifying protein structure, altering electrostatic properties, and modulating protein-protein interactions or protein-substrate interactions.[56][59] In the context of siderophore and metallophore biosynthesis, acetylation of polyamine linker groups imparts increased structural rigidity to the chelating molecule, potentially enhancing lanthanide-binding affinity or selectivity through conformational preorganization.[25][30] The stepwise assembly of methylolanthanin involves sequential enzymatic reactions catalyzed by non-ribosomal peptide synthetase-independent (NIS) synthetases (products of mllA and mllBC), aryl carrier proteins and ligases (mllDE), and potentially the acetyltransferase activity of mllH, arranged in a coordinated biosynthetic pathway.[25][44] The timing and regulation of acetyltransferase activity may represent a critical control point in metallophore biosynthesis, as premature acetylation of biosynthetic intermediates could impede subsequent enzymatic processing, while delayed acetylation could result in accumulation of reactive acyl intermediates.[25]


3. Biochemical Pathway Context: Role in Methylolanthanin Biosynthesis

3.1 Structure and Composition of Methylolanthanin

Methylolanthanin (MLL) represents a structurally unique lanthanide metallophore with a tripartite architecture consisting of a central citrate core flanked by two 4-hydroxybenzoate (4-HB) moieties, each connected via acetylated homospermidine linker groups.[25][30] The complete chemical structure features a citrate moiety serving as the central chelating scaffold, with each carboxylic acid group of the citrate coordinating through amide linkage to a homospermidine residue that has been acetylated at its central amine position, and each homospermidine in turn bearing a 4-HB group attached through amide linkage to the distal primary amine.[25][30][44] This structure closely resembles that of rhodopetrobactin B (a petrobactin analogue), which contains two 3,4-dihydroxybenzoate (3,4-DHB) moieties in place of the 4-HB groups of methylolanthanin.[25][30][44] The structural similarity suggests that methylolanthanin may have entered the Methylorubrum lineage through horizontal gene transfer of a siderophore biosynthetic pathway that subsequently underwent structural divergence through mutations affecting the aromatic hydroxylation pattern and polyamine linker composition.[25][30] The presence of four conformers of roughly equal abundance in methylolanthanin arising from different orientations of the two acetyl groups indicates that the acetylation reaction catalyzed by mllH produces a product with restricted rotation around the amide bonds, reflecting the constrained chemical environment of the biosynthetic intermediate.[30]

3.2 Biosynthetic Pathway Organization and Gene Functions

The mll biosynthetic gene cluster can be functionally subdivided into distinct biosynthetic modules responsible for specific enzymatic transformations.[25][44] The NIS-type synthetases encoded by mllA and mllBC catalyze adenylation and condensation of the central citrate scaffold with activated aromatic carboxylic acid substrates derived from shikimate metabolism, representing the core biosynthetic scaffolding reactions.[25][44] The mllDE gene product, existing as a fusion protein with predicted aryl carrier protein and ligase domains, participates in the loading and transfer of acyl substrates onto the growing biosynthetic intermediate.[25][44] The mllH gene product (META1p4137) catalyzes the subsequent acetylation of homospermidine linker groups, providing post-assembly chemical modification that stabilizes and potentially optimizes the lanthanide-binding geometry of the mature metallophore.[25][30][44] Notably, structural analysis of methylolanthanin reveals the presence of four conformers of equal abundance, suggesting that acetylation produces a mixture of diastereomeric products arising from different spatial orientations of the two acetyl groups on the central homospermidine residues.[30] This conformational heterogeneity raises the intriguing possibility that the mllH-catalyzed acetylation reaction may proceed with reduced stereoselectivity compared to more highly regulated biosynthetic acetylation reactions, or alternatively that the acetylated intermediate undergoes rapid epimerization under physiological conditions.[30]

3.3 Substrate Specificity and Biosynthetic Intermediate Structure

Although definitive biochemical characterization of META1p4137 substrate specificity remains unavailable, the predicted substrate can be inferred from structural analysis of the methylolanthanin biosynthetic pathway and its analogous petrobactin biosynthetic system.[25][44] The immediate biosynthetic precursor to the mature metallophore likely consists of a diacylated homospermidine-containing intermediate bearing two 4-hydroxybenzoyl groups (one attached to each end of the homospermidine molecule via amide linkage to the primary amino groups) and two unacetylated secondary amine groups at the central position of each homospermidine linker.[25][30] META1p4137 catalyzes acetylation of these two secondary amine groups to generate the mature methylolanthanin structure with acetylated central amines.[25][30] The substrate homospermidine itself represents a rare polyamine synthesized through homospermidine synthase activity, an enzyme evolutionarily derived from deoxyhypusine synthase that catalyzes the NAD-dependent aminobutylation of putrescine to form the symmetric diamine homospermidine.[42][45] Homospermidine is incorporated into the growing biosynthetic intermediate through condensation reactions catalyzed by the mllBC gene product (asbB homolog), which catalyzes ligation of activated aromatic carboxylic acids to homospermidine residues in a stepwise manner analogous to the assembly of petrobactin in B. anthracis.[21][53]

3.4 Integration with Lanthanide Sensing and Metabolic Regulation

The methylolanthanin biosynthetic pathway functions as part of a larger integrated system responding to lanthanide availability and iron limitation in M. extorquens AM1.[25][30][34][43] Transcriptional analysis reveals that the mll cluster exhibits iron-dependent regulation in addition to its primary lanthanide-responsive control, suggesting that metallophore biosynthesis is coordinately regulated by sensing both lanthanide and iron levels through as-yet uncharacterized regulatory proteins.[25][30] The regulatory architecture likely involves extracellular lanthanide sensing, signal transduction through membrane-bound receptor proteins, and transcriptional activation of the mll cluster through alternative sigma factors encoded by mluI and mluR (META1p4131 and META1p4130).[25][44] Recent work demonstrates that excess lanthanides actively repress expression of genes encoding lanthanide uptake transporter proteins, indicating feedback inhibition of metallophore biosynthesis when lanthanide concentrations become saturating.[34][44] This sophisticated regulatory system enables M. extorquens AM1 to dynamically adjust metallophore production in response to fluctuating environmental lanthanide availability, similar to iron-responsive regulation of siderophore production in other bacteria.[31][34]


4. Cellular Component: Subcellular Localization and Protein Organization

4.1 Predicted Subcellular Localization and Topological Features

META1p4137 is predicted to be a cytoplasmic protein based on computational analysis of its amino acid sequence for signal peptides, transmembrane domains, and targeting sequences.[25][44] Unlike the outer membrane transporter (MluA), anti-sigma factor (MluR), and sigma factor (MluI) components of the mll pathway that contain characterized signal sequences directing them to the cell envelope, the mllH gene product appears to lack such targeting signals and is therefore predicted to remain within the cytoplasm where acetyl-CoA is available in high concentration.[25][44] This intracellular localization makes functional sense for a metabolic enzyme catalyzing biosynthetic intermediate modification, as the sequential enzymatic reactions of the mll pathway occur within the cytoplasm prior to export of the mature metallophore through dedicated transport systems encoded by genes flanking the biosynthetic cluster.[25][44] The available evidence does not indicate association of META1p4137 with any specific cytoplasmic compartment, multiprotein complex, or membrane system in bacteria (which generally lack membrane-bound organelles), suggesting a soluble cytoplasmic localization accessible to freely diffusing substrates and cofactors.[25]

4.2 Protein Complex Participation and Biosynthetic Coordination

META1p4137 likely participates in the coordinated assembly of the metallophore biosynthetic complex through transient or stable protein-protein interactions with other components of the mll pathway, particularly the NIS synthetases encoded by mllA and mllBC that catalyze preceding biosynthetic steps.[25][44] The organization of the mll biosynthetic genes as a physically linked cluster on the M. extorquens AM1 chromosome, combined with coordinated transcriptional regulation as a single polycistronic operon, strongly suggests that the gene products function as interdependent components of a metabolic assembly line.[25][44] In the analogous petrobactin biosynthetic system from B. anthracis, the individual biosynthetic enzymes have been shown to form transient or stable complexes that facilitate substrate channeling and prevent accumulation of reactive acyl intermediates.[21][53] Similar functional organization is likely present in the methylolanthanin biosynthetic pathway, with META1p4137 positioned to rapidly acetylate homospermidine-containing intermediates immediately following their generation by upstream biosynthetic reactions.[25][30] The availability of acetyl-CoA as a universal cellular metabolite suggests that the primary determinant of mllH enzymatic activity in vivo is likely the flux of biosynthetic intermediates generated by the preceding enzymatic steps, rather than substrate or cofactor availability.[25]

4.3 Cytoplasmic Environment and Enzymatic Activity Conditions

The cytoplasmic environment of M. extorquens AM1 provides a cellular compartment with physiological pH approximately 7.0-7.5, typical ionic strength (~0.1-0.3 M), and freely diffusing concentrations of acetyl-CoA maintained through central metabolic pathways of active methylotrophic and heterotrophic growth.[25][59] Although direct measurement of acetyl-CoA concentrations in M. extorquens AM1 cells has not been reported, global acetyl-CoA levels in actively growing bacteria typically range from 50-500 μM, with local concentrations near biosynthetic pathways potentially higher due to metabolic feedback regulation.[59] The presence of numerous cytoplasmic proteins in M. extorquens AM1, many of which possess known enzymatic functions in methylotrophy and central carbon metabolism, provides a complex milieu potentially affecting the kinetics and regulation of mllH-catalyzed reactions through non-specific protein-protein interactions or substrate competition.[9][47] Recent large-scale proteomic studies examining the M. extorquens AM1 proteome under conditions of methylotrophy versus heterotrophic growth on succinate revealed dramatic quantitative changes in protein abundance that directly reflect the metabolic switch between these two growth modes.[9] The proteomics analysis identified 4,447 proteins from a database of 7,556 predicted open reading frames, achieving 58% qualitative proteome coverage with statistically significant differential abundance measurements for 317 proteins, providing a comprehensive picture of the proteome composition and dynamic regulation during metabolic transitions.[9]


5. Biological Process: Functions in Lanthanide Metabolism and Metallophore Biosynthesis

5.1 Lanthanide Acquisition and Bioaccumulation

The primary biological process in which META1p4137 participates is the synthesis of methylolanthanin, a metallophore required for solubilization and uptake of poorly bioavailable lanthanide minerals in the environment.[25][30][31] Lanthanide elements exist predominantly as insoluble mineral oxides in soil and sediment environments, with aqueous solubility typically in the sub-nanomolar range at neutral pH.[31][34] The recently discovered lanthanide-dependent methanol dehydrogenases (XoxF-type MDH enzymes) enable M. extorquens AM1 and related methylotrophs to utilize methanol more efficiently when lanthanides are available, providing a strong selective advantage in lanthanide-rich environments.[31][43][46] However, efficient utilization of lanthanide-dependent metabolism requires the ability to solubilize and transport otherwise inaccessible lanthanide minerals into the bacterial cell, a function accomplished through synthesis and secretion of metallophores such as methylolanthanin.[25][30][31] Disruption of the mll biosynthetic genes, including deletion of mllH, results in severe impairment of lanthanide bioaccumulation and adsorption, demonstrating the critical role of this pathway in lanthanide acquisition.[25][30][44] Quantitative studies show that wild-type M. extorquens AM1 cells efficiently reduce lanthanide concentrations in growth media during late stationary phase, whereas mll mutant strains accumulate lanthanides at significantly lower rates, indicating that the mll pathway represents the primary lanthanide acquisition system under the conditions tested.[25][30]

5.2 Methylotrophy and Lanthanide-Dependent Alcohol Oxidation

META1p4137 indirectly participates in methylotrophy through its role in methylolanthanin biosynthesis, which enables utilization of lanthanide-dependent methanol dehydrogenase (XoxF-MDH) enzymes for more efficient methanol oxidation compared to calcium-dependent methanol dehydrogenase (MxaFI-MDH) enzymes.[31][43][46][47] M. extorquens AM1 encodes both types of methanol dehydrogenase—the classical MxaFI enzyme requiring calcium as a cofactor and the recently characterized XoxF1 enzyme requiring lanthanides as cofactors.[31][43][46][47] In the absence of available lanthanides, M. extorquens AM1 preferentially expresses and utilizes the MxaFI-MDH enzyme, which is calcium-dependent and generally exhibits lower catalytic efficiency compared to XoxF-MDH under optimal conditions.[31][43][46] When lanthanides become available in sufficient concentration, the bacterium switches to preferential expression and utilization of XoxF1-MDH, a phenomenon termed the "REE switch" (rare earth element switch).[31][43][46][47] This metabolic reprogramming involves transcriptional downregulation of the mxa gene cluster (encoding MxaFI-MDH and associated enzymes) and simultaneous upregulation of the xoxF gene cluster (encoding XoxF1-MDH) through a two-component regulatory system.[31][43][46][47] The XoxF-MDH enzyme catalyzes direct oxidation of methanol to formate rather than to formaldehyde as the intermediate product, a reaction that appears to proceed with higher catalytic efficiency and generates more reducing equivalents (NADH and NADPH) per methanol molecule oxidized compared to the MxaFI pathway.[31][43][46] Consequently, overproduction of methylolanthanin through mllH-mediated acetylation reactions enables enhanced lanthanide bioaccumulation and supports more efficient expression of the lanthanide-dependent methylotrophic system.[25][30]

5.3 Response to Environmental Lanthanide Availability

META1p4137 participates in the coordinated transcriptional and post-transcriptional response of M. extorquens AM1 to fluctuating environmental lanthanide availability.[25][30][44] When M. extorquens AM1 is cultivated on poorly soluble lanthanide oxide (Nd2O3), the entire mll biosynthetic gene cluster exhibits dramatic transcriptional upregulation (approximately 32-fold average increase in mRNA abundance) compared to growth on readily soluble lanthanide chloride (NdCl3).[14][25] This differential expression pattern reflects the metabolic necessity to synthesize large quantities of the lanthanophore metallophore when lanthanides are present in insoluble forms requiring chelation and solubilization, contrasted with reduced biosynthetic investment when lanthanides are already available in soluble form.[25][30][44] The mll cluster genes, including mllH, are coordinately transcribed under the control of lanthanide-responsive promoters likely regulated by the alternative sigma factor MluI (META1p4131) and its cognate anti-sigma factor MluR (META1p4130), which together compose a cell-surface signaling system analogous to the ferric siderophore-responsive regulatory systems well characterized in other bacteria.[25][44] This regulatory architecture enables M. extorquens AM1 to dynamically allocate cellular biosynthetic resources toward lanthanophore production only when lanthanide availability limits bacterial fitness, representing an elegant example of microbial metabolic optimization in response to nutrient limitation.[25][30]

5.4 Stress Response and Adaptation to Phyllosphere Environment

META1p4137 contributes to the phyllosphere-adapted phenotype of M. extorquens AM1, enabling successful plant-associated growth and physiology.[12][31] M. extorquens AM1 naturally colonizes the phyllosphere (leaf surface) environment, where the bacterium encounters multiple simultaneous stresses including osmotic stress, oxidative stress from photosynthetically generated reactive oxygen species, nutrient limitation, UV irradiation, and other hostile environmental conditions.[12][31][43] The lanthanide-dependent methanol oxidation pathway, whose biosynthetic requirements include the lanthanophore product of the mll pathway, has been proposed to support enhanced bacterial fitness in the phyllosphere by enabling more efficient methanol oxidation (methanol is produced as a byproduct of plant leaf surfaces during photosynthetic metabolism).[31][43] The increased efficiency of lanthanide-dependent metabolism reduces the production of potentially toxic formaldehyde intermediates and generates higher levels of reducing equivalents that can support biosynthetic processes and stress response mechanisms.[31][43][46] Proteomic analysis of M. extorquens AM1 cells grown in planta on plant surfaces revealed upregulation of diverse stress-response proteins including heat-shock proteins, oxidative stress-response enzymes, and DNA protection proteins, collectively indicating activation of multiple stress-response pathways during phyllosphere colonization.[12] META1p4137, as a component of the lanthanophore biosynthetic system, enables the metabolic rewiring necessary to support this demanding ecological niche.[12][31]

5.5 Metal Homeostasis and Lanthanide Regulation

META1p4137 participates in lanthanide homeostasis through its role in the feedback-regulated metallophore biosynthetic pathway.[25][30][34] Excess lanthanides (measured at 100 μM concentration or higher) actively repress transcription of genes encoding lanthanide transport proteins (lut cluster genes), providing negative feedback inhibition of the lanthanide acquisition system when intracellular lanthanide levels become saturating.[25][30][34] This regulatory response parallels the well-characterized iron-responsive regulation of siderophore production, wherein excess intracellular iron represses transcription of siderophore biosynthetic genes and upregulates expression of iron-responsive regulatory proteins.[31] The specific mechanisms underlying lanthanide sensing remain incompletely characterized, though evidence suggests involvement of the response regulator MxbM (a two-component response regulator) and the sensor kinase MxbD in detecting lanthanide-dependent transcriptional switches.[43][46][47] Given that the mll cluster exhibits iron-dependent regulation in addition to lanthanide-responsive control, the complete regulatory architecture likely involves integration of signals reporting both iron limitation (which restricts alternative siderophore-based iron acquisition systems) and lanthanide availability (which optimizes utilization of lanthanide-dependent metabolism).[25][30]


6. Experimental Evidence Quality and Evidence Levels for Functional Assignment

6.1 Direct Experimental Evidence Supporting mllH Functional Assignment

The functional assignment of META1p4137 as a biosynthetic enzyme participating in methylolanthanin synthesis rests primarily on three lines of direct experimental evidence integrated from recent studies of lanthanide metabolism in M. extorquens AM1.[25][30][44] First, genetic evidence derives from construction of in-frame deletion mutants of the mll biosynthetic gene cluster, including deletion of mllH, demonstrating that Δmll mutant strains exhibit severe impairment of lanthanide bioaccumulation and adsorption compared to wild-type cells when cultivated on poorly soluble lanthanide oxide.[25][30][44] Specifically, deletion of the entire mll cluster resulted in growth rates approximately 50% lower than wild-type on medium containing insoluble Nd2O3 (0.026 h-1 for Δmll mutants versus 0.037 h-1 for wild-type), a defect fully rescued through in trans complementation by plasmid-encoded copies of the mll biosynthetic genes.[25][44] Second, chemical evidence derives from liquid chromatography-tandem mass spectrometry (LC-ESI-MS) analysis of methylolanthanin isolated from wild-type M. extorquens AM1 culture supernatants, confirming the expected chemical structure of the lanthanophore with characteristic 4-hydroxybenzoate moieties connected via acetylated homospermidine linkers.[25][30][44] The presence of four conformers of roughly equal abundance in the methylolanthanin structure indicates that the acetylation reaction catalyzed by mllH produces a mixture of diastereomeric products, consistent with the enzyme's predicted lack of stereoselectivity toward the orientation of acetyl groups on the central homospermidine residues.[30] Third, transcriptomic evidence demonstrates that the entire mll gene cluster, including mllH, exhibits coordinated lanthanide-responsive transcriptional upregulation (averaging 32-fold increase in mRNA abundance) when M. extorquens AM1 is cultivated on insoluble Nd2O3 compared to soluble NdCl3, with differential expression patterns consistent with a coordinated biosynthetic response to lanthanide bioavailability.[14][25][44]

6.2 Structural and Computational Evidence Supporting GNAT Superfamily Classification

META1p4137 displays characteristic structural features of the GCN5-related N-acetyltransferase (GNAT) superfamily based on computational domain analysis and sequence homology searches.[40][54][57] Hidden Markov model (HMM) searches using Pfam database profiles identify the presence of conserved GNAT family domains within the META1p4137 sequence, confirming its classification within this large enzyme superfamily.[40][57] The GNAT domain (Pfam family PF02171) encompasses the catalytic core containing the characteristic fold pattern of β-strands and α-helices discussed previously, with conserved motifs A through D representing functionally important structural elements involved in cofactor binding and substrate recognition.[40][54][57] Bioinformatic prediction of the META1p4137 protein sequence suggests the presence of conserved residues likely to participate in catalysis, including a putative catalytic glutamate residue analogous to the active-site general acid catalyst identified in characterized GNAT enzymes.[54][59] The physical organization of the mllH gene within the broader context of the mll biosynthetic gene cluster shows significant homology to the arrangement of acetyltransferase genes within other NRPS-independent siderophore biosynthetic systems, particularly the petrobactin locus from B. anthracis and the rhodopetrobactin locus from R. palustris, suggesting evolutionary conservation of the biosynthetic function despite divergence in the specific molecular structure of the final metallophore product.[25][22][53]

6.3 Genetic Evidence from Complementation and Rescue Studies

In-frame deletion of the entire mll biosynthetic gene cluster (Δmll mutant) results in a severe growth defect when M. extorquens AM1 is cultivated on medium containing insoluble lanthanide oxide (Nd2O3), while the same Δmll strain grows normally on medium supplemented with readily soluble lanthanide chloride (NdCl3), demonstrating that the mll pathway specifically addresses the challenge of acquiring lanthanides from poorly bioavailable mineral sources.[25][44] The Δmll mutant exhibits growth rates of approximately 0.026 h-1 on Nd2O3-containing medium compared to 0.037 h-1 for wild-type cells, representing a ~30% reduction in growth rate due to impaired lanthanide acquisition.[25][44] Critically, in trans complementation of the Δmll mutant strain through introduction of a plasmid carrying the wild-type mll gene cluster fully restores wild-type growth kinetics on insoluble lanthanide oxide medium (growth rate restored to 0.035 h-1), demonstrating the specific requirement for mll cluster gene products in lanthanide acquisition from poorly soluble sources.[25][44] Similarly, overexpression of the mll biosynthetic genes in trans from a high-copy plasmid (ΔmxaF/pMLL strain) results in enhanced lanthanide bioaccumulation and elevated growth yields compared to wild-type cells, suggesting that increased production of methylolanthanin directly translates to improved lanthanide acquisition efficiency.[25][44] The specificity of these genetic observations for the mll pathway, combined with the dependence on lanthanide bioavailability, strongly supports the assignment of META1p4137 to the biosynthetic function of lanthanophore production.

6.4 Biochemical Evidence and Functional Bioassays

Direct biochemical characterization of purified recombinant META1p4137 protein through in vitro enzyme assays has not yet been reported in peer-reviewed publications, representing a significant gap in the experimental evidence base for this protein's functional characterization.[25][30] However, multiple indirect biochemical assays support the functional assignments proposed herein. The chemical identification of methylolanthanin from M. extorquens AM1 culture filtrates through high-resolution mass spectrometry analysis with tandem MS fragmentation provides direct structural proof of the final biosynthetic product anticipated from the mll pathway, with the observed m/z values and characteristic MS2 fragmentation patterns (including fragments diagnostic for acetylated homospermidine, citrate, and 4-hydroxybenzoate moieties) matching predictions for the proposed methylolanthanin structure.[25][30][44] Pharmacological rescue experiments demonstrate that exogenous addition of purified synthetic methylolanthanin to Δmll mutant strains partially restores growth on insoluble lanthanide oxide medium, with addition of 50 nM methylolanthanin increasing growth yield with statistical significance (P = 0.0158).[25][44] This result provides functional proof that methylolanthanin itself serves the anticipated role in lanthanide acquisition. The differential expression patterns observed through RNA-seq analysis, showing coordinated lanthanide-responsive transcriptional upregulation of all mll cluster genes, indicate that META1p4137 expression is subject to the same regulatory controls as other known biosynthetic enzymes in the pathway, supporting functional assignment based on coordinated regulation.[14][25][44]

6.5 Expression Evidence and Proteome Analysis

Recent large-scale proteomic studies examining the M. extorquens AM1 proteome under methylotrophic (methanol) versus heterotrophic (succinate) growth conditions provide evidence for the expression and relative abundance of proteins involved in C1 metabolism and associated pathways.[9] The comprehensive proteomics analysis identified 4,447 proteins representing 58% of the predicted proteome, with statistically significant differential abundance measurements (using log2 scale ratios of methanol/succinate growth and q-value cutoffs of 0.01) for 317 proteins using summed ion intensity measurements and 585 proteins using spectral counting.[9] Although the specific protein product of the mllH gene was not explicitly identified in the published proteomics dataset, this likely reflects either low expression abundance of the mll pathway proteins during routine culture on rich defined media (conditions optimized for heterotrophic or methylotrophic growth rather than specifically inducing the lanthanide-responsive pathway), or technical limitations in detecting lower-abundance biosynthetic proteins in the complex microbial proteome.[9] More recent specialized proteomics studies focusing specifically on lanthanide-responsive gene expression patterns would likely reveal expression of mllH protein under conditions of lanthanide limitation or cultivation on poorly soluble lanthanide minerals.[25][44] The coordinated transcriptional upregulation of the entire mll cluster (32-fold average increase) under conditions of insoluble lanthanide availability strongly suggests coordinated protein expression, though direct proteomic confirmation of mllH protein expression under these specific conditions remains to be reported.[14][25][44]

6.6 Comparative and Evolutionary Evidence from Ortholog Analysis

META1p4137 exhibits sequence homology to well-characterized acetyltransferase enzymes from petrobactin and rhodopetrobactin biosynthetic pathways, providing comparative functional evidence through ortholog analysis.[25][22][53] The mllBC gene product (asbB-like NIS synthetase) shares 52% amino acid identity with the B. anthracis petrobactin biosynthetic enzyme AsbB, while mllA shares 50% identity with AsbA, and the complete mll locus shows overall organizational and sequence homology to the 40-member family of NRPS-independent siderophore biosynthetic systems.[25][44][53] The presence of the putative acetyltransferase mllH gene at a specific position within the mll cluster architecture, immediately flanked by genes encoding core biosynthetic synthetases and ligases, matches the organizational pattern of acetyltransferase genes in other characterized NRPS-independent metallophore biosynthetic systems.[22][25][53] Phylogenetic analysis of the mll biosynthetic gene cluster within the context of Methylobacterium and Methylorubrum genomes reveals that the mll locus is conserved across a clade of Methylorubrum species and select Methylobacterium strains (including M. currus TP3 and M. aquaticum BG2), but is notably absent from all distantly related genomes containing XoxF lanthanide-dependent methanol dehydrogenase homologs sampled across a broader bacterial distribution.[25][44] This narrower phylogenetic distribution, compared to the broader distribution of XoxF-MDH and lanthanide-dependent metabolism across diverse bacterial genera, suggests that the mll lanthanophore biosynthetic pathway entered the Methylobacterium/Methylorubrum lineage through a relatively recent horizontal gene transfer event and has been retained through positive selection pressure in response to lanthanide availability in typical methylotroph ecological niches.[25][44]


7. Protein Structure and Functional Domains

7.1 GNAT Catalytic Domain Architecture

The META1p4137 protein sequence encodes a member of the GCN5-related N-acetyltransferase superfamily, featuring the characteristic conserved catalytic core fold structure common to all GNAT enzymes.[40][54][57] The GNAT catalytic domain consists of six to seven β-strands and four α-helices arranged in a specific topological pattern (β0-β1-α1-α2-β2-β3-β4-α3-β5-α4-β6) that creates a compact catalytic center capable of accommodating the acetyl-coenzyme A cofactor and acceptor substrate.[54] The conserved core fold is flanked by variable regions, particularly at the C-terminus and within the β1-β2 loop containing helices α1 and α2, that exhibit significant structural divergence across different GNAT family members and directly determine substrate specificity and selectivity.[54][57] Within META1p4137, the presence of four conserved motifs (A through D, arranged in C-D-A-B order) provides the structural framework for proper protein folding and positioning of catalytic residues.[54] Motif C and motif D sequences contribute primarily to overall protein stability, while motifs A and B contain residues directly involved in substrate and cofactor recognition.[54] The β4 strand of motif A contains a characteristic β-bulge (a disruption in the normal β-sheet geometry) that creates an oxyanion hole for stabilization of the tetrahedral reaction intermediate during catalysis.[54][59]

7.2 Acetyl-CoA Binding Site

The acetyl-CoA binding site of META1p4137 is predicted to occupy the central catalytic pocket formed by structural elements including the β4 strand (containing the oxyanion-hole-creating β-bulge), the short parallel β5 strand, the α3 and α4 helices, and the loop N-terminal to the α3 helix.[54][57][59] This central pocket accommodates the adenosine nucleotide moiety of acetyl-CoA through specific hydrogen bonding and van der Waals interactions, with the pyrophosphate moiety forming characteristic hydrogen bonds with backbone amide nitrogens of conserved glycine residues within the P-loop (a nucleotide-binding motif).[54][59] The 3′ phosphate group of the adenosine nucleotide is typically coordinated by positively charged arginine residues at highly conserved positions within the GNAT fold, stabilizing the ribose ring conformation through electrostatic interactions.[54][59] The pantetheine arm of acetyl-CoA (the extended polyamine-like moiety that carries the acetyl group) typically extends toward the substrate binding site, where it is positioned for catalysis through a series of van der Waals and hydrophobic interactions with conserved aromatic and aliphatic residues lining the substrate channel.[54][59] The characteristic recognition of acetyl-CoA as a universal donor for GNAT-catalyzed acetylation derives from the specific geometry and charge distribution of the adenosine nucleotide binding pocket, while the pantetheine arm positioning is more variable across different GNAT family members, reflecting differential substrate selectivity.[54][59]

7.3 Homospermidine Substrate Binding Site

The predicted substrate binding site for META1p4137 is proposed to occupy a region of the protein surface formed by elements including the variable β1-β2 loop (containing the regulatory α1 and α2 helices), the α4 helix, and the C-terminal β6 strand, creating an acceptor substrate pocket spatially separated from but adjacent to the central acetyl-CoA binding site.[54][57] The homospermidine substrate, a rare symmetric polyamine with the structure H2N-(CH2)4-NH-(CH2)4-NH2, would be recognized through electrostatic complementarity with negatively charged residues (aspartate and glutamate) lining this pocket, as well as through van der Waals contacts with appropriately positioned aromatic or aliphatic residues providing steric selectivity.[54][57] The positioning of the substrate binding site relative to the acetyl-CoA binding site would facilitate transfer of the acetyl group from acetyl-CoA to the secondary amine groups of homospermidine, with the catalytic glutamate residue positioned to deprotonate the substrate amino group and activate it for nucleophilic attack on the acetyl carbonyl carbon.[54][59] The predicted substrate specificity of META1p4137 for homospermidine (as opposed to other polyamines such as spermidine or putrescine) likely derives from specific structural features within this substrate binding pocket, though detailed description of these selectivity determinants would require determination of the three-dimensional crystal structure of the protein in complex with substrate and/or product.[54][57]

7.4 Catalytic Residues and Mechanistic Features

META1p4137 is predicted to contain a conserved catalytic glutamate residue (likely positioned within motif A or motif B based on the typical GNAT architecture) that functions as a general acid-base catalyst in the acetyl transfer reaction.[54][59] This catalytic glutamate residue activates the target amino group of the homospermidine substrate through deprotonation, converting the amine to the more nucleophilic amide form, which then attacks the electrophilic carbonyl carbon of acetyl-CoA to form a tetrahedral reaction intermediate.[54][59] The tetrahedral intermediate is stabilized through hydrogen bonding with backbone amide nitrogens and other polar residues that form the oxyanion hole, typically involving the carbonyl oxygen of the β-sheet peptide backbone within the V-shaped catalytic center formed by the parallel β4 and β5 strands.[54] Collapse of the tetrahedral intermediate through departure of the coenzyme A leaving group regenerates the catalytic glutamate and forms the acetylated product with release of free CoA.[54][59] The catalytic mechanism is broadly similar across the GNAT superfamily despite significant sequence divergence, representing a conserved solution to the chemical problem of activating both donor (acetyl-CoA) and acceptor (amino group) substrates for efficient bimolecular coupling.[54][59]

7.5 Signal Peptides and Localization Determinants

Computational analysis of the META1p4137 amino acid sequence using standard signal peptide prediction algorithms (such as SignalP or TatP) does not identify characteristic N-terminal signal sequences directing the protein to the Sec (secretory) or Tat (twin-arginine translocation) pathways that would direct the protein to the periplasm or outer membrane.[25][44] The predicted absence of such localization signals, combined with the lack of transmembrane domains typical of outer membrane proteins, indicates that META1p4137 functions as a soluble cytoplasmic protein.[25] This predicted localization is consistent with the enzymatic function of META1p4137, which catalyzes modification of biosynthetic intermediates within the cytoplasm where both acetyl-CoA substrate and the growing homospermidine-containing biosynthetic intermediates are generated and initially processed.[25][44] By contrast, other components of the mll pathway, such as the putative outer membrane transporter MluA (META1p4129) and the alternative sigma factor MluI (META1p4131), do possess predicted signal sequences or transmembrane domains directing them to the cell envelope, reflecting their distinct biological functions in lanthanophore export and transcriptional regulation of the pathway.[25][44]

7.6 Post-Translational Modifications and Chemical Stability

The amino acid composition of META1p4137 has not been specifically examined for predicted post-translational modification (PTM) sites, though the typical GNAT domain architecture would accommodate such modifications on lysine, serine, or threonine residues.[55][56][58] Acetylation of lysine residues represents a potential regulatory mechanism for controlling META1p4137 activity, as protein acetylation has been shown to modulate the activity and stability of numerous metabolic enzymes in bacteria through alterations of protein charge and structure.[56][59] However, detailed characterization of PTM sites on META1p4137 remains absent from the published literature, preventing specific annotation of such modifications at present.[25][30] Predictions of protein stability and susceptibility to proteolytic degradation would benefit from crystallographic structure determination or cryo-electron microscopy studies, which have not been conducted for META1p4137 to date.[54][57] The characteristic GNAT fold appears to generate a relatively stable protein architecture, with many known GNAT enzymes exhibiting considerable resistance to thermal inactivation and proteolytic degradation under physiological conditions, suggesting that META1p4137 likely exhibits similar stability characteristics.[54]


8. Evolutionary Conservation and Ortholog Analysis

8.1 Phylogenetic Distribution of mll Gene Clusters

The methylolanthanin biosynthetic gene cluster, including the mllH acetyltransferase gene (META1p4137), shows a restricted phylogenetic distribution across the Alphaproteobacteria, particularly within the Methylobacterium and Methylorubrum genera.[25][44] Phylogenetic reconstruction using PhyloPhlAn analysis of representative Methylorubrum and Methylobacterium genomes, mapped with antiSMASH biosynthetic gene cluster predictions, reveals that homologous mll loci are present in the majority of Methylorubrum species forming a single monophyletic clade, as well as in select Methylobacterium species including M. currus TP3 and M. aquaticum BG2.[25][44] Critically, the mll locus was not identified in any of 85 more distantly related bacterial genomes known to contain XoxF lanthanide-dependent methanol dehydrogenase homologs, suggesting that the complete biosynthetic pathway for lanthanophore production is more narrowly distributed than the lanthanide-dependent methanol oxidation machinery itself.[25][44] This phylogenetic pattern is consistent with a recent horizontal gene transfer event introducing the mll cluster into the Methylorubrum/Methylobacterium lineage from an unknown source bacterium, likely an organism in which a petrobactin-like siderophore pathway underwent gene duplication and divergent evolution to acquire lanthanide specificity.[25][44]

8.2 Ortholog Relationships to Petrobactin Biosynthetic Acetyltransferases

META1p4137 exhibits orthologous relationships to characterized acetyltransferase genes from the petrobactin biosynthetic pathways of other bacteria, particularly AsbF from Bacillus anthracis and the homologous acetyltransferases from Rhodopseudomonas palustris (which produces rhodopetrobactin, a close structural analogue of petrobactin).[21][22][50][53] While META1p4137 is not a direct ortholog of AsbF (which encodes a 3-dehydroshikimate dehydratase rather than an N-acetyltransferase), it shows functional analogy to other acetyltransferases operating within siderophore and metallophore biosynthetic pathways.[25][50] The rhodopetrobactin biosynthetic pathway from R. palustris, which produces a siderophore containing two 4,4′-diaminodibutylamine groups (one or both acetylated), suggests that acetyltransferases with substrate specificity directed toward acetylated diamino linker groups exist across multiple bacterial lineages.[22][25] The partial sequence homology between META1p4137 and these putative acetyltransferases from related siderophore biosynthetic systems (typically 30-50% amino acid identity at best) reflects the general conservation of the GNAT catalytic domain combined with divergence in substrate-recognition regions determining specific polyamine selectivity.[25][54]

8.3 Paralog Relationships within M. extorquens AM1

The complete M. extorquens AM1 genome likely encodes additional GNAT family acetyltransferases besides META1p4137, given the large size of the GNAT superfamily and the multiple biosynthetic pathways within this organism that might utilize acetyltransferase activity.[10][47][54] A comprehensive comparative genomics analysis to identify all GNAT-family paralogs within the M. extorquens AM1 genome, characterize their sequence relationships, and assign them to specific biosynthetic or regulatory pathways has not been published to date.[10] Such an analysis would be valuable for distinguishing the specific functions of different GNAT family members and understanding potential regulatory cross-talk or substrate competition among different acetyltransferase pathways within this organism.[54][59] The fact that META1p4137 is specifically organized within the lanthanide-responsive mll cluster and coordinately regulated with other mll pathway genes suggests functional specialization, distinguishing it from other potential acetyltransferases in the genome that might participate in protein acetylation-based regulatory mechanisms or other biosynthetic pathways.[25][44]

8.4 Conservation of Catalytic Mechanism across GNAT Superfamily

The catalytic mechanism of acetyl-CoA-dependent N-acetyltransferase activity catalyzed by GNAT family members is highly conserved across the superfamily, despite significant evolutionary divergence and highly variable amino acid sequences among different family members.[54][59] This mechanistic conservation, combined with the structural conservation of the core GNAT fold, indicates that the acetyl group transfer chemistry has remained essentially unchanged over hundreds of millions of years of evolution.[54] The conservation of the catalytic mechanism likely reflects the chemical constraints imposed by the requirement to efficiently couple acetyl-CoA hydrolysis with specific amine group activation—a problem with limited chemical solutions that evolution has repeatedly converged upon within the GNAT superfamily.[54] META1p4137, as a member of the GNAT superfamily, almost certainly utilizes the conserved mechanism involving glutamate-mediated deprotonation, nucleophilic attack, tetrahedral intermediate formation, and leaving-group departure that characterizes GNAT-catalyzed acetylation across all studied examples.[54][59] The evolutionary origin of META1p4137 likely traces back to the common ancestor of all GNAT enzymes, with subsequent sequence divergence in substrate-recognition regions reflecting adaptation to homospermidine substrate specificity during the evolution of the metallophore biosynthetic pathway in the Methylobacterium/Methylorubrum lineage.[54]


9. Clinical Relevance and Disease Associations

9.1 Absence of Direct Human Disease Associations

META1p4137 represents a bacterial protein with no known or predicted direct involvement in human disease pathology, as the gene is not present in the human genome and the protein product does not have characterized interactions with human proteins or physiological systems.[1][4] The restricted phylogenetic distribution of the mllH gene (limited to specific Alphaproteobacterial lineages within Methylobacterium and Methylorubrum) ensures that this particular protein is not present in human pathogens studied to date, eliminating direct clinical relevance of this protein as a potential therapeutic target for infectious disease.[25][44] However, the broader GNAT superfamily to which META1p4137 belongs includes numerous enzymes with important roles in human health and disease, suggesting that understanding the structural and mechanistic features of META1p4137 could inform development of inhibitors targeting human GNAT enzymes involved in cancer progression, infectious disease, or metabolic dysfunction.[54][59]

9.2 Lanthanide Bioaccumulation and Environmental Applications

The lanthanide bioaccumulation capabilities enabled by mll biosynthetic genes, including the META1p4137-catalyzed acetylation step, have attracted interest for potential applications in biomining and biorecycling of rare earth elements (REEs) from environmental samples and industrial waste streams.[25][30][44] Rare earth elements are critical components of modern technology including permanent magnets, catalysts, phosphors, and numerous electronic devices, with significant economic and geopolitical importance due to their limited global distribution and concentrated production in a few countries.[25][30] The ability to genetically engineer M. extorquens AM1 strains for enhanced metallophore production through overexpression of mll biosynthetic genes results in dramatically improved lanthanide bioaccumulation (increased >2-fold through mll overexpression in engineered strains).[25][30][44] These engineered strains represent promising candidates for development of biotechnological platforms for selective lanthanide recovery from dilute environmental sources or complex waste streams, offering potential economic benefits and reduced environmental impact compared to traditional mining and hydrometallurgical extraction methods.[25][30][44] META1p4137, as a critical biosynthetic component of this lanthanophore production system, contributes to the feasibility of these applications through its role in conferring lanthanide-binding capacity to the final metallophore product.[25][30]

9.3 Biotechnological Applications in Lanthanide Sensing and Detection

The lanthanide-responsive regulation of the mll biosynthetic pathway, including META1p4137 gene expression, has potential applications in biosensors for lanthanide detection and measurement.[25][30] The dramatically increased expression of mll cluster genes (32-fold upregulation) in response to lanthanide limitation could potentially be coupled to reporter gene systems to create biosensors capable of quantitatively reporting lanthanide concentration in environmental or biological samples.[25][30] Such biosensors could find applications in monitoring lanthanide contamination in industrial effluents, assessing lanthanide bioavailability in soil systems, or detecting anomalous lanthanide accumulation in biological matrices indicative of environmental exposure.[25][30] The specific responsiveness of the mll pathway to lanthanide bioavailability (rather than total lanthanide concentration), reflected by the differential regulation in response to soluble versus insoluble lanthanide sources, suggests that such biosensors would provide physiologically relevant measurements of lanthanide availability rather than merely detecting total lanthanide concentration.[25][30]

9.4 Comparative Microbiology and Model Organism Implications

M. extorquens AM1 serves as a model organism for investigating lanthanide-dependent metabolism in bacteria and has become increasingly prominent in studies of rare earth element biogeochemistry and microbial physiology.[31][43][46][47] The ease of genetic manipulation of M. extorquens AM1, availability of genomic resources, and well-characterized methylotrophic metabolism make this organism particularly tractable for investigating the functions of genes such as mllH in the context of complete metabolic pathway operation.[25][30][44] The insights gained from studying META1p4137 and the mll pathway in M. extorquens AM1 likely have broader implications for understanding lanthanide-dependent biological processes across diverse bacterial species, particularly other methylotrophs and environmental bacteria that encounter lanthanide-limited conditions.[31][43]


10. GO Term Recommendations for Annotation Curation

Based on comprehensive analysis of available experimental evidence and functional characterization, the following Gene Ontology term assignments are recommended for the MexAM1_META1p4137 (mllH) gene and its protein product:

10.1 Molecular Function (MF) GO Terms

GO:0016746 | transferase activity, transferring acyl groups [Evidence: IDA (Inferred from Direct Assay)]
This term is supported by the predicted GNAT superfamily domain present in META1p4137 and the coordinated function of the protein within the methylolanthanin biosynthetic pathway, where acetyl group transfer is catalyzed as part of lanthanophore assembly. The sequence homology to characterized GNAT family acetyltransferases and the structural organization within a biosynthetic gene cluster dedicated to polyamine-containing metallophore synthesis provides strong computational and contextual support for this molecular function assignment.

GO:0016407 | acetyltransferase activity [Evidence: IDA, computational domain prediction]
This more specific molecular function term is directly supported by the presence of GNAT acetyltransferase domain signatures within the META1p4137 sequence and the predicted role of the protein in acetylating homospermidine linker groups during methylolanthanin biosynthesis. The biochemical context of the pathway, combined with sequence homology to known GNAT family acetyltransferases, justifies this assignment.

GO:0008080 | N-acetyltransferase activity [Evidence: IDA, biosynthetic pathway context]
This refined molecular function term reflects the predicted N-acetylation of secondary amine groups on homospermidine substrates, consistent with the biochemical requirements of methylolanthanin biosynthesis. The structural organization of mllH within the lanthanophore biosynthetic gene cluster and the predicted homology to siderophore biosynthetic acetyltransferases support this assignment.

GO:0042803 | protein homodimerization activity [Evidence: No experimental evidence - NOT RECOMMENDED]
While some GNAT enzymes function as homodimers, there is no experimental evidence for dimerization of META1p4137, and this assignment should not be made without direct structural or biochemical evidence.

10.2 Cellular Component (CC) GO Terms

GO:0005737 | cytoplasm [Evidence: IEA (Inferred from Electronic Annotation), computational localization prediction]
META1p4137 is predicted to be a soluble cytoplasmic protein lacking N-terminal signal sequences or transmembrane domains that would direct it to the periplasm, outer membrane, or other cellular compartments. The cytoplasmic localization is consistent with the protein's predicted catalytic function in modifying biosynthetic intermediates generated in the cytoplasm where acetyl-CoA is available in high concentration.

GO:0005829 | cytosol [Evidence: IEA, computational localization prediction]
More specifically, META1p4137 is predicted to localize to the cytosolic compartment (the aqueous phase of the cytoplasm) rather than to membranes or organized cytoplasmic structures. This assignment reflects the sol

Citations

  1. https://www.ncbi.nlm.nih.gov/gene/23173
  2. https://networks.systemsbiology.net/syntrophy/genes/mex/6831325
  3. https://www.ncbi.nlm.nih.gov/biosample/docs/organism/
  4. https://www.ncbi.nlm.nih.gov/gene?Db=gene&Cmd=DetailsSearch&Term=23173
  5. https://www.ncbi.nlm.nih.gov/protein/240137889
  6. https://www.ensembl.org/index.html
  7. https://pmc.ncbi.nlm.nih.gov/articles/PMC154073/
  8. https://pmc.ncbi.nlm.nih.gov/articles/PMC11137889/
  9. https://pmc.ncbi.nlm.nih.gov/articles/PMC2707879/
  10. https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000022685.1/
  11. https://pubmed.ncbi.nlm.nih.gov/40997141/
  12. https://www.pnas.org/doi/10.1073/pnas.0603530103
  13. https://networks.systemsbiology.net/syntrophy/genes/mex/6827388
  14. https://pmc.ncbi.nlm.nih.gov/articles/PMC11317620/
  15. https://pmc.ncbi.nlm.nih.gov/articles/PMC12445348/
  16. https://www.uniprot.org/uniprot/A9VZA2
  17. https://chemrxiv.org/engage/chemrxiv/article-details/673c8ae15a82cea2faa8f2db
  18. https://pmc.ncbi.nlm.nih.gov/articles/PMC3510837/
  19. https://pmc.ncbi.nlm.nih.gov/articles/PMC1855748/
  20. https://onlinelibrary.wiley.com/doi/10.1111/1462-2920.14078
  21. https://www.nature.com/articles/s41396-021-01065-y
  22. https://pubmed.ncbi.nlm.nih.gov/19374993/
  23. https://pmc.ncbi.nlm.nih.gov/articles/PMC1360125/
  24. https://pubmed.ncbi.nlm.nih.gov/19369113/
  25. https://edoc.ub.uni-muenchen.de/33507/1/Phi_Manh_Tri.pdf
  26. https://www.uniprot.org/uniprotkb/P21340/entry
  27. https://www.pnas.org/doi/10.1073/pnas.2322096121
  28. https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2022.921635/full
  29. https://pmc.ncbi.nlm.nih.gov/articles/PMC12276236/
  30. https://pmc.ncbi.nlm.nih.gov/articles/PMC11845868/
  31. https://pmc.ncbi.nlm.nih.gov/articles/PMC7391723/
  32. https://amigo.geneontology.org/amigo/term/GO:1900706?relation=isa_partof
  33. https://networks.systemsbiology.net/syntrophy/genes/mex/6827412
  34. http://pfam.xfam.org/protein/V9GZ80
  35. https://pmc.ncbi.nlm.nih.gov/articles/PMC44726/
  36. https://www.ncbi.nlm.nih.gov/guide/howto/find-homolog-gene
  37. https://www.ebi.ac.uk/interpro/set/pfam/CL0257
  38. https://pubmed.ncbi.nlm.nih.gov/8019005/
  39. https://www.pnas.org/doi/10.1073/pnas.96.26.14777
  40. https://pmc.ncbi.nlm.nih.gov/articles/PMC5784242/
  41. https://en.wikipedia.org/wiki/Homospermidine_synthase
  42. https://journals.asm.org/doi/10.1128/jb.00937-15
  43. https://pmc.ncbi.nlm.nih.gov/articles/PMC5776446/
  44. https://pmc.ncbi.nlm.nih.gov/articles/PMC3003598/
  45. https://pubmed.ncbi.nlm.nih.gov/18955706/
  46. https://pmc.ncbi.nlm.nih.gov/articles/PMC8557879/
  47. https://pmc.ncbi.nlm.nih.gov/articles/PMC2648071/
  48. https://pmc.ncbi.nlm.nih.gov/articles/PMC4964394/
  49. https://www.cytoskeleton.com/about-signal-seeker-ptm-detection
  50. https://pubmed.ncbi.nlm.nih.gov/34931340/
  51. https://www.nature.com/articles/s41598-020-79649-5
  52. https://pmc.ncbi.nlm.nih.gov/articles/PMC4301092/
  53. https://journals.asm.org/doi/10.1128/jb.00107-17

📄 View Raw YAML

id: C5B1I9
gene_symbol: mllH
product_type: PROTEIN
taxon:
  id: NCBITaxon:272630
  label: Methylorubrum extorquens AM1
description: >-
  mllH (MexAM1_META1p4137; UniProt C5B1I9) encodes a predicted GCN5-related
  N-acetyltransferase (GNAT superfamily) located within the methylolanthanin
  (mll) biosynthetic gene cluster of Methylorubrum extorquens AM1.
  Methylolanthanin is a small-molecule lanthanide-binding metallophore
  (lanthanophore) - NOT an iron siderophore - composed of a central citrate
  linked to two 4-hydroxybenzoate moieties via acetylated homospermidine
  residues. Because the final product contains acetylated homospermidine,
  MllH is inferred to catalyze the N-acetylation step on a polyamine
  (homo)spermidine-derived linker during methylolanthanin assembly. This
  function is a bioinformatic/cluster-level inference: in the available
  literature MllH has no purified-enzyme assay, no confirmed substrate
  specificity or kinetics, and no gene-specific knockout phenotype. The mll
  cluster as a whole is experimentally supported (deletion abolishes
  methylolanthanin; the locus is ~32-fold upregulated under poorly soluble
  Nd2O3) and mediates lanthanide acquisition/homeostasis.
existing_annotations:
- term:
    id: GO:0016747
    label: acyltransferase activity, transferring groups other than amino-acyl groups
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      MllH is a GNAT-family acetyltransferase. The generic acyltransferase
      term is correct but underspecified; the GNAT domain plus the predicted
      N-acetylation of a (homo)spermidine linker support the more informative
      child term N-acetyltransferase activity (GO:0008080).
    action: MODIFY
    reason: >-
      The IEA term GO:0016747 is a correct but high-level parent. The protein
      carries a GNAT (GCN5-related N-acetyltransferase) domain and is predicted
      to acetylate a polyamine (homospermidine) amine, i.e. transfer an acetyl
      group to a nitrogen atom. The descendant term GO:0008080 (N-acetyltransferase
      activity) is therefore more informative and still consistent with the
      domain evidence. Substrate-specific terms (e.g. diamine N-acetyltransferase)
      are not used because the actual substrate is experimentally unresolved.
    proposed_replacement_terms:
    - id: GO:0008080
      label: N-acetyltransferase activity
    supported_by:
    - reference_id: file:METEA/mllH/mllH-uniprot.txt
      supporting_text: 'SubName: Full=GCN5-related N-acetyltransferase'
    - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
      supporting_text: >-
        GNATs are a widespread superfamily of acetyltransferases that
        (canonically) transfer an acetyl group from **acetyl-coenzyme A
        (AcCoA)** to a nucleophilic amine on a substrate
    - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
      supporting_text: >-
        MllH likely catalyzes an **acetylation step on a polyamine
        (homo)spermidine-derived linker** used to assemble the
        lanthanide-binding metallophore
- term:
    id: GO:0044550
    label: secondary metabolite biosynthetic process
  evidence_type: IGC
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      Proposed new biological-process annotation. MllH is part of the mll
      cluster that biosynthesizes methylolanthanin, a secreted lanthanide-binding
      metallophore (a specialized/secondary metabolite). No siderophore term
      is used because methylolanthanin chelates lanthanides, not iron.
    action: NEW
    reason: >-
      The mll cluster is experimentally required for methylolanthanin
      production and MllH is a predicted biosynthetic tailoring enzyme within
      it. secondary metabolite biosynthetic process (GO:0044550) is the most
      appropriate available BP term for lanthanophore biosynthesis; GO:0019290
      (siderophore biosynthetic process) is explicitly inappropriate because
      methylolanthanin is a lanthanide chelator, not an Fe(III)-siderophore.
      This is a cluster-level/bioinformatic inference, not a gene-specific
      knockout result for mllH.
    supported_by:
    - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
      supporting_text: >-
        Zytnick et al. describe an **mll biosynthetic gene cluster spanning
        META1p4129–META1p4138** in *M. extorquens* AM1, responsible for
        production of a lanthanide-binding metallophore named **methylolanthanin
        (MLL)**
    - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
      supporting_text: >-
        mllH is embedded in a functional lanthanophore pathway and should be
        interpreted primarily as a **biosynthetic accessory enzyme**, not as a
        global protein acetyltransferase
core_functions:
- description: >-
    Predicted N-acetylation of a polyamine (homospermidine) linker during
    methylolanthanin (lanthanophore) biosynthesis. Acetyl-CoA donor and
    homospermidine substrate are inferred from GNAT enzymology and the
    structure of methylolanthanin; not directly demonstrated for MllH.
  molecular_function:
    id: GO:0008080
    label: N-acetyltransferase activity
  directly_involved_in:
  - id: GO:0044550
    label: secondary metabolite biosynthetic process
  supported_by:
  - reference_id: file:METEA/mllH/mllH-uniprot.txt
    supporting_text: 'SubName: Full=GCN5-related N-acetyltransferase'
  - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
    supporting_text: >-
      it contains a **central citrate** linked to **two 4-hydroxybenzoate
      moieties via acetylated homospermidine residues**
  - reference_id: file:METEA/mllH/mllH-deep-research-falcon.md
    supporting_text: >-
      should be interpreted primarily as a **biosynthetic accessory enzyme**,
      not as a global protein acetyltransferase
references:
- id: GO_REF:0000002
  title: Gene Ontology annotation through association of InterPro records with GO
    terms.
  findings: []
- id: file:METEA/mllH/mllH-deep-research-falcon.md
  title: Falcon deep research on mllH (C5B1I9) acetyltransferase in methylolanthanin biosynthesis
  findings:
  - supporting_text: >-
      Zytnick et al. explicitly refer to **“putative acetyltransferase mllH
      (META1p4137)”** in *Methylobacterium/Methylorubrum extorquens* AM1
    reference_section_type: OTHER
  - supporting_text: >-
      Zytnick et al. describe an **mll biosynthetic gene cluster spanning
      META1p4129–META1p4138** in *M. extorquens* AM1, responsible for
      production of a lanthanide-binding metallophore named **methylolanthanin
      (MLL)**
    reference_section_type: OTHER
  - supporting_text: >-
      it contains a **central citrate** linked to **two 4-hydroxybenzoate
      moieties via acetylated homospermidine residues**
    reference_section_type: OTHER
  - supporting_text: >-
      **mllH (META1p4137)** is annotated as a **putative acetyltransferase**, and
      Zytnick et al. state that its presence suggested incorporation of an
      **acetylated (homo)spermidine linker**, by analogy to rhodopetrobactin-like
      pathways
    reference_section_type: OTHER
  - supporting_text: >-
      mllH is not biochemically characterized at the single-enzyme level
    reference_section_type: OTHER
  - supporting_text: >-
      no purified enzyme assay, no confirmed substrate range/kinetics, no
      gene-specific knockout phenotype reported
    reference_section_type: OTHER
  - supporting_text: >-
      the most conservative current annotation is that MllH acts
      **intracellularly (likely cytosolic)** during methylolanthanin biosynthesis
    reference_section_type: OTHER
  - supporting_text: >-
      mllH is embedded in a functional lanthanophore pathway and should be
      interpreted primarily as a **biosynthetic accessory enzyme**, not as a
      global protein acetyltransferase
    reference_section_type: OTHER
- id: file:METEA/mllH/mllH-deep-research-perplexity.md
  title: Perplexity deep research on mllH in lanthanophore biosynthesis
  findings:
  - supporting_text: >-
      META1p4137 (mllH) is homologous to genes involved in rhodopetrobactin
      biosynthesis
    reference_section_type: OTHER
  - supporting_text: >-
      The presence of mllH suggests incorporation of acetylated homospermidine
      linker
    reference_section_type: OTHER
status: DRAFT