mllDE

UniProt ID: C5B1I6
Organism: Methylorubrum extorquens AM1
Review Status: INITIALIZED
πŸ“ Provide Detailed Feedback

Gene Description

Bifunctional carrier domain-containing protein (mllDE) that functions as a fusion of aryl carrier protein (AsbD-like) and ligase (AsbE-like) domains. The ACP domain requires post-translational modification with 4'-phosphopantetheine to transfer acyl intermediates, while the ligase domain catalyzes adenylation and condensation reactions during methylolanthanin biosynthesis. Part of mll cluster showing 32-fold upregulation in response to poorly soluble lanthanides.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0031177 phosphopantetheine binding
NAS NEW
Summary: Added to align core_functions with existing annotations.
Reason: Core function term not present in existing_annotations.
GO:0016874 ligase activity
NAS NEW
Summary: Added to align core_functions with existing annotations.
Reason: Core function term not present in existing_annotations.
GO:0005737 cytoplasm
NAS NEW
Summary: Added to align core_functions with existing annotations.
Reason: Core function term not present in existing_annotations.

Core Functions

Acyl carrier protein function with phosphopantetheine prosthetic group for substrate transfer

Molecular Function:
phosphopantetheine binding
Supporting Evidence:
  • file:METEA/mllDE/mllDE-deep-research-perplexity.md
    ACP domain requires phosphopantetheinylation for cargo shuttle function during biosynthesis. Contains conserved GXDS motif characteristic of ACP proteins with phosphopantetheine prosthetic group

Adenylation and condensation catalysis for lanthanophore assembly

Molecular Function:
ligase activity
Cellular Locations:
Supporting Evidence:
  • file:METEA/mllDE/mllDE-deep-research-perplexity.md
    Ligase domain catalyzes adenylation and condensation reactions. Predicted cytoplasmic localization based on lack of transmembrane domains or signal peptides

References

file:METEA/mllDE/mllDE-deep-research-perplexity.md
Deep research on mllDE fusion protein in metallophore biosynthesis
  • META1p4134 encodes bifunctional mllDE fusion protein with AsbD-like ACP and AsbE-like ligase domains
  • ACP domain requires phosphopantetheinylation for cargo shuttle function during biosynthesis
  • Ligase domain catalyzes adenylation of citrate and 4-hydroxybenzoate for metallophore assembly
  • Part of mll cluster showing 32-fold upregulation with poorly soluble lanthanides (Nd2O3)

πŸ“š Additional Documentation

Deep Research Perplexity

(mllDE-deep-research-perplexity.md)

provider: perplexity
model: sonar-deep-research
cached: false
start_time: '2025-10-31T09:44:27.315342'
end_time: '2025-10-31T09:47:38.157871'
duration_seconds: 190.84
template_file: templates/gene_research_go_focused.md
template_variables:
organism: METEA
gene_or_uniprot: MexAM1_META1p4134
provider_config:
timeout: 600
max_retries: 3
parameters:
reasoning_effort: medium
search_domain_filter: []
return_citations: true
temperature: 0.0
citation_count: 48


Question

Gene Research Template for GO Annotation Curation

Please provide a comprehensive research report on the gene MexAM1_META1p4134 in METEA.

Research Focus Areas for GO Annotation

I need detailed information covering these areas specifically for GO annotation curation:

1. Molecular Function (MF)

  • Primary molecular function(s) - specific biochemical activities
  • Enzymatic properties - catalytic activities, substrate specificity, cofactor requirements
  • Binding activities - what molecules does the protein bind (DNA, RNA, proteins, small molecules)
  • Regulatory functions - transcriptional regulation, kinase/phosphatase activities
  • Transport functions - ion channels, transporters, carriers

2. Cellular Component (CC)

  • Subcellular localization - where is the protein located within the cell
  • Membrane association - integral membrane, peripheral membrane, or cytosolic
  • Organellar localization - nucleus, mitochondria, ER, Golgi, etc.
  • Protein complexes - what complexes does this protein participate in
  • Cellular structures - cytoskeleton, ribosomes, proteasome components

3. Biological Process (BP)

  • Primary biological pathways - metabolic, signaling, developmental processes
  • Physiological roles - what happens when this gene is mutated or deleted
  • Developmental processes - embryogenesis, organogenesis, differentiation
  • Response to stimuli - environmental, chemical, biological responses
  • Cellular processes - cell cycle, apoptosis, DNA repair, protein folding

4. Experimental Evidence Quality

  • Direct experimental evidence - biochemical assays, functional studies
  • Genetic evidence - knockout/knockdown phenotypes, complementation
  • Physical evidence - protein interactions, localization studies
  • Expression evidence - tissue/stage-specific expression patterns
  • Comparative evidence - ortholog function in other species

5. Disease and Phenotype Associations

  • Human disease associations - genetic disorders, cancer involvement
  • Model organism phenotypes - what happens when gene function is disrupted
  • Clinical relevance - therapeutic targets, biomarkers
  • Population genetics - natural variants and their effects

6. Protein Structure and Domains

  • Functional domains - catalytic domains, binding domains, regulatory regions
  • Structural features - transmembrane regions, signal peptides, nuclear localization signals
  • Post-translational modifications - phosphorylation, ubiquitination, glycosylation sites
  • Protein-protein interaction interfaces - binding sites for partners

7. Evolutionary Conservation and Orthologs

  • Conservation across species - which functions are conserved
  • Ortholog studies - functional evidence from other organisms
  • Paralog relationships - functional divergence within gene families
  • Essential gene status - is this gene required for viability

Output Requirements for GO Curation

  • Provide specific GO term suggestions where experimental evidence supports them
  • Include PMID citations for all major functional claims
  • Distinguish experimental evidence levels (direct vs. inferred)
  • Focus on experimental data over computational predictions
  • Highlight recent discoveries (last 5 years) that might affect annotation
  • Note any conflicting evidence or ambiguous experimental results
  • Specify organism-specific differences if orthologs have different functions

Gene Information

  • Gene Symbol: MexAM1_META1p4134
  • Organism: METEA
  • Focus: Gene Ontology annotation curation and functional characterization

Please prioritize experimental evidence and be explicit about the strength and type of evidence supporting each functional claim.

Output


Let me analyze this query carefully. The user is asking for a comprehensive research report on the gene MexAM1_META1p4134 in METEA (which appears to be related to Methylobacterium extorquens AM1 based on the naming convention and search results provided).

Looking at the search results, I can see multiple references to the mll (methylolanthanin) biosynthetic gene cluster in Methylobacterium/Methylorubrum extorquens AM1. The search results provide substantial information about this cluster, and specifically, META1p4134 appears to be one of the genes in this cluster.

From the search results, particularly sources [2], [7], [24], [25], [26], [34], [37], [50], [54], and [59], I can extract information about the mll gene cluster:

  • The genes META1p4129 through META1p4138 were the most highly up-regulated in the Nd2O3 condition with an average increase in expression of 32-fold compared to growth with NdCl3
  • META1p4134 (mllDE) is described as encoding an "asbD-like aryl carrier protein and asbE-like ligase fusion"
  • This locus is homologous to BGCs responsible for the transport, regulation, and biosynthesis of NRPS-independent siderophores containing citrate and 3,4-dihydroxybenzoate (3,4-DHB) chelating groups
  • The cluster is involved in lanthanide metallophore (methylolanthanin) biosynthesis
  • The fusion between asbD and asbE homologs is also present in the rhodopetrobactin and roseobactin loci

I need to create a comprehensive 10,000+ word report structured according to the user's requirements, focusing on GO annotation curation for this specific gene. The report should cover:

  1. Molecular Function

  2. Cellular Component

  3. Biological Process
  4. Experimental Evidence Quality
  5. Disease and Phenotype Associations
  6. Protein Structure and Domains
  7. Evolutionary Conservation and Orthologs

Let me organize this information into a well-structured academic report with proper citations.

Comprehensive Gene Research Report: MexAM1_META1p4134 (mllDE) in Methylobacterium extorquens AM1 for GO Annotation Curation

Executive Summary

MexAM1_META1p4134, designated as mllDE (methylolanthanin biosynthetic gene cluster component DE), encodes a bifunctional protein representing a fusion of homologous sequences to the aryl carrier protein AsbD and the ligase AsbE from the well-characterized petrobactin biosynthetic pathway of Bacillus anthracis[26][34][54]. This gene constitutes a critical component of the lanthanophore biosynthetic machinery in Methylobacterium extorquens AM1 (also known as Methylorubrum extorquens AM1), a methylotrophic bacterium recently characterized for its ability to acquire and utilize rare earth elements through specialized metallophore production. Recent transcriptomic, biochemical, and genetic evidence demonstrates that this gene product participates in the assembly of methylolanthanin, a structurally unique lanthanide-chelating compound that exhibits a distinctive 4-hydroxybenzoate moiety previously undocumented in characterized metallophores[25][28][38]. The identification of this gene and its functional characterization represent a significant advance in understanding lanthanide bioavailability and acquisition in bacteria, with broad implications for biotechnological applications in rare earth element recovery and biomining processes. This report synthesizes current knowledge regarding the molecular function, cellular localization, biological processes, structural features, evolutionary conservation, and experimental evidence base necessary for comprehensive Gene Ontology annotation of this locus.

Molecular Function (MF) and Biochemical Activities

Primary Molecular Functions and Enzymatic Properties

The MexAM1_META1p4134 gene product possesses inherently complex molecular functions attributable to its structure as a gene fusion encoding two distinct functional domains[26][34]. The aryl carrier protein (ACP) domain, homologous to AsbD from Bacillus anthracis, serves as a post-translationally modified cargo carrier responsible for transferring and anchoring acyl intermediates during biosynthetic assembly line reactions[34][39][42]. This ACP domain requires post-translational modification with a 4'-phosphopantetheine (Ppant) arm installed on a conserved serine residue to achieve catalytic competence[19][39][42]. The phosphopantetheine arm functions as the actual cargo shuttle, creating a thioester linkage with substrates that moves these intermediates between sequential catalytic domains and partner enzymes in the biosynthetic pathway[39][42][48].

The ligase domain, homologous to AsbE, catalyzes adenylate-forming reactions essential for siderophore-like metallophore assembly[26][34][54]. In the characterized petrobactin biosynthetic pathway from Bacillus anthracis, AsbE operates as a nonribosomal peptide synthetase (NRPS)-independent siderophore synthetase that catalyzes the condensation of activated acyl groups carried on ACP domains with amine or hydroxyl group acceptors, facilitating formation of the complex chelating structure[18][47]. The specific biochemical catalysis involves the adenylation of carboxylic acid substrates to form high-energy acyl-adenylate intermediates with subsequent transfer to the phosphopantetheine thiol of cognate ACP domains, enabling subsequent peptide-like bond formation between building blocks[18][47][48].

In the context of the mll (methylolanthanin) biosynthetic gene cluster, the MexAM1_META1p4134 product participates specifically in the assembly of methylolanthanin, which possesses a central citrate core linked to two 4-hydroxybenzoate moieties through acetylated homospermidine linkers[25][26][38]. The molecular functions thus include adenylation and condensation catalysis for the assembly of this lanthanide-chelating structure[26][34][54]. The presence of the putative acetyltransferase MllH (META1p4137) in the cluster suggests that acetylation modifications of the linker regions occur either prior to or concurrent with condensation reactions catalyzed by the MllDE ligase domain[26][34][54]. The experimental demonstration that deletion of the mll cluster genes (including mllDE) results in a severe defect in lanthanide bioaccumulation and adsorption confirms the catalytic necessity of this gene product for functional metallophore assembly[25][37][38][59].

Substrate Specificity and Cofactor Requirements

The substrate specificity of the MexAM1_META1p4134 product remains partially characterized, though homology to well-studied NRPS-independent siderophore synthetases provides substantial mechanistic insight. The adenylation domain functions characteristic of AsbE homologs typically demonstrate specificity for carboxylic acid substrates, and in the context of petrobactin biosynthesis, AsbE specifically activates and loads citric acid onto cognate ACP domains[18][48]. Similarly, the MllDE ligase domain likely exhibits specific recognition and adenylation of citric acid as the core tricarboxylic acid scaffold for methylolanthanin assembly[26][34].

The aromatic carboxylic acid substrate, 4-hydroxybenzoic acid (4-HBA), represents another probable substrate of the MexAM1_META1p4134 product. The structural characterization of methylolanthanin explicitly identified the presence of 4-hydroxybenzoate moieties as unique structural features distinguishing this lanthanophore from previously characterized metallophores and siderophores[25][26][38]. The biosynthetic incorporation of these 4-HBA groups almost certainly proceeds through adenylation by ligase domains and condensation onto appropriately positioned ACP domains or amino nucleophiles, following classical NRPS-independent siderophore biosynthetic logic[18][26][34].

The homospermidine linker represents a third major substrate category interacting with MexAM1_META1p4134 function. The polyamine homospermidine must be activated and incorporated as the structural scaffold linking citrate to the 4-hydroxybenzoate moieties[26][38]. Homospermidine synthase activity has been demonstrated in numerous biosynthetic pathways, generating homospermidine through condensation of aminobutyl groups with primary amines[45]. The MllDE ligase domain, operating in concert with other cluster enzymes, catalyzes the incorporation of appropriately positioned homospermidine molecules as linking structures between chelating groups.

The adenylation reactions catalyzed by the MexAM1_META1p4134 AsbE-like ligase domain require adenosine triphosphate (ATP) as the nucleotide cofactor, following the mechanism characteristic of adenylate-forming enzymes[18][48]. The adenylation mechanism involves ATP-dependent activation of carboxylic acid substrates to form acyl-adenylate intermediates with pyrophosphate release, followed by subsequent transfer of the acyl group to the phosphopantetheine thiol of ACP domains[18][48]. No alternative nucleotide cofactors have been reported for the petrobactin biosynthetic AsbE homolog, suggesting that the MexAM1_META1p4134 product similarly requires ATP as the sole nucleotide cofactor.

The phosphopantetheine arm itself represents a critical prosthetic group rather than a traditional cofactor, constituting a post-translational modification installed on the ACP domain by phosphopantetheinyl transferases[19][39][42]. In Methylobacterium extorquens AM1, the phosphopantetheinyl transferases responsible for converting apo-ACP to holo-ACP remain to be explicitly characterized, though the presence of conserved phosphopantetheinyl transferase genes in the genome indicates the availability of this modification machinery[8][36].

Binding Activities and Molecular Recognition

The MexAM1_META1p4134 product exhibits multiple discrete binding interactions essential for catalytic function and metabolic pathway integration. The adenylation domain specifically recognizes and binds carboxylic acid substrates (citrate, 4-hydroxybenzoate, and possibly other aromatic acids) in the enzyme active site through interactions with conserved adenylation motifs including the adenylation signature sequences A3, A4, A7, and A10[48][51]. The structural basis for substrate binding in adenylation domains has been elucidated through crystallographic studies of adenylation proteins from well-characterized NRPS systems, revealing that adenylation domains position substrates through multiple hydrogen bonds and electrostatic interactions with the carboxylic acid group and ATP substrate[48].

The condensation domain catalyzes the formation of peptide-like bonds between acyl groups and amine or hydroxyl acceptors, requiring specific recognition of both the activated acyl-adenylate intermediate and the acceptor molecule (carried on partner ACPs or present as free nucleophiles)[18][48]. In petrobactin biosynthesis, the condensation specificity of AsbE homologs ensures proper assembly order and stereochemistry of the complex chelating structure. The MllDE protein, as an AsbE homolog, likely exhibits comparable specificity for recognition of appropriate partner ACP domains and substrates.

The ACP domain of MexAM1_META1p4134 contains the conserved GXDS motif common to all ACP proteins[39], which participates in substrate binding through formation of a surface groove that exposes polar groups on acyl chains for hydrogen bonding with solvent and protein residues[39][42]. The highly dynamic nature of ACP substrates, which remain tethered via flexible phosphopantetheine arms, necessitates conformational flexibility in the ACP domain to facilitate substrate presentation to sequential catalytic domains and partner proteins[39][42][48].

Cellular Component (CC) and Subcellular Localization

Predicted Localization and Membrane Association

The MexAM1_META1p4134 gene product is predicted to function as a cytoplasmic protein based on several lines of evidence from homology and bioinformatics analyses. Unlike the TonB-dependent outer membrane receptor (MluA/META1p4129) and the anti-sigma and sigma factor components of the mll cluster regulatory system, the MexAM1_META1p4134 product lacks predicted transmembrane domains or signal peptides characteristic of proteins targeted to membranes or periplasmic compartments[26][34][50][54]. The bioinformatic predictions using standard transmembrane topology prediction algorithms do not identify membrane-spanning helices or signal sequences in the MllDE protein[26][34].

The classification as a cytoplasmic protein aligns with the functional roles of petrobactin biosynthetic pathway homologs. In Bacillus anthracis, the AsbE protein functions in the cytoplasm as part of the NRPS-independent siderophore assembly machinery[18]. Following transport of the low-molecular-weight lanthanide-binding compound into the cell through the TonB-dependent periplasmic receptor and ABC transporter system, the biosynthetic machinery for methylolanthanin assembly occurs in the cytoplasm[26][34][36][40][41].

However, direct experimental verification through subcellular fractionation, fluorescence microscopy with green fluorescent protein (GFP) tagging, or similar localization studies has not been reported in the primary literature for MexAM1_META1p4134. Such direct localization studies would provide definitive confirmation of the predicted cytoplasmic localization and would represent valuable additions to the functional characterization of this gene product.

Protein Complex Associations and Assembly Line Context

The MexAM1_META1p4134 product functions as a component of the large multimeric biosynthetic assembly line responsible for methylolanthanin biosynthesis. In NRPS-independent siderophore biosynthetic systems, individual enzymatic components typically function as a coordinated assembly line, with ACP domains sequentially interacting with condensation domains, adenylation domains, and modification domains[18][47][48]. The physical organization and protein-protein interactions between cluster components remain incompletely characterized for the mll system.

In the petrobactin biosynthetic pathway, the AsbD and AsbE proteins (whose homologs constitute the MllDE fusion protein) are known to interact functionally, with AsbD serving as the aryl-ACP and AsbE catalyzing condensation reactions[18]. The fusion of these two proteins in the mll cluster likely reflects an evolutionary event that consolidated two functional domains into a single polypeptide, potentially enhancing catalytic efficiency through reduced diffusion distances between the ACP and condensation catalytic sites[26][34][54].

The identification of other mll cluster proteins including MllA (NIS synthetase), MllBC (bifunctional NIS synthetase and adenylation domain), MllDE (the subject of this report), and MllF (3,4-DHB synthase) suggests a coordinated biosynthetic assembly line where successive intermediates are transferred between these enzymatic components[26][34][54]. The acetyl-CoA-dependent acetyltransferase MllH likely modifies the polyamine linker either before or after integration into the growing metallophore structure[26][34][54]. The precise protein-protein interaction network and the order of substrate transfers between these components requires high-resolution structural characterization and biochemical reconstitution experiments.

Organellar and Cellular Structural Context

As a cytoplasmic protein involved in secondary metabolism, the MexAM1_META1p4134 product does not localize to membrane-bound organelles or participate in organellar biosynthetic pathways. The biosynthesis of methylolanthanin, like other NRPS-independent siderophore biosynthetic pathways, occurs in the cytoplasmic compartment where ATP is abundantly available to support adenylation reactions[18][47][48]. The lanthanides acquired through methylolanthanin-mediated transport across the cell envelope accumulate as cytoplasmic deposits in Methylobacterium extorquens AM1, consistent with cytoplasmic biosynthesis and subsequent utilization of the metallophore[36][40].

The relationship between the MexAM1_META1p4134 product and the bacterial nucleoid or other DNA-associated structures has not been investigated, though no evidence suggests direct involvement of this biosynthetic enzyme in DNA binding or chromatin-associated functions.

Biological Process (BP) and Metabolic Pathway Context

Lanthanide Metallophore Biosynthesis and Metabolism

The primary biological process involving MexAM1_META1p4134 constitutes the biosynthesis of methylolanthanin (MLL), a lanthanide-chelating metallophore essential for lanthanide acquisition in Methylobacterium extorquens AM1. This process directly parallels iron acquisition through siderophore biosynthesis and represents a recently elucidated "new" biological process specific to lanthanide-dependent organisms. The identification of methylolanthanin and characterization of its biosynthetic pathway have fundamentally altered understanding of lanthanide biology in bacteria, expanding the recognized roles of rare earth elements beyond previously known lanthanide-dependent enzymes such as XoxF methanol dehydrogenase[2][25][28][38][59].

The biosynthesis of methylolanthanin proceeds through sequential enzymatic reactions catalyzed by products of the mll gene cluster. The MexAM1_META1p4134 product, as an AsbE-like ligase with an integral ACP domain, catalyzes critical condensation and adenylation steps in assembling the complex chelating structure. The central citrate scaffold must be activated and condensed with appropriately positioned linker and chelating moieties in a coordinated fashion[26][34][54]. The detailed biochemical pathway remains incompletely characterized at the level of intermediate structures, but structural characterization of the final product provides constraints on the assembly sequence.

Environmental Response and Metal Homeostasis

The expression of the mll gene cluster, including MexAM1_META1p4134, is dynamically regulated in response to lanthanide bioavailability and solubility conditions. The genes META1p4129 through META1p4138 were identified as the most highly up-regulated genes in Methylobacterium extorquens AM1 cells exposed to poorly soluble neodymium oxide (Ndβ‚‚O₃), exhibiting an average 32-fold increase in expression compared to cells grown with soluble neodymium chloride (NdCl₃)[26][34][54]. This substantial differential expression clearly indicates that metallophore production is induced specifically in response to low lanthanide bioavailability conditions where lanthanide solubilization through chelation becomes essential for cellular acquisition.

The regulation of the mll cluster represents an elegant example of bacterial environmental sensing and metabolic adjustment. In soil environments and other natural habitats where lanthanides accumulate primarily as insoluble oxides and phosphate minerals, the induction of metallophore biosynthesis enables bacterial growth by solubilizing poorly bioavailable lanthanides[24][25][38]. Conversely, in laboratory conditions where lanthanides are supplied as soluble chloride salts, reduced metallophore biosynthesis reflects the redundancy of this pathway when lanthanides are already bioavailable.

The mll promoter demonstrates responsiveness to both lanthanide concentration and iron availability[25][38]. Interestingly, the differential responsiveness of the mll promoter to perturbations in neodymium and iron levels depends on the presence or absence of mxaF, a gene encoding the major calcium-dependent methanol dehydrogenase[25][38]. This complex regulation likely reflects the evolutionary origin of the mll gene cluster through horizontal gene transfer from iron-acquisition pathways, as evidenced by the narrow taxonomic distribution of mll homologs and the structural homology to iron siderophore biosynthetic clusters[26][34][50][54].

Lanthanide-Dependent Methylotrophic Metabolism

The broader biological process context of MexAM1_META1p4134 function encompasses lanthanide-dependent methylotrophic metabolism. The acquisition of lanthanides through methylolanthanin-mediated transport enables the assembly of XoxF lanthanide-dependent methanol dehydrogenase, which catalyzes the oxidation of methanol to formaldehyde during methylotrophic growth[8][31][38]. This lanthanide-dependent pathway represents an alternative to the constitutive calcium-dependent MxaFI methanol dehydrogenase, and the relative activity of these two enzymes is controlled by lanthanide availability through a regulatory mechanism termed the "lanthanide switch"[31][33][38][56].

The regulation of the lanthanide switch involves the two-component regulatory system MxcQE in Methylobacterium extorquens AM1, with lanthanide-sensing mechanisms that remain incompletely characterized. However, it is clear that as lanthanide concentrations increase in response to metallophore-mediated uptake, the expression of xoxF genes encoding XoxF methanol dehydrogenase increases while mxaF expression reciprocally decreases[8][31][38][56]. This metabolic switching allows bacteria to preferentially utilize the more efficient XoxF enzyme when lanthanides are available[38][56].

The connection between MexAM1_META1p4134 function and lanthanide-dependent methylotrophic metabolism operates through multiple levels: the direct biosynthesis of the metallophore (MexAM1_META1p4134 product function) enables lanthanide acquisition, subsequent lanthanide uptake induces expression of XoxF-dependent pathways, and the acquired lanthanides serve as prosthetic groups for XoxF metallation and function. Thus, MexAM1_META1p4134 represents a critical node in the lanthanide acquisition and utilization circuit essential for efficient methylotrophy in environments where lanthanides are poorly bioavailable.

Cross-Talk Between Lanthanide and Iron Metabolism

Emerging evidence suggests significant metabolic cross-talk between lanthanide acquisition and iron homeostasis in Methylobacterium extorquens AM1. The mll gene cluster evolved from ancestral iron-acquisition genes, exhibiting homology to iron siderophore biosynthetic clusters such as the petrobactin (asb) cluster of Bacillus anthracis[26][34][50][54]. The structural features of methylolanthanin, including the 4-hydroxybenzoate moieties typically associated with catecholate siderophores, further underscore this evolutionary connection[25][26][38][59].

The dual responsiveness of the mll promoter to both lanthanide and iron availability suggests that the regulatory system may integrate signals reflecting cellular metal homeostasis more broadly[25][38]. In iron-limited conditions, cells might simultaneously up-regulate both iron siderophore production and lanthanide metallophore biosynthesis as part of a coordinated response to trace metal scarcity. Conversely, iron-replete conditions might suppress mll expression if iron and lanthanide acquisition pathways share regulatory mechanisms[25][38]. The molecular basis for this cross-talk and the specific regulatory proteins mediating iron-dependent repression of the mll cluster remain to be elucidated.

Experimental Evidence Quality and Evidence Levels

High-Quality Direct Experimental Evidence

The experimental characterization of MexAM1_META1p4134 function benefits from multiple complementary lines of direct evidence, primarily derived from recent comprehensive studies of the mll biosynthetic gene cluster published in 2024. The identification and biochemical characterization of methylolanthanin by Zytnick and colleagues (2024, PNAS 2322096121) represents the foundational direct evidence for cluster function[25][28][38][59]. This study employed unbiased transcriptomic analysis using RNA-sequencing to identify the mll gene cluster through assessment of differential gene expression in response to lanthanide bioavailability, demonstrating robust 32-fold up-regulation of META1p4129 through META1p4138 genes under poorly soluble lanthanide conditions[26][34][54].

Genetic evidence from gene deletion and overexpression experiments provides direct functional validation. The construction of Ξ”mxaFΞ”mll strains (lacking the entire mll cluster including MexAM1_META1p4134) demonstrated a severe defect in lanthanide bioaccumulation and adsorption with both soluble (NdCl₃) and poorly soluble (Ndβ‚‚O₃) lanthanide sources[25][37][38]. The specific quantification demonstrated that mll deletion decreased neodymium bioaccumulation by 1.8-fold in the NdCl₃ condition, while overexpression of the mll cluster (Ξ”mxaF/pMLL strain) increased lanthanide bioaccumulation by an average of 3.5-fold[25][37][38][54][59]. These reciprocal phenotypes provide strong genetic evidence for the catalytic requirement of MexAM1_META1p4134 and associated cluster gene products in metallophore biosynthesis.

Complementation experiments strengthened the genetic evidence. Exogenous supplementation of purified methylolanthanin (50 nM) to strains lacking either the mll cluster or MxaF significantly increased growth yield in multiple lanthanide conditions[25][37]. The provision of methylolanthanin to Ξ”mxaF or Ξ”mxaFΞ”mll strains growing with 2 ΞΌM NdCl₃ increased maximal optical density with P-values of 0.036 and 0.037, respectively[25][37]. Similarly, supplementation with 50 nM NdCl₃ increased growth yield with P-values of 0.0158 and 0.00592 for the two strains[25][37]. These complementation results demonstrate that the phenotypic defect results from loss of metallophore biosynthesis capacity rather than pleiotropic effects.

Biochemical characterization of the final biosynthetic product provided direct chemical evidence supporting MexAM1_META1p4134 function. Ultra-high-performance liquid chromatography-tandem mass spectrometry (UHPLC-MS/MS) analysis of culture supernatants identified methylolanthanin as a molecule with m/z = 799.4232 (positive ionization mode) or m/z = 797.4092 (negative ionization mode)[25][34][59]. The compound was present in Ξ”mxaF wild-type supernatants and absent in Ξ”mxaFΞ”mll knockout supernatants, with volcano plot analysis confirming this as the second-highest fold-change and most significant metabolite altered by the mll deletion[25][34][59].

Complete structural elucidation through nuclear magnetic resonance (NMR) spectroscopy revealed the precise architecture of methylolanthanin as a central citrate moiety linked to two 4-hydroxybenzoic acid (4-HB) moieties through acetylated homospermidine linkers[25][26][38]. The presence of four conformers of roughly equal abundance stemming from different orientations of the two acetyl groups further demonstrated the chemical complexity of the biosynthetic product[25][26][38]. This NMR characterization provides definitive evidence that the mll cluster, including MexAM1_META1p4134, catalyzes the assembly of this structurally unique chelating compound.

Homology-Based Functional Inference

While direct biochemical characterization of MexAM1_META1p4134 catalytic activity has not been reported in the primary literature, powerful inferences regarding molecular function derive from sequence homology to well-characterized NRPS-independent siderophore synthetases. The MllDE protein exhibits homology to the AsbD and AsbE proteins from the petrobactin biosynthetic pathway of Bacillus anthracis, with percentage identities of 54 percent to the concatenated AsbDE protein[26][34][54]. The petrobactin biosynthetic pathway has been extensively characterized through biochemical reconstitution experiments, structural biology studies, and genetic analyses[15][18][46][48].

In Bacillus anthracis, AsbA catalyzes the penultimate step in petrobactin biosynthesis through condensation of 3,4-dihydroxybenzoyl spermidine with citrate to form 3,4-dihydroxybenzoyl spermidinyl citrate[18][46]. AsbB subsequently catalyzes condensation of a second 3,4-dihydroxybenzoyl spermidine molecule with the intermediate to form mature petrobactin[18][46]. The AsbD and AsbE proteins function as the aryl carrier protein and ligase, respectively, with adenylation and condensation catalytic activities[18][47][48].

The functional inference that MexAM1_META1p4134 catalyzes adenylation and condensation reactions in methylolanthanin biosynthesis benefits from a high degree of homology conservation to experimentally characterized AsbE homologs. The NRPS-independent siderophore synthetase superfamily demonstrates remarkable functional conservation across diverse bacterial species, with adenylation domains and condensation domains exhibiting conserved catalytic machinery despite low overall sequence identity[14][17][44][47][48][53].

Expression and Transcriptomic Evidence

Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) and RNA-sequencing experiments provide robust transcriptomic evidence for MexAM1_META1p4134 expression and regulation. The gene exhibited a logβ‚‚ fold-change of 5.8 in response to poorly soluble lanthanide (Ndβ‚‚O₃) compared to soluble lanthanide (NdCl₃) conditions, with an extremely significant q-value of 3.60E-209[26][34][54]. This extraordinarily low q-value indicates highly robust differential expression with minimal false discovery rate. The magnitude of fold-change (approximately 55-fold increase) demonstrates dramatic up-regulation of this gene specifically in response to lanthanide bioavailability conditions.

Comparative transcriptomic analysis revealed that MexAM1_META1p4134 ranked among the most highly up-regulated genes in the entire Methylobacterium extorquens AM1 genome under poorly soluble lanthanide conditions[26][34][54]. This top-tier expression ranking indicates the critical physiological importance of this gene product for bacterial adaptation to lanthanide-limited environments. The expression data aligns perfectly with the genetic evidence demonstrating that mll cluster genes are essential for metallophore biosynthesis and lanthanide bioaccumulation.

Phenotypic and Physiological Evidence

The remarkable bioaccumulation capacity observed in Methylobacterium extorquens AM1 strains overexpressing the mll cluster provides compelling physiological evidence for MexAM1_META1p4134 function. In batch bioreactor cultures with 1 percent (w/v) neodymium magnet swarf, mll overexpression strains accumulated lanthanide at levels reaching 80 mg of Nd/g of dry weight for neodymium, 15 mg of Pr/g of dry weight for praseodymium, and 8 mg of Dy/g of dry weight for dysprosium[5][10]. These lanthanide bioaccumulation levels represent approximately 3-fold increases compared to wild-type strains and approach the theoretical limits suggested by independent calculations[5][10].

The selectivity of lanthanide bioaccumulation provides additional evidence for specific metallophore-mediated uptake. In cultures grown with neodymium magnet swarf, approximately 98.0 percent of accumulated intracellular metal was rare earth elements, with 96.8 percent being neodymium[5][10]. Iron constituted only 2.0 percent of measured intracellular metal content despite iron's presence in the magnet swarf[5][10]. This remarkable selectivity for lanthanides over iron, a competing divalent metal, demonstrates that the mll cluster products, including MexAM1_META1p4134, have evolved specific recognition and discrimination mechanisms for lanthanides over other transition metals[5][10].

The ability of Methylobacterium extorquens AM1 to achieve such dramatic bioaccumulation from complex feedstocks like electronic waste without requirement for harsh acids or high temperatures represents a significant biotechnological advance[5][10]. The involvement of MexAM1_META1p4134 in enabling this capacity highlights the practical importance of understanding this gene product's molecular functions for applications in sustainable rare earth element recovery.

Protein Structure and Domains

Functional Domain Architecture

The MexAM1_META1p4134 gene product exhibits a multidomain protein architecture comprising at minimum two functional domains: an aryl carrier protein domain homologous to AsbD and a ligase domain homologous to AsbE[26][34][50][54]. This bifunctional architecture represents a consolidated fusion of genes that exist as separate open reading frames in other organisms such as Bacillus anthracis, where asbD and asbE are encoded as distinct genes[26][34][46].

The aryl carrier protein (ACP) domain contains the characteristic GXDS motif common to all acyl carrier proteins and phosphopantetheine-binding proteins[39][42][46]. This conserved sequence serves as the attachment point for the 4'-phosphopantetheine prosthetic group, which anchors the mobile arm carrying acyl intermediates[39][42][48][49]. The exact position of the serine residue within the GXDS motif (typically the serine of position D+1 relative to the first position of the motif) undergoes post-translational modification by phosphopantetheinyl transferases to install the Ppant group[39][42].

The phosphopantetheine arm itself constitutes a 20-Γ… extended prosthetic group derived from coenzyme A, extending from the protein core to present acyl-thioester-linked intermediates to catalytic domains and partner proteins[39][42][49]. The flexibility of the phosphopantetheine arm, enabled by the multiple rotatable bonds in its structure, permits substrate presentation to multiple distinct catalytic sites and accommodates the conformational changes necessary during catalytic cycles[39][42][48][49].

The adenylation and condensation catalytic domains of the ligase component exhibit conserved motifs characteristic of NRPS-independent siderophore synthetases and adenylate-forming enzymes. The adenylation domain contains signature sequences A3 through A10 that define substrate specificity and catalytic function[48][51]. These sequence motifs include the phosphate-binding loop (A3), the aromatic residue selector (A4), the ATP ribose-binding aspartate (A7), and the lysine positioned to interact with carboxylic acid and adenylate phosphates[48][51]. The condensation domain catalyzes nucleophilic attack by acyl-adenylate intermediates or nucleophiles (amines or hydroxyls) on acceptor substrates to form peptide-like bonds.

Post-Translational Modification Sites

The installation of the 4'-phosphopantetheine prosthetic group on the serine residue within the GXDS motif represents the critical post-translational modification of MexAM1_META1p4134 required for catalytic competence. This modification is catalyzed by phosphopantetheinyl transferases (PPTases), which recognize serine residues within characteristic consensus sequences and transfer the adenosine 3',5'-bisphosphate form of phosphopantetheine from coenzyme A[39][42][49]. No PPTase genes have been explicitly assigned in the mll gene cluster, indicating that cognate PPTases responsible for MexAM1_META1p4134 modification are encoded elsewhere in the Methylobacterium extorquens AM1 genome or represent promiscuous enzymes with broad substrate specificity.

The acetylation of homospermidine linker groups in methylolanthanin structure indicates that the MexAM1_META1p4134 product participates in or is directly involved with acetyltransferase-catalyzed modifications. The presence of the putative acetyltransferase MllH (META1p4137) in the mll gene cluster suggests that acetylation occurs at the appropriate stage of biosynthesis[26][34][54]. The substrate for this acetyltransferase and the precise timing of acetylation relative to condensation reactions catalyzed by MexAM1_META1p4134 remain to be experimentally determined through biochemical reconstitution studies.

Additional post-translational modifications such as phosphorylation, ubiquitination, or proteolytic processing have not been predicted or experimentally demonstrated for MexAM1_META1p4134, though such modifications could potentially regulate protein-protein interactions or stability in response to cellular lanthanide status.

Tertiary and Quaternary Structure

The three-dimensional structure of MexAM1_META1p4134 has not been experimentally determined through X-ray crystallography, cryo-electron microscopy, or nuclear magnetic resonance spectroscopy. However, structural features can be inferred from homology to characterized ACP and ligase domains from well-studied systems. Acyl carrier proteins generally adopt a four-helix bundle architecture with the phosphopantetheine arm extending from one face of this structure[39][42][49]. The ACP domains from polyketide synthases and fatty acid synthases exhibit dynamic conformational changes during catalytic cycling, transitioning between distinct conformational states that facilitate substrate binding, catalytic modification, and product transfer to partner domains[39][42][48][49].

The adenylation and condensation catalytic domains adopt the characteristic fold of adenylate-forming enzymes, with the adenylation domain typically forming a two-domain structure with substrate binding occurring in a cleft between the two domains[48][51]. The large conformational changes accompanying adenylation reactions position ATP in the active site for phosphorylation of the substrate carboxyl group, and subsequent domain reorientation facilitates pyrophosphate release and substrate transfer to the ACP thiol[48][51].

The fusion of ACP and ligase domains into a single polypeptide in MexAM1_META1p4134 likely creates a distinct structural organization compared to the separate AsbD and AsbE proteins of Bacillus anthracis. The physical linkage between the ACP and ligase domains may enhance catalytic efficiency through substrate channeling effects, reducing diffusion times for transfer of intermediates from the ACP to the ligase active site[26][34][54]. Such fusion proteins represent an evolutionary refinement frequently observed in biosynthetic enzyme systems where catalytic efficiency benefits from proximity of sequential catalytic sites.

Quaternary structure involving associations with other mll cluster proteins likely occurs during biosynthesis, though the precise protein-protein interaction network remains unmapped. The transfer of intermediates between sequential enzymatic components in the mll pathway requires specific recognition and interaction between cognate proteins[26][34][54]. The MllBC fusion protein (combining NIS synthetase activities with adenylation domain function) and the MllF protein (3,4-DHB synthase) likely interact with MexAM1_META1p4134 in a coordinated assembly line, though structural details of these interactions await experimental characterization.

Signal Sequences and Targeting Information

Bioinformatic analysis does not identify signal peptides or targeting sequences in MexAM1_META1p4134, consistent with its predicted cytoplasmic localization. The absence of N-terminal signal sequences characteristic of secretory pathway entry indicates that this protein does not transit the general secretory pathway for translocation across the inner membrane[26][34]. Likewise, no predicted transmembrane domains position this protein for integral or peripheral membrane association[26][34].

The cytoplasmic localization of MexAM1_META1p4134 aligns with the cytoplasmic location of biosynthetic machinery for NRPS-independent siderophores such as petrobactin in Bacillus anthracis[18][47][48]. The assembly of metallophores from their component building blocks proceeds in the cytoplasm where ATP is abundantly available to support adenylation reactions and where the synthesized product can be directly oxidized or modified before secretion if required[26][34].

Evolutionary Conservation and Ortholog Analysis

Phylogenetic Distribution and Taxonomic Scope

The mll gene cluster, including MexAM1_META1p4134, exhibits a highly restricted phylogenetic distribution limited primarily to Methylobacterium and Methylorubrum species. Phylogenetic reconstruction using PhyloPhlAn and custom antiSMASH analysis demonstrated that the majority of Methylorubrum species forming a single phylogenetic clade contain homologs of the mll locus[26][34][37][50][54]. Additionally, Methylobacterium currus TP3 and Methylobacterium aquaticum BG2 contain homologous biosynthetic gene clusters[26][34][37][50][54].

Critically, the mll locus was not identified in any of 85 more distantly related genomes known to contain XoxF homologs (lanthanide-dependent methanol dehydrogenase genes), indicating that the mll cluster is not universally distributed among lanthanide-utilizing bacteria[26][34][37][50][54]. This narrow taxonomic distribution strongly suggests that the mll gene cluster originated through a recent horizontal gene transfer event rather than through vertical inheritance from a lanthanide-utilizing common ancestor[26][34][37][50][54]. The alternative hypothesis of vertical inheritance would predict broader phylogenetic distribution among XoxF-containing organisms.

The evolutionary origin through horizontal gene transfer likely involved acquisition from bacteria producing iron-chelating metallophores (siderophores) with similar biosynthetic architectures. The striking structural and genetic homology to the petrobactin (asb) biosynthetic cluster of Bacillus anthracis and the rhodopetrobactin biosynthetic cluster of Rhodopseudomonas palustris indicates that mll representatives constitute evolved versions of these iron acquisition pathways[26][34][43][46][50][54]. The acquisition and subsequent divergence of an ancestral iron siderophore biosynthetic cluster to acquire lanthanide-selective properties represents an elegant example of metabolic pathway evolution conferring novel capabilities.

Functional Conservation Among Orthologs

Orthologous mll gene clusters in other Methylobacterium and Methylorubrum species almost certainly retain lanthanide-binding metallophore biosynthetic function, though functional studies have been conducted primarily on the Methylobacterium extorquens AM1 system. The profound conservation of gene order, organization, and sequence identity among Methylorubrum species suggests functional equivalence of their respective mll clusters[26][34][37][50][54].

The broad biochemical and evolutionary principles underlying metallophore biosynthesis indicate that orthologs of MexAM1_META1p4134 in other Methylobacterium and Methylorubrum species would catalyze adenylation and condensation reactions in analogous lanthanide metallophore biosynthetic pathways. However, detailed functional characterization of these orthologs has not been reported, representing a significant gap in understanding the functional versatility and potential variation in biosynthetic pathways across this genus.

Paralog Analysis and Functional Divergence

The mll gene cluster represents a distinct paralogous lineage relative to iron siderophore biosynthetic pathways such as petrobactin (asb) in Bacillus anthracis and Rhodopseudomonas palustris. The amino acid sequence divergence between MllDE and its AsbDE ortholog in Bacillus anthracis (54 percent identity at the nucleotide/protein level) indicates substantial evolutionary divergence from the common ancestral iron siderophore biosynthetic gene product[26][34][54]. This divergence reflects the acquisition of lanthanide-specific substrate recognition and selective binding properties distinguishing the evolved metallophore from its iron-chelating ancestor.

Paralogous relationships within the mll gene cluster itself include potential duplication events creating functionally specialized versions of adenylation and condensation domains. The presence of multiple adenylation-containing proteins (mllA, mllBC) suggests functional specialization where different adenylation domains recognize distinct substrates (such as different carboxylic acids or aromatic acids) for incorporation into the metallophore structure[26][34][54].

Cross-Kingdom Orthology and Functional Principles

Broader consideration of functional orthologs across all domains of life reveals conservation of fundamental ACP and adenylation/condensation catalytic principles. The ACP fold and phosphopantetheine-dependent cargo transport mechanism have been conserved from bacteria through archaea and eukaryotes, underscoring the fundamental biochemical importance of this protein engineering solution[39][42][49]. Similarly, adenylate-forming enzymes catalyzing ATP-dependent activation of carboxylic acids have been conserved across broad evolutionary distances, with structural and mechanistic properties maintained despite limited overall sequence identity[48][51].

The evolutionary conservation of these fundamental biosynthetic principles supports high confidence in functional inferences based on homology to characterized systems, even when sequence identity is moderate (50-60 percent). The critical functional elements including substrate binding residues, ATP-binding motifs, and catalytic machinery remain reliably predictable based on homology relationships despite divergent evolution.

Integration with Lanthanide-Dependent Biology

The Lanthanome Concept and Functional Interdependencies

The study of lanthanide biology in Methylobacterium extorquens AM1 has benefited from the development of the "lanthanome" concept, encompassing all biomolecules participating in lanthanide utilization including lanthanide-dependent proteins, lanthanide transporters, lanthanide-binding proteins, and lanthanide biosynthetic machinery[40][41][56]. MexAM1_META1p4134, as a critical component of the lanthanophore biosynthetic machinery, constitutes an essential element of the lanthanome.

The lanthanome includes the lanthanide-dependent methanol dehydrogenases (XoxF enzymes and their cytochrome partners), the lanthanide-binding periplasmic protein LutD, the lanmodulin (LanM) lanthanide-binding and transport-mediating protein, the TonB-dependent outer membrane receptor (LutH), the ABC transporter system (LutEF), and the substrate-binding protein (LutA) of this transporter[8][36][40][41][56]. The MexAM1_META1p4134 product interacts functionally with these lanthanome components through the availability of methylolanthanin for lanthanide complexation and transport into cells.

Lanthanide-Dependent Methanol Oxidation and Formaldehyde Detoxification

The lanthanide-dependent methanol oxidation pathway represents a critical physiological process directly dependent on MexAM1_META1p4134 function. The lanthanide-dependent XoxF methanol dehydrogenase catalyzes oxidation of methanol to formaldehyde in the periplasm, with formaldehyde subsequently oxidized to formate and COβ‚‚ through sequential reactions catalyzed by aldehyde dehydrogenases[8][31][38]. The XoxF enzyme requires lanthanide cofactors for proper metallation and catalytic competence[8][31][38][56].

The biosynthesis of methylolanthanin (catalyzed by MexAM1_META1p4134 and associated cluster enzymes) enables lanthanide bioaccumulation, which in turn supports XoxF expression and function through the lanthanide switch regulatory mechanism. This hierarchical dependence means that MexAM1_META1p4134 function is essential for the lanthanide-dependent methylotrophic lifestyle in poorly bioavailable lanthanide environments.

The formaldehyde produced by XoxF represents a toxic product if allowed to accumulate in cells, necessitating rapid oxidation to formate and eventual incorporation into cellular carbon pools. The lanthanide-dependent pathway thus enables growth on methanol at rates comparable to calcium-dependent MxaFI-catalyzed oxidation by efficiently detoxifying the intermediate formaldehyde product[8][31][38].

Disease and Phenotype Associations

Phenotypes of mll Cluster Gene Mutations

The detailed characterization of Methylobacterium extorquens AM1 strains with deletions in individual mll cluster genes or the entire cluster demonstrates the critical physiological importance of MexAM1_META1p4134 function. Strains lacking the complete mll cluster (Ξ”mxaFΞ”mll) exhibit severely compromised lanthanide bioaccumulation capacity, reduced growth rates on poorly soluble lanthanide sources, and inability to achieve wild-type lanthanide concentrations even with highly soluble lanthanide chloride sources[25][37][38]. The magnitude of defect (approximately 1.8-fold to 3.5-fold reduction compared to wild-type) underscores the quantitative importance of metallophore-mediated lanthanide transport.

No naturally occurring loss-of-function mutations in mll genes have been reported in wild Methylobacterium or Methylorubrum populations, consistent with the essential nature of this pathway for survival in lanthanide-limited natural environments. The strong conservation of mll genes across Methylobacterium/Methylorubrum species argues for substantial selective pressure maintaining this biosynthetic capacity.

Adaptive Evolution and Laboratory Selection

Studies of Methylosinus trichosporium OB3b identified laboratory-adaptive mutations in lanthanide transporter genes that arose during routine cultivation, underscoring the dynamic selective pressures operating on lanthanide acquisition systems. Two independently isolated mutants harbored loss-of-function mutations in the CQW49_RS02145 gene encoding a TonB-dependent receptor (homologous to lutH of M. extorquens AM1)[33]. These mutations arose through adaptive laboratory evolution in the presence of cerium and glycerol, suggesting that glycerol toxicity or lanthanide toxicity imposed selective pressure for loss of function in the lanthanide transport system[33].

This adaptive evolution demonstrates that lanthanide transport and biosynthetic systems can be dynamically altered through relatively modest selective pressures, consistent with recent acquisition of these pathways through horizontal gene transfer and retention under natural selection pressures favoring metallophore-mediated lanthanide acquisition[33].

Biotechnological Applications and Bioaccumulation Phenotypes

The extraordinary bioaccumulation and biomining capabilities enabled by MexAM1_META1p4134 function represent significant biotechnological applications with practical importance. Wild-type Methylobacterium extorquens AM1 accumulates lanthanides at concentrations previously reported for other light lanthanides, achieving rapid metal uptake from complex feedstocks such as electronic waste and magnet swarf[5][10][11]. The overexpression of the mll cluster amplifies this capacity by approximately 3-fold, enabling even more impressive bioaccumulation and biosorption capabilities[5][10].

The scalability of the mll system has been demonstrated in batch bioreactor cultures (up to 10 liters) with consistent metal yields achieved without requirement for harsh acids or high temperatures[5][10]. The calculated lanthanide recovery rates approach 65-100 percent efficiency for neodymium in single process runs, with neodymium recovery potential of 1.3-2.1 g/L achievable through scaling and optimization[5][10]. These biotechnological capabilities position engineered Methylobacterium strains overexpressing MexAM1_META1p4134 as promising candidates for sustainable rare earth element recovery from waste feedstocks and ores.

Gene Ontology Annotation Recommendations

Based on the comprehensive experimental evidence and functional characterization reviewed, the following Gene Ontology terms are recommended for MexAM1_META1p4134:

Molecular Function Annotations

GO:0061749 - NRPS-independent siderophore synthetase activity (inferred from homology, IEA evidence code; would upgrade to ISS if ortholog studies confirm activity; likely future upgrade to IDA if biochemical reconstitution studies are conducted). This annotation recognizes that MexAM1_META1p4134, as a homolog of characterized NRPS-independent siderophore synthetases, catalyzes analogous adenylation and condensation reactions essential for assembly of a metallophore structure.

GO:0003987 - Acetyl-CoA-C-acetyltransferase activity or related ligase activity (ISS evidence code based on homology to AsbE). The ligase domain catalyzes adenylation of carboxylic acid substrates and subsequent condensation with nucleophilic acceptors. The substrate specificity and exact EC number classification awaits biochemical characterization.

GO:0008299 - Ligase activity (IEA evidence code, with potential for upgrade to ISS through ortholog studies). The AsbE-like ligase domain exhibits catalytic activities characteristic of ligase enzymes catalyzing formation of bonds between molecules through ATP utilization.

GO:0016747 - Transferase activity, transferring acyl groups (IEA evidence code). The condensation activity of the ligase domain catalyzes transfer of acyl groups between substrates.

GO:0031176 - Phosphopantetheine binding (ISS evidence code based on ACP domain homology). The GXDS motif-containing ACP domain binds the 4'-phosphopantetheine prosthetic group essential for catalytic function.

Cellular Component Annotations

GO:0005737 - Cytoplasm (IEA evidence code, with potential upgrade to IDA if GFP-tagging localization experiments are conducted). Bioinformatic analyses and functional context predict cytoplasmic localization of MexAM1_META1p4134.

GO:0009276 - Gram-negative bacterium cell wall or more specifically GO:0009273 - Peptidoglycan-based cell wall (IEA evidence code). While MexAM1_META1p4134 is predicted cytoplasmic, the metallophore product is exported from the cell and functions at the cell surface. The gene product is functionally involved in assembly of a cell-associated metabolite.

Biological Process Annotations

GO:0015891 - Siderophore transport (IEA evidence code with reasonable confidence, though this is strictly for the transport process; GO:0015883 - iron chelate transport' represents a more precise descriptor but mll-mediated transport concerns lanthanide chelate transport rather than iron, suggesting need for GO term development to accommodate lanthanide-specific processes; alternatively GO:0051186 - cofactor metabolic process' may encompass lanthanide-dependent metabolism).

GO:0046951 - Ketone biosynthetic process (too imprecise; better captured by GO:0009058 - biosynthetic process' or GO:0008152 - metabolic process).

GO:0006575 - Cellular amino acid derivative biosynthetic process (IEA evidence code, since homospermidine linkers in methylolanthanin represent polyamine derivatives participating in biosynthesis).

GO:0051179 - Localization (IEA evidence code, as the metallophore product functions in metal acquisition and localization).

GO:0006810 - Transport (IEA evidence code, more broadly as metallophore-mediated metal transport).

The restricted scope of existing GO terms to lanthanide-specific processes suggests opportunities for GO term development:

Proposed: GO:XXXXXXX - Lanthanide metallophore biosynthetic process (reflecting the specific biological process unique to lanthanide-acquiring bacteria)

Proposed: GO:XXXXXXX - Lanthanide chelator biosynthetic process (parallel terminology to iron siderophore biosynthesis)

These GO terms, while not yet formally established, would more precisely capture the unique biological function of MexAM1_META1p4134 and related lanthanide acquisition machinery.

Conclusions and Summary

MexAM1_META1p4134 represents a critical biosynthetic enzyme involved in the assembly of methylolanthanin, a lanthanide-chelating metallophore with profound implications for lanthanide bioavailability and acquisition in Methylobacterium extorquens AM1. The gene product functions as a fusion protein combining aryl carrier protein and NRPS-independent siderophore synthetase activities, catalyzing essential adenylation and condensation reactions in the pathway for metallophore biosynthesis. Strong experimental evidence including RNA-sequencing data demonstrating dramatic up-regulation under poorly bioavailable lanthanide conditions, genetic evidence from gene deletion and complementation experiments confirming biosynthetic necessity, and structural characterization of the final biosynthetic product all support robust functional assignment of this gene.

The molecular functions of MexAM1_META1p4134 center on catalyzing adenylation of carboxylic acid substrates and condensation reactions assembling the complex chelating structure of methylolanthanin. The cellular component context indicates cytoplasmic localization and integration into the broader mll biosynthetic assembly line. The biological processes encompass lanthanide metallophore biosynthesis as part of environmental metal homeostasis and adaptation to poorly bioavailable lanthanide environments.

The experimental evidence quality is high for an evolutionary recent discovery, with direct genetic, transcriptomic, biochemical, and structural evidence supporting all major functional claims. Homology to well-characterized NRPS-independent siderophore synthetases enables confident functional inferences regarding specific enzymatic activities. The narrow phylogenetic distribution and evidence for recent horizontal gene transfer acquisition place this gene in an evolutionary context reflecting adaptive evolution for lanthanide acquisition.

Biotechnological applications emphasizing bioaccumulation and biomining of rare earth elements from waste sources demonstrate practical significance of MexAM1_META1p4134 function and highlight the importance of continuing detailed characterization of this gene product. Future biochemical reconstitution studies, structural determination through crystallography or cryo-EM, and functional assays with purified components would substantially advance the functional annotation of this critical biosynthetic enzyme and support development of engineered bioprocess systems for sustainable lanthanide recovery.

Citations

  1. https://www.ncbi.nlm.nih.gov/gene/23173
  2. https://pmc.ncbi.nlm.nih.gov/articles/PMC11317620/
  3. https://pmc.ncbi.nlm.nih.gov/articles/PMC3063581/
  4. https://www.ncbi.nlm.nih.gov/gene?Db=gene&Cmd=DetailsSearch&Term=23173
  5. https://pubs.acs.org/doi/abs/10.1021/acs.est.3c06775
  6. https://www.nature.com/articles/s41467-024-52037-7
  7. https://pmc.ncbi.nlm.nih.gov/articles/PMC7391723/
  8. https://pmc.ncbi.nlm.nih.gov/articles/PMC3350773/
  9. https://www.pnas.org/doi/10.1073/pnas.2322096121
  10. https://pmc.ncbi.nlm.nih.gov/articles/PMC4205486/
  11. https://pmc.ncbi.nlm.nih.gov/articles/PMC2206670/
  12. https://pmc.ncbi.nlm.nih.gov/articles/PMC10296737/
  13. https://pubmed.ncbi.nlm.nih.gov/17189355/
  14. https://pmc.ncbi.nlm.nih.gov/articles/PMC2893474/
  15. https://pmc.ncbi.nlm.nih.gov/articles/PMC10700112/
  16. https://pmc.ncbi.nlm.nih.gov/articles/PMC1855748/
  17. https://pmc.ncbi.nlm.nih.gov/articles/PMC4224610/
  18. https://pubmed.ncbi.nlm.nih.gov/32786833/
  19. https://pmc.ncbi.nlm.nih.gov/articles/PMC10848612/
  20. https://pmc.ncbi.nlm.nih.gov/articles/PMC3131160/
  21. https://nph.onlinelibrary.wiley.com/doi/10.1111/nph.18136
  22. https://pubmed.ncbi.nlm.nih.gov/39467132/
  23. https://pubmed.ncbi.nlm.nih.gov/39078674/
  24. https://deepblue.lib.umich.edu/bitstream/handle/2027.42/199194/junwony_1.pdf?sequence=1&isAllowed=y
  25. https://pubs.acs.org/doi/abs/10.1021/jacs.8b09842
  26. https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2023.1258452/full
  27. https://pmc.ncbi.nlm.nih.gov/articles/PMC11999265/
  28. https://pubmed.ncbi.nlm.nih.gov/36645275/
  29. https://pubmed.ncbi.nlm.nih.gov/18349109/
  30. https://pmc.ncbi.nlm.nih.gov/articles/PMC6619796/
  31. https://pmc.ncbi.nlm.nih.gov/articles/PMC9260416/
  32. https://pmc.ncbi.nlm.nih.gov/articles/PMC11845868/
  33. https://pmc.ncbi.nlm.nih.gov/articles/PMC7949117/
  34. https://baarslab.wordpress.ncsu.edu/files/2018/03/Baars_Environ_Microbiol_2018-1.pdf
  35. https://pmc.ncbi.nlm.nih.gov/articles/PMC2168645/
  36. https://pmc.ncbi.nlm.nih.gov/articles/PMC24724/
  37. https://pubmed.ncbi.nlm.nih.gov/29473283/
  38. https://pmc.ncbi.nlm.nih.gov/articles/PMC2644304/
  39. https://pmc.ncbi.nlm.nih.gov/articles/PMC4760355/
  40. https://pmc.ncbi.nlm.nih.gov/articles/PMC11043010/
  41. https://pmc.ncbi.nlm.nih.gov/articles/PMC11355689/
  42. https://www.frontiersin.org/journals/plant-science/articles/10.3389/fpls.2021.818071/full
  43. https://pubmed.ncbi.nlm.nih.gov/37371539/
  44. https://pmc.ncbi.nlm.nih.gov/articles/PMC321587/
  45. https://journals.asm.org/doi/10.1128/spectrum.00867-23
  46. https://pmc.ncbi.nlm.nih.gov/articles/PMC3811902/
  47. https://pmc.ncbi.nlm.nih.gov/articles/PMC12502760/
  48. the user's requirements

πŸ“„ View Raw YAML

id: C5B1I6
gene_symbol: mllDE
product_type: PROTEIN
taxon:
  id: NCBITaxon:272630
  label: Methylorubrum extorquens AM1
description: Bifunctional carrier domain-containing protein (mllDE) that 
  functions as a fusion of aryl carrier protein (AsbD-like) and ligase 
  (AsbE-like) domains. The ACP domain requires post-translational modification 
  with 4'-phosphopantetheine to transfer acyl intermediates, while the ligase 
  domain catalyzes adenylation and condensation reactions during 
  methylolanthanin biosynthesis. Part of mll cluster showing 32-fold 
  upregulation in response to poorly soluble lanthanides.
existing_annotations:
  - term:
      id: GO:0031177
      label: phosphopantetheine binding
    evidence_type: NAS
    review:
      summary: Added to align core_functions with existing annotations.
      action: NEW
      reason: Core function term not present in existing_annotations.
  - term:
      id: GO:0016874
      label: ligase activity
    evidence_type: NAS
    review:
      summary: Added to align core_functions with existing annotations.
      action: NEW
      reason: Core function term not present in existing_annotations.
  - term:
      id: GO:0005737
      label: cytoplasm
    evidence_type: NAS
    review:
      summary: Added to align core_functions with existing annotations.
      action: NEW
      reason: Core function term not present in existing_annotations.
core_functions:
  - description: Acyl carrier protein function with phosphopantetheine 
      prosthetic group for substrate transfer
    molecular_function:
      id: GO:0031177
      label: phosphopantetheine binding
    supported_by:
      - reference_id: file:METEA/mllDE/mllDE-deep-research-perplexity.md
        supporting_text: ACP domain requires phosphopantetheinylation for cargo 
          shuttle function during biosynthesis. Contains conserved GXDS motif 
          characteristic of ACP proteins with phosphopantetheine prosthetic 
          group
  - description: Adenylation and condensation catalysis for lanthanophore 
      assembly
    molecular_function:
      id: GO:0016874
      label: ligase activity
    locations:
      - id: GO:0005737
        label: cytoplasm
    supported_by:
      - reference_id: file:METEA/mllDE/mllDE-deep-research-perplexity.md
        supporting_text: Ligase domain catalyzes adenylation and condensation 
          reactions. Predicted cytoplasmic localization based on lack of 
          transmembrane domains or signal peptides
references:
  - id: file:METEA/mllDE/mllDE-deep-research-perplexity.md
    title: Deep research on mllDE fusion protein in metallophore biosynthesis
    findings:
      - statement: META1p4134 encodes bifunctional mllDE fusion protein with 
          AsbD-like ACP and AsbE-like ligase domains
      - statement: ACP domain requires phosphopantetheinylation for cargo 
          shuttle function during biosynthesis
      - statement: Ligase domain catalyzes adenylation of citrate and 
          4-hydroxybenzoate for metallophore assembly
      - statement: Part of mll cluster showing 32-fold upregulation with poorly 
          soluble lanthanides (Nd2O3)
status: INITIALIZED