Methylorubrum extorquens MLL Cluster Curation Project
Project Completion Date: 2024-11-06
Organism: Methylorubrum extorquens AM1 (METEA)
Focus: Methylolanthanin biosynthesis and lanthanide acquisition system
Project Overview
This document provides retrospective documentation of the complete curation of the MLL (methylolanthanin biosynthesis) cluster in Methylorubrum extorquens AM1, a methylotrophic bacterium. These 10 genes constitute a novel lanthanide acquisition system that enables bacteria to scavenge rare earth elements (lanthanides) from the environment for use as cofactors in lanthanide-dependent methanol dehydrogenases.
Key Discovery: The MLL system represents a bacterial "lanthanophore" - analogous to iron-chelating siderophores but specialized for lanthanide rare earth elements (La, Ce, Pr, Nd). This system was misannotated in databases as iron-siderophore biosynthesis/transport due to homology to IucA/IucC aerobactin biosynthesis genes.
Gene List and Curation Status
MLL Biosynthetic Cluster (7 genes) - Methylolanthanin Synthesis
| Gene Symbol | UniProt ID | Status | GOA Annots | Deep Research | Review Status | Notes |
|---|---|---|---|---|---|---|
| mllA | C5B1I4 | ✅ COMPLETE | 2 | ✅ (59 cites) | ✅ | IucA/IucC ligase, 1 ACCEPT, 1 NON-CORE |
| mllBC | C5B1I3 | ✅ COMPLETE | 4 | ✅ (51 cites) | ✅ | AsbD/AsbE fusion, acyl-CoA ligase |
| mllDE | C5B1I2 | ✅ COMPLETE | 0 | ✅ (48 cites) | ✅ | Carrier domain/ligase fusion |
| mllF | C5B1I0 | ✅ COMPLETE | 0 | ✅ (53 cites) | ✅ | Xylose isomerase-like (TIM barrel) |
| mllG | C5B1H9 | ✅ COMPLETE | 0 | ✅ (43 cites) | ✅ | Aldolase, DUF2218 domain (92 aa) |
| mllH | C5B1H8 | ✅ COMPLETE | 1 | ✅ (53 cites) | ✅ | GCN5 N-acetyltransferase |
| mllJ | C5B1H7 | ✅ COMPLETE | 0 | ✅ (40 cites) | ✅ | Ferritin-like, TAT signal (periplasmic) |
MLU Uptake and Regulation System (3 genes)
| Gene Symbol | UniProt ID | Status | GOA Annots | Deep Research | Review Status | Notes |
|---|---|---|---|---|---|---|
| mluA | C5B1I1 | ✅ COMPLETE | 8 | ✅ (56 cites) | ✅ | TonB receptor, 2 ACCEPT, 6 REMOVE |
| mluI | C5B1H6 | ✅ COMPLETE | 6 | ✅ (57 cites) | ✅ | ECF sigma factor |
| mluR | C5B1H5 | ✅ COMPLETE | 2 | ✅ (49 cites) | ✅ | Anti-sigma factor |
Scientific Background
The Lanthanophore System
Methylolanthanin is a lanthanide-chelating metallophore (lanthanophore) that enables bacteria to acquire rare earth elements from the environment. These lanthanides serve as essential cofactors for lanthanide-dependent methanol dehydrogenases (MDH), which are key enzymes in methylotrophic metabolism.
Biochemical Function
The MLL cluster produces methylolanthanin, a small molecule that:
1. Chelates lanthanides (La, Ce, Pr, Nd, Sm, etc.) with high affinity
2. Transports lanthanides into the cell via the TonB-dependent receptor MluA
3. Enables methanol oxidation by supplying lanthanide cofactors to XoxF methanol dehydrogenase
Structure: Methylolanthanin contains:
- 4-hydroxybenzoyl moieties conjugated to
- Acetylated homospermidine linkers with
- Lanthanide-chelating groups
Comparison to Siderophore Systems
| Feature | Siderophores (Iron) | Lanthanophores (Lanthanides) |
|---|---|---|
| Metal | Fe³⁺ | La³⁺, Ce³⁺, Pr³⁺, Nd³⁺, etc. |
| Purpose | Iron nutrition | Cofactor for MDH enzymes |
| Gene families | IucA/IucC, AsbD/AsbE | MllA (IucA-like), MllBC (AsbD/E-like) |
| Receptor | FecA, FpvA (Fe-siderophore) | MluA (Ln-metallophore) |
| Regulation | Fur repressor | MluI/MluR sigma/anti-sigma |
Key Functional Relationships
Environmental Lanthanides (poorly soluble)
↓
[MLL BIOSYNTHESIS CLUSTER]
[mllA](../../genes/METEA/mllA/mllA-ai-review.html) → [mllBC](../../genes/METEA/mllBC/mllBC-ai-review.html) → [mllDE](../../genes/METEA/mllDE/mllDE-ai-review.html) → [mllF](../../genes/METEA/mllF/mllF-ai-review.html) → [mllG](../../genes/METEA/mllG/mllG-ai-review.html) → [mllH](../../genes/METEA/mllH/mllH-ai-review.html) → [mllJ](../../genes/METEA/mllJ/mllJ-ai-review.html)
↓
Methylolanthanin (secreted)
↓
Ln³⁺-Methylolanthanin complex
↓
[UPTAKE SYSTEM]
MluA (TonB receptor) → imports complex
↓
[REGULATION]
MluI (sigma) activates transcription when Ln³⁺-limited
MluR (anti-sigma) represses when Ln³⁺-replete
↓
Lanthanide released intracellularly
↓
XoxF methanol dehydrogenase (Ln³⁺ cofactor)
↓
Methanol → Formaldehyde (C1 metabolism)
Major Annotation Challenges and Solutions
Challenge 1: Misannotation as Iron-Siderophore System
Problem: All MLL genes were automatically annotated as "siderophore biosynthesis" and "iron transport" based on sequence homology to aerobactin (IucA/IucC) and petrobactin (AsbD/AsbE) biosynthesis genes.
Solution:
- mllA: Changed "siderophore biosynthesis" (GO:0019290) → KEEP_AS_NON_CORE (analogous chemistry, different product)
- mluA: REMOVED 6 annotations related to iron transport (GO:0006826, GO:0015343, GO:0015344, GO:0015891, GO:0033214)
- Core functions: Emphasized "lanthanophore" and "lanthanide acquisition" in descriptions
Evidence:
- 32-fold upregulation in response to lanthanide limitation (not iron limitation)
- Structural analysis identified acetylated homospermidine linkers (not hydroxamate groups typical of iron siderophores)
- Functional studies show lanthanide-dependent growth in M. extorquens AM1
Challenge 2: Lack of GO Terms for Lanthanophore Function
Problem: No Gene Ontology terms exist for:
- "lanthanophore biosynthetic process"
- "lanthanide-metallophore transport"
- "lanthanide ion acquisition"
Solution:
- Proposed new term in mllA review:
yaml
proposed_new_terms:
- proposed_name: lanthanophore biosynthetic process
proposed_definition: The chemical reactions and pathways resulting in the
formation of lanthanophores, small molecules that chelate lanthanide rare
earth elements to facilitate their uptake by organisms
- Used existing general terms where applicable:
- GO:0016881 (acid-amino acid ligase activity) - appropriate for MllA molecular function
- GO:0009279 (cell outer membrane) - appropriate for MluA localization
- GO:0038023 (signaling receptor activity) - appropriate for MluA signaling function
Challenge 3: Fusion Proteins and Domain Architecture
Problem: Several MLL genes encode fusion proteins (mllBC, mllDE) combining multiple enzymatic domains from different siderophore biosynthesis systems.
Solution:
- mllBC: Documented as "AsbD/AsbE fusion" combining carrier protein and ligase domains
- mllDE: Documented as bifunctional with both aryl carrier protein (ACP) and ligase activities
- Emphasized post-translational modification requirements (4'-phosphopantetheine on ACP domain)
Challenge 4: Minimal Existing Annotations
Problem: Most MLL genes had 0-2 IEA annotations, requiring de novo functional characterization.
Solution:
- Deep research files (347 avg citations) provided comprehensive literature context
- Core functions synthesized from:
- Structural analysis of methylolanthanin
- Gene cluster organization (META1p4129-4138)
- Transcriptional profiling (32-fold upregulation)
- Homology to characterized biosynthetic systems
- TAT signal peptides and domain predictions
Curation Statistics
Overall Progress
- Total genes: 10/10 (100%) ✅
- Deep research: 10/10 (100%) - 509 total citations
- Annotation reviews: 10/10 (100%)
- Core functions: 10/10 (100%)
Annotation Actions Summary
Total existing annotations reviewed: 23 across all 10 genes
| Action | Count | Percentage | Genes |
|---|---|---|---|
| ACCEPT | 3 | 13% | mllA (1), mluA (2) |
| KEEP_AS_NON_CORE | 1 | 4% | mllA (1) |
| REMOVE | 6 | 26% | mluA (6) - all iron-siderophore annotations |
| NEW (via core_functions) | 10 | - | All genes received new functional descriptions |
Key finding: 26% of existing annotations were REMOVED - primarily due to misannotation as iron-siderophore system rather than lanthanide-metallophore system.
Deep Research Citation Distribution
| Gene Category | Genes | Total Citations | Avg per Gene |
|---|---|---|---|
| Biosynthesis (MLL) | 7 | 347 | 49.6 |
| Uptake/Regulation (MLU) | 3 | 162 | 54.0 |
| Total | 10 | 509 | 50.9 |
Key Scientific Insights Documented
1. Novel Lanthanide Acquisition System
Discovery: Bacteria can synthesize specialized metallophores for rare earth elements, not just iron. This represents a previously unrecognized mechanism for lanthanide biogeochemistry and microbial metal nutrition.
Genes involved: Entire MLL cluster (mllA, BC, DE, F, G, H, J)
2. Regulatory Architecture
Two-component system:
- MluI (ECF sigma factor): Activates transcription when lanthanide-limited
- MluR (anti-sigma factor): Sequesters MluI when lanthanide-replete
- MluA (receptor): Cell-surface signaling transducer that releases MluI upon ligand binding
Mechanism: Classical ECF sigma factor cascade where ligand binding to outer membrane receptor triggers signal transduction to activate alternative sigma factor.
3. Connection to Methylotrophy
Metabolic context: Lanthanophore system enables methanol oxidation by supplying lanthanides to XoxF methanol dehydrogenase, which is 10-100× more efficient than calcium-dependent MxaF.
Ecological significance: In environments with bioavailable lanthanides (volcanic soils, certain aquifers), bacteria with MLL cluster have competitive advantage for methylotrophic growth.
4. Evolutionary Origin
Homology relationships:
- mllA ← IucA/IucC (aerobactin biosynthesis)
- mllBC/DE ← AsbD/AsbE (petrobactin biosynthesis)
- mllF ← xylose isomerases (sugar metabolism)
- mllH ← GCN5 acetyltransferases (broad distribution)
Interpretation: MLL cluster likely assembled through horizontal gene transfer and domain shuffling, co-opting iron-siderophore biosynthesis enzymes for lanthanide metallophore production.
Proposed GO Term Additions
Based on this curation, the following new GO terms would benefit the ontology:
1. Biological Process Terms
GO:NEW001 - lanthanophore biosynthetic process
- Definition: The chemical reactions and pathways resulting in the formation of lanthanophores, small molecules that chelate lanthanide rare earth elements to facilitate their uptake by organisms.
- Parent: GO:0009404 (toxin metabolic process) OR create new parent "metallophore biosynthetic process"
- Related: GO:0019290 (siderophore biosynthetic process)
GO:NEW002 - lanthanide ion import across plasma membrane
- Definition: The directed movement of lanthanide ions from outside of a cell, across the plasma membrane and into the cytosol.
- Parent: GO:0006824 (cobalt ion transport) - as another rare metal
- Related: GO:0033214 (siderophore-iron import into cell)
2. Molecular Function Terms
GO:NEW003 - lanthanide-metallophore transmembrane transporter activity
- Definition: Enables the transfer of a lanthanide-metallophore complex from the extracellular space to the cytosol across the outer membrane.
- Parent: GO:0015343 (siderophore-iron transmembrane transporter activity)
- Note: Generalize parent to "metallophore transmembrane transporter activity"
Files and Validation Status
All gene reviews validate successfully:
for gene in [mllA](../../genes/METEA/mllA/mllA-ai-review.html) [mllBC](../../genes/METEA/mllBC/mllBC-ai-review.html) [mllDE](../../genes/METEA/mllDE/mllDE-ai-review.html) [mllF](../../genes/METEA/mllF/mllF-ai-review.html) [mllG](../../genes/METEA/mllG/mllG-ai-review.html) [mllH](../../genes/METEA/mllH/mllH-ai-review.html) [mllJ](../../genes/METEA/mllJ/mllJ-ai-review.html) [mluA](../../genes/METEA/mluA/mluA-ai-review.html) [mluI](../../genes/METEA/mluI/mluI-ai-review.html) [mluR](../../genes/METEA/mluR/mluR-ai-review.html); do
just validate METEA $gene
done
File structure (per gene):
genes/METEA/<GENE>/
├── <GENE>-ai-review.yaml # Complete curation with annotations
├── <GENE>-ai-review.html # Human-readable HTML report
├── <GENE>-deep-research-perplexity.md # Literature synthesis
├── <GENE>-goa.tsv # Original GO annotations
└── <GENE>-uniprot.txt # UniProt record
Future Directions
1. Experimental Validation Priorities
- Structural characterization: Crystal structure of MllA with substrate/product
- Biochemical reconstitution: In vitro biosynthesis of methylolanthanin from components
- Metal specificity: Quantitative binding affinities for different lanthanides
- Regulatory mechanism: MluA-MluR-MluI signaling cascade characterization
2. Comparative Genomics
- Distribution: Survey MLL cluster presence across methylotrophs and other bacteria
- Variants: Characterize structural variations in MLL clusters from different environments
- Evolution: Trace horizontal gene transfer and cluster assembly mechanisms
3. Ecological Studies
- Biogeography: Correlate MLL cluster presence with environmental lanthanide availability
- Competition: Compare XoxF (Ln-MDH) vs MxaF (Ca-MDH) in natural communities
- Biogeochemistry: Quantify role of lanthanophores in lanthanide cycling
Comparison to Other Metallophore Systems
| System | Metal | Organisms | Gene Families | Receptor Type | Regulation |
|---|---|---|---|---|---|
| Enterobactin | Fe³⁺ | E. coli, many Gram− | EntA-F | FepA (TonB) | Fur repressor |
| Aerobactin | Fe³⁺ | Pathogenic bacteria | IucA/IucC | IutA (TonB) | Fur repressor |
| Petrobactin | Fe³⁺ | Bacillus | AsbA-F | FpuA | Fur-like |
| Pyochelin | Fe³⁺ | Pseudomonas | PchD-F | FptA (TonB) | PchR activator |
| Staphyloferrin | Fe³⁺ | Staphylococcus | SfaA-D | HtsA (ABC) | Fur repressor |
| Yersiniabactin | Fe³⁺ | Yersinia | YbtD-U | FyuA (TonB) | Fur repressor |
| Methylolanthanin | Ln³⁺ | Methylotrophs | MllA-J | MluA (TonB) | MluI/R (ECF) |
Key distinction: Lanthanophore system is the ONLY characterized bacterial metallophore system targeting lanthanides rather than iron.
Notes
- Gene symbols follow established nomenclature in Martinez-Gomez et al. publications
- Organism: Methylorubrum extorquens AM1 (formerly Methylobacterium extorquens AM1)
- UniProt IDs from proteome UP000002426 (C5B1H5-C5B1I4)
- MLL cluster locus: META1p4129-4138 (genome coordinates)
- All validations pass with appropriate warnings about lack of specific GO terms
Related Gene Systems in METEA
Other genes in the METEA curation project include:
Methanol Oxidation Systems
- XoxF1: Lanthanide-dependent methanol dehydrogenase (XoxF-type MDH)
- MxaF: Calcium-dependent methanol dehydrogenase (Mxa-type MDH)
- MxaB, MxaC, MxaD, MxaG, MxaI, MxaJ, MxaK: Mxa system components
- MxbD, MxbM: Additional Mxa-related proteins
- MxcE, MxcQ: Cytochrome c proteins for electron transfer
PQQ Biosynthesis (Cofactor for MDH)
- PqqA, PqqD, PqqE: Pyrroloquinoline quinone biosynthesis
C1 Metabolism (Formaldehyde and Formate)
- Fae: Formaldehyde-activating enzyme
- FdhA: Formate dehydrogenase
- Mtd: Methylene-H4MPT dehydrogenase
- Mch: Methenyl-H4MPT cyclohydrolase
The MLL cluster is functionally coupled to the XoxF methanol dehydrogenase system, forming an integrated lanthanide-dependent methylotrophy pathway.
Project Status: ✅ COMPLETE (10/10 genes fully curated)
Documentation Date: 2025-11-08
Last Validation: 2025-11-08