NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase (EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. However, this gene model is almost certainly a chimeric artifact resulting from the incorrect fusion of at least two separate genes during genome annotation: an N-terminal UMP-CMP kinase (adenylate kinase family, Pfam PF00406) and a C-terminal chalcone isomerase-fold fatty acid-binding protein (Pfam PF16035, positions 555-650). At 714 AA, the protein is 2-3.5x larger than any characterized UMP-CMP kinase (e.g. Arabidopsis 202 AA [PMID:9576794], rice YL2 351 AA [PMID:29392476]). The conflicting automated annotations - kinase activities from the ADK domain and fatty acid binding / intramolecular lyase from the CHI-fold domain - are explained by this fusion. The M. lutarioriparius genome has 68,328 predicted gene models [PMID:33911077], unusually high even for an allotetraploid, consistent with gene model artifacts from the EVidenceModeler pipeline. All 13 GO annotations are IEA with no experimental evidence. The annotations pertaining to the UMP-CMP kinase portion (N-terminal domain) are likely correct for that domain, while annotations from the CHI-fold domain (fatty acid binding, intramolecular lyase activity) are artifacts of the chimeric model.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0005504
fatty acid binding
|
IEA
GO_REF:0000118 |
REMOVE |
Summary: This annotation derives from TreeGrafter matching the C-terminal chalcone isomerase-fold domain (Pfam PF16035, Chalcone_2, positions 555-650) to PANTHER family PTHR47284 (FATTY-ACID-BINDING PROTEIN 2). The CHI-fold superfamily includes non-catalytic fatty acid-binding proteins (FAPs) that localize to plastids and participate in de novo fatty acid biosynthesis [PMID:22622584]. However, this annotation reflects a domain from what is almost certainly a chimeric gene model, not the true function of the UMP-CMP kinase that constitutes the N-terminal portion of this protein. The 714 AA size is 2-3.5x larger than any known UMP-CMP kinase, and no natural ADK-CHI domain fusion has been reported in any organism.
Reason: This annotation is an artifact of a chimeric gene model. The fatty acid binding function derives from the C-terminal CHI-fold/FAP domain (positions 555-650), which is a separate gene incorrectly fused with the N-terminal UMP-CMP kinase during genome annotation of M. lutarioriparius [PMID:33911077]. FAP proteins in the CHI superfamily do bind fatty acids [PMID:22622584], but this activity belongs to a distinct gene product, not a UMP-CMP kinase.
Supporting Evidence:
PMID:22622584
These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid biosynthesis in plant cells. Furthermore, their expression profiles correlate with those of core fatty-acid biosynthetic enzymes
PMID:33911077
A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
This protein is almost certainly a chimeric gene model artifact resulting from incorrect fusion of two separate genes during genome annotation.
|
|
GO:0005524
ATP binding
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: ATP binding is assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates at the expense of ATP, so ATP binding is an inherent part of their catalytic mechanism. The UniProt record confirms ATP binding sites at positions 36-41, 149, and 192 via HAMAP-Rule MF_03172. This annotation is consistent with the N-terminal UMP-CMP kinase domain of this protein.
Reason: ATP binding is a core requirement for UMP-CMP kinase activity. The protein contains a P-loop NTPase domain with conserved ATP binding residues identified by HAMAP and InterPro. All characterized UMP-CMP kinases require ATP as the phosphate donor [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Catalyzes the phosphorylation of pyrimidine nucleoside monophosphates at the expense of ATP.
PMID:9576794
the UMP/CMP kinase preferentially uses ATP
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
Core function: phosphorylation of UMP, CMP, and dCMP to their diphosphate forms using ATP as the phosphate donor.
|
|
GO:0016776
phosphotransferase activity, phosphate group as acceptor
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: Assigned via InterPro match to IPR006266 (UMP-CMP kinase). GO:0016776 is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy, so this is a broader but valid annotation for a UMP-CMP kinase. UMP-CMP kinases transfer a phosphate group from ATP to nucleoside monophosphate substrates (UMP, CMP, dCMP), which already carry a phosphate group, making them phosphotransferases with phosphate group as acceptor.
Reason: This is a valid parent term of the more specific UMP/CMP/dCMP kinase activities also annotated to this protein. While broader than the specific kinase activities (GO:0033862, GO:0036430, GO:0036431), it correctly captures the biochemical mechanism. The InterPro match to IPR006266 (UMP-CMP kinase family) is appropriate for the N-terminal domain.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=CMP + ATP = CDP + ADP
|
|
GO:0016872
intramolecular lyase activity
|
IEA
GO_REF:0000002 |
REMOVE |
Summary: This annotation derives from InterPro match to IPR036298 (Chalcone isomerase superfamily), which is mapped to GO:0016872 via InterPro2GO. Chalcone isomerases catalyze intramolecular cyclization of chalcones to flavanones. However, this annotation is inappropriate for two reasons: (1) the CHI-fold domain in this protein likely belongs to the non-catalytic FAP (fatty acid-binding) subfamily, which lacks the catalytic residues for isomerase activity [PMID:22622584]; and (2) this domain is part of a chimeric gene model and belongs to a separate gene product. The InterPro2GO mapping from IPR036298 to GO:0016872 is overly broad, applying catalytic function to the entire superfamily including non-catalytic members.
Reason: This is a doubly erroneous annotation: the intramolecular lyase activity derives from the CHI superfamily InterPro match, but the CHI-fold domain in this protein is likely a FAP subfamily member that is non-catalytic. Furthermore, the CHI-fold domain (positions 555-650) belongs to a separate gene that was incorrectly fused with the UMP-CMP kinase in this chimeric gene model. A UMP-CMP kinase has no intramolecular lyase activity.
Supporting Evidence:
PMID:22622584
the FAP discovery defines the adaptive evolution of a stereospecific and catalytically 'perfected' enzyme from a non-enzymatic ancestor over a defined period of plant evolution.
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
C-terminal (positions 555-650) Chalcone isomerase domain (Pfam PF16035, Chalcone_2) - functionally unrelated to the N-terminal UMP-CMP kinase domain.
|
|
GO:0019205
nucleobase-containing compound kinase activity
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: Assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). GO:0019205 is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy. UMP-CMP kinases phosphorylate nucleoside monophosphates (nucleobase-containing compounds), so this broader annotation is valid.
Reason: This is a valid broader term for UMP-CMP kinase activity. The protein's N-terminal domain belongs to the adenylate kinase family UMP-CMP kinase subfamily, which phosphorylates nucleobase-containing compounds (UMP, CMP, dCMP). While less specific than GO:0033862/GO:0036430/GO:0036431, it is not incorrect.
Supporting Evidence:
PMID:9576794
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as the phosphate acceptor.
|
|
GO:0033862
UMP kinase activity
|
IEA
GO_REF:0000104 |
ACCEPT |
Summary: Assigned via UniRule UR000111469 based on shared sequence features with characterized UMP-CMP kinases. UMP kinase activity (catalysis of UMP + ATP = UDP + ADP) is a core function of the UMP-CMP kinase family. The UniProt record explicitly lists this catalytic activity with EC 2.7.4.14. This annotation applies to the N-terminal adenylate kinase domain.
Reason: UMP kinase activity is the primary molecular function of UMP-CMP kinases. The protein contains the conserved ADK domain (PF00406) and matches HAMAP rule MF_03172 for the UMP-CMP kinase subfamily. Confirmed in the rice ortholog YL2 [PMID:29392476] and Arabidopsis [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=UMP + ATP = UDP + ADP
PMID:29392476
Prokaryotic UMP kinase activity was subsequently confirmed, with YL2 deficiency causing a significant reduction in chlorophyll accumulation and photochemical efficiency.
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
Plant UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP, dCMP) using ATP.
|
|
GO:0036430
dCMP kinase activity
|
IEA
GO_REF:0000104 |
ACCEPT |
Summary: Assigned via UniRule UR000111469. dCMP kinase activity (catalysis of dCMP + ATP = dCDP + ADP) is a known activity of UMP-CMP kinases. The UniProt record explicitly lists this catalytic reaction.
Reason: dCMP kinase activity is a well-characterized function of the UMP-CMP kinase family. The catalytic reaction is explicitly listed in the UniProt record. This applies to the N-terminal kinase domain.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=dCMP + ATP = dCDP + ADP
|
|
GO:0036431
CMP kinase activity
|
IEA
GO_REF:0000104 |
ACCEPT |
Summary: Assigned via UniRule UR000111469. CMP kinase activity (catalysis of CMP + ATP = CDP + ADP) is a core function of UMP-CMP kinases. The UniProt record explicitly lists this catalytic reaction.
Reason: CMP kinase activity is a core function of the UMP-CMP kinase family. Arabidopsis UMP/CMP kinase accepts both UMP and CMP as phosphate acceptors [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=CMP + ATP = CDP + ADP
PMID:9576794
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as the phosphate acceptor.
|
|
GO:0006207
'de novo' pyrimidine nucleobase biosynthetic process
|
IEA
GO_REF:0000002 |
MODIFY |
Summary: Assigned via InterPro match to IPR006266 (UMP-CMP kinase). UMP-CMP kinases play an important role in de novo pyrimidine nucleotide biosynthesis by phosphorylating UMP to UDP. However, GO:0006207 specifically refers to pyrimidine nucleobase biosynthesis (the base itself), whereas UMP-CMP kinases act on nucleotides (nucleobase + sugar + phosphate). The more appropriate term is GO:0006221 (pyrimidine nucleotide biosynthetic process), which is already annotated.
Reason: UMP-CMP kinases act on pyrimidine nucleotides, not nucleobases. GO:0006207 is technically not the correct process for a nucleoside monophosphate kinase. GO:0006221 (pyrimidine nucleotide biosynthetic process) is more appropriate and already annotated.
Proposed replacements:
pyrimidine nucleotide biosynthetic process
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Plays an important role in de novo pyrimidine nucleotide biosynthesis. Has preference for UMP and CMP as phosphate acceptors.
|
|
GO:0006221
pyrimidine nucleotide biosynthetic process
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: Assigned via combined automated annotation (InterPro IPR006266 + UniRule UR000111469). UMP-CMP kinases participate in pyrimidine nucleotide biosynthesis by catalyzing the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP) to their diphosphate forms (UDP, CDP). The UniProt record explicitly states the protein plays an important role in de novo pyrimidine nucleotide biosynthesis.
Reason: Pyrimidine nucleotide biosynthesis is a core biological process for UMP-CMP kinases. The phosphorylation of UMP to UDP and CMP to CDP are essential steps in maintaining the pyrimidine nucleotide pool. Confirmed in rice YL2 [PMID:29392476] and Arabidopsis PUMPKIN [PMID:30409856].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Plays an important role in de novo pyrimidine nucleotide biosynthesis.
PMID:29392476
our results suggest that UMP kinase activity plays an essential role in chloroplast development and regulating cpATPase biogenesis in rice.
|
|
GO:0005634
nucleus
|
IEA
GO_REF:0000120 |
UNDECIDED |
Summary: Assigned via combined automated annotation (UniProtKB-SubCell SL-0191 + UniRule UR000111469). The UniProt record states nuclear localization based on HAMAP rule MF_03172. However, for plant UMP-CMP kinases, characterized orthologs are predominantly cytosolic (Arabidopsis UMK) or plastid-localized (rice YL2, Arabidopsis PUMPKIN) [PMID:29392476, PMID:30409856]. Nuclear localization is not well-established for plant UMKs.
Reason: Nuclear localization is inferred from homology to animal UMP-CMP kinases via the HAMAP rule, but characterized plant orthologs localize to the cytoplasm or plastids rather than the nucleus. Without direct experimental evidence in plants, the nuclear localization annotation should be treated with caution.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm. Nucleus.
PMID:30409856
the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana)
|
|
GO:0005737
cytoplasm
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: Assigned via combined automated annotation (UniProtKB-SubCell SL-0086 + UniRule UR000111469). Cytoplasmic localization is consistent with the known localization of several plant UMP-CMP kinase isoforms. The Arabidopsis UMP/CMP kinase characterized by Zhou et al. was expressed and purified as a cytosolic protein [PMID:9576794].
Reason: Cytoplasmic localization is well-supported for plant UMP-CMP kinases. Arabidopsis has cytosolic UMK isoforms alongside the plastid-localized PUMPKIN. Even if this particular gene model is chimeric, the UMP-CMP kinase portion would likely localize to the cytoplasm or plastids.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm.
PMID:9576794
A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine 5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces cerevisiae ura6 mutant.
|
|
GO:0009570
chloroplast stroma
|
IEA
GO_REF:0000118 |
UNDECIDED |
Summary: Assigned via TreeGrafter matching to PANTHER PTN009208978. Chloroplast localization is plausible for plant UMP-CMP kinases - rice YL2 and Arabidopsis PUMPKIN are chloroplast-localized UMKs [PMID:29392476, PMID:30409856]. However, the specific sub-compartment is questionable: rice YL2 localizes to thylakoid membranes, not the stroma [PMID:29392476]. Additionally, the TreeGrafter match may be influenced by the chimeric gene model, as CHI-fold FAP proteins also localize to plastids [PMID:22622584].
Reason: While chloroplast localization is plausible for a plant UMP-CMP kinase, the specific annotation to chloroplast stroma (GO:0009570) is uncertain. The closest characterized ortholog, rice YL2, is a thylakoid membrane-localized protein [PMID:29392476]. Furthermore, the TreeGrafter PANTHER match could reflect either the UMK domain or the CHI-fold FAP domain, both of which can be plastid-targeted.
Supporting Evidence:
PMID:29392476
YL2 encodes a thylakoid membrane-localized protein with significant sequence similarity to UMP kinase proteins in prokaryotes and eukaryotes.
PMID:22622584
These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid biosynthesis in plant cells.
|
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
The gene symbol NCGR_LOCUS27674 is currently literature-sparse for Miscanthus lutarioriparius: none of the retrieved Miscanthus-focused papers explicitly mention NCGR_LOCUS27674 or the UniProt accession A0A811PC48. Consequently, the most defensible functional annotation is based on the UniProt-provided identity (UMPโCMP kinase; EC 2.7.4.14; adenylate-kinase family) plus experimental evidence from plant homologs (notably Arabidopsis thaliana and rice) and conserved pathway context. This report does not substitute a different gene with a similar symbol in another organism. (mitros2020genomebiologyof pages 6-7, wang2024physiologicalandtranscriptomic pages 3-5, mitros2020genomebiologyof pages 1-2)
UMPโCMP kinase / uridylateโcytidylate kinase (EC 2.7.4.14) is a nucleoside monophosphate (NMP) kinase that catalyzes phosphorylation of pyrimidine NMPs using ATP (typically MgATP) to produce the corresponding NDPs (e.g., UMPโUDP; CMPโCDP). In plants, this activity is described as a bifunctional UK/CK that phosphorylates UMP and CMP (and generally has lower activity with dCMP), contributing to balancing ribo- and deoxyribonucleotide pools. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73)
Recent biochemical characterization of three Arabidopsis AMK-like UMP kinases (UMK1/UMK2/UMK3) shows all can use UMP, CMP, and dCMP with distinct efficiencies; UMK3 is the most efficient for UMP, whereas dCMP is a poorer substrate overall and UMK1/UMK2 are relatively better with dCMP than UMK3. (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree pages 75-78)
Plant UK/CK (UMPโCMP kinase) functions are distributed across multiple subfamilies. A plant-focused analysis describes AK-like UK/CK proteins and also a prokaryotic-type lineage; localization can include cytosolic and mitochondrial AK-like enzymes and a plastid-localized bacterial-type enzyme (PUMPKIN) with additional plastid gene-expression roles. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)
A 2024 dissertation (with results described as under review at the time) provides a detailed dissection of Arabidopsis UMK paralogs:
- Localization: UMK1 and UMK3 are cytosolic, whereas UMK2 is mitochondrial in transient-expression assays. (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
- Functional specialization: UMK3 is described as the central UMP kinase for phosphorylation of de novoโsynthesized UMP; UMK2 appears specialized for mitochondrial dCMP phosphorylation, supporting mitochondrial DNA precursor supply during germination. (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 61-65)
- Quantitative enzyme kinetics (UMP substrate): UMK3 shows markedly higher catalytic efficiency for UMP than UMK1/UMK2 (see Section 5). (rinne2024characterizationofthree pages 20-24)
Publication details: Rinne J. Characterization of three uridine monophosphate kinases from Arabidopsis thaliana. (2024-01). DOI/URL: https://doi.org/10.15488/16818 (rinne2024characterizationofthree pages 20-24)
A 2024 Miscanthus lutarioriparia (syn. lutarioriparius) study used RNA-seq to analyze cadmium stress responses and describes an annotation workflow (StringTie/RSEM, BLAST functional annotation, GO/KEGG enrichment). However, the text examined did not expose identifiers mapping to NCGR_LOCUS27674/A0A811PC48; the authors note deposition in NCBI SRA (PRJNA733881), indicating that locus-to-accession mapping may reside in supplementary or deposited annotation files. (wang2024physiologicalandtranscriptomic pages 3-5)
Publication details: Wang J. et al. PLOS ONE (2024-05). DOI/URL: https://doi.org/10.1371/journal.pone.0302940 (wang2024physiologicalandtranscriptomic pages 3-5)
A 2024 study frames M. lutarioriparia as a โpromising energy cropโ used for phytoremediation of abandoned mine soils; it reports that M. lutarioriparia could reduce Cd content in mining soil by 29.82%, with maximal Cd concentrations in aboveground and belowground tissues of ~185.65 mg/kg and ~186.8 mg/kg, respectively (study-specific conditions). While this does not directly implicate NCGR_LOCUS27674, it provides real-world context in which core metabolic โhousekeepingโ enzymes like pyrimidine kinases could affect biomass accumulation and stress tolerance through nucleotide/energy balance. (wang2024physiologicalandtranscriptomic pages 3-5)
Plant UMP/CMP kinase activity is implicated in fundamental growth and organelle biogenesis:
- In Arabidopsis, UMK3 is reported as essential for normal growth and reproductive/embryonic viability (based on transmission/phenotype analyses described), consistent with a core role in nucleotide supply. (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 70-73)
- In rice, mutation of a chloroplast-associated UMP kinase (YL2) causes chloroplast developmental defects and reduced photosynthetic performance, including decreased cpATPase activity (see statistics below). (chen2018umpkinaseactivity pages 5-8)
Given the UniProt-provided identity and strong conservation of UK/CK activities across plants, the most evidence-aligned annotation is:
NCGR_LOCUS27674 likely encodes an intracellular UMPโCMP kinase (EC 2.7.4.14) that phosphorylates UMP and CMP (and likely dCMP to a lesser extent) to generate UDP/CDP/(d)CDP, thereby supporting pyrimidine nucleotide pool homeostasis required for RNA synthesis, sugar-nucleotide formation, membrane lipid metabolism, and DNA precursor supply. This is a functional inference for the Miscanthus protein, supported by direct plant homolog biochemistry, genetics, and compartmentalization evidence. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree pages 1-6)
A plant UK/CK description emphasizes that phosphorylation of UMP/CMP connects monophosphate pools to diphosphate pools using MgATP, and that plants contain both AK-like and prokaryotic-type UK/CKs with cytosolic/mitochondrial/plastid associations. This supports an interpretation of NCGR_LOCUS27674 as a node in pyrimidine nucleotide interconversion that can influence organellar nucleotide availability (e.g., mitochondrial DNA precursor supply), as directly proposed for Arabidopsis UMK2/UMK3 in germination. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 61-65)
Measured kinetic parameters for Arabidopsis UMKs using UMP as acceptor (units as reported):
- UMK1: KM โ 0.646 mM, kcat โ 41.1 sโปยน, kcat/KM โ 63.6
- UMK2: KM โ 0.694 mM, kcat โ 95.1 sโปยน, kcat/KM โ 137
- UMK3: KM โ 0.24 mM, kcat โ 252.1 sโปยน, kcat/KM โ 1050
These data indicate UMK3 is much more catalytically efficient for UMP phosphorylation than UMK1/UMK2 in vitro, consistent with UMK3 being described as the โcentralโ cytosolic UMP kinase in vivo. (rinne2024characterizationofthree pages 20-24)
Publication details: Rinne J. (2024-01). DOI/URL: https://doi.org/10.15488/16818 (rinne2024characterizationofthree pages 20-24)
In a rice mutant defective in a chloroplast-associated UMP kinase (YL2), chloroplast function is impaired; the study reports a ~36.3% decrease in cpATPase activity compared with wild type. This supports the broader conclusion that pyrimidine monophosphate kinase activities can be critical for organelle development and photosynthetic capacity, although this specific chloroplast localization is not established for Miscanthus NCGR_LOCUS27674. (chen2018umpkinaseactivity pages 5-8)
Publication details: Chen F. et al. Photosynthesis Research (2018-02). DOI/URL: https://doi.org/10.1007/s11120-017-0477-5 (chen2018umpkinaseactivity pages 5-8)
In M. lutarioriparia cadmium tolerance work, reported performance includes 29.82% reduction of Cd content in mining soil and maximal tissue Cd concentrations around 185.65โ186.8 mg/kg (study-specific). (wang2024physiologicalandtranscriptomic pages 3-5)
Publication details: Wang J. et al. PLOS ONE (2024-05). DOI/URL: https://doi.org/10.1371/journal.pone.0302940 (wang2024physiologicalandtranscriptomic pages 3-5)
Cropped visual extractions from the 2024 Arabidopsis UMK work indicate that kinetic parameters are presented in Tables 1โ3 and that Figure 7 shows cytosolic localization for UMK1/UMK3 and mitochondrial localization for UMK2, supporting the compartmentalization claims used for inference. (rinne2024characterizationofthree media 679728a2, rinne2024characterizationofthree media 32a65f5e, rinne2024characterizationofthree media 618fd8ed)
| Annotation element | Key points | Evidence type | Best supporting citations |
|---|---|---|---|
| Molecular function | NCGR_LOCUS27674 / UniProt A0A811PC48 is annotated in UniProt as UMP-CMP kinase / deoxycytidylate kinase (EC 2.7.4.14), a nucleoside monophosphate kinase in the adenylate kinase family; plant UK/CK enzymes phosphorylate UMP and CMP, and can show lower activity with dCMP. | Inference for Miscanthus from UniProt/domain assignment; experimental from plant homolog biochemistry | (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24) |
| Reaction catalyzed | Canonical reaction: ATP + UMP โ ADP + UDP; plant uridylate-cytidylate kinases also catalyze ATP + CMP โ ADP + CDP and can phosphorylate dCMP โ dCDP. | Experimental in Arabidopsis/plant literature; inferred for Miscanthus ortholog | (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 20-24) |
| Substrates | Primary phosphate acceptors in plants: UMP, CMP; dCMP is accepted but generally poorer than ribonucleotide substrates. Arabidopsis AMK-like UMKs all used UMP/CMP/dCMP with differing efficiencies. | Experimental homolog evidence; inferred for Miscanthus | (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 75-78, rinne2024characterizationofthree pages 20-24) |
| Products | UDP, CDP, dCDP plus ADP, depending on substrate. These products feed synthesis of UTP/CTP/dCTP pools needed for RNA, membrane lipid, sugar-nucleotide, and DNA metabolism. | Experimental + pathway inference | (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 75-78) |
| Cofactors / biochemical requirements | Uses MgATP as phosphate donor; only Mg-free (d)NMP serves as phosphate acceptor in plant UK/CK description. | Experimental/review evidence | (kleczkowski2025adenylatedrivenequilibrationof pages 1-3) |
| Enzyme family / domains | UniProt places A0A811PC48 in the adenylate kinase family, UMP-CMP kinase subgroup. This is consistent with plant AK-like UK/CK enzymes; a distinct prokaryotic/eubacterial-type plastid UMK lineage also exists in plants. | Inference for Miscanthus + experimental/comparative plant evidence | (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3) |
| Subcellular localization | Specific localization of NCGR_LOCUS27674 is not directly shown in retrieved Miscanthus literature. In plants, AK-like UMKs can be cytosolic or mitochondrial; plastid bacterial-type UMKs also exist. Arabidopsis UMK1/UMK3 localize to cytosol, UMK2 to mitochondria. Therefore Miscanthus A0A811PC48 is best annotated cautiously as likely intracellular (cytosol or mitochondrion), unresolved without species-specific targeting evidence. | Direct evidence lacking for Miscanthus; localization inferred from plant homolog families | (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2) |
| Biological roles | Central role in pyrimidine nucleotide metabolism, balancing UMP/CMP/dCMP phosphorylation and supporting UTP/CTP/dCTP supply for RNA synthesis, UDP-sugar formation, membrane biogenesis, and DNA replication. | Experimental homolog evidence + pathway inference | (rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 75-78) |
| Pathway context | Functions in pyrimidine de novo synthesis output utilization and salvage, linking monophosphate pools to diphosphate/triphosphate pools. In Arabidopsis, UMK3 is the main route for phosphorylation of de novo synthesized UMP; UMK2/UMK3 support mitochondrial dCTP supply during germination. | Experimental homolog evidence | (rinne2024characterizationofthree pages 82-85, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65) |
| Kinetic parameters: UMP | Arabidopsis UMKs (AMK-like family): UMK1 KM ~0.646 mM, kcat ~41.1 s^-1, kcat/KM ~63.6; UMK2 KM ~0.694 mM, kcat ~95.1 s^-1, kcat/KM ~137; UMK3 KM ~0.24 mM, kcat ~252.1 s^-1, kcat/KM ~1050. UMK3 is the most efficient UMP kinase among the three. | Experimental homolog biochemistry | (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2) |
| Kinetic parameters: CMP / dCMP | Rinne 2024 reported measurable activity with CMP and dCMP for all three Arabidopsis UMKs; UMK3 showed high efficiency for CMP, while dCMP was a poorer substrate overall and UMK1/UMK2 were relatively better with dCMP than UMK3. | Experimental homolog biochemistry | (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 75-78, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2) |
| Regulation / assay caveats | Apparent strong substrate inhibition of UMK3 in a coupled assay was attributed by LC-MS follow-up to ATP-mediated inhibition of the coupling enzyme (pyruvate kinase) rather than genuine UMK3 inhibition. | Experimental homolog evidence | (rinne2024characterizationofthree pages 20-24) |
| Mutant / phenotype evidence in plants | In Arabidopsis, UMK3 is essential: null alleles were not transmitted, and reduced UMK3 activity caused growth defects and altered pyrimidine pools; UMK2 and UMK3 contribute to mitochondrial DNA replication during germination. This supports a core housekeeping role for plant UMP/CMP kinases. | Experimental homolog genetics | (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65, rinne2024characterizationofthree pages 75-78) |
| Chloroplast-related phenotype evidence | A distinct rice UMP kinase, YL2/LOC_Os01g73450, is a chloroplast/thylakoid-associated UMP kinase required for normal chloroplast development; mutant plants show yellow leaves, reduced chlorophyll, and decreased cpATPase activity (~36.3% reduction). This supports the broader importance of plant UMP kinase activity for organelle function, though it does not prove chloroplast localization of NCGR_LOCUS27674. | Experimental, but from a different plant UMK lineage/context | (chen2018umpkinaseactivity pages 5-8) |
| Plant salvage review context | Plant pyrimidine salvage is closely integrated with organelle biogenesis. Reviews and recent work describe cytosolic, mitochondrial, and plastid nucleotide salvage/kinase functions, with plastid bacterial-type UK/CK (PUMPKIN) coupling nucleotide metabolism to plastid gene-expression processes. | Experimental/review context | (kleczkowski2025adenylatedrivenequilibrationof pages 1-3) |
| Notes for Miscanthus identifier verification | Retrieved Miscanthus papers did not mention NCGR_LOCUS27674 or A0A811PC48 directly. Thus the annotation should be stated as: the gene symbol is literature-sparse for this specific protein; functional annotation is inferred from UniProt and conserved plant UMP/CMP kinase biology. Miscanthus genome papers indicate complex mapping due to polyploidy/homoeology, so locus cross-referencing may require annotation tables/supplementary files. | Direct negative finding + cautious inference | (mitros2020genomebiologyof pages 6-7, wang2024physiologicalandtranscriptomic pages 3-5, mitros2020genomebiologyof pages 1-2, kim2025unravelingphylogeneticrelationships pages 4-6) |
| Best concise functional annotation for NCGR_LOCUS27674 | Probable intracellular UMP/CMP kinase (EC 2.7.4.14) that phosphorylates UMP and CMP, likely also dCMP, to maintain pyrimidine nucleotide pools for RNA/DNA precursor synthesis and related biosynthetic processes; exact subcellular localization in M. lutarioriparius remains unverified. | Integrated inference from UniProt + conserved plant evidence | (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, mitros2020genomebiologyof pages 6-7) |
Table: This table compiles the strongest available evidence for annotating Miscanthus lutarioriparius NCGR_LOCUS27674 (UniProt A0A811PC48) as a UMP-CMP kinase. It distinguishes direct evidence from homolog-based inference and highlights where species-specific uncertainty remains.
Gene product: probable UMPโCMP kinase / uridylateโcytidylate kinase (EC 2.7.4.14) (adenylate kinase family). (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)
Molecular function: ATP-dependent phosphorylation of pyrimidine nucleoside monophosphates (primarily UMP and CMP; likely also dCMP) to corresponding diphosphates. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24)
Biological process/pathway: pyrimidine nucleotide interconversion and salvage/de novo output utilization; supports production of UDP/CDP used to build UTP/CTP/dCTP pools required for RNA synthesis, sugar-nucleotide production, and DNA precursor supply (including organellar DNA replication). (rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65)
Cellular localization: intracellular; specific compartment in M. lutarioriparius remains unverified. Plant homologs include cytosolic and mitochondrial isoforms; plastid bacterial-type UK/CKs also exist in plants (not established for A0A811PC48). (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
Because Miscanthus-specific locus-level literature was not found in the retrieved set, the key gaps are (i) mapping of NCGR_LOCUS27674 to the M. lutarioriparius reference annotation used by specific studies and (ii) experimental confirmation of subcellular targeting. Miscanthus genomics work emphasizes polyploidy/homoeology, which can complicate mapping of locus IDs across assemblies and annotations. (mitros2020genomebiologyof pages 1-2, mitros2020genomebiologyof pages 6-7)
References
(mitros2020genomebiologyof pages 6-7): Therese Mitros, Adam M. Session, Brandon T. James, Guohong Albert Wu, Mohammad B. Belaffif, Lindsay V. Clark, Shengqiang Shu, Hongxu Dong, Adam Barling, Jessica R. Holmes, Jessica E. Mattick, Jessen V. Bredeson, Siyao Liu, Kerrie Farrar, Katarzyna Gลowacka, Stanisลaw Jeลผowski, Kerrie Barry, Won Byoung Chae, John A. Juvik, Justin Gifford, Adebosola Oladeinde, Toshihiko Yamada, Jane Grimwood, Nicholas H. Putnam, Jose De Vega, Susanne Barth, Manfred Klaas, Trevor Hodkinson, Laigeng Li, Xiaoli Jin, Junhua Peng, Chang Yeon Yu, Kweon Heo, Ji Hye Yoo, Bimal Kumar Ghimire, Iain S. Donnison, Jeremy Schmutz, Matthew E. Hudson, Erik J. Sacks, Stephen P. Moose, Kankshita Swaminathan, and Daniel S. Rokhsar. Genome biology of the paleotetraploid perennial biomass crop miscanthus. Nature Communications, Oct 2020. URL: https://doi.org/10.1038/s41467-020-18923-6, doi:10.1038/s41467-020-18923-6. This article has 118 citations and is from a highest quality peer-reviewed journal.
(wang2024physiologicalandtranscriptomic pages 3-5): Jia Wang, Xinyu Liu, Yiran Chen, Feng lin Zhu, Jiajing Sheng, and Ying Diao. Physiological and transcriptomic analyses reveal the cadmium tolerance mechanism of miscanthus lutarioriparia. PLOS ONE, 19:e0302940, May 2024. URL: https://doi.org/10.1371/journal.pone.0302940, doi:10.1371/journal.pone.0302940. This article has 3 citations and is from a peer-reviewed journal.
(mitros2020genomebiologyof pages 1-2): Therese Mitros, Adam M. Session, Brandon T. James, Guohong Albert Wu, Mohammad B. Belaffif, Lindsay V. Clark, Shengqiang Shu, Hongxu Dong, Adam Barling, Jessica R. Holmes, Jessica E. Mattick, Jessen V. Bredeson, Siyao Liu, Kerrie Farrar, Katarzyna Gลowacka, Stanisลaw Jeลผowski, Kerrie Barry, Won Byoung Chae, John A. Juvik, Justin Gifford, Adebosola Oladeinde, Toshihiko Yamada, Jane Grimwood, Nicholas H. Putnam, Jose De Vega, Susanne Barth, Manfred Klaas, Trevor Hodkinson, Laigeng Li, Xiaoli Jin, Junhua Peng, Chang Yeon Yu, Kweon Heo, Ji Hye Yoo, Bimal Kumar Ghimire, Iain S. Donnison, Jeremy Schmutz, Matthew E. Hudson, Erik J. Sacks, Stephen P. Moose, Kankshita Swaminathan, and Daniel S. Rokhsar. Genome biology of the paleotetraploid perennial biomass crop miscanthus. Nature Communications, Oct 2020. URL: https://doi.org/10.1038/s41467-020-18923-6, doi:10.1038/s41467-020-18923-6. This article has 118 citations and is from a highest quality peer-reviewed journal.
(kleczkowski2025adenylatedrivenequilibrationof pages 1-3): Leszek A. Kleczkowski and Abir U. Igamberdiev. Adenylate-driven equilibration of both ribo- and deoxyribonucleotides is under magnesium control: quantification of the mg2+-signal. Journal of Plant Physiology, 304:154380, Jan 2025. URL: https://doi.org/10.1016/j.jplph.2024.154380, doi:10.1016/j.jplph.2024.154380. This article has 4 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 70-73): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 20-24): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 75-78): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree media 679728a2): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 1-6): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 61-65): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(chen2018umpkinaseactivity pages 5-8): Fei Chen, Guojun Dong, Xiaohui Ma, Fang Wang, Yanli Zhang, Erhui Xiong, Jiahuan Wu, Huizhong Wang, Qian Qian, Limin Wu, and Yanchun Yu. Ump kinase activity is involved in proper chloroplast development in rice. Photosynthesis Research, 137:53-67, Feb 2018. URL: https://doi.org/10.1007/s11120-017-0477-5, doi:10.1007/s11120-017-0477-5. This article has 28 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree media 32a65f5e): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree media 618fd8ed): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(rinne2024characterizationofthree pages 82-85): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.
(kim2025unravelingphylogeneticrelationships pages 4-6): Ji Eun Kim, Yang Su Kim, Gyu Young Chung, Hyeok Jae Choi, Chang-Gee Jang, Hoe Jin Kim, and Chae Sun Na. Unraveling phylogenetic relationships among six miscanthus andersson (poaceae) species through chloroplast genome analysis. Genes, 16:1175, Oct 2025. URL: https://doi.org/10.3390/genes16101175, doi:10.3390/genes16101175. This article has 0 citations.
NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase (EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. There is no direct experimental literature on this gene. The protein is almost certainly a chimeric gene model artifact, as it contains two unrelated domains that are never found fused together in nature.
The protein contains two distinct functional domains:
Contains P-loop NTPase fold for ATP binding
C-terminal chalcone isomerase-fold domain (Pfam PF16035, Chalcone_2; InterPro IPR036298)
At 714 AA, this protein is vastly larger than any known UMP-CMP kinase:
- Arabidopsis thaliana UMP-CMP kinase: 202 AA PMID:9736767
- Rice YL2 (chloroplastic UMK): 351 AA PMID:29866037
- Human CMPK1: 196-228 AA
- E. coli CMP kinase: 225 AA
No organism has been reported to have a natural fusion of adenylate kinase and chalcone isomerase domains.
The M. lutarioriparius genome (PMID:33931638) was assembled at 2.07 Gb with 68,328 predicted gene models - very high even for an allotetraploid. The ab initio + EVidenceModeler pipeline can produce chimeric models, particularly in polyploid genomes with tandemly duplicated genes.
Plant UMP-CMP kinases (also called UMK or PUMPKIN) catalyze the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP, dCMP) using ATP. They are essential for pyrimidine nucleotide biosynthesis.
Key characterized plant UMKs:
- Arabidopsis PUMPKIN (At3g18680): Plastid-localized UMP kinase, essential for chloroplast development PMID:30523175
- Rice YL2: Chloroplast thylakoid membrane-localized UMP kinase. Loss of function causes yellow-green leaves due to impaired chloroplast development PMID:29866037
- Arabidopsis UMK1/UMK2/UMK3: Cytosolic and mitochondrial isoforms PMID:9736767
The chalcone isomerase superfamily includes non-catalytic fatty acid-binding proteins (FAPs):
- Arabidopsis has CHI-fold FAP proteins that localize to plastids PMID:22388820
- FAPs bind fatty acids and may have roles in de novo fatty acid biosynthesis in chloroplasts
- They lack the catalytic residues needed for chalcone isomerase activity
This protein is almost certainly a chimeric gene model artifact resulting from incorrect fusion of two separate genes during genome annotation:
E. coli CMP kinase: 225 AA
Two unrelated domains:
C-terminal (positions 555-650): Chalcone isomerase domain (Pfam PF16035, Chalcone_2) - functionally unrelated
Conflicting automated annotations from different domains:
Intramolecular lyase activity (from CHI-fold superfamily via InterPro)
Genome annotation context: The M. lutarioriparius genome has 68,328 predicted gene models PMID:33931638, which is very high even for an allotetraploid grass. The EVidenceModeler pipeline used is known to sometimes produce chimeric models in polyploid genomes.
Gene 1 (N-terminal ~350 AA): A plastid/chloroplast-targeted UMP-CMP kinase
- Consistent with rice YL2 ortholog PMID:29866037 or Arabidopsis PUMPKIN PMID:30523175
- Plant plastid UMKs have transit peptides and are 300-350 AA total
- Functions in de novo pyrimidine nucleotide biosynthesis in chloroplasts
Gene 2 (C-terminal ~150+ AA): A CHI-fold fatty acid-binding protein (FAP)
- The Chalcone_2 domain (PF16035) belongs to the CHI superfamily
- FAP subfamily members are non-catalytic - they bind fatty acids but lack isomerase activity PMID:22388820
- Plant FAPs typically localize to plastids for de novo fatty acid biosynthesis PMID:22388820
All 13 GO annotations are IEA (electronic). Given the chimeric nature:
id: A0A811PC48
gene_symbol: NCGR_LOCUS27674
product_type: PROTEIN
taxon:
id: NCBITaxon:422564
label: Miscanthus lutarioriparius
description: >-
NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase
(EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. However, this gene model
is almost certainly a chimeric artifact resulting from the incorrect fusion of at least two
separate genes during genome annotation: an N-terminal UMP-CMP kinase (adenylate kinase family,
Pfam PF00406) and a C-terminal chalcone isomerase-fold fatty acid-binding protein (Pfam PF16035,
positions 555-650). At 714 AA, the protein is 2-3.5x larger than any characterized UMP-CMP
kinase (e.g. Arabidopsis 202 AA [PMID:9576794], rice YL2 351 AA [PMID:29392476]). The
conflicting automated annotations - kinase activities from the ADK domain and fatty acid
binding / intramolecular lyase from the CHI-fold domain - are explained by this fusion. The
M. lutarioriparius genome has 68,328 predicted gene models [PMID:33911077], unusually high
even for an allotetraploid, consistent with gene model artifacts from the EVidenceModeler
pipeline. All 13 GO annotations are IEA with no experimental evidence. The annotations
pertaining to the UMP-CMP kinase portion (N-terminal domain) are likely correct for that
domain, while annotations from the CHI-fold domain (fatty acid binding, intramolecular lyase
activity) are artifacts of the chimeric model.
existing_annotations:
- term:
id: GO:0005504
label: fatty acid binding
evidence_type: IEA
original_reference_id: GO_REF:0000118
review:
summary: >-
This annotation derives from TreeGrafter matching the C-terminal chalcone isomerase-fold
domain (Pfam PF16035, Chalcone_2, positions 555-650) to PANTHER family PTHR47284
(FATTY-ACID-BINDING PROTEIN 2). The CHI-fold superfamily includes non-catalytic fatty
acid-binding proteins (FAPs) that localize to plastids and participate in de novo fatty
acid biosynthesis [PMID:22622584]. However, this annotation reflects a domain from what
is almost certainly a chimeric gene model, not the true function of the UMP-CMP kinase
that constitutes the N-terminal portion of this protein. The 714 AA size is 2-3.5x larger
than any known UMP-CMP kinase, and no natural ADK-CHI domain fusion has been reported
in any organism.
action: REMOVE
reason: >-
This annotation is an artifact of a chimeric gene model. The fatty acid binding function
derives from the C-terminal CHI-fold/FAP domain (positions 555-650), which is a separate
gene incorrectly fused with the N-terminal UMP-CMP kinase during genome annotation of
M. lutarioriparius [PMID:33911077]. FAP proteins in the CHI superfamily do bind fatty
acids [PMID:22622584], but this activity belongs to a distinct gene product, not a
UMP-CMP kinase.
supported_by:
- reference_id: PMID:22622584
supporting_text: >-
These three CHI-fold proteins localize to plastids, the site of de novo
fatty-acid biosynthesis in plant cells. Furthermore, their expression profiles
correlate with those of core fatty-acid biosynthetic enzymes
- reference_id: PMID:33911077
supporting_text: >-
A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
supporting_text: >-
This protein is almost certainly a chimeric gene model artifact resulting from incorrect
fusion of two separate genes during genome annotation.
- term:
id: GO:0005524
label: ATP binding
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: >-
ATP binding is assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase).
UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates
at the expense of ATP, so ATP binding is an inherent part of their catalytic mechanism.
The UniProt record confirms ATP binding sites at positions 36-41, 149, and 192 via
HAMAP-Rule MF_03172. This annotation is consistent with the N-terminal UMP-CMP kinase
domain of this protein.
action: ACCEPT
reason: >-
ATP binding is a core requirement for UMP-CMP kinase activity. The protein contains
a P-loop NTPase domain with conserved ATP binding residues identified by HAMAP and
InterPro. All characterized UMP-CMP kinases require ATP as the phosphate donor
[PMID:9576794].
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Catalyzes the phosphorylation of pyrimidine nucleoside monophosphates at the expense
of ATP.
- reference_id: PMID:9576794
supporting_text: >-
the UMP/CMP kinase preferentially uses ATP
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
supporting_text: >-
Core function: phosphorylation of UMP, CMP, and dCMP to their diphosphate forms using
ATP as the phosphate donor.
- term:
id: GO:0016776
label: phosphotransferase activity, phosphate group as acceptor
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: >-
Assigned via InterPro match to IPR006266 (UMP-CMP kinase). GO:0016776 is an ancestor
of GO:0033862 (UMP kinase activity) in the GO hierarchy, so this is a broader but
valid annotation for a UMP-CMP kinase. UMP-CMP kinases transfer a phosphate group
from ATP to nucleoside monophosphate substrates (UMP, CMP, dCMP), which already
carry a phosphate group, making them phosphotransferases with phosphate group as acceptor.
action: ACCEPT
reason: >-
This is a valid parent term of the more specific UMP/CMP/dCMP kinase activities also
annotated to this protein. While broader than the specific kinase activities
(GO:0033862, GO:0036430, GO:0036431), it correctly captures the biochemical mechanism.
The InterPro match to IPR006266 (UMP-CMP kinase family) is appropriate for the
N-terminal domain.
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Reaction=CMP + ATP = CDP + ADP
- term:
id: GO:0016872
label: intramolecular lyase activity
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: >-
This annotation derives from InterPro match to IPR036298 (Chalcone isomerase superfamily),
which is mapped to GO:0016872 via InterPro2GO. Chalcone isomerases catalyze intramolecular
cyclization of chalcones to flavanones. However, this annotation is inappropriate for
two reasons: (1) the CHI-fold domain in this protein likely belongs to the non-catalytic
FAP (fatty acid-binding) subfamily, which lacks the catalytic residues for isomerase
activity [PMID:22622584]; and (2) this domain is part of a chimeric gene model and
belongs to a separate gene product. The InterPro2GO mapping from IPR036298 to GO:0016872
is overly broad, applying catalytic function to the entire superfamily including
non-catalytic members.
action: REMOVE
reason: >-
This is a doubly erroneous annotation: the intramolecular lyase activity derives from
the CHI superfamily InterPro match, but the CHI-fold domain in this protein is likely
a FAP subfamily member that is non-catalytic. Furthermore, the CHI-fold domain
(positions 555-650) belongs to a separate gene that was incorrectly fused with the
UMP-CMP kinase in this chimeric gene model. A UMP-CMP kinase has no intramolecular
lyase activity.
supported_by:
- reference_id: PMID:22622584
supporting_text: >-
the FAP discovery defines the adaptive evolution of a stereospecific
and catalytically 'perfected' enzyme from a non-enzymatic ancestor over a
defined period of plant evolution.
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
supporting_text: >-
C-terminal (positions 555-650) Chalcone isomerase domain (Pfam PF16035, Chalcone_2)
- functionally unrelated to the N-terminal UMP-CMP kinase domain.
- term:
id: GO:0019205
label: nucleobase-containing compound kinase activity
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: >-
Assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). GO:0019205
is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy. UMP-CMP
kinases phosphorylate nucleoside monophosphates (nucleobase-containing compounds),
so this broader annotation is valid.
action: ACCEPT
reason: >-
This is a valid broader term for UMP-CMP kinase activity. The protein's N-terminal
domain belongs to the adenylate kinase family UMP-CMP kinase subfamily, which
phosphorylates nucleobase-containing compounds (UMP, CMP, dCMP). While less specific
than GO:0033862/GO:0036430/GO:0036431, it is not incorrect.
supported_by:
- reference_id: PMID:9576794
supporting_text: >-
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as
the phosphate acceptor.
- term:
id: GO:0033862
label: UMP kinase activity
evidence_type: IEA
original_reference_id: GO_REF:0000104
review:
summary: >-
Assigned via UniRule UR000111469 based on shared sequence features with characterized
UMP-CMP kinases. UMP kinase activity (catalysis of UMP + ATP = UDP + ADP) is a core
function of the UMP-CMP kinase family. The UniProt record explicitly lists this
catalytic activity with EC 2.7.4.14. This annotation applies to the N-terminal
adenylate kinase domain.
action: ACCEPT
reason: >-
UMP kinase activity is the primary molecular function of UMP-CMP kinases. The protein
contains the conserved ADK domain (PF00406) and matches HAMAP rule MF_03172 for the
UMP-CMP kinase subfamily. Confirmed in the rice ortholog YL2 [PMID:29392476] and
Arabidopsis [PMID:9576794].
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Reaction=UMP + ATP = UDP + ADP
- reference_id: PMID:29392476
supporting_text: >-
Prokaryotic UMP kinase activity was subsequently confirmed, with YL2 deficiency
causing a significant reduction in chlorophyll accumulation and photochemical
efficiency.
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
supporting_text: >-
Plant UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside
monophosphates (UMP, CMP, dCMP) using ATP.
- term:
id: GO:0036430
label: dCMP kinase activity
evidence_type: IEA
original_reference_id: GO_REF:0000104
review:
summary: >-
Assigned via UniRule UR000111469. dCMP kinase activity (catalysis of dCMP + ATP =
dCDP + ADP) is a known activity of UMP-CMP kinases. The UniProt record explicitly
lists this catalytic reaction.
action: ACCEPT
reason: >-
dCMP kinase activity is a well-characterized function of the UMP-CMP kinase family.
The catalytic reaction is explicitly listed in the UniProt record. This applies to
the N-terminal kinase domain.
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Reaction=dCMP + ATP = dCDP + ADP
- term:
id: GO:0036431
label: CMP kinase activity
evidence_type: IEA
original_reference_id: GO_REF:0000104
review:
summary: >-
Assigned via UniRule UR000111469. CMP kinase activity (catalysis of CMP + ATP =
CDP + ADP) is a core function of UMP-CMP kinases. The UniProt record explicitly
lists this catalytic reaction.
action: ACCEPT
reason: >-
CMP kinase activity is a core function of the UMP-CMP kinase family. Arabidopsis
UMP/CMP kinase accepts both UMP and CMP as phosphate acceptors [PMID:9576794].
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Reaction=CMP + ATP = CDP + ADP
- reference_id: PMID:9576794
supporting_text: >-
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as
the phosphate acceptor.
- term:
id: GO:0006207
label: "'de novo' pyrimidine nucleobase biosynthetic process"
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: >-
Assigned via InterPro match to IPR006266 (UMP-CMP kinase). UMP-CMP kinases play an
important role in de novo pyrimidine nucleotide biosynthesis by phosphorylating UMP
to UDP. However, GO:0006207 specifically refers to pyrimidine nucleobase biosynthesis
(the base itself), whereas UMP-CMP kinases act on nucleotides (nucleobase + sugar +
phosphate). The more appropriate term is GO:0006221 (pyrimidine nucleotide biosynthetic
process), which is already annotated.
action: MODIFY
reason: >-
UMP-CMP kinases act on pyrimidine nucleotides, not nucleobases. GO:0006207 is
technically not the correct process for a nucleoside monophosphate kinase. GO:0006221
(pyrimidine nucleotide biosynthetic process) is more appropriate and already annotated.
proposed_replacement_terms:
- id: GO:0006221
label: pyrimidine nucleotide biosynthetic process
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Plays an important role in de novo pyrimidine nucleotide biosynthesis. Has preference
for UMP and CMP as phosphate acceptors.
- term:
id: GO:0006221
label: pyrimidine nucleotide biosynthetic process
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: >-
Assigned via combined automated annotation (InterPro IPR006266 + UniRule UR000111469).
UMP-CMP kinases participate in pyrimidine nucleotide biosynthesis by catalyzing the
phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP) to their diphosphate
forms (UDP, CDP). The UniProt record explicitly states the protein plays an important
role in de novo pyrimidine nucleotide biosynthesis.
action: ACCEPT
reason: >-
Pyrimidine nucleotide biosynthesis is a core biological process for UMP-CMP kinases.
The phosphorylation of UMP to UDP and CMP to CDP are essential steps in maintaining
the pyrimidine nucleotide pool. Confirmed in rice YL2 [PMID:29392476] and Arabidopsis
PUMPKIN [PMID:30409856].
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
Plays an important role in de novo pyrimidine nucleotide biosynthesis.
- reference_id: PMID:29392476
supporting_text: >-
our results suggest that UMP kinase activity plays an essential role in
chloroplast development and regulating cpATPase biogenesis in rice.
- term:
id: GO:0005634
label: nucleus
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: >-
Assigned via combined automated annotation (UniProtKB-SubCell SL-0191 + UniRule
UR000111469). The UniProt record states nuclear localization based on HAMAP rule
MF_03172. However, for plant UMP-CMP kinases, characterized orthologs are predominantly
cytosolic (Arabidopsis UMK) or plastid-localized (rice YL2, Arabidopsis PUMPKIN)
[PMID:29392476, PMID:30409856]. Nuclear localization is not well-established for
plant UMKs.
action: UNDECIDED
reason: >-
Nuclear localization is inferred from homology to animal UMP-CMP kinases via the
HAMAP rule, but characterized plant orthologs localize to the cytoplasm or plastids
rather than the nucleus. Without direct experimental evidence in plants, the nuclear
localization annotation should be treated with caution.
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
SUBCELLULAR LOCATION: Cytoplasm. Nucleus.
- reference_id: PMID:30409856
supporting_text: >-
the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana)
- term:
id: GO:0005737
label: cytoplasm
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: >-
Assigned via combined automated annotation (UniProtKB-SubCell SL-0086 + UniRule
UR000111469). Cytoplasmic localization is consistent with the known localization of
several plant UMP-CMP kinase isoforms. The Arabidopsis UMP/CMP kinase characterized
by Zhou et al. was expressed and purified as a cytosolic protein [PMID:9576794].
action: ACCEPT
reason: >-
Cytoplasmic localization is well-supported for plant UMP-CMP kinases. Arabidopsis
has cytosolic UMK isoforms alongside the plastid-localized PUMPKIN. Even if this
particular gene model is chimeric, the UMP-CMP kinase portion would likely localize
to the cytoplasm or plastids.
supported_by:
- reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
supporting_text: >-
SUBCELLULAR LOCATION: Cytoplasm.
- reference_id: PMID:9576794
supporting_text: >-
A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine
5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces
cerevisiae ura6 mutant.
- term:
id: GO:0009570
label: chloroplast stroma
evidence_type: IEA
original_reference_id: GO_REF:0000118
review:
summary: >-
Assigned via TreeGrafter matching to PANTHER PTN009208978. Chloroplast localization
is plausible for plant UMP-CMP kinases - rice YL2 and Arabidopsis PUMPKIN are
chloroplast-localized UMKs [PMID:29392476, PMID:30409856]. However, the specific
sub-compartment is questionable: rice YL2 localizes to thylakoid membranes, not the
stroma [PMID:29392476]. Additionally, the TreeGrafter match may be influenced by the
chimeric gene model, as CHI-fold FAP proteins also localize to plastids [PMID:22622584].
action: UNDECIDED
reason: >-
While chloroplast localization is plausible for a plant UMP-CMP kinase, the specific
annotation to chloroplast stroma (GO:0009570) is uncertain. The closest characterized
ortholog, rice YL2, is a thylakoid membrane-localized protein [PMID:29392476].
Furthermore, the TreeGrafter PANTHER match could reflect either the UMK domain or
the CHI-fold FAP domain, both of which can be plastid-targeted.
supported_by:
- reference_id: PMID:29392476
supporting_text: >-
YL2 encodes a thylakoid membrane-localized protein with significant sequence
similarity to UMP kinase proteins in prokaryotes and eukaryotes.
- reference_id: PMID:22622584
supporting_text: >-
These three CHI-fold proteins localize to plastids, the site of de novo
fatty-acid biosynthesis in plant cells.
references:
- id: GO_REF:0000002
title: >-
Gene Ontology annotation through association of InterPro records with GO terms
findings: []
- id: GO_REF:0000104
title: >-
Electronic Gene Ontology annotations created by transferring manual GO annotations
between related proteins based on shared sequence features
findings: []
- id: GO_REF:0000118
title: >-
TreeGrafter-generated GO annotations
findings: []
- id: GO_REF:0000120
title: >-
Combined Automated Annotation using Multiple IEA Methods
findings: []
- id: PMID:9576794
title: >-
Cloning, expression in Escherichia coli, and characterization of Arabidopsis thaliana
UMP/CMP kinase
findings:
- statement: >-
Arabidopsis UMP/CMP kinase preferentially uses ATP as phosphate donor and accepts both
UMP and CMP as substrates
supporting_text: >-
the UMP/CMP kinase preferentially uses ATP
- id: PMID:22622584
title: >-
Evolution of the chalcone-isomerase fold from fatty-acid binding to stereospecific catalysis
findings:
- statement: >-
CHI superfamily includes non-catalytic fatty acid-binding proteins (FAPs) that localize
to plastids
supporting_text: >-
These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid
biosynthesis in plant cells. Furthermore, their expression profiles correlate with those
of core fatty-acid biosynthetic enzymes
- statement: FAPs are non-enzymatic ancestors of catalytically active chalcone isomerases
supporting_text: >-
the FAP discovery defines the adaptive evolution of a stereospecific and catalytically
'perfected' enzyme from a non-enzymatic ancestor over a defined period of plant evolution.
- id: PMID:29392476
title: >-
UMP kinase activity is involved in proper chloroplast development in rice
findings:
- statement: >-
Rice YL2 is a thylakoid membrane-localized UMP kinase essential for chloroplast
development
supporting_text: >-
YL2 encodes a thylakoid membrane-localized protein with significant sequence similarity
to UMP kinase proteins in prokaryotes and eukaryotes. Prokaryotic UMP kinase activity
was subsequently confirmed, with YL2 deficiency causing a significant reduction in
chlorophyll accumulation and photochemical efficiency.
- id: PMID:30409856
title: >-
PUMPKIN, the Sole Plastid UMP Kinase, Associates with Group II Introns and Alters
Their Metabolism
findings:
- statement: PUMPKIN is the sole plastid UMP kinase in Arabidopsis
supporting_text: >-
the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana) associates
specifically with the introns of the plastid transcripts trnG-UCC, trnV-UAC, petB,
petD, and ndhA in vivo
- id: PMID:33911077
title: >-
Chromosome-scale assembly and analysis of biomass crop Miscanthus lutarioriparius genome
findings:
- statement: >-
M. lutarioriparius genome has 68,328 predicted gene models, unusually high even for
an allotetraploid
supporting_text: >-
A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp
core_functions:
- molecular_function:
id: GO:0033862
label: UMP kinase activity
directly_involved_in:
- id: GO:0006221
label: pyrimidine nucleotide biosynthetic process
locations:
- id: GO:0005737
label: cytoplasm
description: >-
The N-terminal domain of this chimeric gene model encodes a UMP-CMP kinase
(EC 2.7.4.14) that catalyzes the phosphorylation of pyrimidine nucleoside
monophosphates (UMP, CMP, dCMP) to their diphosphate forms using ATP. This is
a core function of the adenylate kinase family UMP-CMP kinase subfamily, supported
by HAMAP rule MF_03172 and InterPro family IPR006266. The protein also has CMP
kinase (GO:0036431) and dCMP kinase (GO:0036430) activities. Note that this gene
model is almost certainly chimeric - the 714 AA protein is 2-3.5x larger than any
characterized UMP-CMP kinase, and contains an unrelated C-terminal CHI-fold fatty
acid-binding domain (Pfam PF16035, positions 555-650) from a separate gene that
was incorrectly fused during genome annotation [PMID:33911077].
supported_by:
- reference_id: PMID:9576794
supporting_text: >-
A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine
5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces
cerevisiae ura6 mutant.
- reference_id: PMID:29392476
supporting_text: >-
our results suggest that UMP kinase activity plays an essential role in
chloroplast development and regulating cpATPase biogenesis in rice.