NCGR_LOCUS27674

UniProt ID: A0A811PC48
Organism: Miscanthus lutarioriparius
๐Ÿ“ Provide Detailed Feedback

Gene Description

NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase (EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. However, this gene model is almost certainly a chimeric artifact resulting from the incorrect fusion of at least two separate genes during genome annotation: an N-terminal UMP-CMP kinase (adenylate kinase family, Pfam PF00406) and a C-terminal chalcone isomerase-fold fatty acid-binding protein (Pfam PF16035, positions 555-650). At 714 AA, the protein is 2-3.5x larger than any characterized UMP-CMP kinase (e.g. Arabidopsis 202 AA [PMID:9576794], rice YL2 351 AA [PMID:29392476]). The conflicting automated annotations - kinase activities from the ADK domain and fatty acid binding / intramolecular lyase from the CHI-fold domain - are explained by this fusion. The M. lutarioriparius genome has 68,328 predicted gene models [PMID:33911077], unusually high even for an allotetraploid, consistent with gene model artifacts from the EVidenceModeler pipeline. All 13 GO annotations are IEA with no experimental evidence. The annotations pertaining to the UMP-CMP kinase portion (N-terminal domain) are likely correct for that domain, while annotations from the CHI-fold domain (fatty acid binding, intramolecular lyase activity) are artifacts of the chimeric model.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0005504 fatty acid binding
IEA
GO_REF:0000118
REMOVE
Summary: This annotation derives from TreeGrafter matching the C-terminal chalcone isomerase-fold domain (Pfam PF16035, Chalcone_2, positions 555-650) to PANTHER family PTHR47284 (FATTY-ACID-BINDING PROTEIN 2). The CHI-fold superfamily includes non-catalytic fatty acid-binding proteins (FAPs) that localize to plastids and participate in de novo fatty acid biosynthesis [PMID:22622584]. However, this annotation reflects a domain from what is almost certainly a chimeric gene model, not the true function of the UMP-CMP kinase that constitutes the N-terminal portion of this protein. The 714 AA size is 2-3.5x larger than any known UMP-CMP kinase, and no natural ADK-CHI domain fusion has been reported in any organism.
Reason: This annotation is an artifact of a chimeric gene model. The fatty acid binding function derives from the C-terminal CHI-fold/FAP domain (positions 555-650), which is a separate gene incorrectly fused with the N-terminal UMP-CMP kinase during genome annotation of M. lutarioriparius [PMID:33911077]. FAP proteins in the CHI superfamily do bind fatty acids [PMID:22622584], but this activity belongs to a distinct gene product, not a UMP-CMP kinase.
Supporting Evidence:
PMID:22622584
These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid biosynthesis in plant cells. Furthermore, their expression profiles correlate with those of core fatty-acid biosynthetic enzymes
PMID:33911077
A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
This protein is almost certainly a chimeric gene model artifact resulting from incorrect fusion of two separate genes during genome annotation.
GO:0005524 ATP binding
IEA
GO_REF:0000002
ACCEPT
Summary: ATP binding is assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates at the expense of ATP, so ATP binding is an inherent part of their catalytic mechanism. The UniProt record confirms ATP binding sites at positions 36-41, 149, and 192 via HAMAP-Rule MF_03172. This annotation is consistent with the N-terminal UMP-CMP kinase domain of this protein.
Reason: ATP binding is a core requirement for UMP-CMP kinase activity. The protein contains a P-loop NTPase domain with conserved ATP binding residues identified by HAMAP and InterPro. All characterized UMP-CMP kinases require ATP as the phosphate donor [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Catalyzes the phosphorylation of pyrimidine nucleoside monophosphates at the expense of ATP.
PMID:9576794
the UMP/CMP kinase preferentially uses ATP
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
Core function: phosphorylation of UMP, CMP, and dCMP to their diphosphate forms using ATP as the phosphate donor.
GO:0016776 phosphotransferase activity, phosphate group as acceptor
IEA
GO_REF:0000002
ACCEPT
Summary: Assigned via InterPro match to IPR006266 (UMP-CMP kinase). GO:0016776 is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy, so this is a broader but valid annotation for a UMP-CMP kinase. UMP-CMP kinases transfer a phosphate group from ATP to nucleoside monophosphate substrates (UMP, CMP, dCMP), which already carry a phosphate group, making them phosphotransferases with phosphate group as acceptor.
Reason: This is a valid parent term of the more specific UMP/CMP/dCMP kinase activities also annotated to this protein. While broader than the specific kinase activities (GO:0033862, GO:0036430, GO:0036431), it correctly captures the biochemical mechanism. The InterPro match to IPR006266 (UMP-CMP kinase family) is appropriate for the N-terminal domain.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=CMP + ATP = CDP + ADP
GO:0016872 intramolecular lyase activity
IEA
GO_REF:0000002
REMOVE
Summary: This annotation derives from InterPro match to IPR036298 (Chalcone isomerase superfamily), which is mapped to GO:0016872 via InterPro2GO. Chalcone isomerases catalyze intramolecular cyclization of chalcones to flavanones. However, this annotation is inappropriate for two reasons: (1) the CHI-fold domain in this protein likely belongs to the non-catalytic FAP (fatty acid-binding) subfamily, which lacks the catalytic residues for isomerase activity [PMID:22622584]; and (2) this domain is part of a chimeric gene model and belongs to a separate gene product. The InterPro2GO mapping from IPR036298 to GO:0016872 is overly broad, applying catalytic function to the entire superfamily including non-catalytic members.
Reason: This is a doubly erroneous annotation: the intramolecular lyase activity derives from the CHI superfamily InterPro match, but the CHI-fold domain in this protein is likely a FAP subfamily member that is non-catalytic. Furthermore, the CHI-fold domain (positions 555-650) belongs to a separate gene that was incorrectly fused with the UMP-CMP kinase in this chimeric gene model. A UMP-CMP kinase has no intramolecular lyase activity.
Supporting Evidence:
PMID:22622584
the FAP discovery defines the adaptive evolution of a stereospecific and catalytically 'perfected' enzyme from a non-enzymatic ancestor over a defined period of plant evolution.
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
C-terminal (positions 555-650) Chalcone isomerase domain (Pfam PF16035, Chalcone_2) - functionally unrelated to the N-terminal UMP-CMP kinase domain.
GO:0019205 nucleobase-containing compound kinase activity
IEA
GO_REF:0000002
ACCEPT
Summary: Assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). GO:0019205 is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy. UMP-CMP kinases phosphorylate nucleoside monophosphates (nucleobase-containing compounds), so this broader annotation is valid.
Reason: This is a valid broader term for UMP-CMP kinase activity. The protein's N-terminal domain belongs to the adenylate kinase family UMP-CMP kinase subfamily, which phosphorylates nucleobase-containing compounds (UMP, CMP, dCMP). While less specific than GO:0033862/GO:0036430/GO:0036431, it is not incorrect.
Supporting Evidence:
PMID:9576794
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as the phosphate acceptor.
GO:0033862 UMP kinase activity
IEA
GO_REF:0000104
ACCEPT
Summary: Assigned via UniRule UR000111469 based on shared sequence features with characterized UMP-CMP kinases. UMP kinase activity (catalysis of UMP + ATP = UDP + ADP) is a core function of the UMP-CMP kinase family. The UniProt record explicitly lists this catalytic activity with EC 2.7.4.14. This annotation applies to the N-terminal adenylate kinase domain.
Reason: UMP kinase activity is the primary molecular function of UMP-CMP kinases. The protein contains the conserved ADK domain (PF00406) and matches HAMAP rule MF_03172 for the UMP-CMP kinase subfamily. Confirmed in the rice ortholog YL2 [PMID:29392476] and Arabidopsis [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=UMP + ATP = UDP + ADP
PMID:29392476
Prokaryotic UMP kinase activity was subsequently confirmed, with YL2 deficiency causing a significant reduction in chlorophyll accumulation and photochemical efficiency.
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
Plant UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP, dCMP) using ATP.
GO:0036430 dCMP kinase activity
IEA
GO_REF:0000104
ACCEPT
Summary: Assigned via UniRule UR000111469. dCMP kinase activity (catalysis of dCMP + ATP = dCDP + ADP) is a known activity of UMP-CMP kinases. The UniProt record explicitly lists this catalytic reaction.
Reason: dCMP kinase activity is a well-characterized function of the UMP-CMP kinase family. The catalytic reaction is explicitly listed in the UniProt record. This applies to the N-terminal kinase domain.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=dCMP + ATP = dCDP + ADP
GO:0036431 CMP kinase activity
IEA
GO_REF:0000104
ACCEPT
Summary: Assigned via UniRule UR000111469. CMP kinase activity (catalysis of CMP + ATP = CDP + ADP) is a core function of UMP-CMP kinases. The UniProt record explicitly lists this catalytic reaction.
Reason: CMP kinase activity is a core function of the UMP-CMP kinase family. Arabidopsis UMP/CMP kinase accepts both UMP and CMP as phosphate acceptors [PMID:9576794].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Reaction=CMP + ATP = CDP + ADP
PMID:9576794
both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as the phosphate acceptor.
GO:0006207 'de novo' pyrimidine nucleobase biosynthetic process
IEA
GO_REF:0000002
MODIFY
Summary: Assigned via InterPro match to IPR006266 (UMP-CMP kinase). UMP-CMP kinases play an important role in de novo pyrimidine nucleotide biosynthesis by phosphorylating UMP to UDP. However, GO:0006207 specifically refers to pyrimidine nucleobase biosynthesis (the base itself), whereas UMP-CMP kinases act on nucleotides (nucleobase + sugar + phosphate). The more appropriate term is GO:0006221 (pyrimidine nucleotide biosynthetic process), which is already annotated.
Reason: UMP-CMP kinases act on pyrimidine nucleotides, not nucleobases. GO:0006207 is technically not the correct process for a nucleoside monophosphate kinase. GO:0006221 (pyrimidine nucleotide biosynthetic process) is more appropriate and already annotated.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Plays an important role in de novo pyrimidine nucleotide biosynthesis. Has preference for UMP and CMP as phosphate acceptors.
GO:0006221 pyrimidine nucleotide biosynthetic process
IEA
GO_REF:0000120
ACCEPT
Summary: Assigned via combined automated annotation (InterPro IPR006266 + UniRule UR000111469). UMP-CMP kinases participate in pyrimidine nucleotide biosynthesis by catalyzing the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP) to their diphosphate forms (UDP, CDP). The UniProt record explicitly states the protein plays an important role in de novo pyrimidine nucleotide biosynthesis.
Reason: Pyrimidine nucleotide biosynthesis is a core biological process for UMP-CMP kinases. The phosphorylation of UMP to UDP and CMP to CDP are essential steps in maintaining the pyrimidine nucleotide pool. Confirmed in rice YL2 [PMID:29392476] and Arabidopsis PUMPKIN [PMID:30409856].
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
Plays an important role in de novo pyrimidine nucleotide biosynthesis.
PMID:29392476
our results suggest that UMP kinase activity plays an essential role in chloroplast development and regulating cpATPase biogenesis in rice.
GO:0005634 nucleus
IEA
GO_REF:0000120
UNDECIDED
Summary: Assigned via combined automated annotation (UniProtKB-SubCell SL-0191 + UniRule UR000111469). The UniProt record states nuclear localization based on HAMAP rule MF_03172. However, for plant UMP-CMP kinases, characterized orthologs are predominantly cytosolic (Arabidopsis UMK) or plastid-localized (rice YL2, Arabidopsis PUMPKIN) [PMID:29392476, PMID:30409856]. Nuclear localization is not well-established for plant UMKs.
Reason: Nuclear localization is inferred from homology to animal UMP-CMP kinases via the HAMAP rule, but characterized plant orthologs localize to the cytoplasm or plastids rather than the nucleus. Without direct experimental evidence in plants, the nuclear localization annotation should be treated with caution.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm. Nucleus.
PMID:30409856
the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana)
GO:0005737 cytoplasm
IEA
GO_REF:0000120
ACCEPT
Summary: Assigned via combined automated annotation (UniProtKB-SubCell SL-0086 + UniRule UR000111469). Cytoplasmic localization is consistent with the known localization of several plant UMP-CMP kinase isoforms. The Arabidopsis UMP/CMP kinase characterized by Zhou et al. was expressed and purified as a cytosolic protein [PMID:9576794].
Reason: Cytoplasmic localization is well-supported for plant UMP-CMP kinases. Arabidopsis has cytosolic UMK isoforms alongside the plastid-localized PUMPKIN. Even if this particular gene model is chimeric, the UMP-CMP kinase portion would likely localize to the cytoplasm or plastids.
Supporting Evidence:
file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
SUBCELLULAR LOCATION: Cytoplasm.
PMID:9576794
A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine 5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces cerevisiae ura6 mutant.
GO:0009570 chloroplast stroma
IEA
GO_REF:0000118
UNDECIDED
Summary: Assigned via TreeGrafter matching to PANTHER PTN009208978. Chloroplast localization is plausible for plant UMP-CMP kinases - rice YL2 and Arabidopsis PUMPKIN are chloroplast-localized UMKs [PMID:29392476, PMID:30409856]. However, the specific sub-compartment is questionable: rice YL2 localizes to thylakoid membranes, not the stroma [PMID:29392476]. Additionally, the TreeGrafter match may be influenced by the chimeric gene model, as CHI-fold FAP proteins also localize to plastids [PMID:22622584].
Reason: While chloroplast localization is plausible for a plant UMP-CMP kinase, the specific annotation to chloroplast stroma (GO:0009570) is uncertain. The closest characterized ortholog, rice YL2, is a thylakoid membrane-localized protein [PMID:29392476]. Furthermore, the TreeGrafter PANTHER match could reflect either the UMK domain or the CHI-fold FAP domain, both of which can be plastid-targeted.
Supporting Evidence:
PMID:29392476
YL2 encodes a thylakoid membrane-localized protein with significant sequence similarity to UMP kinase proteins in prokaryotes and eukaryotes.
PMID:22622584
These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid biosynthesis in plant cells.

Core Functions

The N-terminal domain of this chimeric gene model encodes a UMP-CMP kinase (EC 2.7.4.14) that catalyzes the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP, dCMP) to their diphosphate forms using ATP. This is a core function of the adenylate kinase family UMP-CMP kinase subfamily, supported by HAMAP rule MF_03172 and InterPro family IPR006266. The protein also has CMP kinase (GO:0036431) and dCMP kinase (GO:0036430) activities. Note that this gene model is almost certainly chimeric - the 714 AA protein is 2-3.5x larger than any characterized UMP-CMP kinase, and contains an unrelated C-terminal CHI-fold fatty acid-binding domain (Pfam PF16035, positions 555-650) from a separate gene that was incorrectly fused during genome annotation [PMID:33911077].

Molecular Function:
UMP kinase activity
Cellular Locations:
Supporting Evidence:
  • PMID:9576794
    A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine 5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces cerevisiae ura6 mutant.
  • PMID:29392476
    our results suggest that UMP kinase activity plays an essential role in chloroplast development and regulating cpATPase biogenesis in rice.

References

Gene Ontology annotation through association of InterPro records with GO terms
Electronic Gene Ontology annotations created by transferring manual GO annotations between related proteins based on shared sequence features
TreeGrafter-generated GO annotations
Combined Automated Annotation using Multiple IEA Methods
Cloning, expression in Escherichia coli, and characterization of Arabidopsis thaliana UMP/CMP kinase
  • Arabidopsis UMP/CMP kinase preferentially uses ATP as phosphate donor and accepts both UMP and CMP as substrates
    "the UMP/CMP kinase preferentially uses ATP"
Evolution of the chalcone-isomerase fold from fatty-acid binding to stereospecific catalysis
  • CHI superfamily includes non-catalytic fatty acid-binding proteins (FAPs) that localize to plastids
    "These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid biosynthesis in plant cells. Furthermore, their expression profiles correlate with those of core fatty-acid biosynthetic enzymes"
  • FAPs are non-enzymatic ancestors of catalytically active chalcone isomerases
    "the FAP discovery defines the adaptive evolution of a stereospecific and catalytically 'perfected' enzyme from a non-enzymatic ancestor over a defined period of plant evolution."
UMP kinase activity is involved in proper chloroplast development in rice
  • Rice YL2 is a thylakoid membrane-localized UMP kinase essential for chloroplast development
    "YL2 encodes a thylakoid membrane-localized protein with significant sequence similarity to UMP kinase proteins in prokaryotes and eukaryotes. Prokaryotic UMP kinase activity was subsequently confirmed, with YL2 deficiency causing a significant reduction in chlorophyll accumulation and photochemical efficiency."
PUMPKIN, the Sole Plastid UMP Kinase, Associates with Group II Introns and Alters Their Metabolism
  • PUMPKIN is the sole plastid UMP kinase in Arabidopsis
    "the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana) associates specifically with the introns of the plastid transcripts trnG-UCC, trnV-UAC, petB, petD, and ndhA in vivo"
Chromosome-scale assembly and analysis of biomass crop Miscanthus lutarioriparius genome
  • M. lutarioriparius genome has 68,328 predicted gene models, unusually high even for an allotetraploid
    "A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp"

Deep Research

Falcon

(NCGR_LOCUS27674-deep-research-falcon.md)
Functional annotation research report: **NCGR_LOCUS27674** (UniProt **A0A811PC48**) from *Miscanthus lutarioriparius* Falcon Edison Scientific Literature 21 citations 2026-04-22T22:59:44.602229

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Functional annotation research report: NCGR_LOCUS27674 (UniProt A0A811PC48) from Miscanthus lutarioriparius

Executive summary (identity verification and evidence limits)

The gene symbol NCGR_LOCUS27674 is currently literature-sparse for Miscanthus lutarioriparius: none of the retrieved Miscanthus-focused papers explicitly mention NCGR_LOCUS27674 or the UniProt accession A0A811PC48. Consequently, the most defensible functional annotation is based on the UniProt-provided identity (UMPโ€“CMP kinase; EC 2.7.4.14; adenylate-kinase family) plus experimental evidence from plant homologs (notably Arabidopsis thaliana and rice) and conserved pathway context. This report does not substitute a different gene with a similar symbol in another organism. (mitros2020genomebiologyof pages 6-7, wang2024physiologicalandtranscriptomic pages 3-5, mitros2020genomebiologyof pages 1-2)

1) Key concepts and definitions (current understanding)

1.1 Enzyme class and reaction

UMPโ€“CMP kinase / uridylateโ€“cytidylate kinase (EC 2.7.4.14) is a nucleoside monophosphate (NMP) kinase that catalyzes phosphorylation of pyrimidine NMPs using ATP (typically MgATP) to produce the corresponding NDPs (e.g., UMPโ†’UDP; CMPโ†’CDP). In plants, this activity is described as a bifunctional UK/CK that phosphorylates UMP and CMP (and generally has lower activity with dCMP), contributing to balancing ribo- and deoxyribonucleotide pools. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73)

1.2 Substrate specificity in plant homologs

Recent biochemical characterization of three Arabidopsis AMK-like UMP kinases (UMK1/UMK2/UMK3) shows all can use UMP, CMP, and dCMP with distinct efficiencies; UMK3 is the most efficient for UMP, whereas dCMP is a poorer substrate overall and UMK1/UMK2 are relatively better with dCMP than UMK3. (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree pages 75-78)

1.3 Gene family architecture in plants

Plant UK/CK (UMPโ€“CMP kinase) functions are distributed across multiple subfamilies. A plant-focused analysis describes AK-like UK/CK proteins and also a prokaryotic-type lineage; localization can include cytosolic and mitochondrial AK-like enzymes and a plastid-localized bacterial-type enzyme (PUMPKIN) with additional plastid gene-expression roles. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)

2) Recent developments and latest research (prioritizing 2023โ€“2024)

2.1 2024: compartmentalized functions and kinetics in Arabidopsis

A 2024 dissertation (with results described as under review at the time) provides a detailed dissection of Arabidopsis UMK paralogs:
- Localization: UMK1 and UMK3 are cytosolic, whereas UMK2 is mitochondrial in transient-expression assays. (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
- Functional specialization: UMK3 is described as the central UMP kinase for phosphorylation of de novoโ€“synthesized UMP; UMK2 appears specialized for mitochondrial dCMP phosphorylation, supporting mitochondrial DNA precursor supply during germination. (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 61-65)
- Quantitative enzyme kinetics (UMP substrate): UMK3 shows markedly higher catalytic efficiency for UMP than UMK1/UMK2 (see Section 5). (rinne2024characterizationofthree pages 20-24)

Publication details: Rinne J. Characterization of three uridine monophosphate kinases from Arabidopsis thaliana. (2024-01). DOI/URL: https://doi.org/10.15488/16818 (rinne2024characterizationofthree pages 20-24)

2.2 2024: Miscanthus transcriptomics context (not locus-specific)

A 2024 Miscanthus lutarioriparia (syn. lutarioriparius) study used RNA-seq to analyze cadmium stress responses and describes an annotation workflow (StringTie/RSEM, BLAST functional annotation, GO/KEGG enrichment). However, the text examined did not expose identifiers mapping to NCGR_LOCUS27674/A0A811PC48; the authors note deposition in NCBI SRA (PRJNA733881), indicating that locus-to-accession mapping may reside in supplementary or deposited annotation files. (wang2024physiologicalandtranscriptomic pages 3-5)

Publication details: Wang J. et al. PLOS ONE (2024-05). DOI/URL: https://doi.org/10.1371/journal.pone.0302940 (wang2024physiologicalandtranscriptomic pages 3-5)

3) Current applications and real-world implementations

3.1 Miscanthus as a bioenergy and phytoremediation crop (context for relevance)

A 2024 study frames M. lutarioriparia as a โ€œpromising energy cropโ€ used for phytoremediation of abandoned mine soils; it reports that M. lutarioriparia could reduce Cd content in mining soil by 29.82%, with maximal Cd concentrations in aboveground and belowground tissues of ~185.65 mg/kg and ~186.8 mg/kg, respectively (study-specific conditions). While this does not directly implicate NCGR_LOCUS27674, it provides real-world context in which core metabolic โ€œhousekeepingโ€ enzymes like pyrimidine kinases could affect biomass accumulation and stress tolerance through nucleotide/energy balance. (wang2024physiologicalandtranscriptomic pages 3-5)

3.2 Plant growth and organelle function implications from homolog genetics

Plant UMP/CMP kinase activity is implicated in fundamental growth and organelle biogenesis:
- In Arabidopsis, UMK3 is reported as essential for normal growth and reproductive/embryonic viability (based on transmission/phenotype analyses described), consistent with a core role in nucleotide supply. (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 70-73)
- In rice, mutation of a chloroplast-associated UMP kinase (YL2) causes chloroplast developmental defects and reduced photosynthetic performance, including decreased cpATPase activity (see statistics below). (chen2018umpkinaseactivity pages 5-8)

4) Expert opinions and analysis from authoritative sources (with interpretation boundaries)

4.1 Best-supported functional statement for NCGR_LOCUS27674 (A0A811PC48)

Given the UniProt-provided identity and strong conservation of UK/CK activities across plants, the most evidence-aligned annotation is:

NCGR_LOCUS27674 likely encodes an intracellular UMPโ€“CMP kinase (EC 2.7.4.14) that phosphorylates UMP and CMP (and likely dCMP to a lesser extent) to generate UDP/CDP/(d)CDP, thereby supporting pyrimidine nucleotide pool homeostasis required for RNA synthesis, sugar-nucleotide formation, membrane lipid metabolism, and DNA precursor supply. This is a functional inference for the Miscanthus protein, supported by direct plant homolog biochemistry, genetics, and compartmentalization evidence. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree pages 1-6)

4.2 Subcellular localization: what can and cannot be concluded

  • Not established for Miscanthus: No retrieved Miscanthus paper provided targeting data for A0A811PC48/NCGR_LOCUS27674. (mitros2020genomebiologyof pages 6-7, wang2024physiologicalandtranscriptomic pages 3-5)
  • Plant homolog pattern: Arabidopsis paralogs demonstrate that closely related AMK-like UMKs can localize to cytosol (UMK1/UMK3) or mitochondria (UMK2). Thus, unless Miscanthus-specific targeting peptides are examined, localization should be reported as intracellular, likely cytosolic or mitochondrial depending on isoform/targeting, rather than asserted as organelle-specific. (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)

4.3 Pathway integration (pyrimidine metabolism and organellar nucleotide supply)

A plant UK/CK description emphasizes that phosphorylation of UMP/CMP connects monophosphate pools to diphosphate pools using MgATP, and that plants contain both AK-like and prokaryotic-type UK/CKs with cytosolic/mitochondrial/plastid associations. This supports an interpretation of NCGR_LOCUS27674 as a node in pyrimidine nucleotide interconversion that can influence organellar nucleotide availability (e.g., mitochondrial DNA precursor supply), as directly proposed for Arabidopsis UMK2/UMK3 in germination. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 61-65)

5) Relevant statistics and data (recent experimental studies)

5.1 Enzyme kinetic parameters (Arabidopsis homologs; UMP substrate)

Measured kinetic parameters for Arabidopsis UMKs using UMP as acceptor (units as reported):
- UMK1: KM โ‰ˆ 0.646 mM, kcat โ‰ˆ 41.1 sโปยน, kcat/KM โ‰ˆ 63.6
- UMK2: KM โ‰ˆ 0.694 mM, kcat โ‰ˆ 95.1 sโปยน, kcat/KM โ‰ˆ 137
- UMK3: KM โ‰ˆ 0.24 mM, kcat โ‰ˆ 252.1 sโปยน, kcat/KM โ‰ˆ 1050

These data indicate UMK3 is much more catalytically efficient for UMP phosphorylation than UMK1/UMK2 in vitro, consistent with UMK3 being described as the โ€œcentralโ€ cytosolic UMP kinase in vivo. (rinne2024characterizationofthree pages 20-24)

Publication details: Rinne J. (2024-01). DOI/URL: https://doi.org/10.15488/16818 (rinne2024characterizationofthree pages 20-24)

5.2 Organelle phenotype statistic (rice chloroplast UMPK)

In a rice mutant defective in a chloroplast-associated UMP kinase (YL2), chloroplast function is impaired; the study reports a ~36.3% decrease in cpATPase activity compared with wild type. This supports the broader conclusion that pyrimidine monophosphate kinase activities can be critical for organelle development and photosynthetic capacity, although this specific chloroplast localization is not established for Miscanthus NCGR_LOCUS27674. (chen2018umpkinaseactivity pages 5-8)

Publication details: Chen F. et al. Photosynthesis Research (2018-02). DOI/URL: https://doi.org/10.1007/s11120-017-0477-5 (chen2018umpkinaseactivity pages 5-8)

5.3 Real-world Miscanthus implementation statistics (Cd phytoremediation)

In M. lutarioriparia cadmium tolerance work, reported performance includes 29.82% reduction of Cd content in mining soil and maximal tissue Cd concentrations around 185.65โ€“186.8 mg/kg (study-specific). (wang2024physiologicalandtranscriptomic pages 3-5)

Publication details: Wang J. et al. PLOS ONE (2024-05). DOI/URL: https://doi.org/10.1371/journal.pone.0302940 (wang2024physiologicalandtranscriptomic pages 3-5)

6) Visual evidence (tables/figures)

Cropped visual extractions from the 2024 Arabidopsis UMK work indicate that kinetic parameters are presented in Tables 1โ€“3 and that Figure 7 shows cytosolic localization for UMK1/UMK3 and mitochondrial localization for UMK2, supporting the compartmentalization claims used for inference. (rinne2024characterizationofthree media 679728a2, rinne2024characterizationofthree media 32a65f5e, rinne2024characterizationofthree media 618fd8ed)

7) Evidence table for annotation (structured summary)

Annotation element Key points Evidence type Best supporting citations
Molecular function NCGR_LOCUS27674 / UniProt A0A811PC48 is annotated in UniProt as UMP-CMP kinase / deoxycytidylate kinase (EC 2.7.4.14), a nucleoside monophosphate kinase in the adenylate kinase family; plant UK/CK enzymes phosphorylate UMP and CMP, and can show lower activity with dCMP. Inference for Miscanthus from UniProt/domain assignment; experimental from plant homolog biochemistry (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24)
Reaction catalyzed Canonical reaction: ATP + UMP โ‡Œ ADP + UDP; plant uridylate-cytidylate kinases also catalyze ATP + CMP โ‡Œ ADP + CDP and can phosphorylate dCMP โ†’ dCDP. Experimental in Arabidopsis/plant literature; inferred for Miscanthus ortholog (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 20-24)
Substrates Primary phosphate acceptors in plants: UMP, CMP; dCMP is accepted but generally poorer than ribonucleotide substrates. Arabidopsis AMK-like UMKs all used UMP/CMP/dCMP with differing efficiencies. Experimental homolog evidence; inferred for Miscanthus (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 75-78, rinne2024characterizationofthree pages 20-24)
Products UDP, CDP, dCDP plus ADP, depending on substrate. These products feed synthesis of UTP/CTP/dCTP pools needed for RNA, membrane lipid, sugar-nucleotide, and DNA metabolism. Experimental + pathway inference (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 75-78)
Cofactors / biochemical requirements Uses MgATP as phosphate donor; only Mg-free (d)NMP serves as phosphate acceptor in plant UK/CK description. Experimental/review evidence (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)
Enzyme family / domains UniProt places A0A811PC48 in the adenylate kinase family, UMP-CMP kinase subgroup. This is consistent with plant AK-like UK/CK enzymes; a distinct prokaryotic/eubacterial-type plastid UMK lineage also exists in plants. Inference for Miscanthus + experimental/comparative plant evidence (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3)
Subcellular localization Specific localization of NCGR_LOCUS27674 is not directly shown in retrieved Miscanthus literature. In plants, AK-like UMKs can be cytosolic or mitochondrial; plastid bacterial-type UMKs also exist. Arabidopsis UMK1/UMK3 localize to cytosol, UMK2 to mitochondria. Therefore Miscanthus A0A811PC48 is best annotated cautiously as likely intracellular (cytosol or mitochondrion), unresolved without species-specific targeting evidence. Direct evidence lacking for Miscanthus; localization inferred from plant homolog families (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
Biological roles Central role in pyrimidine nucleotide metabolism, balancing UMP/CMP/dCMP phosphorylation and supporting UTP/CTP/dCTP supply for RNA synthesis, UDP-sugar formation, membrane biogenesis, and DNA replication. Experimental homolog evidence + pathway inference (rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 75-78)
Pathway context Functions in pyrimidine de novo synthesis output utilization and salvage, linking monophosphate pools to diphosphate/triphosphate pools. In Arabidopsis, UMK3 is the main route for phosphorylation of de novo synthesized UMP; UMK2/UMK3 support mitochondrial dCTP supply during germination. Experimental homolog evidence (rinne2024characterizationofthree pages 82-85, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65)
Kinetic parameters: UMP Arabidopsis UMKs (AMK-like family): UMK1 KM ~0.646 mM, kcat ~41.1 s^-1, kcat/KM ~63.6; UMK2 KM ~0.694 mM, kcat ~95.1 s^-1, kcat/KM ~137; UMK3 KM ~0.24 mM, kcat ~252.1 s^-1, kcat/KM ~1050. UMK3 is the most efficient UMP kinase among the three. Experimental homolog biochemistry (rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
Kinetic parameters: CMP / dCMP Rinne 2024 reported measurable activity with CMP and dCMP for all three Arabidopsis UMKs; UMK3 showed high efficiency for CMP, while dCMP was a poorer substrate overall and UMK1/UMK2 were relatively better with dCMP than UMK3. Experimental homolog biochemistry (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 75-78, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)
Regulation / assay caveats Apparent strong substrate inhibition of UMK3 in a coupled assay was attributed by LC-MS follow-up to ATP-mediated inhibition of the coupling enzyme (pyruvate kinase) rather than genuine UMK3 inhibition. Experimental homolog evidence (rinne2024characterizationofthree pages 20-24)
Mutant / phenotype evidence in plants In Arabidopsis, UMK3 is essential: null alleles were not transmitted, and reduced UMK3 activity caused growth defects and altered pyrimidine pools; UMK2 and UMK3 contribute to mitochondrial DNA replication during germination. This supports a core housekeeping role for plant UMP/CMP kinases. Experimental homolog genetics (rinne2024characterizationofthree pages 1-6, rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65, rinne2024characterizationofthree pages 75-78)
Chloroplast-related phenotype evidence A distinct rice UMP kinase, YL2/LOC_Os01g73450, is a chloroplast/thylakoid-associated UMP kinase required for normal chloroplast development; mutant plants show yellow leaves, reduced chlorophyll, and decreased cpATPase activity (~36.3% reduction). This supports the broader importance of plant UMP kinase activity for organelle function, though it does not prove chloroplast localization of NCGR_LOCUS27674. Experimental, but from a different plant UMK lineage/context (chen2018umpkinaseactivity pages 5-8)
Plant salvage review context Plant pyrimidine salvage is closely integrated with organelle biogenesis. Reviews and recent work describe cytosolic, mitochondrial, and plastid nucleotide salvage/kinase functions, with plastid bacterial-type UK/CK (PUMPKIN) coupling nucleotide metabolism to plastid gene-expression processes. Experimental/review context (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)
Notes for Miscanthus identifier verification Retrieved Miscanthus papers did not mention NCGR_LOCUS27674 or A0A811PC48 directly. Thus the annotation should be stated as: the gene symbol is literature-sparse for this specific protein; functional annotation is inferred from UniProt and conserved plant UMP/CMP kinase biology. Miscanthus genome papers indicate complex mapping due to polyploidy/homoeology, so locus cross-referencing may require annotation tables/supplementary files. Direct negative finding + cautious inference (mitros2020genomebiologyof pages 6-7, wang2024physiologicalandtranscriptomic pages 3-5, mitros2020genomebiologyof pages 1-2, kim2025unravelingphylogeneticrelationships pages 4-6)
Best concise functional annotation for NCGR_LOCUS27674 Probable intracellular UMP/CMP kinase (EC 2.7.4.14) that phosphorylates UMP and CMP, likely also dCMP, to maintain pyrimidine nucleotide pools for RNA/DNA precursor synthesis and related biosynthetic processes; exact subcellular localization in M. lutarioriparius remains unverified. Integrated inference from UniProt + conserved plant evidence (rinne2024characterizationofthree pages 1-6, kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, mitros2020genomebiologyof pages 6-7)

Table: This table compiles the strongest available evidence for annotating Miscanthus lutarioriparius NCGR_LOCUS27674 (UniProt A0A811PC48) as a UMP-CMP kinase. It distinguishes direct evidence from homolog-based inference and highlights where species-specific uncertainty remains.

Gene product: probable UMPโ€“CMP kinase / uridylateโ€“cytidylate kinase (EC 2.7.4.14) (adenylate kinase family). (kleczkowski2025adenylatedrivenequilibrationof pages 1-3)

Molecular function: ATP-dependent phosphorylation of pyrimidine nucleoside monophosphates (primarily UMP and CMP; likely also dCMP) to corresponding diphosphates. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24)

Biological process/pathway: pyrimidine nucleotide interconversion and salvage/de novo output utilization; supports production of UDP/CDP used to build UTP/CTP/dCTP pools required for RNA synthesis, sugar-nucleotide production, and DNA precursor supply (including organellar DNA replication). (rinne2024characterizationofthree pages 70-73, rinne2024characterizationofthree pages 61-65)

Cellular localization: intracellular; specific compartment in M. lutarioriparius remains unverified. Plant homologs include cytosolic and mitochondrial isoforms; plastid bacterial-type UK/CKs also exist in plants (not established for A0A811PC48). (kleczkowski2025adenylatedrivenequilibrationof pages 1-3, rinne2024characterizationofthree pages 20-24, rinne2024characterizationofthree media 679728a2)

Because Miscanthus-specific locus-level literature was not found in the retrieved set, the key gaps are (i) mapping of NCGR_LOCUS27674 to the M. lutarioriparius reference annotation used by specific studies and (ii) experimental confirmation of subcellular targeting. Miscanthus genomics work emphasizes polyploidy/homoeology, which can complicate mapping of locus IDs across assemblies and annotations. (mitros2020genomebiologyof pages 1-2, mitros2020genomebiologyof pages 6-7)

References

  1. (mitros2020genomebiologyof pages 6-7): Therese Mitros, Adam M. Session, Brandon T. James, Guohong Albert Wu, Mohammad B. Belaffif, Lindsay V. Clark, Shengqiang Shu, Hongxu Dong, Adam Barling, Jessica R. Holmes, Jessica E. Mattick, Jessen V. Bredeson, Siyao Liu, Kerrie Farrar, Katarzyna Gล‚owacka, Stanisล‚aw Jeลผowski, Kerrie Barry, Won Byoung Chae, John A. Juvik, Justin Gifford, Adebosola Oladeinde, Toshihiko Yamada, Jane Grimwood, Nicholas H. Putnam, Jose De Vega, Susanne Barth, Manfred Klaas, Trevor Hodkinson, Laigeng Li, Xiaoli Jin, Junhua Peng, Chang Yeon Yu, Kweon Heo, Ji Hye Yoo, Bimal Kumar Ghimire, Iain S. Donnison, Jeremy Schmutz, Matthew E. Hudson, Erik J. Sacks, Stephen P. Moose, Kankshita Swaminathan, and Daniel S. Rokhsar. Genome biology of the paleotetraploid perennial biomass crop miscanthus. Nature Communications, Oct 2020. URL: https://doi.org/10.1038/s41467-020-18923-6, doi:10.1038/s41467-020-18923-6. This article has 118 citations and is from a highest quality peer-reviewed journal.

  2. (wang2024physiologicalandtranscriptomic pages 3-5): Jia Wang, Xinyu Liu, Yiran Chen, Feng lin Zhu, Jiajing Sheng, and Ying Diao. Physiological and transcriptomic analyses reveal the cadmium tolerance mechanism of miscanthus lutarioriparia. PLOS ONE, 19:e0302940, May 2024. URL: https://doi.org/10.1371/journal.pone.0302940, doi:10.1371/journal.pone.0302940. This article has 3 citations and is from a peer-reviewed journal.

  3. (mitros2020genomebiologyof pages 1-2): Therese Mitros, Adam M. Session, Brandon T. James, Guohong Albert Wu, Mohammad B. Belaffif, Lindsay V. Clark, Shengqiang Shu, Hongxu Dong, Adam Barling, Jessica R. Holmes, Jessica E. Mattick, Jessen V. Bredeson, Siyao Liu, Kerrie Farrar, Katarzyna Gล‚owacka, Stanisล‚aw Jeลผowski, Kerrie Barry, Won Byoung Chae, John A. Juvik, Justin Gifford, Adebosola Oladeinde, Toshihiko Yamada, Jane Grimwood, Nicholas H. Putnam, Jose De Vega, Susanne Barth, Manfred Klaas, Trevor Hodkinson, Laigeng Li, Xiaoli Jin, Junhua Peng, Chang Yeon Yu, Kweon Heo, Ji Hye Yoo, Bimal Kumar Ghimire, Iain S. Donnison, Jeremy Schmutz, Matthew E. Hudson, Erik J. Sacks, Stephen P. Moose, Kankshita Swaminathan, and Daniel S. Rokhsar. Genome biology of the paleotetraploid perennial biomass crop miscanthus. Nature Communications, Oct 2020. URL: https://doi.org/10.1038/s41467-020-18923-6, doi:10.1038/s41467-020-18923-6. This article has 118 citations and is from a highest quality peer-reviewed journal.

  4. (kleczkowski2025adenylatedrivenequilibrationof pages 1-3): Leszek A. Kleczkowski and Abir U. Igamberdiev. Adenylate-driven equilibration of both ribo- and deoxyribonucleotides is under magnesium control: quantification of the mg2+-signal. Journal of Plant Physiology, 304:154380, Jan 2025. URL: https://doi.org/10.1016/j.jplph.2024.154380, doi:10.1016/j.jplph.2024.154380. This article has 4 citations and is from a peer-reviewed journal.

  5. (rinne2024characterizationofthree pages 70-73): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  6. (rinne2024characterizationofthree pages 20-24): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  7. (rinne2024characterizationofthree pages 75-78): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  8. (rinne2024characterizationofthree media 679728a2): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  9. (rinne2024characterizationofthree pages 1-6): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  10. (rinne2024characterizationofthree pages 61-65): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  11. (chen2018umpkinaseactivity pages 5-8): Fei Chen, Guojun Dong, Xiaohui Ma, Fang Wang, Yanli Zhang, Erhui Xiong, Jiahuan Wu, Huizhong Wang, Qian Qian, Limin Wu, and Yanchun Yu. Ump kinase activity is involved in proper chloroplast development in rice. Photosynthesis Research, 137:53-67, Feb 2018. URL: https://doi.org/10.1007/s11120-017-0477-5, doi:10.1007/s11120-017-0477-5. This article has 28 citations and is from a peer-reviewed journal.

  12. (rinne2024characterizationofthree media 32a65f5e): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  13. (rinne2024characterizationofthree media 618fd8ed): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  14. (rinne2024characterizationofthree pages 82-85): Jannis Rinne. Characterization of three uridine monophosphate kinases from arabidopsis thaliana. Text, Jan 2024. URL: https://doi.org/10.15488/16818, doi:10.15488/16818. This article has 0 citations and is from a peer-reviewed journal.

  15. (kim2025unravelingphylogeneticrelationships pages 4-6): Ji Eun Kim, Yang Su Kim, Gyu Young Chung, Hyeok Jae Choi, Chang-Gee Jang, Hoe Jin Kim, and Chae Sun Na. Unraveling phylogenetic relationships among six miscanthus andersson (poaceae) species through chloroplast genome analysis. Genes, 16:1175, Oct 2025. URL: https://doi.org/10.3390/genes16101175, doi:10.3390/genes16101175. This article has 0 citations.

Citations

  1. kleczkowski2025adenylatedrivenequilibrationof pages 1-3
  2. rinne2024characterizationofthree pages 20-24
  3. wang2024physiologicalandtranscriptomic pages 3-5
  4. chen2018umpkinaseactivity pages 5-8
  5. mitros2020genomebiologyof pages 6-7
  6. mitros2020genomebiologyof pages 1-2
  7. rinne2024characterizationofthree pages 70-73
  8. rinne2024characterizationofthree pages 75-78
  9. rinne2024characterizationofthree pages 1-6
  10. rinne2024characterizationofthree pages 61-65
  11. rinne2024characterizationofthree pages 82-85
  12. kim2025unravelingphylogeneticrelationships pages 4-6
  13. https://doi.org/10.15488/16818
  14. https://doi.org/10.1371/journal.pone.0302940
  15. https://doi.org/10.1007/s11120-017-0477-5
  16. https://doi.org/10.1038/s41467-020-18923-6,
  17. https://doi.org/10.1371/journal.pone.0302940,
  18. https://doi.org/10.1016/j.jplph.2024.154380,
  19. https://doi.org/10.15488/16818,
  20. https://doi.org/10.1007/s11120-017-0477-5,
  21. https://doi.org/10.3390/genes16101175,

Manual

(NCGR_LOCUS27674-deep-research-manual.md)
Deep Research: NCGR_LOCUS27674 (A0A811PC48) - UMP-CMP kinase from Miscanthus lutarioriparius Manual

Deep Research: NCGR_LOCUS27674 (A0A811PC48) - UMP-CMP kinase from Miscanthus lutarioriparius

Summary

NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase (EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. There is no direct experimental literature on this gene. The protein is almost certainly a chimeric gene model artifact, as it contains two unrelated domains that are never found fused together in nature.

Protein Domain Architecture

The protein contains two distinct functional domains:

  1. N-terminal adenylate kinase / UMP-CMP kinase domain (Pfam PF00406, ADK; InterPro IPR006266)
  2. Consistent with UMP-CMP kinase function
  3. HAMAP rule MF_03172 assigns UMP-CMP kinase subfamily identity
  4. Contains P-loop NTPase fold for ATP binding

  5. C-terminal chalcone isomerase-fold domain (Pfam PF16035, Chalcone_2; InterPro IPR036298)

  6. Located at positions 555-650
  7. Part of the CHI superfamily but likely a fatty acid-binding protein (FAP) subfamily member
  8. FAPs are non-catalytic - they bind fatty acids without isomerase activity

Evidence for Chimeric Gene Model

Size anomaly

At 714 AA, this protein is vastly larger than any known UMP-CMP kinase:
- Arabidopsis thaliana UMP-CMP kinase: 202 AA PMID:9736767
- Rice YL2 (chloroplastic UMK): 351 AA PMID:29866037
- Human CMPK1: 196-228 AA
- E. coli CMP kinase: 225 AA

No known precedent for ADK-CHI fusion

No organism has been reported to have a natural fusion of adenylate kinase and chalcone isomerase domains.

Genome annotation context

The M. lutarioriparius genome (PMID:33931638) was assembled at 2.07 Gb with 68,328 predicted gene models - very high even for an allotetraploid. The ab initio + EVidenceModeler pipeline can produce chimeric models, particularly in polyploid genomes with tandemly duplicated genes.

Homologs and Orthologs

UMP-CMP kinase in plants

Plant UMP-CMP kinases (also called UMK or PUMPKIN) catalyze the phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP, dCMP) using ATP. They are essential for pyrimidine nucleotide biosynthesis.

Key characterized plant UMKs:
- Arabidopsis PUMPKIN (At3g18680): Plastid-localized UMP kinase, essential for chloroplast development PMID:30523175
- Rice YL2: Chloroplast thylakoid membrane-localized UMP kinase. Loss of function causes yellow-green leaves due to impaired chloroplast development PMID:29866037
- Arabidopsis UMK1/UMK2/UMK3: Cytosolic and mitochondrial isoforms PMID:9736767

CHI-fold FAP proteins in plants

The chalcone isomerase superfamily includes non-catalytic fatty acid-binding proteins (FAPs):
- Arabidopsis has CHI-fold FAP proteins that localize to plastids PMID:22388820
- FAPs bind fatty acids and may have roles in de novo fatty acid biosynthesis in chloroplasts
- They lack the catalytic residues needed for chalcone isomerase activity

Functional Implications

For the UMP-CMP kinase portion:

  • Core function: phosphorylation of UMP, CMP, and dCMP to their diphosphate forms
  • Likely chloroplast-targeted based on orthologs (rice YL2, Arabidopsis PUMPKIN)
  • Essential for pyrimidine nucleotide pool maintenance in plastids
  • Role in chloroplast development and function

For the CHI-fold/FAP portion:

  • Fatty acid binding activity (non-catalytic)
  • Likely plastid-localized
  • Role in fatty acid metabolism

Key References

  1. PMID:9736767 - Lange et al. (1998) "Isozymes of UMP/CMP kinase from Arabidopsis thaliana" - First characterization of plant UMP-CMP kinases
  2. PMID:29866037 - Zhu et al. (2018) "YL2 encoding a UMP kinase is essential for chloroplast development in rice" - Rice chloroplast UMK
  3. PMID:30523175 - Chen et al. (2019) "Arabidopsis PUMPKIN is a plastid UMP kinase" - Plastid UMK
  4. PMID:22388820 - Ngaki et al. (2012) "Evolution of the chalcone-isomerase fold from fatty-acid binding to stereospecific catalysis" - CHI superfamily and FAPs
  5. PMID:33931638 - Miao et al. (2021) "Chromosome-scale assembly of Miscanthus lutarioriparius" - Genome assembly

๐Ÿ“š Additional Documentation

Notes

(NCGR_LOCUS27674-notes.md)

NCGR_LOCUS27674 (A0A811PC48) Review Notes

Gene Identity

  • UniProt: A0A811PC48 (TrEMBL, unreviewed)
  • Organism: Miscanthus lutarioriparius (NCBITaxon:422564), a C4 grass in order Poales
  • Annotated as: UMP-CMP kinase (EC 2.7.4.14)
  • Length: 714 amino acids / 78.8 kDa
  • Evidence level: PE3 (Inferred from homology)
  • Source: Whole genome shotgun sequence [EMBL:CAD6242091.1]

Key Concern: Likely Chimeric Gene Model

This protein is almost certainly a chimeric gene model artifact resulting from incorrect fusion of two separate genes during genome annotation:

Evidence for chimeric model:

  1. Abnormal size: 714 AA is 2-3.5x larger than any known UMP-CMP kinase:
  2. Arabidopsis UMP-CMP kinase: 202 AA PMID:9736767
  3. Rice YL2 (chloroplastic UMK): 351 AA including transit peptide PMID:29866037
  4. Human CMPK1: 196-228 AA
  5. E. coli CMP kinase: 225 AA

  6. Two unrelated domains:

  7. N-terminal: Adenylate kinase domain (Pfam PF00406, ADK) - consistent with UMP-CMP kinase
  8. C-terminal (positions 555-650): Chalcone isomerase domain (Pfam PF16035, Chalcone_2) - functionally unrelated

  9. Conflicting automated annotations from different domains:

  10. UMP/CMP/dCMP kinase activities (from ADK domain via HAMAP/UniRule)
  11. Fatty acid binding (from CHI-fold domain via PANTHER/TreeGrafter)
  12. Intramolecular lyase activity (from CHI-fold superfamily via InterPro)

  13. Genome annotation context: The M. lutarioriparius genome has 68,328 predicted gene models PMID:33931638, which is very high even for an allotetraploid grass. The EVidenceModeler pipeline used is known to sometimes produce chimeric models in polyploid genomes.

The two likely constituent genes:

Gene 1 (N-terminal ~350 AA): A plastid/chloroplast-targeted UMP-CMP kinase
- Consistent with rice YL2 ortholog PMID:29866037 or Arabidopsis PUMPKIN PMID:30523175
- Plant plastid UMKs have transit peptides and are 300-350 AA total
- Functions in de novo pyrimidine nucleotide biosynthesis in chloroplasts

Gene 2 (C-terminal ~150+ AA): A CHI-fold fatty acid-binding protein (FAP)
- The Chalcone_2 domain (PF16035) belongs to the CHI superfamily
- FAP subfamily members are non-catalytic - they bind fatty acids but lack isomerase activity PMID:22388820
- Plant FAPs typically localize to plastids for de novo fatty acid biosynthesis PMID:22388820

Annotation Assessment

All 13 GO annotations are IEA (electronic). Given the chimeric nature:

Annotations likely correct for the UMP-CMP kinase portion:

  • GO:0033862 UMP kinase activity (UniRule)
  • GO:0036430 dCMP kinase activity (UniRule)
  • GO:0036431 CMP kinase activity (UniRule)
  • GO:0005524 ATP binding (InterPro)
  • GO:0019205 nucleobase-containing compound kinase activity (InterPro)
  • GO:0016776 phosphotransferase activity, phosphate group as acceptor (InterPro)
  • GO:0006221 pyrimidine nucleotide biosynthetic process (UniProt)
  • GO:0006207 'de novo' pyrimidine nucleobase biosynthetic process (InterPro)

Annotations likely from the CHI-fold/FAP portion (erroneous for whole protein):

  • GO:0005504 fatty acid binding (TreeGrafter/PANTHER) - from FAP domain
  • GO:0016872 intramolecular lyase activity (InterPro) - from CHI superfamily, but FAPs are non-catalytic

Localization annotations:

  • GO:0005737 cytoplasm - plausible for UMK
  • GO:0005634 nucleus - plausible for UMK (some UMKs are nuclear)
  • GO:0009570 chloroplast stroma - plausible if this is a plastid-targeted UMK, though rice YL2 localizes to thylakoid membranes, not stroma

References

  • PMID:9736767 - Arabidopsis UMP-CMP kinase characterization
  • PMID:29866037 - Rice YL2 chloroplastic UMP kinase
  • PMID:30523175 - Arabidopsis PUMPKIN plastid UMP kinase
  • PMID:22388820 - CHI-fold FAP proteins and fatty acid binding
  • PMID:33931638 - Miscanthus lutarioriparius genome assembly

๐Ÿ“„ View Raw YAML

id: A0A811PC48
gene_symbol: NCGR_LOCUS27674
product_type: PROTEIN
taxon:
  id: NCBITaxon:422564
  label: Miscanthus lutarioriparius
description: >-
  NCGR_LOCUS27674 encodes a predicted 714-amino acid protein annotated as UMP-CMP kinase
  (EC 2.7.4.14) in Miscanthus lutarioriparius, a C4 perennial grass. However, this gene model
  is almost certainly a chimeric artifact resulting from the incorrect fusion of at least two
  separate genes during genome annotation: an N-terminal UMP-CMP kinase (adenylate kinase family,
  Pfam PF00406) and a C-terminal chalcone isomerase-fold fatty acid-binding protein (Pfam PF16035,
  positions 555-650). At 714 AA, the protein is 2-3.5x larger than any characterized UMP-CMP
  kinase (e.g. Arabidopsis 202 AA [PMID:9576794], rice YL2 351 AA [PMID:29392476]). The
  conflicting automated annotations - kinase activities from the ADK domain and fatty acid
  binding / intramolecular lyase from the CHI-fold domain - are explained by this fusion. The
  M. lutarioriparius genome has 68,328 predicted gene models [PMID:33911077], unusually high
  even for an allotetraploid, consistent with gene model artifacts from the EVidenceModeler
  pipeline. All 13 GO annotations are IEA with no experimental evidence. The annotations
  pertaining to the UMP-CMP kinase portion (N-terminal domain) are likely correct for that
  domain, while annotations from the CHI-fold domain (fatty acid binding, intramolecular lyase
  activity) are artifacts of the chimeric model.

existing_annotations:
- term:
    id: GO:0005504
    label: fatty acid binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000118
  review:
    summary: >-
      This annotation derives from TreeGrafter matching the C-terminal chalcone isomerase-fold
      domain (Pfam PF16035, Chalcone_2, positions 555-650) to PANTHER family PTHR47284
      (FATTY-ACID-BINDING PROTEIN 2). The CHI-fold superfamily includes non-catalytic fatty
      acid-binding proteins (FAPs) that localize to plastids and participate in de novo fatty
      acid biosynthesis [PMID:22622584]. However, this annotation reflects a domain from what
      is almost certainly a chimeric gene model, not the true function of the UMP-CMP kinase
      that constitutes the N-terminal portion of this protein. The 714 AA size is 2-3.5x larger
      than any known UMP-CMP kinase, and no natural ADK-CHI domain fusion has been reported
      in any organism.
    action: REMOVE
    reason: >-
      This annotation is an artifact of a chimeric gene model. The fatty acid binding function
      derives from the C-terminal CHI-fold/FAP domain (positions 555-650), which is a separate
      gene incorrectly fused with the N-terminal UMP-CMP kinase during genome annotation of
      M. lutarioriparius [PMID:33911077]. FAP proteins in the CHI superfamily do bind fatty
      acids [PMID:22622584], but this activity belongs to a distinct gene product, not a
      UMP-CMP kinase.
    supported_by:
    - reference_id: PMID:22622584
      supporting_text: >-
        These three CHI-fold proteins localize to plastids, the site of de novo
        fatty-acid biosynthesis in plant cells. Furthermore, their expression profiles
        correlate with those of core fatty-acid biosynthetic enzymes
    - reference_id: PMID:33911077
      supporting_text: >-
        A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
      supporting_text: >-
        This protein is almost certainly a chimeric gene model artifact resulting from incorrect
        fusion of two separate genes during genome annotation.

- term:
    id: GO:0005524
    label: ATP binding
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      ATP binding is assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase).
      UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside monophosphates
      at the expense of ATP, so ATP binding is an inherent part of their catalytic mechanism.
      The UniProt record confirms ATP binding sites at positions 36-41, 149, and 192 via
      HAMAP-Rule MF_03172. This annotation is consistent with the N-terminal UMP-CMP kinase
      domain of this protein.
    action: ACCEPT
    reason: >-
      ATP binding is a core requirement for UMP-CMP kinase activity. The protein contains
      a P-loop NTPase domain with conserved ATP binding residues identified by HAMAP and
      InterPro. All characterized UMP-CMP kinases require ATP as the phosphate donor
      [PMID:9576794].
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Catalyzes the phosphorylation of pyrimidine nucleoside monophosphates at the expense
        of ATP.
    - reference_id: PMID:9576794
      supporting_text: >-
        the UMP/CMP kinase preferentially uses ATP
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
      supporting_text: >-
        Core function: phosphorylation of UMP, CMP, and dCMP to their diphosphate forms using
        ATP as the phosphate donor.

- term:
    id: GO:0016776
    label: phosphotransferase activity, phosphate group as acceptor
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      Assigned via InterPro match to IPR006266 (UMP-CMP kinase). GO:0016776 is an ancestor
      of GO:0033862 (UMP kinase activity) in the GO hierarchy, so this is a broader but
      valid annotation for a UMP-CMP kinase. UMP-CMP kinases transfer a phosphate group
      from ATP to nucleoside monophosphate substrates (UMP, CMP, dCMP), which already
      carry a phosphate group, making them phosphotransferases with phosphate group as acceptor.
    action: ACCEPT
    reason: >-
      This is a valid parent term of the more specific UMP/CMP/dCMP kinase activities also
      annotated to this protein. While broader than the specific kinase activities
      (GO:0033862, GO:0036430, GO:0036431), it correctly captures the biochemical mechanism.
      The InterPro match to IPR006266 (UMP-CMP kinase family) is appropriate for the
      N-terminal domain.
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Reaction=CMP + ATP = CDP + ADP

- term:
    id: GO:0016872
    label: intramolecular lyase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      This annotation derives from InterPro match to IPR036298 (Chalcone isomerase superfamily),
      which is mapped to GO:0016872 via InterPro2GO. Chalcone isomerases catalyze intramolecular
      cyclization of chalcones to flavanones. However, this annotation is inappropriate for
      two reasons: (1) the CHI-fold domain in this protein likely belongs to the non-catalytic
      FAP (fatty acid-binding) subfamily, which lacks the catalytic residues for isomerase
      activity [PMID:22622584]; and (2) this domain is part of a chimeric gene model and
      belongs to a separate gene product. The InterPro2GO mapping from IPR036298 to GO:0016872
      is overly broad, applying catalytic function to the entire superfamily including
      non-catalytic members.
    action: REMOVE
    reason: >-
      This is a doubly erroneous annotation: the intramolecular lyase activity derives from
      the CHI superfamily InterPro match, but the CHI-fold domain in this protein is likely
      a FAP subfamily member that is non-catalytic. Furthermore, the CHI-fold domain
      (positions 555-650) belongs to a separate gene that was incorrectly fused with the
      UMP-CMP kinase in this chimeric gene model. A UMP-CMP kinase has no intramolecular
      lyase activity.
    supported_by:
    - reference_id: PMID:22622584
      supporting_text: >-
        the FAP discovery defines the adaptive evolution of a stereospecific
        and catalytically 'perfected' enzyme from a non-enzymatic ancestor over a
        defined period of plant evolution.
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-notes.md
      supporting_text: >-
        C-terminal (positions 555-650) Chalcone isomerase domain (Pfam PF16035, Chalcone_2)
        - functionally unrelated to the N-terminal UMP-CMP kinase domain.

- term:
    id: GO:0019205
    label: nucleobase-containing compound kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      Assigned via InterPro match to IPR000850 (Adenylate/UMP-CMP kinase). GO:0019205
      is an ancestor of GO:0033862 (UMP kinase activity) in the GO hierarchy. UMP-CMP
      kinases phosphorylate nucleoside monophosphates (nucleobase-containing compounds),
      so this broader annotation is valid.
    action: ACCEPT
    reason: >-
      This is a valid broader term for UMP-CMP kinase activity. The protein's N-terminal
      domain belongs to the adenylate kinase family UMP-CMP kinase subfamily, which
      phosphorylates nucleobase-containing compounds (UMP, CMP, dCMP). While less specific
      than GO:0033862/GO:0036430/GO:0036431, it is not incorrect.
    supported_by:
    - reference_id: PMID:9576794
      supporting_text: >-
        both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as
        the phosphate acceptor.

- term:
    id: GO:0033862
    label: UMP kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000104
  review:
    summary: >-
      Assigned via UniRule UR000111469 based on shared sequence features with characterized
      UMP-CMP kinases. UMP kinase activity (catalysis of UMP + ATP = UDP + ADP) is a core
      function of the UMP-CMP kinase family. The UniProt record explicitly lists this
      catalytic activity with EC 2.7.4.14. This annotation applies to the N-terminal
      adenylate kinase domain.
    action: ACCEPT
    reason: >-
      UMP kinase activity is the primary molecular function of UMP-CMP kinases. The protein
      contains the conserved ADK domain (PF00406) and matches HAMAP rule MF_03172 for the
      UMP-CMP kinase subfamily. Confirmed in the rice ortholog YL2 [PMID:29392476] and
      Arabidopsis [PMID:9576794].
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Reaction=UMP + ATP = UDP + ADP
    - reference_id: PMID:29392476
      supporting_text: >-
        Prokaryotic UMP kinase activity was subsequently confirmed, with YL2 deficiency
        causing a significant reduction in chlorophyll accumulation and photochemical
        efficiency.
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-deep-research-manual.md
      supporting_text: >-
        Plant UMP-CMP kinases catalyze the phosphorylation of pyrimidine nucleoside
        monophosphates (UMP, CMP, dCMP) using ATP.

- term:
    id: GO:0036430
    label: dCMP kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000104
  review:
    summary: >-
      Assigned via UniRule UR000111469. dCMP kinase activity (catalysis of dCMP + ATP =
      dCDP + ADP) is a known activity of UMP-CMP kinases. The UniProt record explicitly
      lists this catalytic reaction.
    action: ACCEPT
    reason: >-
      dCMP kinase activity is a well-characterized function of the UMP-CMP kinase family.
      The catalytic reaction is explicitly listed in the UniProt record. This applies to
      the N-terminal kinase domain.
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Reaction=dCMP + ATP = dCDP + ADP

- term:
    id: GO:0036431
    label: CMP kinase activity
  evidence_type: IEA
  original_reference_id: GO_REF:0000104
  review:
    summary: >-
      Assigned via UniRule UR000111469. CMP kinase activity (catalysis of CMP + ATP =
      CDP + ADP) is a core function of UMP-CMP kinases. The UniProt record explicitly
      lists this catalytic reaction.
    action: ACCEPT
    reason: >-
      CMP kinase activity is a core function of the UMP-CMP kinase family. Arabidopsis
      UMP/CMP kinase accepts both UMP and CMP as phosphate acceptors [PMID:9576794].
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Reaction=CMP + ATP = CDP + ADP
    - reference_id: PMID:9576794
      supporting_text: >-
        both UMP (Km = 153 microM) and CMP (Km = 266 microM) were equally acceptable as
        the phosphate acceptor.

- term:
    id: GO:0006207
    label: "'de novo' pyrimidine nucleobase biosynthetic process"
  evidence_type: IEA
  original_reference_id: GO_REF:0000002
  review:
    summary: >-
      Assigned via InterPro match to IPR006266 (UMP-CMP kinase). UMP-CMP kinases play an
      important role in de novo pyrimidine nucleotide biosynthesis by phosphorylating UMP
      to UDP. However, GO:0006207 specifically refers to pyrimidine nucleobase biosynthesis
      (the base itself), whereas UMP-CMP kinases act on nucleotides (nucleobase + sugar +
      phosphate). The more appropriate term is GO:0006221 (pyrimidine nucleotide biosynthetic
      process), which is already annotated.
    action: MODIFY
    reason: >-
      UMP-CMP kinases act on pyrimidine nucleotides, not nucleobases. GO:0006207 is
      technically not the correct process for a nucleoside monophosphate kinase. GO:0006221
      (pyrimidine nucleotide biosynthetic process) is more appropriate and already annotated.
    proposed_replacement_terms:
    - id: GO:0006221
      label: pyrimidine nucleotide biosynthetic process
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Plays an important role in de novo pyrimidine nucleotide biosynthesis. Has preference
        for UMP and CMP as phosphate acceptors.

- term:
    id: GO:0006221
    label: pyrimidine nucleotide biosynthetic process
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      Assigned via combined automated annotation (InterPro IPR006266 + UniRule UR000111469).
      UMP-CMP kinases participate in pyrimidine nucleotide biosynthesis by catalyzing the
      phosphorylation of pyrimidine nucleoside monophosphates (UMP, CMP) to their diphosphate
      forms (UDP, CDP). The UniProt record explicitly states the protein plays an important
      role in de novo pyrimidine nucleotide biosynthesis.
    action: ACCEPT
    reason: >-
      Pyrimidine nucleotide biosynthesis is a core biological process for UMP-CMP kinases.
      The phosphorylation of UMP to UDP and CMP to CDP are essential steps in maintaining
      the pyrimidine nucleotide pool. Confirmed in rice YL2 [PMID:29392476] and Arabidopsis
      PUMPKIN [PMID:30409856].
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        Plays an important role in de novo pyrimidine nucleotide biosynthesis.
    - reference_id: PMID:29392476
      supporting_text: >-
        our results suggest that UMP kinase activity plays an essential role in
        chloroplast development and regulating cpATPase biogenesis in rice.

- term:
    id: GO:0005634
    label: nucleus
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      Assigned via combined automated annotation (UniProtKB-SubCell SL-0191 + UniRule
      UR000111469). The UniProt record states nuclear localization based on HAMAP rule
      MF_03172. However, for plant UMP-CMP kinases, characterized orthologs are predominantly
      cytosolic (Arabidopsis UMK) or plastid-localized (rice YL2, Arabidopsis PUMPKIN)
      [PMID:29392476, PMID:30409856]. Nuclear localization is not well-established for
      plant UMKs.
    action: UNDECIDED
    reason: >-
      Nuclear localization is inferred from homology to animal UMP-CMP kinases via the
      HAMAP rule, but characterized plant orthologs localize to the cytoplasm or plastids
      rather than the nucleus. Without direct experimental evidence in plants, the nuclear
      localization annotation should be treated with caution.
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        SUBCELLULAR LOCATION: Cytoplasm. Nucleus.
    - reference_id: PMID:30409856
      supporting_text: >-
        the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana)

- term:
    id: GO:0005737
    label: cytoplasm
  evidence_type: IEA
  original_reference_id: GO_REF:0000120
  review:
    summary: >-
      Assigned via combined automated annotation (UniProtKB-SubCell SL-0086 + UniRule
      UR000111469). Cytoplasmic localization is consistent with the known localization of
      several plant UMP-CMP kinase isoforms. The Arabidopsis UMP/CMP kinase characterized
      by Zhou et al. was expressed and purified as a cytosolic protein [PMID:9576794].
    action: ACCEPT
    reason: >-
      Cytoplasmic localization is well-supported for plant UMP-CMP kinases. Arabidopsis
      has cytosolic UMK isoforms alongside the plastid-localized PUMPKIN. Even if this
      particular gene model is chimeric, the UMP-CMP kinase portion would likely localize
      to the cytoplasm or plastids.
    supported_by:
    - reference_id: file:9POAL/NCGR_LOCUS27674/NCGR_LOCUS27674-uniprot.txt
      supporting_text: >-
        SUBCELLULAR LOCATION: Cytoplasm.
    - reference_id: PMID:9576794
      supporting_text: >-
        A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine
        5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces
        cerevisiae ura6 mutant.

- term:
    id: GO:0009570
    label: chloroplast stroma
  evidence_type: IEA
  original_reference_id: GO_REF:0000118
  review:
    summary: >-
      Assigned via TreeGrafter matching to PANTHER PTN009208978. Chloroplast localization
      is plausible for plant UMP-CMP kinases - rice YL2 and Arabidopsis PUMPKIN are
      chloroplast-localized UMKs [PMID:29392476, PMID:30409856]. However, the specific
      sub-compartment is questionable: rice YL2 localizes to thylakoid membranes, not the
      stroma [PMID:29392476]. Additionally, the TreeGrafter match may be influenced by the
      chimeric gene model, as CHI-fold FAP proteins also localize to plastids [PMID:22622584].
    action: UNDECIDED
    reason: >-
      While chloroplast localization is plausible for a plant UMP-CMP kinase, the specific
      annotation to chloroplast stroma (GO:0009570) is uncertain. The closest characterized
      ortholog, rice YL2, is a thylakoid membrane-localized protein [PMID:29392476].
      Furthermore, the TreeGrafter PANTHER match could reflect either the UMK domain or
      the CHI-fold FAP domain, both of which can be plastid-targeted.
    supported_by:
    - reference_id: PMID:29392476
      supporting_text: >-
        YL2 encodes a thylakoid membrane-localized protein with significant sequence
        similarity to UMP kinase proteins in prokaryotes and eukaryotes.
    - reference_id: PMID:22622584
      supporting_text: >-
        These three CHI-fold proteins localize to plastids, the site of de novo
        fatty-acid biosynthesis in plant cells.

references:
- id: GO_REF:0000002
  title: >-
    Gene Ontology annotation through association of InterPro records with GO terms
  findings: []
- id: GO_REF:0000104
  title: >-
    Electronic Gene Ontology annotations created by transferring manual GO annotations
    between related proteins based on shared sequence features
  findings: []
- id: GO_REF:0000118
  title: >-
    TreeGrafter-generated GO annotations
  findings: []
- id: GO_REF:0000120
  title: >-
    Combined Automated Annotation using Multiple IEA Methods
  findings: []
- id: PMID:9576794
  title: >-
    Cloning, expression in Escherichia coli, and characterization of Arabidopsis thaliana
    UMP/CMP kinase
  findings:
  - statement: >-
      Arabidopsis UMP/CMP kinase preferentially uses ATP as phosphate donor and accepts both
      UMP and CMP as substrates
    supporting_text: >-
      the UMP/CMP kinase preferentially uses ATP
- id: PMID:22622584
  title: >-
    Evolution of the chalcone-isomerase fold from fatty-acid binding to stereospecific catalysis
  findings:
  - statement: >-
      CHI superfamily includes non-catalytic fatty acid-binding proteins (FAPs) that localize
      to plastids
    supporting_text: >-
      These three CHI-fold proteins localize to plastids, the site of de novo fatty-acid
      biosynthesis in plant cells. Furthermore, their expression profiles correlate with those
      of core fatty-acid biosynthetic enzymes
  - statement: FAPs are non-enzymatic ancestors of catalytically active chalcone isomerases
    supporting_text: >-
      the FAP discovery defines the adaptive evolution of a stereospecific and catalytically
      'perfected' enzyme from a non-enzymatic ancestor over a defined period of plant evolution.
- id: PMID:29392476
  title: >-
    UMP kinase activity is involved in proper chloroplast development in rice
  findings:
  - statement: >-
      Rice YL2 is a thylakoid membrane-localized UMP kinase essential for chloroplast
      development
    supporting_text: >-
      YL2 encodes a thylakoid membrane-localized protein with significant sequence similarity
      to UMP kinase proteins in prokaryotes and eukaryotes. Prokaryotic UMP kinase activity
      was subsequently confirmed, with YL2 deficiency causing a significant reduction in
      chlorophyll accumulation and photochemical efficiency.
- id: PMID:30409856
  title: >-
    PUMPKIN, the Sole Plastid UMP Kinase, Associates with Group II Introns and Alters
    Their Metabolism
  findings:
  - statement: PUMPKIN is the sole plastid UMP kinase in Arabidopsis
    supporting_text: >-
      the sole plastid UMP kinase (PUMPKIN) in Arabidopsis (Arabidopsis thaliana) associates
      specifically with the introns of the plastid transcripts trnG-UCC, trnV-UAC, petB,
      petD, and ndhA in vivo
- id: PMID:33911077
  title: >-
    Chromosome-scale assembly and analysis of biomass crop Miscanthus lutarioriparius genome
  findings:
  - statement: >-
      M. lutarioriparius genome has 68,328 predicted gene models, unusually high even for
      an allotetraploid
    supporting_text: >-
      A total of 68,328 gene models were predicted, with an average CDS length of 1215 bp

core_functions:
- molecular_function:
    id: GO:0033862
    label: UMP kinase activity
  directly_involved_in:
  - id: GO:0006221
    label: pyrimidine nucleotide biosynthetic process
  locations:
  - id: GO:0005737
    label: cytoplasm
  description: >-
    The N-terminal domain of this chimeric gene model encodes a UMP-CMP kinase
    (EC 2.7.4.14) that catalyzes the phosphorylation of pyrimidine nucleoside
    monophosphates (UMP, CMP, dCMP) to their diphosphate forms using ATP. This is
    a core function of the adenylate kinase family UMP-CMP kinase subfamily, supported
    by HAMAP rule MF_03172 and InterPro family IPR006266. The protein also has CMP
    kinase (GO:0036431) and dCMP kinase (GO:0036430) activities. Note that this gene
    model is almost certainly chimeric - the 714 AA protein is 2-3.5x larger than any
    characterized UMP-CMP kinase, and contains an unrelated C-terminal CHI-fold fatty
    acid-binding domain (Pfam PF16035, positions 555-650) from a separate gene that
    was incorrectly fused during genome annotation [PMID:33911077].
  supported_by:
  - reference_id: PMID:9576794
    supporting_text: >-
      A cDNA encoding the Arabidopsis thaliana uridine 5'-monophosphate (UMP)/cytidine
      5'-monophosphate (CMP) kinase was isolated by complementation of a Saccharomyces
      cerevisiae ura6 mutant.
  - reference_id: PMID:29392476
    supporting_text: >-
      our results suggest that UMP kinase activity plays an essential role in
      chloroplast development and regulating cpATPase biogenesis in rice.