MetaboLights MTBLS1 → GO bridge coverage

MetaboLights MTBLS1 → GO bridge coverage

Input: MetaboLights MTBLS1

Generated by coverage_probe.py — all numbers
computed live from OLS4 (ChEBI), the Rhea REST API, and the GO rhea2go mapping.

Why this probe exists

Reported metabolite ChEBI ids rarely match Rhea participants directly,
for two reasons we test as successive normalization tiers:

  1. Protonation — Rhea writes participants in their major protonation
    state at pH 7.3 (citrate(3-), ATP(4-)); repositories report neutral
    forms. We expand over ChEBI is_protonated_form_of /
    is_deprotonated_form_of (the protonation family).
  2. Structure / skeleton — a study reports a generic, non-stereospecific
    compound (isoleucine) while Rhea uses the stereospecific zwitterion
    (L-isoleucine zwitterion). We expand over the broader structural
    relations (+ tautomer/enantiomer + generic→specific children), bounded
    to the seed's InChIKey skeleton. This tier is stereo/charge-blind, so
    it is reported separately as the more permissive fallback.

Headline

Per-metabolite

Tier reached: exact < proton (protonation) < struct (skeleton) < (miss).

Metabolite Seed ChEBI q Tier GO MF (exact→proton→struct) Rhea-matched form
p-aminohippuric acid CHEBI:104011 0 0→0→0
2-hydroxybutyric acid CHEBI:1148 0 proton 0→11→11 2-hydroxybutyrate (CHEBI:64552, q=-1)
acetoacetic acid CHEBI:15344 0 proton 0→21→21 acetoacetate (CHEBI:13705, q=-1)
acetone CHEBI:15347 0 exact 7→7→7 acetone (CHEBI:15347, q=0)
acetic acid CHEBI:15366 0 proton 0→83→83 acetate (CHEBI:30089, q=-1)
sarcosine CHEBI:15611 0 proton 0→10→10 sarcosine zwitterion (CHEBI:57433, q=0)
allantoin CHEBI:15676 0 exact 0→0→0 allantoin (CHEBI:15676, q=0)
succinic acid CHEBI:15741 0 proton 0→92→92 succinate(2-) (CHEBI:30031, q=-2)
nicotinic acid CHEBI:15940 0 proton 0→6→6 nicotinate (CHEBI:32544, q=-1)
ethanol CHEBI:16236 0 exact 8→8→8 ethanol (CHEBI:16236, q=0)
adenosine CHEBI:16335 0 exact 9→9→9 adenosine (CHEBI:16335, q=0)
alanine CHEBI:16449 0 proton 0→11→11 propionate (CHEBI:17272, q=-1)
3-methyl-2-oxobutanoic acid CHEBI:16530 0 proton 0→30→30 3-methyl-2-oxobutanoate (CHEBI:11851, q=-1)
methylguanidine CHEBI:16628 0 proton 0→1→1 methylguanidinium (CHEBI:57840, q=1)
uridine CHEBI:16704 0 exact 8→8→8 uridine (CHEBI:16704, q=0)
creatinine CHEBI:16737 0 exact 2→2→2 creatinine (CHEBI:16737, q=0)
2-oxoglutarate(2-) CHEBI:16810 -2 exact 148→152→152 2-oxoglutarate(2-) (CHEBI:16810, q=-2)
creatine CHEBI:16919 0 proton 0→5→5 creatine zwitterion (CHEBI:57947, q=0)
dimethylamine CHEBI:17170 0 proton 0→7→7 dimethylaminium (CHEBI:58040, q=1)
glycolic acid CHEBI:17497 0 proton 0→92→92 glycolate (CHEBI:29805, q=-1)
N-acetyl-L-glutamic acid CHEBI:17533 0 proton 0→133→133 L-glutamate(1-) (CHEBI:29985, q=-1)
N,N-dimethylglycine CHEBI:17724 0 struct 0→0→5 N,N-dimethylglycine zwitterion (CHEBI:58251, q=0)
fumaric acid CHEBI:18012 0 proton 0→18→18 fumarate(2-) (CHEBI:29806, q=-2)
N-benzoylglycine CHEBI:18089 0 proton 0→2→2 N-benzoylglycinate (CHEBI:606565, q=-1)
N-methylnicotinate CHEBI:18123 0 exact 1→6→6 N-methylnicotinate (CHEBI:18123, q=0)
trimethylamine CHEBI:18139 0 proton 0→6→6 trimethylammonium (CHEBI:58389, q=1)
tyrosine CHEBI:18186 0 proton 0→11→11 propionate (CHEBI:17272, q=-1)
citrulline CHEBI:18211 0 struct 0→0→7 L-citrulline zwitterion (CHEBI:57743, q=0)
glutamic acid CHEBI:18237 0 struct 0→0→132 L-glutamate(1-) (CHEBI:29985, q=-1)
ornithine CHEBI:18257 0 proton 0→5→5 valerate (CHEBI:31011, q=-1)
N-carbamoyl-beta-alanine CHEBI:18261 0 proton 0→13→13 N-carbamoyl-beta-alaninate (CHEBI:11892, q=-1)
3-hydroxybutyric acid CHEBI:20067 0 proton 0→11→11 3-hydroxybutyrate (CHEBI:37054, q=-1)
N-acetyl-L-aspartic acid CHEBI:21547 0 proton 0→40→40 N-acetyl-L-aspartate(2-) (CHEBI:16953, q=-2)
isoleucine CHEBI:24898 0 struct 0→0→11 L-isoleucine zwitterion (CHEBI:58045, q=0)
leucine CHEBI:25017 0 struct 0→0→13 L-leucine zwitterion (CHEBI:57427, q=0)
lysine CHEBI:25094 0 proton 0→9→9 hexanoate (CHEBI:17120, q=-1)
valine CHEBI:27266 0 struct 0→0→14 L-valine zwitterion (CHEBI:57762, q=0)
3-aminoisobutyric acid CHEBI:27389 0 proton 0→13→13 isobutyrate (CHEBI:48944, q=-1)
N-methyl-6-pyridone-3-carboxamide CHEBI:27410 0 0→0→0
histidine CHEBI:27570 0 struct 0→0→16 L-histidine zwitterion (CHEBI:57595, q=0)
3,4-dihydroxymandelic acid CHEBI:27637 0 proton 0→84→84 mandelate (CHEBI:25147, q=-1)
N-methyl-4-pyridone-3-carboxamide CHEBI:27838 0 0→0→0
phenylalanine CHEBI:28044 0 struct 0→0→26 L-phenylalanine zwitterion (CHEBI:58095, q=0)
glutamine CHEBI:28300 0 struct 0→0→39 L-glutamine zwitterion (CHEBI:58359, q=0)
oxaloacetic acid CHEBI:30744 0 proton 0→129→129 oxaloacetate(2-) (CHEBI:16452, q=-2)
formic acid CHEBI:30751 0 proton 0→43→43 formate (CHEBI:15740, q=-1)
citric acid CHEBI:30769 0 proton 0→8→8 citrate(3-) (CHEBI:16947, q=-3)
butyric acid CHEBI:30772 0 proton 0→11→11 butyrate (CHEBI:17968, q=-1)
methylmalonic acid CHEBI:30860 0 proton 0→8→8 methylmalonate(2-) (CHEBI:17453, q=-2)
2-oxoglutaric acid CHEBI:30915 0 proton 0→152→152 2-oxoglutarate(2-) (CHEBI:16810, q=-2)
pyruvic acid CHEBI:32816 0 proton 0→135→135 pyruvate (CHEBI:15361, q=-1)
phloretic acid CHEBI:32980 0 proton 0→12→12 phloretate (CHEBI:16331, q=-1)
3-methyl-2-oxovaleric acid CHEBI:35932 0 proton 0→6→6 3-methyl-2-oxovalerate (CHEBI:28654, q=-1)
indoxyl sulfate CHEBI:43355 0 proton 0→1→1 indoxyl sulfate(1-) (CHEBI:144643, q=-1)
2-hydroxyisobutyric acid CHEBI:50129 0 proton 0→14→14 2-hydroxyisobutyrate (CHEBI:19641, q=-1)
2-hydroxy-3-methylbutyric acid CHEBI:60645 0 proton 0→4→4 isovalerate (CHEBI:48942, q=-1)
2-hydroxypentanoic acid CHEBI:60647 0 proton 0→5→5 valerate (CHEBI:31011, q=-1)
N-hexanoylglycine CHEBI:64390 0 0→0→0
N-methylnicotinamide CHEBI:64399 0 0→0→0
3-(3-sulfooxyphenyl)propanoic acid CHEBI:64414 0 proton 0→11→11 propionate (CHEBI:17272, q=-1)
malic acid CHEBI:6650 0 proton 0→93→93 malate(2-) (CHEBI:15595, q=-2)
isocaproic acid CHEBI:74903 0 proton 0→3→3 isocaproate (CHEBI:74904, q=-1)
methyl N-acetylvalinate CHEBI:74911 0 0→0→0
2-hydroxypropanoic acid CHEBI:78320 0 proton 0→34→34 lactate (CHEBI:24996, q=-1)

Recovered by structure (skeleton) normalization

Generic / stereochemistry mismatches the protonation tier could not fix,
recovered by InChIKey-skeleton expansion:

Example GO molecular functions reached

Residual misses (after both normalization tiers)

Resolved to ChEBI but matched no Rhea reaction even after protonation and
skeleton normalization — typically derivatives Rhea represents only in a
conjugated/acylated form, or compounds genuinely absent from Rhea.

Method / reproducibility