Annotation-gain report: applying the ARO→GO mappings to UniProt
Candidate GO annotations that UniProtKB entries with a CARD cross-reference would gain from the
curated aro2go.sssom.yaml mappings (exact-or-narrower propagation). These
are leads for a curator, not automatic assertions. "New" = the proposed GO term is not
already present in the entry's UniProt GO annotations (any evidence code) and is not subsumed
by a more specific term the entry already has (subsumption-aware filtering).
Generated by annotation_gain_report.py from a UniProt snapshot (see data/uniprot_card_xrefs.tsv).
Summary
- UniProtKB entries with a CARD cross-reference: 4,182
- Entries matched by ≥1 mapping (exact or narrower): 3,609
- Entries gaining ≥1 new candidate GO annotation: 630
- Total candidate new annotations: 630
- Suppressed as redundant (entry already has a more specific term): 104
New candidate annotations by GO term
| GO term | label | entries gaining (new) | already annotated | suppressed (subsumed) |
|---|---|---|---|---|
| GO:0008800 | beta-lactamase activity | 448 | 2,795 | 0 |
| GO:0043838 | phosphatidylethanolamine:Kdo2-lipid A phosphoethanolamine transferase activity | 79 | 0 | 0 |
| GO:0008988 | rRNA (adenine-N6-)-methyltransferase activity | 29 | 0 | 12 |
| GO:0034069 | aminoglycoside N-acetyltransferase activity | 25 | 1 | 51 |
| GO:0034068 | aminoglycoside nucleotidyltransferase activity | 13 | 0 | 22 |
| GO:0070043 | rRNA (guanine-N7-)-methyltransferase activity | 12 | 0 | 0 |
| GO:0034071 | aminoglycoside phosphotransferase activity | 12 | 0 | 19 |
| GO:0050073 | macrolide 2'-kinase activity | 10 | 0 | 0 |
| GO:0004364 | glutathione transferase activity | 2 | 1 | 0 |
| GO:0008811 | chloramphenicol O-acetyltransferase activity | 0 | 30 | 0 |
| GO:0004146 | dihydrofolate reductase activity | 0 | 48 | 0 |
New candidate annotations by mapped ARO family/term
| mapped ARO | label | new annotations |
|---|---|---|
| ARO:3000001 | beta-lactamase | 448 |
| ARO:3004112 | phosphoethanolamine transferase conferring colistin resistance | 79 |
| ARO:3000560 | Erm 23S ribosomal RNA methyltransferase | 29 |
| ARO:3000121 | aminoglycoside acetyltransferase (AAC) | 25 |
| ARO:3000218 | aminoglycoside nucleotidyltransferase (ANT) | 13 |
| ARO:3000114 | aminoglycoside phosphotransferase (APH) | 12 |
| ARO:3004271 | 16S rRNA methyltransferase (G1405) | 12 |
| ARO:3000333 | macrolide phosphotransferase (MPH) | 8 |
| ARO:3000318 | mphB | 1 |
| ARO:3003209 | fosA5 | 1 |
| ARO:3000316 | mphA | 1 |
| ARO:3002872 | FosA3 | 1 |