MSA check of two pseudo-enzyme claims

Warnings (1)

MSA check of two pseudo-enzyme claims

Reproducible alignment-level verification of catalytic-residue loss for two of the
pseudo-enzyme examples in the IBA_REVIEW findings (Pattern 7).
Earlier passes relied on UniProt CAUTION/FUNCTION text; this analysis independently
inspects the actual residues.

Run: uv run python catalytic_residue_msa.py (FAMSA alignment via pyfamsa).
Target sequences come from the repo's local genes/.../*-uniprot.txt; the reference
active enzyme (DPYS) and the catalytic-residue positions are pulled live from the
UniProt REST feature tables (with a documented hardcoded fallback for the AGO2
positions if the API returns none — not triggered in the runs reported here, which
used the live metal-binding-site features).

1. Human Argonautes AGO1–4 — RNase-H-like catalytic site (DDH of the DEDH tetrad)

Residues at the positions UniProt annotates on AGO2 as divalent-metal-binding
(the catalytic Asp/Asp/His — i.e. the DDH metal-coordinating subset of the
canonical DEDH slicer tetrad; the catalytic-glutamate "finger" is discussed in
the caveats and is not in the metal-binding feature set):

AGO2 position AGO1 AGO2 AGO3 AGO4
D597 D D D D
D669 D D D G
H807 R H H R

2. CRMP/DPYSL family vs active dihydropyrimidinase (DPYS, Q14117)

Residues at DPYS's UniProt-annotated Zn(2+)-coordinating / active-site positions:

DPYS position DPYS DPYSL2 DPYSL3 DPYSL4 DPYSL5 CRMP1
H67 (Zn) H H H H S N
H69 (Zn) H R H R H Y
K159 (carbamate→Zn) K L M L Q Q
H192 (Zn) H H H H H H
H248 (Zn) H K K K N K
D326 (Zn) D A A A D G

Caveats