BIOREASON_COMPARISON / article

BIOREASON_COMPARISON / article

Paper drafts and supporting material for the BioReason-Pro / AI-AUGR manuscript, intended for ISMB 2026 Function-COSI.

Files

Thesis

Annotation databases face a practical deployment question — when is a new function-prediction method good enough to trust in production? — that CAFA-style aggregate metrics ($F_{\max}$, $S_{\min}$) cannot fully answer. AI-AUGR (Assessment via Unified Gene-evidence Review) is an agentic curation pipeline that complements CAFA-style evaluation by:

  1. Reading the narrative. Modern agentic predictors such as BioReason-Pro emit free-text functional summaries and chain-of-thought reasoning traces that sit outside bag-of-GO-terms scoring.
  2. Surfacing systematic failure modes. Pseudoenzyme blind spots, localisation defaults, paralog indistinguishability, missing organism-specific biology, neo-functionalisation, narrative–GO disconnect, and cross-kingdom fold bias — not visible in aggregate scores, decisive for deployment.
  3. Distinguishing novel insight from restatement. Most BioReason-Pro summaries narratively restate InterPro2GO. An aggregate score cannot see this; an agentic review can.

Evidence base

How to read this directory

For a reviewer coming in cold, read in this order:

  1. manuscript.tex — the full story.
  2. supplemental-benchmark-details.md — source availability and supplemental denominator checks.
  3. ../BIOREASON_COMPARISON.md — the underlying experimental log with per-organism breakdown, top performers, critical failures, and full failure-mode taxonomy.
  4. ../VALIDATING_ECOLI_PREDICTIONS.md — the de Crécy-Lagard positive-control experimental log.
  5. ../recapitulation-experiment/claude-expt-1/README.md — archived ESR-ECOLI-DET-Mini answer-key-withheld recap results.
  6. short-abstract.md — a 250-word pitch.

PDF build

Build the manuscript PDF directly in this project workspace:

cd projects/BIOREASON_COMPARISON
just pdf

The recipe runs latexmk in article/ and writes article/manuscript.pdf.
Generated PDFs and LaTeX build outputs are intentionally ignored here.