SPKW D. melanogaster (Drosophila) Subproject
Parent project: SPKW.md
Overview
Analysis of TRUE SPKW-unique annotations in D. melanogaster using closure-based query.
Key Statistics (2026-01-31)
| Metric | Count |
|---|---|
| TRUE SPKW-unique annotations | 4,101 |
| Unique genes | 2,753 |
| Unique terms | 198 |
Distribution by Aspect
| Aspect | Annotations | Genes | Terms |
|---|---|---|---|
| F (Molecular Function) | 2,501 (61%) | 1,695 | 71 |
| P (Biological Process) | 1,404 (34%) | 1,143 | 112 |
| C (Cellular Component) | 196 (5%) | 189 | 15 |
Top TRUE SPKW-Unique Terms
| Term | Label | Genes | Notes |
|---|---|---|---|
| GO:0046872 | metal ion binding | 1,088 | Very broad MF term |
| GO:0008270 | zinc ion binding | 436 | Broad MF term |
| GO:0051301 | cell division | 119 | Mix of legitimate and over-annotation |
| GO:0007165 | signal transduction | 115 | Broad term |
| GO:0045087 | innate immune response | 63 | Likely legitimate for AMPs |
| GO:0006915 | apoptotic process | 28 | Needs case-by-case review |
| GO:0031640 | killing of cells of another organism | 24 | Legitimate - lysozymes, AMPs |
| GO:0051321 | meiotic cell cycle | 20 | Different pattern than S. pombe |
Pattern Analysis
1. "Killing of cells of another organism" (GO:0031640) - LEGITIMATE
24 genes with this annotation are primarily:
- Lysozymes (LysB, LysD, LysE, LysP, LysS, LysX) - with "hydrolase activity, acting on glycosyl bonds"
- Antimicrobial peptides (Drosomycin, Metchnikowin, Baramicin)
- Spätzle (spz) - cytokine that activates Toll pathway
These are legitimate annotations for antimicrobial effectors.
2. Meiotic Cell Cycle (GO:0051321) - MIXED
Unlike S. pombe where ATG genes were wrongly annotated, D. mel meiotic annotations include:
- GATOR1 complex (Nprl2, Nprl3, Iml1) - These have EXPERIMENTAL evidence for germline/meiotic functions (PMID:25512509) alongside their TOR-regulation role
- Cell division genes (Bub3, Cdc37, cnn) - Potentially legitimate
- Some may still be over-annotations requiring case-by-case review
3. Apoptotic Process (GO:0006915) - NEEDS REVIEW
28 genes include:
- Legitimate: Buffy (Bcl-2 family), HtrA2, Opa1, ASPP, Daxx
- Questionable: Akt, Pdk1, Lkb1 (growth signaling kinases)
- Cross-talk: BI-1 (has both apoptosis AND autophagy annotations)
Key Differences from S. pombe
| Pattern | S. pombe | D. melanogaster |
|---|---|---|
| Meiotic cell cycle | ATG genes over-annotated (NO experimental meiotic evidence) | GATOR1 complex has experimental meiotic evidence |
| Organism context | Unicellular - meiosis = sporulation | Multicellular - germline development |
| Main issue | Autophagy conflated with meiosis | Less clear-cut; some legitimate meiotic functions |
Detailed Case Studies (Full Reviews Completed 2026-01-31)
Case 1: Buffy - Regulation vs Participation Conflation ✓
Gene: Buffy (Q8T8Y5) - Bcl-2 family anti-apoptotic protein
Review file: genes/DROME/[Buffy](../../genes/DROME/Buffy/Buffy-ai-review.html)/Buffy-ai-review.yaml
SPKW annotation: GO:0006915 (apoptotic process)
Experimental evidence: GO:0043066 (negative regulation of apoptotic process) via IMP/IGI
Review decision: MODIFY → GO:0043066 (negative regulation of apoptotic process)
Analysis: Buffy REGULATES apoptosis (as a Bcl-2 family member), it doesn't participate in or execute apoptosis. The SPKW annotation conflates regulatory role with process membership. Key evidence:
- "Buffy, the second Drosophila Bcl-2-like protein, is a pro-survival protein" (PMID:12853472)
- Overexpression inhibits developmental and irradiation-induced apoptosis
- Localizes to ER (not mitochondria like Debcl)
Category: SPKW captures correct biology but at wrong specificity
Case 2: Ced-12 - Legitimate SPKW Contribution ✓
Gene: Ced-12 (Q9VKB2) - ELMO homolog, engulfment machinery
Review file: genes/DROME/[Ced-12](../../genes/DROME/Ced-12/Ced-12-ai-review.html)/Ced-12-ai-review.yaml
SPKW annotations: GO:0006915 (apoptotic process), GO:0006909 (phagocytosis)
Experimental evidence (IMP/IGI): cell migration, myoblast fusion, border follicle cell migration, actin organization
Review decisions:
- GO:0006909 (phagocytosis) → ACCEPT - legitimate conserved function
- GO:0006915 (apoptotic process) → MODIFY to GO:0043652 (engulfment of apoptotic cell)
Analysis: This is a truly valuable SPKW contribution. Ced-12/ELMO is named after C. elegans CED-12, a core component of apoptotic corpse engulfment machinery. D. melanogaster experimental work focused on cell migration and myoblast fusion, but the conserved efferocytosis function is captured ONLY by SPKW. The modification from "apoptotic process" to "engulfment of apoptotic cell" clarifies that ELMO functions in the engulfing cell, not the dying cell.
Category: SPKW provides unique functional insight missing from experimental annotations
Case 3: Sin1 - Over-annotation via Signaling Pathway ✓
Gene: Sin1 (Q9V719) - TORC2 complex subunit
Review file: genes/DROME/[Sin1](../../genes/DROME/Sin1/Sin1-ai-review.html)/Sin1-ai-review.yaml
SPKW annotation: GO:0006915 (apoptotic process)
Experimental evidence: TORC2 signaling (IGI), insulin receptor signaling (IMP), Akt activation
Review decision: REMOVE
Analysis: Sin1 is a TORC2 subunit. The pathway to apoptosis is too indirect:
[Sin1](../../genes/DROME/Sin1/Sin1-ai-review.html) → TORC2 → Akt → Bad → apoptosis
(4-5 enzymatic steps)
This is classic signaling over-extension. Sin1's core function is TORC2 complex assembly and substrate recruitment via its CRIM and PH domains. The apoptosis connection is too many steps removed to justify direct annotation.
Category: SPKW over-extends from indirect signaling connection
Case 4: LysB - Correct Function, Wrong Context ✓
Gene: LysB (Q08694) - Lysozyme B, glycosyl hydrolase family 22
Review file: genes/DROME/[LysB](../../genes/DROME/LysB/LysB-ai-review.html)/LysB-ai-review.yaml
SPKW annotations:
- GO:0031640 (killing of cells of another organism)
- GO:0042742 (defense response to bacterium)
Review decisions:
- GO:0031640 → ACCEPT - lysozyme activity DOES kill bacteria
- GO:0042742 → MODIFY to GO:0007586 (digestion)
Analysis: LysB is a digestive enzyme, not an immune effector:
- Expressed in midgut (larvae and adults), NOT in fat body or hemocytes
- REPRESSED (not induced) after bacterial infection
- UniProt: "Unlikely to play an active role in the humoral immune defense. May have a function in the digestion of bacteria in the food."
The "killing of cells of another organism" annotation is correct - lysozyme enzymatic activity does result in bacterial death. But the "defense response to bacterium" annotation implies immune function, which is wrong for LysB. The biological context is digestion, not immunity.
Category: SPKW captures correct molecular outcome but wrong biological context
Categories of SPKW-Unique Annotations
| Category | Description | Example | Action |
|---|---|---|---|
| Regulation conflation | Annotated to process when evidence is for regulation | Buffy | Consider MODIFY to regulation term |
| Legitimate unique | SPKW captures biology missing from experimental data | Ced-12 | KEEP - valuable contribution |
| Signaling over-extension | Indirect pathway connection | Sin1 | Consider REMOVE or very broad term |
| Support process conflation | Process X active during process Y | S. pombe ATG | REMOVE |
| Correct but uninformative | Very broad terms (metal ion binding) | CG genes | Keep but low priority |
Lessons Learned from D. melanogaster Analysis
-
Organism biology matters: The same GO term can have different validity in different organisms. Meiotic annotations in unicellular yeasts (sporulation) differ fundamentally from multicellular organisms (germline development).
-
SPKW can capture legitimate unique biology: The "killing of cells of another organism" annotations for lysozymes and AMPs are appropriate and represent genuine SPKW contributions.
-
MF terms dominate TRUE SPKW-unique: 61% of unique contributions are Molecular Function, often very broad terms (metal ion binding, zinc ion binding). These may be accurate but uninformative.
-
GATOR1 is a special case: These genes regulate both TOR signaling (autophagy) AND have bona fide germline functions. The meiotic annotation may be appropriate here, unlike S. pombe ATG genes.
-
Apoptosis annotations need case-by-case review: Unlike the systematic ATG over-annotation in S. pombe, D. mel apoptosis includes a mix of legitimate (Bcl-2 family) and questionable (growth kinases) annotations.
Comparative Summary: S. pombe vs D. melanogaster
┌─────────────────────────────────────────────────────────────────────┐
│ SPKW Over-Annotation Patterns │
├─────────────────────────────────────────────────────────────────────┤
│ │
│ S. pombe (Unicellular Yeast) │
│ ──────────────────────────── │
│ Pattern: Expression during meiosis → "Meiosis" keyword │
│ Problem: ATG genes upregulated for autophagy during sporulation │
│ Result: SYSTEMATIC over-annotation (7/7 genes = 100%) │
│ Action: REMOVE meiotic cell cycle from all ATG genes │
│ │
│ D. melanogaster (Multicellular) │
│ ────────────────────────────── │
│ Pattern: More nuanced - some legitimate, some questionable │
│ Difference: Dedicated germline cells with distinct regulation │
│ Result: MIXED - requires case-by-case review │
│ Action: Cannot apply blanket removal; need individual assessment │
│ │
│ Key Insight: Same GO term, different validity by organism context │
│ │
└─────────────────────────────────────────────────────────────────────┘
Recommendations for SPKW Curation
-
Use closure-based queries: Reduces false positives by 70%+ in well-curated organisms
-
Check experimental evidence before removing: D. mel GATOR1 genes have experimental meiotic evidence that S. pombe ATG genes lack
-
Consider organism biology: Unicellular vs multicellular organisms have fundamentally different meiotic biology
-
Prioritize patterns over individual genes: Finding systematic over-annotation patterns (like ATG-meiosis in yeast) is more impactful than reviewing individual genes
-
Some SPKW annotations are legitimate: Don't assume all SPKW-unique = over-annotation. Lysozymes and AMPs in D. mel are correctly annotated.
-
Check for regulation vs participation: SPKW often conflates regulatory roles with process membership - consider if a "regulation of X" term would be more appropriate
-
Value novel functional insights: Cases like Ced-12 where SPKW captures known biology that experimental annotations miss are valuable contributions to GO
Review Files Location
genes/DROME/
├── [Buffy](../../genes/DROME/Buffy/Buffy-ai-review.html)/Buffy-ai-review.yaml
├── [Ced-12](../../genes/DROME/Ced-12/Ced-12-ai-review.html)/Ced-12-ai-review.yaml
├── [Sin1](../../genes/DROME/Sin1/Sin1-ai-review.html)/Sin1-ai-review.yaml
└── [LysB](../../genes/DROME/LysB/LysB-ai-review.html)/LysB-ai-review.yaml