SNIPE (Surface-associated Nuclease Inhibiting Phage Entry) is a 500 amino acid inner membrane-bound GIY-YIG family DNA endonuclease that provides direct anti-bacteriophage defence in Escherichia coli. SNIPE constitutively localizes to the inner membrane via an N-terminal single-pass transmembrane helix (aa 5-24) and pre-associates with the ManYZ mannose permease complex before phage infection. During phage genome injection, the central DUF4041 domain (aa 144-262, now designated the SNIPE-associated domain, IPR025280) binds both incoming phage DNA and the phage tape measure protein (TMP), positioning the C-terminal GIY-YIG nuclease domain (aa 357-450, catalytic residue E414) to cleave phage DNA as it crosses the inner membrane. This provides direct defence β the infected cell survives β in contrast to abortive infection systems. SNIPE does not affect phage adsorption, does not target pre-existing intracellular phage genomes (lysogens), and does not block plasmid transformation, indicating its activity is specific to the phage genome injection process. Originally identified as PD-lambda-1 in a functional screen of 71 diverse E. coli pangenomes (PMID:36123438), SNIPE homologues (~500 curated) are found across many bacterial phyla, with 33% of well-sequenced clades harbouring at least one homologue. The N-terminal region is highly variable and functions as a phage-specificity adapter, with 59% of homologues having one TM domain, 7% having two, and 34% using alternative membrane-targeting strategies (DivIVA domains, type III secretion domains, PH domains). SNIPE represents a previously unknown self/non-self discrimination strategy based on subcellular localization rather than sequence recognition (CRISPR-Cas) or modification detection (restriction-modification). Note: UniProt entry A0A8T9CRB7 is from E. coli strain T0181B.E-10. The experimentally characterized SNIPE was from strain MOD1-ECOR26 (GenBank RCP76574.1, locus APT27_20780). Both share RefSeq WP_001606968.1. SNIPE is not present in E. coli K-12 MG1655.
Definition: A defense response to a virus in which a membrane-anchored nuclease of the SNIPE family cleaves viral DNA during genome injection across the cell membrane. SNIPE proteins associate with inner membrane components at phage genome injection sites, bind phage tape measure proteins via their DUF4041 domain, and use a GIY-YIG nuclease domain to degrade incoming phage DNA before it enters the cytoplasm. Unlike restriction-modification or CRISPR-Cas systems, self/non-self discrimination is achieved through subcellular localization at the membrane rather than recognition of specific DNA sequences or modifications.
Justification: The current anti-foreign nucleic acid branch (GO:0099046 clearance of foreign intracellular nucleic acids) does not capture SNIPE mechanism, because SNIPE cleaves phage DNA during translocation at the membrane before the DNA becomes intracellular. A dedicated term under GO:0051607 (defense response to virus) is needed to represent this direct anti-phage mechanism.
Parent term: defense response to virus
Supporting Evidence:
Definition: A defense response to a virus in which the host targets viral nucleic acid for degradation or modification, preventing viral replication. Includes systems that act on nucleic acid after cell entry (e.g. CRISPR-Cas, restriction-modification) and systems that act during genome injection (e.g. SNIPE).
Justification: Optional intermediate grouping term matching the GO issue proposal. This would provide a clean parent for nucleic acid-targeting anti-viral systems such as CRISPR-Cas, restriction-modification, and SNIPE while preserving mechanistic distinctions in child terms.
Parent term: defense response to virus
Supporting Evidence:
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0004520
DNA endonuclease activity
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE contains a C-terminal GIY-YIG nuclease domain (aa 357-450) that endonucleolytically cleaves phage DNA during genome injection. Substitution of the predicted catalytic residue E414 to alanine abolishes both nuclease activity and phage defence (PMID:41741653). Radiolabelled 32P phage DNA is degraded from ~42 kb to fragments <100 bp and mononucleotides in SNIPE-expressing cells, dependent on E414 (PMID:41741653). The pattern of degradation (smear of fragments rather than defined products) is consistent with endonucleolytic cleavage, though in vitro reconstitution has not yet been performed. The GIY-YIG nuclease domain is the most conserved region across ~500 SNIPE homologues (PMID:41741653), and domain-level homology to the T5orf172/MUG113 family of GIY-YIG endonucleases is confirmed by InterPro/Pfam assignments (IPR018306, PF13455). GO:0004520 is the appropriate level of specificity; there is no GIY-YIG-specific child term in GO, and the data do not support a more specific term like GO:0015666 (restriction endodeoxyribonuclease activity) since SNIPE does not recognize specific DNA sequences or modification states.
Reason: Core molecular function of SNIPE, directly demonstrated by multiple experimental approaches in PMID:41741653: (1) 32P-labelled phage DNA cleavage assay showing degradation during injection, (2) E414A catalytic dead mutant abolishing cleavage and defence, (3) Gam-GFP double-strand break reporter showing SNIPE-dependent foci, (4) CFP-ParB foci reduced ~30-fold indicating DNA degradation during injection. The domain architecture (GIY-YIG nuclease, IPR018306/PF13455) independently supports endonuclease activity via homology to biochemically characterized family members. IDA evidence code is appropriate because the nuclease activity was directly assayed in vivo using radiolabelled substrate and catalytic mutant controls.
Supporting Evidence:
PMID:41741653
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
PMID:41741653
Using radiolabelled phage DNA and time-lapse microscopy to track phage genomes, we demonstrate that SNIPE directly cleaves phage DNA during genome injection
PMID:36123438
a putative membrane-anchored protein with a central coiled-coil domain (DUF4041) and a C-terminal DNA binding/cleavage domain
file:ECOLX/SNIPE/SNIPE-deep-research-falcon.md
T5orf172 is a subfamily of GIY-YIG endonucleases ... the two profiles [T5orf172 and MUG113] are highly related
|
|
GO:0003690
double-stranded DNA binding
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: The DUF4041/SNIPE-associated domain (aa 144-262) of SNIPE has a positively charged surface that facilitates DNA binding. When the TM domain is removed (SNIPE(deltaTM E414A)-GFP), the protein co-localizes with the DAPI-stained nucleoid (Pearson R=0.76), and this co-localization is abolished when DUF4041 is also deleted (deltaDUF4041 shows R=0.13) (PMID:41741653). A DUF4041-only fusion to GFP also localizes to the nucleoid. ConSurf analysis shows the positively charged DNA-binding interface is conserved across SNIPE homologues (PMID:41741653). Note that the evidence for DNA binding is based on fluorescence microscopy co-localization with the nucleoid rather than a direct biochemical binding assay (e.g. EMSA); however, the domain-dependent co-localization pattern and the conserved positively charged surface provide strong support for direct DNA binding. Phage lambda DNA is dsDNA, making GO:0003690 the appropriate specific term. The original identification also noted the C-terminal region as a "DNA binding/cleavage domain" (PMID:36123438).
Reason: DNA binding by DUF4041 is demonstrated by microscopy showing domain-dependent nucleoid co-localization (PMID:41741653): removal of DUF4041 abolishes nucleoid association. This is further supported by the conserved positively charged surface identified by ConSurf analysis across ~500 homologues, and by domain annotation as a DNA-binding domain (IPR018306 Phage_T5_Orf172_DNA-bd). IDA evidence code is appropriate for microscopy-based localization experiments with domain deletion controls. The term GO:0003690 (double-stranded DNA binding) is preferred over GO:0003677 (DNA binding) because phage lambda DNA is dsDNA and the nucleoid is dsDNA.
Supporting Evidence:
PMID:36123438
a putative membrane-anchored protein with a central coiled-coil domain (DUF4041) and a C-terminal DNA binding/cleavage domain
|
|
GO:0051607
defense response to virus
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE provides direct defence against bacteriophage lambda and diverse siphoviruses. SNIPE-expressing cells survive phage lambda infection at high MOI where control cells lyse (PMID:41741653). Defence is demonstrated by: time-lapse microscopy showing cell survival, growth curves at MOI 0.1-10, plaque assays showing 6-7 log reduction in phage titre, and the BASEL collection screen showing protection against diverse siphoviruses (both ManYZ-dependent and ManYZ-independent) (PMID:41741653). SNIPE provides direct defence (cell survival) rather than abortive infection. It was originally identified as one of 21 novel defence systems in a functional metagenomic screen of the E. coli pangenome (PMID:36123438).
Reason: Defence against bacteriophage is the primary evolved biological function of SNIPE, supported by multiple complementary experimental approaches (PMID:41741653) and by its original identification via a functional anti-phage defence screen (PMID:36123438). GO:0051607 (defense response to virus) is the most specific existing BP term. There is currently no GO term for "defense response to bacteriophage" specifically; a new term request has been drafted (see projects/SNIPE/go-issue-antiviral-nucleic-acid-defense.md). IDA evidence code is appropriate as defence was directly assayed by growth curves, plaque assays, and single-cell microscopy.
Supporting Evidence:
PMID:41741653
we characterize SNIPE, an anti-bacteriophage defence system that constitutively localizes to the bacterial cell membrane in Escherichia coli to block phage
PMID:41741653
SNIPE as a widespread bacterial defence system that exploits the spatial organization of phage genome injection to specifically target viral DNA, representing a previously unknown strategy for distinguishing self from non-self in prokaryotic immune systems
PMID:36123438
Our results unveil 21 conserved defence systems, none of which were previously detected as enriched in defence islands
|
|
GO:0045071
negative regulation of viral genome replication
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE prevents phage genome replication by destroying the phage DNA during the injection process, before the genome enters the cytoplasm intact. CFP-ParB foci (a proxy for phage genome establishment) are reduced ~30-fold in SNIPE-expressing cells (PMID:41741653). Phage produced per cell is not significantly different between SNIPE and empty vector when inducing lysogens, confirming that SNIPE targets the injection step specifically and does not affect pre-existing intracellular phage genomes (PMID:41741653). While the net effect is prevention of viral genome replication, the mechanism is more precisely described as destruction of the phage genome during membrane translocation rather than regulation of replication per se. GO:0046597 (host-mediated suppression of symbiont invasion) may be a more mechanistically precise BP term since SNIPE blocks the entry/establishment of the phage genome rather than regulating its replication after entry.
Reason: The outcome of SNIPE activity is that viral genome replication is completely prevented, making GO:0045071 a defensible annotation. The evidence is strong: CFP-ParB foci reduction demonstrates that phage genomes do not establish in SNIPE-expressing cells (PMID:41741653). However, SNIPE acts upstream of replication -- it destroys the DNA during injection, not by regulating the replication machinery. This distinction may matter for precision. The annotation is retained because the net biological outcome (no viral genome replication) is accurately captured. See also the additional GO:0046597 annotation below for a more mechanistically precise complementary term. IDA evidence code is appropriate.
Supporting Evidence:
PMID:41741653
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
PMID:41741653
SNIPE associates with host proteins essential for lambda genome entry and with the lambda tape measure protein, which facilitates lambda genome injection across the inner membrane
|
|
GO:0046597
host-mediated suppression of symbiont invasion
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE inhibits phage genome entry into the bacterial cell by cleaving phage DNA during the injection process at the inner membrane. This directly suppresses the phage's invasion of the host cell. GO:0046597 is defined as "a process in which a host inhibits or disrupts the entry of a symbiont into a host cell." While bacteriophages inject their genome rather than entering bodily, the phage genome translocation across the inner membrane constitutes the functional "entry" event that SNIPE suppresses. SNIPE does not block phage adsorption to the cell surface -- phage adsorption is unaffected (PMID:41741653) -- but it destroys the phage DNA during membrane translocation, preventing genome establishment. This term complements GO:0045071 by providing a more mechanistically precise description of SNIPE's biological role.
Reason: SNIPE acts specifically at the step of phage genome injection across the inner membrane (PMID:41741653). It does not affect phage adsorption, does not target pre-existing lysogens, and does not block plasmid transformation -- it is specific to the injection process. GO:0046597 (host-mediated suppression of symbiont invasion) captures this specificity better than GO:0045071 (negative regulation of viral genome replication), which implies action on the replication process itself. GO:0046597 is the parent of several relevant child terms in GO and is compatible with bacterial hosts defending against phage. IDA evidence code is appropriate.
Supporting Evidence:
PMID:41741653
we characterize SNIPE, an anti-bacteriophage defence system that constitutively localizes to the bacterial cell membrane in Escherichia coli to block phage
PMID:36123438
We confirmed that each system did not affect phage adsorption (Extended Data Fig
|
|
GO:0006308
DNA catabolic process
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE degrades phage DNA from ~42 kb to fragments less than 100 bp and mononucleotides during genome injection (PMID:41741653). This is a DNA catabolic process -- the breakdown of DNA. The 32P-labelled DNA tracking experiment directly demonstrates catabolism of the phage DNA substrate. While the MF annotation (GO:0004520 DNA endonuclease activity) captures the catalytic mechanism, GO:0006308 captures the biological process of DNA degradation that results from this activity.
Reason: The degradation of phage DNA from intact genomic molecules (~42 kb) to small fragments (<100 bp) and mononucleotides is directly demonstrated by the 32P-labelled DNA cleavage assay (PMID:41741653). This constitutes DNA catabolism. GO:0006308 provides a process-level annotation that complements the molecular function (GO:0004520) and the higher-level biological context (GO:0051607 defense response to virus). IDA evidence code is appropriate as the catabolic activity was directly measured using radiolabelled substrate.
Supporting Evidence:
PMID:41741653
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
|
|
GO:0005887
integral component of plasma membrane
|
IDA
PMID:41741653 A membrane-bound nuclease directly cleaves phage DNA during ... |
NEW |
Summary: SNIPE is an integral inner membrane protein with a single-pass TM helix (aa 5-24). Topology was confirmed by PhoA-LacZalpha fusion analysis: PhoA was active only when inserted at the N-terminus (periplasmic side), and LacZalpha was active only when inserted downstream of the TM domain (cytoplasmic side) (PMID:41741653). SNIPE-GFP localizes uniformly to the cell membrane by fluorescence microscopy (PMID:41741653). Immunoblotting of fractionated cell lysates shows SNIPE is strongly enriched in the membrane fraction (PMID:41741653). DeepTMHMM and Phobius both predict a transmembrane helix at residues 6-24, consistent with the UniProt annotation (ECO:0000256|SAM:Phobius). In bacteria, GO:0005886 (plasma membrane) corresponds to the inner/cytoplasmic membrane. The single-pass TM helix makes SNIPE an integral (not peripheral) component, supporting GO:0005887 over GO:0005886.
Reason: Integral membrane localization is directly demonstrated by three independent experimental approaches in PMID:41741653: (1) PhoA/LacZalpha topology mapping confirming transmembrane orientation, (2) fluorescence microscopy of SNIPE-GFP showing uniform membrane localization, (3) membrane fractionation/immunoblotting showing enrichment in membrane fraction. This is further supported by the predicted TM helix (Phobius, DeepTMHMM) in the UniProt record. GO:0005887 (integral component of plasma membrane) is preferred over GO:0005886 (plasma membrane) because the single-pass TM helix makes SNIPE an integral membrane protein. IDA evidence code is appropriate for the topology mapping and fractionation experiments.
Supporting Evidence:
PMID:41741653
an anti-bacteriophage defence system that constitutively localizes to the bacterial cell membrane in Escherichia coli
PMID:41741653
Here, we characterize SNIPE, an anti-bacteriophage defence system that constitutively localizes to the bacterial cell membrane in Escherichia coli to block phage Ξ» infection
|
Q: Is GO:0004520 (DNA endonuclease activity) the correct MF term for SNIPE, or should a more specific child term (e.g. reflecting GIY-YIG family membership or the membrane-localized context) be used?
Q: Should the ManYZ pre-association be annotated as a separate protein complex, or is it better modelled as a GO-CAM causal association (SNIPE located_in plasma membrane, colocalizes_with ManYZ complex)?
Q: Does the tape measure protein (TMP) binding by DUF4041 warrant a specific MF annotation (e.g. viral protein binding) or is this better captured only in the GO-CAM model?
Experiment: The 32P cleavage assay (Fig 2d) shows a smear of fragments consistent with endonucleolytic cleavage, but the exact cleavage mechanism and products have not been biochemically characterized with purified SNIPE. In vitro reconstitution of SNIPE nuclease activity with defined DNA substrates would confirm endonuclease vs exonuclease activity, determine sequence specificity (if any), and characterize cleavage products.
Hypothesis: SNIPE cleaves phage DNA endonucleolytically (producing internal cuts) rather than exonucleolytically
Type: Biochemical assay
Experiment: Test representative SNIPE homologues lacking TM domains for phage defence in their native hosts or heterologous expression. Determine whether these homologues also pre-associate with inner membrane proteins at genome injection sites, and whether their N-terminal domains dictate phage specificity.
Hypothesis: SNIPE homologues with non-TM N-terminal regions (DivIVA, T3SS, PH domain) use analogous mechanisms but target different phage entry pathways
Type: Genetics/Functional assay
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Research Report: Functional Annotation of βSNIPEβ (UniProt A0A8T9CRB7; ORF D4N09_22825) in Escherichia coli (ECOLX)
Target identity provided by the user specifies UniProt A0A8T9CRB7 as an E. coli protein described as βDUF4041 domain-containing protein,β with InterPro/Pfam domain annotations including Phage_T5_Orf172_DNA-bd (IPR018306), SNIPE (IPR025280; PF13250), and MUG113 (PF13455). In the tool-retrieved literature, βSNIPEβ did not appear as an E. coli gene symbol nor as PF13250/IPR025280, and no publication directly referenced A0A8T9CRB7 or ORF D4N09_22825. Therefore, the current evidence base supports only domain-context-based functional inference and does not support an experimentally validated gene-specific function for this particular E. coli protein.
Critically, the only directly relevant and recent evidence recovered concerns the related T5orf172/MUG113 family, which is treated as a GIYβYIG endonuclease subfamily and a common phage homing endonuclease gene (HEG) module. Barth et al. explicitly accept MUG113 (PF13455) for T5orf172 proteins because the profiles are highly related, supporting the interpretation that the MUG113-like part of the userβs protein is in the T5orf172-like HEG/nuclease space (https://doi.org/10.1101/2023.03.23.533998; posted 2023-03-23) (barth2023nucleasegenesoccupy pages 3-4).
2.1 Homing endonuclease genes (HEGs)
HEGs are βselfishβ mobile elements that spread by introducing site-specific DNA breaks in homologous target loci lacking the HEG, followed by repair via homologous recombination that copies the HEG into the cleaved allele (βhomingβ). This canonical model is explicitly described and used as a conceptual framework for T5orf172-family proteins in both bacteriophage and eukaryotic-virus-associated mobile elements (https://doi.org/10.1093/ve/veae051; published 2024-07) (barth2024genomicanalysisof pages 8-10).
2.2 GIYβYIG endonucleases and the T5orf172 subfamily
The GIYβYIG endonuclease superfamily comprises catalytic nuclease domains used by diverse mobile elements, including many phage-associated HEGs. In the retrieved corpus, T5orf172 is described as a GIYβYIG endonuclease subfamily: Barth et al. state βT5orf172 is a subfamily of GIY-YIG endonucleases,β and note that in at least one example, nuclease activity depends on a glutamate residue whose catalytic role is inferred from alignments to biochemically characterized GIYβYIG enzymes (https://doi.org/10.1101/2023.03.23.533998; posted 2023-03-23) (barth2023nucleasegenesoccupy pages 6-10, barth2023nucleasegenesoccupy pages 4-6).
2.3 Modular architecture: DNA-binding + nuclease domains
A recurring theme in T5orf172-like HEGs is domain modularity: a nuclease domain paired with variable DNA-binding modules that determine target specificity. Barth et al. operationalize this by trimming sequences to separate terminal nuclease domains from putative DNA-binding termini and then annotating DNA-binding domains (CapR-like and Zn-finger DUFs; RepA_N) (barth2023nucleasegenesoccupy pages 4-6). Their figures visually summarize the βbeads-on-a-stringβ repeat architecture for DNA-binding modules followed by a terminal nuclease domain in representative T5orf172 HEGs (barth2023nucleasegenesoccupy media 643002a1, barth2023nucleasegenesoccupy media fc0baff9).
2.4 Phage satellites and phageβsatellite conflict as a functional context
In Vibrio cholerae phage ICP1, and its satellites called PLEs (phage-inducible chromosomal island-like elements), there is an arms-race dynamic in which satellites inhibit phage production and phages counter-adapt. This system provides experimentally grounded examples where T5orf172-like nuclease modules (and HEG-derived DNA-binding modules) have clear functional roles in defense and counter-defense (barth2021achimericnuclease pages 12-14, barth2023nucleasegenesoccupy pages 18-20).
3.1 What is directly known for A0A8T9CRB7 (SNIPE in ECOLX)
No direct experimental evidence was retrieved for A0A8T9CRB7 itself, its gene/ORF D4N09_22825, or for PF13250/IPR025280 (βSNIPEβ) in E. coli. Consequently, claims about its precise biochemical activity (e.g., a specific cleavage motif or substrate), its physiological pathway in E. coli, or its cellular localization cannot be made from the retrieved evidence set.
3.2 High-confidence inference from the closely related T5orf172/MUG113 domain context
Given (i) the explicit statement that MUG113 (PF13455) is βhighly relatedβ to T5orf172 and accepted for T5orf172-containing proteins (barth2023nucleasegenesoccupy pages 3-4), and (ii) the repeated classification of T5orf172 as a GIYβYIG endonuclease subfamily (barth2023nucleasegenesoccupy pages 6-10, barth2023nucleasegenesoccupy pages 4-6), the strongest defensible functional inference is:
β’ The protein family context is consistent with a DNA endonuclease (GIYβYIG-like) that is frequently embedded in mobile genetic elements (especially phages) and often functions as a homing endonuclease or as a domesticated derivative that targets specific DNA sequences (barth2024genomicanalysisof pages 8-10, barth2023nucleasegenesoccupy pages 10-12).
β’ The functional βsubstrate classβ in characterized systems is DNA, and specificity is typically determined by fused DNA-binding modules (e.g., RepA_N origin-binding domains or zinc-finger-like repeats related to CapR) (barth2023nucleasegenesoccupy pages 18-20, barth2023nucleasegenesoccupy pages 15-18).
β’ Therefore, for A0A8T9CRB7, the most plausible primary molecular roleβconditional on its domain architecture being homologous to these systemsβis as a sequence-specific DNA-binding protein with a nuclease-associated region (or an HEG-derived DNA-binding module), potentially involved in mobile-element dynamics rather than core E. coli metabolism.
3.3 Likely cellular/biological compartment
Characterized T5orf172-like effectors and their domesticated derivatives act on DNA during phage infection or mobile-element activity; this implies intracellular action on nucleic acids (host or viral DNA), not secretion or membrane localization (barth2021achimericnuclease pages 12-14, barth2023nucleasegenesoccupy pages 18-20). However, localization for A0A8T9CRB7 in E. coli cannot be asserted without accession-specific evidence.
Because A0A8T9CRB7 lacks direct evidence, the most useful annotation anchors come from experimentally characterized T5orf172-containing proteins in related phage/satellite systems.
4.1 Odn/Gp88: a modular origin-targeting nuclease with a T5orf172-like nuclease domain
Barth et al. (eLife, 2021-07; https://doi.org/10.7554/elife.68339) report that ICP1 isolates lacking CRISPR-Cas instead encode gp88, a T5orf172-domain protein (pfam10544) whose activity is consistent with a GIYβYIG-like nuclease mechanism: it retains key catalytic residues and a conserved glutamate (E180) whose mutation (E180A) abolishes function (barth2021achimericnuclease pages 4-5, barth2021achimericnuclease pages 2-4). Functionally, gp88 is required for phage propagation on hosts carrying specific PLE variants (PLE1/4/5), linking the nuclease to anti-satellite immunity (barth2021achimericnuclease pages 4-5, barth2021achimericnuclease pages 2-4).
The same study provides direct substrate specificity evidence: purified Gp88 cleaves PCR fragments containing replication origin sequences from sensitive PLEs (PLE1/4/5) but not insensitive PLEs (PLE2/3) in vitro (assayed at 500 nM protein), and the catalytic-site mutant E180A abolishes cleavage (barth2021achimericnuclease pages 7-9). Mechanistically, Odn/Gp88 is presented as a chimeric nuclease in which DNA-binding specificity and nuclease function are separable modules; escape can occur via changes in iteron spacing/architecture, suggesting steric/spacing constraints similar to GIYβYIG endonucleases (barth2021achimericnuclease pages 12-14).
4.2 CapR: domesticated HEG-derived DNA-binding module used for transcriptional repression
Netter et al. (Nucleic Acids Research, 2021-11; https://doi.org/10.1101/2020.11.28.402263) describe CapR as a small (15.5 kDa) satellite-encoded DNA-binding protein that represses the ICP1 capsid morphogenesis operon. EMSAs show CapR binds specifically to the capsid operon promoter region (binds probe A; not a control probe), and mapping indicates binding upstream of the promoter core elements (netter2021aphagesatellite pages 11-13). CapR-mediated repression is tuned: it downregulates capsid operon transcription by ~3-fold (producing roughly one-third as many capsid morphogenesis proteins) without measurably inhibiting ICP1 progeny production under tested conditions, whereas stronger repression via CRISPRi harms both phage and satellite mobilization (netter2021aphagesatellite pages 13-15, netter2021aphagesatellite pages 11-13).
CapR is linked bioinformatically to domains of ICP1 homing endonucleases, supporting an βexpertβ evolutionary interpretation: a putative HEG-derived DNA-binding module was domesticated into a transcriptional repressor after nuclease loss, analogous to known HEG regulatory behaviors (netter2021aphagesatellite pages 13-15, netter2021aphagesatellite pages 15-16).
5.1 2023: Phage-wide genomics of T5orf172/MUG113 HEGs, domain architectures, and mobility statistics
Barth et al. (bioRxiv, posted 2023-03-23; https://doi.org/10.1101/2023.03.23.533998) provide large-scale comparative genomics of HEGs across phages, with emphasis on T5orf172. They report that T5orf172 HEGs are common across multiple phage clades and frequently occur in or near essential genes. Quantitative neighborhood statistics include 468 T5orf172 neighborhoods across 204 genomes, with 35% near terminase, 33% near ribonucleotide reductase, and 31% near capsid genes (barth2023nucleasegenesoccupy pages 10-12).
They also quantify DNA-binding module associations for T5orf172 HEGs: 95/229 (41.5%) encode a CapR-like domain, and 200/229 (87.3%) contain at least one of three Zn-finger-like domains (CapR, DUF723, or DUF4397), often as multiple repeats consistent with class V repeat proteins (βbeads on a stringβ) (barth2023nucleasegenesoccupy pages 15-18). These repetitive, modular architectures provide a mechanistic rationale for rapid specificity diversification via recombination and domain shuffling (barth2023nucleasegenesoccupy pages 15-18). Figures 5β6 visually depict these repeated DNA-binding domains linked to terminal nuclease modules and illustrate recombination events that generate deletions and domain swaps (barth2023nucleasegenesoccupy media 643002a1, barth2023nucleasegenesoccupy media fc0baff9).
Importantly for your targetβs domain set: the authors explicitly state that for proteins with T5orf172 domains, they also accepted MUG113 (PF13455) because βthe two profiles are highly related,β providing the clearest support in this corpus that MUG113-like proteins fall into the T5orf172 HEG/nuclease domain space (barth2023nucleasegenesoccupy pages 3-4).
5.2 2024: T5orf172 as a mobile HEG module in eukaryotic-virus-associated elements
Barth et al. (Virus Evolution, 2024-07; https://doi.org/10.1093/ve/veae051) extend the T5orf172 framing beyond bacteriophages, describing T5orf172 as a GIYβYIG subfamily domain and discussing its association with mobility, repeats, and recombination in hyperparasitic viruses (polinton-like viruses) associated with entomopoxviruses. They highlight hallmarks consistent with HEG-like mobility (multiple homologous copies, repetitive/modular architectures, flanking repeats, rearrangements) and explicitly define the HEG homing mechanism to interpret these patterns (barth2024genomicanalysisof pages 8-10). They further discuss possible βcollaborative homingβ scenarios where HEGs mobilize nearby cargo and intron-associated elements, framing T5orf172-like endonucleases as contributors to module turnover and replicon evolution (barth2024genomicanalysisof pages 10-11).
6.1 Natural, deployable design principles for sequence-targeted nucleases
The Odn/Gp88 case demonstrates a naturally evolved βprogrammable-likeβ paradigm: a DNA-binding specificity module (RepA_N-like origin-binding domain) fused to a GIYβYIG-like nuclease effector (T5orf172-like domain) creates a sequence/architecture-specific nuclease that targets satellite replication origins and provides immunity in the absence of CRISPR-Cas (barth2021achimericnuclease pages 7-9, barth2021achimericnuclease pages 4-5). This is relevant for synthetic biology and phage engineering conceptually, because it shows that modular domain shuffling can generate novel targeting specificities (barth2021achimericnuclease pages 12-14).
6.2 Transcriptional regulation and particle hijacking modules (phage satellite biology)
CapR represents a βdomesticated endonuclease-derivedβ DNA-binding module used to tune helper-phage gene expression to balance inhibition and satellite propagation (netter2021aphagesatellite pages 15-16, netter2021aphagesatellite pages 13-15). This is a real biological implementation of fine-tuned repression rather than maximal suppression.
6.3 Practical genome annotation and phage genomics
Barth et al. 2023 provides concrete quantitative heuristics used in annotation pipelines: detection of T5orf172/MUG113 domains via Pfam HMMs, trimming strategies to isolate nuclease vs DNA-binding termini, and the empirical prevalence of specific DNA-binding modules (CapR/DUF723/DUF4397; RepA_N) that co-occur with T5orf172-like nuclease domains (barth2023nucleasegenesoccupy pages 4-6, barth2023nucleasegenesoccupy pages 15-18). These are directly applicable to real-world functional annotation workflows for prophage and mobile-element proteins.
Across the retrieved works, the recurring expert interpretation is that T5orf172/MUG113-family genes are best viewed primarily through the lens of mobile-element biology (HEGs) rather than as stable, core cellular genes. Barth et al. emphasize the mobility signatures of these genes (presence/absence variation across isolates, frameshifts/deletions, intron/intein associations, proximity to essential genes) and caution that homology/domain presence does not automatically imply active βhoming,β because HEG-derived proteins can be domesticated for new roles (barth2023nucleasegenesoccupy pages 6-10, barth2023nucleasegenesoccupy pages 23-25). Netter et al. similarly frame CapR as a domesticated derivative of a putative homing endonuclease lineage, emphasizing repurposing of DNA-binding function after loss of nuclease function (netter2021aphagesatellite pages 13-15, netter2021aphagesatellite pages 15-16).
Quantitative data points relevant to domain-family function and to annotation:
β’ In phage neighborhood surveys: among 468 T5orf172 neighborhoods (204 genomes), 35% associate with terminase, 33% with ribonucleotide reductase, and 31% with capsid genes (Barth et al., bioRxiv posted 2023-03-23) (barth2023nucleasegenesoccupy pages 10-12).
β’ DNA-binding module prevalence in T5orf172 HEGs: 95/229 (41.5%) encode a CapR-like domain; 200/229 (87.3%) encode at least one of CapR/DUF723/DUF4397 Zn-finger domains; many encode multiple repeats (barth2023nucleasegenesoccupy pages 15-18).
β’ CapR/CRISPRi phenotypes (Netter et al., NAR 2021-11): CRISPRi targeting the capsid operon reduces capsid protein ~3-fold, reduces ICP1 plaquing efficiency to ~1/5, reduces ICP1 particle formation ~3-fold, and reduces PLE transduction by nearly an order of magnitude; in contrast, CapR-mediated repression does not inhibit ICP1 progeny despite ~3-fold transcriptional downregulation of the capsid operon (netter2021aphagesatellite pages 11-13, netter2021aphagesatellite pages 13-15).
β’ Origin-selective cleavage by Odn/Gp88: purified protein cleaves origin-containing DNA fragments from some PLEs but not others, and catalytic mutant E180A abolishes cleavage/activity (barth2021achimericnuclease pages 7-9).
Given the lack of accession-specific literature in the retrieved corpus, the following points cannot be responsibly concluded for A0A8T9CRB7 without additional targeted database retrieval and/or experiments:
β’ Whether A0A8T9CRB7 is encoded in a prophage region, plasmid, integrative element, or core chromosome.
β’ Whether it is an active nuclease, a DNA-binding protein lacking nuclease activity, or another type of mobile-element accessory factor.
β’ Its in vivo DNA targets, if any (e.g., origin sequences, repeat elements), and whether it participates in a recognizable E. coli pathway.
β’ Subcellular localization beyond the generic expectation of intracellular nucleic-acid interaction implied by the domain context.
Based strictly on the retrieved evidence and the domain context provided by the user, a conservative, evidence-aligned annotation for A0A8T9CRB7 would be:
β’ βPredicted mobile-element-associated DNA-binding protein with homing-endonuclease-related (GIYβYIG/T5orf172/MUG113-like) nuclease domain architecture; likely involved in sequence-specific DNA interactions in prophage/mobile element contexts; function uncharacterized in E. coli.β
This phrasing aligns with the demonstrated behavior of T5orf172/MUG113 proteins as modular HEG-derived nucleases and DNA-binding proteins that can be domesticated for regulation or anti-satellite defense, while explicitly acknowledging that SNIPE (PF13250/IPR025280) and the specific UniProt accession were not directly covered by the retrieved literature (barth2023nucleasegenesoccupy pages 4-6, barth2023nucleasegenesoccupy pages 6-10, netter2021aphagesatellite pages 13-15).
References (URLs and dates)
β’ Barth ZK, Dunham DT, Seed KD. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv. Posted 2023-03-23. https://doi.org/10.1101/2023.03.23.533998 (barth2023nucleasegenesoccupy pages 3-4, barth2023nucleasegenesoccupy pages 15-18, barth2023nucleasegenesoccupy pages 10-12).
β’ Netter Z, Boyd CM, Silvas TV, Seed KD. A phage satellite tunes inducing phage gene expression using a domesticated endonuclease to balance inhibition and virion hijacking. Nucleic Acids Research. 2021-11. https://doi.org/10.1101/2020.11.28.402263 (netter2021aphagesatellite pages 11-13, netter2021aphagesatellite pages 13-15).
β’ Barth ZK, Nguyen MHT, Seed KD. A chimeric nuclease substitutes a phage CRISPR-Cas system to provide sequence-specific immunity against subviral parasites. eLife. 2021-07. https://doi.org/10.7554/elife.68339 (barth2021achimericnuclease pages 7-9, barth2021achimericnuclease pages 4-5).
β’ Barth ZK, Hicklin I, ThΓ©zΓ© J, et al. Genomic analysis of hyperparasitic viruses associated with entomopoxviruses. Virus Evolution. 2024-07. https://doi.org/10.1093/ve/veae051 (barth2024genomicanalysisof pages 8-10, barth2024genomicanalysisof pages 10-11).
References
(barth2023nucleasegenesoccupy pages 3-4): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2024genomicanalysisof pages 8-10): Zachary K Barth, Ian Hicklin, Julien ThΓ©zΓ©, Jun Takatsuka, Madoka Nakai, Elisabeth A Herniou, Anne M Brown, and Frank O Aylward. Genomic analysis of hyperparasitic viruses associated with entomopoxviruses. Virus Evolution, Jul 2024. URL: https://doi.org/10.1093/ve/veae051, doi:10.1093/ve/veae051. This article has 6 citations and is from a peer-reviewed journal.
(barth2023nucleasegenesoccupy pages 6-10): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2023nucleasegenesoccupy pages 4-6): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2023nucleasegenesoccupy media 643002a1): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2023nucleasegenesoccupy media fc0baff9): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2021achimericnuclease pages 12-14): Zachary K Barth, Maria HT Nguyen, and Kimberley D Seed. A chimeric nuclease substitutes a phage crispr-cas system to provide sequence-specific immunity against subviral parasites. eLife, Jul 2021. URL: https://doi.org/10.7554/elife.68339, doi:10.7554/elife.68339. This article has 46 citations and is from a domain leading peer-reviewed journal.
(barth2023nucleasegenesoccupy pages 18-20): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2023nucleasegenesoccupy pages 10-12): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2023nucleasegenesoccupy pages 15-18): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
(barth2021achimericnuclease pages 4-5): Zachary K Barth, Maria HT Nguyen, and Kimberley D Seed. A chimeric nuclease substitutes a phage crispr-cas system to provide sequence-specific immunity against subviral parasites. eLife, Jul 2021. URL: https://doi.org/10.7554/elife.68339, doi:10.7554/elife.68339. This article has 46 citations and is from a domain leading peer-reviewed journal.
(barth2021achimericnuclease pages 2-4): Zachary K Barth, Maria HT Nguyen, and Kimberley D Seed. A chimeric nuclease substitutes a phage crispr-cas system to provide sequence-specific immunity against subviral parasites. eLife, Jul 2021. URL: https://doi.org/10.7554/elife.68339, doi:10.7554/elife.68339. This article has 46 citations and is from a domain leading peer-reviewed journal.
(barth2021achimericnuclease pages 7-9): Zachary K Barth, Maria HT Nguyen, and Kimberley D Seed. A chimeric nuclease substitutes a phage crispr-cas system to provide sequence-specific immunity against subviral parasites. eLife, Jul 2021. URL: https://doi.org/10.7554/elife.68339, doi:10.7554/elife.68339. This article has 46 citations and is from a domain leading peer-reviewed journal.
(netter2021aphagesatellite pages 11-13): Zoe Netter, Caroline M. Boyd, Tania V. Silvas, and Kimberley D. Seed. A phage satellite tunes inducing phage gene expression using a domesticated endonuclease to balance inhibition and virion hijacking. Nucleic Acids Research, 49:4386-4401, Nov 2021. URL: https://doi.org/10.1101/2020.11.28.402263, doi:10.1101/2020.11.28.402263. This article has 35 citations and is from a highest quality peer-reviewed journal.
(netter2021aphagesatellite pages 13-15): Zoe Netter, Caroline M. Boyd, Tania V. Silvas, and Kimberley D. Seed. A phage satellite tunes inducing phage gene expression using a domesticated endonuclease to balance inhibition and virion hijacking. Nucleic Acids Research, 49:4386-4401, Nov 2021. URL: https://doi.org/10.1101/2020.11.28.402263, doi:10.1101/2020.11.28.402263. This article has 35 citations and is from a highest quality peer-reviewed journal.
(netter2021aphagesatellite pages 15-16): Zoe Netter, Caroline M. Boyd, Tania V. Silvas, and Kimberley D. Seed. A phage satellite tunes inducing phage gene expression using a domesticated endonuclease to balance inhibition and virion hijacking. Nucleic Acids Research, 49:4386-4401, Nov 2021. URL: https://doi.org/10.1101/2020.11.28.402263, doi:10.1101/2020.11.28.402263. This article has 35 citations and is from a highest quality peer-reviewed journal.
(barth2024genomicanalysisof pages 10-11): Zachary K Barth, Ian Hicklin, Julien ThΓ©zΓ©, Jun Takatsuka, Madoka Nakai, Elisabeth A Herniou, Anne M Brown, and Frank O Aylward. Genomic analysis of hyperparasitic viruses associated with entomopoxviruses. Virus Evolution, Jul 2024. URL: https://doi.org/10.1093/ve/veae051, doi:10.1093/ve/veae051. This article has 6 citations and is from a peer-reviewed journal.
(barth2023nucleasegenesoccupy pages 23-25): Zachary K Barth, Drew T Dunham, and Kimberley D Seed. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. bioRxiv, Mar 2023. URL: https://doi.org/10.1101/2023.03.23.533998, doi:10.1101/2023.03.23.533998. This article has 20 citations.
Title: A membrane-bound nuclease directly cleaves phage DNA during genome injection
Authors: Saxton DS, DeWeirdt PC, Doering CR, Roney IJ & Laub MT
DOI: 10.1038/s41586-026-10207-1
PMID: 41741653
Three domains confirmed by AlphaFold, HHpred, and DeepTMHMM:
SNIPE provides direct defence (infected cell survives), distinct from abortive infection:
SNIPE represents a novel strategy distinct from:
- CRISPR-Cas: sequence-specific recognition via guide RNAs
- Restriction-modification: recognition of DNA methylation marks
- SNIPE: exploits spatial organization β phage DNA must pass through the membrane during injection, and SNIPE is positioned there to intercept it PMID:41741653
Suggest data-driven feature combinations for:
IPR025280 + GIY-YIG nuclease + membrane targeting features
to support an architecture-aware annotation strategy.
IPR025280 (SNIPE associated domain)IPR029330 (small control run to test script on a different input)Output folders:
results/IPR025280/results/IPR029330/Top architectures from results/IPR025280/architecture_table.tsv:
PF13250:IPR025280-PF13455 -> 1109 proteins (68.80%)PF13250:IPR025280-PF10544:IPR018306 -> 270 proteins (16.75%)PF13250:IPR025280 -> 139 proteins (8.62%)Top 3 cover 1518/1612 proteins (94.17%).
From results/IPR025280/cooccurring_domain_weighted.tsv:
PF13455 -> 1189 proteins (73.76%)PF10544 + IPR018306 -> 277 proteins (17.18%)Combined, these catalytic co-features cover 1466/1612 proteins (90.94%).
From results/IPR025280/nterm_tm_heuristic_overall.tsv:
From results/IPR025280/representative_extra_features.tsv:
Q7MG17: TMhelix 5-22, signal peptide 1-23Q6AMX9: TMhelix 7-24Rare but explicit membrane-associated accessory domains (from cooccurring_domain_weighted.tsv):
PF14470 / IPR039519 (bacterial PH domain): 3 proteins (0.19%)PF05154 / IPR007829 (TM2 domain): 1 protein (0.06%)Potentially membrane-linked but weaker evidence:
PF10708 / IPR018929 (DUF2510): 76 proteins (4.71%); Pfam description notes many members are putative membrane proteins.Use all of:
IPR025280 (SNIPE associated domain)PF13455 OR IPR018306/PF10544IPR007829 or IPR039519 when presentUse:
IPR025280PF13455 OR IPR018306/PF10544IPR018929 (DUF2510)with lower confidence due weaker direct membrane evidence.
Suggested architecture-aware implementation note:
IPR025280 + (PF13455 OR IPR018306/PF10544) + (N-term TM/signal peptide OR membrane-associated accessory domain such as IPR007829/IPR039519; optional lower-confidence IPR018929).
IPR029330)IPR025280)id: A0A8T9CRB7
gene_symbol: SNIPE
product_type: PROTEIN
status: DRAFT
taxon:
id: NCBITaxon:562
label: Escherichia coli
description: >-
SNIPE (Surface-associated Nuclease Inhibiting Phage Entry) is a 500 amino acid inner
membrane-bound
GIY-YIG family DNA endonuclease that provides direct anti-bacteriophage defence
in Escherichia coli.
SNIPE constitutively localizes to the inner membrane via an N-terminal single-pass
transmembrane helix
(aa 5-24) and pre-associates with the ManYZ mannose permease complex before phage
infection. During
phage genome injection, the central DUF4041 domain (aa 144-262, now designated the
SNIPE-associated
domain, IPR025280) binds both incoming phage DNA and the phage tape measure protein
(TMP), positioning
the C-terminal GIY-YIG nuclease domain (aa 357-450, catalytic residue E414) to cleave
phage DNA as it
crosses the inner membrane. This provides direct defence β the infected cell survives
β in contrast
to abortive infection systems. SNIPE does not affect phage adsorption, does not
target pre-existing
intracellular phage genomes (lysogens), and does not block plasmid transformation,
indicating its
activity is specific to the phage genome injection process. Originally identified
as PD-lambda-1 in
a functional screen of 71 diverse E. coli pangenomes (PMID:36123438), SNIPE homologues
(~500 curated)
are found across many bacterial phyla, with 33% of well-sequenced clades harbouring
at least one
homologue. The N-terminal region is highly variable and functions as a phage-specificity
adapter, with
59% of homologues having one TM domain, 7% having two, and 34% using alternative
membrane-targeting
strategies (DivIVA domains, type III secretion domains, PH domains). SNIPE represents
a previously
unknown self/non-self discrimination strategy based on subcellular localization
rather than sequence
recognition (CRISPR-Cas) or modification detection (restriction-modification).
Note: UniProt entry A0A8T9CRB7 is from E. coli strain T0181B.E-10. The experimentally
characterized
SNIPE was from strain MOD1-ECOR26 (GenBank RCP76574.1, locus APT27_20780). Both
share RefSeq
WP_001606968.1. SNIPE is not present in E. coli K-12 MG1655.
tags:
- anti-phage defence
- nuclease
- membrane protein
- innate immunity
- phage-host interaction
references:
- id: PMID:41741653
title: A membrane-bound nuclease directly cleaves phage DNA during genome injection
findings:
- statement: >-
SNIPE is a membrane-bound nuclease that directly cleaves phage DNA during
genome
injection, providing direct defence against bacteriophage
supporting_text: >-
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
- statement: >-
SNIPE constitutively localizes to the bacterial cell membrane
supporting_text: >-
we characterize SNIPE, an anti-bacteriophage defence system that constitutively
localizes to the bacterial cell membrane in Escherichia coli to block phage
- statement: >-
SNIPE pre-associates with host proteins for lambda genome entry and with
the tape
measure protein during genome injection
supporting_text: >-
SNIPE associates with host proteins essential for Ξ» genome entry and with
the Ξ»
tape measure protein, which facilitates Ξ» genome injection across the inner
membrane
- statement: >-
Radiolabelled phage DNA tracking demonstrates SNIPE cleaves DNA during injection
supporting_text: >-
Using radiolabelled phage DNA and time-lapse microscopy to track phage genomes,
we
demonstrate that SNIPE directly cleaves phage DNA during genome injection
- statement: >-
SNIPE provides ManYZ-independent defence against siphoviruses via direct
TMP interaction
supporting_text: >-
SNIPE also defends against diverse siphoviruses, probably through direct
interactions
with their tape measure proteins
- statement: >-
SNIPE is a widespread defence system representing a novel self/non-self
discrimination strategy
supporting_text: >-
SNIPE as a widespread bacterial defence system that exploits the spatial
organization of
phage genome injection to specifically target viral DNA, representing a
previously unknown
strategy for distinguishing self from non-self in prokaryotic immune systems
- id: PMID:36123438
title: >-
A functional selection reveals previously undetected anti-phage defence systems
in the
E. coli pangenome
findings:
- statement: >-
SNIPE (originally PD-lambda-1) was identified in a functional metagenomic
screen of 71
diverse E. coli strains as a novel anti-phage defence system
supporting_text: >-
Our results unveil 21 conserved defence systems, none of which were previously
detected
as enriched in defence islands
- statement: >-
SNIPE is a membrane-anchored protein with DUF4041 and a GIY-YIG nuclease
domain
supporting_text: >-
a putative membrane-anchored protein with a central coiled-coil domain (DUF4041)
and a C-terminal DNA binding/cleavage domain
- statement: >-
Defence systems including SNIPE are carried primarily by prophages and mobile
genetic elements
supporting_text: >-
intact prophages and mobile genetic elements are primary reservoirs and
distributors
of defence systems in E. coli
- id: file:ECOLX/SNIPE/SNIPE-deep-research-falcon.md
title: Deep research on SNIPE/DUF4041 domain-containing protein (falcon)
findings:
- statement: >-
T5orf172/MUG113 (PF13455) is a GIY-YIG endonuclease subfamily; MUG113 profile
is
highly related to T5orf172, placing SNIPE's nuclease domain in the GIY-YIG
HEG space
supporting_text: >-
T5orf172 is a subfamily of GIY-YIG endonucleases ... the two profiles [T5orf172
and MUG113] are highly related
- statement: >-
T5orf172-family proteins have modular architecture with variable DNA-binding
modules
determining target specificity, paired with a conserved nuclease domain
supporting_text: >-
a nuclease domain paired with variable DNA-binding modules that determine
target
specificity
existing_annotations:
# No existing GO annotations for A0A8T9CRB7 β all annotations below are NEW proposals
# === Molecular Function ===
- term:
id: GO:0004520
label: DNA endonuclease activity
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE contains a C-terminal GIY-YIG nuclease domain (aa 357-450) that endonucleolytically
cleaves phage DNA during genome injection. Substitution of the predicted catalytic
residue
E414 to alanine abolishes both nuclease activity and phage defence (PMID:41741653).
Radiolabelled 32P phage DNA is degraded from ~42 kb to fragments <100 bp and
mononucleotides in SNIPE-expressing cells, dependent on E414 (PMID:41741653).
The
pattern of degradation (smear of fragments rather than defined products) is
consistent
with endonucleolytic cleavage, though in vitro reconstitution has not yet
been performed.
The GIY-YIG nuclease domain is the most conserved region across ~500 SNIPE
homologues
(PMID:41741653), and domain-level homology to the T5orf172/MUG113 family of
GIY-YIG
endonucleases is confirmed by InterPro/Pfam assignments (IPR018306, PF13455).
GO:0004520
is the appropriate level of specificity; there is no GIY-YIG-specific child
term in GO,
and the data do not support a more specific term like GO:0015666 (restriction
endodeoxyribonuclease activity) since SNIPE does not recognize specific DNA
sequences
or modification states.
action: NEW
reason: >-
Core molecular function of SNIPE, directly demonstrated by multiple experimental
approaches in PMID:41741653: (1) 32P-labelled phage DNA cleavage assay showing
degradation during injection, (2) E414A catalytic dead mutant abolishing cleavage
and defence, (3) Gam-GFP double-strand break reporter showing SNIPE-dependent
foci,
(4) CFP-ParB foci reduced ~30-fold indicating DNA degradation during injection.
The domain architecture (GIY-YIG nuclease, IPR018306/PF13455) independently
supports
endonuclease activity via homology to biochemically characterized family members.
IDA evidence code is appropriate because the nuclease activity was directly
assayed
in vivo using radiolabelled substrate and catalytic mutant controls.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
- reference_id: PMID:41741653
supporting_text: >-
Using radiolabelled phage DNA and time-lapse microscopy to track phage genomes,
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
- reference_id: PMID:36123438
supporting_text: >-
a putative membrane-anchored protein with a central coiled-coil domain
(DUF4041)
and a C-terminal DNA binding/cleavage domain
- reference_id: file:ECOLX/SNIPE/SNIPE-deep-research-falcon.md
supporting_text: >-
T5orf172 is a subfamily of GIY-YIG endonucleases ... the two profiles
[T5orf172
and MUG113] are highly related
- term:
id: GO:0003690
label: double-stranded DNA binding
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
The DUF4041/SNIPE-associated domain (aa 144-262) of SNIPE has a positively
charged
surface that facilitates DNA binding. When the TM domain is removed
(SNIPE(deltaTM E414A)-GFP), the protein co-localizes with the DAPI-stained
nucleoid
(Pearson R=0.76), and this co-localization is abolished when DUF4041 is also
deleted
(deltaDUF4041 shows R=0.13) (PMID:41741653). A DUF4041-only fusion to GFP
also
localizes to the nucleoid. ConSurf analysis shows the positively charged DNA-binding
interface is conserved across SNIPE homologues (PMID:41741653). Note that
the evidence
for DNA binding is based on fluorescence microscopy co-localization with the
nucleoid
rather than a direct biochemical binding assay (e.g. EMSA); however, the domain-dependent
co-localization pattern and the conserved positively charged surface provide
strong
support for direct DNA binding. Phage lambda DNA is dsDNA, making GO:0003690
the
appropriate specific term. The original identification also noted the C-terminal
region
as a "DNA binding/cleavage domain" (PMID:36123438).
action: NEW
reason: >-
DNA binding by DUF4041 is demonstrated by microscopy showing domain-dependent
nucleoid
co-localization (PMID:41741653): removal of DUF4041 abolishes nucleoid association.
This is further supported by the conserved positively charged surface identified
by
ConSurf analysis across ~500 homologues, and by domain annotation as a DNA-binding
domain (IPR018306 Phage_T5_Orf172_DNA-bd). IDA evidence code is appropriate
for
microscopy-based localization experiments with domain deletion controls. The
term
GO:0003690 (double-stranded DNA binding) is preferred over GO:0003677 (DNA
binding)
because phage lambda DNA is dsDNA and the nucleoid is dsDNA.
supported_by:
- reference_id: PMID:36123438
supporting_text: >-
a putative membrane-anchored protein with a central coiled-coil domain
(DUF4041)
and a C-terminal DNA binding/cleavage domain
# === Biological Process ===
- term:
id: GO:0051607
label: defense response to virus
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE provides direct defence against bacteriophage lambda and diverse siphoviruses.
SNIPE-expressing cells survive phage lambda infection at high MOI where control
cells
lyse (PMID:41741653). Defence is demonstrated by: time-lapse microscopy showing
cell
survival, growth curves at MOI 0.1-10, plaque assays showing 6-7 log reduction
in
phage titre, and the BASEL collection screen showing protection against diverse
siphoviruses (both ManYZ-dependent and ManYZ-independent) (PMID:41741653).
SNIPE
provides direct defence (cell survival) rather than abortive infection. It
was
originally identified as one of 21 novel defence systems in a functional metagenomic
screen of the E. coli pangenome (PMID:36123438).
action: NEW
reason: >-
Defence against bacteriophage is the primary evolved biological function of
SNIPE,
supported by multiple complementary experimental approaches (PMID:41741653)
and by
its original identification via a functional anti-phage defence screen (PMID:36123438).
GO:0051607 (defense response to virus) is the most specific existing BP term.
There
is currently no GO term for "defense response to bacteriophage" specifically;
a new
term request has been drafted (see projects/SNIPE/go-issue-antiviral-nucleic-acid-defense.md).
IDA evidence code is appropriate as defence was directly assayed by growth
curves,
plaque assays, and single-cell microscopy.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we characterize SNIPE, an anti-bacteriophage defence system that constitutively
localizes to the bacterial cell membrane in Escherichia coli to block
phage
- reference_id: PMID:41741653
supporting_text: >-
SNIPE as a widespread bacterial defence system that exploits the spatial
organization
of phage genome injection to specifically target viral DNA, representing
a previously
unknown strategy for distinguishing self from non-self in prokaryotic
immune systems
- reference_id: PMID:36123438
supporting_text: >-
Our results unveil 21 conserved defence systems, none of which were previously
detected as enriched in defence islands
- term:
id: GO:0045071
label: negative regulation of viral genome replication
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE prevents phage genome replication by destroying the phage DNA during
the injection
process, before the genome enters the cytoplasm intact. CFP-ParB foci (a proxy
for
phage genome establishment) are reduced ~30-fold in SNIPE-expressing cells
(PMID:41741653). Phage produced per cell is not significantly different between
SNIPE
and empty vector when inducing lysogens, confirming that SNIPE targets the
injection
step specifically and does not affect pre-existing intracellular phage genomes
(PMID:41741653). While the net effect is prevention of viral genome replication,
the
mechanism is more precisely described as destruction of the phage genome during
membrane translocation rather than regulation of replication per se. GO:0046597
(host-mediated suppression of symbiont invasion) may be a more mechanistically
precise
BP term since SNIPE blocks the entry/establishment of the phage genome rather
than
regulating its replication after entry.
action: NEW
reason: >-
The outcome of SNIPE activity is that viral genome replication is completely
prevented,
making GO:0045071 a defensible annotation. The evidence is strong: CFP-ParB
foci
reduction demonstrates that phage genomes do not establish in SNIPE-expressing
cells
(PMID:41741653). However, SNIPE acts upstream of replication -- it destroys
the DNA
during injection, not by regulating the replication machinery. This distinction
may
matter for precision. The annotation is retained because the net biological
outcome
(no viral genome replication) is accurately captured. See also the additional
GO:0046597 annotation below for a more mechanistically precise complementary
term.
IDA evidence code is appropriate.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
- reference_id: PMID:41741653
supporting_text: >-
SNIPE associates with host proteins essential for lambda genome entry
and with the
lambda tape measure protein, which facilitates lambda genome injection
across the
inner membrane
- term:
id: GO:0046597
label: host-mediated suppression of symbiont invasion
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE inhibits phage genome entry into the bacterial cell by cleaving phage
DNA
during the injection process at the inner membrane. This directly suppresses
the
phage's invasion of the host cell. GO:0046597 is defined as "a process in
which a
host inhibits or disrupts the entry of a symbiont into a host cell." While
bacteriophages inject their genome rather than entering bodily, the phage
genome
translocation across the inner membrane constitutes the functional "entry"
event that
SNIPE suppresses. SNIPE does not block phage adsorption to the cell surface
-- phage
adsorption is unaffected (PMID:41741653) -- but it destroys the phage DNA
during
membrane translocation, preventing genome establishment. This term complements
GO:0045071 by providing a more mechanistically precise description of SNIPE's
biological role.
action: NEW
reason: >-
SNIPE acts specifically at the step of phage genome injection across the inner
membrane (PMID:41741653). It does not affect phage adsorption, does not target
pre-existing lysogens, and does not block plasmid transformation -- it is
specific
to the injection process. GO:0046597 (host-mediated suppression of symbiont
invasion)
captures this specificity better than GO:0045071 (negative regulation of viral
genome
replication), which implies action on the replication process itself. GO:0046597
is
the parent of several relevant child terms in GO and is compatible with bacterial
hosts defending against phage. IDA evidence code is appropriate.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we characterize SNIPE, an anti-bacteriophage defence system that constitutively
localizes to the bacterial cell membrane in Escherichia coli to block
phage
- reference_id: PMID:36123438
supporting_text: >-
We confirmed that each system did not affect phage adsorption (Extended Data
Fig
- term:
id: GO:0006308
label: DNA catabolic process
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE degrades phage DNA from ~42 kb to fragments less than 100 bp and mononucleotides
during genome injection (PMID:41741653). This is a DNA catabolic process --
the
breakdown of DNA. The 32P-labelled DNA tracking experiment directly demonstrates
catabolism of the phage DNA substrate. While the MF annotation (GO:0004520
DNA
endonuclease activity) captures the catalytic mechanism, GO:0006308 captures
the
biological process of DNA degradation that results from this activity.
action: NEW
reason: >-
The degradation of phage DNA from intact genomic molecules (~42 kb) to small
fragments
(<100 bp) and mononucleotides is directly demonstrated by the 32P-labelled
DNA
cleavage assay (PMID:41741653). This constitutes DNA catabolism. GO:0006308
provides
a process-level annotation that complements the molecular function (GO:0004520)
and
the higher-level biological context (GO:0051607 defense response to virus).
IDA
evidence code is appropriate as the catabolic activity was directly measured
using
radiolabelled substrate.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
# === Cellular Component ===
- term:
id: GO:0005887
label: integral component of plasma membrane
evidence_type: IDA
original_reference_id: PMID:41741653
review:
summary: >-
SNIPE is an integral inner membrane protein with a single-pass TM helix (aa
5-24).
Topology was confirmed by PhoA-LacZalpha fusion analysis: PhoA was active
only when
inserted at the N-terminus (periplasmic side), and LacZalpha was active only
when
inserted downstream of the TM domain (cytoplasmic side) (PMID:41741653). SNIPE-GFP
localizes uniformly to the cell membrane by fluorescence microscopy (PMID:41741653).
Immunoblotting of fractionated cell lysates shows SNIPE is strongly enriched
in the
membrane fraction (PMID:41741653). DeepTMHMM and Phobius both predict a transmembrane
helix at residues 6-24, consistent with the UniProt annotation (ECO:0000256|SAM:Phobius).
In bacteria, GO:0005886 (plasma membrane) corresponds to the inner/cytoplasmic
membrane. The single-pass TM helix makes SNIPE an integral (not peripheral)
component,
supporting GO:0005887 over GO:0005886.
action: NEW
reason: >-
Integral membrane localization is directly demonstrated by three independent
experimental approaches in PMID:41741653: (1) PhoA/LacZalpha topology mapping
confirming transmembrane orientation, (2) fluorescence microscopy of SNIPE-GFP
showing
uniform membrane localization, (3) membrane fractionation/immunoblotting showing
enrichment in membrane fraction. This is further supported by the predicted
TM helix
(Phobius, DeepTMHMM) in the UniProt record. GO:0005887 (integral component
of plasma
membrane) is preferred over GO:0005886 (plasma membrane) because the single-pass
TM
helix makes SNIPE an integral membrane protein. IDA evidence code is appropriate
for
the topology mapping and fractionation experiments.
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
an anti-bacteriophage defence system that constitutively localizes to
the bacterial
cell membrane in Escherichia coli
- reference_id: PMID:41741653
supporting_text: >-
Here, we characterize SNIPE, an anti-bacteriophage defence system that constitutively
localizes to the bacterial cell membrane in Escherichia coli to block phage
Ξ» infection
core_functions:
- description: >-
SNIPE is a membrane-anchored DNA endonuclease that directly cleaves bacteriophage
DNA
during the genome injection process. It exploits the spatial organization of
phage genome
entry β the fact that phage DNA must cross the inner membrane β to distinguish
foreign
(viral) DNA from self (chromosomal) DNA. The GIY-YIG nuclease domain provides
the catalytic
endonuclease activity, with E414 as the essential catalytic residue. The DUF4041
domain
binds both phage DNA (via a conserved positively charged interface) and phage
tape measure
proteins (TMPs), positioning the nuclease at the site of genome injection. The
N-terminal
TM domain anchors SNIPE to the inner membrane, preventing autoimmune cleavage
of host
chromosomal DNA. SNIPE pre-associates with the ManYZ mannose permease complex,
which is
the inner membrane conduit for lambda genome injection, ensuring SNIPE is positioned
at
genome entry sites before infection occurs. This provides direct defence β the
infected
cell survives β in contrast to abortive infection systems that sacrifice the
infected cell.
SNIPE also provides ManYZ-independent defence against diverse siphoviruses,
probably through
direct interactions with their TMPs via the DUF4041 domain. The ~500 SNIPE homologues
across
bacterial phyla show conservation of the GIY-YIG nuclease and DUF4041 domains,
while the
N-terminal region is highly diversified and functions as a phage-specificity
adapter with
varied membrane-targeting strategies (TM helices, DivIVA domains, T3SS domains,
PH domains).
molecular_function:
id: GO:0004520
label: DNA endonuclease activity
directly_involved_in:
- id: GO:0051607
label: defense response to virus
- id: GO:0045071
label: negative regulation of viral genome replication
locations:
- id: GO:0005886
label: plasma membrane
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
SNIPE as a widespread bacterial defence system that exploits the spatial
organization
of phage genome injection to specifically target viral DNA, representing
a previously
unknown strategy for distinguishing self from non-self in prokaryotic immune
systems
- reference_id: PMID:36123438
supporting_text: >-
A functional selection reveals previously undetected anti-phage defence
systems in
the E. coli pangenome
proposed_new_terms:
- proposed_name: SNIPE defense system
proposed_definition: >-
A defense response to a virus in which a membrane-anchored nuclease of the SNIPE
family cleaves viral DNA during genome injection across the cell membrane. SNIPE
proteins associate with inner membrane components at phage genome injection
sites,
bind phage tape measure proteins via their DUF4041 domain, and use a GIY-YIG
nuclease
domain to degrade incoming phage DNA before it enters the cytoplasm. Unlike
restriction-modification or CRISPR-Cas systems, self/non-self discrimination
is
achieved through subcellular localization at the membrane rather than recognition
of
specific DNA sequences or modifications.
justification: >-
The current anti-foreign nucleic acid branch (GO:0099046 clearance of foreign
intracellular nucleic acids) does not capture SNIPE mechanism, because SNIPE
cleaves
phage DNA during translocation at the membrane before the DNA becomes intracellular.
A dedicated term under GO:0051607 (defense response to virus) is needed to represent
this direct anti-phage mechanism.
proposed_parent:
id: GO:0051607
label: defense response to virus
supported_by:
- reference_id: PMID:41741653
supporting_text: >-
we demonstrate that SNIPE directly cleaves phage DNA during genome injection
- reference_id: PMID:36123438
supporting_text: >-
A functional selection reveals previously undetected anti-phage defence
systems
in the E. coli pangenome
- proposed_name: antiviral defense by targeting viral nucleic acid
proposed_definition: >-
A defense response to a virus in which the host targets viral nucleic acid for
degradation or modification, preventing viral replication. Includes systems
that act
on nucleic acid after cell entry (e.g. CRISPR-Cas, restriction-modification)
and
systems that act during genome injection (e.g. SNIPE).
justification: >-
Optional intermediate grouping term matching the GO issue proposal. This would
provide
a clean parent for nucleic acid-targeting anti-viral systems such as CRISPR-Cas,
restriction-modification, and SNIPE while preserving mechanistic distinctions
in child terms.
proposed_parent:
id: GO:0051607
label: defense response to virus
supported_by:
- reference_id: PMID:37460672
supporting_text: >-
The highly diverse antiphage defence systems of bacteria
suggested_questions:
- question: >-
Is GO:0004520 (DNA endonuclease activity) the correct MF term for SNIPE, or
should
a more specific child term (e.g. reflecting GIY-YIG family membership or the
membrane-localized context) be used?
experts: []
- question: >-
Should the ManYZ pre-association be annotated as a separate protein complex,
or is it
better modelled as a GO-CAM causal association (SNIPE located_in plasma membrane,
colocalizes_with ManYZ complex)?
experts: []
- question: >-
Does the tape measure protein (TMP) binding by DUF4041 warrant a specific MF
annotation
(e.g. viral protein binding) or is this better captured only in the GO-CAM model?
experts: []
suggested_experiments:
- hypothesis: >-
SNIPE cleaves phage DNA endonucleolytically (producing internal cuts) rather
than
exonucleolytically
description: >-
The 32P cleavage assay (Fig 2d) shows a smear of fragments consistent with
endonucleolytic cleavage, but the exact cleavage mechanism and products have
not been
biochemically characterized with purified SNIPE. In vitro reconstitution of
SNIPE
nuclease activity with defined DNA substrates would confirm endonuclease vs
exonuclease
activity, determine sequence specificity (if any), and characterize cleavage
products.
experiment_type: Biochemical assay
- hypothesis: >-
SNIPE homologues with non-TM N-terminal regions (DivIVA, T3SS, PH domain) use
analogous mechanisms but target different phage entry pathways
description: >-
Test representative SNIPE homologues lacking TM domains for phage defence in
their
native hosts or heterologous expression. Determine whether these homologues
also
pre-associate with inner membrane proteins at genome injection sites, and whether
their N-terminal domains dictate phage specificity.
experiment_type: Genetics/Functional assay