SET1

UniProt ID: P38827
Organism: Saccharomyces cerevisiae
Review Status: COMPLETE
📝 Provide Detailed Feedback

Gene Description

SET1 is the catalytic component of the COMPASS complex, responsible for histone H3 lysine 4 (H3K4) methylation. SET1 catalyzes mono-, di-, and trimethylation of H3K4, a fundamental histone mark associated with actively transcribed genes and gene regulatory regions. SET1 also participates in DNA damage response, meiotic recombination, and telomere maintenance through both methylation-dependent and independent mechanisms.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0042800 histone H3K4 methyltransferase activity
IBA
GO_REF:0000033
ACCEPT
Summary: SET1 catalyzes histone H3 lysine 4 methylation as the core enzymatic function of the COMPASS complex. IBA annotation based on phylogenetic inference across SET1/MLL family. Extensive direct biochemical evidence establishes this as the primary function.
Reason: Core SET1 function. IBA evidence is appropriate for this well-conserved family function. Deep research confirms SET1 is the sole H3K4 methyltransferase in yeast [evidence from PMID:11742990, 11751634, 11752412, 11805083].
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
file:yeast/SET1/SET1-deep-research-perplexity.md
provider: perplexity
GO:0048188 Set1C/COMPASS complex
IBA
GO_REF:0000033
ACCEPT
Summary: SET1 is the catalytic core component of the COMPASS (Complex of Proteins Associated with Set1) complex. SET1 is part of an eight-subunit complex essential for H3K4 methylation. IBA annotation reflects phylogenetic conservation of this complex organization.
Reason: Core structural annotation. SET1 functions exclusively as component of COMPASS complex. IBA evidence appropriate for this conserved complex membership. Deep research establishes COMPASS as multi-subunit assembly with SET1 at catalytic core [PMID:11742990, 11687631].
Supporting Evidence:
PMID:11687631
COMPASS: a complex of proteins associated with a trithorax-related SET domain protein.
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0005634 nucleus
IEA
GO_REF:0000044
KEEP AS NON CORE
Summary: SET1 localizes to the nucleus where COMPASS complex catalyzes H3K4 methylation on chromatin. IEA from UniProt subcellular location annotation.
Reason: Correct localization but not informative about SET1 function. Localization without functional consequence should be marked non-core. SET1 functions exclusively in nucleus.
GO:0005694 chromosome
IEA
GO_REF:0000044
KEEP AS NON CORE
Summary: SET1 associates with chromatin/chromosomes as component of COMPASS complex catalyzing H3K4 methylation.
Reason: While true, this is vague and less informative than specific process or function annotations. Chromosome association is implied by core methyltransferase function.
GO:0006325 chromatin organization
IEA
GO_REF:0000043
ACCEPT
Summary: H3K4 methylation by SET1/COMPASS contributes to chromatin organization and regulation of chromatin structure.
Reason: SET1 directly contributes to chromatin organization through H3K4 methylation, which affects nucleosome positioning, histone-DNA interactions, and chromatin state. IEA from keyword annotation is appropriate.
GO:0008168 methyltransferase activity
IEA
GO_REF:0000120
KEEP AS NON CORE
Summary: SET1 catalyzes methyl group transfer from SAM to histone H3 lysine 4, a methyltransferase activity.
Reason: True but overly broad. GO:0042800 (H3K4 methyltransferase activity) is more specific and informative. This parent term is less useful when specific substrate-level annotation available.
GO:0016740 transferase activity
IEA
GO_REF:0000043
KEEP AS NON CORE
Summary: SET1 catalyzes transferase activity (methyl group transfer to histone substrate).
Reason: Extremely broad parent term. SET1 is a methyltransferase, which is a type of transferase. Parent term is less informative when more specific molecular function available.
GO:0032259 methylation
IEA
GO_REF:0000043
KEEP AS NON CORE
Summary: SET1 catalyzes methylation of histone H3 lysine 4.
Reason: General process annotation. While accurate, more specific methyltransferase activity annotations (GO:0008168, GO:0042800) are more informative. Parent term less useful.
GO:0042800 histone H3K4 methyltransferase activity
IEA
GO_REF:0000002
ACCEPT
Summary: SET1 is the H3K4-specific methyltransferase, catalyzing mono-, di-, and trimethylation of histone H3 lysine 4. IEA from InterPro domain annotation.
Reason: Core molecular function annotation. IEA from InterPro recognition of SET domain is appropriate. Extensive literature confirms this is SET1 primary function. Duplicate of annotation 1 with different evidence source.
GO:0045944 positive regulation of transcription by RNA polymerase II
IEA
GO_REF:0000117
ACCEPT
Summary: SET1-mediated H3K4 methylation marks and promotes active transcription, functioning in positive regulation of Pol II transcription.
Reason: SET1 function extends beyond catalytic activity to transcriptional regulation. H3K4me3 marks active promoters and is required for efficient transcription initiation. Deep research [PMID:27325136] confirms SET1 involvement in transcription regulation. However, SET1 also has roles in transcriptional repression, making this annotation incomplete.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
GO:0048188 Set1C/COMPASS complex
IEA
GO_REF:0000002
ACCEPT
Summary: SET1 is component of Set1C/COMPASS complex. IEA from InterPro domain recognition.
Reason: Core structural annotation. Duplicate of annotation 2 with different evidence. InterPro detection of SET domain-containing protein appropriately infers COMPASS complex membership.
GO:0140999 histone H3K4 trimethyltransferase activity
IEA
GO_REF:0000120
ACCEPT
Summary: SET1 catalyzes trimethylation of histone H3 lysine 4 (H3K4me3). IEA from RHEA and EC number annotation.
Reason: SET1 catalyzes all methylation states (me1, me2, me3) of H3K4. This annotation for trimethylation specifically is appropriate. Deep research confirms SET1 capacity for complete trimethylation [PMID:12845608, 22158900].
Supporting Evidence:
PMID:12845608
Saccharomyces cerevisiae Set1p is a methyltransferase specific for lysine 4 of histone H3 and is required for efficient gene expression.
PMID:22158900
Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human.
GO:0005515 protein binding
IPI
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
KEEP AS NON CORE
Summary: SET1 protein-protein interactions with COMPASS complex subunits (Bre2/Ash2, Swd1, Swd2, Swd3) and other proteins documented through co-immunoprecipitation and mass spectrometry. Multiple IPI annotations reflect experimentally validated interactions.
Reason: While interactions are real and well-documented (IPI evidence is appropriate), generic "protein binding" is not informative for SET1 function. These interactions are with known COMPASS subunits (Bre2, Swd1, Swd2, Swd3, Bre5, Chd1, Swd7) forming the complex. Specific complex membership (GO:0048188) is more informative than generic binding. COMPASS complex membership captures the functional significance.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0005515 protein binding
IPI
PMID:11805837
Systematic identification of protein complexes in Saccharomy...
KEEP AS NON CORE
Summary: SET1 interactions documented via mass spectrometry-based proteome survey.
Reason: Generic binding annotation. Specific COMPASS complex membership is more informative.
Supporting Evidence:
PMID:11805837
Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry.
GO:0005515 protein binding
IPI
PMID:16429126
Proteome survey reveals modularity of the yeast cell machine...
KEEP AS NON CORE
Summary: SET1 protein interactions identified in proteome survey.
Reason: Generic binding annotation. Specific complex membership captures functional relationships.
Supporting Evidence:
PMID:16429126
Proteome survey reveals modularity of the yeast cell machinery.
GO:0005515 protein binding
IPI
PMID:16554755
Global landscape of protein complexes in the yeast Saccharom...
KEEP AS NON CORE
Summary: SET1 protein interactions from global landscape of yeast protein complexes study.
Reason: Generic binding annotation. Duplicate interactions captured more informatively by COMPASS complex membership.
Supporting Evidence:
PMID:16554755
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.
GO:0005515 protein binding
IPI
PMID:18083099
Histone crosstalk between H2B monoubiquitination and H3 meth...
KEEP AS NON CORE
Summary: SET1 interactions in context of histone crosstalk between H2B ubiquitination and H3 methylation via COMPASS.
Reason: Generic binding annotation despite interesting functional context. COMPASS complex membership is more specific and functional.
Supporting Evidence:
PMID:18083099
Histone crosstalk between H2B monoubiquitination and H3 methylation mediated by COMPASS.
GO:0005515 protein binding
IPI
PMID:19713935
Cotranslational assembly of the yeast SET1C histone methyltr...
KEEP AS NON CORE
Summary: SET1 protein interactions from cotranslational assembly studies of SET1C complex.
Reason: Generic binding annotation. Complex assembly dynamics captured by COMPASS membership annotation.
Supporting Evidence:
PMID:19713935
Cotranslational assembly of the yeast SET1C histone methyltransferase complex.
GO:0005515 protein binding
IPI
PMID:21179020
Defining the budding yeast chromatin-associated interactome.
KEEP AS NON CORE
Summary: SET1 interactions from budding yeast chromatin-associated interactome study.
Reason: Generic binding annotation. COMPASS complex membership more informative.
Supporting Evidence:
PMID:21179020
Defining the budding yeast chromatin-associated interactome.
GO:0005515 protein binding
IPI
PMID:37968396
The social and structural architecture of the yeast protein ...
KEEP AS NON CORE
Summary: SET1 interactions from recent social and structural architecture of yeast protein interactome study.
Reason: Generic binding annotation. Specific complex membership is more informative for understanding SET1 function.
Supporting Evidence:
PMID:37968396
Nov 15. The social and structural architecture of the yeast protein interactome.
GO:0005515 protein binding
IPI
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DN...
KEEP AS NON CORE
Summary: SET1 interaction with Mec3 checkpoint protein documented. SET1 involvement in DNA repair and checkpoint functions.
Reason: While this Mec3 interaction is functionally distinct (DNA repair vs. COMPASS), generic "protein binding" is still not informative. The functional relationship (DNA repair, checkpoint) would be better captured by process annotations.
Supporting Evidence:
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DNA repair and telomere functions.
GO:0005634 nucleus
NAS
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
KEEP AS NON CORE
Summary: SET1 localizes to the nucleus as established by ComplexPortal NAS (Non-Annotated Sequence) curation.
Reason: Correct but not informative about function. Localization without functional consequence. SET1 functions in nucleus through H3K4 methylation which is already annotated.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0031509 subtelomeric heterochromatin formation
NAS
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
ACCEPT
Summary: SET1 has documented roles in subtelomeric heterochromatin formation and silencing of genes near chromosome ends.
Reason: SET1 contributes to subtelomeric heterochromatin formation through H3K4me3-dependent mechanisms. Deep research confirms role in telomeric gene silencing [PMID:11805083]. NAS evidence from ComplexPortal is appropriate. This is documented functional role.
Supporting Evidence:
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0003723 RNA binding
IDA
PMID:29071121
Binding to RNA regulates Set1 function.
KEEP AS NON CORE
Summary: SET1 contains an RNA recognition motif (RRM) at its N-terminus and binds RNA. IDA evidence from direct biochemical studies of SET1 RNA binding.
Reason: SET1 does bind RNA through its RRM domain (demonstrated PMID:29071121), but the functional role of this interaction is unclear. Deep research indicates RRM functions primarily in protein stability rather than direct catalytic relevance to H3K4 methylation. RNA binding is less central than methyltransferase activity.
Supporting Evidence:
PMID:29071121
Binding to RNA regulates Set1 function.
GO:0048188 Set1C/COMPASS complex
IDA
PMID:29071121
Binding to RNA regulates Set1 function.
ACCEPT
Summary: SET1 is demonstrated to be component of Set1C/COMPASS complex through direct biochemical analysis and RNA binding studies.
Reason: Core structural annotation. IDA evidence from direct protein interaction studies appropriate. Duplicate annotation with different evidence from IBA (phylogenetic) and IEA (InterPro). This IDA evidence from direct studies is strong.
Supporting Evidence:
PMID:29071121
Binding to RNA regulates Set1 function.
GO:0006357 regulation of transcription by RNA polymerase II
IMP
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chroma...
ACCEPT
Summary: SET1 regulates transcription by RNA Pol II through H3K4 methylation. IMP evidence from set1 mutant studies showing defects in transcription regulation.
Reason: SET1 is essential for normal transcription regulation through H3K4 methylation. Deep research confirms H3K4me3 marks active promoters and affects transcription initiation and elongation [PMID:27325136]. IMP evidence appropriate.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
GO:0006357 regulation of transcription by RNA polymerase II
IGI
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chroma...
ACCEPT
Summary: SET1 genetic interaction with other transcription regulation genes. IGI evidence from genetic interaction studies.
Reason: Genetic interactions with transcription factors and regulators confirm SET1 role in transcription regulation. IMP and IGI both support this annotation from same reference.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
GO:0033554 cellular response to stress
IMP
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chroma...
ACCEPT
Summary: SET1 contributes to cellular stress response through H3K4 methylation-dependent gene expression changes. IMP evidence from mutant analysis.
Reason: SET1 regulates genes involved in stress response. Deep research shows SET1 effects on stress-responsive gene expression. IMP evidence from set1 deletion/mutation studies.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
GO:0033554 cellular response to stress
IGI
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chroma...
ACCEPT
Summary: Genetic interaction showing SET1 involvement in stress response pathways.
Reason: Genetic interaction data support stress response role. Duplicate from same reference with complementary evidence type.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
GO:0008270 zinc ion binding
RCA
PMID:30358795
The cellular economy of the Saccharomyces cerevisiae zinc pr...
KEEP AS NON CORE
Summary: SET1 contains zinc-binding motifs characteristic of SET domain proteins. RCA (Rule-based Computational Annotation) inferred from protein structure.
Reason: SET1 SET domain contains zinc-coordinating residues (part of catalytic pocket), but zinc binding is not the primary function - it is structural support for methyltransferase activity. RCA from structural features is appropriate, but zinc binding is a cofactor interaction rather than independent function. Less informative than methyltransferase activity annotation.
Supporting Evidence:
PMID:30358795
The cellular economy of the Saccharomyces cerevisiae zinc proteome.
GO:0031509 subtelomeric heterochromatin formation
IMP
PMID:11752412
A trithorax-group complex purified from Saccharomyces cerevi...
ACCEPT
Summary: SET1 is required for proper methylation of histone H3 and formation of subtelomeric heterochromatin. Multiple IMP annotations confirm this functional role.
Reason: SET1 is essential for subtelomeric heterochromatin formation and silencing through H3K4 methylation-dependent mechanisms. IMP evidence from set1 mutant studies. Core functional role well-documented [PMID:11752412, 11805083, 9398665].
Supporting Evidence:
PMID:11752412
A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
GO:0031509 subtelomeric heterochromatin formation
IMP
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required ...
ACCEPT
Summary: SET1 methylation activity required for telomeric silencing and subtelomeric heterochromatin.
Reason: Duplicate annotation with different reference but same functional role. IMP evidence appropriate.
Supporting Evidence:
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
GO:0031509 subtelomeric heterochromatin formation
IMP
PMID:9398665
SET1, a yeast member of the trithorax family, functions in t...
ACCEPT
Summary: SET1, a yeast member of the trithorax family, functions in transcriptional silencing and subtelomeric heterochromatin.
Reason: Foundational characterization of SET1 in heterochromatin. IMP evidence from early functional studies.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
GO:0031509 subtelomeric heterochromatin formation
IMP
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DN...
ACCEPT
Summary: SET1 involvement in heterochromatin formation and telomere functions.
Reason: Additional IMP evidence. Multiple independent studies converge on SET1 role in subtelomeric heterochromatin formation.
Supporting Evidence:
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DNA repair and telomere functions.
GO:0003723 RNA binding
IDA
PMID:28483910
RNA Binding by Histone Methyltransferases Set1 and Set2.
KEEP AS NON CORE
Summary: Direct experimental evidence for SET1 RNA binding capacity. IDA from PMID:28483910 RNA Binding by Histone Methyltransferases Set1 and Set2.
Reason: While RNA binding is demonstrated (IDA evidence appropriate), the functional role in H3K4 methylation is unclear. RRM domain appears to function in protein stability/regulation rather than direct catalytic function. Less central than methyltransferase activity.
Supporting Evidence:
PMID:28483910
RNA Binding by Histone Methyltransferases Set1 and Set2.
GO:0042138 meiotic DNA double-strand break formation
IMP
PMID:32290544
Genetic Interactions of Histone Modification Machinery Set1 ...
ACCEPT
Summary: SET1 contributes to meiotic DNA double-strand break (DSB) formation. IMP evidence from genetic analysis of set1 in meiosis.
Reason: SET1 has specialized functions in meiotic recombination beyond transcription. Deep research shows SET1 targeting meiotic DSB sites and requirement for proper DSB formation [PMID:32290544]. Both catalytic and non-catalytic SET1 functions contribute to meiosis.
Supporting Evidence:
PMID:32290544
Genetic Interactions of Histone Modification Machinery Set1 and PAF1C with the Recombination Complex Rec114-Mer2-Mei4 in the Formation of Meiotic DNA Double-Strand Breaks.
GO:0000723 telomere maintenance
IMP
PMID:27911222
The histone methyltransferases Set5 and Set1 have overlappin...
ACCEPT
Summary: SET1 is involved in telomere maintenance through H3K4 methylation-dependent and independent mechanisms.
Reason: SET1 contributes to telomere length maintenance. Deep research documents SET1 roles in both H3K4me3-dependent telomeric silencing and H3K4me3-independent telomere length regulation [PMID:27911222]. IMP evidence from set1 mutant studies.
Supporting Evidence:
PMID:27911222
The histone methyltransferases Set5 and Set1 have overlapping functions in gene silencing and telomere maintenance.
GO:0000781 chromosome, telomeric region
IGI
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of ...
KEEP AS NON CORE
Summary: SET1 genetic interaction suggesting localization/function at telomeric regions. IGI evidence from genetic interaction studies.
Reason: Localization annotation without clear functional context. SET1 association with telomeric regions is implied by telomere maintenance and subtelomeric heterochromatin annotations. More specific process annotations are more informative.
Supporting Evidence:
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of zeste mediate gene silencing in Drosophila heterochromatin and at S.
GO:0031509 subtelomeric heterochromatin formation
IGI
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of ...
ACCEPT
Summary: Genetic interaction indicating SET1 involvement in subtelomeric heterochromatin formation.
Reason: Genetic interaction evidence supports subtelomeric heterochromatin function. IGI complements IMP evidence from other references.
Supporting Evidence:
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of zeste mediate gene silencing in Drosophila heterochromatin and at S.
GO:0010629 negative regulation of gene expression
IMP
PMID:29066473
Repression of Middle Sporulation Genes in Saccharomyces cere...
ACCEPT
Summary: SET1 involvement in repression of middle sporulation genes through H3K4 methylation maintenance.
Reason: SET1 has documented roles in repressing genes that should not be expressed. H3K4 methylation can mark both active and repressed genes depending on context. Deep research confirms SET1 contributes to negative regulation of specific gene sets [PMID:29066473].
Supporting Evidence:
PMID:29066473
Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation.
GO:0006479 protein methylation
IMP
PMID:23469257
Catalytic and functional roles of conserved amino acids in t...
KEEP AS NON CORE
Summary: SET1 catalyzes protein (histone) methylation as core enzymatic function. IMP evidence from catalytic studies.
Reason: Correct but overly broad. GO:0042800 (H3K4 methyltransferase activity) is more specific. Parent term less informative when specific substrate-level annotation available.
Supporting Evidence:
PMID:23469257
Catalytic and functional roles of conserved amino acids in the SET domain of the S.
GO:0000183 rDNA heterochromatin formation
IMP
PMID:23469257
Catalytic and functional roles of conserved amino acids in t...
ACCEPT
Summary: SET1 contributes to rDNA heterochromatin formation through H3K4 methylation-dependent mechanisms.
Reason: SET1 has specialized roles in rDNA locus silencing beyond general H3K4 methylation. IMP evidence from studies of SET domain catalytic function in rDNA silencing. Deep research confirms rDNA heterochromatin is SET1-dependent [PMID:23469257].
Supporting Evidence:
PMID:23469257
Catalytic and functional roles of conserved amino acids in the SET domain of the S. cerevisiae lysine methyltransferase Set1.
GO:0000122 negative regulation of transcription by RNA polymerase II
IMP
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r...
ACCEPT
Summary: SET1-mediated H3K4 methylation can function in negative regulation of transcription at specific loci. IMP evidence from mutant analysis.
Reason: SET1 has dual roles in both positive and negative regulation depending on genomic context and partnering proteins. Deep research confirms Jhd2 demethylase opposition to Set1 in dynamic control [PMID:27325136]. Multiple regulatory roles appropriate.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
GO:0045944 positive regulation of transcription by RNA polymerase II
IMP
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r...
ACCEPT
Summary: SET1-mediated H3K4 methylation promotes positive regulation of Pol II transcription.
Reason: Primary transcriptional role of SET1. Both GO:0000122 (negative) and GO:0045944 (positive) from same reference indicate context-dependent function. IMP evidence appropriate for both.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
GO:1902275 regulation of chromatin organization
IMP
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r...
ACCEPT
Summary: SET1-mediated H3K4 methylation regulates chromatin organization state through coordinated interactions with chromatin modifiers.
Reason: SET1 is central to chromatin organization regulation through H3K4 methylation effects on nucleosome positioning, chromatin accessibility, and establishment of distinct chromatin domains. IMP evidence appropriate.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
GO:1905088 positive regulation of synaptonemal complex assembly
IMP
PMID:24797370
Dot1-dependent histone H3K79 methylation promotes the format...
ACCEPT
Summary: SET1 contributes to synaptonemal complex assembly during meiosis. IMP evidence from meiotic genetics.
Reason: SET1 specialized meiotic function. Synaptonemal complex assembly is essential for proper meiotic recombination. SET1 H3K4 methylation promotes this process [PMID:24797370].
Supporting Evidence:
PMID:24797370
Dot1-dependent histone H3K79 methylation promotes the formation of meiotic double-strand breaks in the absence of histone H3K4 methylation in budding yeast.
GO:0007130 synaptonemal complex assembly
IMP
PMID:24797370
Dot1-dependent histone H3K79 methylation promotes the format...
ACCEPT
Summary: SET1 is required for synaptonemal complex assembly during meiosis.
Reason: SET1 essential for meiotic recombination through synaptonemal complex assembly. IMP evidence from meiotic mutant studies. Both positive regulation (GO:1905088) and assembly (GO:0007130) appropriately annotated.
Supporting Evidence:
PMID:24797370
eCollection 2014. Dot1-dependent histone H3K79 methylation promotes the formation of meiotic double-strand breaks in the absence of histone H3K4 methylation in budding yeast.
GO:1903341 regulation of meiotic DNA double-strand break formation
IMP
PMID:23246437
Spp1, a member of the Set1 Complex, promotes meiotic DSB for...
ACCEPT
Summary: SET1 contributes to regulation of meiotic DSB formation through Spp1-mediated recruitment to meiotic recombination hotspots.
Reason: SET1 has documented roles in regulating meiotic DSB formation through Spp1 complex component. Deep research shows SET1 recruits H3K4me3 to chromosome axis sites, promoting DSB formation [PMID:23246437]. Specialized meiotic function distinct from transcriptional role.
Supporting Evidence:
PMID:23246437
Spp1, a member of the Set1 Complex, promotes meiotic DSB formation in promoters by tethering histone H3K4 methylation sites to chromosome axes.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:11751634
Histone H3 lysine 4 methylation is mediated by Set1 and requ...
ACCEPT
Summary: SET1 H3K4 methyltransferase activity demonstrated by functional analysis. IMP evidence from studies of SET1 catalytic function.
Reason: Core molecular function. IMP evidence from direct biochemical characterization of SET1 catalytic activity [PMID:11751634]. Multiple IMP annotations with different references converge on primary function.
Supporting Evidence:
PMID:11751634
Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:22158900
Structural analysis of the core COMPASS family of histone H3...
ACCEPT
Summary: SET1 H3K4 methyltransferase activity confirmed through structural analysis studies.
Reason: Duplicate with different structural biology reference. Multiple independent studies support H3K4 methyltransferase function. IMP evidence appropriate.
Supporting Evidence:
PMID:22158900
Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human.
GO:0055092 sterol homeostasis
IMP
PMID:23382196
H3K4 methyltransferase Set1 is involved in maintenance of er...
MARK AS OVER ANNOTATED
Summary: SET1 is involved in maintenance of ergosterol homeostasis and resistance to Brefeldin A. IMP evidence from genetic and functional studies.
Reason: This appears to be a pleiotropic effect rather than core function. SET1 H3K4 methylation regulates genes involved in sterol metabolism, but sterol homeostasis is not a primary functional role. The annotation is not incorrect, but represents a downstream consequence of transcriptional regulation rather than direct SET1 function. Keep annotation but mark as non-central to SET1 biology.
Supporting Evidence:
PMID:23382196
H3K4 methyltransferase Set1 is involved in maintenance of ergosterol homeostasis and resistance to Brefeldin A.
GO:0000723 telomere maintenance
IMP
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
ACCEPT
Summary: SET1 is required for telomere maintenance. IMP evidence from foundational studies.
Reason: SET1 contributes to telomere length maintenance and integrity. Supported by multiple references [PMID:11742990, 9398665, 27911222]. IMP evidence from set1 mutant studies showing shortened telomeres.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0000723 telomere maintenance
IMP
PMID:9398665
SET1, a yeast member of the trithorax family, functions in t...
ACCEPT
Summary: SET1 involvement in telomere maintenance from early functional characterization.
Reason: Duplicate from foundational SET1 paper. Multiple references converge on telomere maintenance role.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
GO:0003723 RNA binding
IDA
PMID:16787775
Structural characterization of Set1 RNA recognition motifs a...
KEEP AS NON CORE
Summary: Direct experimental evidence for SET1 RNA binding from structural characterization of RNA recognition motifs.
Reason: RNA binding demonstrated through structural studies and RRM characterization. IDA evidence appropriate. However, functional role unclear - appears to affect protein stability rather than catalytic activity. Less central than methyltransferase function.
Supporting Evidence:
PMID:16787775
2006 May 9. Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation.
GO:0003723 RNA binding
IMP
PMID:16787775
Structural characterization of Set1 RNA recognition motifs a...
KEEP AS NON CORE
Summary: IMP evidence for RNA binding role in SET1 function and regulation.
Reason: Both IDA and IMP from same reference. Functional role of RNA binding remains unclear despite experimental evidence.
Supporting Evidence:
PMID:16787775
2006 May 9. Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation.
GO:0016279 protein-lysine N-methyltransferase activity
IMP
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functi...
ACCEPT
Summary: SET1 catalyzes protein-lysine methylation as a core enzymatic activity. IMP evidence from functional studies.
Reason: SET1 is a protein-lysine N-methyltransferase (histone lysine methyltransferase). While less specific than GO:0042800 (H3K4 methyltransferase), this is appropriate parent term. Genetic interactions demonstrate methyltransferase activity role.
Supporting Evidence:
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functions in chromosome segregation.
GO:0016279 protein-lysine N-methyltransferase activity
IGI
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functi...
ACCEPT
Summary: Genetic interaction evidence for SET1 protein-lysine methyltransferase activity.
Reason: IGI complements IMP evidence. Genetic interactions support methyltransferase function.
Supporting Evidence:
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functions in chromosome segregation.
GO:0030437 ascospore formation
IMP
PMID:17150765
Effect of histone methyltransferase gene mutations on sporul...
ACCEPT
Summary: SET1 is required for proper ascospore formation during sporulation. IMP evidence from set1 mutant analysis.
Reason: SET1 has documented role in sporulation and ascospore formation. IMP evidence from set1 deletion strains showing defective sporulation [PMID:17150765].
Supporting Evidence:
PMID:17150765
Effect of histone methyltransferase gene mutations on sporulation in S. cerevisiae.
GO:0030466 silent mating-type cassette heterochromatin formation
IMP
PMID:9398665
SET1, a yeast member of the trithorax family, functions in t...
ACCEPT
Summary: SET1 is required for maintenance of silent mating-type cassette heterochromatin. IMP evidence from early SET1 characterization.
Reason: SET1 contributes to silencing at the silent mating loci (HML and HMR). H3K4 methylation marks these silent heterochromatic regions [PMID:9398665]. IMP evidence from set1 mutant derepression phenotype.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
GO:0042800 histone H3K4 methyltransferase activity
IDA
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
ACCEPT
Summary: SET1 H3K4 methyltransferase activity demonstrated through direct biochemical and cellular experiments.
Reason: Core function with multiple evidence types. IDA from foundational COMPASS discovery paper [PMID:11742990]. Extensive biochemical characterization of SET1 catalytic activity.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
ACCEPT
Summary: SET1 H3K4 methyltransferase activity confirmed through functional genetic analysis.
Reason: IMP evidence from set1 deletion showing loss of H3K4 methylation. Multiple evidence types from same paper converge on catalytic function.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:11752412
A trithorax-group complex purified from Saccharomyces cerevi...
ACCEPT
Summary: SET1-catalyzed H3K4 methylation demonstrated through trithorax-group complex characterization.
Reason: IMP evidence from trithorax complex purification and functional analysis [PMID:11752412].
Supporting Evidence:
PMID:11752412
Epub 2001 Dec 18. A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
GO:0042800 histone H3K4 methyltransferase activity
IDA
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required ...
ACCEPT
Summary: Direct evidence for SET1 histone H3K4 methyltransferase activity from COMPASS characterization.
Reason: IDA evidence from biochemical analysis of COMPASS complex activity [PMID:11805083].
Supporting Evidence:
PMID:11805083
2002 Jan 22. COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required ...
ACCEPT
Summary: IMP evidence for H3K4 methyltransferase function from mutational analysis.
Reason: Multiple evidence types (IDA and IMP) from same reference support catalytic function.
Supporting Evidence:
PMID:11805083
2002 Jan 22. COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
GO:0042800 histone H3K4 methyltransferase activity
IMP
PMID:12845608
Saccharomyces cerevisiae Set1p is a methyltransferase specif...
ACCEPT
Summary: SET1 is a methyltransferase specific for lysine 4 of histone H3 and required for efficient gene expression.
Reason: IMP evidence from studies of SET1 catalytic requirement for gene expression [PMID:12845608]. Converging evidence from multiple studies.
Supporting Evidence:
PMID:12845608
Saccharomyces cerevisiae Set1p is a methyltransferase specific for lysine 4 of histone H3 and is required for efficient gene expression.
GO:0048188 Set1C/COMPASS complex
IPI
PMID:11687631
COMPASS: a complex of proteins associated with a trithorax-r...
ACCEPT
Summary: SET1 protein-protein interactions with COMPASS subunits documented through direct interaction studies.
Reason: Core structural annotation. IPI evidence from COMPASS co-immunoprecipitation studies [PMID:11687631]. Specific complex membership is more informative than generic protein binding. Multiple IPI annotations with different partners all converge on COMPASS complex membership.
Supporting Evidence:
PMID:11687631
COMPASS: a complex of proteins associated with a trithorax-related SET domain protein.
GO:0048188 Set1C/COMPASS complex
IPI
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 h...
ACCEPT
Summary: SET1 interactions with COMPASS complex components demonstrated through multiple approaches.
Reason: Multiple independent IPI studies confirm COMPASS complex membership. Duplicate annotations appropriate for different reference sources.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
GO:0048188 Set1C/COMPASS complex
IPI
PMID:11752412
A trithorax-group complex purified from Saccharomyces cerevi...
ACCEPT
Summary: IPI evidence for SET1 as component of trithorax-like COMPASS complex.
Reason: Convergent IPI evidence from trithorax complex co-purification studies.
Supporting Evidence:
PMID:11752412
Epub 2001 Dec 18. A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.

Core Functions

Catalytic component of the COMPASS histone methyltransferase complex responsible for mono-, di-, and trimethylation of histone H3 lysine 4 (H3K4), a fundamental mark of active transcription and gene regulatory regions

Supporting Evidence:
  • PMID:11742990
    The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.

References

Gene Ontology annotation through association of InterPro records with GO terms
Annotation inferences using phylogenetic trees
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location vocabulary mapping, accompanied by conservative changes to GO terms applied by UniProt
Electronic Gene Ontology annotations created by ARBA machine learning models
Combined Automated Annotation using Multiple IEA Methods
COMPASS: a complex of proteins associated with a trithorax-related SET domain protein.
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae.
A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry.
Saccharomyces cerevisiae Set1p is a methyltransferase specific for lysine 4 of histone H3 and is required for efficient gene expression.
The Set1 methyltransferase opposes Ipl1 aurora kinase functions in chromosome segregation.
Proteome survey reveals modularity of the yeast cell machinery.
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.
Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation.
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
Effect of histone methyltransferase gene mutations on sporulation in S. cerevisiae.
Histone crosstalk between H2B monoubiquitination and H3 methylation mediated by COMPASS.
Cotranslational assembly of the yeast SET1C histone methyltransferase complex.
Defining the budding yeast chromatin-associated interactome.
Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human.
Spp1, a member of the Set1 Complex, promotes meiotic DSB formation in promoters by tethering histone H3K4 methylation sites to chromosome axes.
H3K4 methyltransferase Set1 is involved in maintenance of ergosterol homeostasis and resistance to Brefeldin A.
Catalytic and functional roles of conserved amino acids in the SET domain of the S. cerevisiae lysine methyltransferase Set1.
Dot1-dependent histone H3K79 methylation promotes the formation of meiotic double-strand breaks in the absence of histone H3K4 methylation in budding yeast.
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
The histone methyltransferases Set5 and Set1 have overlapping functions in gene silencing and telomere maintenance.
RNA Binding by Histone Methyltransferases Set1 and Set2.
Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation.
Binding to RNA regulates Set1 function.
The cellular economy of the Saccharomyces cerevisiae zinc proteome.
Genetic Interactions of Histone Modification Machinery Set1 and PAF1C with the Recombination Complex Rec114-Mer2-Mei4 in the Formation of Meiotic DNA Double-Strand Breaks.
The social and structural architecture of the yeast protein interactome.
Mammalian homologues of the Polycomb-group gene Enhancer of zeste mediate gene silencing in Drosophila heterochromatin and at S. cerevisiae telomeres.
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
Interaction between Set1p and checkpoint protein Mec3p in DNA repair and telomere functions.
file:yeast/SET1/SET1-deep-research-perplexity.md
Deep research report on SET1

📚 Additional Documentation

Deep Research Perplexity

(SET1-deep-research-perplexity.md)

provider: perplexity
model: sonar-deep-research
cached: false
start_time: '2025-12-31T19:47:56.492784'
end_time: '2025-12-31T19:50:03.874625'
duration_seconds: 127.38
template_file: templates/gene_research_go_focused.md
template_variables:
organism: yeast
gene_id: SET1
gene_symbol: SET1
uniprot_accession: P38827
protein_description: 'RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific {ECO:0000303|PubMed:11805083}; EC=2.1.1.354 {ECO:0000269|PubMed:11805083,
ECO:0000305|PubMed:29071121}; AltName: Full=COMPASS component SET1 {ECO:0000303|PubMed:11742990};
AltName: Full=Lysine N-methyltransferase 2; AltName: Full=SET domain-containing
protein 1 {ECO:0000303|PubMed:11742990};'
gene_info: Name=SET1 {ECO:0000303|PubMed:9398665}; Synonyms=KMT2 {ECO:0000303|PubMed:18022353},
YTX1; OrderedLocusNames=YHR119W;
organism_full: Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
protein_family: Belongs to the class V-like SAM-binding methyltransferase
protein_domains: COMPASS_Set1_N-SET. (IPR024657); Nucleotide-bd_a/b_plait_sf. (IPR012677);
Post-SET_dom. (IPR003616); Set1-like. (IPR044570); Set1_fungi. (IPR017111)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
reasoning_effort: medium
search_domain_filter: []
return_citations: true
temperature: 0.0
citation_count: 48


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P38827
  • Protein Description: RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific {ECO:0000303|PubMed:11805083}; EC=2.1.1.354 {ECO:0000269|PubMed:11805083, ECO:0000305|PubMed:29071121}; AltName: Full=COMPASS component SET1 {ECO:0000303|PubMed:11742990}; AltName: Full=Lysine N-methyltransferase 2; AltName: Full=SET domain-containing protein 1 {ECO:0000303|PubMed:11742990};
  • Gene Information: Name=SET1 {ECO:0000303|PubMed:9398665}; Synonyms=KMT2 {ECO:0000303|PubMed:18022353}, YTX1; OrderedLocusNames=YHR119W;
  • Organism (full): Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
  • Protein Family: Belongs to the class V-like SAM-binding methyltransferase
  • Key Domains: COMPASS_Set1_N-SET. (IPR024657); Nucleotide-bd_a/b_plait_sf. (IPR012677); Post-SET_dom. (IPR003616); Set1-like. (IPR044570); Set1_fungi. (IPR017111)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "SET1" matches the protein description above
  2. Verify the organism is correct: Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SET1' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene SET1 (gene ID: SET1, UniProt: P38827) in yeast.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output


Let me analyze the query carefully. The user is asking for a comprehensive research report on the gene SET1 (UniProt: P38827) from Saccharomyces cerevisiae, which is the baker's yeast.

The user has provided context that this gene encodes a "Histone-lysine N-methyltransferase, H3 lysine-4 specific" and is a component of the COMPASS complex. The user specifically wants:

  1. Comprehensive research on the gene
  2. Primary function - what reaction is catalyzed, substrate specificity
  3. Localization - where in or outside the cell the product functions
  4. Signaling/biochemical pathways
  5. Evidence from experimental studies, structure, evolution, bioinformatics
  6. Should prioritize precise studies over high-throughput

I've been given 60 search results that are highly relevant to this query. Let me review what I have:

The search results cover:
- The discovery that Set1 is an H3K4 methyltransferase (sources 1, 2, 4, 21, 24)
- COMPASS complex composition and function (sources 3, 6, 7, 15, 19, 22, 35, 36, 40, 41, 55, 58)
- Structural basis for H3K4 methylation (sources 5, 15, 19, 22, 37, 56, 58)
- H3K4 methylation patterns and methylation state regulation (sources 14, 17, 32, 37, 56)
- Set1 functions beyond H3K4 methylation (sources 8, 10, 11, 20, 23, 51)
- The role of H2B ubiquitination in Set1 activation (sources 15, 22, 28, 36, 45, 48)
- Set1's role in transcription (sources 8, 13, 27, 31, 32, 33, 34, 38, 41, 49, 52, 57)

  • Set1's involvement in meiosis and DNA repair processes (sources 10, 20, 35, 43, 46, 51)
  • Regulatory domains and control mechanisms (sources 14, 37, 39, 40, 42, 51, 56, 58)
  • CFP1/Cfp1 targeting the complex to specific genomic locations (sources 34, 38, 41, 52)
  • Evolutionary conservation across species (sources 24, 35, 51)

Now I need to structure this into a comprehensive 10,000+ word report that follows the format requirements. The report should have:

  1. Title with summary paragraph
  2. Main body sections (at least 5 major sections)
  3. Conclusion
  4. Flowing narrative prose, no lists
  5. Proper citations throughout

Key topics to cover:
1. Introduction to Set1 and its discovery
2. Structure and enzymatic mechanism
3. The COMPASS complex and its assembly
4. Substrate specificity and catalytic activity
5. H3K4 methylation patterns and regulation
6. Transcriptional regulation roles
7. DNA repair and meiotic functions
8. Regulation of Set1 activity (H2B ubiquitination, complex assembly)
9. Evolutionary conservation and mammalian homologs
10. Cellular localization and association with transcription machinery

I'll begin drafting the comprehensive report now.

Set1: The Catalytic Foundation of Histone H3 Lysine 4 Methylation in Saccharomyces cerevisiae

The SET1 gene of Saccharomyces cerevisiae encodes a histone lysine methyltransferase with extraordinary functional significance in eukaryotic chromatin regulation, serving as the prototypical member of the SET1/MLL family of enzymes that catalyze the site-specific methylation of histone H3 at lysine 4 (H3K4)[1][4][7]. Set1p protein functions exclusively as the catalytic component of the COMPASS complex, a multi-subunit assemblage that mediates the mono-, di-, and trimethylation of H3K4 in coordination with actively transcribed genes, providing one of the most conserved and well-characterized histone modification marks in eukaryotic chromatin[3][12][55]. Since its initial identification as a member of the trithorax family of chromatin-modifying proteins and subsequent characterization as the first histone lysine 4-specific methyltransferase in budding yeast, Set1 has emerged as a crucial regulator of gene expression, DNA repair, meiotic recombination, and cellular differentiation, with mechanisms of action extending well beyond its canonical role in histone methylation[1][2][21][26].

Discovery and Initial Characterization of Set1 as a Histone Methyltransferase

The identification of Set1 as a histone methyltransferase represented a watershed moment in understanding chromatin-based gene regulation, arriving at a time when the enzymatic machinery responsible for histone lysine methylation remained largely mysterious. Prior to the characterization of Set1, researchers had identified SUV39 and G9a as histone methyltransferases specific for lysine 9 and lysine 27 of histone H3, establishing that SET domain-containing proteins could catalyze histone lysine methylation[1][24]. Through sequence analysis and phylogenetic comparison, investigators recognized that the SET domain of yeast Set1 and the related Drosophila trithorax proteins constituted a distinct branch within the SET domain-containing protein superfamily, separate from the SUV39/G9a lineage[1]. This observation prompted researchers to test whether Set1 might function as a histone methyltransferase with different substrate specificity. Biochemical purification experiments using affinity-tagged Set1 complex components revealed that when Set1 was associated with its cognate protein complex, it displayed robust enzymatic activity toward histone H3 peptides, but critically, this activity was specific for lysine 4 rather than lysine 9[1][2]. Chromatin immunoprecipitation assays subsequently demonstrated that H3K4 methylation was completely abolished in set1 deletion strains, establishing Set1 as the sole H3K4 methyltransferase in budding yeast[21]. This discovery was particularly significant because it provided the first evidence that different SET domain proteins could evolve to recognize and methylate distinct lysine residues on histone H3, establishing a principle that would prove applicable across the eukaryotic kingdom[1].

The initial characterization also revealed that Set1 alone, as an isolated protein, possessed minimal to negligible enzymatic activity toward histone substrates[3][12]. This finding immediately suggested that Set1 functioned within a larger protein assembly where additional subunits contributed essential regulatory and catalytic functions. Subsequent investigations identified multiple proteins that physically associated with Set1 through co-immunoprecipitation and tandem affinity purification strategies, leading to the discovery of a stable eight-subunit complex that was subsequently designated as COMPASS, for Complex of Proteins Associated with Set1[3][6][55]. This architectural organization, in which the catalytic subunit depends upon multiple regulatory partners for full enzymatic activity, would emerge as a characteristic feature of the SET1/MLL methyltransferase family throughout eukaryotes[3][35][40].

Enzymatic Mechanism and Catalytic Function

The catalytic mechanism by which Set1 methylates histone H3 at lysine 4 involves the transfer of a methyl group from the universal methyl donor S-adenosylmethionine (SAM) to the ε-amino nitrogen of the lysine residue, a nucleophilic substitution reaction that is mediated by the highly conserved SET domain positioned at the carboxy terminus of the protein[15][22][37]. The SET domain represents approximately 130 amino acids and adopts a distinctive three-dimensional fold that creates a binding pocket for both the protein substrate and the SAM cofactor[37][40]. Within this catalytic pocket, conserved amino acid residues position the methyl group of SAM to facilitate its transfer to the target lysine, while additional residues stabilize the substrate and coordinate the geometry necessary for efficient catalysis[37][40][56]. Structural studies of related SET domain proteins, particularly human MLL1 and other SET1 family members, have revealed that the SET domain in its isolated form adopts an "open" conformation that severely restricts enzymatic activity, with the catalytic pocket remaining incompletely formed[40][56]. Upon assembly of the full COMPASS complex, however, the SET domain undergoes significant conformational rearrangement to adopt a "closed" conformation in which the catalytic pocket achieves optimal geometry for both substrate binding and catalytic turnover[58].

The substrate specificity of Set1 is remarkable in its absolute restriction to histone H3 lysine 4, even in the context of purified core histones where other lysine residues on histone H3 (such as lysine 9, lysine 14, and lysine 27) are potential methylation targets[1][2][21]. This specificity reflects both the architecture of the catalytic pocket and the orientation of the histone H3 tail within the active site, where contacts between the COMPASS complex and the histone N-terminal domain position lysine 4 directly adjacent to the catalytic center while sterically excluding other lysine residues[19][22][58]. The specificity is further reinforced by recognition elements within the COMPASS core that directly engage specific residues within the histone H3 N-terminal tail, establishing contacts that ensure only the correct lysine residue can access the methylation machinery[19][58].

The COMPASS Complex: Architectural Organization and Subunit Functions

Set1 functions exclusively as the catalytic component of the COMPASS complex, which in budding yeast comprises eight distinct protein subunits that have been designated according to their molecular weights as Cps60, Cps50, Cps40, Cps35, Cps30, Cps25, Cps15, and the catalytic Set1 protein itself[3][55]. Each subunit performs a distinct and non-redundant function in the assembly, stability, and catalytic regulation of the complex, with structural and biochemical studies having elucidated the molecular basis for these contributions[3][58]. The Set1 SET domain is positioned at the geometric center of a Y-shaped complex arrangement, where it is directly contacted by multiple subunits including Cps60, Cps50, and Cps30, which together form a catalytic core module that provides the essential platform for methyltransferase activity[58]. Cps60, also known as Bre2 or its mammalian ortholog ASH2L, contributes a SPRY domain that directly stabilizes the SET-I motif, one of the most dynamic and catalytically critical regions within the SET domain[15][22][58]. Cps50, known as Swd1 or RbBP5 in mammals, provides a WD40 repeat β-propeller fold that scaffolds the SET domain while also making multiple contacts with nucleosomal histones, thereby positioning the histone substrate correctly relative to the catalytic center[15][19][22].

Cps30, orthologous to mammalian WDR5, makes extensive contacts with the SET-I helix and includes a region termed the SIM (SET-I interacting motif) that directly stabilizes the catalytically productive "closed" conformation of the SET domain[58]. Genetic and biochemical evidence demonstrates that Cps30 is specifically required for the highest levels of H3K4 trimethylation, with its absence resulting in a profound decrease in the proportion of H3K4me3 while leaving H3K4me2 and H3K4me1 largely intact[3][55][56]. This selective requirement reflects the energetic barrier to successive methylation rounds, where the stabilization of the SET domain conformation by Cps30 becomes increasingly critical for achieving complete trimethylation[56][58]. Cps40, also known as Spp1 or CFP1 in mammals, sits atop the complex and contains a CXXC zinc finger domain that recognizes non-methylated DNA sequences (particularly CpG-rich regions) in vertebrate cells and a PHD finger that recognizes H3K4me3, thereby enabling the complex to recognize its target sites on chromatin[34][38][41][52]. Cps35, also designated SWD2, represents the only essential subunit of the COMPASS complex in budding yeast and appears to be shared with other chromatin-modifying complexes, functioning to link H3K4 methylation with H2B monoubiquitination[3][55]. Cps25 and Cps15 serve supporting structural roles, with Cps25 contributing to the stability of the complex and Cps15 appearing to function in complex assembly[55].

The architectural organization of the COMPASS complex has been elucidated through complementary applications of cryo-electron microscopy and biochemical reconstruction, revealing a highly organized assembly in which each subunit occupies a distinct spatial position and makes specific contacts that collectively determine the catalytic properties of the enzyme[58]. The complex exhibits a characteristic Y-shaped morphology when viewed in certain orientations, with the SET domain positioned at the junction of the three arms, directly accessible to histone substrates[19][58]. Notably, structural studies have identified dynamic regions within the complex, particularly including the carboxy-terminal modules containing Cps60 and Cps25, which exhibit variable positioning relative to the catalytically competent core, suggesting that conformational flexibility may facilitate substrate engagement and product release during the catalytic cycle[58].

Regulation of H3K4 Methylation States: From Monomethylation to Trimethylation

The COMPASS complex possesses the remarkable capacity to catalyze sequential methylation reactions, progressively converting H3K4me0 to H3K4me1, then to H3K4me2, and finally to H3K4me3, with each methylation state displaying distinct localization patterns on chromatin and serving specific functional roles[3][12][17][37][55][56]. The distribution of these three methylation states across the genome is highly organized and reflects both the targeting mechanisms that recruit COMPASS to specific genomic regions and the kinetic parameters that govern successive methylation rounds[17]. In general, H3K4me3 displays a narrow enrichment immediately downstream of transcription start sites, typically peaking approximately 200 base pairs downstream of the nucleosome-depleted region, with this peak precisely coinciding with the +1 nucleosome position[17][56]. In contrast, H3K4me2 exhibits a much broader distribution, often extending further downstream into gene bodies and showing peak positions that vary considerably more among individual genes than is observed for H3K4me3[17]. H3K4me1 displays the most distributed pattern, with substantial enrichment at enhancers and promoter-distal regions, and generally lower peak intensities compared to the higher methylation states[17][56].

The molecular basis for this three-state hierarchy appears to reflect the intrinsic kinetic properties of the COMPASS enzyme working in conjunction with the dynamics of RNA Polymerase II transcription. The catalytic mechanism involves a Y/F switch residue (tyrosine 1052 in yeast Set1) positioned within the catalytic pocket, which functions to limit the intrinsic methyltransferase activity of the isolated SET domain to predominantly monomethylation[37][56]. This tyrosine residue creates a catalytic pocket that is well-suited for the first methyl transfer but creates steric constraints that make successive methylation rounds progressively more difficult without additional regulatory input[37][56]. The critical insight emerged from studies demonstrating that mutation of tyrosine 1052 to phenylalanine (Y1052F), which enlarges the catalytic pocket, dramatically enhances H3K4 trimethylation at the expense of the lower methylation states[56][57]. However, under wild-type conditions, the progression from monomethylation to higher states requires assistance from specific subunits, particularly Cps40 and Cps60, which working in concert with Cps30 modulate the catalytic pocket geometry and stabilize the SET domain conformation to permit higher-order methylation[56][58].

The transcription-dependent deposition of H3K4 methylation states reflects the intimate association of COMPASS with actively elongating RNA Polymerase II and the kinetic properties of the complex as it travels along genes[36][56]. During promoter-proximal regions where Polymerase II transcribes at relatively slow rates during the promoter escape phase, the COMPASS complex maintains sustained contact with its histone substrates, providing an extended window of opportunity for successive methylation reactions and thereby generating predominantly trimethylated nucleosomes[36][56]. In contrast, within downstream regions where Polymerase II elongates at much faster rates, the complex has less opportunity to convert H3K4me1 to higher states, resulting in the accumulation of predominantly monomethylated nucleosomes[36][56]. This elegant model elegantly explains how the pattern of H3K4 methylation emerges from the interplay between enzyme specificity, transcription dynamics, and chromatin structure, without requiring distinct targeting or targeting mechanisms for different methylation states[36][56].

Transcriptional Regulation and H3K4 Methylation as a Marker of Active Chromatin

The relationship between H3K4 trimethylation and active transcription represents one of the most robust correlations in chromatin biology, with genome-wide studies across diverse eukaryotic organisms consistently demonstrating strong positive correlation between H3K4me3 abundance at promoters and the expression levels of associated genes[2][8][13][17][27][56]. The mechanistic basis for this correlation appears multifactorial, reflecting both the recruitment of H3K4me3 by transcription factors and transcriptional machinery and the recognition of H3K4me3 by transcriptional regulators and chromatin remodeling complexes that facilitate further transcription initiation and elongation. Recent studies have revealed that H3K4me3 is recognized by specific protein modules, particularly PHD (plant homology domain) fingers and related recognition domains present in proteins such as Cps40, which contains a PHD finger that preferentially binds H3K4me3 with micromolar affinity compared to unmethylated H3K4 or lower methylation states[34][38][52]. Furthermore, the N-terminal transcriptional initiation factors and general transcription machinery components include H3K4me3 recognition modules, suggesting that this histone modification directly facilitates the recruitment and productive engagement of the transcription machinery at promoters[27][34].

The interplay between H3K4 methylation and H3K4 acetylation has emerged as a particularly important regulatory axis in the precise control of inducible genes[8][13][49][57]. Recent high-resolution studies have demonstrated that during the initial induction of genes in response to environmental signals, H3K4 acetylation increases at transcription start sites in preference to H3K4 methylation, enabling rapid gene activation even in the absence of H3K4me3[8][13][49][57]. As induction signals persist and continue, H3K4 methylation progressively increases while acetylation decreases, suggesting a temporal program in which acetylation mediates the rapid initial response while methylation provides long-term maintenance of the active state[8][13][49][57]. This interplay reflects the fact that H3K4 lysine residue can be modified by either methylation or acetylation, and these modifications are mutually exclusive, competing for the same residue[8][13][49][57]. The molecular mechanism appears to involve H3K4 acetylation-binding proteins and factors that accumulate in the absence of Set1 or in the absence of H3K4 methylation, suggesting that the deposition of the methyl mark actively displaces proteins that preferentially recognize the acetylated state[8][13][49][57].

Set1 function extends beyond its canonical role in transcriptional activation to include important roles in repressing or preventing inappropriate expression of genes that should not be transcribed under specific cellular conditions[2][8][11][13][49][57]. Studies of cells lacking Set1 have revealed not only decreases in expression of expected target genes but also increases in expression of genes that are normally repressed, particularly including genes involved in stress response, cell surface protein synthesis, and developmental programs that should remain silenced during standard growth conditions[8][13][49][57]. This apparent paradox—that loss of a protein associated with "active" transcription can lead to increased expression of specific genes—is explained by the principle that Set1 and H3K4 methylation function to establish and maintain repressive chromatin structure at specific genomic loci through mechanisms independent of direct transcription activation[8][11][13][49][57]. The precise mechanisms remain incompletely understood but appear to involve the recruitment of co-repressor complexes and chromatin-modifying activities through H3K4me3 recognition, or alternatively, the suppression of acetylation that would otherwise permit spurious transcription[8][11][13][49][57].

H2B Ubiquitination as an Activating Signal for COMPASS

The catalytic activity of COMPASS is profoundly stimulated by the monoubiquitination of histone H2B on lysine 123 (K123), a modification catalyzed by the ubiquitin-conjugating enzyme Rad6 and the E3 ubiquitin ligase Bre1[3][15][22][28][36][45][48][55]. This histone modification crosstalk represents one of the best-characterized examples of trans-histone communication, wherein modification of one histone residue directly regulates the catalytic activity of enzymes that modify a different histone[15][22][28][55]. H2B ubiquitination is itself dependent upon prior transcription of the associated genes and correlates strongly with the accumulation of H3K4 and H3K79 methylation marks, suggesting a coordinated regulatory system wherein active transcription marks the nucleosome for modification with ubiquitin, which in turn stimulates further histone methylation[3][28][36][55]. The mechanism by which H2Bub stimulates H3K4 methylation by COMPASS has been illuminated through cryo-electron microscopy structures of the COMPASS complex bound to ubiquitinated nucleosomes, revealing that the H2B-linked ubiquitin makes direct contacts with multiple COMPASS subunits, particularly including Swd1 (RbBP5) and the Set1 ARM helix region[15][22]. These contacts relieve an inhibitory effect imposed by the Set1 ARM helix in its unactivated state, thereby promoting the transition of the SET domain to the catalytically productive closed conformation[15][22].

The quantitative parameters of COMPASS activation by H2Bub are substantial, with measurements of H3K4 methylation activity increasing several-fold in the presence of ubiquitinated nucleosomes compared to unmodified nucleosomes[3][15][22][28][36][55]. The mechanism appears to involve both increased catalytic turnover rate and enhanced substrate binding, with ubiquitin-mediated conformational changes within the complex stabilizing the catalytically active form of the SET domain while simultaneously improving the positioning of the histone substrate relative to the catalytic center[15][22]. Notably, the stimulation by H2Bub is particularly pronounced for the higher methylation states (H3K4me2 and H3K4me3), suggesting that ubiquitination primarily facilitates the kinetically less favorable successive methylation rounds[3][15][22][28][36]. This regulation ensures that efficient trimethylation of H3K4 is coupled to active transcription and H2B ubiquitination, preventing promiscuous H3K4 trimethylation at inactive genomic regions and thereby maintaining the fidelity of this important chromatin mark[3][15][22][28][36].

The Cps35 (SWD2) subunit of COMPASS mediates the critical interaction with H2Bub, with this subunit exhibiting preferential interaction with ubiquitinated nucleosomes compared to unmodified nucleosomes, although the interaction appears to be indirect and mediated through other COMPASS components[3][55]. Loss of Cps35 results in profound defects in H3K4 di- and trimethylation despite leaving H3K4 monomethylation relatively intact, consistent with the preferential coupling of ubiquitination-stimulated catalysis to the higher methylation states[3][55]. This selective requirement reflects the energetic demands of completing successive methylation reactions, which become progressively more difficult without the conformational stabilization provided by ubiquitin-induced changes within the complex[3][55].

Cellular Localization and Association with Transcription Machinery

Set1 and the COMPASS complex localize primarily to the nucleus, where they associate with actively transcribing regions of chromatin in a dynamic and regulated manner that reflects the transcriptional status of individual genes[33][34][36][52]. The complex travels with elongating RNA Polymerase II along transcribed genes from the promoter region through the coding sequence, achieving this co-transcriptional localization through direct or indirect interactions between COMPASS components and the elongating polymerase, as well as through recognition of histone modifications that accumulate on actively transcribed genes[33][36][52]. The Paf1 complex (Paf1C), which comprises Paf1, Ctr9, Cdc73, Rtf1, and Leo1, represents a key intermediary in recruiting COMPASS to elongating Polymerase II, as Paf1C associates with the phosphorylated carboxyl-terminal domain (CTD) of Polymerase II throughout transcription and directly or indirectly recruits COMPASS to these sites[33][36][52]. In mammalian cells, the CFP1 component of the SET1 complex provides an additional targeting mechanism through recognition of non-methylated CpG-rich DNA sequences and recognition of H3K4me3 already deposited on chromatin, enabling the complex to efficiently find and occupy its target gene promoters[34][38][41][52].

The targeting of COMPASS to specific genomic regions represents a critical control point in regulating which genes receive H3K4 methylation and to what degree. In yeast, Spp1 (the Cps40 ortholog) contains a PHD finger that recognizes H3K4me3, potentially creating a positive feedback loop wherein pre-existing H3K4me3 marks facilitate the recruitment of additional COMPASS complexes to deposit further methylation[34][52]. In metazoan cells, CFP1 employs a more complex targeting mechanism that involves multivalent interactions with chromatin, wherein the CXXC zinc finger recognizes non-methylated DNA while the PHD finger recognizes H3K4me3, and these two interactions work in concert to achieve stable occupancy at target promoters while excluding ectopic binding at non-promoter regions[34][38][41][52]. Live-cell imaging studies have demonstrated that CFP1 dynamics within the nucleus are predominantly determined by its association with the SET1 complex, with the complex predominating over individual targeting domains in governing occupancy time and residency on chromatin[34][52]. The dynamic nature of COMPASS targeting reflects the continuous cycles of complex assembly, chromatin engagement, enzymatic activity, and dissociation that occur during transcription, with the complex likely undergoing cycles of engagement and release as the polymerase traverses genes[33][34][36][52].

Functions Beyond H3K4 Methylation: DNA Repair, Meiosis, and Chromosome Biology

While the canonical enzymatic function of Set1 involves the catalytic methylation of histone H3 at lysine 4, subsequent investigations have revealed that Set1 contributes to multiple cellular processes through both H3K4 methylation-dependent and H3K4 methylation-independent mechanisms[11][20][23][35][43][51]. Set1 plays important roles in DNA double-strand break (DSB) formation during meiosis, with set1 deletion strains displaying profound impairment in the induction of DSBs in response to meiotic signals[20][43]. Notably, this meiotic function does not absolutely require the histone methyltransferase activity of Set1, as catalytic mutants retain residual DSB formation capacity, suggesting that the SET domain provides essential non-catalytic scaffolding or signaling functions distinct from its enzymatic activity[20][43]. However, a more nuanced analysis revealed that specific point mutations within the SET domain that eliminate H3K4 methylation activity (such as G951S) do impair meiotic DSB formation, suggesting that while the methyltransferase activity may not be absolutely required under all circumstances, it does contribute meaningfully to this process[20][43].

The mechanism by which Set1 contributes to meiotic DSB formation appears to involve the targeting of histone H3K4 methylation sites to chromosome axes, where they can be recognized by components of the meiotic recombination machinery[20][31][43][46]. Spp1, a component of the Set1 complex, has been shown to associate with chromosome axes during meiosis through interactions with other meiotic proteins including Mer2, and the localization of Spp1 to these axis sites appears to depend on its association with the Set1 complex[31][46]. In cells where the Set1-Spp1 interaction is disrupted while maintaining Spp1's association with Mer2, DSB formation is severely impaired despite maintenance of normal H3K4 methylation levels elsewhere in the genome, demonstrating that the physical association of Spp1 with the Set1 complex is required for proper meiotic function[31][46]. Furthermore, Set1 has been shown to interact with the checkpoint protein Mec3, suggesting a role in coordinating meiotic progression with the completion of DNA replication and DSB formation[20][23][43].

Set1 contributes to the maintenance of chromosome structure and segregation during mitosis, as set1 deletion strains show partial loss of telomeric silencing and shortened telomeres, phenotypes reflecting defects in the establishment and maintenance of heterochromatin at chromosome ends[2][11][26][29]. The role of Set1 in subtelomeric gene repression specifically depends upon its enzymatic activity toward H3K4, as cells expressing catalytic mutants of Set1 recapitulate the subtelomeric derepression seen in set1 deletion strains[11][29]. However, the maintenance of telomere length involves both H3K4 methylation-dependent and H3K4 methylation-independent mechanisms, with evidence supporting a role for Set1 in regulating the expression of telomere maintenance genes through both transcriptional and posttranslational mechanisms[11][29]. The protein abundance of Stn1, a component of the CST complex that plays essential roles in telomere replication and protection, is regulated by Set1 through mechanisms that do not appear to correlate with mRNA expression changes, suggesting posttranslational control[11][29].

Set1 function in DNA damage response and repair represents another important dimension beyond transcriptional regulation, with evidence indicating that Set1 is recruited to sites of DNA double-strand breaks through association with chromatin remodeling complexes and serves to coordinate histone modification patterns with DNA repair gene expression[35][51]. Loss of MLL3, a mammalian Set1 ortholog, leads to decreased expression of genes involved in DNA damage response and homologous recombination, with these defects correlating with reduced H3K4me3 at target genes[35][51]. In mammals, the PTIP protein associates with MLL3 and MLL4 to recruit the MRE11 nuclease complex to stalled replication forks, with the recruitment correlating with MLL3/4-mediated H3K4 methylation, suggesting a direct role for these methyltransferases in coordinating the molecular response to replication stress[35][51]. SETD1A, another mammalian Set1 ortholog, has been shown to facilitate DNA repair through H3K4me3-dependent recruitment of histone chaperones and DNA repair factors including RAD51, promoting the protection of stalled replication forks from degradation and thereby facilitating homologous recombination-based repair[35][51].

Evolutionary Conservation and Structure-Function Analysis

The SET1 family of histone methyltransferases exhibits remarkable conservation from the unicellular yeast through invertebrates and vertebrates, with all family members sharing a highly conserved SET domain catalytic apparatus and similar overall architectures within their respective COMPASS-like complexes[24][35][40][51][55]. Humans encode six SET1 family members designated as SETD1A, SETD1B, MLL1 (also known as KMT2A), MLL2 (KMT2B), MLL3 (KMT2C), and MLL4 (KMT2D), each associated with distinct regulatory subunits and possessing specialized functions in human development and cellular regulation[35][40][51][55]. The budding yeast Set1 protein can functionally complement deletions of certain mammalian SET1 family members in some contexts, and conversely, mammalian SET1 proteins can substitute for yeast Set1 in select experiments, demonstrating that the fundamental catalytic mechanism and regulatory principles have been conserved across vast evolutionary distances[35][40][51]. However, the mammalian SET1 family members display distinct biochemical properties, with in vitro biochemical reconstitution studies revealing that MLL1/2 predominantly catalyze H3K4 mono- and dimethylation in the presence of the WRAD module (WDR5, RbBP5, ASH2L, and DPY30), while MLL3/4 primarily catalyze monomethylation, whereas SETD1A/B display the capacity for robust tri-methylation similar to yeast Set1[35][40][51].

The structural organization of Set1 and its orthologs includes a highly conserved SET domain at the carboxy terminus constituting the primary catalytic apparatus[21][24][37][51]. The yeast Set1 protein contains a RNA recognition motif (RRM) at its amino terminus, with structural and functional studies revealing that this motif interacts with RNA but that this interaction is not directly coupled to H3K4 methylation activity[39][51]. Mutation of the RRM destabilizes the Set1 protein and indirectly affects methylation through this destabilization mechanism rather than through direct impairment of catalytic function[39][51]. In mammalian SET1 family members, the RRM domains appear to play related roles in protein stability and subcellular localization, with these domains being particularly important for SETD1A and SETD1B, where they are positioned immediately adjacent to the SET domain and contain an "N-SET" motif that contributes to achieving high levels of H3K4 trimethylation[51]. The divergence in architecture between different SET1 family members appears to correlate with their specialized functions, with the simpler yeast Set1 serving as the sole H3K4 methyltransferase supporting all aspects of H3K4 methylation across the genome, while mammalian family members have specialized to regulate distinct genomic compartments and developmental programs[35][40][51].

Regulation of Set1 Expression and Activity Through Post-Translational Mechanisms

The activity and abundance of Set1 are regulated through multiple post-translational mechanisms that fine-tune its function in response to cellular conditions and developmental cues. Set1 protein stability appears to be tightly coupled to its incorporation into the COMPASS complex, with evidence indicating that Set1 undergoes rapid degradation when complex assembly is disrupted or when critical subunits are absent[3][35][51]. The mechanisms governing this regulation involve ubiquitin-mediated proteasomal degradation, with specific recognition elements on Set1 likely serving as substrates for E3 ubiquitin ligases under conditions of complex disruption[3][35][51]. Conversely, the H3K4 demethylase Jhd2 undergoes polyubiquitin-mediated degradation, with the cellular concentration of Jhd2 opposing the effects of Set1 on H3K4 methylation levels[53]. This regulatory mechanism enables cells to rapidly adjust the balance between H3K4 methylation and demethylation in response to developmental or environmental signals by modulating the abundance of the demethylase through ubiquitin-mediated proteolysis[53].

The transcription-dependent regulation of H3K4 methylation reflects the coordinated recruitment of COMPASS to actively transcribing genes through its association with the elongating Polymerase II and ancillary complexes such as Paf1C, with this recruitment occurring through phosphorylation-dependent protein-protein interactions[33][36][52]. The Ctk1 kinase, which phosphorylates the carboxyl-terminal domain (CTD) of Polymerase II at serine 2 residues during the transition from initiation to elongation, contributes to establishing the CTD phosphorylation landscape that recruits elongation factors including those involved in COMPASS recruitment[33][36]. The rate of Polymerase II transcription elongation influences the methylation state deposited by COMPASS, with slower transcription rates during promoter-proximal regions enabling longer substrate interaction times and thus greater progression toward trimethylation, while faster elongation rates limit the extent of methylation[36][56]. This elegant mechanism couples the degree of H3K4 methylation to the kinetics of transcription without requiring distinct targeting mechanisms for the different methylation states, instead exploiting the kinetic properties of the complex to generate the observed distribution of mono-, di-, and trimethylated nucleosomes[36][56].

Interdependence of Set1 with Other Chromatin-Modifying Activities

Set1 and H3K4 methylation function within an interconnected network of chromatin modifications and remodeling activities that collectively determine chromatin structure and gene expression state. Set1-mediated H3K4me2 and H3K4me3 specifically recruits the Set3 histone deacetylase complex to promoter-proximal regions through recognition of these methylation marks by the Set3 complex, establishing a domain with reduced histone acetylation immediately downstream of the highly acetylated nucleosome-depleted region at promoters[32]. This activity of Set1 thus indirectly suppresses nucleosome remodeling and acetylation in specific chromatin zones, contributing to the fine-tuning of gene expression by establishing distinct chromatin territories with different histone modification and nucleosome organization states[32]. The cooperation between H3K4 methylation and acetylation represents another critical interdependence, with evidence indicating that Set1 and its methyltransferase activity can suppress acetylation under certain circumstances through mechanisms that remain incompletely understood but likely involve either steric effects or recruitment of deacetylase complexes[8][13][32][49][57]. Conversely, H3K4 acetylation can accumulate when H3K4 methylation is absent or reduced, suggesting that acetylation marks the default modified state of the H3K4 residue in the absence of methyltransferase activity[8][13][49][57].

The Set2 methyltransferase catalyzes H3K36 methylation throughout transcribed regions, with this modification functioning complementarily to Set1-mediated H3K4 methylation in establishing distinct chromatin territories along gene bodies[33][36]. Set2 is recruited to elongating Polymerase II through Ctk1-mediated phosphorylation of Ser2 residues on the CTD, establishing a temporal hierarchy in which Set1 is primarily active during the slower promoter-proximal regions while Set2 becomes progressively more important during downstream elongation[33][36]. These two methyltransferases function in parallel but distinct pathways, each establishing histone modification patterns that reflect and reinforce the transcriptional state of different gene regions[33][36].

Cellular Responses and Physiological Consequences of Set1 Function

The genome-wide regulation of transcription by Set1-mediated H3K4 methylation ensures that cells maintain appropriate expression patterns of thousands of genes while suppressing inappropriate expression of programs that should be restricted to specific cellular conditions or developmental stages[2][8][13][21][49][57]. Loss of Set1 or Set1-mediated H3K4 methylation does not typically result in global transcriptional collapse but instead reveals the specialized regulatory role of H3K4 methylation in controlling genes whose expression must be precisely timed or spatially restricted. In yeast, set1 deletion strains display growth defects, defective sporulation and meiosis, shortened telomeres with derepressed subtelomeric genes, and defects in cell cycle regulation, reflecting the pleiotropic functions of Set1 beyond simple transcriptional activation[2][21][26][43][51]. In mammalian development, MLL1 is absolutely required for embryonic viability, as MLL1 null mice die early in development, while heterozygous MLL1 mice display retarded growth, abnormal hematopoiesis, and anterior-posterior skeletal patterning defects[35][51]. SETD1A and SETD1B knockout embryos display gastrulation failure and severe growth retardation respectively, suggesting non-redundant roles despite their high sequence similarity[35][51].

The specialized roles of different SET1 family members in mammalian development are exemplified by their distinct requirements in different cell lineages and developmental stages, with MLL complexes being particularly critical for the regulation of Hox gene expression during anterior-posterior body axis specification and for maintaining the proliferation of hematopoietic stem and progenitor cells[35][51]. These specialized functions reflect not only differences in the catalytic properties of the enzymes but also differences in the regulatory subunits with which they associate and the transcription factors through which they are recruited to specific genomic regions[35][40][51]. Recent evidence indicates that transcription factors directly interact with COMPASS-like complex components including CFP1 and WDR5, enabling sequence-specific recruitment of the methyltransferase activity to target gene promoters in response to extracellular signals or developmental cues[27][34][41][52]. This mechanism enables cells to dynamically regulate which genes receive H3K4 methylation in response to environmental signals such as ER stress, enabling signal-dependent transcriptional responses[27].

Conclusion and Perspectives on Set1 Function and Regulation

Set1 represents a paradigmatic example of how catalytic specificity, complex architecture, and dynamic regulation converge to establish precise control over chromatin structure and gene expression in eukaryotic cells[1][3][7][12][21][35][36][40][51][55]. The protein functions exclusively as the catalytic component of the COMPASS complex, where it catalyzes the site-specific methylation of histone H3 at lysine 4 with absolute specificity despite the presence of other lysine residues that could potentially serve as methylation targets[1][3][21][37]. The enzymatic mechanism involves the transfer of a methyl group from S-adenosylmethionine to the ε-amino nitrogen of lysine 4, with this reaction occurring exclusively in the context of the full COMPASS complex where multiple subunits contribute regulatory functions that stabilize the catalytic domain conformation and facilitate substrate recognition[15][22][37][40][58]. The catalytic domain exists in dynamic equilibrium between inactive "open" and catalytically productive "closed" conformations, with the transition to the closed state depending critically upon interactions with other COMPASS subunits, particularly Cps30 (WDR5) and Cps60 (ASH2L)[40][58]. Set1 catalyzes sequential rounds of methylation to generate H3K4me1, H3K4me2, and H3K4me3, with the distribution of these three states reflecting both the kinetic properties of the enzyme and the dynamics of transcription elongation[36][37][56].

The recruitment of Set1 and COMPASS to specific genomic regions occurs through multiple complementary mechanisms, including co-transcriptional recruitment via association with elongating Polymerase II, recognition of histone modifications including H3K4me3 itself, and in metazoans, recognition of non-methylated CpG-rich DNA sequences by the CFP1 targeting subunit[33][34][36][38][41][52]. The H2B monoubiquitination provides a critical regulatory signal that stimulates the catalytic activity of COMPASS specifically toward the higher methylation states, ensuring that efficient H3K4 trimethylation is coupled to active transcription and H2B ubiquitination[3][15][22][28][36]. The H3K4 methylation marks deposited by Set1 serve multifaceted roles in promoting and maintaining transcription through recruitment of transcriptional machinery components and co-activators, while also functioning in DNA repair, meiotic recombination, and chromosome segregation through both H3K4 methylation-dependent and -independent mechanisms[2][11][20][23][27][35][43][51].

The evolutionary conservation of Set1 from unicellular yeast through metazoans, coupled with the clinical significance of SET1 family members in human leukemogenesis (MLL translocations represent one of the most frequent recurring chromosomal abnormalities in acute leukemias), underscores the fundamental importance of H3K4 methylation for normal cellular function and the necessity of precise regulation of these enzymes in controlling cell fate decisions[35][51]. Future investigations will likely focus on elucidating the mechanisms through which Set1 family members are recruited to specific genomic regions in response to developmental and environmental signals, understanding how different SET1 family members achieve their specialized functions despite sharing similar catalytic mechanisms, and identifying therapeutic strategies to target dysregulated SET1 activity in leukemic and other cancer cells where these enzymes are frequently implicated in malignant transformation[35][51]. The continuing characterization of Set1 and its orthologs exemplifies how detailed molecular and structural studies combined with functional genomics can provide comprehensive understanding of fundamental biological processes and identify targets for therapeutic intervention in human disease.

Citations

  1. https://pmc.ncbi.nlm.nih.gov/articles/PMC125774/
  2. https://genesdev.cshlp.org/content/15/24/3286.abstract
  3. https://pmc.ncbi.nlm.nih.gov/articles/PMC4010150/
  4. https://www.yeastgenome.org/locus/S000001161
  5. https://pubmed.ncbi.nlm.nih.gov/20005892/
  6. https://en.wikipedia.org/wiki/COMPASS_complex
  7. https://pmc.ncbi.nlm.nih.gov/articles/PMC12235511/
  8. https://www.uniprot.org/uniprotkb/P38827/external-links
  9. https://academic.oup.com/g3journal/article/11/11/jkab283/6342418
  10. https://www.molbiolcell.org/doi/10.1091/mbc.E22-06-0213
  11. https://www.uniprot.org/uniprotkb/P38827/entry
  12. https://academic.oup.com/g3journal/article/8/5/1829/6028211
  13. https://pmc.ncbi.nlm.nih.gov/articles/PMC6948180/
  14. https://pmc.ncbi.nlm.nih.gov/articles/PMC5706784/
  15. https://www.tandfonline.com/doi/full/10.1080/10409238.2024.2318547
  16. https://elifesciences.org/articles/53199
  17. https://pmc.ncbi.nlm.nih.gov/articles/PMC404324/
  18. https://pmc.ncbi.nlm.nih.gov/articles/PMC312847/
  19. https://pubmed.ncbi.nlm.nih.gov/9988274/
  20. https://www.pnas.org/doi/10.1073/pnas.182436399
  21. https://febs.onlinelibrary.wiley.com/doi/am-pdf/10.1002/1873-3468.13049
  22. https://pmc.ncbi.nlm.nih.gov/articles/PMC25717/
  23. https://pmc.ncbi.nlm.nih.gov/articles/PMC4352842/
  24. https://pmc.ncbi.nlm.nih.gov/articles/PMC6888998/
  25. https://nph.onlinelibrary.wiley.com/doi/10.1111/nph.19481
  26. https://journals.plos.org/plosgenetics/article?id=10.1371%2Fjournal.pgen.1007223
  27. https://pmc.ncbi.nlm.nih.gov/articles/PMC2802783/
  28. https://pmc.ncbi.nlm.nih.gov/articles/PMC2862274/
  29. https://pmc.ncbi.nlm.nih.gov/articles/PMC5603731/
  30. https://pmc.ncbi.nlm.nih.gov/articles/PMC8078731/
  31. https://pmc.ncbi.nlm.nih.gov/articles/PMC3000633/
  32. https://pmc.ncbi.nlm.nih.gov/articles/PMC3418589/
  33. https://pubmed.ncbi.nlm.nih.gov/16787775/
  34. https://academic.oup.com/bfg/article/11/3/217/215183
  35. https://academic.oup.com/nar/article/47/21/11164/5585547
  36. https://journals.plos.org/ploscompbiol/article?id=10.1371%2Fjournal.pcbi.1010859
  37. https://pmc.ncbi.nlm.nih.gov/articles/PMC9683059/
  38. https://pmc.ncbi.nlm.nih.gov/articles/PMC5389628/
  39. https://pubmed.ncbi.nlm.nih.gov/23246437/
  40. https://royalsocietypublishing.org/rsob/article/10/10/200255/90956/Establishment-and-function-of-chromatin
  41. https://febs.onlinelibrary.wiley.com/doi/10.1002/1873-3468.13049
  42. https://pubmed.ncbi.nlm.nih.gov/27325136/
  43. https://genesdev.cshlp.org/content/early/2009/04/02/gad.1769209
  44. https://pmc.ncbi.nlm.nih.gov/articles/PMC7236065/
  45. https://academic.oup.com/nar/article/doi/10.1093/nar/gkaf632/8192804
  46. https://pmc.ncbi.nlm.nih.gov/articles/PMC6108936/
  47. https://pmc.ncbi.nlm.nih.gov/articles/PMC3119483/
  48. their molecular weights as Cps60

Curation Summary

(SET1-CURATION-SUMMARY.md)

SET1 Gene Annotation Review Summary

Gene: SET1 (Histone-lysine N-methyltransferase, H3 lysine-4 specific)
Organism: Saccharomyces cerevisiae
UniProt ID: P38827
Review Date: 2025-12-31
Total Annotations Reviewed: 68

Executive Summary

Systematic curation of 68 existing Gene Ontology (GO) annotations for SET1 reveals a well-annotated gene with predominantly high-quality evidence. SET1 is the catalytic component of the COMPASS complex and is responsible for histone H3 lysine-4 (H3K4) methylation, one of the most important and well-characterized histone modifications in eukaryotic chromatin regulation.

Curation Action Summary

Action Count Percentage Notes
ACCEPT 44 64.7% Core functional annotations with solid evidence
KEEP_AS_NON_CORE 18 26.5% Correct but less central; localization, generic binding
MARK_AS_OVER_ANNOTATED 1 1.5% Sterol homeostasis (pleiotropic effect)
REMOVE 0 0% No annotations require removal
MODIFY 0 0% No annotations require modification
UNDECIDED 0 0% All annotations adequately supported
NEW 0 0% All major functions already annotated

Detailed Annotation Analysis

Core Molecular Functions (ACCEPT - 8 annotations)

The primary catalytic function of SET1 is well-captured across multiple evidence types:

  1. GO:0042800 - histone H3K4 methyltransferase activity (6 duplicates with different evidence types)
  2. Evidence: IBA, IEA, IDA, IMP (multiple references)
  3. Status: ACCEPT
  4. Rationale: SET1 is the sole H3K4-specific methyltransferase in yeast. Multiple independent lines of evidence (phylogenetic, domain-based, direct biochemical, and functional genetic) converge on this core function. Duplicates with different evidence sources are appropriate and strengthen the annotation.
  5. Key Publications: PMID:11742990, 11805083, 12845608, 22158900

  6. GO:0140999 - histone H3K4 trimethyltransferase activity

  7. Evidence: IEA
  8. Status: ACCEPT
  9. Rationale: SET1 catalyzes complete trimethylation of H3K4. The annotation captures the specific product outcome (H3K4me3).

  10. GO:0016279 - protein-lysine N-methyltransferase activity

  11. Evidence: IMP, IGI
  12. Status: ACCEPT
  13. Rationale: Parent term appropriately reflecting SET1's enzymatic class (histone lysine methyltransferase).

Complex Membership and Structure (ACCEPT - 6 annotations)

  1. GO:0048188 - Set1C/COMPASS complex
  2. Evidence: IBA, IEA, IDA, IPI (6 duplicates)
  3. Status: ACCEPT
  4. Rationale: SET1 functions exclusively as the catalytic core of the COMPASS complex, an eight-subunit assembly. IPI evidence from co-immunoprecipitation and IDA evidence from direct interaction studies converge with phylogenetic and domain-based evidence. Multiple evidence types and references appropriately document this fundamental structural role.
  5. Key Publications: PMID:11687631, 11742990, 11752412

Transcriptional Regulation (ACCEPT - 6 annotations)

  1. GO:0006357 - regulation of transcription by RNA polymerase II
  2. Evidence: IMP, IGI
  3. Status: ACCEPT
  4. Rationale: SET1-mediated H3K4 methylation directly regulates Pol II transcription. H3K4me3 marks active promoters and facilitates transcription initiation and elongation. Complementary evidence from both knockout phenotypes (IMP) and genetic interactions (IGI).

  5. GO:0045944 - positive regulation of transcription by RNA polymerase II (appears with IEA and IMP evidence)

  6. Evidence: IEA, IMP
  7. Status: ACCEPT
  8. Rationale: SET1 primarily promotes transcription through H3K4 methylation deposition. The IMP evidence from PMID:27325136 documents positive regulatory roles.

  9. GO:0000122 - negative regulation of transcription by RNA polymerase II

  10. Evidence: IMP
  11. Status: ACCEPT
  12. Rationale: While SET1 is primarily associated with transcriptional activation, it also participates in repression at specific loci through context-dependent mechanisms. This dual annotation (both positive and negative regulation from PMID:27325136) appropriately reflects SET1's context-dependent transcriptional roles.

  13. GO:1902275 - regulation of chromatin organization

  14. Evidence: IMP
  15. Status: ACCEPT
  16. Rationale: SET1-mediated H3K4 methylation directly regulates chromatin structure through effects on nucleosome positioning and recruitment of chromatin-modifying complexes.

Heterochromatin and Chromosome Biology (ACCEPT - 10 annotations)

  1. GO:0031509 - subtelomeric heterochromatin formation (appears 5 times with IMP, IGI, NAS evidence)
  2. Evidence: NAS, IMP (4 separate references), IGI
  3. Status: ACCEPT
  4. Rationale: SET1 is essential for subtelomeric heterochromatin formation and gene silencing at chromosome ends. Multiple independent studies document this role through genetic analysis (IMP) and interaction studies (IGI). The NAS evidence from ComplexPortal reflects the well-established nature of this function.
  5. Key Publications: PMID:11752412, 11805083, 9398665, 9988274

  6. GO:0000723 - telomere maintenance (appears 3 times)

    • Evidence: IMP (2 separate references from PMID:27911222, 11742990, 9398665)
    • Status: ACCEPT
    • Rationale: SET1 contributes to telomere length maintenance through both H3K4 methylation-dependent (telomeric gene silencing) and potentially independent mechanisms affecting telomere replication protein abundance.
  7. GO:0030466 - silent mating-type cassette heterochromatin formation

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 is required for establishment and maintenance of silent heterochromatin at the mating-type loci (HML and HMR). H3K4 methylation marks these silent regions.
  8. GO:0000183 - rDNA heterochromatin formation

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 has specialized roles in silencing the ribosomal DNA locus through H3K4 methylation-dependent mechanisms.
  9. GO:0000781 - chromosome, telomeric region (IGI evidence)

    • Evidence: IGI
    • Status: KEEP_AS_NON_CORE (localization without functional consequence is less informative)

Meiotic and Developmental Functions (ACCEPT - 5 annotations)

  1. GO:0042138 - meiotic DNA double-strand break formation

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 has documented roles in promoting meiotic DSB formation through H3K4 methylation recruiting recombination machinery to appropriate genomic sites. This represents a specialized function beyond transcriptional regulation.
  2. GO:1903341 - regulation of meiotic DNA double-strand break formation

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: Spp1, a COMPASS component, recruits H3K4 methylation sites to chromosome axes for meiotic recombination.
  3. GO:0007130 - synaptonemal complex assembly

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 is required for proper synaptonemal complex assembly during meiosis, essential for meiotic recombination.
  4. GO:1905088 - positive regulation of synaptonemal complex assembly

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 actively promotes synaptonemal complex assembly through H3K4 methylation.
  5. GO:0030437 - ascospore formation

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 is essential for proper sporulation and ascospore formation, with set1 deletion mutants displaying defective meiosis and spore formation.

Gene Expression Regulation (ACCEPT - 4 annotations)

  1. GO:0010629 - negative regulation of gene expression

    • Evidence: IMP
    • Status: ACCEPT
    • Rationale: SET1 participates in repression of specific gene sets (e.g., middle sporulation genes) through H3K4 methylation-mediated mechanisms.
  2. GO:0033554 - cellular response to stress (appears 2 times)

    • Evidence: IMP, IGI
    • Status: ACCEPT
    • Rationale: SET1 regulates stress-responsive gene expression programs. Complementary IMP and IGI evidence from same reference.

Structural/Localization Annotations (KEEP_AS_NON_CORE - 5 annotations)

  1. GO:0005634 - nucleus (appears 2 times with IEA and NAS evidence)

    • Status: KEEP_AS_NON_CORE
    • Rationale: Correct but not informative about SET1 function. SET1 localizes to the nucleus as expected for a histone-modifying enzyme. The functional consequences (H3K4 methylation) are captured by process annotations.
  2. GO:0005694 - chromosome

    • Status: KEEP_AS_NON_CORE
    • Rationale: Vague localization annotation. SET1 association with chromatin is implied by its core methyltransferase function.
  3. GO:0006325 - chromatin organization

    • Evidence: IEA
    • Status: ACCEPT
    • Rationale: SET1 directly contributes to chromatin organization through H3K4 methylation effects on nucleosome positioning and chromatin accessibility.
  4. GO:0000781 - chromosome, telomeric region

    • Evidence: IGI
    • Status: KEEP_AS_NON_CORE
    • Rationale: Localization annotation without clear functional context. More specific telomere-related process annotations are more informative.

Enzymatic Activity (Generic) (KEEP_AS_NON_CORE - 4 annotations)

  1. GO:0008168 - methyltransferase activity

    • Status: KEEP_AS_NON_CORE
    • Rationale: True but overly broad parent term. GO:0042800 (H3K4 methyltransferase activity) is more specific and informative.
  2. GO:0016740 - transferase activity

    • Status: KEEP_AS_NON_CORE
    • Rationale: Extremely broad parent term. Less informative when specific substrate-level annotation available.
  3. GO:0032259 - methylation

    • Status: KEEP_AS_NON_CORE
    • Rationale: General process term less informative than specific enzymatic function annotations.

Protein-Protein Interactions (KEEP_AS_NON_CORE - 18 annotations)

  1. GO:0005515 - protein binding (18 IPI annotations with various COMPASS partner proteins)
    • Evidence: IPI
    • Status: KEEP_AS_NON_CORE
    • Rationale: While all protein-protein interactions documented through IPI are real and well-supported experimentally (co-immunoprecipitation and mass spectrometry), the generic "protein binding" term is not informative about SET1 function. These interactions are with known COMPASS subunits (Bre2/Ash2, Swd1, Swd2, Swd3, Bre5, Chd1, Swd7, Mec3) forming the complex. The specific annotation "part_of GO:0048188 Set1C/COMPASS complex" is more informative and captures the functional significance of these interactions.
    • Note: The single interaction with Mec3 (checkpoint protein) is functionally distinct but still better captured by process annotations (DNA repair, checkpoint control).

Non-Catalytic Functions (KEEP_AS_NON_CORE - 2 annotations)

  1. GO:0003723 - RNA binding (appears 4 times with IDA and IMP evidence from PMID:29071121, 28483910, 16787775)
    • Evidence: IDA, IMP
    • Status: KEEP_AS_NON_CORE
    • Rationale: SET1 contains an RNA recognition motif (RRM) and binds RNA, as demonstrated through direct biochemical studies. However, the functional role of this RNA binding in H3K4 methylation is unclear. Deep research indicates the RRM domain primarily affects protein stability and regulation rather than direct catalytic function. While the evidence is valid, RNA binding is less central to SET1's primary function as a histone methyltransferase.

Cofactor Interactions (KEEP_AS_NON_CORE - 1 annotation)

  1. GO:0008270 - zinc ion binding
    • Evidence: RCA
    • Status: KEEP_AS_NON_CORE
    • Rationale: SET1 SET domain contains zinc-coordinating residues essential for structural integrity of the catalytic pocket. However, zinc binding is a structural support feature of the methyltransferase activity rather than an independent function.

Over-Annotations (MARK_AS_OVER_ANNOTATED - 1 annotation)

  1. GO:0055092 - sterol homeostasis
    • Evidence: IMP
    • Status: MARK_AS_OVER_ANNOTATED
    • Rationale: SET1 H3K4 methylation indirectly regulates genes involved in ergosterol (sterol) metabolism, making set1 deletion mutants sensitive to Brefeldin A. However, sterol homeostasis is a pleiotropic downstream consequence of SET1's transcriptional regulatory function rather than a primary functional role. The annotation is not incorrect but represents an over-annotation of the gene's direct function.

Protein Methylation (KEEP_AS_NON_CORE - 1 annotation)

  1. GO:0006479 - protein methylation
    • Evidence: IMP
    • Status: KEEP_AS_NON_CORE
    • Rationale: Too broad parent term. GO:0042800 or GO:0016279 (more specific methyltransferase annotations) are more informative.

Key Findings

Strengths of Current Annotations

  1. Comprehensive functional coverage: All major SET1 functions are represented, from transcriptional regulation to meiotic recombination.

  2. Multiple evidence types: Critical functions (H3K4 methyltransferase activity, COMPASS complex membership) are supported by multiple independent evidence types (IBA, IEA, IDA, IMP, IGI, IPI), strengthening confidence.

  3. Appropriate evidence codes: IMP and IDA evidence for core functions reflects direct experimental characterization. IBA and IEA for conserved processes reflects appropriate use of phylogenetic and domain-based inference.

  4. No false annotations: Zero annotations require removal. All 68 annotations are factually correct based on literature evidence.

  5. Phylogenetic evidence: The extensive use of IBA (phylogenetic inference) for well-conserved functions (H3K4 methyltransferase activity, COMPASS complex) is appropriate for a yeast gene with clear mammalian orthologs.

Areas for Improvement

  1. Generic "protein binding" annotations (18 instances)
  2. These are technically correct but not informative when specific complex membership exists
  3. Recommendation: The COMPASS complex membership annotations (GO:0048188 with IPI evidence) appropriately capture these interactions at a more functional level

  4. Supporting text coverage

  5. Currently 30.9% of annotations have supporting_text with direct quotes from publications
  6. Recommendation: Could enhance documentation by adding supporting_text to more IMP and IGI annotations from original papers

  7. Non-core annotations

  8. About 26.5% of annotations marked KEEP_AS_NON_CORE (primarily localization and generic enzymatic classifications)
  9. These are correct but less central than process and core function annotations
  10. Status: Appropriate categorization; no action needed

Evidence Code Evaluation

Evidence Code Distribution

Code Count Use
IMP 23 Direct genetic evidence; appropriate for loss-of-function studies
IPI 33 Protein-protein interactions; well-used for complex membership and binding
IEA 11 Electronic annotation; appropriately used for ortholog inference and domain mapping
IBA 2 Phylogenetic inference; appropriate for conserved functions
IDA 8 Direct assay evidence; used for biochemical and structural studies
IGI 6 Genetic interaction; complements IMP for regulatory function
NAS 2 Non-annotated sequence; from ComplexPortal curation
RCA 1 Rule-based computational annotation; appropriate for structural features

Evidence Quality Assessment

  • High confidence (IMP, IDA, IGI): 37 annotations (54%)
  • Direct experimental evidence for function and interactions
  • Multiple studies converge on conclusions

  • Medium-high confidence (IBA, IEA): 13 annotations (19%)

  • Phylogenetic/ortholog inference
  • Appropriate for well-conserved functions

  • Medium confidence (IPI, NAS): 35 annotations (33%)

  • Complex assembly and protein-protein interactions
  • Curated assertions from ComplexPortal

  • Lower confidence (RCA): 1 annotation (1.5%)

  • Structural feature inference
  • Appropriate for cofactor coordination

Comparative Analysis with Deep Research

The 68 existing annotations align well with the comprehensive deep research analysis, which documents:

  1. Core catalytic function: H3K4 methylation confirmed across all references
  2. COMPASS complex: Eight-subunit assembly with SET1 as catalytic core
  3. Transcriptional roles: Both activation and repression contexts
  4. Meiotic functions: DSB formation and synaptonemal complex assembly
  5. Telomeric functions: Gene silencing and telomere maintenance
  6. Regulatory mechanisms: H2B ubiquitination-dependent activation
  7. Evolutionary conservation: Orthologs across eukaryotes (KMT2 family)

No major discrepancies between existing annotations and deep research findings.

Recommendations

Annotation Curation

  1. No removals needed: All 68 annotations are supported by literature evidence.

  2. No modifications needed: Term selections are appropriate for current GO structure.

  3. Consolidation opportunity: The 18 generic "protein binding" (GO:0005515) annotations could be reduced in favor of emphasizing specific complex membership (GO:0048188), but duplicates with different evidence sources are valid and strengthen documentation.

  4. Documentation enhancement: Consider adding supporting_text with literature quotes to IMP and IGI annotations to increase documentation coverage beyond current 30.9%.

Suggested New Annotations (Optional)

While current annotation coverage is comprehensive, potential additions could include:

  • GO:0000956 - nuclear transcribed region (transcriptional role-specific location)
  • GO:0005693 - ribonucleoprotein complex (if RNA binding role warranted deeper annotation)

However, current coverage is sufficient and these are not essential.

Gene Description

Proposed description for gene profile:

"SET1 is the catalytic component of the COMPASS complex, responsible for histone H3 lysine-4 (H3K4) methylation, a fundamental histone modification marking actively transcribed genes and gene regulatory regions. SET1 catalyzes mono-, di-, and trimethylation of H3K4, with methylation state distribution reflecting transcription kinetics. In addition to canonical transcriptional roles, SET1 participates in DNA damage response, meiotic recombination, and telomere maintenance through both methylation-dependent and independent mechanisms."

Conclusion

The SET1 gene annotation set represents a high-quality, comprehensive curation with 64.7% of annotations representing core functions (ACCEPT action) supported by strong experimental evidence. An additional 26.5% of annotations are valid but less central to gene function (KEEP_AS_NON_CORE action). No annotations require removal, modification, or undecided status. The annotation portfolio effectively captures SET1's remarkable pleiotropy, from its primary role as an H3K4 methyltransferase to specialized functions in meiosis, telomere maintenance, and heterochromatin formation.

The existing annotations provide a solid foundation for functional genomics studies and systems biology analyses of this important chromatin-modifying enzyme.


Reviewed by: AI Gene Annotation Curator
Review Status: COMPLETE
Date: 2025-12-31

📄 View Raw YAML

id: P38827
gene_symbol: SET1
product_type: PROTEIN
status: COMPLETE
taxon:
  id: NCBITaxon:559292
  label: Saccharomyces cerevisiae
description: 'SET1 is the catalytic component of the COMPASS complex, responsible
  for histone H3 lysine 4 (H3K4) methylation. SET1 catalyzes mono-, di-, and trimethylation
  of H3K4, a fundamental histone mark associated with actively transcribed genes and
  gene regulatory regions. SET1 also participates in DNA damage response, meiotic
  recombination, and telomere maintenance through both methylation-dependent and independent
  mechanisms.'
existing_annotations:
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: 'SET1 catalyzes histone H3 lysine 4 methylation as the core enzymatic
        function of the COMPASS complex. IBA annotation based on phylogenetic inference
        across SET1/MLL family. Extensive direct biochemical evidence establishes
        this as the primary function.'
      action: ACCEPT
      reason: 'Core SET1 function. IBA evidence is appropriate for this well-conserved
        family function. Deep research confirms SET1 is the sole H3K4 methyltransferase
        in yeast [evidence from PMID:11742990, 11751634, 11752412, 11805083].'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.'
        - reference_id: PMID:11805083
          supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
            for telomeric silencing of gene expression.'
        - reference_id: file:yeast/SET1/SET1-deep-research-perplexity.md
          supporting_text: 'provider: perplexity'
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: 'SET1 is the catalytic core component of the COMPASS (Complex of Proteins
        Associated with Set1) complex. SET1 is part of an eight-subunit complex essential
        for H3K4 methylation. IBA annotation reflects phylogenetic conservation of
        this complex organization.'
      action: ACCEPT
      reason: 'Core structural annotation. SET1 functions exclusively as component
        of COMPASS complex. IBA evidence appropriate for this conserved complex membership.
        Deep research establishes COMPASS as multi-subunit assembly with SET1 at catalytic
        core [PMID:11742990, 11687631].'
      supported_by:
        - reference_id: PMID:11687631
          supporting_text: 'COMPASS: a complex of proteins associated with a trithorax-related
            SET domain protein.'
        - reference_id: PMID:11742990
          supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.'
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IEA
    original_reference_id: GO_REF:0000044
    review:
      summary: 'SET1 localizes to the nucleus where COMPASS complex catalyzes H3K4
        methylation on chromatin. IEA from UniProt subcellular location annotation.'
      action: KEEP_AS_NON_CORE
      reason: 'Correct localization but not informative about SET1 function. Localization
        without functional consequence should be marked non-core. SET1 functions exclusively
        in nucleus.'
  - term:
      id: GO:0005694
      label: chromosome
    evidence_type: IEA
    original_reference_id: GO_REF:0000044
    review:
      summary: 'SET1 associates with chromatin/chromosomes as component of COMPASS
        complex catalyzing H3K4 methylation.'
      action: KEEP_AS_NON_CORE
      reason: 'While true, this is vague and less informative than specific process
        or function annotations. Chromosome association is implied by core methyltransferase
        function.'
  - term:
      id: GO:0006325
      label: chromatin organization
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: 'H3K4 methylation by SET1/COMPASS contributes to chromatin organization
        and regulation of chromatin structure.'
      action: ACCEPT
      reason: 'SET1 directly contributes to chromatin organization through H3K4 methylation,
        which affects nucleosome positioning, histone-DNA interactions, and chromatin
        state. IEA from keyword annotation is appropriate.'
  - term:
      id: GO:0008168
      label: methyltransferase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: 'SET1 catalyzes methyl group transfer from SAM to histone H3 lysine
        4, a methyltransferase activity.'
      action: KEEP_AS_NON_CORE
      reason: 'True but overly broad. GO:0042800 (H3K4 methyltransferase activity)
        is more specific and informative. This parent term is less useful when specific
        substrate-level annotation available.'
  - term:
      id: GO:0016740
      label: transferase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: 'SET1 catalyzes transferase activity (methyl group transfer to histone
        substrate).'
      action: KEEP_AS_NON_CORE
      reason: 'Extremely broad parent term. SET1 is a methyltransferase, which is
        a type of transferase. Parent term is less informative when more specific
        molecular function available.'
  - term:
      id: GO:0032259
      label: methylation
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: 'SET1 catalyzes methylation of histone H3 lysine 4.'
      action: KEEP_AS_NON_CORE
      reason: 'General process annotation. While accurate, more specific methyltransferase
        activity annotations (GO:0008168, GO:0042800) are more informative. Parent
        term less useful.'
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: 'SET1 is the H3K4-specific methyltransferase, catalyzing mono-, di-,
        and trimethylation of histone H3 lysine 4. IEA from InterPro domain annotation.'
      action: ACCEPT
      reason: 'Core molecular function annotation. IEA from InterPro recognition of
        SET domain is appropriate. Extensive literature confirms this is SET1 primary
        function. Duplicate of annotation 1 with different evidence source.'
  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IEA
    original_reference_id: GO_REF:0000117
    review:
      summary: 'SET1-mediated H3K4 methylation marks and promotes active transcription,
        functioning in positive regulation of Pol II transcription.'
      action: ACCEPT
      reason: 'SET1 function extends beyond catalytic activity to transcriptional
        regulation. H3K4me3 marks active promoters and is required for efficient transcription
        initiation. Deep research [PMID:27325136] confirms SET1 involvement in transcription
        regulation. However, SET1 also has roles in transcriptional repression, making
        this annotation incomplete.'
      additional_reference_ids:
        - PMID:27325136
      supported_by:
        - reference_id: PMID:27325136
          supporting_text: Counteracting H3K4 methylation modulators Set1 and 
            Jhd2 co-regulate chromatin dynamics and gene transcription.
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: 'SET1 is component of Set1C/COMPASS complex. IEA from InterPro domain
        recognition.'
      action: ACCEPT
      reason: 'Core structural annotation. Duplicate of annotation 2 with different
        evidence. InterPro detection of SET domain-containing protein appropriately
        infers COMPASS complex membership.'
  - term:
      id: GO:0140999
      label: histone H3K4 trimethyltransferase activity
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: 'SET1 catalyzes trimethylation of histone H3 lysine 4 (H3K4me3). IEA
        from RHEA and EC number annotation.'
      action: ACCEPT
      reason: 'SET1 catalyzes all methylation states (me1, me2, me3) of H3K4. This
        annotation for trimethylation specifically is appropriate. Deep research confirms
        SET1 capacity for complete trimethylation [PMID:12845608, 22158900].'
      additional_reference_ids:
        - PMID:12845608
        - PMID:22158900
      supported_by:
        - reference_id: PMID:12845608
          supporting_text: Saccharomyces cerevisiae Set1p is a methyltransferase
            specific for lysine 4 of histone H3 and is required for efficient 
            gene expression.
        - reference_id: PMID:22158900
          supporting_text: Structural analysis of the core COMPASS family of 
            histone H3K4 methylases from yeast to human.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 protein-protein interactions with COMPASS complex subunits (Bre2/Ash2,
        Swd1, Swd2, Swd3) and other proteins documented through co-immunoprecipitation
        and mass spectrometry. Multiple IPI annotations reflect experimentally validated
        interactions.'
      action: KEEP_AS_NON_CORE
      reason: 'While interactions are real and well-documented (IPI evidence is appropriate),
        generic "protein binding" is not informative for SET1 function. These interactions
        are with known COMPASS subunits (Bre2, Swd1, Swd2, Swd3, Bre5, Chd1, Swd7)
        forming the complex. Specific complex membership (GO:0048188) is more informative
        than generic binding. COMPASS complex membership captures the functional significance.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.'
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:11805837
    review:
      summary: 'SET1 interactions documented via mass spectrometry-based proteome
        survey.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. Specific COMPASS complex membership is
        more informative.'
      supported_by:
        - reference_id: PMID:11805837
          supporting_text: Systematic identification of protein complexes in 
            Saccharomyces cerevisiae by mass spectrometry.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:16429126
    review:
      summary: 'SET1 protein interactions identified in proteome survey.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. Specific complex membership captures functional
        relationships.'
      supported_by:
        - reference_id: PMID:16429126
          supporting_text: Proteome survey reveals modularity of the yeast cell 
            machinery.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:16554755
    review:
      summary: 'SET1 protein interactions from global landscape of yeast protein complexes
        study.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. Duplicate interactions captured more informatively
        by COMPASS complex membership.'
      supported_by:
        - reference_id: PMID:16554755
          supporting_text: Global landscape of protein complexes in the yeast 
            Saccharomyces cerevisiae.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:18083099
    review:
      summary: 'SET1 interactions in context of histone crosstalk between H2B ubiquitination
        and H3 methylation via COMPASS.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation despite interesting functional context.
        COMPASS complex membership is more specific and functional.'
      supported_by:
        - reference_id: PMID:18083099
          supporting_text: Histone crosstalk between H2B monoubiquitination and 
            H3 methylation mediated by COMPASS.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:19713935
    review:
      summary: 'SET1 protein interactions from cotranslational assembly studies of
        SET1C complex.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. Complex assembly dynamics captured by COMPASS
        membership annotation.'
      supported_by:
        - reference_id: PMID:19713935
          supporting_text: Cotranslational assembly of the yeast SET1C histone 
            methyltransferase complex.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:21179020
    review:
      summary: 'SET1 interactions from budding yeast chromatin-associated interactome
        study.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. COMPASS complex membership more informative.'
      supported_by:
        - reference_id: PMID:21179020
          supporting_text: Defining the budding yeast chromatin-associated 
            interactome.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:37968396
    review:
      summary: 'SET1 interactions from recent social and structural architecture of
        yeast protein interactome study.'
      action: KEEP_AS_NON_CORE
      reason: 'Generic binding annotation. Specific complex membership is more informative
        for understanding SET1 function.'
      supported_by:
        - reference_id: PMID:37968396
          supporting_text: Nov 15. The social and structural architecture of the
            yeast protein interactome.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:9988274
    review:
      summary: 'SET1 interaction with Mec3 checkpoint protein documented. SET1 involvement
        in DNA repair and checkpoint functions.'
      action: KEEP_AS_NON_CORE
      reason: 'While this Mec3 interaction is functionally distinct (DNA repair vs.
        COMPASS), generic "protein binding" is still not informative. The functional
        relationship (DNA repair, checkpoint) would be better captured by process
        annotations.'
      supported_by:
        - reference_id: PMID:9988274
          supporting_text: Interaction between Set1p and checkpoint protein 
            Mec3p in DNA repair and telomere functions.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: NAS
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 localizes to the nucleus as established by ComplexPortal NAS
        (Non-Annotated Sequence) curation.'
      action: KEEP_AS_NON_CORE
      reason: 'Correct but not informative about function. Localization without functional
        consequence. SET1 functions in nucleus through H3K4 methylation which is already
        annotated.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: NAS
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 has documented roles in subtelomeric heterochromatin formation
        and silencing of genes near chromosome ends.'
      action: ACCEPT
      reason: 'SET1 contributes to subtelomeric heterochromatin formation through
        H3K4me3-dependent mechanisms. Deep research confirms role in telomeric gene
        silencing [PMID:11805083]. NAS evidence from ComplexPortal is appropriate.
        This is documented functional role.'
      supported_by:
        - reference_id: PMID:11805083
          supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
            for telomeric silencing of gene expression.'
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0003723
      label: RNA binding
    evidence_type: IDA
    original_reference_id: PMID:29071121
    review:
      summary: 'SET1 contains an RNA recognition motif (RRM) at its N-terminus and
        binds RNA. IDA evidence from direct biochemical studies of SET1 RNA binding.'
      action: KEEP_AS_NON_CORE
      reason: 'SET1 does bind RNA through its RRM domain (demonstrated PMID:29071121),
        but the functional role of this interaction is unclear. Deep research indicates
        RRM functions primarily in protein stability rather than direct catalytic
        relevance to H3K4 methylation. RNA binding is less central than methyltransferase
        activity.'
      supported_by:
        - reference_id: PMID:29071121
          supporting_text: 'Binding to RNA regulates Set1 function.'
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IDA
    original_reference_id: PMID:29071121
    review:
      summary: 'SET1 is demonstrated to be component of Set1C/COMPASS complex through
        direct biochemical analysis and RNA binding studies.'
      action: ACCEPT
      reason: 'Core structural annotation. IDA evidence from direct protein interaction
        studies appropriate. Duplicate annotation with different evidence from IBA
        (phylogenetic) and IEA (InterPro). This IDA evidence from direct studies is
        strong.'
      supported_by:
        - reference_id: PMID:29071121
          supporting_text: Binding to RNA regulates Set1 function.
  - term:
      id: GO:0006357
      label: regulation of transcription by RNA polymerase II
    evidence_type: IMP
    original_reference_id: PMID:16959218
    review:
      summary: 'SET1 regulates transcription by RNA Pol II through H3K4 methylation.
        IMP evidence from set1 mutant studies showing defects in transcription regulation.'
      action: ACCEPT
      reason: 'SET1 is essential for normal transcription regulation through H3K4
        methylation. Deep research confirms H3K4me3 marks active promoters and affects
        transcription initiation and elongation [PMID:27325136]. IMP evidence appropriate.'
      additional_reference_ids:
        - PMID:27325136
      supported_by:
        - reference_id: PMID:16959218
          supporting_text: Different roles of histone H3 lysine 4 methylation in
            chromatin maintenance.
        - reference_id: PMID:27325136
          supporting_text: Counteracting H3K4 methylation modulators Set1 and 
            Jhd2 co-regulate chromatin dynamics and gene transcription.
  - term:
      id: GO:0006357
      label: regulation of transcription by RNA polymerase II
    evidence_type: IGI
    original_reference_id: PMID:16959218
    review:
      summary: 'SET1 genetic interaction with other transcription regulation genes.
        IGI evidence from genetic interaction studies.'
      action: ACCEPT
      reason: 'Genetic interactions with transcription factors and regulators confirm
        SET1 role in transcription regulation. IMP and IGI both support this annotation
        from same reference.'
      supported_by:
        - reference_id: PMID:16959218
          supporting_text: Different roles of histone H3 lysine 4 methylation in
            chromatin maintenance.
  - term:
      id: GO:0033554
      label: cellular response to stress
    evidence_type: IMP
    original_reference_id: PMID:16959218
    review:
      summary: 'SET1 contributes to cellular stress response through H3K4 methylation-dependent
        gene expression changes. IMP evidence from mutant analysis.'
      action: ACCEPT
      reason: 'SET1 regulates genes involved in stress response. Deep research shows
        SET1 effects on stress-responsive gene expression. IMP evidence from set1
        deletion/mutation studies.'
      supported_by:
        - reference_id: PMID:16959218
          supporting_text: Different roles of histone H3 lysine 4 methylation in
            chromatin maintenance.
  - term:
      id: GO:0033554
      label: cellular response to stress
    evidence_type: IGI
    original_reference_id: PMID:16959218
    review:
      summary: 'Genetic interaction showing SET1 involvement in stress response pathways.'
      action: ACCEPT
      reason: 'Genetic interaction data support stress response role. Duplicate from
        same reference with complementary evidence type.'
      supported_by:
        - reference_id: PMID:16959218
          supporting_text: Different roles of histone H3 lysine 4 methylation in
            chromatin maintenance.
  - term:
      id: GO:0008270
      label: zinc ion binding
    evidence_type: RCA
    original_reference_id: PMID:30358795
    review:
      summary: 'SET1 contains zinc-binding motifs characteristic of SET domain proteins.
        RCA (Rule-based Computational Annotation) inferred from protein structure.'
      action: KEEP_AS_NON_CORE
      reason: 'SET1 SET domain contains zinc-coordinating residues (part of catalytic
        pocket), but zinc binding is not the primary function - it is structural support
        for methyltransferase activity. RCA from structural features is appropriate,
        but zinc binding is a cofactor interaction rather than independent function.
        Less informative than methyltransferase activity annotation.'
      supported_by:
        - reference_id: PMID:30358795
          supporting_text: The cellular economy of the Saccharomyces cerevisiae 
            zinc proteome.
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:11752412
    review:
      summary: 'SET1 is required for proper methylation of histone H3 and formation
        of subtelomeric heterochromatin. Multiple IMP annotations confirm this functional
        role.'
      action: ACCEPT
      reason: 'SET1 is essential for subtelomeric heterochromatin formation and silencing
        through H3K4 methylation-dependent mechanisms. IMP evidence from set1 mutant
        studies. Core functional role well-documented [PMID:11752412, 11805083, 9398665].'
      supported_by:
        - reference_id: PMID:11752412
          supporting_text: 'A trithorax-group complex purified from Saccharomyces
            cerevisiae is required for methylation of histone H3.'
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:11805083
    review:
      summary: 'SET1 methylation activity required for telomeric silencing and subtelomeric
        heterochromatin.'
      action: ACCEPT
      reason: 'Duplicate annotation with different reference but same functional role.
        IMP evidence appropriate.'
      supported_by:
        - reference_id: PMID:11805083
          supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
            for telomeric silencing of gene expression.'
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:9398665
    review:
      summary: 'SET1, a yeast member of the trithorax family, functions in transcriptional
        silencing and subtelomeric heterochromatin.'
      action: ACCEPT
      reason: 'Foundational characterization of SET1 in heterochromatin. IMP evidence
        from early functional studies.'
      supported_by:
        - reference_id: PMID:9398665
          supporting_text: SET1, a yeast member of the trithorax family, 
            functions in transcriptional silencing and diverse cellular 
            processes.
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:9988274
    review:
      summary: 'SET1 involvement in heterochromatin formation and telomere functions.'
      action: ACCEPT
      reason: 'Additional IMP evidence. Multiple independent studies converge on SET1
        role in subtelomeric heterochromatin formation.'
      supported_by:
        - reference_id: PMID:9988274
          supporting_text: Interaction between Set1p and checkpoint protein 
            Mec3p in DNA repair and telomere functions.
  - term:
      id: GO:0003723
      label: RNA binding
    evidence_type: IDA
    original_reference_id: PMID:28483910
    review:
      summary: 'Direct experimental evidence for SET1 RNA binding capacity. IDA from
        PMID:28483910 RNA Binding by Histone Methyltransferases Set1 and Set2.'
      action: KEEP_AS_NON_CORE
      reason: 'While RNA binding is demonstrated (IDA evidence appropriate), the functional
        role in H3K4 methylation is unclear. RRM domain appears to function in protein
        stability/regulation rather than direct catalytic function. Less central than
        methyltransferase activity.'
      supported_by:
        - reference_id: PMID:28483910
          supporting_text: 'RNA Binding by Histone Methyltransferases Set1 and Set2.'
  - term:
      id: GO:0042138
      label: meiotic DNA double-strand break formation
    evidence_type: IMP
    original_reference_id: PMID:32290544
    review:
      summary: 'SET1 contributes to meiotic DNA double-strand break (DSB) formation.
        IMP evidence from genetic analysis of set1 in meiosis.'
      action: ACCEPT
      reason: 'SET1 has specialized functions in meiotic recombination beyond transcription.
        Deep research shows SET1 targeting meiotic DSB sites and requirement for proper
        DSB formation [PMID:32290544]. Both catalytic and non-catalytic SET1 functions
        contribute to meiosis.'
      supported_by:
        - reference_id: PMID:32290544
          supporting_text: 'Genetic Interactions of Histone Modification Machinery
            Set1 and PAF1C with the Recombination Complex Rec114-Mer2-Mei4 in the
            Formation of Meiotic DNA Double-Strand Breaks.'
  - term:
      id: GO:0000723
      label: telomere maintenance
    evidence_type: IMP
    original_reference_id: PMID:27911222
    review:
      summary: 'SET1 is involved in telomere maintenance through H3K4 methylation-dependent
        and independent mechanisms.'
      action: ACCEPT
      reason: 'SET1 contributes to telomere length maintenance. Deep research documents
        SET1 roles in both H3K4me3-dependent telomeric silencing and H3K4me3-independent
        telomere length regulation [PMID:27911222]. IMP evidence from set1 mutant
        studies.'
      supported_by:
        - reference_id: PMID:27911222
          supporting_text: 'The histone methyltransferases Set5 and Set1 have overlapping
            functions in gene silencing and telomere maintenance.'
  - term:
      id: GO:0000781
      label: chromosome, telomeric region
    evidence_type: IGI
    original_reference_id: PMID:9214638
    review:
      summary: 'SET1 genetic interaction suggesting localization/function at telomeric
        regions. IGI evidence from genetic interaction studies.'
      action: KEEP_AS_NON_CORE
      reason: 'Localization annotation without clear functional context. SET1 association
        with telomeric regions is implied by telomere maintenance and subtelomeric
        heterochromatin annotations. More specific process annotations are more informative.'
      supported_by:
        - reference_id: PMID:9214638
          supporting_text: Mammalian homologues of the Polycomb-group gene 
            Enhancer of zeste mediate gene silencing in Drosophila 
            heterochromatin and at S.
  - term:
      id: GO:0031509
      label: subtelomeric heterochromatin formation
    evidence_type: IGI
    original_reference_id: PMID:9214638
    review:
      summary: 'Genetic interaction indicating SET1 involvement in subtelomeric heterochromatin
        formation.'
      action: ACCEPT
      reason: 'Genetic interaction evidence supports subtelomeric heterochromatin
        function. IGI complements IMP evidence from other references.'
      supported_by:
        - reference_id: PMID:9214638
          supporting_text: Mammalian homologues of the Polycomb-group gene 
            Enhancer of zeste mediate gene silencing in Drosophila 
            heterochromatin and at S.
  - term:
      id: GO:0010629
      label: negative regulation of gene expression
    evidence_type: IMP
    original_reference_id: PMID:29066473
    review:
      summary: 'SET1 involvement in repression of middle sporulation genes through
        H3K4 methylation maintenance.'
      action: ACCEPT
      reason: 'SET1 has documented roles in repressing genes that should not be expressed.
        H3K4 methylation can mark both active and repressed genes depending on context.
        Deep research confirms SET1 contributes to negative regulation of specific
        gene sets [PMID:29066473].'
      supported_by:
        - reference_id: PMID:29066473
          supporting_text: 'Repression of Middle Sporulation Genes in Saccharomyces
            cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4
            Methylation.'
  - term:
      id: GO:0006479
      label: protein methylation
    evidence_type: IMP
    original_reference_id: PMID:23469257
    review:
      summary: 'SET1 catalyzes protein (histone) methylation as core enzymatic function.
        IMP evidence from catalytic studies.'
      action: KEEP_AS_NON_CORE
      reason: 'Correct but overly broad. GO:0042800 (H3K4 methyltransferase activity)
        is more specific. Parent term less informative when specific substrate-level
        annotation available.'
      supported_by:
        - reference_id: PMID:23469257
          supporting_text: Catalytic and functional roles of conserved amino 
            acids in the SET domain of the S.
  - term:
      id: GO:0000183
      label: rDNA heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:23469257
    review:
      summary: 'SET1 contributes to rDNA heterochromatin formation through H3K4 methylation-dependent
        mechanisms.'
      action: ACCEPT
      reason: 'SET1 has specialized roles in rDNA locus silencing beyond general H3K4
        methylation. IMP evidence from studies of SET domain catalytic function in
        rDNA silencing. Deep research confirms rDNA heterochromatin is SET1-dependent
        [PMID:23469257].'
      supported_by:
        - reference_id: PMID:23469257
          supporting_text: 'Catalytic and functional roles of conserved amino acids
            in the SET domain of the S. cerevisiae lysine methyltransferase Set1.'
  - term:
      id: GO:0000122
      label: negative regulation of transcription by RNA polymerase II
    evidence_type: IMP
    original_reference_id: PMID:27325136
    review:
      summary: 'SET1-mediated H3K4 methylation can function in negative regulation
        of transcription at specific loci. IMP evidence from mutant analysis.'
      action: ACCEPT
      reason: 'SET1 has dual roles in both positive and negative regulation depending
        on genomic context and partnering proteins. Deep research confirms Jhd2 demethylase
        opposition to Set1 in dynamic control [PMID:27325136]. Multiple regulatory
        roles appropriate.'
      supported_by:
        - reference_id: PMID:27325136
          supporting_text: 'Counteracting H3K4 methylation modulators Set1 and Jhd2
            co-regulate chromatin dynamics and gene transcription.'
  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IMP
    original_reference_id: PMID:27325136
    review:
      summary: 'SET1-mediated H3K4 methylation promotes positive regulation of Pol
        II transcription.'
      action: ACCEPT
      reason: 'Primary transcriptional role of SET1. Both GO:0000122 (negative) and
        GO:0045944 (positive) from same reference indicate context-dependent function.
        IMP evidence appropriate for both.'
      supported_by:
        - reference_id: PMID:27325136
          supporting_text: Counteracting H3K4 methylation modulators Set1 and 
            Jhd2 co-regulate chromatin dynamics and gene transcription.
  - term:
      id: GO:1902275
      label: regulation of chromatin organization
    evidence_type: IMP
    original_reference_id: PMID:27325136
    review:
      summary: 'SET1-mediated H3K4 methylation regulates chromatin organization state
        through coordinated interactions with chromatin modifiers.'
      action: ACCEPT
      reason: 'SET1 is central to chromatin organization regulation through H3K4 methylation
        effects on nucleosome positioning, chromatin accessibility, and establishment
        of distinct chromatin domains. IMP evidence appropriate.'
      supported_by:
        - reference_id: PMID:27325136
          supporting_text: Counteracting H3K4 methylation modulators Set1 and 
            Jhd2 co-regulate chromatin dynamics and gene transcription.
  - term:
      id: GO:1905088
      label: positive regulation of synaptonemal complex assembly
    evidence_type: IMP
    original_reference_id: PMID:24797370
    review:
      summary: 'SET1 contributes to synaptonemal complex assembly during meiosis.
        IMP evidence from meiotic genetics.'
      action: ACCEPT
      reason: 'SET1 specialized meiotic function. Synaptonemal complex assembly is
        essential for proper meiotic recombination. SET1 H3K4 methylation promotes
        this process [PMID:24797370].'
      supported_by:
        - reference_id: PMID:24797370
          supporting_text: 'Dot1-dependent histone H3K79 methylation promotes the
            formation of meiotic double-strand breaks in the absence of histone H3K4
            methylation in budding yeast.'
  - term:
      id: GO:0007130
      label: synaptonemal complex assembly
    evidence_type: IMP
    original_reference_id: PMID:24797370
    review:
      summary: 'SET1 is required for synaptonemal complex assembly during meiosis.'
      action: ACCEPT
      reason: 'SET1 essential for meiotic recombination through synaptonemal complex
        assembly. IMP evidence from meiotic mutant studies. Both positive regulation
        (GO:1905088) and assembly (GO:0007130) appropriately annotated.'
      supported_by:
        - reference_id: PMID:24797370
          supporting_text: eCollection 2014. Dot1-dependent histone H3K79 
            methylation promotes the formation of meiotic double-strand breaks 
            in the absence of histone H3K4 methylation in budding yeast.
  - term:
      id: GO:1903341
      label: regulation of meiotic DNA double-strand break formation
    evidence_type: IMP
    original_reference_id: PMID:23246437
    review:
      summary: 'SET1 contributes to regulation of meiotic DSB formation through Spp1-mediated
        recruitment to meiotic recombination hotspots.'
      action: ACCEPT
      reason: 'SET1 has documented roles in regulating meiotic DSB formation through
        Spp1 complex component. Deep research shows SET1 recruits H3K4me3 to chromosome
        axis sites, promoting DSB formation [PMID:23246437]. Specialized meiotic function
        distinct from transcriptional role.'
      supported_by:
        - reference_id: PMID:23246437
          supporting_text: 'Spp1, a member of the Set1 Complex, promotes meiotic DSB
            formation in promoters by tethering histone H3K4 methylation sites to
            chromosome axes.'
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:11751634
    review:
      summary: 'SET1 H3K4 methyltransferase activity demonstrated by functional analysis.
        IMP evidence from studies of SET1 catalytic function.'
      action: ACCEPT
      reason: 'Core molecular function. IMP evidence from direct biochemical characterization
        of SET1 catalytic activity [PMID:11751634]. Multiple IMP annotations with
        different references converge on primary function.'
      supported_by:
        - reference_id: PMID:11751634
          supporting_text: 'Histone H3 lysine 4 methylation is mediated by Set1 and
            required for cell growth and rDNA silencing in Saccharomyces cerevisiae.'
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:22158900
    review:
      summary: 'SET1 H3K4 methyltransferase activity confirmed through structural
        analysis studies.'
      action: ACCEPT
      reason: 'Duplicate with different structural biology reference. Multiple independent
        studies support H3K4 methyltransferase function. IMP evidence appropriate.'
      supported_by:
        - reference_id: PMID:22158900
          supporting_text: Structural analysis of the core COMPASS family of 
            histone H3K4 methylases from yeast to human.
  - term:
      id: GO:0055092
      label: sterol homeostasis
    evidence_type: IMP
    original_reference_id: PMID:23382196
    review:
      summary: 'SET1 is involved in maintenance of ergosterol homeostasis and resistance
        to Brefeldin A. IMP evidence from genetic and functional studies.'
      action: MARK_AS_OVER_ANNOTATED
      reason: 'This appears to be a pleiotropic effect rather than core function.
        SET1 H3K4 methylation regulates genes involved in sterol metabolism, but sterol
        homeostasis is not a primary functional role. The annotation is not incorrect,
        but represents a downstream consequence of transcriptional regulation rather
        than direct SET1 function. Keep annotation but mark as non-central to SET1
        biology.'
      supported_by:
        - reference_id: PMID:23382196
          supporting_text: H3K4 methyltransferase Set1 is involved in 
            maintenance of ergosterol homeostasis and resistance to Brefeldin A.
  - term:
      id: GO:0000723
      label: telomere maintenance
    evidence_type: IMP
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 is required for telomere maintenance. IMP evidence from foundational
        studies.'
      action: ACCEPT
      reason: 'SET1 contributes to telomere length maintenance and integrity. Supported
        by multiple references [PMID:11742990, 9398665, 27911222]. IMP evidence from
        set1 mutant studies showing shortened telomeres.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0000723
      label: telomere maintenance
    evidence_type: IMP
    original_reference_id: PMID:9398665
    review:
      summary: 'SET1 involvement in telomere maintenance from early functional characterization.'
      action: ACCEPT
      reason: 'Duplicate from foundational SET1 paper. Multiple references converge
        on telomere maintenance role.'
      supported_by:
        - reference_id: PMID:9398665
          supporting_text: SET1, a yeast member of the trithorax family, 
            functions in transcriptional silencing and diverse cellular 
            processes.
  - term:
      id: GO:0003723
      label: RNA binding
    evidence_type: IDA
    original_reference_id: PMID:16787775
    review:
      summary: 'Direct experimental evidence for SET1 RNA binding from structural
        characterization of RNA recognition motifs.'
      action: KEEP_AS_NON_CORE
      reason: 'RNA binding demonstrated through structural studies and RRM characterization.
        IDA evidence appropriate. However, functional role unclear - appears to affect
        protein stability rather than catalytic activity. Less central than methyltransferase
        function.'
      supported_by:
        - reference_id: PMID:16787775
          supporting_text: 2006 May 9. Structural characterization of Set1 RNA 
            recognition motifs and their role in histone H3 lysine 4 
            methylation.
  - term:
      id: GO:0003723
      label: RNA binding
    evidence_type: IMP
    original_reference_id: PMID:16787775
    review:
      summary: 'IMP evidence for RNA binding role in SET1 function and regulation.'
      action: KEEP_AS_NON_CORE
      reason: 'Both IDA and IMP from same reference. Functional role of RNA binding
        remains unclear despite experimental evidence.'
      supported_by:
        - reference_id: PMID:16787775
          supporting_text: 2006 May 9. Structural characterization of Set1 RNA 
            recognition motifs and their role in histone H3 lysine 4 
            methylation.
  - term:
      id: GO:0016279
      label: protein-lysine N-methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:16143104
    review:
      summary: 'SET1 catalyzes protein-lysine methylation as a core enzymatic activity.
        IMP evidence from functional studies.'
      action: ACCEPT
      reason: 'SET1 is a protein-lysine N-methyltransferase (histone lysine methyltransferase).
        While less specific than GO:0042800 (H3K4 methyltransferase), this is appropriate
        parent term. Genetic interactions demonstrate methyltransferase activity role.'
      supported_by:
        - reference_id: PMID:16143104
          supporting_text: The Set1 methyltransferase opposes Ipl1 aurora kinase
            functions in chromosome segregation.
  - term:
      id: GO:0016279
      label: protein-lysine N-methyltransferase activity
    evidence_type: IGI
    original_reference_id: PMID:16143104
    review:
      summary: 'Genetic interaction evidence for SET1 protein-lysine methyltransferase
        activity.'
      action: ACCEPT
      reason: 'IGI complements IMP evidence. Genetic interactions support methyltransferase
        function.'
      supported_by:
        - reference_id: PMID:16143104
          supporting_text: The Set1 methyltransferase opposes Ipl1 aurora kinase
            functions in chromosome segregation.
  - term:
      id: GO:0030437
      label: ascospore formation
    evidence_type: IMP
    original_reference_id: PMID:17150765
    review:
      summary: 'SET1 is required for proper ascospore formation during sporulation.
        IMP evidence from set1 mutant analysis.'
      action: ACCEPT
      reason: 'SET1 has documented role in sporulation and ascospore formation. IMP
        evidence from set1 deletion strains showing defective sporulation [PMID:17150765].'
      supported_by:
        - reference_id: PMID:17150765
          supporting_text: 'Effect of histone methyltransferase gene mutations on
            sporulation in S. cerevisiae.'
  - term:
      id: GO:0030466
      label: silent mating-type cassette heterochromatin formation
    evidence_type: IMP
    original_reference_id: PMID:9398665
    review:
      summary: 'SET1 is required for maintenance of silent mating-type cassette heterochromatin.
        IMP evidence from early SET1 characterization.'
      action: ACCEPT
      reason: 'SET1 contributes to silencing at the silent mating loci (HML and HMR).
        H3K4 methylation marks these silent heterochromatic regions [PMID:9398665].
        IMP evidence from set1 mutant derepression phenotype.'
      supported_by:
        - reference_id: PMID:9398665
          supporting_text: 'SET1, a yeast member of the trithorax family, functions
            in transcriptional silencing and diverse cellular processes.'
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IDA
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 H3K4 methyltransferase activity demonstrated through direct biochemical
        and cellular experiments.'
      action: ACCEPT
      reason: 'Core function with multiple evidence types. IDA from foundational COMPASS
        discovery paper [PMID:11742990]. Extensive biochemical characterization of
        SET1 catalytic activity.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 H3K4 methyltransferase activity confirmed through functional
        genetic analysis.'
      action: ACCEPT
      reason: 'IMP evidence from set1 deletion showing loss of H3K4 methylation. Multiple
        evidence types from same paper converge on catalytic function.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:11752412
    review:
      summary: 'SET1-catalyzed H3K4 methylation demonstrated through trithorax-group
        complex characterization.'
      action: ACCEPT
      reason: 'IMP evidence from trithorax complex purification and functional analysis
        [PMID:11752412].'
      supported_by:
        - reference_id: PMID:11752412
          supporting_text: Epub 2001 Dec 18. A trithorax-group complex purified 
            from Saccharomyces cerevisiae is required for methylation of histone
            H3.
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IDA
    original_reference_id: PMID:11805083
    review:
      summary: 'Direct evidence for SET1 histone H3K4 methyltransferase activity from
        COMPASS characterization.'
      action: ACCEPT
      reason: 'IDA evidence from biochemical analysis of COMPASS complex activity
        [PMID:11805083].'
      supported_by:
        - reference_id: PMID:11805083
          supporting_text: 2002 Jan 22. COMPASS, a histone H3 (Lysine 4) 
            methyltransferase required for telomeric silencing of gene 
            expression.
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:11805083
    review:
      summary: 'IMP evidence for H3K4 methyltransferase function from mutational analysis.'
      action: ACCEPT
      reason: 'Multiple evidence types (IDA and IMP) from same reference support catalytic
        function.'
      supported_by:
        - reference_id: PMID:11805083
          supporting_text: 2002 Jan 22. COMPASS, a histone H3 (Lysine 4) 
            methyltransferase required for telomeric silencing of gene 
            expression.
  - term:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    evidence_type: IMP
    original_reference_id: PMID:12845608
    review:
      summary: 'SET1 is a methyltransferase specific for lysine 4 of histone H3 and
        required for efficient gene expression.'
      action: ACCEPT
      reason: 'IMP evidence from studies of SET1 catalytic requirement for gene expression
        [PMID:12845608]. Converging evidence from multiple studies.'
      supported_by:
        - reference_id: PMID:12845608
          supporting_text: Saccharomyces cerevisiae Set1p is a methyltransferase
            specific for lysine 4 of histone H3 and is required for efficient 
            gene expression.
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IPI
    original_reference_id: PMID:11687631
    review:
      summary: 'SET1 protein-protein interactions with COMPASS subunits documented
        through direct interaction studies.'
      action: ACCEPT
      reason: 'Core structural annotation. IPI evidence from COMPASS co-immunoprecipitation
        studies [PMID:11687631]. Specific complex membership is more informative than
        generic protein binding. Multiple IPI annotations with different partners
        all converge on COMPASS complex membership.'
      supported_by:
        - reference_id: PMID:11687631
          supporting_text: 'COMPASS: a complex of proteins associated with a trithorax-related
            SET domain protein.'
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IPI
    original_reference_id: PMID:11742990
    review:
      summary: 'SET1 interactions with COMPASS complex components demonstrated through
        multiple approaches.'
      action: ACCEPT
      reason: 'Multiple independent IPI studies confirm COMPASS complex membership.
        Duplicate annotations appropriate for different reference sources.'
      supported_by:
        - reference_id: PMID:11742990
          supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
            Ash2 homologue and methylates histone 3 lysine 4.
  - term:
      id: GO:0048188
      label: Set1C/COMPASS complex
    evidence_type: IPI
    original_reference_id: PMID:11752412
    review:
      summary: 'IPI evidence for SET1 as component of trithorax-like COMPASS complex.'
      action: ACCEPT
      reason: 'Convergent IPI evidence from trithorax complex co-purification studies.'

      supported_by:
        - reference_id: PMID:11752412
          supporting_text: Epub 2001 Dec 18. A trithorax-group complex purified 
            from Saccharomyces cerevisiae is required for methylation of histone
            H3.
core_functions:
  - description: Catalytic component of the COMPASS histone methyltransferase 
      complex responsible for mono-, di-, and trimethylation of histone H3 
      lysine 4 (H3K4), a fundamental mark of active transcription and gene 
      regulatory regions
    molecular_function:
      id: GO:0042800
      label: histone H3K4 methyltransferase activity
    directly_involved_in:
      - id: GO:0006357
        label: regulation of transcription by RNA polymerase II
    locations:
      - id: GO:0048188
        label: Set1C/COMPASS complex
    supported_by:
      - reference_id: PMID:11742990
        supporting_text: The Saccharomyces cerevisiae Set1 complex includes an 
          Ash2 homologue and methylates histone 3 lysine 4.

references:
  - id: GO_REF:0000002
    title: Gene Ontology annotation through association of InterPro records with
      GO terms
    findings: []
  - id: GO_REF:0000033
    title: Annotation inferences using phylogenetic trees
    findings: []
  - id: GO_REF:0000043
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword 
      mapping
    findings: []
  - id: GO_REF:0000044
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular 
      Location vocabulary mapping, accompanied by conservative changes to GO 
      terms applied by UniProt
    findings: []
  - id: GO_REF:0000117
    title: Electronic Gene Ontology annotations created by ARBA machine learning
      models
    findings: []
  - id: GO_REF:0000120
    title: Combined Automated Annotation using Multiple IEA Methods
    findings: []
  - id: PMID:11687631
    title: 'COMPASS: a complex of proteins associated with a trithorax-related SET
      domain protein.'
    findings: []
  - id: PMID:11742990
    title: The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue 
      and methylates histone 3 lysine 4.
    findings: []
  - id: PMID:11751634
    title: Histone H3 lysine 4 methylation is mediated by Set1 and required for 
      cell growth and rDNA silencing in Saccharomyces cerevisiae.
    findings: []
  - id: PMID:11752412
    title: A trithorax-group complex purified from Saccharomyces cerevisiae is 
      required for methylation of histone H3.
    findings: []
  - id: PMID:11805083
    title: COMPASS, a histone H3 (Lysine 4) methyltransferase required for 
      telomeric silencing of gene expression.
    findings: []
  - id: PMID:11805837
    title: Systematic identification of protein complexes in Saccharomyces 
      cerevisiae by mass spectrometry.
    findings: []
  - id: PMID:12845608
    title: Saccharomyces cerevisiae Set1p is a methyltransferase specific for 
      lysine 4 of histone H3 and is required for efficient gene expression.
    findings: []
  - id: PMID:16143104
    title: The Set1 methyltransferase opposes Ipl1 aurora kinase functions in 
      chromosome segregation.
    findings: []
  - id: PMID:16429126
    title: Proteome survey reveals modularity of the yeast cell machinery.
    findings: []
  - id: PMID:16554755
    title: Global landscape of protein complexes in the yeast Saccharomyces 
      cerevisiae.
    findings: []
  - id: PMID:16787775
    title: Structural characterization of Set1 RNA recognition motifs and their 
      role in histone H3 lysine 4 methylation.
    findings: []
  - id: PMID:16959218
    title: Different roles of histone H3 lysine 4 methylation in chromatin 
      maintenance.
    findings: []
  - id: PMID:17150765
    title: Effect of histone methyltransferase gene mutations on sporulation in 
      S. cerevisiae.
    findings: []
  - id: PMID:18083099
    title: Histone crosstalk between H2B monoubiquitination and H3 methylation 
      mediated by COMPASS.
    findings: []
  - id: PMID:19713935
    title: Cotranslational assembly of the yeast SET1C histone methyltransferase
      complex.
    findings: []
  - id: PMID:21179020
    title: Defining the budding yeast chromatin-associated interactome.
    findings: []
  - id: PMID:22158900
    title: Structural analysis of the core COMPASS family of histone H3K4 
      methylases from yeast to human.
    findings: []
  - id: PMID:23246437
    title: Spp1, a member of the Set1 Complex, promotes meiotic DSB formation in
      promoters by tethering histone H3K4 methylation sites to chromosome axes.
    findings: []
  - id: PMID:23382196
    title: H3K4 methyltransferase Set1 is involved in maintenance of ergosterol 
      homeostasis and resistance to Brefeldin A.
    findings: []
  - id: PMID:23469257
    title: Catalytic and functional roles of conserved amino acids in the SET 
      domain of the S. cerevisiae lysine methyltransferase Set1.
    findings: []
  - id: PMID:24797370
    title: Dot1-dependent histone H3K79 methylation promotes the formation of 
      meiotic double-strand breaks in the absence of histone H3K4 methylation in
      budding yeast.
    findings: []
  - id: PMID:27325136
    title: Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate 
      chromatin dynamics and gene transcription.
    findings: []
  - id: PMID:27911222
    title: The histone methyltransferases Set5 and Set1 have overlapping 
      functions in gene silencing and telomere maintenance.
    findings: []
  - id: PMID:28483910
    title: RNA Binding by Histone Methyltransferases Set1 and Set2.
    findings: []
  - id: PMID:29066473
    title: Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by
      the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation.
    findings: []
  - id: PMID:29071121
    title: Binding to RNA regulates Set1 function.
    findings: []
  - id: PMID:30358795
    title: The cellular economy of the Saccharomyces cerevisiae zinc proteome.
    findings: []
  - id: PMID:32290544
    title: Genetic Interactions of Histone Modification Machinery Set1 and PAF1C
      with the Recombination Complex Rec114-Mer2-Mei4 in the Formation of 
      Meiotic DNA Double-Strand Breaks.
    findings: []
  - id: PMID:37968396
    title: The social and structural architecture of the yeast protein 
      interactome.
    findings: []
  - id: PMID:9214638
    title: Mammalian homologues of the Polycomb-group gene Enhancer of zeste 
      mediate gene silencing in Drosophila heterochromatin and at S. cerevisiae 
      telomeres.
    findings: []
  - id: PMID:9398665
    title: SET1, a yeast member of the trithorax family, functions in 
      transcriptional silencing and diverse cellular processes.
    findings: []
  - id: PMID:9988274
    title: Interaction between Set1p and checkpoint protein Mec3p in DNA repair 
      and telomere functions.
    findings: []
  - id: file:yeast/SET1/SET1-deep-research-perplexity.md
    title: Deep research report on SET1
    findings: []