SET1 is the catalytic component of the COMPASS complex, responsible for histone H3 lysine 4 (H3K4) methylation. SET1 catalyzes mono-, di-, and trimethylation of H3K4, a fundamental histone mark associated with actively transcribed genes and gene regulatory regions. SET1 also participates in DNA damage response, meiotic recombination, and telomere maintenance through both methylation-dependent and independent mechanisms.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SET1 catalyzes histone H3 lysine 4 methylation as the core enzymatic function of the COMPASS complex. IBA annotation based on phylogenetic inference across SET1/MLL family. Extensive direct biochemical evidence establishes this as the primary function.
Reason: Core SET1 function. IBA evidence is appropriate for this well-conserved family function. Deep research confirms SET1 is the sole H3K4 methyltransferase in yeast [evidence from PMID:11742990, 11751634, 11752412, 11805083].
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
file:yeast/SET1/SET1-deep-research-perplexity.md
provider: perplexity
|
|
GO:0048188
Set1C/COMPASS complex
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SET1 is the catalytic core component of the COMPASS (Complex of Proteins Associated with Set1) complex. SET1 is part of an eight-subunit complex essential for H3K4 methylation. IBA annotation reflects phylogenetic conservation of this complex organization.
Reason: Core structural annotation. SET1 functions exclusively as component of COMPASS complex. IBA evidence appropriate for this conserved complex membership. Deep research establishes COMPASS as multi-subunit assembly with SET1 at catalytic core [PMID:11742990, 11687631].
Supporting Evidence:
PMID:11687631
COMPASS: a complex of proteins associated with a trithorax-related SET domain protein.
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0005634
nucleus
|
IEA
GO_REF:0000044 |
KEEP AS NON CORE |
Summary: SET1 localizes to the nucleus where COMPASS complex catalyzes H3K4 methylation on chromatin. IEA from UniProt subcellular location annotation.
Reason: Correct localization but not informative about SET1 function. Localization without functional consequence should be marked non-core. SET1 functions exclusively in nucleus.
|
|
GO:0005694
chromosome
|
IEA
GO_REF:0000044 |
KEEP AS NON CORE |
Summary: SET1 associates with chromatin/chromosomes as component of COMPASS complex catalyzing H3K4 methylation.
Reason: While true, this is vague and less informative than specific process or function annotations. Chromosome association is implied by core methyltransferase function.
|
|
GO:0006325
chromatin organization
|
IEA
GO_REF:0000043 |
ACCEPT |
Summary: H3K4 methylation by SET1/COMPASS contributes to chromatin organization and regulation of chromatin structure.
Reason: SET1 directly contributes to chromatin organization through H3K4 methylation, which affects nucleosome positioning, histone-DNA interactions, and chromatin state. IEA from keyword annotation is appropriate.
|
|
GO:0008168
methyltransferase activity
|
IEA
GO_REF:0000120 |
KEEP AS NON CORE |
Summary: SET1 catalyzes methyl group transfer from SAM to histone H3 lysine 4, a methyltransferase activity.
Reason: True but overly broad. GO:0042800 (H3K4 methyltransferase activity) is more specific and informative. This parent term is less useful when specific substrate-level annotation available.
|
|
GO:0016740
transferase activity
|
IEA
GO_REF:0000043 |
KEEP AS NON CORE |
Summary: SET1 catalyzes transferase activity (methyl group transfer to histone substrate).
Reason: Extremely broad parent term. SET1 is a methyltransferase, which is a type of transferase. Parent term is less informative when more specific molecular function available.
|
|
GO:0032259
methylation
|
IEA
GO_REF:0000043 |
KEEP AS NON CORE |
Summary: SET1 catalyzes methylation of histone H3 lysine 4.
Reason: General process annotation. While accurate, more specific methyltransferase activity annotations (GO:0008168, GO:0042800) are more informative. Parent term less useful.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: SET1 is the H3K4-specific methyltransferase, catalyzing mono-, di-, and trimethylation of histone H3 lysine 4. IEA from InterPro domain annotation.
Reason: Core molecular function annotation. IEA from InterPro recognition of SET domain is appropriate. Extensive literature confirms this is SET1 primary function. Duplicate of annotation 1 with different evidence source.
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IEA
GO_REF:0000117 |
ACCEPT |
Summary: SET1-mediated H3K4 methylation marks and promotes active transcription, functioning in positive regulation of Pol II transcription.
Reason: SET1 function extends beyond catalytic activity to transcriptional regulation. H3K4me3 marks active promoters and is required for efficient transcription initiation. Deep research [PMID:27325136] confirms SET1 involvement in transcription regulation. However, SET1 also has roles in transcriptional repression, making this annotation incomplete.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
|
|
GO:0048188
Set1C/COMPASS complex
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: SET1 is component of Set1C/COMPASS complex. IEA from InterPro domain recognition.
Reason: Core structural annotation. Duplicate of annotation 2 with different evidence. InterPro detection of SET domain-containing protein appropriately infers COMPASS complex membership.
|
|
GO:0140999
histone H3K4 trimethyltransferase activity
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: SET1 catalyzes trimethylation of histone H3 lysine 4 (H3K4me3). IEA from RHEA and EC number annotation.
Reason: SET1 catalyzes all methylation states (me1, me2, me3) of H3K4. This annotation for trimethylation specifically is appropriate. Deep research confirms SET1 capacity for complete trimethylation [PMID:12845608, 22158900].
Supporting Evidence:
PMID:12845608
Saccharomyces cerevisiae Set1p is a methyltransferase specific for lysine 4 of histone H3 and is required for efficient gene expression.
PMID:22158900
Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human.
|
|
GO:0005515
protein binding
|
IPI
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
KEEP AS NON CORE |
Summary: SET1 protein-protein interactions with COMPASS complex subunits (Bre2/Ash2, Swd1, Swd2, Swd3) and other proteins documented through co-immunoprecipitation and mass spectrometry. Multiple IPI annotations reflect experimentally validated interactions.
Reason: While interactions are real and well-documented (IPI evidence is appropriate), generic "protein binding" is not informative for SET1 function. These interactions are with known COMPASS subunits (Bre2, Swd1, Swd2, Swd3, Bre5, Chd1, Swd7) forming the complex. Specific complex membership (GO:0048188) is more informative than generic binding. COMPASS complex membership captures the functional significance.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0005515
protein binding
|
IPI
PMID:11805837 Systematic identification of protein complexes in Saccharomy... |
KEEP AS NON CORE |
Summary: SET1 interactions documented via mass spectrometry-based proteome survey.
Reason: Generic binding annotation. Specific COMPASS complex membership is more informative.
Supporting Evidence:
PMID:11805837
Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry.
|
|
GO:0005515
protein binding
|
IPI
PMID:16429126 Proteome survey reveals modularity of the yeast cell machine... |
KEEP AS NON CORE |
Summary: SET1 protein interactions identified in proteome survey.
Reason: Generic binding annotation. Specific complex membership captures functional relationships.
Supporting Evidence:
PMID:16429126
Proteome survey reveals modularity of the yeast cell machinery.
|
|
GO:0005515
protein binding
|
IPI
PMID:16554755 Global landscape of protein complexes in the yeast Saccharom... |
KEEP AS NON CORE |
Summary: SET1 protein interactions from global landscape of yeast protein complexes study.
Reason: Generic binding annotation. Duplicate interactions captured more informatively by COMPASS complex membership.
Supporting Evidence:
PMID:16554755
Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.
|
|
GO:0005515
protein binding
|
IPI
PMID:18083099 Histone crosstalk between H2B monoubiquitination and H3 meth... |
KEEP AS NON CORE |
Summary: SET1 interactions in context of histone crosstalk between H2B ubiquitination and H3 methylation via COMPASS.
Reason: Generic binding annotation despite interesting functional context. COMPASS complex membership is more specific and functional.
Supporting Evidence:
PMID:18083099
Histone crosstalk between H2B monoubiquitination and H3 methylation mediated by COMPASS.
|
|
GO:0005515
protein binding
|
IPI
PMID:19713935 Cotranslational assembly of the yeast SET1C histone methyltr... |
KEEP AS NON CORE |
Summary: SET1 protein interactions from cotranslational assembly studies of SET1C complex.
Reason: Generic binding annotation. Complex assembly dynamics captured by COMPASS membership annotation.
Supporting Evidence:
PMID:19713935
Cotranslational assembly of the yeast SET1C histone methyltransferase complex.
|
|
GO:0005515
protein binding
|
IPI
PMID:21179020 Defining the budding yeast chromatin-associated interactome. |
KEEP AS NON CORE |
Summary: SET1 interactions from budding yeast chromatin-associated interactome study.
Reason: Generic binding annotation. COMPASS complex membership more informative.
Supporting Evidence:
PMID:21179020
Defining the budding yeast chromatin-associated interactome.
|
|
GO:0005515
protein binding
|
IPI
PMID:37968396 The social and structural architecture of the yeast protein ... |
KEEP AS NON CORE |
Summary: SET1 interactions from recent social and structural architecture of yeast protein interactome study.
Reason: Generic binding annotation. Specific complex membership is more informative for understanding SET1 function.
Supporting Evidence:
PMID:37968396
Nov 15. The social and structural architecture of the yeast protein interactome.
|
|
GO:0005515
protein binding
|
IPI
PMID:9988274 Interaction between Set1p and checkpoint protein Mec3p in DN... |
KEEP AS NON CORE |
Summary: SET1 interaction with Mec3 checkpoint protein documented. SET1 involvement in DNA repair and checkpoint functions.
Reason: While this Mec3 interaction is functionally distinct (DNA repair vs. COMPASS), generic "protein binding" is still not informative. The functional relationship (DNA repair, checkpoint) would be better captured by process annotations.
Supporting Evidence:
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DNA repair and telomere functions.
|
|
GO:0005634
nucleus
|
NAS
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
KEEP AS NON CORE |
Summary: SET1 localizes to the nucleus as established by ComplexPortal NAS (Non-Annotated Sequence) curation.
Reason: Correct but not informative about function. Localization without functional consequence. SET1 functions in nucleus through H3K4 methylation which is already annotated.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
NAS
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
ACCEPT |
Summary: SET1 has documented roles in subtelomeric heterochromatin formation and silencing of genes near chromosome ends.
Reason: SET1 contributes to subtelomeric heterochromatin formation through H3K4me3-dependent mechanisms. Deep research confirms role in telomeric gene silencing [PMID:11805083]. NAS evidence from ComplexPortal is appropriate. This is documented functional role.
Supporting Evidence:
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0003723
RNA binding
|
IDA
PMID:29071121 Binding to RNA regulates Set1 function. |
KEEP AS NON CORE |
Summary: SET1 contains an RNA recognition motif (RRM) at its N-terminus and binds RNA. IDA evidence from direct biochemical studies of SET1 RNA binding.
Reason: SET1 does bind RNA through its RRM domain (demonstrated PMID:29071121), but the functional role of this interaction is unclear. Deep research indicates RRM functions primarily in protein stability rather than direct catalytic relevance to H3K4 methylation. RNA binding is less central than methyltransferase activity.
Supporting Evidence:
PMID:29071121
Binding to RNA regulates Set1 function.
|
|
GO:0048188
Set1C/COMPASS complex
|
IDA
PMID:29071121 Binding to RNA regulates Set1 function. |
ACCEPT |
Summary: SET1 is demonstrated to be component of Set1C/COMPASS complex through direct biochemical analysis and RNA binding studies.
Reason: Core structural annotation. IDA evidence from direct protein interaction studies appropriate. Duplicate annotation with different evidence from IBA (phylogenetic) and IEA (InterPro). This IDA evidence from direct studies is strong.
Supporting Evidence:
PMID:29071121
Binding to RNA regulates Set1 function.
|
|
GO:0006357
regulation of transcription by RNA polymerase II
|
IMP
PMID:16959218 Different roles of histone H3 lysine 4 methylation in chroma... |
ACCEPT |
Summary: SET1 regulates transcription by RNA Pol II through H3K4 methylation. IMP evidence from set1 mutant studies showing defects in transcription regulation.
Reason: SET1 is essential for normal transcription regulation through H3K4 methylation. Deep research confirms H3K4me3 marks active promoters and affects transcription initiation and elongation [PMID:27325136]. IMP evidence appropriate.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
|
|
GO:0006357
regulation of transcription by RNA polymerase II
|
IGI
PMID:16959218 Different roles of histone H3 lysine 4 methylation in chroma... |
ACCEPT |
Summary: SET1 genetic interaction with other transcription regulation genes. IGI evidence from genetic interaction studies.
Reason: Genetic interactions with transcription factors and regulators confirm SET1 role in transcription regulation. IMP and IGI both support this annotation from same reference.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
|
|
GO:0033554
cellular response to stress
|
IMP
PMID:16959218 Different roles of histone H3 lysine 4 methylation in chroma... |
ACCEPT |
Summary: SET1 contributes to cellular stress response through H3K4 methylation-dependent gene expression changes. IMP evidence from mutant analysis.
Reason: SET1 regulates genes involved in stress response. Deep research shows SET1 effects on stress-responsive gene expression. IMP evidence from set1 deletion/mutation studies.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
|
|
GO:0033554
cellular response to stress
|
IGI
PMID:16959218 Different roles of histone H3 lysine 4 methylation in chroma... |
ACCEPT |
Summary: Genetic interaction showing SET1 involvement in stress response pathways.
Reason: Genetic interaction data support stress response role. Duplicate from same reference with complementary evidence type.
Supporting Evidence:
PMID:16959218
Different roles of histone H3 lysine 4 methylation in chromatin maintenance.
|
|
GO:0008270
zinc ion binding
|
RCA
PMID:30358795 The cellular economy of the Saccharomyces cerevisiae zinc pr... |
KEEP AS NON CORE |
Summary: SET1 contains zinc-binding motifs characteristic of SET domain proteins. RCA (Rule-based Computational Annotation) inferred from protein structure.
Reason: SET1 SET domain contains zinc-coordinating residues (part of catalytic pocket), but zinc binding is not the primary function - it is structural support for methyltransferase activity. RCA from structural features is appropriate, but zinc binding is a cofactor interaction rather than independent function. Less informative than methyltransferase activity annotation.
Supporting Evidence:
PMID:30358795
The cellular economy of the Saccharomyces cerevisiae zinc proteome.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
IMP
PMID:11752412 A trithorax-group complex purified from Saccharomyces cerevi... |
ACCEPT |
Summary: SET1 is required for proper methylation of histone H3 and formation of subtelomeric heterochromatin. Multiple IMP annotations confirm this functional role.
Reason: SET1 is essential for subtelomeric heterochromatin formation and silencing through H3K4 methylation-dependent mechanisms. IMP evidence from set1 mutant studies. Core functional role well-documented [PMID:11752412, 11805083, 9398665].
Supporting Evidence:
PMID:11752412
A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
IMP
PMID:11805083 COMPASS, a histone H3 (Lysine 4) methyltransferase required ... |
ACCEPT |
Summary: SET1 methylation activity required for telomeric silencing and subtelomeric heterochromatin.
Reason: Duplicate annotation with different reference but same functional role. IMP evidence appropriate.
Supporting Evidence:
PMID:11805083
COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
IMP
PMID:9398665 SET1, a yeast member of the trithorax family, functions in t... |
ACCEPT |
Summary: SET1, a yeast member of the trithorax family, functions in transcriptional silencing and subtelomeric heterochromatin.
Reason: Foundational characterization of SET1 in heterochromatin. IMP evidence from early functional studies.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
IMP
PMID:9988274 Interaction between Set1p and checkpoint protein Mec3p in DN... |
ACCEPT |
Summary: SET1 involvement in heterochromatin formation and telomere functions.
Reason: Additional IMP evidence. Multiple independent studies converge on SET1 role in subtelomeric heterochromatin formation.
Supporting Evidence:
PMID:9988274
Interaction between Set1p and checkpoint protein Mec3p in DNA repair and telomere functions.
|
|
GO:0003723
RNA binding
|
IDA
PMID:28483910 RNA Binding by Histone Methyltransferases Set1 and Set2. |
KEEP AS NON CORE |
Summary: Direct experimental evidence for SET1 RNA binding capacity. IDA from PMID:28483910 RNA Binding by Histone Methyltransferases Set1 and Set2.
Reason: While RNA binding is demonstrated (IDA evidence appropriate), the functional role in H3K4 methylation is unclear. RRM domain appears to function in protein stability/regulation rather than direct catalytic function. Less central than methyltransferase activity.
Supporting Evidence:
PMID:28483910
RNA Binding by Histone Methyltransferases Set1 and Set2.
|
|
GO:0042138
meiotic DNA double-strand break formation
|
IMP
PMID:32290544 Genetic Interactions of Histone Modification Machinery Set1 ... |
ACCEPT |
Summary: SET1 contributes to meiotic DNA double-strand break (DSB) formation. IMP evidence from genetic analysis of set1 in meiosis.
Reason: SET1 has specialized functions in meiotic recombination beyond transcription. Deep research shows SET1 targeting meiotic DSB sites and requirement for proper DSB formation [PMID:32290544]. Both catalytic and non-catalytic SET1 functions contribute to meiosis.
Supporting Evidence:
PMID:32290544
Genetic Interactions of Histone Modification Machinery Set1 and PAF1C with the Recombination Complex Rec114-Mer2-Mei4 in the Formation of Meiotic DNA Double-Strand Breaks.
|
|
GO:0000723
telomere maintenance
|
IMP
PMID:27911222 The histone methyltransferases Set5 and Set1 have overlappin... |
ACCEPT |
Summary: SET1 is involved in telomere maintenance through H3K4 methylation-dependent and independent mechanisms.
Reason: SET1 contributes to telomere length maintenance. Deep research documents SET1 roles in both H3K4me3-dependent telomeric silencing and H3K4me3-independent telomere length regulation [PMID:27911222]. IMP evidence from set1 mutant studies.
Supporting Evidence:
PMID:27911222
The histone methyltransferases Set5 and Set1 have overlapping functions in gene silencing and telomere maintenance.
|
|
GO:0000781
chromosome, telomeric region
|
IGI
PMID:9214638 Mammalian homologues of the Polycomb-group gene Enhancer of ... |
KEEP AS NON CORE |
Summary: SET1 genetic interaction suggesting localization/function at telomeric regions. IGI evidence from genetic interaction studies.
Reason: Localization annotation without clear functional context. SET1 association with telomeric regions is implied by telomere maintenance and subtelomeric heterochromatin annotations. More specific process annotations are more informative.
Supporting Evidence:
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of zeste mediate gene silencing in Drosophila heterochromatin and at S.
|
|
GO:0031509
subtelomeric heterochromatin formation
|
IGI
PMID:9214638 Mammalian homologues of the Polycomb-group gene Enhancer of ... |
ACCEPT |
Summary: Genetic interaction indicating SET1 involvement in subtelomeric heterochromatin formation.
Reason: Genetic interaction evidence supports subtelomeric heterochromatin function. IGI complements IMP evidence from other references.
Supporting Evidence:
PMID:9214638
Mammalian homologues of the Polycomb-group gene Enhancer of zeste mediate gene silencing in Drosophila heterochromatin and at S.
|
|
GO:0010629
negative regulation of gene expression
|
IMP
PMID:29066473 Repression of Middle Sporulation Genes in Saccharomyces cere... |
ACCEPT |
Summary: SET1 involvement in repression of middle sporulation genes through H3K4 methylation maintenance.
Reason: SET1 has documented roles in repressing genes that should not be expressed. H3K4 methylation can mark both active and repressed genes depending on context. Deep research confirms SET1 contributes to negative regulation of specific gene sets [PMID:29066473].
Supporting Evidence:
PMID:29066473
Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation.
|
|
GO:0006479
protein methylation
|
IMP
PMID:23469257 Catalytic and functional roles of conserved amino acids in t... |
KEEP AS NON CORE |
Summary: SET1 catalyzes protein (histone) methylation as core enzymatic function. IMP evidence from catalytic studies.
Reason: Correct but overly broad. GO:0042800 (H3K4 methyltransferase activity) is more specific. Parent term less informative when specific substrate-level annotation available.
Supporting Evidence:
PMID:23469257
Catalytic and functional roles of conserved amino acids in the SET domain of the S.
|
|
GO:0000183
rDNA heterochromatin formation
|
IMP
PMID:23469257 Catalytic and functional roles of conserved amino acids in t... |
ACCEPT |
Summary: SET1 contributes to rDNA heterochromatin formation through H3K4 methylation-dependent mechanisms.
Reason: SET1 has specialized roles in rDNA locus silencing beyond general H3K4 methylation. IMP evidence from studies of SET domain catalytic function in rDNA silencing. Deep research confirms rDNA heterochromatin is SET1-dependent [PMID:23469257].
Supporting Evidence:
PMID:23469257
Catalytic and functional roles of conserved amino acids in the SET domain of the S. cerevisiae lysine methyltransferase Set1.
|
|
GO:0000122
negative regulation of transcription by RNA polymerase II
|
IMP
PMID:27325136 Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r... |
ACCEPT |
Summary: SET1-mediated H3K4 methylation can function in negative regulation of transcription at specific loci. IMP evidence from mutant analysis.
Reason: SET1 has dual roles in both positive and negative regulation depending on genomic context and partnering proteins. Deep research confirms Jhd2 demethylase opposition to Set1 in dynamic control [PMID:27325136]. Multiple regulatory roles appropriate.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IMP
PMID:27325136 Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r... |
ACCEPT |
Summary: SET1-mediated H3K4 methylation promotes positive regulation of Pol II transcription.
Reason: Primary transcriptional role of SET1. Both GO:0000122 (negative) and GO:0045944 (positive) from same reference indicate context-dependent function. IMP evidence appropriate for both.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
|
|
GO:1902275
regulation of chromatin organization
|
IMP
PMID:27325136 Counteracting H3K4 methylation modulators Set1 and Jhd2 co-r... |
ACCEPT |
Summary: SET1-mediated H3K4 methylation regulates chromatin organization state through coordinated interactions with chromatin modifiers.
Reason: SET1 is central to chromatin organization regulation through H3K4 methylation effects on nucleosome positioning, chromatin accessibility, and establishment of distinct chromatin domains. IMP evidence appropriate.
Supporting Evidence:
PMID:27325136
Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate chromatin dynamics and gene transcription.
|
|
GO:1905088
positive regulation of synaptonemal complex assembly
|
IMP
PMID:24797370 Dot1-dependent histone H3K79 methylation promotes the format... |
ACCEPT |
Summary: SET1 contributes to synaptonemal complex assembly during meiosis. IMP evidence from meiotic genetics.
Reason: SET1 specialized meiotic function. Synaptonemal complex assembly is essential for proper meiotic recombination. SET1 H3K4 methylation promotes this process [PMID:24797370].
Supporting Evidence:
PMID:24797370
Dot1-dependent histone H3K79 methylation promotes the formation of meiotic double-strand breaks in the absence of histone H3K4 methylation in budding yeast.
|
|
GO:0007130
synaptonemal complex assembly
|
IMP
PMID:24797370 Dot1-dependent histone H3K79 methylation promotes the format... |
ACCEPT |
Summary: SET1 is required for synaptonemal complex assembly during meiosis.
Reason: SET1 essential for meiotic recombination through synaptonemal complex assembly. IMP evidence from meiotic mutant studies. Both positive regulation (GO:1905088) and assembly (GO:0007130) appropriately annotated.
Supporting Evidence:
PMID:24797370
eCollection 2014. Dot1-dependent histone H3K79 methylation promotes the formation of meiotic double-strand breaks in the absence of histone H3K4 methylation in budding yeast.
|
|
GO:1903341
regulation of meiotic DNA double-strand break formation
|
IMP
PMID:23246437 Spp1, a member of the Set1 Complex, promotes meiotic DSB for... |
ACCEPT |
Summary: SET1 contributes to regulation of meiotic DSB formation through Spp1-mediated recruitment to meiotic recombination hotspots.
Reason: SET1 has documented roles in regulating meiotic DSB formation through Spp1 complex component. Deep research shows SET1 recruits H3K4me3 to chromosome axis sites, promoting DSB formation [PMID:23246437]. Specialized meiotic function distinct from transcriptional role.
Supporting Evidence:
PMID:23246437
Spp1, a member of the Set1 Complex, promotes meiotic DSB formation in promoters by tethering histone H3K4 methylation sites to chromosome axes.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:11751634 Histone H3 lysine 4 methylation is mediated by Set1 and requ... |
ACCEPT |
Summary: SET1 H3K4 methyltransferase activity demonstrated by functional analysis. IMP evidence from studies of SET1 catalytic function.
Reason: Core molecular function. IMP evidence from direct biochemical characterization of SET1 catalytic activity [PMID:11751634]. Multiple IMP annotations with different references converge on primary function.
Supporting Evidence:
PMID:11751634
Histone H3 lysine 4 methylation is mediated by Set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:22158900 Structural analysis of the core COMPASS family of histone H3... |
ACCEPT |
Summary: SET1 H3K4 methyltransferase activity confirmed through structural analysis studies.
Reason: Duplicate with different structural biology reference. Multiple independent studies support H3K4 methyltransferase function. IMP evidence appropriate.
Supporting Evidence:
PMID:22158900
Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human.
|
|
GO:0055092
sterol homeostasis
|
IMP
PMID:23382196 H3K4 methyltransferase Set1 is involved in maintenance of er... |
MARK AS OVER ANNOTATED |
Summary: SET1 is involved in maintenance of ergosterol homeostasis and resistance to Brefeldin A. IMP evidence from genetic and functional studies.
Reason: This appears to be a pleiotropic effect rather than core function. SET1 H3K4 methylation regulates genes involved in sterol metabolism, but sterol homeostasis is not a primary functional role. The annotation is not incorrect, but represents a downstream consequence of transcriptional regulation rather than direct SET1 function. Keep annotation but mark as non-central to SET1 biology.
Supporting Evidence:
PMID:23382196
H3K4 methyltransferase Set1 is involved in maintenance of ergosterol homeostasis and resistance to Brefeldin A.
|
|
GO:0000723
telomere maintenance
|
IMP
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
ACCEPT |
Summary: SET1 is required for telomere maintenance. IMP evidence from foundational studies.
Reason: SET1 contributes to telomere length maintenance and integrity. Supported by multiple references [PMID:11742990, 9398665, 27911222]. IMP evidence from set1 mutant studies showing shortened telomeres.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0000723
telomere maintenance
|
IMP
PMID:9398665 SET1, a yeast member of the trithorax family, functions in t... |
ACCEPT |
Summary: SET1 involvement in telomere maintenance from early functional characterization.
Reason: Duplicate from foundational SET1 paper. Multiple references converge on telomere maintenance role.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
|
|
GO:0003723
RNA binding
|
IDA
PMID:16787775 Structural characterization of Set1 RNA recognition motifs a... |
KEEP AS NON CORE |
Summary: Direct experimental evidence for SET1 RNA binding from structural characterization of RNA recognition motifs.
Reason: RNA binding demonstrated through structural studies and RRM characterization. IDA evidence appropriate. However, functional role unclear - appears to affect protein stability rather than catalytic activity. Less central than methyltransferase function.
Supporting Evidence:
PMID:16787775
2006 May 9. Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation.
|
|
GO:0003723
RNA binding
|
IMP
PMID:16787775 Structural characterization of Set1 RNA recognition motifs a... |
KEEP AS NON CORE |
Summary: IMP evidence for RNA binding role in SET1 function and regulation.
Reason: Both IDA and IMP from same reference. Functional role of RNA binding remains unclear despite experimental evidence.
Supporting Evidence:
PMID:16787775
2006 May 9. Structural characterization of Set1 RNA recognition motifs and their role in histone H3 lysine 4 methylation.
|
|
GO:0016279
protein-lysine N-methyltransferase activity
|
IMP
PMID:16143104 The Set1 methyltransferase opposes Ipl1 aurora kinase functi... |
ACCEPT |
Summary: SET1 catalyzes protein-lysine methylation as a core enzymatic activity. IMP evidence from functional studies.
Reason: SET1 is a protein-lysine N-methyltransferase (histone lysine methyltransferase). While less specific than GO:0042800 (H3K4 methyltransferase), this is appropriate parent term. Genetic interactions demonstrate methyltransferase activity role.
Supporting Evidence:
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functions in chromosome segregation.
|
|
GO:0016279
protein-lysine N-methyltransferase activity
|
IGI
PMID:16143104 The Set1 methyltransferase opposes Ipl1 aurora kinase functi... |
ACCEPT |
Summary: Genetic interaction evidence for SET1 protein-lysine methyltransferase activity.
Reason: IGI complements IMP evidence. Genetic interactions support methyltransferase function.
Supporting Evidence:
PMID:16143104
The Set1 methyltransferase opposes Ipl1 aurora kinase functions in chromosome segregation.
|
|
GO:0030437
ascospore formation
|
IMP
PMID:17150765 Effect of histone methyltransferase gene mutations on sporul... |
ACCEPT |
Summary: SET1 is required for proper ascospore formation during sporulation. IMP evidence from set1 mutant analysis.
Reason: SET1 has documented role in sporulation and ascospore formation. IMP evidence from set1 deletion strains showing defective sporulation [PMID:17150765].
Supporting Evidence:
PMID:17150765
Effect of histone methyltransferase gene mutations on sporulation in S. cerevisiae.
|
|
GO:0030466
silent mating-type cassette heterochromatin formation
|
IMP
PMID:9398665 SET1, a yeast member of the trithorax family, functions in t... |
ACCEPT |
Summary: SET1 is required for maintenance of silent mating-type cassette heterochromatin. IMP evidence from early SET1 characterization.
Reason: SET1 contributes to silencing at the silent mating loci (HML and HMR). H3K4 methylation marks these silent heterochromatic regions [PMID:9398665]. IMP evidence from set1 mutant derepression phenotype.
Supporting Evidence:
PMID:9398665
SET1, a yeast member of the trithorax family, functions in transcriptional silencing and diverse cellular processes.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IDA
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
ACCEPT |
Summary: SET1 H3K4 methyltransferase activity demonstrated through direct biochemical and cellular experiments.
Reason: Core function with multiple evidence types. IDA from foundational COMPASS discovery paper [PMID:11742990]. Extensive biochemical characterization of SET1 catalytic activity.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
ACCEPT |
Summary: SET1 H3K4 methyltransferase activity confirmed through functional genetic analysis.
Reason: IMP evidence from set1 deletion showing loss of H3K4 methylation. Multiple evidence types from same paper converge on catalytic function.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:11752412 A trithorax-group complex purified from Saccharomyces cerevi... |
ACCEPT |
Summary: SET1-catalyzed H3K4 methylation demonstrated through trithorax-group complex characterization.
Reason: IMP evidence from trithorax complex purification and functional analysis [PMID:11752412].
Supporting Evidence:
PMID:11752412
Epub 2001 Dec 18. A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IDA
PMID:11805083 COMPASS, a histone H3 (Lysine 4) methyltransferase required ... |
ACCEPT |
Summary: Direct evidence for SET1 histone H3K4 methyltransferase activity from COMPASS characterization.
Reason: IDA evidence from biochemical analysis of COMPASS complex activity [PMID:11805083].
Supporting Evidence:
PMID:11805083
2002 Jan 22. COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:11805083 COMPASS, a histone H3 (Lysine 4) methyltransferase required ... |
ACCEPT |
Summary: IMP evidence for H3K4 methyltransferase function from mutational analysis.
Reason: Multiple evidence types (IDA and IMP) from same reference support catalytic function.
Supporting Evidence:
PMID:11805083
2002 Jan 22. COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression.
|
|
GO:0042800
histone H3K4 methyltransferase activity
|
IMP
PMID:12845608 Saccharomyces cerevisiae Set1p is a methyltransferase specif... |
ACCEPT |
Summary: SET1 is a methyltransferase specific for lysine 4 of histone H3 and required for efficient gene expression.
Reason: IMP evidence from studies of SET1 catalytic requirement for gene expression [PMID:12845608]. Converging evidence from multiple studies.
Supporting Evidence:
PMID:12845608
Saccharomyces cerevisiae Set1p is a methyltransferase specific for lysine 4 of histone H3 and is required for efficient gene expression.
|
|
GO:0048188
Set1C/COMPASS complex
|
IPI
PMID:11687631 COMPASS: a complex of proteins associated with a trithorax-r... |
ACCEPT |
Summary: SET1 protein-protein interactions with COMPASS subunits documented through direct interaction studies.
Reason: Core structural annotation. IPI evidence from COMPASS co-immunoprecipitation studies [PMID:11687631]. Specific complex membership is more informative than generic protein binding. Multiple IPI annotations with different partners all converge on COMPASS complex membership.
Supporting Evidence:
PMID:11687631
COMPASS: a complex of proteins associated with a trithorax-related SET domain protein.
|
|
GO:0048188
Set1C/COMPASS complex
|
IPI
PMID:11742990 The Saccharomyces cerevisiae Set1 complex includes an Ash2 h... |
ACCEPT |
Summary: SET1 interactions with COMPASS complex components demonstrated through multiple approaches.
Reason: Multiple independent IPI studies confirm COMPASS complex membership. Duplicate annotations appropriate for different reference sources.
Supporting Evidence:
PMID:11742990
The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4.
|
|
GO:0048188
Set1C/COMPASS complex
|
IPI
PMID:11752412 A trithorax-group complex purified from Saccharomyces cerevi... |
ACCEPT |
Summary: IPI evidence for SET1 as component of trithorax-like COMPASS complex.
Reason: Convergent IPI evidence from trithorax complex co-purification studies.
Supporting Evidence:
PMID:11752412
Epub 2001 Dec 18. A trithorax-group complex purified from Saccharomyces cerevisiae is required for methylation of histone H3.
|
provider: perplexity
model: sonar-deep-research
cached: false
start_time: '2025-12-31T19:47:56.492784'
end_time: '2025-12-31T19:50:03.874625'
duration_seconds: 127.38
template_file: templates/gene_research_go_focused.md
template_variables:
organism: yeast
gene_id: SET1
gene_symbol: SET1
uniprot_accession: P38827
protein_description: 'RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4
specific {ECO:0000303|PubMed:11805083}; EC=2.1.1.354 {ECO:0000269|PubMed:11805083,
ECO:0000305|PubMed:29071121}; AltName: Full=COMPASS component SET1 {ECO:0000303|PubMed:11742990};
AltName: Full=Lysine N-methyltransferase 2; AltName: Full=SET domain-containing
protein 1 {ECO:0000303|PubMed:11742990};'
gene_info: Name=SET1 {ECO:0000303|PubMed:9398665}; Synonyms=KMT2 {ECO:0000303|PubMed:18022353},
YTX1; OrderedLocusNames=YHR119W;
organism_full: Saccharomyces cerevisiae (strain ATCC 204508 / S288c) (Baker's yeast).
protein_family: Belongs to the class V-like SAM-binding methyltransferase
protein_domains: COMPASS_Set1_N-SET. (IPR024657); Nucleotide-bd_a/b_plait_sf. (IPR012677);
Post-SET_dom. (IPR003616); Set1-like. (IPR044570); Set1_fungi. (IPR017111)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
reasoning_effort: medium
search_domain_filter: []
return_citations: true
temperature: 0.0
citation_count: 48
BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.
DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SET1' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information
Please provide a comprehensive research report on the gene SET1 (gene ID: SET1, UniProt: P38827) in yeast.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Let me analyze the query carefully. The user is asking for a comprehensive research report on the gene SET1 (UniProt: P38827) from Saccharomyces cerevisiae, which is the baker's yeast.
The user has provided context that this gene encodes a "Histone-lysine N-methyltransferase, H3 lysine-4 specific" and is a component of the COMPASS complex. The user specifically wants:
I've been given 60 search results that are highly relevant to this query. Let me review what I have:
The search results cover:
- The discovery that Set1 is an H3K4 methyltransferase (sources 1, 2, 4, 21, 24)
- COMPASS complex composition and function (sources 3, 6, 7, 15, 19, 22, 35, 36, 40, 41, 55, 58)
- Structural basis for H3K4 methylation (sources 5, 15, 19, 22, 37, 56, 58)
- H3K4 methylation patterns and methylation state regulation (sources 14, 17, 32, 37, 56)
- Set1 functions beyond H3K4 methylation (sources 8, 10, 11, 20, 23, 51)
- The role of H2B ubiquitination in Set1 activation (sources 15, 22, 28, 36, 45, 48)
- Set1's role in transcription (sources 8, 13, 27, 31, 32, 33, 34, 38, 41, 49, 52, 57)
Now I need to structure this into a comprehensive 10,000+ word report that follows the format requirements. The report should have:
Key topics to cover:
1. Introduction to Set1 and its discovery
2. Structure and enzymatic mechanism
3. The COMPASS complex and its assembly
4. Substrate specificity and catalytic activity
5. H3K4 methylation patterns and regulation
6. Transcriptional regulation roles
7. DNA repair and meiotic functions
8. Regulation of Set1 activity (H2B ubiquitination, complex assembly)
9. Evolutionary conservation and mammalian homologs
10. Cellular localization and association with transcription machinery
I'll begin drafting the comprehensive report now.
The SET1 gene of Saccharomyces cerevisiae encodes a histone lysine methyltransferase with extraordinary functional significance in eukaryotic chromatin regulation, serving as the prototypical member of the SET1/MLL family of enzymes that catalyze the site-specific methylation of histone H3 at lysine 4 (H3K4)[1][4][7]. Set1p protein functions exclusively as the catalytic component of the COMPASS complex, a multi-subunit assemblage that mediates the mono-, di-, and trimethylation of H3K4 in coordination with actively transcribed genes, providing one of the most conserved and well-characterized histone modification marks in eukaryotic chromatin[3][12][55]. Since its initial identification as a member of the trithorax family of chromatin-modifying proteins and subsequent characterization as the first histone lysine 4-specific methyltransferase in budding yeast, Set1 has emerged as a crucial regulator of gene expression, DNA repair, meiotic recombination, and cellular differentiation, with mechanisms of action extending well beyond its canonical role in histone methylation[1][2][21][26].
The identification of Set1 as a histone methyltransferase represented a watershed moment in understanding chromatin-based gene regulation, arriving at a time when the enzymatic machinery responsible for histone lysine methylation remained largely mysterious. Prior to the characterization of Set1, researchers had identified SUV39 and G9a as histone methyltransferases specific for lysine 9 and lysine 27 of histone H3, establishing that SET domain-containing proteins could catalyze histone lysine methylation[1][24]. Through sequence analysis and phylogenetic comparison, investigators recognized that the SET domain of yeast Set1 and the related Drosophila trithorax proteins constituted a distinct branch within the SET domain-containing protein superfamily, separate from the SUV39/G9a lineage[1]. This observation prompted researchers to test whether Set1 might function as a histone methyltransferase with different substrate specificity. Biochemical purification experiments using affinity-tagged Set1 complex components revealed that when Set1 was associated with its cognate protein complex, it displayed robust enzymatic activity toward histone H3 peptides, but critically, this activity was specific for lysine 4 rather than lysine 9[1][2]. Chromatin immunoprecipitation assays subsequently demonstrated that H3K4 methylation was completely abolished in set1 deletion strains, establishing Set1 as the sole H3K4 methyltransferase in budding yeast[21]. This discovery was particularly significant because it provided the first evidence that different SET domain proteins could evolve to recognize and methylate distinct lysine residues on histone H3, establishing a principle that would prove applicable across the eukaryotic kingdom[1].
The initial characterization also revealed that Set1 alone, as an isolated protein, possessed minimal to negligible enzymatic activity toward histone substrates[3][12]. This finding immediately suggested that Set1 functioned within a larger protein assembly where additional subunits contributed essential regulatory and catalytic functions. Subsequent investigations identified multiple proteins that physically associated with Set1 through co-immunoprecipitation and tandem affinity purification strategies, leading to the discovery of a stable eight-subunit complex that was subsequently designated as COMPASS, for Complex of Proteins Associated with Set1[3][6][55]. This architectural organization, in which the catalytic subunit depends upon multiple regulatory partners for full enzymatic activity, would emerge as a characteristic feature of the SET1/MLL methyltransferase family throughout eukaryotes[3][35][40].
The catalytic mechanism by which Set1 methylates histone H3 at lysine 4 involves the transfer of a methyl group from the universal methyl donor S-adenosylmethionine (SAM) to the ε-amino nitrogen of the lysine residue, a nucleophilic substitution reaction that is mediated by the highly conserved SET domain positioned at the carboxy terminus of the protein[15][22][37]. The SET domain represents approximately 130 amino acids and adopts a distinctive three-dimensional fold that creates a binding pocket for both the protein substrate and the SAM cofactor[37][40]. Within this catalytic pocket, conserved amino acid residues position the methyl group of SAM to facilitate its transfer to the target lysine, while additional residues stabilize the substrate and coordinate the geometry necessary for efficient catalysis[37][40][56]. Structural studies of related SET domain proteins, particularly human MLL1 and other SET1 family members, have revealed that the SET domain in its isolated form adopts an "open" conformation that severely restricts enzymatic activity, with the catalytic pocket remaining incompletely formed[40][56]. Upon assembly of the full COMPASS complex, however, the SET domain undergoes significant conformational rearrangement to adopt a "closed" conformation in which the catalytic pocket achieves optimal geometry for both substrate binding and catalytic turnover[58].
The substrate specificity of Set1 is remarkable in its absolute restriction to histone H3 lysine 4, even in the context of purified core histones where other lysine residues on histone H3 (such as lysine 9, lysine 14, and lysine 27) are potential methylation targets[1][2][21]. This specificity reflects both the architecture of the catalytic pocket and the orientation of the histone H3 tail within the active site, where contacts between the COMPASS complex and the histone N-terminal domain position lysine 4 directly adjacent to the catalytic center while sterically excluding other lysine residues[19][22][58]. The specificity is further reinforced by recognition elements within the COMPASS core that directly engage specific residues within the histone H3 N-terminal tail, establishing contacts that ensure only the correct lysine residue can access the methylation machinery[19][58].
Set1 functions exclusively as the catalytic component of the COMPASS complex, which in budding yeast comprises eight distinct protein subunits that have been designated according to their molecular weights as Cps60, Cps50, Cps40, Cps35, Cps30, Cps25, Cps15, and the catalytic Set1 protein itself[3][55]. Each subunit performs a distinct and non-redundant function in the assembly, stability, and catalytic regulation of the complex, with structural and biochemical studies having elucidated the molecular basis for these contributions[3][58]. The Set1 SET domain is positioned at the geometric center of a Y-shaped complex arrangement, where it is directly contacted by multiple subunits including Cps60, Cps50, and Cps30, which together form a catalytic core module that provides the essential platform for methyltransferase activity[58]. Cps60, also known as Bre2 or its mammalian ortholog ASH2L, contributes a SPRY domain that directly stabilizes the SET-I motif, one of the most dynamic and catalytically critical regions within the SET domain[15][22][58]. Cps50, known as Swd1 or RbBP5 in mammals, provides a WD40 repeat β-propeller fold that scaffolds the SET domain while also making multiple contacts with nucleosomal histones, thereby positioning the histone substrate correctly relative to the catalytic center[15][19][22].
Cps30, orthologous to mammalian WDR5, makes extensive contacts with the SET-I helix and includes a region termed the SIM (SET-I interacting motif) that directly stabilizes the catalytically productive "closed" conformation of the SET domain[58]. Genetic and biochemical evidence demonstrates that Cps30 is specifically required for the highest levels of H3K4 trimethylation, with its absence resulting in a profound decrease in the proportion of H3K4me3 while leaving H3K4me2 and H3K4me1 largely intact[3][55][56]. This selective requirement reflects the energetic barrier to successive methylation rounds, where the stabilization of the SET domain conformation by Cps30 becomes increasingly critical for achieving complete trimethylation[56][58]. Cps40, also known as Spp1 or CFP1 in mammals, sits atop the complex and contains a CXXC zinc finger domain that recognizes non-methylated DNA sequences (particularly CpG-rich regions) in vertebrate cells and a PHD finger that recognizes H3K4me3, thereby enabling the complex to recognize its target sites on chromatin[34][38][41][52]. Cps35, also designated SWD2, represents the only essential subunit of the COMPASS complex in budding yeast and appears to be shared with other chromatin-modifying complexes, functioning to link H3K4 methylation with H2B monoubiquitination[3][55]. Cps25 and Cps15 serve supporting structural roles, with Cps25 contributing to the stability of the complex and Cps15 appearing to function in complex assembly[55].
The architectural organization of the COMPASS complex has been elucidated through complementary applications of cryo-electron microscopy and biochemical reconstruction, revealing a highly organized assembly in which each subunit occupies a distinct spatial position and makes specific contacts that collectively determine the catalytic properties of the enzyme[58]. The complex exhibits a characteristic Y-shaped morphology when viewed in certain orientations, with the SET domain positioned at the junction of the three arms, directly accessible to histone substrates[19][58]. Notably, structural studies have identified dynamic regions within the complex, particularly including the carboxy-terminal modules containing Cps60 and Cps25, which exhibit variable positioning relative to the catalytically competent core, suggesting that conformational flexibility may facilitate substrate engagement and product release during the catalytic cycle[58].
The COMPASS complex possesses the remarkable capacity to catalyze sequential methylation reactions, progressively converting H3K4me0 to H3K4me1, then to H3K4me2, and finally to H3K4me3, with each methylation state displaying distinct localization patterns on chromatin and serving specific functional roles[3][12][17][37][55][56]. The distribution of these three methylation states across the genome is highly organized and reflects both the targeting mechanisms that recruit COMPASS to specific genomic regions and the kinetic parameters that govern successive methylation rounds[17]. In general, H3K4me3 displays a narrow enrichment immediately downstream of transcription start sites, typically peaking approximately 200 base pairs downstream of the nucleosome-depleted region, with this peak precisely coinciding with the +1 nucleosome position[17][56]. In contrast, H3K4me2 exhibits a much broader distribution, often extending further downstream into gene bodies and showing peak positions that vary considerably more among individual genes than is observed for H3K4me3[17]. H3K4me1 displays the most distributed pattern, with substantial enrichment at enhancers and promoter-distal regions, and generally lower peak intensities compared to the higher methylation states[17][56].
The molecular basis for this three-state hierarchy appears to reflect the intrinsic kinetic properties of the COMPASS enzyme working in conjunction with the dynamics of RNA Polymerase II transcription. The catalytic mechanism involves a Y/F switch residue (tyrosine 1052 in yeast Set1) positioned within the catalytic pocket, which functions to limit the intrinsic methyltransferase activity of the isolated SET domain to predominantly monomethylation[37][56]. This tyrosine residue creates a catalytic pocket that is well-suited for the first methyl transfer but creates steric constraints that make successive methylation rounds progressively more difficult without additional regulatory input[37][56]. The critical insight emerged from studies demonstrating that mutation of tyrosine 1052 to phenylalanine (Y1052F), which enlarges the catalytic pocket, dramatically enhances H3K4 trimethylation at the expense of the lower methylation states[56][57]. However, under wild-type conditions, the progression from monomethylation to higher states requires assistance from specific subunits, particularly Cps40 and Cps60, which working in concert with Cps30 modulate the catalytic pocket geometry and stabilize the SET domain conformation to permit higher-order methylation[56][58].
The transcription-dependent deposition of H3K4 methylation states reflects the intimate association of COMPASS with actively elongating RNA Polymerase II and the kinetic properties of the complex as it travels along genes[36][56]. During promoter-proximal regions where Polymerase II transcribes at relatively slow rates during the promoter escape phase, the COMPASS complex maintains sustained contact with its histone substrates, providing an extended window of opportunity for successive methylation reactions and thereby generating predominantly trimethylated nucleosomes[36][56]. In contrast, within downstream regions where Polymerase II elongates at much faster rates, the complex has less opportunity to convert H3K4me1 to higher states, resulting in the accumulation of predominantly monomethylated nucleosomes[36][56]. This elegant model elegantly explains how the pattern of H3K4 methylation emerges from the interplay between enzyme specificity, transcription dynamics, and chromatin structure, without requiring distinct targeting or targeting mechanisms for different methylation states[36][56].
The relationship between H3K4 trimethylation and active transcription represents one of the most robust correlations in chromatin biology, with genome-wide studies across diverse eukaryotic organisms consistently demonstrating strong positive correlation between H3K4me3 abundance at promoters and the expression levels of associated genes[2][8][13][17][27][56]. The mechanistic basis for this correlation appears multifactorial, reflecting both the recruitment of H3K4me3 by transcription factors and transcriptional machinery and the recognition of H3K4me3 by transcriptional regulators and chromatin remodeling complexes that facilitate further transcription initiation and elongation. Recent studies have revealed that H3K4me3 is recognized by specific protein modules, particularly PHD (plant homology domain) fingers and related recognition domains present in proteins such as Cps40, which contains a PHD finger that preferentially binds H3K4me3 with micromolar affinity compared to unmethylated H3K4 or lower methylation states[34][38][52]. Furthermore, the N-terminal transcriptional initiation factors and general transcription machinery components include H3K4me3 recognition modules, suggesting that this histone modification directly facilitates the recruitment and productive engagement of the transcription machinery at promoters[27][34].
The interplay between H3K4 methylation and H3K4 acetylation has emerged as a particularly important regulatory axis in the precise control of inducible genes[8][13][49][57]. Recent high-resolution studies have demonstrated that during the initial induction of genes in response to environmental signals, H3K4 acetylation increases at transcription start sites in preference to H3K4 methylation, enabling rapid gene activation even in the absence of H3K4me3[8][13][49][57]. As induction signals persist and continue, H3K4 methylation progressively increases while acetylation decreases, suggesting a temporal program in which acetylation mediates the rapid initial response while methylation provides long-term maintenance of the active state[8][13][49][57]. This interplay reflects the fact that H3K4 lysine residue can be modified by either methylation or acetylation, and these modifications are mutually exclusive, competing for the same residue[8][13][49][57]. The molecular mechanism appears to involve H3K4 acetylation-binding proteins and factors that accumulate in the absence of Set1 or in the absence of H3K4 methylation, suggesting that the deposition of the methyl mark actively displaces proteins that preferentially recognize the acetylated state[8][13][49][57].
Set1 function extends beyond its canonical role in transcriptional activation to include important roles in repressing or preventing inappropriate expression of genes that should not be transcribed under specific cellular conditions[2][8][11][13][49][57]. Studies of cells lacking Set1 have revealed not only decreases in expression of expected target genes but also increases in expression of genes that are normally repressed, particularly including genes involved in stress response, cell surface protein synthesis, and developmental programs that should remain silenced during standard growth conditions[8][13][49][57]. This apparent paradox—that loss of a protein associated with "active" transcription can lead to increased expression of specific genes—is explained by the principle that Set1 and H3K4 methylation function to establish and maintain repressive chromatin structure at specific genomic loci through mechanisms independent of direct transcription activation[8][11][13][49][57]. The precise mechanisms remain incompletely understood but appear to involve the recruitment of co-repressor complexes and chromatin-modifying activities through H3K4me3 recognition, or alternatively, the suppression of acetylation that would otherwise permit spurious transcription[8][11][13][49][57].
The catalytic activity of COMPASS is profoundly stimulated by the monoubiquitination of histone H2B on lysine 123 (K123), a modification catalyzed by the ubiquitin-conjugating enzyme Rad6 and the E3 ubiquitin ligase Bre1[3][15][22][28][36][45][48][55]. This histone modification crosstalk represents one of the best-characterized examples of trans-histone communication, wherein modification of one histone residue directly regulates the catalytic activity of enzymes that modify a different histone[15][22][28][55]. H2B ubiquitination is itself dependent upon prior transcription of the associated genes and correlates strongly with the accumulation of H3K4 and H3K79 methylation marks, suggesting a coordinated regulatory system wherein active transcription marks the nucleosome for modification with ubiquitin, which in turn stimulates further histone methylation[3][28][36][55]. The mechanism by which H2Bub stimulates H3K4 methylation by COMPASS has been illuminated through cryo-electron microscopy structures of the COMPASS complex bound to ubiquitinated nucleosomes, revealing that the H2B-linked ubiquitin makes direct contacts with multiple COMPASS subunits, particularly including Swd1 (RbBP5) and the Set1 ARM helix region[15][22]. These contacts relieve an inhibitory effect imposed by the Set1 ARM helix in its unactivated state, thereby promoting the transition of the SET domain to the catalytically productive closed conformation[15][22].
The quantitative parameters of COMPASS activation by H2Bub are substantial, with measurements of H3K4 methylation activity increasing several-fold in the presence of ubiquitinated nucleosomes compared to unmodified nucleosomes[3][15][22][28][36][55]. The mechanism appears to involve both increased catalytic turnover rate and enhanced substrate binding, with ubiquitin-mediated conformational changes within the complex stabilizing the catalytically active form of the SET domain while simultaneously improving the positioning of the histone substrate relative to the catalytic center[15][22]. Notably, the stimulation by H2Bub is particularly pronounced for the higher methylation states (H3K4me2 and H3K4me3), suggesting that ubiquitination primarily facilitates the kinetically less favorable successive methylation rounds[3][15][22][28][36]. This regulation ensures that efficient trimethylation of H3K4 is coupled to active transcription and H2B ubiquitination, preventing promiscuous H3K4 trimethylation at inactive genomic regions and thereby maintaining the fidelity of this important chromatin mark[3][15][22][28][36].
The Cps35 (SWD2) subunit of COMPASS mediates the critical interaction with H2Bub, with this subunit exhibiting preferential interaction with ubiquitinated nucleosomes compared to unmodified nucleosomes, although the interaction appears to be indirect and mediated through other COMPASS components[3][55]. Loss of Cps35 results in profound defects in H3K4 di- and trimethylation despite leaving H3K4 monomethylation relatively intact, consistent with the preferential coupling of ubiquitination-stimulated catalysis to the higher methylation states[3][55]. This selective requirement reflects the energetic demands of completing successive methylation reactions, which become progressively more difficult without the conformational stabilization provided by ubiquitin-induced changes within the complex[3][55].
Set1 and the COMPASS complex localize primarily to the nucleus, where they associate with actively transcribing regions of chromatin in a dynamic and regulated manner that reflects the transcriptional status of individual genes[33][34][36][52]. The complex travels with elongating RNA Polymerase II along transcribed genes from the promoter region through the coding sequence, achieving this co-transcriptional localization through direct or indirect interactions between COMPASS components and the elongating polymerase, as well as through recognition of histone modifications that accumulate on actively transcribed genes[33][36][52]. The Paf1 complex (Paf1C), which comprises Paf1, Ctr9, Cdc73, Rtf1, and Leo1, represents a key intermediary in recruiting COMPASS to elongating Polymerase II, as Paf1C associates with the phosphorylated carboxyl-terminal domain (CTD) of Polymerase II throughout transcription and directly or indirectly recruits COMPASS to these sites[33][36][52]. In mammalian cells, the CFP1 component of the SET1 complex provides an additional targeting mechanism through recognition of non-methylated CpG-rich DNA sequences and recognition of H3K4me3 already deposited on chromatin, enabling the complex to efficiently find and occupy its target gene promoters[34][38][41][52].
The targeting of COMPASS to specific genomic regions represents a critical control point in regulating which genes receive H3K4 methylation and to what degree. In yeast, Spp1 (the Cps40 ortholog) contains a PHD finger that recognizes H3K4me3, potentially creating a positive feedback loop wherein pre-existing H3K4me3 marks facilitate the recruitment of additional COMPASS complexes to deposit further methylation[34][52]. In metazoan cells, CFP1 employs a more complex targeting mechanism that involves multivalent interactions with chromatin, wherein the CXXC zinc finger recognizes non-methylated DNA while the PHD finger recognizes H3K4me3, and these two interactions work in concert to achieve stable occupancy at target promoters while excluding ectopic binding at non-promoter regions[34][38][41][52]. Live-cell imaging studies have demonstrated that CFP1 dynamics within the nucleus are predominantly determined by its association with the SET1 complex, with the complex predominating over individual targeting domains in governing occupancy time and residency on chromatin[34][52]. The dynamic nature of COMPASS targeting reflects the continuous cycles of complex assembly, chromatin engagement, enzymatic activity, and dissociation that occur during transcription, with the complex likely undergoing cycles of engagement and release as the polymerase traverses genes[33][34][36][52].
While the canonical enzymatic function of Set1 involves the catalytic methylation of histone H3 at lysine 4, subsequent investigations have revealed that Set1 contributes to multiple cellular processes through both H3K4 methylation-dependent and H3K4 methylation-independent mechanisms[11][20][23][35][43][51]. Set1 plays important roles in DNA double-strand break (DSB) formation during meiosis, with set1 deletion strains displaying profound impairment in the induction of DSBs in response to meiotic signals[20][43]. Notably, this meiotic function does not absolutely require the histone methyltransferase activity of Set1, as catalytic mutants retain residual DSB formation capacity, suggesting that the SET domain provides essential non-catalytic scaffolding or signaling functions distinct from its enzymatic activity[20][43]. However, a more nuanced analysis revealed that specific point mutations within the SET domain that eliminate H3K4 methylation activity (such as G951S) do impair meiotic DSB formation, suggesting that while the methyltransferase activity may not be absolutely required under all circumstances, it does contribute meaningfully to this process[20][43].
The mechanism by which Set1 contributes to meiotic DSB formation appears to involve the targeting of histone H3K4 methylation sites to chromosome axes, where they can be recognized by components of the meiotic recombination machinery[20][31][43][46]. Spp1, a component of the Set1 complex, has been shown to associate with chromosome axes during meiosis through interactions with other meiotic proteins including Mer2, and the localization of Spp1 to these axis sites appears to depend on its association with the Set1 complex[31][46]. In cells where the Set1-Spp1 interaction is disrupted while maintaining Spp1's association with Mer2, DSB formation is severely impaired despite maintenance of normal H3K4 methylation levels elsewhere in the genome, demonstrating that the physical association of Spp1 with the Set1 complex is required for proper meiotic function[31][46]. Furthermore, Set1 has been shown to interact with the checkpoint protein Mec3, suggesting a role in coordinating meiotic progression with the completion of DNA replication and DSB formation[20][23][43].
Set1 contributes to the maintenance of chromosome structure and segregation during mitosis, as set1 deletion strains show partial loss of telomeric silencing and shortened telomeres, phenotypes reflecting defects in the establishment and maintenance of heterochromatin at chromosome ends[2][11][26][29]. The role of Set1 in subtelomeric gene repression specifically depends upon its enzymatic activity toward H3K4, as cells expressing catalytic mutants of Set1 recapitulate the subtelomeric derepression seen in set1 deletion strains[11][29]. However, the maintenance of telomere length involves both H3K4 methylation-dependent and H3K4 methylation-independent mechanisms, with evidence supporting a role for Set1 in regulating the expression of telomere maintenance genes through both transcriptional and posttranslational mechanisms[11][29]. The protein abundance of Stn1, a component of the CST complex that plays essential roles in telomere replication and protection, is regulated by Set1 through mechanisms that do not appear to correlate with mRNA expression changes, suggesting posttranslational control[11][29].
Set1 function in DNA damage response and repair represents another important dimension beyond transcriptional regulation, with evidence indicating that Set1 is recruited to sites of DNA double-strand breaks through association with chromatin remodeling complexes and serves to coordinate histone modification patterns with DNA repair gene expression[35][51]. Loss of MLL3, a mammalian Set1 ortholog, leads to decreased expression of genes involved in DNA damage response and homologous recombination, with these defects correlating with reduced H3K4me3 at target genes[35][51]. In mammals, the PTIP protein associates with MLL3 and MLL4 to recruit the MRE11 nuclease complex to stalled replication forks, with the recruitment correlating with MLL3/4-mediated H3K4 methylation, suggesting a direct role for these methyltransferases in coordinating the molecular response to replication stress[35][51]. SETD1A, another mammalian Set1 ortholog, has been shown to facilitate DNA repair through H3K4me3-dependent recruitment of histone chaperones and DNA repair factors including RAD51, promoting the protection of stalled replication forks from degradation and thereby facilitating homologous recombination-based repair[35][51].
The SET1 family of histone methyltransferases exhibits remarkable conservation from the unicellular yeast through invertebrates and vertebrates, with all family members sharing a highly conserved SET domain catalytic apparatus and similar overall architectures within their respective COMPASS-like complexes[24][35][40][51][55]. Humans encode six SET1 family members designated as SETD1A, SETD1B, MLL1 (also known as KMT2A), MLL2 (KMT2B), MLL3 (KMT2C), and MLL4 (KMT2D), each associated with distinct regulatory subunits and possessing specialized functions in human development and cellular regulation[35][40][51][55]. The budding yeast Set1 protein can functionally complement deletions of certain mammalian SET1 family members in some contexts, and conversely, mammalian SET1 proteins can substitute for yeast Set1 in select experiments, demonstrating that the fundamental catalytic mechanism and regulatory principles have been conserved across vast evolutionary distances[35][40][51]. However, the mammalian SET1 family members display distinct biochemical properties, with in vitro biochemical reconstitution studies revealing that MLL1/2 predominantly catalyze H3K4 mono- and dimethylation in the presence of the WRAD module (WDR5, RbBP5, ASH2L, and DPY30), while MLL3/4 primarily catalyze monomethylation, whereas SETD1A/B display the capacity for robust tri-methylation similar to yeast Set1[35][40][51].
The structural organization of Set1 and its orthologs includes a highly conserved SET domain at the carboxy terminus constituting the primary catalytic apparatus[21][24][37][51]. The yeast Set1 protein contains a RNA recognition motif (RRM) at its amino terminus, with structural and functional studies revealing that this motif interacts with RNA but that this interaction is not directly coupled to H3K4 methylation activity[39][51]. Mutation of the RRM destabilizes the Set1 protein and indirectly affects methylation through this destabilization mechanism rather than through direct impairment of catalytic function[39][51]. In mammalian SET1 family members, the RRM domains appear to play related roles in protein stability and subcellular localization, with these domains being particularly important for SETD1A and SETD1B, where they are positioned immediately adjacent to the SET domain and contain an "N-SET" motif that contributes to achieving high levels of H3K4 trimethylation[51]. The divergence in architecture between different SET1 family members appears to correlate with their specialized functions, with the simpler yeast Set1 serving as the sole H3K4 methyltransferase supporting all aspects of H3K4 methylation across the genome, while mammalian family members have specialized to regulate distinct genomic compartments and developmental programs[35][40][51].
The activity and abundance of Set1 are regulated through multiple post-translational mechanisms that fine-tune its function in response to cellular conditions and developmental cues. Set1 protein stability appears to be tightly coupled to its incorporation into the COMPASS complex, with evidence indicating that Set1 undergoes rapid degradation when complex assembly is disrupted or when critical subunits are absent[3][35][51]. The mechanisms governing this regulation involve ubiquitin-mediated proteasomal degradation, with specific recognition elements on Set1 likely serving as substrates for E3 ubiquitin ligases under conditions of complex disruption[3][35][51]. Conversely, the H3K4 demethylase Jhd2 undergoes polyubiquitin-mediated degradation, with the cellular concentration of Jhd2 opposing the effects of Set1 on H3K4 methylation levels[53]. This regulatory mechanism enables cells to rapidly adjust the balance between H3K4 methylation and demethylation in response to developmental or environmental signals by modulating the abundance of the demethylase through ubiquitin-mediated proteolysis[53].
The transcription-dependent regulation of H3K4 methylation reflects the coordinated recruitment of COMPASS to actively transcribing genes through its association with the elongating Polymerase II and ancillary complexes such as Paf1C, with this recruitment occurring through phosphorylation-dependent protein-protein interactions[33][36][52]. The Ctk1 kinase, which phosphorylates the carboxyl-terminal domain (CTD) of Polymerase II at serine 2 residues during the transition from initiation to elongation, contributes to establishing the CTD phosphorylation landscape that recruits elongation factors including those involved in COMPASS recruitment[33][36]. The rate of Polymerase II transcription elongation influences the methylation state deposited by COMPASS, with slower transcription rates during promoter-proximal regions enabling longer substrate interaction times and thus greater progression toward trimethylation, while faster elongation rates limit the extent of methylation[36][56]. This elegant mechanism couples the degree of H3K4 methylation to the kinetics of transcription without requiring distinct targeting mechanisms for the different methylation states, instead exploiting the kinetic properties of the complex to generate the observed distribution of mono-, di-, and trimethylated nucleosomes[36][56].
Set1 and H3K4 methylation function within an interconnected network of chromatin modifications and remodeling activities that collectively determine chromatin structure and gene expression state. Set1-mediated H3K4me2 and H3K4me3 specifically recruits the Set3 histone deacetylase complex to promoter-proximal regions through recognition of these methylation marks by the Set3 complex, establishing a domain with reduced histone acetylation immediately downstream of the highly acetylated nucleosome-depleted region at promoters[32]. This activity of Set1 thus indirectly suppresses nucleosome remodeling and acetylation in specific chromatin zones, contributing to the fine-tuning of gene expression by establishing distinct chromatin territories with different histone modification and nucleosome organization states[32]. The cooperation between H3K4 methylation and acetylation represents another critical interdependence, with evidence indicating that Set1 and its methyltransferase activity can suppress acetylation under certain circumstances through mechanisms that remain incompletely understood but likely involve either steric effects or recruitment of deacetylase complexes[8][13][32][49][57]. Conversely, H3K4 acetylation can accumulate when H3K4 methylation is absent or reduced, suggesting that acetylation marks the default modified state of the H3K4 residue in the absence of methyltransferase activity[8][13][49][57].
The Set2 methyltransferase catalyzes H3K36 methylation throughout transcribed regions, with this modification functioning complementarily to Set1-mediated H3K4 methylation in establishing distinct chromatin territories along gene bodies[33][36]. Set2 is recruited to elongating Polymerase II through Ctk1-mediated phosphorylation of Ser2 residues on the CTD, establishing a temporal hierarchy in which Set1 is primarily active during the slower promoter-proximal regions while Set2 becomes progressively more important during downstream elongation[33][36]. These two methyltransferases function in parallel but distinct pathways, each establishing histone modification patterns that reflect and reinforce the transcriptional state of different gene regions[33][36].
The genome-wide regulation of transcription by Set1-mediated H3K4 methylation ensures that cells maintain appropriate expression patterns of thousands of genes while suppressing inappropriate expression of programs that should be restricted to specific cellular conditions or developmental stages[2][8][13][21][49][57]. Loss of Set1 or Set1-mediated H3K4 methylation does not typically result in global transcriptional collapse but instead reveals the specialized regulatory role of H3K4 methylation in controlling genes whose expression must be precisely timed or spatially restricted. In yeast, set1 deletion strains display growth defects, defective sporulation and meiosis, shortened telomeres with derepressed subtelomeric genes, and defects in cell cycle regulation, reflecting the pleiotropic functions of Set1 beyond simple transcriptional activation[2][21][26][43][51]. In mammalian development, MLL1 is absolutely required for embryonic viability, as MLL1 null mice die early in development, while heterozygous MLL1 mice display retarded growth, abnormal hematopoiesis, and anterior-posterior skeletal patterning defects[35][51]. SETD1A and SETD1B knockout embryos display gastrulation failure and severe growth retardation respectively, suggesting non-redundant roles despite their high sequence similarity[35][51].
The specialized roles of different SET1 family members in mammalian development are exemplified by their distinct requirements in different cell lineages and developmental stages, with MLL complexes being particularly critical for the regulation of Hox gene expression during anterior-posterior body axis specification and for maintaining the proliferation of hematopoietic stem and progenitor cells[35][51]. These specialized functions reflect not only differences in the catalytic properties of the enzymes but also differences in the regulatory subunits with which they associate and the transcription factors through which they are recruited to specific genomic regions[35][40][51]. Recent evidence indicates that transcription factors directly interact with COMPASS-like complex components including CFP1 and WDR5, enabling sequence-specific recruitment of the methyltransferase activity to target gene promoters in response to extracellular signals or developmental cues[27][34][41][52]. This mechanism enables cells to dynamically regulate which genes receive H3K4 methylation in response to environmental signals such as ER stress, enabling signal-dependent transcriptional responses[27].
Set1 represents a paradigmatic example of how catalytic specificity, complex architecture, and dynamic regulation converge to establish precise control over chromatin structure and gene expression in eukaryotic cells[1][3][7][12][21][35][36][40][51][55]. The protein functions exclusively as the catalytic component of the COMPASS complex, where it catalyzes the site-specific methylation of histone H3 at lysine 4 with absolute specificity despite the presence of other lysine residues that could potentially serve as methylation targets[1][3][21][37]. The enzymatic mechanism involves the transfer of a methyl group from S-adenosylmethionine to the ε-amino nitrogen of lysine 4, with this reaction occurring exclusively in the context of the full COMPASS complex where multiple subunits contribute regulatory functions that stabilize the catalytic domain conformation and facilitate substrate recognition[15][22][37][40][58]. The catalytic domain exists in dynamic equilibrium between inactive "open" and catalytically productive "closed" conformations, with the transition to the closed state depending critically upon interactions with other COMPASS subunits, particularly Cps30 (WDR5) and Cps60 (ASH2L)[40][58]. Set1 catalyzes sequential rounds of methylation to generate H3K4me1, H3K4me2, and H3K4me3, with the distribution of these three states reflecting both the kinetic properties of the enzyme and the dynamics of transcription elongation[36][37][56].
The recruitment of Set1 and COMPASS to specific genomic regions occurs through multiple complementary mechanisms, including co-transcriptional recruitment via association with elongating Polymerase II, recognition of histone modifications including H3K4me3 itself, and in metazoans, recognition of non-methylated CpG-rich DNA sequences by the CFP1 targeting subunit[33][34][36][38][41][52]. The H2B monoubiquitination provides a critical regulatory signal that stimulates the catalytic activity of COMPASS specifically toward the higher methylation states, ensuring that efficient H3K4 trimethylation is coupled to active transcription and H2B ubiquitination[3][15][22][28][36]. The H3K4 methylation marks deposited by Set1 serve multifaceted roles in promoting and maintaining transcription through recruitment of transcriptional machinery components and co-activators, while also functioning in DNA repair, meiotic recombination, and chromosome segregation through both H3K4 methylation-dependent and -independent mechanisms[2][11][20][23][27][35][43][51].
The evolutionary conservation of Set1 from unicellular yeast through metazoans, coupled with the clinical significance of SET1 family members in human leukemogenesis (MLL translocations represent one of the most frequent recurring chromosomal abnormalities in acute leukemias), underscores the fundamental importance of H3K4 methylation for normal cellular function and the necessity of precise regulation of these enzymes in controlling cell fate decisions[35][51]. Future investigations will likely focus on elucidating the mechanisms through which Set1 family members are recruited to specific genomic regions in response to developmental and environmental signals, understanding how different SET1 family members achieve their specialized functions despite sharing similar catalytic mechanisms, and identifying therapeutic strategies to target dysregulated SET1 activity in leukemic and other cancer cells where these enzymes are frequently implicated in malignant transformation[35][51]. The continuing characterization of Set1 and its orthologs exemplifies how detailed molecular and structural studies combined with functional genomics can provide comprehensive understanding of fundamental biological processes and identify targets for therapeutic intervention in human disease.
Gene: SET1 (Histone-lysine N-methyltransferase, H3 lysine-4 specific)
Organism: Saccharomyces cerevisiae
UniProt ID: P38827
Review Date: 2025-12-31
Total Annotations Reviewed: 68
Systematic curation of 68 existing Gene Ontology (GO) annotations for SET1 reveals a well-annotated gene with predominantly high-quality evidence. SET1 is the catalytic component of the COMPASS complex and is responsible for histone H3 lysine-4 (H3K4) methylation, one of the most important and well-characterized histone modifications in eukaryotic chromatin regulation.
| Action | Count | Percentage | Notes |
|---|---|---|---|
| ACCEPT | 44 | 64.7% | Core functional annotations with solid evidence |
| KEEP_AS_NON_CORE | 18 | 26.5% | Correct but less central; localization, generic binding |
| MARK_AS_OVER_ANNOTATED | 1 | 1.5% | Sterol homeostasis (pleiotropic effect) |
| REMOVE | 0 | 0% | No annotations require removal |
| MODIFY | 0 | 0% | No annotations require modification |
| UNDECIDED | 0 | 0% | All annotations adequately supported |
| NEW | 0 | 0% | All major functions already annotated |
The primary catalytic function of SET1 is well-captured across multiple evidence types:
Key Publications: PMID:11742990, 11805083, 12845608, 22158900
GO:0140999 - histone H3K4 trimethyltransferase activity
Rationale: SET1 catalyzes complete trimethylation of H3K4. The annotation captures the specific product outcome (H3K4me3).
GO:0016279 - protein-lysine N-methyltransferase activity
Rationale: SET1-mediated H3K4 methylation directly regulates Pol II transcription. H3K4me3 marks active promoters and facilitates transcription initiation and elongation. Complementary evidence from both knockout phenotypes (IMP) and genetic interactions (IGI).
GO:0045944 - positive regulation of transcription by RNA polymerase II (appears with IEA and IMP evidence)
Rationale: SET1 primarily promotes transcription through H3K4 methylation deposition. The IMP evidence from PMID:27325136 documents positive regulatory roles.
GO:0000122 - negative regulation of transcription by RNA polymerase II
Rationale: While SET1 is primarily associated with transcriptional activation, it also participates in repression at specific loci through context-dependent mechanisms. This dual annotation (both positive and negative regulation from PMID:27325136) appropriately reflects SET1's context-dependent transcriptional roles.
GO:1902275 - regulation of chromatin organization
Key Publications: PMID:11752412, 11805083, 9398665, 9988274
GO:0000723 - telomere maintenance (appears 3 times)
GO:0030466 - silent mating-type cassette heterochromatin formation
GO:0000183 - rDNA heterochromatin formation
GO:0000781 - chromosome, telomeric region (IGI evidence)
GO:0042138 - meiotic DNA double-strand break formation
GO:1903341 - regulation of meiotic DNA double-strand break formation
GO:0007130 - synaptonemal complex assembly
GO:1905088 - positive regulation of synaptonemal complex assembly
GO:0030437 - ascospore formation
GO:0010629 - negative regulation of gene expression
GO:0033554 - cellular response to stress (appears 2 times)
GO:0005634 - nucleus (appears 2 times with IEA and NAS evidence)
GO:0005694 - chromosome
GO:0006325 - chromatin organization
GO:0000781 - chromosome, telomeric region
GO:0008168 - methyltransferase activity
GO:0016740 - transferase activity
GO:0032259 - methylation
Comprehensive functional coverage: All major SET1 functions are represented, from transcriptional regulation to meiotic recombination.
Multiple evidence types: Critical functions (H3K4 methyltransferase activity, COMPASS complex membership) are supported by multiple independent evidence types (IBA, IEA, IDA, IMP, IGI, IPI), strengthening confidence.
Appropriate evidence codes: IMP and IDA evidence for core functions reflects direct experimental characterization. IBA and IEA for conserved processes reflects appropriate use of phylogenetic and domain-based inference.
No false annotations: Zero annotations require removal. All 68 annotations are factually correct based on literature evidence.
Phylogenetic evidence: The extensive use of IBA (phylogenetic inference) for well-conserved functions (H3K4 methyltransferase activity, COMPASS complex) is appropriate for a yeast gene with clear mammalian orthologs.
Recommendation: The COMPASS complex membership annotations (GO:0048188 with IPI evidence) appropriately capture these interactions at a more functional level
Supporting text coverage
Recommendation: Could enhance documentation by adding supporting_text to more IMP and IGI annotations from original papers
Non-core annotations
| Code | Count | Use |
|---|---|---|
| IMP | 23 | Direct genetic evidence; appropriate for loss-of-function studies |
| IPI | 33 | Protein-protein interactions; well-used for complex membership and binding |
| IEA | 11 | Electronic annotation; appropriately used for ortholog inference and domain mapping |
| IBA | 2 | Phylogenetic inference; appropriate for conserved functions |
| IDA | 8 | Direct assay evidence; used for biochemical and structural studies |
| IGI | 6 | Genetic interaction; complements IMP for regulatory function |
| NAS | 2 | Non-annotated sequence; from ComplexPortal curation |
| RCA | 1 | Rule-based computational annotation; appropriate for structural features |
Multiple studies converge on conclusions
Medium-high confidence (IBA, IEA): 13 annotations (19%)
Appropriate for well-conserved functions
Medium confidence (IPI, NAS): 35 annotations (33%)
Curated assertions from ComplexPortal
Lower confidence (RCA): 1 annotation (1.5%)
The 68 existing annotations align well with the comprehensive deep research analysis, which documents:
No major discrepancies between existing annotations and deep research findings.
No removals needed: All 68 annotations are supported by literature evidence.
No modifications needed: Term selections are appropriate for current GO structure.
Consolidation opportunity: The 18 generic "protein binding" (GO:0005515) annotations could be reduced in favor of emphasizing specific complex membership (GO:0048188), but duplicates with different evidence sources are valid and strengthen documentation.
Documentation enhancement: Consider adding supporting_text with literature quotes to IMP and IGI annotations to increase documentation coverage beyond current 30.9%.
While current annotation coverage is comprehensive, potential additions could include:
However, current coverage is sufficient and these are not essential.
Proposed description for gene profile:
"SET1 is the catalytic component of the COMPASS complex, responsible for histone H3 lysine-4 (H3K4) methylation, a fundamental histone modification marking actively transcribed genes and gene regulatory regions. SET1 catalyzes mono-, di-, and trimethylation of H3K4, with methylation state distribution reflecting transcription kinetics. In addition to canonical transcriptional roles, SET1 participates in DNA damage response, meiotic recombination, and telomere maintenance through both methylation-dependent and independent mechanisms."
The SET1 gene annotation set represents a high-quality, comprehensive curation with 64.7% of annotations representing core functions (ACCEPT action) supported by strong experimental evidence. An additional 26.5% of annotations are valid but less central to gene function (KEEP_AS_NON_CORE action). No annotations require removal, modification, or undecided status. The annotation portfolio effectively captures SET1's remarkable pleiotropy, from its primary role as an H3K4 methyltransferase to specialized functions in meiosis, telomere maintenance, and heterochromatin formation.
The existing annotations provide a solid foundation for functional genomics studies and systems biology analyses of this important chromatin-modifying enzyme.
Reviewed by: AI Gene Annotation Curator
Review Status: COMPLETE
Date: 2025-12-31
id: P38827
gene_symbol: SET1
product_type: PROTEIN
status: COMPLETE
taxon:
id: NCBITaxon:559292
label: Saccharomyces cerevisiae
description: 'SET1 is the catalytic component of the COMPASS complex, responsible
for histone H3 lysine 4 (H3K4) methylation. SET1 catalyzes mono-, di-, and trimethylation
of H3K4, a fundamental histone mark associated with actively transcribed genes and
gene regulatory regions. SET1 also participates in DNA damage response, meiotic
recombination, and telomere maintenance through both methylation-dependent and independent
mechanisms.'
existing_annotations:
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: 'SET1 catalyzes histone H3 lysine 4 methylation as the core enzymatic
function of the COMPASS complex. IBA annotation based on phylogenetic inference
across SET1/MLL family. Extensive direct biochemical evidence establishes
this as the primary function.'
action: ACCEPT
reason: 'Core SET1 function. IBA evidence is appropriate for this well-conserved
family function. Deep research confirms SET1 is the sole H3K4 methyltransferase
in yeast [evidence from PMID:11742990, 11751634, 11752412, 11805083].'
supported_by:
- reference_id: PMID:11742990
supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.'
- reference_id: PMID:11805083
supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
for telomeric silencing of gene expression.'
- reference_id: file:yeast/SET1/SET1-deep-research-perplexity.md
supporting_text: 'provider: perplexity'
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: 'SET1 is the catalytic core component of the COMPASS (Complex of Proteins
Associated with Set1) complex. SET1 is part of an eight-subunit complex essential
for H3K4 methylation. IBA annotation reflects phylogenetic conservation of
this complex organization.'
action: ACCEPT
reason: 'Core structural annotation. SET1 functions exclusively as component
of COMPASS complex. IBA evidence appropriate for this conserved complex membership.
Deep research establishes COMPASS as multi-subunit assembly with SET1 at catalytic
core [PMID:11742990, 11687631].'
supported_by:
- reference_id: PMID:11687631
supporting_text: 'COMPASS: a complex of proteins associated with a trithorax-related
SET domain protein.'
- reference_id: PMID:11742990
supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.'
- term:
id: GO:0005634
label: nucleus
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: 'SET1 localizes to the nucleus where COMPASS complex catalyzes H3K4
methylation on chromatin. IEA from UniProt subcellular location annotation.'
action: KEEP_AS_NON_CORE
reason: 'Correct localization but not informative about SET1 function. Localization
without functional consequence should be marked non-core. SET1 functions exclusively
in nucleus.'
- term:
id: GO:0005694
label: chromosome
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: 'SET1 associates with chromatin/chromosomes as component of COMPASS
complex catalyzing H3K4 methylation.'
action: KEEP_AS_NON_CORE
reason: 'While true, this is vague and less informative than specific process
or function annotations. Chromosome association is implied by core methyltransferase
function.'
- term:
id: GO:0006325
label: chromatin organization
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: 'H3K4 methylation by SET1/COMPASS contributes to chromatin organization
and regulation of chromatin structure.'
action: ACCEPT
reason: 'SET1 directly contributes to chromatin organization through H3K4 methylation,
which affects nucleosome positioning, histone-DNA interactions, and chromatin
state. IEA from keyword annotation is appropriate.'
- term:
id: GO:0008168
label: methyltransferase activity
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: 'SET1 catalyzes methyl group transfer from SAM to histone H3 lysine
4, a methyltransferase activity.'
action: KEEP_AS_NON_CORE
reason: 'True but overly broad. GO:0042800 (H3K4 methyltransferase activity)
is more specific and informative. This parent term is less useful when specific
substrate-level annotation available.'
- term:
id: GO:0016740
label: transferase activity
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: 'SET1 catalyzes transferase activity (methyl group transfer to histone
substrate).'
action: KEEP_AS_NON_CORE
reason: 'Extremely broad parent term. SET1 is a methyltransferase, which is
a type of transferase. Parent term is less informative when more specific
molecular function available.'
- term:
id: GO:0032259
label: methylation
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: 'SET1 catalyzes methylation of histone H3 lysine 4.'
action: KEEP_AS_NON_CORE
reason: 'General process annotation. While accurate, more specific methyltransferase
activity annotations (GO:0008168, GO:0042800) are more informative. Parent
term less useful.'
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: 'SET1 is the H3K4-specific methyltransferase, catalyzing mono-, di-,
and trimethylation of histone H3 lysine 4. IEA from InterPro domain annotation.'
action: ACCEPT
reason: 'Core molecular function annotation. IEA from InterPro recognition of
SET domain is appropriate. Extensive literature confirms this is SET1 primary
function. Duplicate of annotation 1 with different evidence source.'
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IEA
original_reference_id: GO_REF:0000117
review:
summary: 'SET1-mediated H3K4 methylation marks and promotes active transcription,
functioning in positive regulation of Pol II transcription.'
action: ACCEPT
reason: 'SET1 function extends beyond catalytic activity to transcriptional
regulation. H3K4me3 marks active promoters and is required for efficient transcription
initiation. Deep research [PMID:27325136] confirms SET1 involvement in transcription
regulation. However, SET1 also has roles in transcriptional repression, making
this annotation incomplete.'
additional_reference_ids:
- PMID:27325136
supported_by:
- reference_id: PMID:27325136
supporting_text: Counteracting H3K4 methylation modulators Set1 and
Jhd2 co-regulate chromatin dynamics and gene transcription.
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: 'SET1 is component of Set1C/COMPASS complex. IEA from InterPro domain
recognition.'
action: ACCEPT
reason: 'Core structural annotation. Duplicate of annotation 2 with different
evidence. InterPro detection of SET domain-containing protein appropriately
infers COMPASS complex membership.'
- term:
id: GO:0140999
label: histone H3K4 trimethyltransferase activity
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: 'SET1 catalyzes trimethylation of histone H3 lysine 4 (H3K4me3). IEA
from RHEA and EC number annotation.'
action: ACCEPT
reason: 'SET1 catalyzes all methylation states (me1, me2, me3) of H3K4. This
annotation for trimethylation specifically is appropriate. Deep research confirms
SET1 capacity for complete trimethylation [PMID:12845608, 22158900].'
additional_reference_ids:
- PMID:12845608
- PMID:22158900
supported_by:
- reference_id: PMID:12845608
supporting_text: Saccharomyces cerevisiae Set1p is a methyltransferase
specific for lysine 4 of histone H3 and is required for efficient
gene expression.
- reference_id: PMID:22158900
supporting_text: Structural analysis of the core COMPASS family of
histone H3K4 methylases from yeast to human.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:11742990
review:
summary: 'SET1 protein-protein interactions with COMPASS complex subunits (Bre2/Ash2,
Swd1, Swd2, Swd3) and other proteins documented through co-immunoprecipitation
and mass spectrometry. Multiple IPI annotations reflect experimentally validated
interactions.'
action: KEEP_AS_NON_CORE
reason: 'While interactions are real and well-documented (IPI evidence is appropriate),
generic "protein binding" is not informative for SET1 function. These interactions
are with known COMPASS subunits (Bre2, Swd1, Swd2, Swd3, Bre5, Chd1, Swd7)
forming the complex. Specific complex membership (GO:0048188) is more informative
than generic binding. COMPASS complex membership captures the functional significance.'
supported_by:
- reference_id: PMID:11742990
supporting_text: 'The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.'
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:11805837
review:
summary: 'SET1 interactions documented via mass spectrometry-based proteome
survey.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. Specific COMPASS complex membership is
more informative.'
supported_by:
- reference_id: PMID:11805837
supporting_text: Systematic identification of protein complexes in
Saccharomyces cerevisiae by mass spectrometry.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:16429126
review:
summary: 'SET1 protein interactions identified in proteome survey.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. Specific complex membership captures functional
relationships.'
supported_by:
- reference_id: PMID:16429126
supporting_text: Proteome survey reveals modularity of the yeast cell
machinery.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:16554755
review:
summary: 'SET1 protein interactions from global landscape of yeast protein complexes
study.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. Duplicate interactions captured more informatively
by COMPASS complex membership.'
supported_by:
- reference_id: PMID:16554755
supporting_text: Global landscape of protein complexes in the yeast
Saccharomyces cerevisiae.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:18083099
review:
summary: 'SET1 interactions in context of histone crosstalk between H2B ubiquitination
and H3 methylation via COMPASS.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation despite interesting functional context.
COMPASS complex membership is more specific and functional.'
supported_by:
- reference_id: PMID:18083099
supporting_text: Histone crosstalk between H2B monoubiquitination and
H3 methylation mediated by COMPASS.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:19713935
review:
summary: 'SET1 protein interactions from cotranslational assembly studies of
SET1C complex.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. Complex assembly dynamics captured by COMPASS
membership annotation.'
supported_by:
- reference_id: PMID:19713935
supporting_text: Cotranslational assembly of the yeast SET1C histone
methyltransferase complex.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:21179020
review:
summary: 'SET1 interactions from budding yeast chromatin-associated interactome
study.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. COMPASS complex membership more informative.'
supported_by:
- reference_id: PMID:21179020
supporting_text: Defining the budding yeast chromatin-associated
interactome.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:37968396
review:
summary: 'SET1 interactions from recent social and structural architecture of
yeast protein interactome study.'
action: KEEP_AS_NON_CORE
reason: 'Generic binding annotation. Specific complex membership is more informative
for understanding SET1 function.'
supported_by:
- reference_id: PMID:37968396
supporting_text: Nov 15. The social and structural architecture of the
yeast protein interactome.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:9988274
review:
summary: 'SET1 interaction with Mec3 checkpoint protein documented. SET1 involvement
in DNA repair and checkpoint functions.'
action: KEEP_AS_NON_CORE
reason: 'While this Mec3 interaction is functionally distinct (DNA repair vs.
COMPASS), generic "protein binding" is still not informative. The functional
relationship (DNA repair, checkpoint) would be better captured by process
annotations.'
supported_by:
- reference_id: PMID:9988274
supporting_text: Interaction between Set1p and checkpoint protein
Mec3p in DNA repair and telomere functions.
- term:
id: GO:0005634
label: nucleus
evidence_type: NAS
original_reference_id: PMID:11742990
review:
summary: 'SET1 localizes to the nucleus as established by ComplexPortal NAS
(Non-Annotated Sequence) curation.'
action: KEEP_AS_NON_CORE
reason: 'Correct but not informative about function. Localization without functional
consequence. SET1 functions in nucleus through H3K4 methylation which is already
annotated.'
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: NAS
original_reference_id: PMID:11742990
review:
summary: 'SET1 has documented roles in subtelomeric heterochromatin formation
and silencing of genes near chromosome ends.'
action: ACCEPT
reason: 'SET1 contributes to subtelomeric heterochromatin formation through
H3K4me3-dependent mechanisms. Deep research confirms role in telomeric gene
silencing [PMID:11805083]. NAS evidence from ComplexPortal is appropriate.
This is documented functional role.'
supported_by:
- reference_id: PMID:11805083
supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
for telomeric silencing of gene expression.'
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0003723
label: RNA binding
evidence_type: IDA
original_reference_id: PMID:29071121
review:
summary: 'SET1 contains an RNA recognition motif (RRM) at its N-terminus and
binds RNA. IDA evidence from direct biochemical studies of SET1 RNA binding.'
action: KEEP_AS_NON_CORE
reason: 'SET1 does bind RNA through its RRM domain (demonstrated PMID:29071121),
but the functional role of this interaction is unclear. Deep research indicates
RRM functions primarily in protein stability rather than direct catalytic
relevance to H3K4 methylation. RNA binding is less central than methyltransferase
activity.'
supported_by:
- reference_id: PMID:29071121
supporting_text: 'Binding to RNA regulates Set1 function.'
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IDA
original_reference_id: PMID:29071121
review:
summary: 'SET1 is demonstrated to be component of Set1C/COMPASS complex through
direct biochemical analysis and RNA binding studies.'
action: ACCEPT
reason: 'Core structural annotation. IDA evidence from direct protein interaction
studies appropriate. Duplicate annotation with different evidence from IBA
(phylogenetic) and IEA (InterPro). This IDA evidence from direct studies is
strong.'
supported_by:
- reference_id: PMID:29071121
supporting_text: Binding to RNA regulates Set1 function.
- term:
id: GO:0006357
label: regulation of transcription by RNA polymerase II
evidence_type: IMP
original_reference_id: PMID:16959218
review:
summary: 'SET1 regulates transcription by RNA Pol II through H3K4 methylation.
IMP evidence from set1 mutant studies showing defects in transcription regulation.'
action: ACCEPT
reason: 'SET1 is essential for normal transcription regulation through H3K4
methylation. Deep research confirms H3K4me3 marks active promoters and affects
transcription initiation and elongation [PMID:27325136]. IMP evidence appropriate.'
additional_reference_ids:
- PMID:27325136
supported_by:
- reference_id: PMID:16959218
supporting_text: Different roles of histone H3 lysine 4 methylation in
chromatin maintenance.
- reference_id: PMID:27325136
supporting_text: Counteracting H3K4 methylation modulators Set1 and
Jhd2 co-regulate chromatin dynamics and gene transcription.
- term:
id: GO:0006357
label: regulation of transcription by RNA polymerase II
evidence_type: IGI
original_reference_id: PMID:16959218
review:
summary: 'SET1 genetic interaction with other transcription regulation genes.
IGI evidence from genetic interaction studies.'
action: ACCEPT
reason: 'Genetic interactions with transcription factors and regulators confirm
SET1 role in transcription regulation. IMP and IGI both support this annotation
from same reference.'
supported_by:
- reference_id: PMID:16959218
supporting_text: Different roles of histone H3 lysine 4 methylation in
chromatin maintenance.
- term:
id: GO:0033554
label: cellular response to stress
evidence_type: IMP
original_reference_id: PMID:16959218
review:
summary: 'SET1 contributes to cellular stress response through H3K4 methylation-dependent
gene expression changes. IMP evidence from mutant analysis.'
action: ACCEPT
reason: 'SET1 regulates genes involved in stress response. Deep research shows
SET1 effects on stress-responsive gene expression. IMP evidence from set1
deletion/mutation studies.'
supported_by:
- reference_id: PMID:16959218
supporting_text: Different roles of histone H3 lysine 4 methylation in
chromatin maintenance.
- term:
id: GO:0033554
label: cellular response to stress
evidence_type: IGI
original_reference_id: PMID:16959218
review:
summary: 'Genetic interaction showing SET1 involvement in stress response pathways.'
action: ACCEPT
reason: 'Genetic interaction data support stress response role. Duplicate from
same reference with complementary evidence type.'
supported_by:
- reference_id: PMID:16959218
supporting_text: Different roles of histone H3 lysine 4 methylation in
chromatin maintenance.
- term:
id: GO:0008270
label: zinc ion binding
evidence_type: RCA
original_reference_id: PMID:30358795
review:
summary: 'SET1 contains zinc-binding motifs characteristic of SET domain proteins.
RCA (Rule-based Computational Annotation) inferred from protein structure.'
action: KEEP_AS_NON_CORE
reason: 'SET1 SET domain contains zinc-coordinating residues (part of catalytic
pocket), but zinc binding is not the primary function - it is structural support
for methyltransferase activity. RCA from structural features is appropriate,
but zinc binding is a cofactor interaction rather than independent function.
Less informative than methyltransferase activity annotation.'
supported_by:
- reference_id: PMID:30358795
supporting_text: The cellular economy of the Saccharomyces cerevisiae
zinc proteome.
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:11752412
review:
summary: 'SET1 is required for proper methylation of histone H3 and formation
of subtelomeric heterochromatin. Multiple IMP annotations confirm this functional
role.'
action: ACCEPT
reason: 'SET1 is essential for subtelomeric heterochromatin formation and silencing
through H3K4 methylation-dependent mechanisms. IMP evidence from set1 mutant
studies. Core functional role well-documented [PMID:11752412, 11805083, 9398665].'
supported_by:
- reference_id: PMID:11752412
supporting_text: 'A trithorax-group complex purified from Saccharomyces
cerevisiae is required for methylation of histone H3.'
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:11805083
review:
summary: 'SET1 methylation activity required for telomeric silencing and subtelomeric
heterochromatin.'
action: ACCEPT
reason: 'Duplicate annotation with different reference but same functional role.
IMP evidence appropriate.'
supported_by:
- reference_id: PMID:11805083
supporting_text: 'COMPASS, a histone H3 (Lysine 4) methyltransferase required
for telomeric silencing of gene expression.'
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:9398665
review:
summary: 'SET1, a yeast member of the trithorax family, functions in transcriptional
silencing and subtelomeric heterochromatin.'
action: ACCEPT
reason: 'Foundational characterization of SET1 in heterochromatin. IMP evidence
from early functional studies.'
supported_by:
- reference_id: PMID:9398665
supporting_text: SET1, a yeast member of the trithorax family,
functions in transcriptional silencing and diverse cellular
processes.
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:9988274
review:
summary: 'SET1 involvement in heterochromatin formation and telomere functions.'
action: ACCEPT
reason: 'Additional IMP evidence. Multiple independent studies converge on SET1
role in subtelomeric heterochromatin formation.'
supported_by:
- reference_id: PMID:9988274
supporting_text: Interaction between Set1p and checkpoint protein
Mec3p in DNA repair and telomere functions.
- term:
id: GO:0003723
label: RNA binding
evidence_type: IDA
original_reference_id: PMID:28483910
review:
summary: 'Direct experimental evidence for SET1 RNA binding capacity. IDA from
PMID:28483910 RNA Binding by Histone Methyltransferases Set1 and Set2.'
action: KEEP_AS_NON_CORE
reason: 'While RNA binding is demonstrated (IDA evidence appropriate), the functional
role in H3K4 methylation is unclear. RRM domain appears to function in protein
stability/regulation rather than direct catalytic function. Less central than
methyltransferase activity.'
supported_by:
- reference_id: PMID:28483910
supporting_text: 'RNA Binding by Histone Methyltransferases Set1 and Set2.'
- term:
id: GO:0042138
label: meiotic DNA double-strand break formation
evidence_type: IMP
original_reference_id: PMID:32290544
review:
summary: 'SET1 contributes to meiotic DNA double-strand break (DSB) formation.
IMP evidence from genetic analysis of set1 in meiosis.'
action: ACCEPT
reason: 'SET1 has specialized functions in meiotic recombination beyond transcription.
Deep research shows SET1 targeting meiotic DSB sites and requirement for proper
DSB formation [PMID:32290544]. Both catalytic and non-catalytic SET1 functions
contribute to meiosis.'
supported_by:
- reference_id: PMID:32290544
supporting_text: 'Genetic Interactions of Histone Modification Machinery
Set1 and PAF1C with the Recombination Complex Rec114-Mer2-Mei4 in the
Formation of Meiotic DNA Double-Strand Breaks.'
- term:
id: GO:0000723
label: telomere maintenance
evidence_type: IMP
original_reference_id: PMID:27911222
review:
summary: 'SET1 is involved in telomere maintenance through H3K4 methylation-dependent
and independent mechanisms.'
action: ACCEPT
reason: 'SET1 contributes to telomere length maintenance. Deep research documents
SET1 roles in both H3K4me3-dependent telomeric silencing and H3K4me3-independent
telomere length regulation [PMID:27911222]. IMP evidence from set1 mutant
studies.'
supported_by:
- reference_id: PMID:27911222
supporting_text: 'The histone methyltransferases Set5 and Set1 have overlapping
functions in gene silencing and telomere maintenance.'
- term:
id: GO:0000781
label: chromosome, telomeric region
evidence_type: IGI
original_reference_id: PMID:9214638
review:
summary: 'SET1 genetic interaction suggesting localization/function at telomeric
regions. IGI evidence from genetic interaction studies.'
action: KEEP_AS_NON_CORE
reason: 'Localization annotation without clear functional context. SET1 association
with telomeric regions is implied by telomere maintenance and subtelomeric
heterochromatin annotations. More specific process annotations are more informative.'
supported_by:
- reference_id: PMID:9214638
supporting_text: Mammalian homologues of the Polycomb-group gene
Enhancer of zeste mediate gene silencing in Drosophila
heterochromatin and at S.
- term:
id: GO:0031509
label: subtelomeric heterochromatin formation
evidence_type: IGI
original_reference_id: PMID:9214638
review:
summary: 'Genetic interaction indicating SET1 involvement in subtelomeric heterochromatin
formation.'
action: ACCEPT
reason: 'Genetic interaction evidence supports subtelomeric heterochromatin
function. IGI complements IMP evidence from other references.'
supported_by:
- reference_id: PMID:9214638
supporting_text: Mammalian homologues of the Polycomb-group gene
Enhancer of zeste mediate gene silencing in Drosophila
heterochromatin and at S.
- term:
id: GO:0010629
label: negative regulation of gene expression
evidence_type: IMP
original_reference_id: PMID:29066473
review:
summary: 'SET1 involvement in repression of middle sporulation genes through
H3K4 methylation maintenance.'
action: ACCEPT
reason: 'SET1 has documented roles in repressing genes that should not be expressed.
H3K4 methylation can mark both active and repressed genes depending on context.
Deep research confirms SET1 contributes to negative regulation of specific
gene sets [PMID:29066473].'
supported_by:
- reference_id: PMID:29066473
supporting_text: 'Repression of Middle Sporulation Genes in Saccharomyces
cerevisiae by the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4
Methylation.'
- term:
id: GO:0006479
label: protein methylation
evidence_type: IMP
original_reference_id: PMID:23469257
review:
summary: 'SET1 catalyzes protein (histone) methylation as core enzymatic function.
IMP evidence from catalytic studies.'
action: KEEP_AS_NON_CORE
reason: 'Correct but overly broad. GO:0042800 (H3K4 methyltransferase activity)
is more specific. Parent term less informative when specific substrate-level
annotation available.'
supported_by:
- reference_id: PMID:23469257
supporting_text: Catalytic and functional roles of conserved amino
acids in the SET domain of the S.
- term:
id: GO:0000183
label: rDNA heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:23469257
review:
summary: 'SET1 contributes to rDNA heterochromatin formation through H3K4 methylation-dependent
mechanisms.'
action: ACCEPT
reason: 'SET1 has specialized roles in rDNA locus silencing beyond general H3K4
methylation. IMP evidence from studies of SET domain catalytic function in
rDNA silencing. Deep research confirms rDNA heterochromatin is SET1-dependent
[PMID:23469257].'
supported_by:
- reference_id: PMID:23469257
supporting_text: 'Catalytic and functional roles of conserved amino acids
in the SET domain of the S. cerevisiae lysine methyltransferase Set1.'
- term:
id: GO:0000122
label: negative regulation of transcription by RNA polymerase II
evidence_type: IMP
original_reference_id: PMID:27325136
review:
summary: 'SET1-mediated H3K4 methylation can function in negative regulation
of transcription at specific loci. IMP evidence from mutant analysis.'
action: ACCEPT
reason: 'SET1 has dual roles in both positive and negative regulation depending
on genomic context and partnering proteins. Deep research confirms Jhd2 demethylase
opposition to Set1 in dynamic control [PMID:27325136]. Multiple regulatory
roles appropriate.'
supported_by:
- reference_id: PMID:27325136
supporting_text: 'Counteracting H3K4 methylation modulators Set1 and Jhd2
co-regulate chromatin dynamics and gene transcription.'
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IMP
original_reference_id: PMID:27325136
review:
summary: 'SET1-mediated H3K4 methylation promotes positive regulation of Pol
II transcription.'
action: ACCEPT
reason: 'Primary transcriptional role of SET1. Both GO:0000122 (negative) and
GO:0045944 (positive) from same reference indicate context-dependent function.
IMP evidence appropriate for both.'
supported_by:
- reference_id: PMID:27325136
supporting_text: Counteracting H3K4 methylation modulators Set1 and
Jhd2 co-regulate chromatin dynamics and gene transcription.
- term:
id: GO:1902275
label: regulation of chromatin organization
evidence_type: IMP
original_reference_id: PMID:27325136
review:
summary: 'SET1-mediated H3K4 methylation regulates chromatin organization state
through coordinated interactions with chromatin modifiers.'
action: ACCEPT
reason: 'SET1 is central to chromatin organization regulation through H3K4 methylation
effects on nucleosome positioning, chromatin accessibility, and establishment
of distinct chromatin domains. IMP evidence appropriate.'
supported_by:
- reference_id: PMID:27325136
supporting_text: Counteracting H3K4 methylation modulators Set1 and
Jhd2 co-regulate chromatin dynamics and gene transcription.
- term:
id: GO:1905088
label: positive regulation of synaptonemal complex assembly
evidence_type: IMP
original_reference_id: PMID:24797370
review:
summary: 'SET1 contributes to synaptonemal complex assembly during meiosis.
IMP evidence from meiotic genetics.'
action: ACCEPT
reason: 'SET1 specialized meiotic function. Synaptonemal complex assembly is
essential for proper meiotic recombination. SET1 H3K4 methylation promotes
this process [PMID:24797370].'
supported_by:
- reference_id: PMID:24797370
supporting_text: 'Dot1-dependent histone H3K79 methylation promotes the
formation of meiotic double-strand breaks in the absence of histone H3K4
methylation in budding yeast.'
- term:
id: GO:0007130
label: synaptonemal complex assembly
evidence_type: IMP
original_reference_id: PMID:24797370
review:
summary: 'SET1 is required for synaptonemal complex assembly during meiosis.'
action: ACCEPT
reason: 'SET1 essential for meiotic recombination through synaptonemal complex
assembly. IMP evidence from meiotic mutant studies. Both positive regulation
(GO:1905088) and assembly (GO:0007130) appropriately annotated.'
supported_by:
- reference_id: PMID:24797370
supporting_text: eCollection 2014. Dot1-dependent histone H3K79
methylation promotes the formation of meiotic double-strand breaks
in the absence of histone H3K4 methylation in budding yeast.
- term:
id: GO:1903341
label: regulation of meiotic DNA double-strand break formation
evidence_type: IMP
original_reference_id: PMID:23246437
review:
summary: 'SET1 contributes to regulation of meiotic DSB formation through Spp1-mediated
recruitment to meiotic recombination hotspots.'
action: ACCEPT
reason: 'SET1 has documented roles in regulating meiotic DSB formation through
Spp1 complex component. Deep research shows SET1 recruits H3K4me3 to chromosome
axis sites, promoting DSB formation [PMID:23246437]. Specialized meiotic function
distinct from transcriptional role.'
supported_by:
- reference_id: PMID:23246437
supporting_text: 'Spp1, a member of the Set1 Complex, promotes meiotic DSB
formation in promoters by tethering histone H3K4 methylation sites to
chromosome axes.'
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:11751634
review:
summary: 'SET1 H3K4 methyltransferase activity demonstrated by functional analysis.
IMP evidence from studies of SET1 catalytic function.'
action: ACCEPT
reason: 'Core molecular function. IMP evidence from direct biochemical characterization
of SET1 catalytic activity [PMID:11751634]. Multiple IMP annotations with
different references converge on primary function.'
supported_by:
- reference_id: PMID:11751634
supporting_text: 'Histone H3 lysine 4 methylation is mediated by Set1 and
required for cell growth and rDNA silencing in Saccharomyces cerevisiae.'
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:22158900
review:
summary: 'SET1 H3K4 methyltransferase activity confirmed through structural
analysis studies.'
action: ACCEPT
reason: 'Duplicate with different structural biology reference. Multiple independent
studies support H3K4 methyltransferase function. IMP evidence appropriate.'
supported_by:
- reference_id: PMID:22158900
supporting_text: Structural analysis of the core COMPASS family of
histone H3K4 methylases from yeast to human.
- term:
id: GO:0055092
label: sterol homeostasis
evidence_type: IMP
original_reference_id: PMID:23382196
review:
summary: 'SET1 is involved in maintenance of ergosterol homeostasis and resistance
to Brefeldin A. IMP evidence from genetic and functional studies.'
action: MARK_AS_OVER_ANNOTATED
reason: 'This appears to be a pleiotropic effect rather than core function.
SET1 H3K4 methylation regulates genes involved in sterol metabolism, but sterol
homeostasis is not a primary functional role. The annotation is not incorrect,
but represents a downstream consequence of transcriptional regulation rather
than direct SET1 function. Keep annotation but mark as non-central to SET1
biology.'
supported_by:
- reference_id: PMID:23382196
supporting_text: H3K4 methyltransferase Set1 is involved in
maintenance of ergosterol homeostasis and resistance to Brefeldin A.
- term:
id: GO:0000723
label: telomere maintenance
evidence_type: IMP
original_reference_id: PMID:11742990
review:
summary: 'SET1 is required for telomere maintenance. IMP evidence from foundational
studies.'
action: ACCEPT
reason: 'SET1 contributes to telomere length maintenance and integrity. Supported
by multiple references [PMID:11742990, 9398665, 27911222]. IMP evidence from
set1 mutant studies showing shortened telomeres.'
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0000723
label: telomere maintenance
evidence_type: IMP
original_reference_id: PMID:9398665
review:
summary: 'SET1 involvement in telomere maintenance from early functional characterization.'
action: ACCEPT
reason: 'Duplicate from foundational SET1 paper. Multiple references converge
on telomere maintenance role.'
supported_by:
- reference_id: PMID:9398665
supporting_text: SET1, a yeast member of the trithorax family,
functions in transcriptional silencing and diverse cellular
processes.
- term:
id: GO:0003723
label: RNA binding
evidence_type: IDA
original_reference_id: PMID:16787775
review:
summary: 'Direct experimental evidence for SET1 RNA binding from structural
characterization of RNA recognition motifs.'
action: KEEP_AS_NON_CORE
reason: 'RNA binding demonstrated through structural studies and RRM characterization.
IDA evidence appropriate. However, functional role unclear - appears to affect
protein stability rather than catalytic activity. Less central than methyltransferase
function.'
supported_by:
- reference_id: PMID:16787775
supporting_text: 2006 May 9. Structural characterization of Set1 RNA
recognition motifs and their role in histone H3 lysine 4
methylation.
- term:
id: GO:0003723
label: RNA binding
evidence_type: IMP
original_reference_id: PMID:16787775
review:
summary: 'IMP evidence for RNA binding role in SET1 function and regulation.'
action: KEEP_AS_NON_CORE
reason: 'Both IDA and IMP from same reference. Functional role of RNA binding
remains unclear despite experimental evidence.'
supported_by:
- reference_id: PMID:16787775
supporting_text: 2006 May 9. Structural characterization of Set1 RNA
recognition motifs and their role in histone H3 lysine 4
methylation.
- term:
id: GO:0016279
label: protein-lysine N-methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:16143104
review:
summary: 'SET1 catalyzes protein-lysine methylation as a core enzymatic activity.
IMP evidence from functional studies.'
action: ACCEPT
reason: 'SET1 is a protein-lysine N-methyltransferase (histone lysine methyltransferase).
While less specific than GO:0042800 (H3K4 methyltransferase), this is appropriate
parent term. Genetic interactions demonstrate methyltransferase activity role.'
supported_by:
- reference_id: PMID:16143104
supporting_text: The Set1 methyltransferase opposes Ipl1 aurora kinase
functions in chromosome segregation.
- term:
id: GO:0016279
label: protein-lysine N-methyltransferase activity
evidence_type: IGI
original_reference_id: PMID:16143104
review:
summary: 'Genetic interaction evidence for SET1 protein-lysine methyltransferase
activity.'
action: ACCEPT
reason: 'IGI complements IMP evidence. Genetic interactions support methyltransferase
function.'
supported_by:
- reference_id: PMID:16143104
supporting_text: The Set1 methyltransferase opposes Ipl1 aurora kinase
functions in chromosome segregation.
- term:
id: GO:0030437
label: ascospore formation
evidence_type: IMP
original_reference_id: PMID:17150765
review:
summary: 'SET1 is required for proper ascospore formation during sporulation.
IMP evidence from set1 mutant analysis.'
action: ACCEPT
reason: 'SET1 has documented role in sporulation and ascospore formation. IMP
evidence from set1 deletion strains showing defective sporulation [PMID:17150765].'
supported_by:
- reference_id: PMID:17150765
supporting_text: 'Effect of histone methyltransferase gene mutations on
sporulation in S. cerevisiae.'
- term:
id: GO:0030466
label: silent mating-type cassette heterochromatin formation
evidence_type: IMP
original_reference_id: PMID:9398665
review:
summary: 'SET1 is required for maintenance of silent mating-type cassette heterochromatin.
IMP evidence from early SET1 characterization.'
action: ACCEPT
reason: 'SET1 contributes to silencing at the silent mating loci (HML and HMR).
H3K4 methylation marks these silent heterochromatic regions [PMID:9398665].
IMP evidence from set1 mutant derepression phenotype.'
supported_by:
- reference_id: PMID:9398665
supporting_text: 'SET1, a yeast member of the trithorax family, functions
in transcriptional silencing and diverse cellular processes.'
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IDA
original_reference_id: PMID:11742990
review:
summary: 'SET1 H3K4 methyltransferase activity demonstrated through direct biochemical
and cellular experiments.'
action: ACCEPT
reason: 'Core function with multiple evidence types. IDA from foundational COMPASS
discovery paper [PMID:11742990]. Extensive biochemical characterization of
SET1 catalytic activity.'
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:11742990
review:
summary: 'SET1 H3K4 methyltransferase activity confirmed through functional
genetic analysis.'
action: ACCEPT
reason: 'IMP evidence from set1 deletion showing loss of H3K4 methylation. Multiple
evidence types from same paper converge on catalytic function.'
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:11752412
review:
summary: 'SET1-catalyzed H3K4 methylation demonstrated through trithorax-group
complex characterization.'
action: ACCEPT
reason: 'IMP evidence from trithorax complex purification and functional analysis
[PMID:11752412].'
supported_by:
- reference_id: PMID:11752412
supporting_text: Epub 2001 Dec 18. A trithorax-group complex purified
from Saccharomyces cerevisiae is required for methylation of histone
H3.
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IDA
original_reference_id: PMID:11805083
review:
summary: 'Direct evidence for SET1 histone H3K4 methyltransferase activity from
COMPASS characterization.'
action: ACCEPT
reason: 'IDA evidence from biochemical analysis of COMPASS complex activity
[PMID:11805083].'
supported_by:
- reference_id: PMID:11805083
supporting_text: 2002 Jan 22. COMPASS, a histone H3 (Lysine 4)
methyltransferase required for telomeric silencing of gene
expression.
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:11805083
review:
summary: 'IMP evidence for H3K4 methyltransferase function from mutational analysis.'
action: ACCEPT
reason: 'Multiple evidence types (IDA and IMP) from same reference support catalytic
function.'
supported_by:
- reference_id: PMID:11805083
supporting_text: 2002 Jan 22. COMPASS, a histone H3 (Lysine 4)
methyltransferase required for telomeric silencing of gene
expression.
- term:
id: GO:0042800
label: histone H3K4 methyltransferase activity
evidence_type: IMP
original_reference_id: PMID:12845608
review:
summary: 'SET1 is a methyltransferase specific for lysine 4 of histone H3 and
required for efficient gene expression.'
action: ACCEPT
reason: 'IMP evidence from studies of SET1 catalytic requirement for gene expression
[PMID:12845608]. Converging evidence from multiple studies.'
supported_by:
- reference_id: PMID:12845608
supporting_text: Saccharomyces cerevisiae Set1p is a methyltransferase
specific for lysine 4 of histone H3 and is required for efficient
gene expression.
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IPI
original_reference_id: PMID:11687631
review:
summary: 'SET1 protein-protein interactions with COMPASS subunits documented
through direct interaction studies.'
action: ACCEPT
reason: 'Core structural annotation. IPI evidence from COMPASS co-immunoprecipitation
studies [PMID:11687631]. Specific complex membership is more informative than
generic protein binding. Multiple IPI annotations with different partners
all converge on COMPASS complex membership.'
supported_by:
- reference_id: PMID:11687631
supporting_text: 'COMPASS: a complex of proteins associated with a trithorax-related
SET domain protein.'
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IPI
original_reference_id: PMID:11742990
review:
summary: 'SET1 interactions with COMPASS complex components demonstrated through
multiple approaches.'
action: ACCEPT
reason: 'Multiple independent IPI studies confirm COMPASS complex membership.
Duplicate annotations appropriate for different reference sources.'
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
- term:
id: GO:0048188
label: Set1C/COMPASS complex
evidence_type: IPI
original_reference_id: PMID:11752412
review:
summary: 'IPI evidence for SET1 as component of trithorax-like COMPASS complex.'
action: ACCEPT
reason: 'Convergent IPI evidence from trithorax complex co-purification studies.'
supported_by:
- reference_id: PMID:11752412
supporting_text: Epub 2001 Dec 18. A trithorax-group complex purified
from Saccharomyces cerevisiae is required for methylation of histone
H3.
core_functions:
- description: Catalytic component of the COMPASS histone methyltransferase
complex responsible for mono-, di-, and trimethylation of histone H3
lysine 4 (H3K4), a fundamental mark of active transcription and gene
regulatory regions
molecular_function:
id: GO:0042800
label: histone H3K4 methyltransferase activity
directly_involved_in:
- id: GO:0006357
label: regulation of transcription by RNA polymerase II
locations:
- id: GO:0048188
label: Set1C/COMPASS complex
supported_by:
- reference_id: PMID:11742990
supporting_text: The Saccharomyces cerevisiae Set1 complex includes an
Ash2 homologue and methylates histone 3 lysine 4.
references:
- id: GO_REF:0000002
title: Gene Ontology annotation through association of InterPro records with
GO terms
findings: []
- id: GO_REF:0000033
title: Annotation inferences using phylogenetic trees
findings: []
- id: GO_REF:0000043
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword
mapping
findings: []
- id: GO_REF:0000044
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular
Location vocabulary mapping, accompanied by conservative changes to GO
terms applied by UniProt
findings: []
- id: GO_REF:0000117
title: Electronic Gene Ontology annotations created by ARBA machine learning
models
findings: []
- id: GO_REF:0000120
title: Combined Automated Annotation using Multiple IEA Methods
findings: []
- id: PMID:11687631
title: 'COMPASS: a complex of proteins associated with a trithorax-related SET
domain protein.'
findings: []
- id: PMID:11742990
title: The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue
and methylates histone 3 lysine 4.
findings: []
- id: PMID:11751634
title: Histone H3 lysine 4 methylation is mediated by Set1 and required for
cell growth and rDNA silencing in Saccharomyces cerevisiae.
findings: []
- id: PMID:11752412
title: A trithorax-group complex purified from Saccharomyces cerevisiae is
required for methylation of histone H3.
findings: []
- id: PMID:11805083
title: COMPASS, a histone H3 (Lysine 4) methyltransferase required for
telomeric silencing of gene expression.
findings: []
- id: PMID:11805837
title: Systematic identification of protein complexes in Saccharomyces
cerevisiae by mass spectrometry.
findings: []
- id: PMID:12845608
title: Saccharomyces cerevisiae Set1p is a methyltransferase specific for
lysine 4 of histone H3 and is required for efficient gene expression.
findings: []
- id: PMID:16143104
title: The Set1 methyltransferase opposes Ipl1 aurora kinase functions in
chromosome segregation.
findings: []
- id: PMID:16429126
title: Proteome survey reveals modularity of the yeast cell machinery.
findings: []
- id: PMID:16554755
title: Global landscape of protein complexes in the yeast Saccharomyces
cerevisiae.
findings: []
- id: PMID:16787775
title: Structural characterization of Set1 RNA recognition motifs and their
role in histone H3 lysine 4 methylation.
findings: []
- id: PMID:16959218
title: Different roles of histone H3 lysine 4 methylation in chromatin
maintenance.
findings: []
- id: PMID:17150765
title: Effect of histone methyltransferase gene mutations on sporulation in
S. cerevisiae.
findings: []
- id: PMID:18083099
title: Histone crosstalk between H2B monoubiquitination and H3 methylation
mediated by COMPASS.
findings: []
- id: PMID:19713935
title: Cotranslational assembly of the yeast SET1C histone methyltransferase
complex.
findings: []
- id: PMID:21179020
title: Defining the budding yeast chromatin-associated interactome.
findings: []
- id: PMID:22158900
title: Structural analysis of the core COMPASS family of histone H3K4
methylases from yeast to human.
findings: []
- id: PMID:23246437
title: Spp1, a member of the Set1 Complex, promotes meiotic DSB formation in
promoters by tethering histone H3K4 methylation sites to chromosome axes.
findings: []
- id: PMID:23382196
title: H3K4 methyltransferase Set1 is involved in maintenance of ergosterol
homeostasis and resistance to Brefeldin A.
findings: []
- id: PMID:23469257
title: Catalytic and functional roles of conserved amino acids in the SET
domain of the S. cerevisiae lysine methyltransferase Set1.
findings: []
- id: PMID:24797370
title: Dot1-dependent histone H3K79 methylation promotes the formation of
meiotic double-strand breaks in the absence of histone H3K4 methylation in
budding yeast.
findings: []
- id: PMID:27325136
title: Counteracting H3K4 methylation modulators Set1 and Jhd2 co-regulate
chromatin dynamics and gene transcription.
findings: []
- id: PMID:27911222
title: The histone methyltransferases Set5 and Set1 have overlapping
functions in gene silencing and telomere maintenance.
findings: []
- id: PMID:28483910
title: RNA Binding by Histone Methyltransferases Set1 and Set2.
findings: []
- id: PMID:29066473
title: Repression of Middle Sporulation Genes in Saccharomyces cerevisiae by
the Sum1-Rfm1-Hst1 Complex Is Maintained by Set1 and H3K4 Methylation.
findings: []
- id: PMID:29071121
title: Binding to RNA regulates Set1 function.
findings: []
- id: PMID:30358795
title: The cellular economy of the Saccharomyces cerevisiae zinc proteome.
findings: []
- id: PMID:32290544
title: Genetic Interactions of Histone Modification Machinery Set1 and PAF1C
with the Recombination Complex Rec114-Mer2-Mei4 in the Formation of
Meiotic DNA Double-Strand Breaks.
findings: []
- id: PMID:37968396
title: The social and structural architecture of the yeast protein
interactome.
findings: []
- id: PMID:9214638
title: Mammalian homologues of the Polycomb-group gene Enhancer of zeste
mediate gene silencing in Drosophila heterochromatin and at S. cerevisiae
telomeres.
findings: []
- id: PMID:9398665
title: SET1, a yeast member of the trithorax family, functions in
transcriptional silencing and diverse cellular processes.
findings: []
- id: PMID:9988274
title: Interaction between Set1p and checkpoint protein Mec3p in DNA repair
and telomere functions.
findings: []
- id: file:yeast/SET1/SET1-deep-research-perplexity.md
title: Deep research report on SET1
findings: []