SOX2

UniProt ID: P48431
Organism: Homo sapiens
Review Status: COMPLETE
📝 Provide Detailed Feedback

Gene Description

SOX2 is a SOXB1 family transcription factor containing an HMG-box domain that binds the minor groove of DNA and functions as a pioneer factor. As a core pluripotency factor (one of the Yamanaka factors), SOX2 maintains embryonic stem cell self-renewal and neural stem cell identity. It cooperates with OCT4 on composite Sox/Oct DNA motifs to regulate thousands of enhancers controlling pluripotency and early neural fate specification. SOX2 is essential for maintaining neural progenitor cells in an undifferentiated state and preventing premature neuronal differentiation. Nuclear localization is regulated by NLS/NES sequences and post-translational modifications including methylation, acetylation, SUMOylation, and ubiquitination.

Existing Annotations Review

GO Term Evidence Action Reason
GO:0000122 negative regulation of transcription by RNA polymerase II
IBA
GO_REF:0000033
ACCEPT
Summary: SOX2 functions as both a transcriptional activator and repressor depending on context. SUMOylation of SOX2 enables transcriptional repression of centrosome/centromere and cell-cycle genes in neural stem cells, providing proliferative restraint.
Reason: This annotation accurately represents a documented function of SOX2. The deep research shows SUMOylation-dependent repression of cell-cycle genes in neural stem cells. While SOX2 is primarily an activator, its repressor function is well-established and represents a core regulatory mechanism for controlling neural stem cell proliferation.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
In neural stem cells, SUMOylation of SOX2 is required for transcriptional repression of centrosome/centromere and cell-cycle genes and for proliferative restraint
GO:0000122 negative regulation of transcription by RNA polymerase II
ISS
GO_REF:0000024
ACCEPT
Summary: ISS evidence from mouse orthologs supports the repressor function of SOX2.
Reason: ISS annotation based on ortholog data is consistent with experimental evidence from human SOX2 showing transcriptional repression. This is a valid inference from sequence similarity given the highly conserved nature of SOX2 function across mammals.
GO:0045944 positive regulation of transcription by RNA polymerase II
IBA
GO_REF:0000033
ACCEPT
Summary: SOX2 is fundamentally a transcriptional activator that cooperates with OCT4 to activate thousands of enhancers controlling pluripotency and neural fate genes. This is the primary and core molecular function of SOX2.
Reason: This annotation represents the core function of SOX2 as a transcriptional activator. Deep research shows SOX2 and OCT4 co-occupy thousands of OSN enhancers in pluripotent cells and activate genes essential for pluripotency and neural specification. Multiple IBA, IDA, and IEA annotations converge on this fundamental activity.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells
file:genes/human/SOX2/SOX2-deep-research-falcon.md
1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers
GO:0045944 positive regulation of transcription by RNA polymerase II
IEA
GO_REF:0000107
ACCEPT
Summary: Electronic annotation based on ortholog data from Ensembl.
Reason: Consistent with IBA and IDA evidence for transcriptional activation. This electronic annotation accurately reflects SOX2's primary function.
GO:0045944 positive regulation of transcription by RNA polymerase II
IDA
PMID:21245162
Pluripotency factors regulate definitive endoderm specificat...
ACCEPT
Summary: Direct experimental evidence for SOX2 transcriptional activation.
Reason: IDA evidence provides direct experimental support for transcriptional activation function, consistent with the core role of SOX2.
Supporting Evidence:
PMID:21245162
Pluripotency factors regulate definitive endoderm specification through eomesodermin.
GO:0045944 positive regulation of transcription by RNA polymerase II
IDA
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveo...
ACCEPT
Summary: Independent IDA evidence for transcriptional activation.
Reason: Multiple independent experimental studies confirm transcriptional activation as a core SOX2 function.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
GO:0045893 positive regulation of DNA-templated transcription
IDA
PMID:29253717
Regulation of ADAM10 by miR-140-5p and potential relevance f...
ACCEPT
Summary: IDA evidence for positive regulation of transcription.
Reason: This is a parent term of RNA pol II-specific transcriptional activation and accurately represents SOX2's core function. The term is at an appropriate level of specificity for general transcriptional activation function.
Supporting Evidence:
PMID:29253717
Regulation of ADAM10 by miR-140-5p and potential relevance for Alzheimer's disease.
GO:0045893 positive regulation of DNA-templated transcription
IDA
PMID:18407919
Novel mutations in LHX3 are associated with hypopituitarism ...
ACCEPT
Summary: Additional IDA evidence for transcriptional activation.
Reason: Multiple experimental studies support this fundamental function of SOX2 as a transcriptional activator.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
GO:0006355 regulation of DNA-templated transcription
IEA
GO_REF:0000002
ACCEPT
Summary: Electronic annotation based on InterPro domain IPR022097 (SOX family).
Reason: This is a valid high-level annotation based on domain architecture. All SOX family transcription factors regulate transcription, and this annotation is appropriately general. While less specific than the positive/negative regulation terms, it is not incorrect.
GO:0006355 regulation of DNA-templated transcription
IDA
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic...
ACCEPT
Summary: IDA evidence for general transcriptional regulation.
Reason: This parent term is supported by experimental evidence and is appropriately general for describing SOX2's overall transcriptional regulatory function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
GO:0006355 regulation of DNA-templated transcription
NAS
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2...
ACCEPT
Summary: Non-traceable author statement from the original SOX2 cloning paper.
Reason: Historical NAS annotation from the original characterization of SOX2 is appropriate for general transcriptional regulation function.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
GO:0010468 regulation of gene expression
IMP
PMID:18388306
Regulation of self-renewal and pluripotency by Sox2 in human...
ACCEPT
Summary: IMP evidence showing mutant phenotype affecting gene expression regulation.
Reason: This is a valid high-level annotation demonstrating SOX2's role in gene expression control through mutant phenotype analysis. While very general, it is appropriate as a parent term encompassing transcriptional regulation.
Supporting Evidence:
PMID:18388306
Apr 3. Regulation of self-renewal and pluripotency by Sox2 in human embryonic stem cells.
GO:0030182 neuron differentiation
IBA
GO_REF:0000033
MODIFY
Summary: SOX2 plays a complex role in neuron differentiation by maintaining neural progenitors in an undifferentiated state and preventing premature neuronal differentiation.
Reason: This term is ambiguous for SOX2's actual role. SOX2 does NOT promote neuron differentiation; rather, it prevents premature differentiation by maintaining neural progenitor identity. A more accurate term would be GO:0045665 (negative regulation of neuron differentiation), which is already present in the annotation set. The IBA annotation likely captures involvement in the process broadly, but this could be misleading about SOX2's actual function.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
UniProtKB:P48431
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
GO:0045665 negative regulation of neuron differentiation
ISS
GO_REF:0000024
ACCEPT
Summary: SOX2 actively suppresses neuronal differentiation to maintain neural progenitor cells in an undifferentiated state. This is a core function in neural development.
Reason: This annotation accurately captures SOX2's essential role in preventing premature neuronal differentiation. UniProt states that SOX2 "keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation." This is central to its function in neural stem cell maintenance and represents a core activity.
Supporting Evidence:
UniProtKB:P48431
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
GO:0097150 neuronal stem cell population maintenance
ISS
GO_REF:0000024
ACCEPT
Summary: SOX2 is essential for maintaining neural stem cell identity and self-renewal. This is one of its core functions in the nervous system.
Reason: This annotation represents a core function of SOX2 in neural biology. Deep research shows SOX2 is critical for neural stem cell maintenance and is described as a factor that "keeps neural cells undifferentiated." The term accurately captures SOX2's essential role in maintaining the neural stem cell pool.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Downstream SRRT target that mediates the promotion of neural stem cell self-renewal
GO:0007420 brain development
IBA
GO_REF:0000033
KEEP AS NON CORE
Summary: SOX2 plays essential roles in brain development through maintaining neural progenitor populations and regulating neural fate specification.
Reason: While SOX2 is indeed involved in brain development, this term is very broad and captures pleiotropy rather than core molecular function. The more specific neural stem cell maintenance and neural specification functions are better representations of SOX2's primary roles. This annotation is not incorrect but represents a higher-level developmental outcome rather than the core cellular function.
GO:0030900 forebrain development
IBA
GO_REF:0000033
KEEP AS NON CORE
Summary: SOX2 expression and function in forebrain development is well-documented.
Reason: This is a more specific developmental process than general brain development, but still represents a regional/developmental outcome rather than core molecular function. SOX2 functions the same way (maintaining neural progenitors) across brain regions. The forebrain-specific annotation reflects where SOX2 is expressed and important, but the core function is neural progenitor maintenance regardless of brain region.
GO:0030900 forebrain development
IEP
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
KEEP AS NON CORE
Summary: Expression pattern evidence showing SOX2 expression during forebrain development.
Reason: IEP evidence showing expression pattern during forebrain development is valid but does not demonstrate specific function. This is consistent with SOX2's role in maintaining neural progenitors throughout the developing brain including forebrain regions.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0021781 glial cell fate commitment
NAS
PMID:17291498
Sox-2 is expressed by glial and progenitor cells and Pax-6 i...
KEEP AS NON CORE
Summary: NAS evidence for role in glial cell fate commitment.
Reason: While SOX2 is expressed in glial progenitors and can influence glial fate, this represents one specific outcome of SOX2's more general role in maintaining progenitor identity. The core function is maintaining undifferentiated stem/progenitor states; glial fate commitment is one context-dependent outcome. This is valid but peripheral to core function.
Supporting Evidence:
PMID:17291498
2006 Dec 20. Sox-2 is expressed by glial and progenitor cells and Pax-6 is expressed by neuroblasts in the human subventricular zone.
GO:0035019 somatic stem cell population maintenance
IMP
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr...
ACCEPT
Summary: IMP evidence showing SOX2 is required for maintaining somatic stem cell populations, particularly neural stem cells.
Reason: This annotation represents a core function of SOX2. Multiple independent IMP and IDA studies demonstrate SOX2's essential role in maintaining stem cell populations. This encompasses both embryonic and neural stem cell maintenance and represents a fundamental cellular function of SOX2.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Critical for early embryogenesis and for embryonic stem cell pluripotency
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
GO:0035019 somatic stem cell population maintenance
IDA
PMID:19409607
MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pl...
ACCEPT
Summary: Independent IDA evidence for stem cell maintenance.
Reason: Multiple independent experimental studies support this core function.
Supporting Evidence:
PMID:19409607
Apr 30. MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pluripotency in human embryonic stem cells.
GO:0035019 somatic stem cell population maintenance
IMP
PMID:18388306
Regulation of self-renewal and pluripotency by Sox2 in human...
ACCEPT
Summary: Additional IMP evidence from mutant phenotype analysis.
Reason: Mutant phenotype evidence provides strong support for essential role in stem cell maintenance.
Supporting Evidence:
PMID:18388306
Apr 3. Regulation of self-renewal and pluripotency by Sox2 in human embryonic stem cells.
GO:1902807 negative regulation of cell cycle G1/S phase transition
IDA
PMID:18268498
SOX2 is frequently downregulated in gastric cancers and inhi...
ACCEPT
Summary: IDA evidence for SOX2 role in cell cycle regulation.
Reason: This annotation is supported by direct experimental evidence and is consistent with SOX2's role in maintaining progenitor cells in a self-renewing state. Cell cycle regulation is an integral part of SOX2's function in controlling stem cell proliferation versus differentiation decisions.
Supporting Evidence:
PMID:18268498
Feb 12. SOX2 is frequently downregulated in gastric cancers and inhibits cell growth through cell-cycle arrest and apoptosis.
GO:0043410 positive regulation of MAPK cascade
IDA
PMID:18187129
Forced expression of Sox2 or Nanog in human bone marrow deri...
KEEP AS NON CORE
Summary: IDA evidence for SOX2 involvement in MAPK signaling.
Reason: While experimentally supported, MAPK cascade regulation represents a context-specific signaling outcome rather than a core molecular function. SOX2's primary function is as a transcription factor; MAPK regulation may be an indirect consequence or occur in specific cellular contexts (e.g., mesenchymal stem cells in this study). This is valid but peripheral to core transcriptional regulatory function.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
GO:0048839 inner ear development
IBA
GO_REF:0000033
KEEP AS NON CORE
Summary: SOX2 expression and function in inner ear development.
Reason: SOX2 plays roles in sensory organ development including inner ear, but this represents tissue-specific developmental pleiotropy rather than core function. The core function is maintaining progenitor populations; inner ear development is one of many developmental contexts where this occurs.
GO:0048839 inner ear development
IEP
PMID:18407919
Novel mutations in LHX3 are associated with hypopituitarism ...
KEEP AS NON CORE
Summary: Expression pattern evidence during inner ear development.
Reason: IEP evidence supports expression during inner ear development but does not demonstrate specific function. Consistent with SOX2's broader role in sensory progenitor maintenance.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
GO:0001654 eye development
IEP
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
KEEP AS NON CORE
Summary: Expression pattern during eye development.
Reason: SOX2 mutations cause anophthalmia/microphthalmia, demonstrating clear importance in eye development. However, this represents tissue-specific developmental outcome rather than core molecular function. Like inner ear development, eye development is one context where SOX2's progenitor-maintaining function is critical.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0021983 pituitary gland development
IEP
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
KEEP AS NON CORE
Summary: Expression pattern during pituitary development.
Reason: Another tissue-specific developmental context where SOX2 maintains progenitor populations. Valid annotation but represents developmental pleiotropy rather than core cellular function.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0001714 endodermal cell fate specification
IDA
PMID:21245162
Pluripotency factors regulate definitive endoderm specificat...
KEEP AS NON CORE
Summary: IDA evidence for role in endoderm specification.
Reason: SOX2 with OCT4 regulates germ layer specification including endoderm fate. While experimentally supported, this represents one developmental context among many. SOX2's core function is maintaining pluripotency and regulating lineage choice; endoderm specification is one specific outcome.
Supporting Evidence:
PMID:21245162
Pluripotency factors regulate definitive endoderm specification through eomesodermin.
GO:0001649 osteoblast differentiation
IDA
PMID:18187129
Forced expression of Sox2 or Nanog in human bone marrow deri...
KEEP AS NON CORE
Summary: IDA evidence for SOX2 role in osteoblast differentiation when ectopically expressed in mesenchymal stem cells.
Reason: This represents an experimental manipulation (forced SOX2 expression in mesenchymal stem cells) rather than SOX2's endogenous core function. While experimentally valid, osteoblast differentiation is not a physiological context where SOX2 normally functions and represents experimental pleiotropy.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
GO:0090090 negative regulation of canonical Wnt signaling pathway
IDA
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
KEEP AS NON CORE
Summary: IDA evidence for SOX2 regulation of Wnt signaling.
Reason: While experimentally supported, Wnt pathway regulation represents cross-talk between SOX2's transcriptional program and developmental signaling rather than a core molecular function. This is a valid regulatory connection but peripheral to SOX2's primary role as a transcriptional regulator of pluripotency and neural fate.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0070848 response to growth factor
IDA
PMID:18187129
Forced expression of Sox2 or Nanog in human bone marrow deri...
KEEP AS NON CORE
Summary: IDA evidence for SOX2 response to growth factor stimulation.
Reason: This represents an upstream regulatory input rather than SOX2's core function. Growth factors can regulate SOX2 expression or activity, but this does not define what SOX2 itself does. This is valid annotation but describes regulation of SOX2 rather than SOX2's function.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
GO:0009611 response to wounding
IEP
PMID:17982423
Stem cell markers in the human posterior limbus and corneal ...
KEEP AS NON CORE
Summary: Expression pattern evidence in corneal stem cells after wounding.
Reason: IEP evidence showing SOX2 expression in corneal stem cells during wound healing. This represents SOX2's role in maintaining stem cell populations in a regenerative context, which is consistent with core function but wound response itself is a specific physiological context rather than core cellular function.
Supporting Evidence:
PMID:17982423
Stem cell markers in the human posterior limbus and corneal endothelium of unwounded and wounded corneas.
GO:0006325 chromatin organization
NAS
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2...
ACCEPT
Summary: NAS evidence for chromatin organization, likely based on SOX2's ability to bend DNA and engage nucleosomes.
Reason: This annotation accurately captures SOX2's pioneer factor function. Deep research shows SOX2 engages nucleosomal DNA, bends DNA, and maintains chromatin accessibility at key regulatory sites. This is integral to SOX2's mechanism of action as a pioneer transcription factor and represents a core molecular activity.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner and maintain chromatin accessibility at key regulatory sites
file:genes/human/SOX2/SOX2-deep-research-falcon.md
thousands of ATAC-seq peaks are lost within ~1 hour of SOX2 degradation
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
GO:0001228 DNA-binding transcription activator activity, RNA polymerase II-specific
IBA
GO_REF:0000033
ACCEPT
Summary: SOX2 functions as a sequence-specific transcriptional activator binding to Sox motifs (WWCAAW consensus).
Reason: This annotation accurately represents SOX2's core molecular function. The HMG-box domain binds sequence-specifically to Sox motifs and activates transcription. This is supported by IBA phylogenetic inference and extensive experimental evidence.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 contains a ~79-aa HMG box that binds the minor groove of cognate sequences (consensus WWCAAW; commonly the core TTGT)
GO:0001228 DNA-binding transcription activator activity, RNA polymerase II-specific
IEA
GO_REF:0000107
ACCEPT
Summary: Electronic annotation based on Ensembl ortholog data.
Reason: Electronic annotation is consistent with experimental and phylogenetic evidence for transcriptional activator function.
GO:0000981 DNA-binding transcription factor activity, RNA polymerase II-specific
ISA
GO_REF:0000113
ACCEPT
Summary: ISA evidence from TFClass database classification of SOX2 as a sequence-specific DNA-binding transcription factor.
Reason: This is a parent term of DNA-binding transcription activator activity and accurately describes SOX2's molecular function class. The ISA evidence from TFClass is appropriate for classifying SOX2 within the transcription factor family.
GO:0003700 DNA-binding transcription factor activity
IDA
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveo...
ACCEPT
Summary: IDA evidence for general transcription factor activity.
Reason: This is a high-level parent term accurately describing SOX2's molecular function. Multiple IDA studies support this classification.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
GO:0003700 DNA-binding transcription factor activity
IDA
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic...
ACCEPT
Summary: Additional IDA evidence for transcription factor activity.
Reason: Independent experimental support for transcription factor classification.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
GO:0003700 DNA-binding transcription factor activity
NAS
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2...
ACCEPT
Summary: NAS evidence from original SOX2 characterization.
Reason: Historical NAS annotation from the original SOX2 cloning and characterization is appropriate.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
GO:0000978 RNA polymerase II cis-regulatory region sequence-specific DNA binding
IBA
GO_REF:0000033
ACCEPT
Summary: SOX2 binds to cis-regulatory enhancer regions through sequence-specific recognition of Sox motifs.
Reason: This annotation accurately captures SOX2's binding to enhancer elements. Deep research shows SOX2 binds thousands of enhancers and recognizes specific DNA sequences. This represents a core molecular function of the HMG-box domain.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells
GO:0000976 transcription cis-regulatory region binding
IEA
GO_REF:0000107
ACCEPT
Summary: Electronic annotation for cis-regulatory region binding.
Reason: This is a parent term of sequence-specific cis-regulatory binding and is supported by extensive evidence of SOX2 binding to enhancers.
GO:0000976 transcription cis-regulatory region binding
IDA
PMID:18407919
Novel mutations in LHX3 are associated with hypopituitarism ...
ACCEPT
Summary: IDA evidence for cis-regulatory binding.
Reason: Direct experimental evidence supports SOX2's binding to regulatory regions.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
GO:0043565 sequence-specific DNA binding
IDA
PMID:20713518
Foxp1 coordinates cardiomyocyte proliferation through both c...
ACCEPT
Summary: IDA evidence for sequence-specific DNA binding.
Reason: This annotation accurately describes the fundamental DNA-binding property of the SOX2 HMG-box domain, which recognizes specific Sox motif sequences. This is a core molecular function.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
HMG box that binds the minor groove of cognate sequences (consensus WWCAAW)
PMID:20713518
Foxp1 coordinates cardiomyocyte proliferation through both cell-autonomous and nonautonomous mechanisms.
GO:0003677 DNA binding
IEA
GO_REF:0000043
ACCEPT
Summary: Electronic annotation based on UniProtKB keyword mapping.
Reason: This high-level term is appropriate for describing SOX2's DNA-binding capacity. While very general, it is not incorrect and is supported by HMG-box domain annotation.
GO:0003677 DNA binding
IDA
PMID:19427902
Purification of human transcription factors Nanog and Sox2, ...
ACCEPT
Summary: IDA evidence for DNA binding from biochemical study.
Reason: Direct experimental evidence for DNA binding is appropriate.
Supporting Evidence:
PMID:19427902
2009 May 8. Purification of human transcription factors Nanog and Sox2, each in complex with Skp, an Escherichia coli periplasmic chaperone.
GO:0003677 DNA binding
IDA
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
ACCEPT
Summary: Additional IDA evidence for DNA binding.
Reason: Multiple independent experimental confirmations of DNA-binding activity.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0003677 DNA binding
IDA
PMID:18407919
Novel mutations in LHX3 are associated with hypopituitarism ...
ACCEPT
Summary: Further IDA evidence for DNA binding.
Reason: Consistent experimental support for DNA-binding function.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
GO:0003677 DNA binding
IDA
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic...
ACCEPT
Summary: IDA evidence from embryonic stem cell study.
Reason: DNA binding is well-established core molecular function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
GO:0003677 DNA binding
NAS
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2...
ACCEPT
Summary: NAS evidence from original SOX2 paper.
Reason: Historical annotation appropriate for fundamental DNA-binding function.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
GO:0005515 protein binding
IPI
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr...
REMOVE
Summary: IPI evidence for protein-protein interaction (OCT4).
Reason: The term "protein binding" is uninformative and does not tell us about SOX2's actual function. While SOX2 does bind OCT4 and other proteins, a more specific term like GO:0046982 (protein heterodimerization activity) or annotation of the specific complex (OCT4-SOX2 complex) would be more informative. Per curation guidelines, we should avoid generic "protein binding" annotations in favor of more specific molecular function terms.
Supporting Evidence:
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
GO:0005515 protein binding
IPI
PMID:23667531
The SOX2-interactome in brain cancer cells identifies the re...
REMOVE
Summary: IPI evidence for protein interaction (USP9X, MSI2).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:23667531
The SOX2-interactome in brain cancer cells identifies the requirement of MSI2 and USP9X for the growth of brain tumor cells.
GO:0005515 protein binding
IPI
PMID:24048479
Deterministic direct reprogramming of somatic cells to pluri...
REMOVE
Summary: IPI evidence for protein interaction (MBD3).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:24048479
Deterministic direct reprogramming of somatic cells to pluripotency.
GO:0005515 protein binding
IPI
PMID:24525231
The PRKCI and SOX2 oncogenes are coamplified and cooperate t...
REMOVE
Summary: IPI evidence for protein interaction (PRKCI).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:24525231
The PRKCI and SOX2 oncogenes are coamplified and cooperate to activate Hedgehog signaling in lung squamous cell carcinoma.
GO:0005515 protein binding
IPI
PMID:25416956
A proteome-scale map of the human interactome network.
REMOVE
Summary: IPI evidence from interactome mapping (SUMO1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:25416956
A proteome-scale map of the human interactome network.
GO:0005515 protein binding
IPI
PMID:25702638
The L1TD1 protein interactome reveals the importance of post...
REMOVE
Summary: IPI evidence for protein interaction (L1TD1, OCT4).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:25702638
Epub 2015 Feb 19. The L1TD1 protein interactome reveals the importance of post-transcriptional regulation in human pluripotency.
GO:0005515 protein binding
IPI
PMID:27334688
Oncogenic Sox2 regulates and cooperates with VRK1 in cell cy...
REMOVE
Summary: IPI evidence for protein interaction (VRK1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:27334688
Oncogenic Sox2 regulates and cooperates with VRK1 in cell cycle progression and differentiation.
GO:0005515 protein binding
IPI
PMID:31515488
Extensive disruption of protein interactions by genetic vari...
REMOVE
Summary: IPI evidence for protein interaction (SUMO1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:31515488
Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations.
GO:0005515 protein binding
IPI
PMID:29358331
LSD1 demethylase and the methyl-binding protein PHF20L1 prev...
REMOVE
Summary: IPI evidence for protein interaction (PHF20L1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:29358331
2018 Jan 22. LSD1 demethylase and the methyl-binding protein PHF20L1 prevent SET7 methyltransferase-dependent proteolysis of the stem-cell protein SOX2.
GO:0005515 protein binding
IPI
PMID:30442713
Proteolysis of methylated SOX2 protein is regulated by L3MBT...
REMOVE
Summary: IPI evidence for protein interaction (L3MBTL3).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:30442713
2018 Nov 15. Proteolysis of methylated SOX2 protein is regulated by L3MBTL3 and CRL4(DCAF5) ubiquitin ligase.
GO:0005515 protein binding
IPI
PMID:18456656
The molecular mechanism governing the oncogenic potential of...
REMOVE
Summary: IPI evidence for protein interaction.
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:18456656
2008 May 2. The molecular mechanism governing the oncogenic potential of SOX2 in breast cancer.
GO:0035198 miRNA binding
IDA
PMID:18710938
Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embry...
KEEP AS NON CORE
Summary: IDA evidence for SOX2 binding to miR-302.
Reason: While experimentally validated, miRNA binding represents a non-canonical activity that is not part of SOX2's core transcription factor function. The primary function of SOX2 is sequence-specific DNA binding and transcriptional regulation. miRNA interaction may represent regulatory crosstalk but is peripheral to the core molecular function.
Supporting Evidence:
PMID:18710938
Aug 18. Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem cells.
GO:0005634 nucleus
IBA
GO_REF:0000033
ACCEPT
Summary: SOX2 is a nuclear transcription factor with NLS sequences directing nuclear localization.
Reason: Nuclear localization is essential for SOX2 function as a transcription factor. This annotation accurately represents where SOX2 performs its core function. UniProt describes nuclear localization regulated by NLS/NES sequences, and deep research confirms SOX2 is "predominantly nuclear."
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Subcellular localization: SOX2 is nuclear; its HMG region and flanking sequences include NLS/NES
GO:0005634 nucleus
IEA
GO_REF:0000120
ACCEPT
Summary: Electronic annotation based on sequence analysis and ortholog data.
Reason: Electronic annotation is consistent with experimental evidence for nuclear localization.
GO:0005634 nucleus
IDA
PMID:17291498
Sox-2 is expressed by glial and progenitor cells and Pax-6 i...
ACCEPT
Summary: Direct experimental evidence for nuclear localization.
Reason: Multiple IDA studies confirm nuclear localization through microscopy or fractionation.
Supporting Evidence:
PMID:17291498
2006 Dec 20. Sox-2 is expressed by glial and progenitor cells and Pax-6 is expressed by neuroblasts in the human subventricular zone.
GO:0005634 nucleus
IDA
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveo...
ACCEPT
Summary: Additional IDA evidence for nuclear localization.
Reason: Consistent experimental evidence across multiple studies.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
GO:0005634 nucleus
IDA
PMID:18285410
SOX2 plays a critical role in the pituitary, forebrain, and ...
ACCEPT
Summary: Further IDA evidence for nuclear localization.
Reason: Well-established nuclear localization.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
GO:0005634 nucleus
IC
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic...
ACCEPT
Summary: Inferred from curator statement based on DNA binding activity.
Reason: IC inference is appropriate - DNA-binding transcription factors must be nuclear to function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
GO:0005634 nucleus
NAS
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2...
ACCEPT
Summary: NAS evidence from original SOX2 paper.
Reason: Historical annotation appropriate for nuclear localization.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
GO:0005654 nucleoplasm
IDA
GO_REF:0000052
ACCEPT
Summary: IDA evidence based on immunofluorescence localization.
Reason: Nucleoplasm is a more specific cellular component than nucleus, indicating SOX2 is soluble within the nuclear compartment rather than associated with nuclear envelope or other nuclear structures. This is appropriate for a transcription factor.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-1112609
ACCEPT
Summary: TAS evidence from Reactome pathway annotation (multiple Reactome entries).
Reason: Multiple Reactome pathway annotations place SOX2 in nucleoplasm, consistent with its function in transcriptional regulation. These TAS annotations are appropriate.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972948
ACCEPT
Summary: TAS from Reactome (OCT4, SOX2, NANOG bind GSC promoter).
Reason: Consistent with nucleoplasmic localization during transcriptional activity.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972949
ACCEPT
Summary: TAS from Reactome (OCT4, SOX2, NANOG bind EOMES promoter).
Reason: Multiple Reactome TAS annotations are consistent with nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972956
ACCEPT
Summary: TAS from Reactome (STAT3 promoter binding).
Reason: Consistent nucleoplasm localization across Reactome pathways.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972960
ACCEPT
Summary: TAS from Reactome (FGF2 promoter binding).
Reason: Multiple consistent Reactome annotations support nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972962
ACCEPT
Summary: TAS from Reactome (TDGF1 promoter binding).
Reason: Consistent Reactome pathway annotations.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972964
ACCEPT
Summary: TAS from Reactome (FOXD3 promoter binding).
Reason: Consistent with nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972965
ACCEPT
Summary: TAS from Reactome (TSC22D1 promoter binding).
Reason: Multiple Reactome TAS annotations are appropriate.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972966
ACCEPT
Summary: TAS from Reactome (HHEX promoter binding).
Reason: Consistent Reactome annotations.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972967
ACCEPT
Summary: TAS from Reactome (EPHA1 promoter binding).
Reason: Appropriate TAS annotation from Reactome.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972975
ACCEPT
Summary: TAS from Reactome (SALL1 promoter binding).
Reason: Consistent with nucleoplasm localization during transcriptional regulation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972977
ACCEPT
Summary: TAS from Reactome (DKK1 promoter binding).
Reason: Multiple consistent Reactome pathway annotations.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972978
ACCEPT
Summary: TAS from Reactome (ZIC3 promoter binding).
Reason: Appropriate Reactome TAS annotation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-2972979
ACCEPT
Summary: TAS from Reactome (CDX2 promoter).
Reason: Consistent with nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-452894
ACCEPT
Summary: TAS from Reactome (Expression of SOX2).
Reason: Appropriate Reactome annotation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-480204
ACCEPT
Summary: TAS from Reactome (NANOG promoter binding).
Reason: Consistent nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-480685
ACCEPT
Summary: TAS from Reactome (SOX2 promoter autoregulation).
Reason: Appropriate Reactome TAS annotation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-5626938
ACCEPT
Summary: TAS from Reactome (Beta-catenin binds SOX proteins).
Reason: Consistent with nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-6790036
ACCEPT
Summary: TAS from Reactome (STAT3-upregulated nuclear proteins).
Reason: Appropriate Reactome annotation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-6800120
ACCEPT
Summary: TAS from Reactome (DPPA4 gene binding).
Reason: Consistent nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-9732125
ACCEPT
Summary: TAS from Reactome (MITF promoter binding).
Reason: Appropriate Reactome TAS annotation.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-9833066
ACCEPT
Summary: TAS from Reactome (ZEB2 gene binding).
Reason: Consistent with nucleoplasm localization.
GO:0005654 nucleoplasm
TAS
Reactome:R-HSA-9926523
ACCEPT
Summary: TAS from Reactome (MITF-dependent SOX2 expression).
Reason: Final Reactome TAS annotation, consistent with others.
GO:0016607 nuclear speck
IEA
GO_REF:0000044
ACCEPT
Summary: Electronic annotation suggesting SOX2 localizes to nuclear speckles.
Reason: Nuclear speckles are subnuclear domains enriched in splicing factors and certain transcription factors. UniProt states SOX2 colocalizes in speckles with SOX6. While this is a more specialized localization than general nucleoplasm, it is consistent with SOX2's dynamic localization in active transcriptional regions. This annotation is appropriate.
Supporting Evidence:
UniProtKB:P48431
Colocalizes with SOX6 in speckles
GO:0000785 chromatin
ISA
GO_REF:0000113
ACCEPT
Summary: ISA evidence from TFClass database indicating SOX2 associates with chromatin.
Reason: This annotation is highly appropriate for SOX2 as a pioneer factor. Deep research shows SOX2 engages nucleosomal DNA and maintains chromatin accessibility. Chromatin localization is essential for SOX2's function as a pioneer transcription factor that can bind DNA in chromatin context.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
As a pioneer factor, SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner
GO:0005667 transcription regulator complex
TAS
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr...
ACCEPT
Summary: TAS evidence for SOX2 participation in transcriptional regulatory complexes, particularly with OCT4.
Reason: SOX2 forms complexes with OCT4 and other transcription factors to regulate gene expression. This is well-documented and represents an important aspect of SOX2 function - it does not act alone but cooperates with partner factors in multi-protein complexes.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Structural biophysics and NMR show that SOX2's HMG domain binds nucleosomal DNA in a site- and sequence-dependent manner and forms synergistic complexes with full-length OCT4
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
GO:0005737 cytoplasm
IEA
GO_REF:0000044
ACCEPT
Summary: Electronic annotation based on UniProtKB subcellular location vocabulary.
Reason: While SOX2 is predominantly nuclear, UniProt documents cytoplasmic localization that is regulated by post-translational modifications. Acetylation at K75 promotes nuclear export leading to cytoplasmic localization and proteasomal degradation. This represents a regulatory mechanism for controlling SOX2 levels and activity.
Supporting Evidence:
UniProtKB:P48431
Acetylation contributes to its nuclear localization and deacetylation by HDAC3 induces a cytoplasmic delocalization
file:genes/human/SOX2/SOX2-deep-research-falcon.md
acetylation at K75 within the SOX2 NES, mediated by CBP/p300 and ACSS3/ACSS2, promotes nuclear export
GO:0005737 cytoplasm
IDA
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveo...
ACCEPT
Summary: IDA evidence showing cytoplasmic localization of SOX2.
Reason: Direct experimental observation of cytoplasmic SOX2 is consistent with regulated nuclear-cytoplasmic shuttling. While not the primary site of function, cytoplasmic localization is physiologically relevant for regulation of SOX2 activity.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
GO:0005829 cytosol
IDA
PMID:18000303
Evidence for the presence of stem cell-like progenitor cells...
ACCEPT
Summary: IDA evidence for cytosolic localization.
Reason: Cytosol is the aqueous component of cytoplasm. This annotation is consistent with cytoplasmic localization and represents SOX2 in the soluble cytoplasmic fraction, likely during nuclear-cytoplasmic transport or regulated export.
Supporting Evidence:
PMID:18000303
Evidence for the presence of stem cell-like progenitor cells in human adult pancreas.

Core Functions

SOX2 functions as a sequence-specific DNA-binding transcription factor with an HMG-box domain that binds Sox motifs (WWCAAW consensus). Acts as a pioneer factor engaging nucleosomal DNA to maintain chromatin accessibility at regulatory enhancers. Primarily activates transcription cooperating with OCT4 on composite Sox/Oct motifs, but also functions as a repressor when SUMOylated.

Supporting Evidence:
  • file:genes/human/SOX2/SOX2-deep-research-falcon.md
    SOX2 contains a ~79-aa HMG box that binds the minor groove of cognate sequences (consensus WWCAAW; commonly the core TTGT)
  • file:genes/human/SOX2/SOX2-deep-research-falcon.md
    SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells
  • file:genes/human/SOX2/SOX2-deep-research-falcon.md
    As a pioneer factor, SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner and maintain chromatin accessibility at key regulatory sites

SOX2 is one of the four Yamanaka factors (with OCT4, KLF4, and MYC) sufficient for reprogramming somatic cells to induced pluripotent stem cells. Forms transcriptional regulatory complexes with OCT4 and NANOG to maintain embryonic stem cell self-renewal and pluripotency.

Supporting Evidence:
  • file:genes/human/SOX2/SOX2-deep-research-falcon.md
    Critical for early embryogenesis and for embryonic stem cell pluripotency
  • UniProtKB:P48431
    POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four Yamanaka factors

SOX2 maintains neural stem cells in an undifferentiated state and prevents premature neuronal differentiation by counteracting proneural factors. Essential for neural progenitor self-renewal and maintaining the neural stem cell pool throughout the nervous system.

Supporting Evidence:
  • UniProtKB:P48431
    Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
  • file:genes/human/SOX2/SOX2-deep-research-falcon.md
    Downstream SRRT target that mediates the promotion of neural stem cell self-renewal

References

Gene Ontology annotation through association of InterPro records with GO terms
Manual transfer of experimentally-verified manual GO annotation data to orthologs by curator judgment of sequence similarity
Annotation inferences using phylogenetic trees
Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword mapping
Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular Location vocabulary mapping
Gene Ontology annotation based on curation of immunofluorescence data
Automatic transfer of experimentally verified manual GO annotation data to orthologs using Ensembl Compara
Gene Ontology annotation of human sequence-specific DNA binding transcription factors based on TFClass database
Combined Automated Annotation using Multiple IEA Methods
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Deep research report on SOX2 function and mechanisms
UniProtKB:P48431
UniProt entry for human SOX2 transcription factor
Core transcriptional regulatory circuitry in human embryonic stem cells
Sox-2 is expressed by glial and progenitor cells and Pax-6 is expressed by neuroblasts in the human subventricular zone.
Stem cell markers in the human posterior limbus and corneal endothelium of unwounded and wounded corneas.
Evidence for the presence of stem cell-like progenitor cells in human adult pancreas.
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
SOX2 is frequently downregulated in gastric cancers and inhibits cell growth through cell-cycle arrest and apoptosis.
SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
Regulation of self-renewal and pluripotency by Sox2 in human embryonic stem cells.
Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
The molecular mechanism governing the oncogenic potential of SOX2 in breast cancer.
Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem cells.
MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pluripotency in human embryonic stem cells.
Purification of human transcription factors Nanog and Sox2, each in complex with Skp, an Escherichia coli periplasmic chaperone.
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
Foxp1 coordinates cardiomyocyte proliferation through both cell-autonomous and nonautonomous mechanisms.
Pluripotency factors regulate definitive endoderm specification through eomesodermin.
The SOX2-interactome in brain cancer cells identifies the requirement of MSI2 and USP9X for the growth of brain tumor cells.
Deterministic direct reprogramming of somatic cells to pluripotency.
The PRKCI and SOX2 oncogenes are coamplified and cooperate to activate Hedgehog signaling in lung squamous cell carcinoma.
A proteome-scale map of the human interactome network.
The L1TD1 protein interactome reveals the importance of post-transcriptional regulation in human pluripotency.
Oncogenic Sox2 regulates and cooperates with VRK1 in cell cycle progression and differentiation.
Regulation of ADAM10 by miR-140-5p and potential relevance for Alzheimer's disease.
LSD1 demethylase and the methyl-binding protein PHF20L1 prevent SET7 methyltransferase-dependent proteolysis of the stem-cell protein SOX2.
Proteolysis of methylated SOX2 protein is regulated by L3MBTL3 and CRL4(DCAF5) ubiquitin ligase.
Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations.
The cDNA sequence and chromosomal location of the human SOX2 gene.
Reactome:R-HSA-1112609
Reactome pathway annotation
Reactome:R-HSA-2972948
POU5F1 (OCT4), SOX2, NANOG bind GSC promoter
Reactome:R-HSA-2972949
POU5F1 (OCT4), SOX2, NANOG bind EOMES promoter
Reactome:R-HSA-2972956
POU5F1 (OCT4), SOX2, NANOG bind STAT3 promoter
Reactome:R-HSA-2972960
POU5F1 (OCT4), SOX2, NANOG bind FGF2 promoter
Reactome:R-HSA-2972962
POU5F1 (OCT4), SOX2, NANOG bind TDGF1 promoter
Reactome:R-HSA-2972964
POU5F1 (OCT4), SOX2, NANOG bind FOXD3 promoter
Reactome:R-HSA-2972965
POU5F1 (OCT4), SOX2, NANOG bind TSC22D1 promoter
Reactome:R-HSA-2972966
POU5F1 (OCT4), SOX2, NANOG bind HHEX promoter
Reactome:R-HSA-2972967
POU5F1 (OCT4), SOX2, NANOG bind EPHA1 promoter
Reactome:R-HSA-2972975
POU5F1 (OCT4), SOX2, NANOG bind SALL1 promoter
Reactome:R-HSA-2972977
POU5F1 (OCT4), SOX2, NANOG bind DKK1 promoter
Reactome:R-HSA-2972978
POU5F1 (OCT4), SOX2, NANOG bind ZIC3 promoter
Reactome:R-HSA-2972979
POU5F1 (OCT4), NANOG bind CDX2 promoter
Reactome:R-HSA-452894
Expression of SOX2
Reactome:R-HSA-480204
POU5F1 (OCT4), SOX2, NANOG, KLF4, PBX1, SMAD2 bind NANOG promoter
Reactome:R-HSA-480685
POU5F1 (OCT4), SOX2, NANOG bind SOX2 promoter
Reactome:R-HSA-5626938
Beta-catenin binds SOX proteins
Reactome:R-HSA-6790036
Expression of STAT3-upregulated nuclear proteins
Reactome:R-HSA-6800120
POU5F1 (OCT4), SOX2, NANOG bind DPPA4 gene
Reactome:R-HSA-9732125
SOX2 binds MITF promoter
Reactome:R-HSA-9833066
SOX2 binds ZEB2 gene
Reactome:R-HSA-9926523
MITF-M-dependent SOX2 gene expression

📚 Additional Documentation

Deep Research Falcon

(SOX2-deep-research-falcon.md)

provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2026-01-11T23:22:02.393412'
end_time: '2026-01-11T23:28:14.006202'
duration_seconds: 371.61
template_file: templates/gene_research_go_focused.md
template_variables:
organism: human
gene_id: SOX2
gene_symbol: SOX2
uniprot_accession: P48431
protein_description: 'RecName: Full=Transcription factor SOX-2;'
gene_info: Name=SOX2;
organism_full: Homo sapiens (Human).
protein_family: Not specified in UniProt
protein_domains: HMG_box_dom. (IPR009071); HMG_box_dom_sf. (IPR036910); SOX_fam.
(IPR022097); SRY-related_HMG-box_TF-like. (IPR050140); HMG_box (PF00505)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 46


Question

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P48431
  • Protein Description: RecName: Full=Transcription factor SOX-2;
  • Gene Information: Name=SOX2;
  • Organism (full): Homo sapiens (Human).
  • Protein Family: Not specified in UniProt
  • Key Domains: HMG_box_dom. (IPR009071); HMG_box_dom_sf. (IPR036910); SOX_fam. (IPR022097); SRY-related_HMG-box_TF-like. (IPR050140); HMG_box (PF00505)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "SOX2" matches the protein description above
  2. Verify the organism is correct: Homo sapiens (Human).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SOX2' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene SOX2 (gene ID: SOX2, UniProt: P48431) in human.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Output

Question: You are an expert researcher providing comprehensive, well-cited information.

Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies

Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.

Gene Research for Functional Annotation

⚠️ CRITICAL: Gene/Protein Identification Context

BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.

Target Gene/Protein Identity (from UniProt):

  • UniProt Accession: P48431
  • Protein Description: RecName: Full=Transcription factor SOX-2;
  • Gene Information: Name=SOX2;
  • Organism (full): Homo sapiens (Human).
  • Protein Family: Not specified in UniProt
  • Key Domains: HMG_box_dom. (IPR009071); HMG_box_dom_sf. (IPR036910); SOX_fam. (IPR022097); SRY-related_HMG-box_TF-like. (IPR050140); HMG_box (PF00505)

MANDATORY VERIFICATION STEPS:

  1. Check if the gene symbol "SOX2" matches the protein description above
  2. Verify the organism is correct: Homo sapiens (Human).
  3. Check if protein family/domains align with what you find in literature
  4. If you find literature for a DIFFERENT gene with the same or similar symbol, STOP

If Gene Symbol is Ambiguous or You Cannot Find Relevant Literature:

DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SOX2' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information

Research Target:

Please provide a comprehensive research report on the gene SOX2 (gene ID: SOX2, UniProt: P48431) in human.

The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.

You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.

We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.

We are interested in where in or outside the cell the gene product carries out its function.

We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.

Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.

Research report: Human SOX2 (UniProt P48431)

Plan summary and verification
- Identity/domain verification: SOX2 is a human SRY-related HMG-box transcription factor with a conserved HMG DNA-binding domain and nuclear localization/export signals, consistent with UniProt P48431 and recent literature describing its HMG box, NLS/NES, and cooperative binding with OCT4 on composite Sox/Oct motifs (URLs: https://doi.org/10.7554/elife.83157; domain overview and motifs) (langroudi2023evaluatingtherole pages 39-42).

1) Key concepts and definitions with current understanding
- Molecular identity and domains: SOX2 contains a ~79-aa HMG box that binds the minor groove of cognate sequences (consensus WWCAAW; commonly the core TTGT) and bends DNA. It also carries nuclear localization and export signals overlapping regulatory regions that control nucleocytoplasmic shuttling (marelli2024sumodependenttranscriptionalrepression pages 1-2, langroudi2023evaluatingtherole pages 39-42). The HMG box mediates base- and shape-reading of DNA and enables pioneer-like chromatin engagement (ozden2023molecularmechanismof pages 1-2).
- DNA recognition and OCT4 cooperativity: Structural biophysics and NMR show that SOX2’s HMG domain binds nucleosomal DNA in a site- and sequence-dependent manner and forms synergistic complexes with full-length OCT4 on adjacent Sox/Oct motifs, with cooperativity enhanced at nucleosome edges (URL: https://doi.org/10.1016/j.jmb.2022.167916; published Jan 30, 2023) (gadea2023structuralplasticityof pages 1-3). Mechanistically, SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells (URL: https://doi.org/10.1016/j.stemcr.2024.03.003; published May 14, 2024) (tsaytler2024earlyneuralspecification pages 1-2).
- Pioneer factor function: Acute degron-based depletion demonstrates that SOX2 maintains chromatin accessibility at a subset of its sites; thousands of ATAC-seq peaks are lost within ~1 hour of SOX2 degradation, and these SOX2-maintained sites are highly predictive of nearby nascent transcription (URL: https://doi.org/10.15252/embj.2022113150; published Sep 2023) (maresca2023pioneeractivitydistinguishes pages 1-2). Reviews and modeling converge that SOX2, with OCT4 and KLF4, exemplifies pioneer factors that can engage nucleosomal DNA and facilitate chromatin opening (URL: https://doi.org/10.1007/s12551-024-01205-6; published Jul 4, 2024) (orsetti2024structuraldynamicsin pages 1-3).
- Subcellular localization: SOX2 is nuclear; its HMG region and flanking sequences include NLS/NES, and post-translational modifications (PTMs) modulate nuclear export and stability (langroudi2023evaluatingtherole pages 39-42, seo2024acomprehensiveunderstanding pages 1-2).

2) Recent developments (2023–2024) and latest research
- Nucleosome engagement and positional specificity: Integrating NMR, biochemistry, and molecular modeling, SOX2 binds solvent-exposed nucleosomal sites with position dependence, forming super-stable complexes at SHL+5 and preferring SHL2 shapes; suboptimal sites promote partially unfolded, dynamic SOX2 states. OCT4 stabilizes SOX2 bending and engagement on specific adjacent motifs (2023; J Mol Biol) (gadea2023structuralplasticityof pages 1-3). Complementary modeling/biophysics show concurrent base-specific contacts (Arg51, Asn54, Tyr118) and shape-reading are required for sequence-specific nucleosomal recognition, especially at SHL2 (2023; JCIM) (ozden2023molecularmechanismof pages 1-2).
- Quantitative pioneer activity in cells: Acute SOX2 depletion produces rapid loss of thousands of accessible regions within ~1 hour; sites maintained by SOX2 most strongly predict transcriptional output, functionally validated at the Klf2 regulatory element by CRISPR (2023; EMBO J) (maresca2023pioneeractivitydistinguishes pages 1-2).
- Early neural specification logic: A 2024 Stem Cell Reports study identifies 1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers. Using dTAG-mediated rapid SOX2 ablation (SOX2 reduced to ~1.5%, 1.2%, 0.3% at 2/12/24 h), authors show these enhancers transiently activate immediately upon neural induction to prime Pax6 and other neural fate genes, but close rapidly under mesodermal cues (URL: https://doi.org/10.1016/j.stemcr.2024.03.003; May 14, 2024) (tsaytler2024earlyneuralspecification pages 1-2).
- Developmental/cancer enhancer control of SOX2: A 2023 Nucleic Acids Research paper maps a distal enhancer cluster, SRR124–134 (~124–134 kb downstream), required for SOX2 overexpression in breast and lung adenocarcinoma; deleting the cluster reduces SOX2, reprograms chromatin accessibility and transcriptomes, and correlates with patient tumor accessibility. The conserved SRR124/134 elements are essential for mouse development; homozygous deletion causes lethal failure of tracheoesophageal separation (URL: https://doi.org/10.1093/nar/gkad734; Sep 2023) (abatti2023epigeneticreprogrammingof pages 1-2).
- PTM regulation in cancer: In colorectal cancer models, acetylation at K75 within the SOX2 NES, mediated by CBP/p300 and ACSS3/ACSS2, promotes nuclear export and proteasomal degradation, while HDAC4 (regulated by miR-29a) deacetylates K75; O-GlcNAcylation at S246 further stabilizes/regulates SOX2 (URL: https://doi.org/10.3390/cancers16051035; Mar 3, 2024) (seo2024acomprehensiveunderstanding pages 1-2). In neural stem cells, SUMOylation of SOX2 is required for transcriptional repression of centrosome/centromere and cell-cycle genes and for proliferative restraint (URL: https://doi.org/10.1371/journal.pone.0298818; Mar 2024) (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Pluripotency network context: Recent reviews reinforce that OCT4/SOX2/NANOG are core master regulators of pluripotency and self-renewal, operating with signaling (LIF/BMP, TGF-β family), chromatin remodeling, and epigenetic modulators (URL: https://doi.org/10.3390/ijms24098386; May 7, 2023) (varzideh2023molecularmechanismsunderlying pages 1-3).

3) Current applications and real-world implementations
- iPSC reprogramming: SOX2 is one of the original Yamanaka factors used for reprogramming human somatic cells to iPSCs and remains a standard component of protocols across cell sources, leveraged for disease modeling and regenerative medicine (varzideh2023molecularmechanismsunderlying pages 1-3). SOX2’s pioneer activity and collaboration with OCT4/KLF4 underlie enhancer priming and chromatin opening during reprogramming (maresca2023pioneeractivitydistinguishes pages 1-2, orsetti2024structuraldynamicsin pages 1-3).
- Enhancer targeting and biomarker translation: In breast and lung adenocarcinoma, SRR124–134 accessibility correlates with SOX2 overexpression in primary tumors, suggesting enhancer-state biomarkers and potential CRISPR-interrogated regulatory dependencies (abatti2023epigeneticreprogrammingof pages 1-2). In SCLC, clinical-pathologic frameworks increasingly consider lineage transcription factors and genomic amplifications (including 3q arms harboring SOX2) as part of emerging molecular stratification schemas and trial design (URL: https://doi.org/10.1038/s41571-024-00914-x; Jul 2024) (sen2024emergingadvancesin pages 1-3).

4) Expert opinions and analysis from authoritative sources
- Pioneer factors and chromatin: A 2024 Biophysical Reviews synthesis argues that multiple structures of TF–nucleosome complexes now clarify binding positions and the open question of whether PFs directly open nucleosomes versus recruiting remodelers. The review highlights SOX2/OCT4/KLF4 as canonical pioneers central to cell-fate transitions and recommends integrating NMR with molecular dynamics to resolve dynamic opening events (orsetti2024structuraldynamicsin pages 1-3). The EMBO Journal study provides in vivo functional criteria for SOX2 “pioneer-maintained” sites that are most predictive of gene activation, sharpening how to distinguish activating from non-activating binding (maresca2023pioneeractivitydistinguishes pages 1-2). Together, these works suggest that SOX2’s context-dependent ability to maintain accessibility is the best predictor of its regulatory impact.

5) Relevant statistics and data from recent studies (with URLs/dates)
- Acute loss-of-function kinetics: SOX2 protein eliminated within ~2 h using dTAG; ATAC-seq revealed 784 and 3,001 DARs at 12 h and 24 h, respectively, and thousands of SOX2-maintained accessible sites lost within ~1 h, with strong coupling to nascent transcription (EMBO J 2023; https://doi.org/10.15252/embj.2022113150; Sep 2023; Stem Cell Reports 2024; https://doi.org/10.1016/j.stemcr.2024.03.003; May 14, 2024) (maresca2023pioneeractivitydistinguishes pages 1-2, tsaytler2024earlyneuralspecification pages 1-2).
- Neural enhancers: 1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers; SOX2 protein reduced to ~1.5%, 1.2%, 0.3% at 2/12/24 h in KO, with neural induction rapidly activating these enhancers and up-regulating Pax6 (Stem Cell Reports 2024; https://doi.org/10.1016/j.stemcr.2024.03.003) (tsaytler2024earlyneuralspecification pages 1-2).
- Structural determinants of nucleosome engagement: At SHL2 and SHL+5, SOX2 achieves sequence- and shape-specific nucleosome recognition; key HMG residues (Arg51, Asn54, Tyr118) mediate base-specific recognition, and solvation/exposure (solvent-facing DNA) favors binding (JCIM 2023; https://doi.org/10.1021/acs.jcim.2c01520; Jun 12, 2023; J Mol Biol 2023; https://doi.org/10.1016/j.jmb.2022.167916; Jan 30, 2023) (ozden2023molecularmechanismof pages 1-2, gadea2023structuralplasticityof pages 1-3).
- Developmental enhancer control of SOX2: Deletion of SRR124–134 reduces SOX2, changes chromatin accessibility genome-wide, and in mice SRR124/134 deletion is lethal due to tracheoesophageal separation failure (NAR 2023; https://doi.org/10.1093/nar/gkad734) (abatti2023epigeneticreprogrammingof pages 1-2).
- PTMs controlling SOX2 behavior: In CRC cells, K75 acetylation (CBP/p300; ACSS3/ACSS2) promotes nuclear export and proteasomal degradation; HDAC4 deacetylates K75 under miR-29a control; O-GlcNAc at S246 regulates SOX2 stability/activity (Cancers 2024; https://doi.org/10.3390/cancers16051035; Mar 3, 2024). SUMOylation is required for repression of cell-cycle/centrosomal genes in human NSCs and for SOX2-mediated restraint of NSC proliferation (PLOS ONE 2024; https://doi.org/10.1371/journal.pone.0298818; Mar 2024) (seo2024acomprehensiveunderstanding pages 1-2, marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Cancer lineage context: Molecular reviews of SCLC note recurrent copy-number alterations including 3q arm (harboring SOX2), underpinning lineage programs and trial development (Nat Rev Clin Oncol 2024; https://doi.org/10.1038/s41571-024-00914-x) (sen2024emergingadvancesin pages 1-3). NAR 2023 further contextualizes SOX2 oncogenic roles across BRCA, GBM, LUAD, and LUSC and shows enhancer misactivation as a likely mechanism of SOX2 overexpression (abatti2023epigeneticreprogrammingof pages 1-2).

Functional annotation summary
- Primary function: SOX2 is a nuclear sequence-specific transcription factor of the SOXB1 family that binds the minor groove via an HMG domain, bends DNA, and partners with OCT4 on composite Sox/Oct motifs to regulate enhancers. As a pioneer factor, SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner and maintain chromatin accessibility at key regulatory sites that predict transcriptional output (gadea2023structuralplasticityof pages 1-3, ozden2023molecularmechanismof pages 1-2, maresca2023pioneeractivitydistinguishes pages 1-2).
- Cellular localization: Predominantly nuclear; NLS and NES in/around the HMG region and flanks; PTMs (e.g., K75 acetylation) regulate export and stability (langroudi2023evaluatingtherole pages 39-42, seo2024acomprehensiveunderstanding pages 1-2).
- Pathways/biological processes: SOX2 is central to the pluripotency GRN with OCT4/NANOG and primes early neural fate through a dedicated cohort of SOX2-dependent enhancers (varzideh2023molecularmechanismsunderlying pages 1-3, tsaytler2024earlyneuralspecification pages 1-2). In the nervous system, SUMO-dependent repression contributes to NSC proliferative control (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Disease genetics and mechanisms: SOX2 haploinsufficiency is classically linked to anophthalmia/microphthalmia spectrum and broader neurodevelopmental phenotypes (references therein) (marelli2024sumodependenttranscriptionalrepression pages 1-2). In cancer, SOX2 is frequently overexpressed or amplified (3q) in squamous lineage tumors and contributes to stemness, plasticity, and therapy resistance; enhancer reprogramming of SRR124–134 underlies overexpression in breast/lung adenocarcinoma (sen2024emergingadvancesin pages 1-3, abatti2023epigeneticreprogrammingof pages 1-2). PTMs modulate SOX2 stability and localization in CRC, offering mechanistic links to tumor progression (seo2024acomprehensiveunderstanding pages 1-2).
- Applications: SOX2 is a core reprogramming factor for iPSC generation, enabling applications in disease modeling and regenerative medicine; its pioneer function is directly leveraged for chromatin remodeling in cell-fate engineering (varzideh2023molecularmechanismsunderlying pages 1-3, maresca2023pioneeractivitydistinguishes pages 1-2, orsetti2024structuraldynamicsin pages 1-3).

Notes on gene/protein identification check
- The gene symbol SOX2, organism Homo sapiens, and domain family (HMG box; SOX family) match UniProt P48431 and are consistent with all cited literature. No conflicting gene symbol usage was found.

References (with URLs/dates inline) with citation mapping
- Marelli et al., PLOS ONE, Mar 2024, https://doi.org/10.1371/journal.pone.0298818 (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Langroudi (thesis/compendium), 2023; domain/NLS–NES overview (langroudi2023evaluatingtherole pages 39-42).
- Maresca et al., EMBO J, Sep 2023, https://doi.org/10.15252/embj.2022113150 (maresca2023pioneeractivitydistinguishes pages 1-2).
- Hu et al., Nucleic Acids Res, Aug 2023, https://doi.org/10.1093/nar/gkad597 (mechanistic context for Sox–Oct condensates and reprogramming; complements SOX2) (hu2023evaluationofthe pages 18-19).
- Tsaytler et al., Stem Cell Reports, May 14, 2024, https://doi.org/10.1016/j.stemcr.2024.03.003 (tsaytler2024earlyneuralspecification pages 1-2).
- Sen et al., Nat Rev Clin Oncol, Jul 2024, https://doi.org/10.1038/s41571-024-00914-x (sen2024emergingadvancesin pages 1-3).
- Seo et al., Cancers, Mar 3, 2024, https://doi.org/10.3390/cancers16051035 (seo2024acomprehensiveunderstanding pages 1-2).
- Abatti et al., Nucleic Acids Res, Sep 2023, https://doi.org/10.1093/nar/gkad734 (abatti2023epigeneticreprogrammingof pages 1-2).
- Ozden et al., J Chem Inf Model, Jun 12, 2023, https://doi.org/10.1021/acs.jcim.2c01520 (ozden2023molecularmechanismof pages 1-2).
- Gadea & Nikolova, J Mol Biol, Jan 30, 2023, https://doi.org/10.1016/j.jmb.2022.167916 (gadea2023structuralplasticityof pages 1-3).
- Varzideh et al., Int J Mol Sci, May 7, 2023, https://doi.org/10.3390/ijms24098386 (varzideh2023molecularmechanismsunderlying pages 1-3).
- Orsetti et al., Biophysical Reviews, Jul 4, 2024, https://doi.org/10.1007/s12551-024-01205-6 (orsetti2024structuraldynamicsin pages 1-3).

References

  1. (langroudi2023evaluatingtherole pages 39-42): L Langroudi. Evaluating the role of myelin regulatory factor in mouse development. Unknown journal, 2023.

  2. (marelli2024sumodependenttranscriptionalrepression pages 1-2): Elisa Marelli, Jaime Hughes, and Paul J. Scotting. Sumo-dependent transcriptional repression by sox2 inhibits the proliferation of neural stem cells. PLOS ONE, 19:e0298818, Mar 2024. URL: https://doi.org/10.1371/journal.pone.0298818, doi:10.1371/journal.pone.0298818. This article has 4 citations and is from a peer-reviewed journal.

  3. (ozden2023molecularmechanismof pages 1-2): Burcu Ozden, Ramachandran Boopathi, Ayşe Berçin Barlas, Imtiaz N. Lone, Jan Bednar, Carlo Petosa, Seyit Kale, Ali Hamiche, Dimitar Angelov, Stefan Dimitrov, and Ezgi Karaca. Molecular mechanism of nucleosome recognition by the pioneer transcription factor sox. Journal of Chemical Information and Modeling, 63:3839-3853, Jun 2023. URL: https://doi.org/10.1021/acs.jcim.2c01520, doi:10.1021/acs.jcim.2c01520. This article has 11 citations and is from a peer-reviewed journal.

  4. (gadea2023structuralplasticityof pages 1-3): Fabiana C. Malaga Gadea and Evgenia N. Nikolova. Structural plasticity of pioneer factor sox2 and dna bendability modulate nucleosome engagement and sox2-oct4 synergism. Journal of Molecular Biology, 435:167916, Jan 2023. URL: https://doi.org/10.1016/j.jmb.2022.167916, doi:10.1016/j.jmb.2022.167916. This article has 20 citations and is from a domain leading peer-reviewed journal.

  5. (tsaytler2024earlyneuralspecification pages 1-2): Pavel Tsaytler, Gaby Blaess, Manuela Scholze-Wittler, Frederic Koch, and Bernhard G. Herrmann. Early neural specification of stem cells is mediated by a set of sox2-dependent neural-associated enhancers. Stem Cell Reports, 19:618-628, May 2024. URL: https://doi.org/10.1016/j.stemcr.2024.03.003, doi:10.1016/j.stemcr.2024.03.003. This article has 4 citations and is from a domain leading peer-reviewed journal.

  6. (maresca2023pioneeractivitydistinguishes pages 1-2): M. Maresca, Teun van den Brand, Hangpeng Li, Hans Teunissen, James O. J. Davies, and Elzo de Wit. Pioneer activity distinguishes activating from non‐activating sox2 binding sites. The EMBO Journal, Sep 2023. URL: https://doi.org/10.15252/embj.2022113150, doi:10.15252/embj.2022113150. This article has 25 citations.

  7. (orsetti2024structuraldynamicsin pages 1-3): Andrea Orsetti, Daphne van Oosten, Roxana-Geanina Vasarhelyi, Theodor-Marian Dănescu, Jan Huertas, Hugo van Ingen, and Vlad Cojocaru. Structural dynamics in chromatin unraveling by pioneer transcription factors. Biophysical Reviews, 16:365-382, Jun 2024. URL: https://doi.org/10.1007/s12551-024-01205-6, doi:10.1007/s12551-024-01205-6. This article has 11 citations and is from a peer-reviewed journal.

  8. (seo2024acomprehensiveunderstanding pages 1-2): Yoojeong Seo, Dong Keon Kim, Jihye Park, Soo Jung Park, Jae Jun Park, Jae Hee Cheon, and Tae Il Kim. A comprehensive understanding of post-translational modification of sox2 via acetylation and o-glcnacylation in colorectal cancer. Cancers, 16:1035, Mar 2024. URL: https://doi.org/10.3390/cancers16051035, doi:10.3390/cancers16051035. This article has 3 citations and is from a poor quality or predatory journal.

  9. (abatti2023epigeneticreprogrammingof pages 1-2): Luis E Abatti, Patricia Lado-Fernández, Linh Huynh, Manuel Collado, Michael M Hoffman, and Jennifer A Mitchell. Epigenetic reprogramming of a distal developmental enhancer cluster drives sox2 overexpression in breast and lung adenocarcinoma. Nucleic Acids Research, 51:10109-10131, Sep 2023. URL: https://doi.org/10.1093/nar/gkad734, doi:10.1093/nar/gkad734. This article has 21 citations and is from a highest quality peer-reviewed journal.

  10. (marelli2024sumodependenttranscriptionalrepression pages 22-23): Elisa Marelli, Jaime Hughes, and Paul J. Scotting. Sumo-dependent transcriptional repression by sox2 inhibits the proliferation of neural stem cells. PLOS ONE, 19:e0298818, Mar 2024. URL: https://doi.org/10.1371/journal.pone.0298818, doi:10.1371/journal.pone.0298818. This article has 4 citations and is from a peer-reviewed journal.

  11. (varzideh2023molecularmechanismsunderlying pages 1-3): Fahimeh Varzideh, Jessica Gambardella, Urna Kansakar, Stanislovas S. Jankauskas, and Gaetano Santulli. Molecular mechanisms underlying pluripotency and self-renewal of embryonic stem cells. International Journal of Molecular Sciences, 24:8386, May 2023. URL: https://doi.org/10.3390/ijms24098386, doi:10.3390/ijms24098386. This article has 86 citations and is from a poor quality or predatory journal.

  12. (sen2024emergingadvancesin pages 1-3): Triparna Sen, Nobuyuki Takahashi, Subhamoy Chakraborty, Naoko Takebe, Amin H. Nassar, Nagla A. Karim, Sonam Puri, and Abdul Rafeh Naqash. Emerging advances in defining the molecular and therapeutic landscape of small-cell lung cancer. Nature reviews. Clinical oncology, 21:610-627, Jul 2024. URL: https://doi.org/10.1038/s41571-024-00914-x, doi:10.1038/s41571-024-00914-x. This article has 95 citations.

  13. (hu2023evaluationofthe pages 18-19): Haoqing Hu, D. Ho, D. Tan, C. MacCarthy, Cheng-han Yu, Mingxi Weng, H. Schöler, and R. Jauch. Evaluation of the determinants for improved pluripotency induction and maintenance by engineered sox17. Nucleic Acids Research, 51:8934-8956, Aug 2023. URL: https://doi.org/10.1093/nar/gkad597, doi:10.1093/nar/gkad597. This article has 7 citations and is from a highest quality peer-reviewed journal.

Citations

  1. langroudi2023evaluatingtherole pages 39-42
  2. ozden2023molecularmechanismof pages 1-2
  3. gadea2023structuralplasticityof pages 1-3
  4. tsaytler2024earlyneuralspecification pages 1-2
  5. maresca2023pioneeractivitydistinguishes pages 1-2
  6. orsetti2024structuraldynamicsin pages 1-3
  7. abatti2023epigeneticreprogrammingof pages 1-2
  8. seo2024acomprehensiveunderstanding pages 1-2
  9. varzideh2023molecularmechanismsunderlying pages 1-3
  10. sen2024emergingadvancesin pages 1-3
  11. marelli2024sumodependenttranscriptionalrepression pages 1-2
  12. hu2023evaluationofthe pages 18-19
  13. marelli2024sumodependenttranscriptionalrepression pages 22-23
  14. https://doi.org/10.7554/elife.83157;
  15. https://doi.org/10.1016/j.jmb.2022.167916;
  16. https://doi.org/10.1016/j.stemcr.2024.03.003;
  17. https://doi.org/10.15252/embj.2022113150;
  18. https://doi.org/10.1007/s12551-024-01205-6;
  19. https://doi.org/10.1093/nar/gkad734;
  20. https://doi.org/10.3390/cancers16051035;
  21. https://doi.org/10.1371/journal.pone.0298818;
  22. https://doi.org/10.3390/ijms24098386;
  23. https://doi.org/10.1038/s41571-024-00914-x;
  24. https://doi.org/10.1016/j.stemcr.2024.03.003
  25. https://doi.org/10.1021/acs.jcim.2c01520;
  26. https://doi.org/10.1093/nar/gkad734
  27. https://doi.org/10.1038/s41571-024-00914-x
  28. https://doi.org/10.1371/journal.pone.0298818
  29. https://doi.org/10.15252/embj.2022113150
  30. https://doi.org/10.1093/nar/gkad597
  31. https://doi.org/10.3390/cancers16051035
  32. https://doi.org/10.1021/acs.jcim.2c01520
  33. https://doi.org/10.1016/j.jmb.2022.167916
  34. https://doi.org/10.3390/ijms24098386
  35. https://doi.org/10.1007/s12551-024-01205-6
  36. https://doi.org/10.1371/journal.pone.0298818,
  37. https://doi.org/10.1021/acs.jcim.2c01520,
  38. https://doi.org/10.1016/j.jmb.2022.167916,
  39. https://doi.org/10.1016/j.stemcr.2024.03.003,
  40. https://doi.org/10.15252/embj.2022113150,
  41. https://doi.org/10.1007/s12551-024-01205-6,
  42. https://doi.org/10.3390/cancers16051035,
  43. https://doi.org/10.1093/nar/gkad734,
  44. https://doi.org/10.3390/ijms24098386,
  45. https://doi.org/10.1038/s41571-024-00914-x,
  46. https://doi.org/10.1093/nar/gkad597,

📄 View Raw YAML

id: P48431
gene_symbol: SOX2
product_type: PROTEIN
status: COMPLETE
taxon:
  id: NCBITaxon:9606
  label: Homo sapiens
description: 'SOX2 is a SOXB1 family transcription factor containing an HMG-box domain
  that binds the minor groove of DNA and functions as a pioneer factor. As a core
  pluripotency factor (one of the Yamanaka factors), SOX2 maintains embryonic stem
  cell self-renewal and neural stem cell identity. It cooperates with OCT4 on composite
  Sox/Oct DNA motifs to regulate thousands of enhancers controlling pluripotency and
  early neural fate specification. SOX2 is essential for maintaining neural progenitor
  cells in an undifferentiated state and preventing premature neuronal differentiation.
  Nuclear localization is regulated by NLS/NES sequences and post-translational modifications
  including methylation, acetylation, SUMOylation, and ubiquitination.'

existing_annotations:
# Biological Process Annotations - Transcriptional Regulation

  - term:
      id: GO:0000122
      label: negative regulation of transcription by RNA polymerase II
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 functions as both a transcriptional activator and repressor 
        depending on context. SUMOylation of SOX2 enables transcriptional 
        repression of centrosome/centromere and cell-cycle genes in neural stem 
        cells, providing proliferative restraint.
      action: ACCEPT
      reason: This annotation accurately represents a documented function of 
        SOX2. The deep research shows SUMOylation-dependent repression of 
        cell-cycle genes in neural stem cells. While SOX2 is primarily an 
        activator, its repressor function is well-established and represents a 
        core regulatory mechanism for controlling neural stem cell 
        proliferation.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "In neural stem cells, SUMOylation of SOX2 is required
            for transcriptional repression of centrosome/centromere and cell-cycle
            genes and for proliferative restraint"

  - term:
      id: GO:0000122
      label: negative regulation of transcription by RNA polymerase II
    evidence_type: ISS
    original_reference_id: GO_REF:0000024
    review:
      summary: ISS evidence from mouse orthologs supports the repressor function
        of SOX2.
      action: ACCEPT
      reason: ISS annotation based on ortholog data is consistent with 
        experimental evidence from human SOX2 showing transcriptional 
        repression. This is a valid inference from sequence similarity given the
        highly conserved nature of SOX2 function across mammals.

  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 is fundamentally a transcriptional activator that cooperates
        with OCT4 to activate thousands of enhancers controlling pluripotency 
        and neural fate genes. This is the primary and core molecular function 
        of SOX2.
      action: ACCEPT
      reason: This annotation represents the core function of SOX2 as a 
        transcriptional activator. Deep research shows SOX2 and OCT4 co-occupy 
        thousands of OSN enhancers in pluripotent cells and activate genes 
        essential for pluripotency and neural specification. Multiple IBA, IDA, 
        and IEA annotations converge on this fundamental activity.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
            enhancers in pluripotent cells"
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "1,898 SOX2-dependent neural-associated enhancers among
            8,531 OSN-bound enhancers"

  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IEA
    original_reference_id: GO_REF:0000107
    review:
      summary: Electronic annotation based on ortholog data from Ensembl.
      action: ACCEPT
      reason: Consistent with IBA and IDA evidence for transcriptional 
        activation. This electronic annotation accurately reflects SOX2's 
        primary function.

  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IDA
    original_reference_id: PMID:21245162
    review:
      summary: Direct experimental evidence for SOX2 transcriptional activation.
      action: ACCEPT
      reason: IDA evidence provides direct experimental support for 
        transcriptional activation function, consistent with the core role of 
        SOX2.

      supported_by:
        - reference_id: PMID:21245162
          supporting_text: Pluripotency factors regulate definitive endoderm 
            specification through eomesodermin.
  - term:
      id: GO:0045944
      label: positive regulation of transcription by RNA polymerase II
    evidence_type: IDA
    original_reference_id: PMID:18027866
    review:
      summary: Independent IDA evidence for transcriptional activation.
      action: ACCEPT
      reason: Multiple independent experimental studies confirm transcriptional 
        activation as a core SOX2 function.

      supported_by:
        - reference_id: PMID:18027866
          supporting_text: Aberrant expression of SOX2 upregulates MUC5AC 
            gastric foveolar mucin in mucinous cancers of the colorectum and 
            related lesions.
  - term:
      id: GO:0045893
      label: positive regulation of DNA-templated transcription
    evidence_type: IDA
    original_reference_id: PMID:29253717
    review:
      summary: IDA evidence for positive regulation of transcription.
      action: ACCEPT
      reason: This is a parent term of RNA pol II-specific transcriptional 
        activation and accurately represents SOX2's core function. The term is 
        at an appropriate level of specificity for general transcriptional 
        activation function.

      supported_by:
        - reference_id: PMID:29253717
          supporting_text: Regulation of ADAM10 by miR-140-5p and potential 
            relevance for Alzheimer's disease.
  - term:
      id: GO:0045893
      label: positive regulation of DNA-templated transcription
    evidence_type: IDA
    original_reference_id: PMID:18407919
    review:
      summary: Additional IDA evidence for transcriptional activation.
      action: ACCEPT
      reason: Multiple experimental studies support this fundamental function of
        SOX2 as a transcriptional activator.

      supported_by:
        - reference_id: PMID:18407919
          supporting_text: Apr 10. Novel mutations in LHX3 are associated with 
            hypopituitarism and sensorineural hearing loss.
  - term:
      id: GO:0006355
      label: regulation of DNA-templated transcription
    evidence_type: IEA
    original_reference_id: GO_REF:0000002
    review:
      summary: Electronic annotation based on InterPro domain IPR022097 (SOX 
        family).
      action: ACCEPT
      reason: This is a valid high-level annotation based on domain 
        architecture. All SOX family transcription factors regulate 
        transcription, and this annotation is appropriately general. While less 
        specific than the positive/negative regulation terms, it is not 
        incorrect.

  - term:
      id: GO:0006355
      label: regulation of DNA-templated transcription
    evidence_type: IDA
    original_reference_id: PMID:16153702
    review:
      summary: IDA evidence for general transcriptional regulation.
      action: ACCEPT
      reason: This parent term is supported by experimental evidence and is 
        appropriately general for describing SOX2's overall transcriptional 
        regulatory function.

      supported_by:
        - reference_id: PMID:16153702
          supporting_text: Core transcriptional regulatory circuitry in human 
            embryonic stem cells.
  - term:
      id: GO:0006355
      label: regulation of DNA-templated transcription
    evidence_type: NAS
    original_reference_id: PMID:7849401
    review:
      summary: Non-traceable author statement from the original SOX2 cloning 
        paper.
      action: ACCEPT
      reason: Historical NAS annotation from the original characterization of 
        SOX2 is appropriate for general transcriptional regulation function.

      supported_by:
        - reference_id: PMID:7849401
          supporting_text: The cDNA sequence and chromosomal location of the 
            human SOX2 gene.
  - term:
      id: GO:0010468
      label: regulation of gene expression
    evidence_type: IMP
    original_reference_id: PMID:18388306
    review:
      summary: IMP evidence showing mutant phenotype affecting gene expression 
        regulation.
      action: ACCEPT
      reason: This is a valid high-level annotation demonstrating SOX2's role in
        gene expression control through mutant phenotype analysis. While very 
        general, it is appropriate as a parent term encompassing transcriptional
        regulation.

# Biological Process - Neural Development and Differentiation

      supported_by:
        - reference_id: PMID:18388306
          supporting_text: Apr 3. Regulation of self-renewal and pluripotency by
            Sox2 in human embryonic stem cells.
  - term:
      id: GO:0030182
      label: neuron differentiation
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 plays a complex role in neuron differentiation by 
        maintaining neural progenitors in an undifferentiated state and 
        preventing premature neuronal differentiation.
      action: MODIFY
      reason: This term is ambiguous for SOX2's actual role. SOX2 does NOT 
        promote neuron differentiation; rather, it prevents premature 
        differentiation by maintaining neural progenitor identity. A more 
        accurate term would be GO:0045665 (negative regulation of neuron 
        differentiation), which is already present in the annotation set. The 
        IBA annotation likely captures involvement in the process broadly, but 
        this could be misleading about SOX2's actual function.
      proposed_replacement_terms:
        - id: GO:0045665
          label: negative regulation of neuron differentiation
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "Keeps neural cells undifferentiated by counteracting the
            activity of proneural proteins and suppresses neuronal differentiation"
        - reference_id: UniProtKB:P48431
          supporting_text: "Keeps neural cells undifferentiated by counteracting the
            activity of proneural proteins and suppresses neuronal differentiation"

  - term:
      id: GO:0045665
      label: negative regulation of neuron differentiation
    evidence_type: ISS
    original_reference_id: GO_REF:0000024
    review:
      summary: SOX2 actively suppresses neuronal differentiation to maintain 
        neural progenitor cells in an undifferentiated state. This is a core 
        function in neural development.
      action: ACCEPT
      reason: This annotation accurately captures SOX2's essential role in 
        preventing premature neuronal differentiation. UniProt states that SOX2 
        "keeps neural cells undifferentiated by counteracting the activity of 
        proneural proteins and suppresses neuronal differentiation." This is 
        central to its function in neural stem cell maintenance and represents a
        core activity.
      supported_by:
        - reference_id: UniProtKB:P48431
          supporting_text: "Keeps neural cells undifferentiated by counteracting the
            activity of proneural proteins and suppresses neuronal differentiation"

  - term:
      id: GO:0097150
      label: neuronal stem cell population maintenance
    evidence_type: ISS
    original_reference_id: GO_REF:0000024
    review:
      summary: SOX2 is essential for maintaining neural stem cell identity and 
        self-renewal. This is one of its core functions in the nervous system.
      action: ACCEPT
      reason: This annotation represents a core function of SOX2 in neural 
        biology. Deep research shows SOX2 is critical for neural stem cell 
        maintenance and is described as a factor that "keeps neural cells 
        undifferentiated." The term accurately captures SOX2's essential role in
        maintaining the neural stem cell pool.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "Downstream SRRT target that mediates the promotion of
            neural stem cell self-renewal"

  - term:
      id: GO:0007420
      label: brain development
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 plays essential roles in brain development through 
        maintaining neural progenitor populations and regulating neural fate 
        specification.
      action: KEEP_AS_NON_CORE
      reason: While SOX2 is indeed involved in brain development, this term is 
        very broad and captures pleiotropy rather than core molecular function. 
        The more specific neural stem cell maintenance and neural specification 
        functions are better representations of SOX2's primary roles. This 
        annotation is not incorrect but represents a higher-level developmental 
        outcome rather than the core cellular function.

  - term:
      id: GO:0030900
      label: forebrain development
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 expression and function in forebrain development is 
        well-documented.
      action: KEEP_AS_NON_CORE
      reason: This is a more specific developmental process than general brain 
        development, but still represents a regional/developmental outcome 
        rather than core molecular function. SOX2 functions the same way 
        (maintaining neural progenitors) across brain regions. The 
        forebrain-specific annotation reflects where SOX2 is expressed and 
        important, but the core function is neural progenitor maintenance 
        regardless of brain region.

  - term:
      id: GO:0030900
      label: forebrain development
    evidence_type: IEP
    original_reference_id: PMID:18285410
    review:
      summary: Expression pattern evidence showing SOX2 expression during 
        forebrain development.
      action: KEEP_AS_NON_CORE
      reason: IEP evidence showing expression pattern during forebrain 
        development is valid but does not demonstrate specific function. This is
        consistent with SOX2's role in maintaining neural progenitors throughout
        the developing brain including forebrain regions.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0021781
      label: glial cell fate commitment
    evidence_type: NAS
    original_reference_id: PMID:17291498
    review:
      summary: NAS evidence for role in glial cell fate commitment.
      action: KEEP_AS_NON_CORE
      reason: While SOX2 is expressed in glial progenitors and can influence 
        glial fate, this represents one specific outcome of SOX2's more general 
        role in maintaining progenitor identity. The core function is 
        maintaining undifferentiated stem/progenitor states; glial fate 
        commitment is one context-dependent outcome. This is valid but 
        peripheral to core function.

# Biological Process - Stem Cell Maintenance and Pluripotency

      supported_by:
        - reference_id: PMID:17291498
          supporting_text: 2006 Dec 20. Sox-2 is expressed by glial and 
            progenitor cells and Pax-6 is expressed by neuroblasts in the human 
            subventricular zone.
  - term:
      id: GO:0035019
      label: somatic stem cell population maintenance
    evidence_type: IMP
    original_reference_id: PMID:19736317
    review:
      summary: IMP evidence showing SOX2 is required for maintaining somatic 
        stem cell populations, particularly neural stem cells.
      action: ACCEPT
      reason: This annotation represents a core function of SOX2. Multiple 
        independent IMP and IDA studies demonstrate SOX2's essential role in 
        maintaining stem cell populations. This encompasses both embryonic and 
        neural stem cell maintenance and represents a fundamental cellular 
        function of SOX2.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "Critical for early embryogenesis and for embryonic stem
            cell pluripotency"

        - reference_id: PMID:19736317
          supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
            from stem cell pluripotency to specifying a cardiac fate.'
  - term:
      id: GO:0035019
      label: somatic stem cell population maintenance
    evidence_type: IDA
    original_reference_id: PMID:19409607
    review:
      summary: Independent IDA evidence for stem cell maintenance.
      action: ACCEPT
      reason: Multiple independent experimental studies support this core 
        function.

      supported_by:
        - reference_id: PMID:19409607
          supporting_text: Apr 30. MicroRNA-145 regulates OCT4, SOX2, and KLF4 
            and represses pluripotency in human embryonic stem cells.
  - term:
      id: GO:0035019
      label: somatic stem cell population maintenance
    evidence_type: IMP
    original_reference_id: PMID:18388306
    review:
      summary: Additional IMP evidence from mutant phenotype analysis.
      action: ACCEPT
      reason: Mutant phenotype evidence provides strong support for essential 
        role in stem cell maintenance.

# Biological Process - Cell Cycle and Proliferation

      supported_by:
        - reference_id: PMID:18388306
          supporting_text: Apr 3. Regulation of self-renewal and pluripotency by
            Sox2 in human embryonic stem cells.
  - term:
      id: GO:1902807
      label: negative regulation of cell cycle G1/S phase transition
    evidence_type: IDA
    original_reference_id: PMID:18268498
    review:
      summary: IDA evidence for SOX2 role in cell cycle regulation.
      action: ACCEPT
      reason: This annotation is supported by direct experimental evidence and 
        is consistent with SOX2's role in maintaining progenitor cells in a 
        self-renewing state. Cell cycle regulation is an integral part of SOX2's
        function in controlling stem cell proliferation versus differentiation 
        decisions.

      supported_by:
        - reference_id: PMID:18268498
          supporting_text: Feb 12. SOX2 is frequently downregulated in gastric 
            cancers and inhibits cell growth through cell-cycle arrest and 
            apoptosis.
  - term:
      id: GO:0043410
      label: positive regulation of MAPK cascade
    evidence_type: IDA
    original_reference_id: PMID:18187129
    review:
      summary: IDA evidence for SOX2 involvement in MAPK signaling.
      action: KEEP_AS_NON_CORE
      reason: While experimentally supported, MAPK cascade regulation represents
        a context-specific signaling outcome rather than a core molecular 
        function. SOX2's primary function is as a transcription factor; MAPK 
        regulation may be an indirect consequence or occur in specific cellular 
        contexts (e.g., mesenchymal stem cells in this study). This is valid but
        peripheral to core transcriptional regulatory function.

# Biological Process - Development and Fate Specification

      supported_by:
        - reference_id: PMID:18187129
          supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog 
            in human bone marrow derived mesenchymal stem cells maintains their 
            expansion and differentiation capabilities.
  - term:
      id: GO:0048839
      label: inner ear development
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 expression and function in inner ear development.
      action: KEEP_AS_NON_CORE
      reason: SOX2 plays roles in sensory organ development including inner ear,
        but this represents tissue-specific developmental pleiotropy rather than
        core function. The core function is maintaining progenitor populations; 
        inner ear development is one of many developmental contexts where this 
        occurs.

  - term:
      id: GO:0048839
      label: inner ear development
    evidence_type: IEP
    original_reference_id: PMID:18407919
    review:
      summary: Expression pattern evidence during inner ear development.
      action: KEEP_AS_NON_CORE
      reason: IEP evidence supports expression during inner ear development but 
        does not demonstrate specific function. Consistent with SOX2's broader 
        role in sensory progenitor maintenance.

      supported_by:
        - reference_id: PMID:18407919
          supporting_text: Apr 10. Novel mutations in LHX3 are associated with 
            hypopituitarism and sensorineural hearing loss.
  - term:
      id: GO:0001654
      label: eye development
    evidence_type: IEP
    original_reference_id: PMID:18285410
    review:
      summary: Expression pattern during eye development.
      action: KEEP_AS_NON_CORE
      reason: SOX2 mutations cause anophthalmia/microphthalmia, demonstrating 
        clear importance in eye development. However, this represents 
        tissue-specific developmental outcome rather than core molecular 
        function. Like inner ear development, eye development is one context 
        where SOX2's progenitor-maintaining function is critical.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0021983
      label: pituitary gland development
    evidence_type: IEP
    original_reference_id: PMID:18285410
    review:
      summary: Expression pattern during pituitary development.
      action: KEEP_AS_NON_CORE
      reason: Another tissue-specific developmental context where SOX2 maintains
        progenitor populations. Valid annotation but represents developmental 
        pleiotropy rather than core cellular function.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0001714
      label: endodermal cell fate specification
    evidence_type: IDA
    original_reference_id: PMID:21245162
    review:
      summary: IDA evidence for role in endoderm specification.
      action: KEEP_AS_NON_CORE
      reason: SOX2 with OCT4 regulates germ layer specification including 
        endoderm fate. While experimentally supported, this represents one 
        developmental context among many. SOX2's core function is maintaining 
        pluripotency and regulating lineage choice; endoderm specification is 
        one specific outcome.

      supported_by:
        - reference_id: PMID:21245162
          supporting_text: Pluripotency factors regulate definitive endoderm 
            specification through eomesodermin.
  - term:
      id: GO:0001649
      label: osteoblast differentiation
    evidence_type: IDA
    original_reference_id: PMID:18187129
    review:
      summary: IDA evidence for SOX2 role in osteoblast differentiation when 
        ectopically expressed in mesenchymal stem cells.
      action: KEEP_AS_NON_CORE
      reason: This represents an experimental manipulation (forced SOX2 
        expression in mesenchymal stem cells) rather than SOX2's endogenous core
        function. While experimentally valid, osteoblast differentiation is not 
        a physiological context where SOX2 normally functions and represents 
        experimental pleiotropy.

# Biological Process - Signaling and Response

      supported_by:
        - reference_id: PMID:18187129
          supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog 
            in human bone marrow derived mesenchymal stem cells maintains their 
            expansion and differentiation capabilities.
  - term:
      id: GO:0090090
      label: negative regulation of canonical Wnt signaling pathway
    evidence_type: IDA
    original_reference_id: PMID:18285410
    review:
      summary: IDA evidence for SOX2 regulation of Wnt signaling.
      action: KEEP_AS_NON_CORE
      reason: While experimentally supported, Wnt pathway regulation represents 
        cross-talk between SOX2's transcriptional program and developmental 
        signaling rather than a core molecular function. This is a valid 
        regulatory connection but peripheral to SOX2's primary role as a 
        transcriptional regulator of pluripotency and neural fate.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0070848
      label: response to growth factor
    evidence_type: IDA
    original_reference_id: PMID:18187129
    review:
      summary: IDA evidence for SOX2 response to growth factor stimulation.
      action: KEEP_AS_NON_CORE
      reason: This represents an upstream regulatory input rather than SOX2's 
        core function. Growth factors can regulate SOX2 expression or activity, 
        but this does not define what SOX2 itself does. This is valid annotation
        but describes regulation of SOX2 rather than SOX2's function.

      supported_by:
        - reference_id: PMID:18187129
          supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog 
            in human bone marrow derived mesenchymal stem cells maintains their 
            expansion and differentiation capabilities.
  - term:
      id: GO:0009611
      label: response to wounding
    evidence_type: IEP
    original_reference_id: PMID:17982423
    review:
      summary: Expression pattern evidence in corneal stem cells after wounding.
      action: KEEP_AS_NON_CORE
      reason: IEP evidence showing SOX2 expression in corneal stem cells during 
        wound healing. This represents SOX2's role in maintaining stem cell 
        populations in a regenerative context, which is consistent with core 
        function but wound response itself is a specific physiological context 
        rather than core cellular function.

      supported_by:
        - reference_id: PMID:17982423
          supporting_text: Stem cell markers in the human posterior limbus and 
            corneal endothelium of unwounded and wounded corneas.
  - term:
      id: GO:0006325
      label: chromatin organization
    evidence_type: NAS
    original_reference_id: PMID:7849401
    review:
      summary: NAS evidence for chromatin organization, likely based on SOX2's 
        ability to bend DNA and engage nucleosomes.
      action: ACCEPT
      reason: This annotation accurately captures SOX2's pioneer factor 
        function. Deep research shows SOX2 engages nucleosomal DNA, bends DNA, 
        and maintains chromatin accessibility at key regulatory sites. This is 
        integral to SOX2's mechanism of action as a pioneer transcription factor
        and represents a core molecular activity.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "SOX2 can bind nucleosomal DNA in a position- and sequence-dependent
            manner and maintain chromatin accessibility at key regulatory sites"
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "thousands of ATAC-seq peaks are lost within ~1 hour of
            SOX2 degradation"

# Molecular Function - DNA Binding and Transcription Factor Activity

        - reference_id: PMID:7849401
          supporting_text: The cDNA sequence and chromosomal location of the 
            human SOX2 gene.
  - term:
      id: GO:0001228
      label: DNA-binding transcription activator activity, RNA polymerase 
        II-specific
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 functions as a sequence-specific transcriptional activator 
        binding to Sox motifs (WWCAAW consensus).
      action: ACCEPT
      reason: This annotation accurately represents SOX2's core molecular 
        function. The HMG-box domain binds sequence-specifically to Sox motifs 
        and activates transcription. This is supported by IBA phylogenetic 
        inference and extensive experimental evidence.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "SOX2 contains a ~79-aa HMG box that binds the minor groove
            of cognate sequences (consensus WWCAAW; commonly the core TTGT)"

  - term:
      id: GO:0001228
      label: DNA-binding transcription activator activity, RNA polymerase 
        II-specific
    evidence_type: IEA
    original_reference_id: GO_REF:0000107
    review:
      summary: Electronic annotation based on Ensembl ortholog data.
      action: ACCEPT
      reason: Electronic annotation is consistent with experimental and 
        phylogenetic evidence for transcriptional activator function.

  - term:
      id: GO:0000981
      label: DNA-binding transcription factor activity, RNA polymerase 
        II-specific
    evidence_type: ISA
    original_reference_id: GO_REF:0000113
    review:
      summary: ISA evidence from TFClass database classification of SOX2 as a 
        sequence-specific DNA-binding transcription factor.
      action: ACCEPT
      reason: This is a parent term of DNA-binding transcription activator 
        activity and accurately describes SOX2's molecular function class. The 
        ISA evidence from TFClass is appropriate for classifying SOX2 within the
        transcription factor family.

  - term:
      id: GO:0003700
      label: DNA-binding transcription factor activity
    evidence_type: IDA
    original_reference_id: PMID:18027866
    review:
      summary: IDA evidence for general transcription factor activity.
      action: ACCEPT
      reason: This is a high-level parent term accurately describing SOX2's 
        molecular function. Multiple IDA studies support this classification.

      supported_by:
        - reference_id: PMID:18027866
          supporting_text: Aberrant expression of SOX2 upregulates MUC5AC 
            gastric foveolar mucin in mucinous cancers of the colorectum and 
            related lesions.
  - term:
      id: GO:0003700
      label: DNA-binding transcription factor activity
    evidence_type: IDA
    original_reference_id: PMID:16153702
    review:
      summary: Additional IDA evidence for transcription factor activity.
      action: ACCEPT
      reason: Independent experimental support for transcription factor 
        classification.

      supported_by:
        - reference_id: PMID:16153702
          supporting_text: Core transcriptional regulatory circuitry in human 
            embryonic stem cells.
  - term:
      id: GO:0003700
      label: DNA-binding transcription factor activity
    evidence_type: NAS
    original_reference_id: PMID:7849401
    review:
      summary: NAS evidence from original SOX2 characterization.
      action: ACCEPT
      reason: Historical NAS annotation from the original SOX2 cloning and 
        characterization is appropriate.

      supported_by:
        - reference_id: PMID:7849401
          supporting_text: The cDNA sequence and chromosomal location of the 
            human SOX2 gene.
  - term:
      id: GO:0000978
      label: RNA polymerase II cis-regulatory region sequence-specific DNA 
        binding
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 binds to cis-regulatory enhancer regions through 
        sequence-specific recognition of Sox motifs.
      action: ACCEPT
      reason: This annotation accurately captures SOX2's binding to enhancer 
        elements. Deep research shows SOX2 binds thousands of enhancers and 
        recognizes specific DNA sequences. This represents a core molecular 
        function of the HMG-box domain.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
            enhancers in pluripotent cells"

  - term:
      id: GO:0000976
      label: transcription cis-regulatory region binding
    evidence_type: IEA
    original_reference_id: GO_REF:0000107
    review:
      summary: Electronic annotation for cis-regulatory region binding.
      action: ACCEPT
      reason: This is a parent term of sequence-specific cis-regulatory binding 
        and is supported by extensive evidence of SOX2 binding to enhancers.

  - term:
      id: GO:0000976
      label: transcription cis-regulatory region binding
    evidence_type: IDA
    original_reference_id: PMID:18407919
    review:
      summary: IDA evidence for cis-regulatory binding.
      action: ACCEPT
      reason: Direct experimental evidence supports SOX2's binding to regulatory
        regions.

      supported_by:
        - reference_id: PMID:18407919
          supporting_text: Apr 10. Novel mutations in LHX3 are associated with 
            hypopituitarism and sensorineural hearing loss.
  - term:
      id: GO:0043565
      label: sequence-specific DNA binding
    evidence_type: IDA
    original_reference_id: PMID:20713518
    review:
      summary: IDA evidence for sequence-specific DNA binding.
      action: ACCEPT
      reason: This annotation accurately describes the fundamental DNA-binding 
        property of the SOX2 HMG-box domain, which recognizes specific Sox motif
        sequences. This is a core molecular function.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "HMG box that binds the minor groove of cognate sequences
            (consensus WWCAAW)"

        - reference_id: PMID:20713518
          supporting_text: Foxp1 coordinates cardiomyocyte proliferation through
            both cell-autonomous and nonautonomous mechanisms.
  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: IEA
    original_reference_id: GO_REF:0000043
    review:
      summary: Electronic annotation based on UniProtKB keyword mapping.
      action: ACCEPT
      reason: This high-level term is appropriate for describing SOX2's 
        DNA-binding capacity. While very general, it is not incorrect and is 
        supported by HMG-box domain annotation.

  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: IDA
    original_reference_id: PMID:19427902
    review:
      summary: IDA evidence for DNA binding from biochemical study.
      action: ACCEPT
      reason: Direct experimental evidence for DNA binding is appropriate.

      supported_by:
        - reference_id: PMID:19427902
          supporting_text: 2009 May 8. Purification of human transcription 
            factors Nanog and Sox2, each in complex with Skp, an Escherichia 
            coli periplasmic chaperone.
  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: IDA
    original_reference_id: PMID:18285410
    review:
      summary: Additional IDA evidence for DNA binding.
      action: ACCEPT
      reason: Multiple independent experimental confirmations of DNA-binding 
        activity.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: IDA
    original_reference_id: PMID:18407919
    review:
      summary: Further IDA evidence for DNA binding.
      action: ACCEPT
      reason: Consistent experimental support for DNA-binding function.

      supported_by:
        - reference_id: PMID:18407919
          supporting_text: Apr 10. Novel mutations in LHX3 are associated with 
            hypopituitarism and sensorineural hearing loss.
  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: IDA
    original_reference_id: PMID:16153702
    review:
      summary: IDA evidence from embryonic stem cell study.
      action: ACCEPT
      reason: DNA binding is well-established core molecular function.

      supported_by:
        - reference_id: PMID:16153702
          supporting_text: Core transcriptional regulatory circuitry in human 
            embryonic stem cells.
  - term:
      id: GO:0003677
      label: DNA binding
    evidence_type: NAS
    original_reference_id: PMID:7849401
    review:
      summary: NAS evidence from original SOX2 paper.
      action: ACCEPT
      reason: Historical annotation appropriate for fundamental DNA-binding 
        function.

# Molecular Function - Protein Binding

      supported_by:
        - reference_id: PMID:7849401
          supporting_text: The cDNA sequence and chromosomal location of the 
            human SOX2 gene.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:19736317
    review:
      summary: IPI evidence for protein-protein interaction (OCT4).
      action: REMOVE
      reason: The term "protein binding" is uninformative and does not tell us 
        about SOX2's actual function. While SOX2 does bind OCT4 and other 
        proteins, a more specific term like GO:0046982 (protein 
        heterodimerization activity) or annotation of the specific complex 
        (OCT4-SOX2 complex) would be more informative. Per curation guidelines, 
        we should avoid generic "protein binding" annotations in favor of more 
        specific molecular function terms.

      supported_by:
        - reference_id: PMID:19736317
          supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
            from stem cell pluripotency to specifying a cardiac fate.'
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:23667531
    review:
      summary: IPI evidence for protein interaction (USP9X, MSI2).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:23667531
          supporting_text: The SOX2-interactome in brain cancer cells identifies
            the requirement of MSI2 and USP9X for the growth of brain tumor 
            cells.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:24048479
    review:
      summary: IPI evidence for protein interaction (MBD3).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:24048479
          supporting_text: Deterministic direct reprogramming of somatic cells 
            to pluripotency.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:24525231
    review:
      summary: IPI evidence for protein interaction (PRKCI).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:24525231
          supporting_text: The PRKCI and SOX2 oncogenes are coamplified and 
            cooperate to activate Hedgehog signaling in lung squamous cell 
            carcinoma.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:25416956
    review:
      summary: IPI evidence from interactome mapping (SUMO1).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:25416956
          supporting_text: A proteome-scale map of the human interactome 
            network.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:25702638
    review:
      summary: IPI evidence for protein interaction (L1TD1, OCT4).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:25702638
          supporting_text: Epub 2015 Feb 19. The L1TD1 protein interactome 
            reveals the importance of post-transcriptional regulation in human 
            pluripotency.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:27334688
    review:
      summary: IPI evidence for protein interaction (VRK1).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:27334688
          supporting_text: Oncogenic Sox2 regulates and cooperates with VRK1 in 
            cell cycle progression and differentiation.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:31515488
    review:
      summary: IPI evidence for protein interaction (SUMO1).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:31515488
          supporting_text: Extensive disruption of protein interactions by 
            genetic variants across the allele frequency spectrum in human 
            populations.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:29358331
    review:
      summary: IPI evidence for protein interaction (PHF20L1).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:29358331
          supporting_text: 2018 Jan 22. LSD1 demethylase and the methyl-binding 
            protein PHF20L1 prevent SET7 methyltransferase-dependent proteolysis
            of the stem-cell protein SOX2.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:30442713
    review:
      summary: IPI evidence for protein interaction (L3MBTL3).
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

      supported_by:
        - reference_id: PMID:30442713
          supporting_text: 2018 Nov 15. Proteolysis of methylated SOX2 protein 
            is regulated by L3MBTL3 and CRL4(DCAF5) ubiquitin ligase.
  - term:
      id: GO:0005515
      label: protein binding
    evidence_type: IPI
    original_reference_id: PMID:18456656
    review:
      summary: IPI evidence for protein interaction.
      action: REMOVE
      reason: Generic protein binding term is uninformative per curation 
        guidelines.

# Molecular Function - RNA Binding

      supported_by:
        - reference_id: PMID:18456656
          supporting_text: 2008 May 2. The molecular mechanism governing the 
            oncogenic potential of SOX2 in breast cancer.
  - term:
      id: GO:0035198
      label: miRNA binding
    evidence_type: IDA
    original_reference_id: PMID:18710938
    review:
      summary: IDA evidence for SOX2 binding to miR-302.
      action: KEEP_AS_NON_CORE
      reason: While experimentally validated, miRNA binding represents a 
        non-canonical activity that is not part of SOX2's core transcription 
        factor function. The primary function of SOX2 is sequence-specific DNA 
        binding and transcriptional regulation. miRNA interaction may represent 
        regulatory crosstalk but is peripheral to the core molecular function.

# Cellular Component - Nuclear Localization

      supported_by:
        - reference_id: PMID:18710938
          supporting_text: Aug 18. Oct4/Sox2-regulated miR-302 targets cyclin D1
            in human embryonic stem cells.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IBA
    original_reference_id: GO_REF:0000033
    review:
      summary: SOX2 is a nuclear transcription factor with NLS sequences 
        directing nuclear localization.
      action: ACCEPT
      reason: Nuclear localization is essential for SOX2 function as a 
        transcription factor. This annotation accurately represents where SOX2 
        performs its core function. UniProt describes nuclear localization 
        regulated by NLS/NES sequences, and deep research confirms SOX2 is 
        "predominantly nuclear."
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "Subcellular localization: SOX2 is nuclear; its HMG region
            and flanking sequences include NLS/NES"

  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IEA
    original_reference_id: GO_REF:0000120
    review:
      summary: Electronic annotation based on sequence analysis and ortholog 
        data.
      action: ACCEPT
      reason: Electronic annotation is consistent with experimental evidence for
        nuclear localization.

  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IDA
    original_reference_id: PMID:17291498
    review:
      summary: Direct experimental evidence for nuclear localization.
      action: ACCEPT
      reason: Multiple IDA studies confirm nuclear localization through 
        microscopy or fractionation.

      supported_by:
        - reference_id: PMID:17291498
          supporting_text: 2006 Dec 20. Sox-2 is expressed by glial and 
            progenitor cells and Pax-6 is expressed by neuroblasts in the human 
            subventricular zone.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IDA
    original_reference_id: PMID:18027866
    review:
      summary: Additional IDA evidence for nuclear localization.
      action: ACCEPT
      reason: Consistent experimental evidence across multiple studies.

      supported_by:
        - reference_id: PMID:18027866
          supporting_text: Aberrant expression of SOX2 upregulates MUC5AC 
            gastric foveolar mucin in mucinous cancers of the colorectum and 
            related lesions.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IDA
    original_reference_id: PMID:18285410
    review:
      summary: Further IDA evidence for nuclear localization.
      action: ACCEPT
      reason: Well-established nuclear localization.

      supported_by:
        - reference_id: PMID:18285410
          supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the 
            pituitary, forebrain, and eye during human embryonic development.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: IC
    original_reference_id: PMID:16153702
    review:
      summary: Inferred from curator statement based on DNA binding activity.
      action: ACCEPT
      reason: IC inference is appropriate - DNA-binding transcription factors 
        must be nuclear to function.

      supported_by:
        - reference_id: PMID:16153702
          supporting_text: Core transcriptional regulatory circuitry in human 
            embryonic stem cells.
  - term:
      id: GO:0005634
      label: nucleus
    evidence_type: NAS
    original_reference_id: PMID:7849401
    review:
      summary: NAS evidence from original SOX2 paper.
      action: ACCEPT
      reason: Historical annotation appropriate for nuclear localization.

      supported_by:
        - reference_id: PMID:7849401
          supporting_text: The cDNA sequence and chromosomal location of the 
            human SOX2 gene.
  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: IDA
    original_reference_id: GO_REF:0000052
    review:
      summary: IDA evidence based on immunofluorescence localization.
      action: ACCEPT
      reason: Nucleoplasm is a more specific cellular component than nucleus, 
        indicating SOX2 is soluble within the nuclear compartment rather than 
        associated with nuclear envelope or other nuclear structures. This is 
        appropriate for a transcription factor.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-1112609
    review:
      summary: TAS evidence from Reactome pathway annotation (multiple Reactome 
        entries).
      action: ACCEPT
      reason: Multiple Reactome pathway annotations place SOX2 in nucleoplasm, 
        consistent with its function in transcriptional regulation. These TAS 
        annotations are appropriate.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972948
    review:
      summary: TAS from Reactome (OCT4, SOX2, NANOG bind GSC promoter).
      action: ACCEPT
      reason: Consistent with nucleoplasmic localization during transcriptional 
        activity.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972949
    review:
      summary: TAS from Reactome (OCT4, SOX2, NANOG bind EOMES promoter).
      action: ACCEPT
      reason: Multiple Reactome TAS annotations are consistent with nucleoplasm 
        localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972956
    review:
      summary: TAS from Reactome (STAT3 promoter binding).
      action: ACCEPT
      reason: Consistent nucleoplasm localization across Reactome pathways.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972960
    review:
      summary: TAS from Reactome (FGF2 promoter binding).
      action: ACCEPT
      reason: Multiple consistent Reactome annotations support nucleoplasm 
        localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972962
    review:
      summary: TAS from Reactome (TDGF1 promoter binding).
      action: ACCEPT
      reason: Consistent Reactome pathway annotations.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972964
    review:
      summary: TAS from Reactome (FOXD3 promoter binding).
      action: ACCEPT
      reason: Consistent with nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972965
    review:
      summary: TAS from Reactome (TSC22D1 promoter binding).
      action: ACCEPT
      reason: Multiple Reactome TAS annotations are appropriate.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972966
    review:
      summary: TAS from Reactome (HHEX promoter binding).
      action: ACCEPT
      reason: Consistent Reactome annotations.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972967
    review:
      summary: TAS from Reactome (EPHA1 promoter binding).
      action: ACCEPT
      reason: Appropriate TAS annotation from Reactome.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972975
    review:
      summary: TAS from Reactome (SALL1 promoter binding).
      action: ACCEPT
      reason: Consistent with nucleoplasm localization during transcriptional 
        regulation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972977
    review:
      summary: TAS from Reactome (DKK1 promoter binding).
      action: ACCEPT
      reason: Multiple consistent Reactome pathway annotations.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972978
    review:
      summary: TAS from Reactome (ZIC3 promoter binding).
      action: ACCEPT
      reason: Appropriate Reactome TAS annotation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-2972979
    review:
      summary: TAS from Reactome (CDX2 promoter).
      action: ACCEPT
      reason: Consistent with nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-452894
    review:
      summary: TAS from Reactome (Expression of SOX2).
      action: ACCEPT
      reason: Appropriate Reactome annotation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-480204
    review:
      summary: TAS from Reactome (NANOG promoter binding).
      action: ACCEPT
      reason: Consistent nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-480685
    review:
      summary: TAS from Reactome (SOX2 promoter autoregulation).
      action: ACCEPT
      reason: Appropriate Reactome TAS annotation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-5626938
    review:
      summary: TAS from Reactome (Beta-catenin binds SOX proteins).
      action: ACCEPT
      reason: Consistent with nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-6790036
    review:
      summary: TAS from Reactome (STAT3-upregulated nuclear proteins).
      action: ACCEPT
      reason: Appropriate Reactome annotation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-6800120
    review:
      summary: TAS from Reactome (DPPA4 gene binding).
      action: ACCEPT
      reason: Consistent nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-9732125
    review:
      summary: TAS from Reactome (MITF promoter binding).
      action: ACCEPT
      reason: Appropriate Reactome TAS annotation.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-9833066
    review:
      summary: TAS from Reactome (ZEB2 gene binding).
      action: ACCEPT
      reason: Consistent with nucleoplasm localization.

  - term:
      id: GO:0005654
      label: nucleoplasm
    evidence_type: TAS
    original_reference_id: Reactome:R-HSA-9926523
    review:
      summary: TAS from Reactome (MITF-dependent SOX2 expression).
      action: ACCEPT
      reason: Final Reactome TAS annotation, consistent with others.

  - term:
      id: GO:0016607
      label: nuclear speck
    evidence_type: IEA
    original_reference_id: GO_REF:0000044
    review:
      summary: Electronic annotation suggesting SOX2 localizes to nuclear 
        speckles.
      action: ACCEPT
      reason: Nuclear speckles are subnuclear domains enriched in splicing 
        factors and certain transcription factors. UniProt states SOX2 
        colocalizes in speckles with SOX6. While this is a more specialized 
        localization than general nucleoplasm, it is consistent with SOX2's 
        dynamic localization in active transcriptional regions. This annotation 
        is appropriate.
      supported_by:
        - reference_id: UniProtKB:P48431
          supporting_text: "Colocalizes with SOX6 in speckles"

  - term:
      id: GO:0000785
      label: chromatin
    evidence_type: ISA
    original_reference_id: GO_REF:0000113
    review:
      summary: ISA evidence from TFClass database indicating SOX2 associates 
        with chromatin.
      action: ACCEPT
      reason: This annotation is highly appropriate for SOX2 as a pioneer 
        factor. Deep research shows SOX2 engages nucleosomal DNA and maintains 
        chromatin accessibility. Chromatin localization is essential for SOX2's 
        function as a pioneer transcription factor that can bind DNA in 
        chromatin context.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "As a pioneer factor, SOX2 can bind nucleosomal DNA in
            a position- and sequence-dependent manner"

  - term:
      id: GO:0005667
      label: transcription regulator complex
    evidence_type: TAS
    original_reference_id: PMID:19736317
    review:
      summary: TAS evidence for SOX2 participation in transcriptional regulatory
        complexes, particularly with OCT4.
      action: ACCEPT
      reason: SOX2 forms complexes with OCT4 and other transcription factors to 
        regulate gene expression. This is well-documented and represents an 
        important aspect of SOX2 function - it does not act alone but cooperates
        with partner factors in multi-protein complexes.
      supported_by:
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "Structural biophysics and NMR show that SOX2's HMG domain
            binds nucleosomal DNA in a site- and sequence-dependent manner and forms
            synergistic complexes with full-length OCT4"

# Cellular Component - Cytoplasmic Localization

        - reference_id: PMID:19736317
          supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
            from stem cell pluripotency to specifying a cardiac fate.'
  - term:
      id: GO:0005737
      label: cytoplasm
    evidence_type: IEA
    original_reference_id: GO_REF:0000044
    review:
      summary: Electronic annotation based on UniProtKB subcellular location 
        vocabulary.
      action: ACCEPT
      reason: While SOX2 is predominantly nuclear, UniProt documents cytoplasmic
        localization that is regulated by post-translational modifications. 
        Acetylation at K75 promotes nuclear export leading to cytoplasmic 
        localization and proteasomal degradation. This represents a regulatory 
        mechanism for controlling SOX2 levels and activity.
      supported_by:
        - reference_id: UniProtKB:P48431
          supporting_text: "Acetylation contributes to its nuclear localization and
            deacetylation by HDAC3 induces a cytoplasmic delocalization"
        - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
          supporting_text: "acetylation at K75 within the SOX2 NES, mediated by CBP/p300
            and ACSS3/ACSS2, promotes nuclear export"

  - term:
      id: GO:0005737
      label: cytoplasm
    evidence_type: IDA
    original_reference_id: PMID:18027866
    review:
      summary: IDA evidence showing cytoplasmic localization of SOX2.
      action: ACCEPT
      reason: Direct experimental observation of cytoplasmic SOX2 is consistent 
        with regulated nuclear-cytoplasmic shuttling. While not the primary site
        of function, cytoplasmic localization is physiologically relevant for 
        regulation of SOX2 activity.

      supported_by:
        - reference_id: PMID:18027866
          supporting_text: Aberrant expression of SOX2 upregulates MUC5AC 
            gastric foveolar mucin in mucinous cancers of the colorectum and 
            related lesions.
  - term:
      id: GO:0005829
      label: cytosol
    evidence_type: IDA
    original_reference_id: PMID:18000303
    review:
      summary: IDA evidence for cytosolic localization.
      action: ACCEPT
      reason: Cytosol is the aqueous component of cytoplasm. This annotation is 
        consistent with cytoplasmic localization and represents SOX2 in the 
        soluble cytoplasmic fraction, likely during nuclear-cytoplasmic 
        transport or regulated export.

# Core Functions Summary

      supported_by:
        - reference_id: PMID:18000303
          supporting_text: Evidence for the presence of stem cell-like 
            progenitor cells in human adult pancreas.
core_functions:
  - molecular_function:
      id: GO:0001228
      label: DNA-binding transcription activator activity, RNA polymerase 
        II-specific
    directly_involved_in:
      - id: GO:0045944
        label: positive regulation of transcription by RNA polymerase II
      - id: GO:0000122
        label: negative regulation of transcription by RNA polymerase II
    locations:
      - id: GO:0005634
        label: nucleus
      - id: GO:0000785
        label: chromatin
    description: SOX2 functions as a sequence-specific DNA-binding transcription
      factor with an HMG-box domain that binds Sox motifs (WWCAAW consensus). 
      Acts as a pioneer factor engaging nucleosomal DNA to maintain chromatin 
      accessibility at regulatory enhancers. Primarily activates transcription 
      cooperating with OCT4 on composite Sox/Oct motifs, but also functions as a
      repressor when SUMOylated.
    supported_by:
      - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
        supporting_text: "SOX2 contains a ~79-aa HMG box that binds the minor groove
          of cognate sequences (consensus WWCAAW; commonly the core TTGT)"
      - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
        supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
          enhancers in pluripotent cells"
      - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
        supporting_text: "As a pioneer factor, SOX2 can bind nucleosomal DNA in a
          position- and sequence-dependent manner and maintain chromatin accessibility
          at key regulatory sites"

  - molecular_function:
      id: GO:0001228
      label: DNA-binding transcription activator activity, RNA polymerase 
        II-specific
    directly_involved_in:
      - id: GO:0035019
        label: somatic stem cell population maintenance
    locations:
      - id: GO:0005634
        label: nucleus
    description: SOX2 is one of the four Yamanaka factors (with OCT4, KLF4, and 
      MYC) sufficient for reprogramming somatic cells to induced pluripotent 
      stem cells. Forms transcriptional regulatory complexes with OCT4 and NANOG
      to maintain embryonic stem cell self-renewal and pluripotency.
    supported_by:
      - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
        supporting_text: "Critical for early embryogenesis and for embryonic stem
          cell pluripotency"
      - reference_id: UniProtKB:P48431
        supporting_text: "POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four Yamanaka
          factors"

  - molecular_function:
      id: GO:0001228
      label: DNA-binding transcription activator activity, RNA polymerase 
        II-specific
    directly_involved_in:
      - id: GO:0097150
        label: neuronal stem cell population maintenance
      - id: GO:0045665
        label: negative regulation of neuron differentiation
    locations:
      - id: GO:0005634
        label: nucleus
    description: SOX2 maintains neural stem cells in an undifferentiated state 
      and prevents premature neuronal differentiation by counteracting proneural
      factors. Essential for neural progenitor self-renewal and maintaining the 
      neural stem cell pool throughout the nervous system.
    supported_by:
      - reference_id: UniProtKB:P48431
        supporting_text: "Keeps neural cells undifferentiated by counteracting the
          activity of proneural proteins and suppresses neuronal differentiation"
      - reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
        supporting_text: "Downstream SRRT target that mediates the promotion of neural
          stem cell self-renewal"

references:
  - id: GO_REF:0000002
    title: Gene Ontology annotation through association of InterPro records with
      GO terms

  - id: GO_REF:0000024
    title: Manual transfer of experimentally-verified manual GO annotation data 
      to orthologs by curator judgment of sequence similarity

  - id: GO_REF:0000033
    title: Annotation inferences using phylogenetic trees

  - id: GO_REF:0000043
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword 
      mapping

  - id: GO_REF:0000044
    title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular 
      Location vocabulary mapping

  - id: GO_REF:0000052
    title: Gene Ontology annotation based on curation of immunofluorescence data

  - id: GO_REF:0000107
    title: Automatic transfer of experimentally verified manual GO annotation 
      data to orthologs using Ensembl Compara

  - id: GO_REF:0000113
    title: Gene Ontology annotation of human sequence-specific DNA binding 
      transcription factors based on TFClass database

  - id: GO_REF:0000120
    title: Combined Automated Annotation using Multiple IEA Methods

  - id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
    title: Deep research report on SOX2 function and mechanisms

  - id: UniProtKB:P48431
    title: UniProt entry for human SOX2 transcription factor

  - id: PMID:16153702
    title: Core transcriptional regulatory circuitry in human embryonic stem 
      cells

  - id: PMID:17291498
    title: Sox-2 is expressed by glial and progenitor cells and Pax-6 is 
      expressed by neuroblasts in the human subventricular zone.

  - id: PMID:17982423
    title: Stem cell markers in the human posterior limbus and corneal 
      endothelium of unwounded and wounded corneas.

  - id: PMID:18000303
    title: Evidence for the presence of stem cell-like progenitor cells in human
      adult pancreas.

  - id: PMID:18027866
    title: Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin
      in mucinous cancers of the colorectum and related lesions.

  - id: PMID:18187129
    title: Forced expression of Sox2 or Nanog in human bone marrow derived 
      mesenchymal stem cells maintains their expansion and differentiation 
      capabilities.

  - id: PMID:18268498
    title: SOX2 is frequently downregulated in gastric cancers and inhibits cell
      growth through cell-cycle arrest and apoptosis.

  - id: PMID:18285410
    title: SOX2 plays a critical role in the pituitary, forebrain, and eye 
      during human embryonic development.

  - id: PMID:18388306
    title: Regulation of self-renewal and pluripotency by Sox2 in human 
      embryonic stem cells.

  - id: PMID:18407919
    title: Novel mutations in LHX3 are associated with hypopituitarism and 
      sensorineural hearing loss.

  - id: PMID:18456656
    title: The molecular mechanism governing the oncogenic potential of SOX2 in 
      breast cancer.

  - id: PMID:18710938
    title: Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem
      cells.

  - id: PMID:19409607
    title: MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses 
      pluripotency in human embryonic stem cells.

  - id: PMID:19427902
    title: Purification of human transcription factors Nanog and Sox2, each in 
      complex with Skp, an Escherichia coli periplasmic chaperone.

  - id: PMID:19736317
    title: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell
      pluripotency to specifying a cardiac fate.'

  - id: PMID:20713518
    title: Foxp1 coordinates cardiomyocyte proliferation through both 
      cell-autonomous and nonautonomous mechanisms.

  - id: PMID:21245162
    title: Pluripotency factors regulate definitive endoderm specification 
      through eomesodermin.

  - id: PMID:23667531
    title: The SOX2-interactome in brain cancer cells identifies the requirement
      of MSI2 and USP9X for the growth of brain tumor cells.

  - id: PMID:24048479
    title: Deterministic direct reprogramming of somatic cells to pluripotency.

  - id: PMID:24525231
    title: The PRKCI and SOX2 oncogenes are coamplified and cooperate to 
      activate Hedgehog signaling in lung squamous cell carcinoma.

  - id: PMID:25416956
    title: A proteome-scale map of the human interactome network.

  - id: PMID:25702638
    title: The L1TD1 protein interactome reveals the importance of 
      post-transcriptional regulation in human pluripotency.

  - id: PMID:27334688
    title: Oncogenic Sox2 regulates and cooperates with VRK1 in cell cycle 
      progression and differentiation.

  - id: PMID:29253717
    title: Regulation of ADAM10 by miR-140-5p and potential relevance for 
      Alzheimer's disease.

  - id: PMID:29358331
    title: LSD1 demethylase and the methyl-binding protein PHF20L1 prevent SET7 
      methyltransferase-dependent proteolysis of the stem-cell protein SOX2.

  - id: PMID:30442713
    title: Proteolysis of methylated SOX2 protein is regulated by L3MBTL3 and 
      CRL4(DCAF5) ubiquitin ligase.

  - id: PMID:31515488
    title: Extensive disruption of protein interactions by genetic variants 
      across the allele frequency spectrum in human populations.

  - id: PMID:7849401
    title: The cDNA sequence and chromosomal location of the human SOX2 gene.

  - id: Reactome:R-HSA-1112609
    title: Reactome pathway annotation

  - id: Reactome:R-HSA-2972948
    title: POU5F1 (OCT4), SOX2, NANOG bind GSC promoter

  - id: Reactome:R-HSA-2972949
    title: POU5F1 (OCT4), SOX2, NANOG bind EOMES promoter

  - id: Reactome:R-HSA-2972956
    title: POU5F1 (OCT4), SOX2, NANOG bind STAT3 promoter

  - id: Reactome:R-HSA-2972960
    title: POU5F1 (OCT4), SOX2, NANOG bind FGF2 promoter

  - id: Reactome:R-HSA-2972962
    title: POU5F1 (OCT4), SOX2, NANOG bind TDGF1 promoter

  - id: Reactome:R-HSA-2972964
    title: POU5F1 (OCT4), SOX2, NANOG bind FOXD3 promoter

  - id: Reactome:R-HSA-2972965
    title: POU5F1 (OCT4), SOX2, NANOG bind TSC22D1 promoter

  - id: Reactome:R-HSA-2972966
    title: POU5F1 (OCT4), SOX2, NANOG bind HHEX promoter

  - id: Reactome:R-HSA-2972967
    title: POU5F1 (OCT4), SOX2, NANOG bind EPHA1 promoter

  - id: Reactome:R-HSA-2972975
    title: POU5F1 (OCT4), SOX2, NANOG bind SALL1 promoter

  - id: Reactome:R-HSA-2972977
    title: POU5F1 (OCT4), SOX2, NANOG bind DKK1 promoter

  - id: Reactome:R-HSA-2972978
    title: POU5F1 (OCT4), SOX2, NANOG bind ZIC3 promoter

  - id: Reactome:R-HSA-2972979
    title: POU5F1 (OCT4), NANOG bind CDX2 promoter

  - id: Reactome:R-HSA-452894
    title: Expression of SOX2

  - id: Reactome:R-HSA-480204
    title: POU5F1 (OCT4), SOX2, NANOG, KLF4, PBX1, SMAD2 bind NANOG promoter

  - id: Reactome:R-HSA-480685
    title: POU5F1 (OCT4), SOX2, NANOG bind SOX2 promoter

  - id: Reactome:R-HSA-5626938
    title: Beta-catenin binds SOX proteins

  - id: Reactome:R-HSA-6790036
    title: Expression of STAT3-upregulated nuclear proteins

  - id: Reactome:R-HSA-6800120
    title: POU5F1 (OCT4), SOX2, NANOG bind DPPA4 gene

  - id: Reactome:R-HSA-9732125
    title: SOX2 binds MITF promoter

  - id: Reactome:R-HSA-9833066
    title: SOX2 binds ZEB2 gene

  - id: Reactome:R-HSA-9926523
    title: MITF-M-dependent SOX2 gene expression