SOX2 is a SOXB1 family transcription factor containing an HMG-box domain that binds the minor groove of DNA and functions as a pioneer factor. As a core pluripotency factor (one of the Yamanaka factors), SOX2 maintains embryonic stem cell self-renewal and neural stem cell identity. It cooperates with OCT4 on composite Sox/Oct DNA motifs to regulate thousands of enhancers controlling pluripotency and early neural fate specification. SOX2 is essential for maintaining neural progenitor cells in an undifferentiated state and preventing premature neuronal differentiation. Nuclear localization is regulated by NLS/NES sequences and post-translational modifications including methylation, acetylation, SUMOylation, and ubiquitination.
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0000122
negative regulation of transcription by RNA polymerase II
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SOX2 functions as both a transcriptional activator and repressor depending on context. SUMOylation of SOX2 enables transcriptional repression of centrosome/centromere and cell-cycle genes in neural stem cells, providing proliferative restraint.
Reason: This annotation accurately represents a documented function of SOX2. The deep research shows SUMOylation-dependent repression of cell-cycle genes in neural stem cells. While SOX2 is primarily an activator, its repressor function is well-established and represents a core regulatory mechanism for controlling neural stem cell proliferation.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
In neural stem cells, SUMOylation of SOX2 is required for transcriptional repression of centrosome/centromere and cell-cycle genes and for proliferative restraint
|
|
GO:0000122
negative regulation of transcription by RNA polymerase II
|
ISS
GO_REF:0000024 |
ACCEPT |
Summary: ISS evidence from mouse orthologs supports the repressor function of SOX2.
Reason: ISS annotation based on ortholog data is consistent with experimental evidence from human SOX2 showing transcriptional repression. This is a valid inference from sequence similarity given the highly conserved nature of SOX2 function across mammals.
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SOX2 is fundamentally a transcriptional activator that cooperates with OCT4 to activate thousands of enhancers controlling pluripotency and neural fate genes. This is the primary and core molecular function of SOX2.
Reason: This annotation represents the core function of SOX2 as a transcriptional activator. Deep research shows SOX2 and OCT4 co-occupy thousands of OSN enhancers in pluripotent cells and activate genes essential for pluripotency and neural specification. Multiple IBA, IDA, and IEA annotations converge on this fundamental activity.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells
file:genes/human/SOX2/SOX2-deep-research-falcon.md
1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IEA
GO_REF:0000107 |
ACCEPT |
Summary: Electronic annotation based on ortholog data from Ensembl.
Reason: Consistent with IBA and IDA evidence for transcriptional activation. This electronic annotation accurately reflects SOX2's primary function.
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IDA
PMID:21245162 Pluripotency factors regulate definitive endoderm specificat... |
ACCEPT |
Summary: Direct experimental evidence for SOX2 transcriptional activation.
Reason: IDA evidence provides direct experimental support for transcriptional activation function, consistent with the core role of SOX2.
Supporting Evidence:
PMID:21245162
Pluripotency factors regulate definitive endoderm specification through eomesodermin.
|
|
GO:0045944
positive regulation of transcription by RNA polymerase II
|
IDA
PMID:18027866 Aberrant expression of SOX2 upregulates MUC5AC gastric foveo... |
ACCEPT |
Summary: Independent IDA evidence for transcriptional activation.
Reason: Multiple independent experimental studies confirm transcriptional activation as a core SOX2 function.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
|
|
GO:0045893
positive regulation of DNA-templated transcription
|
IDA
PMID:29253717 Regulation of ADAM10 by miR-140-5p and potential relevance f... |
ACCEPT |
Summary: IDA evidence for positive regulation of transcription.
Reason: This is a parent term of RNA pol II-specific transcriptional activation and accurately represents SOX2's core function. The term is at an appropriate level of specificity for general transcriptional activation function.
Supporting Evidence:
PMID:29253717
Regulation of ADAM10 by miR-140-5p and potential relevance for Alzheimer's disease.
|
|
GO:0045893
positive regulation of DNA-templated transcription
|
IDA
PMID:18407919 Novel mutations in LHX3 are associated with hypopituitarism ... |
ACCEPT |
Summary: Additional IDA evidence for transcriptional activation.
Reason: Multiple experimental studies support this fundamental function of SOX2 as a transcriptional activator.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
|
|
GO:0006355
regulation of DNA-templated transcription
|
IEA
GO_REF:0000002 |
ACCEPT |
Summary: Electronic annotation based on InterPro domain IPR022097 (SOX family).
Reason: This is a valid high-level annotation based on domain architecture. All SOX family transcription factors regulate transcription, and this annotation is appropriately general. While less specific than the positive/negative regulation terms, it is not incorrect.
|
|
GO:0006355
regulation of DNA-templated transcription
|
IDA
PMID:16153702 Core transcriptional regulatory circuitry in human embryonic... |
ACCEPT |
Summary: IDA evidence for general transcriptional regulation.
Reason: This parent term is supported by experimental evidence and is appropriately general for describing SOX2's overall transcriptional regulatory function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
|
|
GO:0006355
regulation of DNA-templated transcription
|
NAS
PMID:7849401 The cDNA sequence and chromosomal location of the human SOX2... |
ACCEPT |
Summary: Non-traceable author statement from the original SOX2 cloning paper.
Reason: Historical NAS annotation from the original characterization of SOX2 is appropriate for general transcriptional regulation function.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
|
|
GO:0010468
regulation of gene expression
|
IMP
PMID:18388306 Regulation of self-renewal and pluripotency by Sox2 in human... |
ACCEPT |
Summary: IMP evidence showing mutant phenotype affecting gene expression regulation.
Reason: This is a valid high-level annotation demonstrating SOX2's role in gene expression control through mutant phenotype analysis. While very general, it is appropriate as a parent term encompassing transcriptional regulation.
Supporting Evidence:
PMID:18388306
Apr 3. Regulation of self-renewal and pluripotency by Sox2 in human embryonic stem cells.
|
|
GO:0030182
neuron differentiation
|
IBA
GO_REF:0000033 |
MODIFY |
Summary: SOX2 plays a complex role in neuron differentiation by maintaining neural progenitors in an undifferentiated state and preventing premature neuronal differentiation.
Reason: This term is ambiguous for SOX2's actual role. SOX2 does NOT promote neuron differentiation; rather, it prevents premature differentiation by maintaining neural progenitor identity. A more accurate term would be GO:0045665 (negative regulation of neuron differentiation), which is already present in the annotation set. The IBA annotation likely captures involvement in the process broadly, but this could be misleading about SOX2's actual function.
Proposed replacements:
negative regulation of neuron differentiation
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
UniProtKB:P48431
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
|
|
GO:0045665
negative regulation of neuron differentiation
|
ISS
GO_REF:0000024 |
ACCEPT |
Summary: SOX2 actively suppresses neuronal differentiation to maintain neural progenitor cells in an undifferentiated state. This is a core function in neural development.
Reason: This annotation accurately captures SOX2's essential role in preventing premature neuronal differentiation. UniProt states that SOX2 "keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation." This is central to its function in neural stem cell maintenance and represents a core activity.
Supporting Evidence:
UniProtKB:P48431
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation
|
|
GO:0097150
neuronal stem cell population maintenance
|
ISS
GO_REF:0000024 |
ACCEPT |
Summary: SOX2 is essential for maintaining neural stem cell identity and self-renewal. This is one of its core functions in the nervous system.
Reason: This annotation represents a core function of SOX2 in neural biology. Deep research shows SOX2 is critical for neural stem cell maintenance and is described as a factor that "keeps neural cells undifferentiated." The term accurately captures SOX2's essential role in maintaining the neural stem cell pool.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Downstream SRRT target that mediates the promotion of neural stem cell self-renewal
|
|
GO:0007420
brain development
|
IBA
GO_REF:0000033 |
KEEP AS NON CORE |
Summary: SOX2 plays essential roles in brain development through maintaining neural progenitor populations and regulating neural fate specification.
Reason: While SOX2 is indeed involved in brain development, this term is very broad and captures pleiotropy rather than core molecular function. The more specific neural stem cell maintenance and neural specification functions are better representations of SOX2's primary roles. This annotation is not incorrect but represents a higher-level developmental outcome rather than the core cellular function.
|
|
GO:0030900
forebrain development
|
IBA
GO_REF:0000033 |
KEEP AS NON CORE |
Summary: SOX2 expression and function in forebrain development is well-documented.
Reason: This is a more specific developmental process than general brain development, but still represents a regional/developmental outcome rather than core molecular function. SOX2 functions the same way (maintaining neural progenitors) across brain regions. The forebrain-specific annotation reflects where SOX2 is expressed and important, but the core function is neural progenitor maintenance regardless of brain region.
|
|
GO:0030900
forebrain development
|
IEP
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
KEEP AS NON CORE |
Summary: Expression pattern evidence showing SOX2 expression during forebrain development.
Reason: IEP evidence showing expression pattern during forebrain development is valid but does not demonstrate specific function. This is consistent with SOX2's role in maintaining neural progenitors throughout the developing brain including forebrain regions.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0021781
glial cell fate commitment
|
NAS
PMID:17291498 Sox-2 is expressed by glial and progenitor cells and Pax-6 i... |
KEEP AS NON CORE |
Summary: NAS evidence for role in glial cell fate commitment.
Reason: While SOX2 is expressed in glial progenitors and can influence glial fate, this represents one specific outcome of SOX2's more general role in maintaining progenitor identity. The core function is maintaining undifferentiated stem/progenitor states; glial fate commitment is one context-dependent outcome. This is valid but peripheral to core function.
Supporting Evidence:
PMID:17291498
2006 Dec 20. Sox-2 is expressed by glial and progenitor cells and Pax-6 is expressed by neuroblasts in the human subventricular zone.
|
|
GO:0035019
somatic stem cell population maintenance
|
IMP
PMID:19736317 Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr... |
ACCEPT |
Summary: IMP evidence showing SOX2 is required for maintaining somatic stem cell populations, particularly neural stem cells.
Reason: This annotation represents a core function of SOX2. Multiple independent IMP and IDA studies demonstrate SOX2's essential role in maintaining stem cell populations. This encompasses both embryonic and neural stem cell maintenance and represents a fundamental cellular function of SOX2.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Critical for early embryogenesis and for embryonic stem cell pluripotency
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
|
|
GO:0035019
somatic stem cell population maintenance
|
IDA
PMID:19409607 MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pl... |
ACCEPT |
Summary: Independent IDA evidence for stem cell maintenance.
Reason: Multiple independent experimental studies support this core function.
Supporting Evidence:
PMID:19409607
Apr 30. MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses pluripotency in human embryonic stem cells.
|
|
GO:0035019
somatic stem cell population maintenance
|
IMP
PMID:18388306 Regulation of self-renewal and pluripotency by Sox2 in human... |
ACCEPT |
Summary: Additional IMP evidence from mutant phenotype analysis.
Reason: Mutant phenotype evidence provides strong support for essential role in stem cell maintenance.
Supporting Evidence:
PMID:18388306
Apr 3. Regulation of self-renewal and pluripotency by Sox2 in human embryonic stem cells.
|
|
GO:1902807
negative regulation of cell cycle G1/S phase transition
|
IDA
PMID:18268498 SOX2 is frequently downregulated in gastric cancers and inhi... |
ACCEPT |
Summary: IDA evidence for SOX2 role in cell cycle regulation.
Reason: This annotation is supported by direct experimental evidence and is consistent with SOX2's role in maintaining progenitor cells in a self-renewing state. Cell cycle regulation is an integral part of SOX2's function in controlling stem cell proliferation versus differentiation decisions.
Supporting Evidence:
PMID:18268498
Feb 12. SOX2 is frequently downregulated in gastric cancers and inhibits cell growth through cell-cycle arrest and apoptosis.
|
|
GO:0043410
positive regulation of MAPK cascade
|
IDA
PMID:18187129 Forced expression of Sox2 or Nanog in human bone marrow deri... |
KEEP AS NON CORE |
Summary: IDA evidence for SOX2 involvement in MAPK signaling.
Reason: While experimentally supported, MAPK cascade regulation represents a context-specific signaling outcome rather than a core molecular function. SOX2's primary function is as a transcription factor; MAPK regulation may be an indirect consequence or occur in specific cellular contexts (e.g., mesenchymal stem cells in this study). This is valid but peripheral to core transcriptional regulatory function.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
|
|
GO:0048839
inner ear development
|
IBA
GO_REF:0000033 |
KEEP AS NON CORE |
Summary: SOX2 expression and function in inner ear development.
Reason: SOX2 plays roles in sensory organ development including inner ear, but this represents tissue-specific developmental pleiotropy rather than core function. The core function is maintaining progenitor populations; inner ear development is one of many developmental contexts where this occurs.
|
|
GO:0048839
inner ear development
|
IEP
PMID:18407919 Novel mutations in LHX3 are associated with hypopituitarism ... |
KEEP AS NON CORE |
Summary: Expression pattern evidence during inner ear development.
Reason: IEP evidence supports expression during inner ear development but does not demonstrate specific function. Consistent with SOX2's broader role in sensory progenitor maintenance.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
|
|
GO:0001654
eye development
|
IEP
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
KEEP AS NON CORE |
Summary: Expression pattern during eye development.
Reason: SOX2 mutations cause anophthalmia/microphthalmia, demonstrating clear importance in eye development. However, this represents tissue-specific developmental outcome rather than core molecular function. Like inner ear development, eye development is one context where SOX2's progenitor-maintaining function is critical.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0021983
pituitary gland development
|
IEP
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
KEEP AS NON CORE |
Summary: Expression pattern during pituitary development.
Reason: Another tissue-specific developmental context where SOX2 maintains progenitor populations. Valid annotation but represents developmental pleiotropy rather than core cellular function.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0001714
endodermal cell fate specification
|
IDA
PMID:21245162 Pluripotency factors regulate definitive endoderm specificat... |
KEEP AS NON CORE |
Summary: IDA evidence for role in endoderm specification.
Reason: SOX2 with OCT4 regulates germ layer specification including endoderm fate. While experimentally supported, this represents one developmental context among many. SOX2's core function is maintaining pluripotency and regulating lineage choice; endoderm specification is one specific outcome.
Supporting Evidence:
PMID:21245162
Pluripotency factors regulate definitive endoderm specification through eomesodermin.
|
|
GO:0001649
osteoblast differentiation
|
IDA
PMID:18187129 Forced expression of Sox2 or Nanog in human bone marrow deri... |
KEEP AS NON CORE |
Summary: IDA evidence for SOX2 role in osteoblast differentiation when ectopically expressed in mesenchymal stem cells.
Reason: This represents an experimental manipulation (forced SOX2 expression in mesenchymal stem cells) rather than SOX2's endogenous core function. While experimentally valid, osteoblast differentiation is not a physiological context where SOX2 normally functions and represents experimental pleiotropy.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
|
|
GO:0090090
negative regulation of canonical Wnt signaling pathway
|
IDA
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
KEEP AS NON CORE |
Summary: IDA evidence for SOX2 regulation of Wnt signaling.
Reason: While experimentally supported, Wnt pathway regulation represents cross-talk between SOX2's transcriptional program and developmental signaling rather than a core molecular function. This is a valid regulatory connection but peripheral to SOX2's primary role as a transcriptional regulator of pluripotency and neural fate.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0070848
response to growth factor
|
IDA
PMID:18187129 Forced expression of Sox2 or Nanog in human bone marrow deri... |
KEEP AS NON CORE |
Summary: IDA evidence for SOX2 response to growth factor stimulation.
Reason: This represents an upstream regulatory input rather than SOX2's core function. Growth factors can regulate SOX2 expression or activity, but this does not define what SOX2 itself does. This is valid annotation but describes regulation of SOX2 rather than SOX2's function.
Supporting Evidence:
PMID:18187129
Epub 2007 Dec 4. Forced expression of Sox2 or Nanog in human bone marrow derived mesenchymal stem cells maintains their expansion and differentiation capabilities.
|
|
GO:0009611
response to wounding
|
IEP
PMID:17982423 Stem cell markers in the human posterior limbus and corneal ... |
KEEP AS NON CORE |
Summary: Expression pattern evidence in corneal stem cells after wounding.
Reason: IEP evidence showing SOX2 expression in corneal stem cells during wound healing. This represents SOX2's role in maintaining stem cell populations in a regenerative context, which is consistent with core function but wound response itself is a specific physiological context rather than core cellular function.
Supporting Evidence:
PMID:17982423
Stem cell markers in the human posterior limbus and corneal endothelium of unwounded and wounded corneas.
|
|
GO:0006325
chromatin organization
|
NAS
PMID:7849401 The cDNA sequence and chromosomal location of the human SOX2... |
ACCEPT |
Summary: NAS evidence for chromatin organization, likely based on SOX2's ability to bend DNA and engage nucleosomes.
Reason: This annotation accurately captures SOX2's pioneer factor function. Deep research shows SOX2 engages nucleosomal DNA, bends DNA, and maintains chromatin accessibility at key regulatory sites. This is integral to SOX2's mechanism of action as a pioneer transcription factor and represents a core molecular activity.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner and maintain chromatin accessibility at key regulatory sites
file:genes/human/SOX2/SOX2-deep-research-falcon.md
thousands of ATAC-seq peaks are lost within ~1 hour of SOX2 degradation
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
|
|
GO:0001228
DNA-binding transcription activator activity, RNA polymerase II-specific
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SOX2 functions as a sequence-specific transcriptional activator binding to Sox motifs (WWCAAW consensus).
Reason: This annotation accurately represents SOX2's core molecular function. The HMG-box domain binds sequence-specifically to Sox motifs and activates transcription. This is supported by IBA phylogenetic inference and extensive experimental evidence.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 contains a ~79-aa HMG box that binds the minor groove of cognate sequences (consensus WWCAAW; commonly the core TTGT)
|
|
GO:0001228
DNA-binding transcription activator activity, RNA polymerase II-specific
|
IEA
GO_REF:0000107 |
ACCEPT |
Summary: Electronic annotation based on Ensembl ortholog data.
Reason: Electronic annotation is consistent with experimental and phylogenetic evidence for transcriptional activator function.
|
|
GO:0000981
DNA-binding transcription factor activity, RNA polymerase II-specific
|
ISA
GO_REF:0000113 |
ACCEPT |
Summary: ISA evidence from TFClass database classification of SOX2 as a sequence-specific DNA-binding transcription factor.
Reason: This is a parent term of DNA-binding transcription activator activity and accurately describes SOX2's molecular function class. The ISA evidence from TFClass is appropriate for classifying SOX2 within the transcription factor family.
|
|
GO:0003700
DNA-binding transcription factor activity
|
IDA
PMID:18027866 Aberrant expression of SOX2 upregulates MUC5AC gastric foveo... |
ACCEPT |
Summary: IDA evidence for general transcription factor activity.
Reason: This is a high-level parent term accurately describing SOX2's molecular function. Multiple IDA studies support this classification.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
|
|
GO:0003700
DNA-binding transcription factor activity
|
IDA
PMID:16153702 Core transcriptional regulatory circuitry in human embryonic... |
ACCEPT |
Summary: Additional IDA evidence for transcription factor activity.
Reason: Independent experimental support for transcription factor classification.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
|
|
GO:0003700
DNA-binding transcription factor activity
|
NAS
PMID:7849401 The cDNA sequence and chromosomal location of the human SOX2... |
ACCEPT |
Summary: NAS evidence from original SOX2 characterization.
Reason: Historical NAS annotation from the original SOX2 cloning and characterization is appropriate.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
|
|
GO:0000978
RNA polymerase II cis-regulatory region sequence-specific DNA binding
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SOX2 binds to cis-regulatory enhancer regions through sequence-specific recognition of Sox motifs.
Reason: This annotation accurately captures SOX2's binding to enhancer elements. Deep research shows SOX2 binds thousands of enhancers and recognizes specific DNA sequences. This represents a core molecular function of the HMG-box domain.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells
|
|
GO:0000976
transcription cis-regulatory region binding
|
IEA
GO_REF:0000107 |
ACCEPT |
Summary: Electronic annotation for cis-regulatory region binding.
Reason: This is a parent term of sequence-specific cis-regulatory binding and is supported by extensive evidence of SOX2 binding to enhancers.
|
|
GO:0000976
transcription cis-regulatory region binding
|
IDA
PMID:18407919 Novel mutations in LHX3 are associated with hypopituitarism ... |
ACCEPT |
Summary: IDA evidence for cis-regulatory binding.
Reason: Direct experimental evidence supports SOX2's binding to regulatory regions.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
|
|
GO:0043565
sequence-specific DNA binding
|
IDA
PMID:20713518 Foxp1 coordinates cardiomyocyte proliferation through both c... |
ACCEPT |
Summary: IDA evidence for sequence-specific DNA binding.
Reason: This annotation accurately describes the fundamental DNA-binding property of the SOX2 HMG-box domain, which recognizes specific Sox motif sequences. This is a core molecular function.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
HMG box that binds the minor groove of cognate sequences (consensus WWCAAW)
PMID:20713518
Foxp1 coordinates cardiomyocyte proliferation through both cell-autonomous and nonautonomous mechanisms.
|
|
GO:0003677
DNA binding
|
IEA
GO_REF:0000043 |
ACCEPT |
Summary: Electronic annotation based on UniProtKB keyword mapping.
Reason: This high-level term is appropriate for describing SOX2's DNA-binding capacity. While very general, it is not incorrect and is supported by HMG-box domain annotation.
|
|
GO:0003677
DNA binding
|
IDA
PMID:19427902 Purification of human transcription factors Nanog and Sox2, ... |
ACCEPT |
Summary: IDA evidence for DNA binding from biochemical study.
Reason: Direct experimental evidence for DNA binding is appropriate.
Supporting Evidence:
PMID:19427902
2009 May 8. Purification of human transcription factors Nanog and Sox2, each in complex with Skp, an Escherichia coli periplasmic chaperone.
|
|
GO:0003677
DNA binding
|
IDA
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
ACCEPT |
Summary: Additional IDA evidence for DNA binding.
Reason: Multiple independent experimental confirmations of DNA-binding activity.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0003677
DNA binding
|
IDA
PMID:18407919 Novel mutations in LHX3 are associated with hypopituitarism ... |
ACCEPT |
Summary: Further IDA evidence for DNA binding.
Reason: Consistent experimental support for DNA-binding function.
Supporting Evidence:
PMID:18407919
Apr 10. Novel mutations in LHX3 are associated with hypopituitarism and sensorineural hearing loss.
|
|
GO:0003677
DNA binding
|
IDA
PMID:16153702 Core transcriptional regulatory circuitry in human embryonic... |
ACCEPT |
Summary: IDA evidence from embryonic stem cell study.
Reason: DNA binding is well-established core molecular function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
|
|
GO:0003677
DNA binding
|
NAS
PMID:7849401 The cDNA sequence and chromosomal location of the human SOX2... |
ACCEPT |
Summary: NAS evidence from original SOX2 paper.
Reason: Historical annotation appropriate for fundamental DNA-binding function.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
|
|
GO:0005515
protein binding
|
IPI
PMID:19736317 Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr... |
REMOVE |
Summary: IPI evidence for protein-protein interaction (OCT4).
Reason: The term "protein binding" is uninformative and does not tell us about SOX2's actual function. While SOX2 does bind OCT4 and other proteins, a more specific term like GO:0046982 (protein heterodimerization activity) or annotation of the specific complex (OCT4-SOX2 complex) would be more informative. Per curation guidelines, we should avoid generic "protein binding" annotations in favor of more specific molecular function terms.
Supporting Evidence:
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
|
|
GO:0005515
protein binding
|
IPI
PMID:23667531 The SOX2-interactome in brain cancer cells identifies the re... |
REMOVE |
Summary: IPI evidence for protein interaction (USP9X, MSI2).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:23667531
The SOX2-interactome in brain cancer cells identifies the requirement of MSI2 and USP9X for the growth of brain tumor cells.
|
|
GO:0005515
protein binding
|
IPI
PMID:24048479 Deterministic direct reprogramming of somatic cells to pluri... |
REMOVE |
Summary: IPI evidence for protein interaction (MBD3).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:24048479
Deterministic direct reprogramming of somatic cells to pluripotency.
|
|
GO:0005515
protein binding
|
IPI
PMID:24525231 The PRKCI and SOX2 oncogenes are coamplified and cooperate t... |
REMOVE |
Summary: IPI evidence for protein interaction (PRKCI).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:24525231
The PRKCI and SOX2 oncogenes are coamplified and cooperate to activate Hedgehog signaling in lung squamous cell carcinoma.
|
|
GO:0005515
protein binding
|
IPI
PMID:25416956 A proteome-scale map of the human interactome network. |
REMOVE |
Summary: IPI evidence from interactome mapping (SUMO1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:25416956
A proteome-scale map of the human interactome network.
|
|
GO:0005515
protein binding
|
IPI
PMID:25702638 The L1TD1 protein interactome reveals the importance of post... |
REMOVE |
Summary: IPI evidence for protein interaction (L1TD1, OCT4).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:25702638
Epub 2015 Feb 19. The L1TD1 protein interactome reveals the importance of post-transcriptional regulation in human pluripotency.
|
|
GO:0005515
protein binding
|
IPI
PMID:27334688 Oncogenic Sox2 regulates and cooperates with VRK1 in cell cy... |
REMOVE |
Summary: IPI evidence for protein interaction (VRK1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:27334688
Oncogenic Sox2 regulates and cooperates with VRK1 in cell cycle progression and differentiation.
|
|
GO:0005515
protein binding
|
IPI
PMID:31515488 Extensive disruption of protein interactions by genetic vari... |
REMOVE |
Summary: IPI evidence for protein interaction (SUMO1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:31515488
Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations.
|
|
GO:0005515
protein binding
|
IPI
PMID:29358331 LSD1 demethylase and the methyl-binding protein PHF20L1 prev... |
REMOVE |
Summary: IPI evidence for protein interaction (PHF20L1).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:29358331
2018 Jan 22. LSD1 demethylase and the methyl-binding protein PHF20L1 prevent SET7 methyltransferase-dependent proteolysis of the stem-cell protein SOX2.
|
|
GO:0005515
protein binding
|
IPI
PMID:30442713 Proteolysis of methylated SOX2 protein is regulated by L3MBT... |
REMOVE |
Summary: IPI evidence for protein interaction (L3MBTL3).
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:30442713
2018 Nov 15. Proteolysis of methylated SOX2 protein is regulated by L3MBTL3 and CRL4(DCAF5) ubiquitin ligase.
|
|
GO:0005515
protein binding
|
IPI
PMID:18456656 The molecular mechanism governing the oncogenic potential of... |
REMOVE |
Summary: IPI evidence for protein interaction.
Reason: Generic protein binding term is uninformative per curation guidelines.
Supporting Evidence:
PMID:18456656
2008 May 2. The molecular mechanism governing the oncogenic potential of SOX2 in breast cancer.
|
|
GO:0035198
miRNA binding
|
IDA
PMID:18710938 Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embry... |
KEEP AS NON CORE |
Summary: IDA evidence for SOX2 binding to miR-302.
Reason: While experimentally validated, miRNA binding represents a non-canonical activity that is not part of SOX2's core transcription factor function. The primary function of SOX2 is sequence-specific DNA binding and transcriptional regulation. miRNA interaction may represent regulatory crosstalk but is peripheral to the core molecular function.
Supporting Evidence:
PMID:18710938
Aug 18. Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem cells.
|
|
GO:0005634
nucleus
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: SOX2 is a nuclear transcription factor with NLS sequences directing nuclear localization.
Reason: Nuclear localization is essential for SOX2 function as a transcription factor. This annotation accurately represents where SOX2 performs its core function. UniProt describes nuclear localization regulated by NLS/NES sequences, and deep research confirms SOX2 is "predominantly nuclear."
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Subcellular localization: SOX2 is nuclear; its HMG region and flanking sequences include NLS/NES
|
|
GO:0005634
nucleus
|
IEA
GO_REF:0000120 |
ACCEPT |
Summary: Electronic annotation based on sequence analysis and ortholog data.
Reason: Electronic annotation is consistent with experimental evidence for nuclear localization.
|
|
GO:0005634
nucleus
|
IDA
PMID:17291498 Sox-2 is expressed by glial and progenitor cells and Pax-6 i... |
ACCEPT |
Summary: Direct experimental evidence for nuclear localization.
Reason: Multiple IDA studies confirm nuclear localization through microscopy or fractionation.
Supporting Evidence:
PMID:17291498
2006 Dec 20. Sox-2 is expressed by glial and progenitor cells and Pax-6 is expressed by neuroblasts in the human subventricular zone.
|
|
GO:0005634
nucleus
|
IDA
PMID:18027866 Aberrant expression of SOX2 upregulates MUC5AC gastric foveo... |
ACCEPT |
Summary: Additional IDA evidence for nuclear localization.
Reason: Consistent experimental evidence across multiple studies.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
|
|
GO:0005634
nucleus
|
IDA
PMID:18285410 SOX2 plays a critical role in the pituitary, forebrain, and ... |
ACCEPT |
Summary: Further IDA evidence for nuclear localization.
Reason: Well-established nuclear localization.
Supporting Evidence:
PMID:18285410
Epub 2008 Feb 19. SOX2 plays a critical role in the pituitary, forebrain, and eye during human embryonic development.
|
|
GO:0005634
nucleus
|
IC
PMID:16153702 Core transcriptional regulatory circuitry in human embryonic... |
ACCEPT |
Summary: Inferred from curator statement based on DNA binding activity.
Reason: IC inference is appropriate - DNA-binding transcription factors must be nuclear to function.
Supporting Evidence:
PMID:16153702
Core transcriptional regulatory circuitry in human embryonic stem cells.
|
|
GO:0005634
nucleus
|
NAS
PMID:7849401 The cDNA sequence and chromosomal location of the human SOX2... |
ACCEPT |
Summary: NAS evidence from original SOX2 paper.
Reason: Historical annotation appropriate for nuclear localization.
Supporting Evidence:
PMID:7849401
The cDNA sequence and chromosomal location of the human SOX2 gene.
|
|
GO:0005654
nucleoplasm
|
IDA
GO_REF:0000052 |
ACCEPT |
Summary: IDA evidence based on immunofluorescence localization.
Reason: Nucleoplasm is a more specific cellular component than nucleus, indicating SOX2 is soluble within the nuclear compartment rather than associated with nuclear envelope or other nuclear structures. This is appropriate for a transcription factor.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-1112609 |
ACCEPT |
Summary: TAS evidence from Reactome pathway annotation (multiple Reactome entries).
Reason: Multiple Reactome pathway annotations place SOX2 in nucleoplasm, consistent with its function in transcriptional regulation. These TAS annotations are appropriate.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972948 |
ACCEPT |
Summary: TAS from Reactome (OCT4, SOX2, NANOG bind GSC promoter).
Reason: Consistent with nucleoplasmic localization during transcriptional activity.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972949 |
ACCEPT |
Summary: TAS from Reactome (OCT4, SOX2, NANOG bind EOMES promoter).
Reason: Multiple Reactome TAS annotations are consistent with nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972956 |
ACCEPT |
Summary: TAS from Reactome (STAT3 promoter binding).
Reason: Consistent nucleoplasm localization across Reactome pathways.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972960 |
ACCEPT |
Summary: TAS from Reactome (FGF2 promoter binding).
Reason: Multiple consistent Reactome annotations support nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972962 |
ACCEPT |
Summary: TAS from Reactome (TDGF1 promoter binding).
Reason: Consistent Reactome pathway annotations.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972964 |
ACCEPT |
Summary: TAS from Reactome (FOXD3 promoter binding).
Reason: Consistent with nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972965 |
ACCEPT |
Summary: TAS from Reactome (TSC22D1 promoter binding).
Reason: Multiple Reactome TAS annotations are appropriate.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972966 |
ACCEPT |
Summary: TAS from Reactome (HHEX promoter binding).
Reason: Consistent Reactome annotations.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972967 |
ACCEPT |
Summary: TAS from Reactome (EPHA1 promoter binding).
Reason: Appropriate TAS annotation from Reactome.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972975 |
ACCEPT |
Summary: TAS from Reactome (SALL1 promoter binding).
Reason: Consistent with nucleoplasm localization during transcriptional regulation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972977 |
ACCEPT |
Summary: TAS from Reactome (DKK1 promoter binding).
Reason: Multiple consistent Reactome pathway annotations.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972978 |
ACCEPT |
Summary: TAS from Reactome (ZIC3 promoter binding).
Reason: Appropriate Reactome TAS annotation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-2972979 |
ACCEPT |
Summary: TAS from Reactome (CDX2 promoter).
Reason: Consistent with nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-452894 |
ACCEPT |
Summary: TAS from Reactome (Expression of SOX2).
Reason: Appropriate Reactome annotation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-480204 |
ACCEPT |
Summary: TAS from Reactome (NANOG promoter binding).
Reason: Consistent nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-480685 |
ACCEPT |
Summary: TAS from Reactome (SOX2 promoter autoregulation).
Reason: Appropriate Reactome TAS annotation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-5626938 |
ACCEPT |
Summary: TAS from Reactome (Beta-catenin binds SOX proteins).
Reason: Consistent with nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-6790036 |
ACCEPT |
Summary: TAS from Reactome (STAT3-upregulated nuclear proteins).
Reason: Appropriate Reactome annotation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-6800120 |
ACCEPT |
Summary: TAS from Reactome (DPPA4 gene binding).
Reason: Consistent nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-9732125 |
ACCEPT |
Summary: TAS from Reactome (MITF promoter binding).
Reason: Appropriate Reactome TAS annotation.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-9833066 |
ACCEPT |
Summary: TAS from Reactome (ZEB2 gene binding).
Reason: Consistent with nucleoplasm localization.
|
|
GO:0005654
nucleoplasm
|
TAS
Reactome:R-HSA-9926523 |
ACCEPT |
Summary: TAS from Reactome (MITF-dependent SOX2 expression).
Reason: Final Reactome TAS annotation, consistent with others.
|
|
GO:0016607
nuclear speck
|
IEA
GO_REF:0000044 |
ACCEPT |
Summary: Electronic annotation suggesting SOX2 localizes to nuclear speckles.
Reason: Nuclear speckles are subnuclear domains enriched in splicing factors and certain transcription factors. UniProt states SOX2 colocalizes in speckles with SOX6. While this is a more specialized localization than general nucleoplasm, it is consistent with SOX2's dynamic localization in active transcriptional regions. This annotation is appropriate.
Supporting Evidence:
UniProtKB:P48431
Colocalizes with SOX6 in speckles
|
|
GO:0000785
chromatin
|
ISA
GO_REF:0000113 |
ACCEPT |
Summary: ISA evidence from TFClass database indicating SOX2 associates with chromatin.
Reason: This annotation is highly appropriate for SOX2 as a pioneer factor. Deep research shows SOX2 engages nucleosomal DNA and maintains chromatin accessibility. Chromatin localization is essential for SOX2's function as a pioneer transcription factor that can bind DNA in chromatin context.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
As a pioneer factor, SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner
|
|
GO:0005667
transcription regulator complex
|
TAS
PMID:19736317 Interplay of Oct4 with Sox2 and Sox17: a molecular switch fr... |
ACCEPT |
Summary: TAS evidence for SOX2 participation in transcriptional regulatory complexes, particularly with OCT4.
Reason: SOX2 forms complexes with OCT4 and other transcription factors to regulate gene expression. This is well-documented and represents an important aspect of SOX2 function - it does not act alone but cooperates with partner factors in multi-protein complexes.
Supporting Evidence:
file:genes/human/SOX2/SOX2-deep-research-falcon.md
Structural biophysics and NMR show that SOX2's HMG domain binds nucleosomal DNA in a site- and sequence-dependent manner and forms synergistic complexes with full-length OCT4
PMID:19736317
Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell pluripotency to specifying a cardiac fate.
|
|
GO:0005737
cytoplasm
|
IEA
GO_REF:0000044 |
ACCEPT |
Summary: Electronic annotation based on UniProtKB subcellular location vocabulary.
Reason: While SOX2 is predominantly nuclear, UniProt documents cytoplasmic localization that is regulated by post-translational modifications. Acetylation at K75 promotes nuclear export leading to cytoplasmic localization and proteasomal degradation. This represents a regulatory mechanism for controlling SOX2 levels and activity.
Supporting Evidence:
UniProtKB:P48431
Acetylation contributes to its nuclear localization and deacetylation by HDAC3 induces a cytoplasmic delocalization
file:genes/human/SOX2/SOX2-deep-research-falcon.md
acetylation at K75 within the SOX2 NES, mediated by CBP/p300 and ACSS3/ACSS2, promotes nuclear export
|
|
GO:0005737
cytoplasm
|
IDA
PMID:18027866 Aberrant expression of SOX2 upregulates MUC5AC gastric foveo... |
ACCEPT |
Summary: IDA evidence showing cytoplasmic localization of SOX2.
Reason: Direct experimental observation of cytoplasmic SOX2 is consistent with regulated nuclear-cytoplasmic shuttling. While not the primary site of function, cytoplasmic localization is physiologically relevant for regulation of SOX2 activity.
Supporting Evidence:
PMID:18027866
Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin in mucinous cancers of the colorectum and related lesions.
|
|
GO:0005829
cytosol
|
IDA
PMID:18000303 Evidence for the presence of stem cell-like progenitor cells... |
ACCEPT |
Summary: IDA evidence for cytosolic localization.
Reason: Cytosol is the aqueous component of cytoplasm. This annotation is consistent with cytoplasmic localization and represents SOX2 in the soluble cytoplasmic fraction, likely during nuclear-cytoplasmic transport or regulated export.
Supporting Evidence:
PMID:18000303
Evidence for the presence of stem cell-like progenitor cells in human adult pancreas.
|
provider: falcon
model: Edison Scientific Literature
cached: false
start_time: '2026-01-11T23:22:02.393412'
end_time: '2026-01-11T23:28:14.006202'
duration_seconds: 371.61
template_file: templates/gene_research_go_focused.md
template_variables:
organism: human
gene_id: SOX2
gene_symbol: SOX2
uniprot_accession: P48431
protein_description: 'RecName: Full=Transcription factor SOX-2;'
gene_info: Name=SOX2;
organism_full: Homo sapiens (Human).
protein_family: Not specified in UniProt
protein_domains: HMG_box_dom. (IPR009071); HMG_box_dom_sf. (IPR036910); SOX_fam.
(IPR022097); SRY-related_HMG-box_TF-like. (IPR050140); HMG_box (PF00505)
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
temperature: 0.1
citation_count: 46
BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.
DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SOX2' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information
Please provide a comprehensive research report on the gene SOX2 (gene ID: SOX2, UniProt: P48431) in human.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Question: You are an expert researcher providing comprehensive, well-cited information.
Provide detailed information focusing on:
1. Key concepts and definitions with current understanding
2. Recent developments and latest research (prioritize 2023-2024 sources)
3. Current applications and real-world implementations
4. Expert opinions and analysis from authoritative sources
5. Relevant statistics and data from recent studies
Format as a comprehensive research report with proper citations. Include URLs and publication dates where available.
Always prioritize recent, authoritative sources and provide specific citations for all major claims.
BEFORE YOU BEGIN RESEARCH: You MUST verify you are researching the CORRECT gene/protein. Gene symbols can be ambiguous, especially for less well-characterized genes from non-model organisms.
DO NOT PROCEED WITH RESEARCH ON A DIFFERENT GENE. Instead:
- State clearly: "The gene symbol 'SOX2' is ambiguous or literature is limited for this specific protein"
- Explain what you found (e.g., "Found extensive literature on a different gene with the same symbol in a different organism")
- Describe the protein based ONLY on the UniProt information provided above
- Suggest that the protein function can be inferred from domain/family information
Please provide a comprehensive research report on the gene SOX2 (gene ID: SOX2, UniProt: P48431) in human.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Research report: Human SOX2 (UniProt P48431)
Plan summary and verification
- Identity/domain verification: SOX2 is a human SRY-related HMG-box transcription factor with a conserved HMG DNA-binding domain and nuclear localization/export signals, consistent with UniProt P48431 and recent literature describing its HMG box, NLS/NES, and cooperative binding with OCT4 on composite Sox/Oct motifs (URLs: https://doi.org/10.7554/elife.83157; domain overview and motifs) (langroudi2023evaluatingtherole pages 39-42).
1) Key concepts and definitions with current understanding
- Molecular identity and domains: SOX2 contains a ~79-aa HMG box that binds the minor groove of cognate sequences (consensus WWCAAW; commonly the core TTGT) and bends DNA. It also carries nuclear localization and export signals overlapping regulatory regions that control nucleocytoplasmic shuttling (marelli2024sumodependenttranscriptionalrepression pages 1-2, langroudi2023evaluatingtherole pages 39-42). The HMG box mediates base- and shape-reading of DNA and enables pioneer-like chromatin engagement (ozden2023molecularmechanismof pages 1-2).
- DNA recognition and OCT4 cooperativity: Structural biophysics and NMR show that SOX2’s HMG domain binds nucleosomal DNA in a site- and sequence-dependent manner and forms synergistic complexes with full-length OCT4 on adjacent Sox/Oct motifs, with cooperativity enhanced at nucleosome edges (URL: https://doi.org/10.1016/j.jmb.2022.167916; published Jan 30, 2023) (gadea2023structuralplasticityof pages 1-3). Mechanistically, SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG) enhancers in pluripotent cells (URL: https://doi.org/10.1016/j.stemcr.2024.03.003; published May 14, 2024) (tsaytler2024earlyneuralspecification pages 1-2).
- Pioneer factor function: Acute degron-based depletion demonstrates that SOX2 maintains chromatin accessibility at a subset of its sites; thousands of ATAC-seq peaks are lost within ~1 hour of SOX2 degradation, and these SOX2-maintained sites are highly predictive of nearby nascent transcription (URL: https://doi.org/10.15252/embj.2022113150; published Sep 2023) (maresca2023pioneeractivitydistinguishes pages 1-2). Reviews and modeling converge that SOX2, with OCT4 and KLF4, exemplifies pioneer factors that can engage nucleosomal DNA and facilitate chromatin opening (URL: https://doi.org/10.1007/s12551-024-01205-6; published Jul 4, 2024) (orsetti2024structuraldynamicsin pages 1-3).
- Subcellular localization: SOX2 is nuclear; its HMG region and flanking sequences include NLS/NES, and post-translational modifications (PTMs) modulate nuclear export and stability (langroudi2023evaluatingtherole pages 39-42, seo2024acomprehensiveunderstanding pages 1-2).
2) Recent developments (2023–2024) and latest research
- Nucleosome engagement and positional specificity: Integrating NMR, biochemistry, and molecular modeling, SOX2 binds solvent-exposed nucleosomal sites with position dependence, forming super-stable complexes at SHL+5 and preferring SHL2 shapes; suboptimal sites promote partially unfolded, dynamic SOX2 states. OCT4 stabilizes SOX2 bending and engagement on specific adjacent motifs (2023; J Mol Biol) (gadea2023structuralplasticityof pages 1-3). Complementary modeling/biophysics show concurrent base-specific contacts (Arg51, Asn54, Tyr118) and shape-reading are required for sequence-specific nucleosomal recognition, especially at SHL2 (2023; JCIM) (ozden2023molecularmechanismof pages 1-2).
- Quantitative pioneer activity in cells: Acute SOX2 depletion produces rapid loss of thousands of accessible regions within ~1 hour; sites maintained by SOX2 most strongly predict transcriptional output, functionally validated at the Klf2 regulatory element by CRISPR (2023; EMBO J) (maresca2023pioneeractivitydistinguishes pages 1-2).
- Early neural specification logic: A 2024 Stem Cell Reports study identifies 1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers. Using dTAG-mediated rapid SOX2 ablation (SOX2 reduced to ~1.5%, 1.2%, 0.3% at 2/12/24 h), authors show these enhancers transiently activate immediately upon neural induction to prime Pax6 and other neural fate genes, but close rapidly under mesodermal cues (URL: https://doi.org/10.1016/j.stemcr.2024.03.003; May 14, 2024) (tsaytler2024earlyneuralspecification pages 1-2).
- Developmental/cancer enhancer control of SOX2: A 2023 Nucleic Acids Research paper maps a distal enhancer cluster, SRR124–134 (~124–134 kb downstream), required for SOX2 overexpression in breast and lung adenocarcinoma; deleting the cluster reduces SOX2, reprograms chromatin accessibility and transcriptomes, and correlates with patient tumor accessibility. The conserved SRR124/134 elements are essential for mouse development; homozygous deletion causes lethal failure of tracheoesophageal separation (URL: https://doi.org/10.1093/nar/gkad734; Sep 2023) (abatti2023epigeneticreprogrammingof pages 1-2).
- PTM regulation in cancer: In colorectal cancer models, acetylation at K75 within the SOX2 NES, mediated by CBP/p300 and ACSS3/ACSS2, promotes nuclear export and proteasomal degradation, while HDAC4 (regulated by miR-29a) deacetylates K75; O-GlcNAcylation at S246 further stabilizes/regulates SOX2 (URL: https://doi.org/10.3390/cancers16051035; Mar 3, 2024) (seo2024acomprehensiveunderstanding pages 1-2). In neural stem cells, SUMOylation of SOX2 is required for transcriptional repression of centrosome/centromere and cell-cycle genes and for proliferative restraint (URL: https://doi.org/10.1371/journal.pone.0298818; Mar 2024) (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Pluripotency network context: Recent reviews reinforce that OCT4/SOX2/NANOG are core master regulators of pluripotency and self-renewal, operating with signaling (LIF/BMP, TGF-β family), chromatin remodeling, and epigenetic modulators (URL: https://doi.org/10.3390/ijms24098386; May 7, 2023) (varzideh2023molecularmechanismsunderlying pages 1-3).
3) Current applications and real-world implementations
- iPSC reprogramming: SOX2 is one of the original Yamanaka factors used for reprogramming human somatic cells to iPSCs and remains a standard component of protocols across cell sources, leveraged for disease modeling and regenerative medicine (varzideh2023molecularmechanismsunderlying pages 1-3). SOX2’s pioneer activity and collaboration with OCT4/KLF4 underlie enhancer priming and chromatin opening during reprogramming (maresca2023pioneeractivitydistinguishes pages 1-2, orsetti2024structuraldynamicsin pages 1-3).
- Enhancer targeting and biomarker translation: In breast and lung adenocarcinoma, SRR124–134 accessibility correlates with SOX2 overexpression in primary tumors, suggesting enhancer-state biomarkers and potential CRISPR-interrogated regulatory dependencies (abatti2023epigeneticreprogrammingof pages 1-2). In SCLC, clinical-pathologic frameworks increasingly consider lineage transcription factors and genomic amplifications (including 3q arms harboring SOX2) as part of emerging molecular stratification schemas and trial design (URL: https://doi.org/10.1038/s41571-024-00914-x; Jul 2024) (sen2024emergingadvancesin pages 1-3).
4) Expert opinions and analysis from authoritative sources
- Pioneer factors and chromatin: A 2024 Biophysical Reviews synthesis argues that multiple structures of TF–nucleosome complexes now clarify binding positions and the open question of whether PFs directly open nucleosomes versus recruiting remodelers. The review highlights SOX2/OCT4/KLF4 as canonical pioneers central to cell-fate transitions and recommends integrating NMR with molecular dynamics to resolve dynamic opening events (orsetti2024structuraldynamicsin pages 1-3). The EMBO Journal study provides in vivo functional criteria for SOX2 “pioneer-maintained” sites that are most predictive of gene activation, sharpening how to distinguish activating from non-activating binding (maresca2023pioneeractivitydistinguishes pages 1-2). Together, these works suggest that SOX2’s context-dependent ability to maintain accessibility is the best predictor of its regulatory impact.
5) Relevant statistics and data from recent studies (with URLs/dates)
- Acute loss-of-function kinetics: SOX2 protein eliminated within ~2 h using dTAG; ATAC-seq revealed 784 and 3,001 DARs at 12 h and 24 h, respectively, and thousands of SOX2-maintained accessible sites lost within ~1 h, with strong coupling to nascent transcription (EMBO J 2023; https://doi.org/10.15252/embj.2022113150; Sep 2023; Stem Cell Reports 2024; https://doi.org/10.1016/j.stemcr.2024.03.003; May 14, 2024) (maresca2023pioneeractivitydistinguishes pages 1-2, tsaytler2024earlyneuralspecification pages 1-2).
- Neural enhancers: 1,898 SOX2-dependent neural-associated enhancers among 8,531 OSN-bound enhancers; SOX2 protein reduced to ~1.5%, 1.2%, 0.3% at 2/12/24 h in KO, with neural induction rapidly activating these enhancers and up-regulating Pax6 (Stem Cell Reports 2024; https://doi.org/10.1016/j.stemcr.2024.03.003) (tsaytler2024earlyneuralspecification pages 1-2).
- Structural determinants of nucleosome engagement: At SHL2 and SHL+5, SOX2 achieves sequence- and shape-specific nucleosome recognition; key HMG residues (Arg51, Asn54, Tyr118) mediate base-specific recognition, and solvation/exposure (solvent-facing DNA) favors binding (JCIM 2023; https://doi.org/10.1021/acs.jcim.2c01520; Jun 12, 2023; J Mol Biol 2023; https://doi.org/10.1016/j.jmb.2022.167916; Jan 30, 2023) (ozden2023molecularmechanismof pages 1-2, gadea2023structuralplasticityof pages 1-3).
- Developmental enhancer control of SOX2: Deletion of SRR124–134 reduces SOX2, changes chromatin accessibility genome-wide, and in mice SRR124/134 deletion is lethal due to tracheoesophageal separation failure (NAR 2023; https://doi.org/10.1093/nar/gkad734) (abatti2023epigeneticreprogrammingof pages 1-2).
- PTMs controlling SOX2 behavior: In CRC cells, K75 acetylation (CBP/p300; ACSS3/ACSS2) promotes nuclear export and proteasomal degradation; HDAC4 deacetylates K75 under miR-29a control; O-GlcNAc at S246 regulates SOX2 stability/activity (Cancers 2024; https://doi.org/10.3390/cancers16051035; Mar 3, 2024). SUMOylation is required for repression of cell-cycle/centrosomal genes in human NSCs and for SOX2-mediated restraint of NSC proliferation (PLOS ONE 2024; https://doi.org/10.1371/journal.pone.0298818; Mar 2024) (seo2024acomprehensiveunderstanding pages 1-2, marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Cancer lineage context: Molecular reviews of SCLC note recurrent copy-number alterations including 3q arm (harboring SOX2), underpinning lineage programs and trial development (Nat Rev Clin Oncol 2024; https://doi.org/10.1038/s41571-024-00914-x) (sen2024emergingadvancesin pages 1-3). NAR 2023 further contextualizes SOX2 oncogenic roles across BRCA, GBM, LUAD, and LUSC and shows enhancer misactivation as a likely mechanism of SOX2 overexpression (abatti2023epigeneticreprogrammingof pages 1-2).
Functional annotation summary
- Primary function: SOX2 is a nuclear sequence-specific transcription factor of the SOXB1 family that binds the minor groove via an HMG domain, bends DNA, and partners with OCT4 on composite Sox/Oct motifs to regulate enhancers. As a pioneer factor, SOX2 can bind nucleosomal DNA in a position- and sequence-dependent manner and maintain chromatin accessibility at key regulatory sites that predict transcriptional output (gadea2023structuralplasticityof pages 1-3, ozden2023molecularmechanismof pages 1-2, maresca2023pioneeractivitydistinguishes pages 1-2).
- Cellular localization: Predominantly nuclear; NLS and NES in/around the HMG region and flanks; PTMs (e.g., K75 acetylation) regulate export and stability (langroudi2023evaluatingtherole pages 39-42, seo2024acomprehensiveunderstanding pages 1-2).
- Pathways/biological processes: SOX2 is central to the pluripotency GRN with OCT4/NANOG and primes early neural fate through a dedicated cohort of SOX2-dependent enhancers (varzideh2023molecularmechanismsunderlying pages 1-3, tsaytler2024earlyneuralspecification pages 1-2). In the nervous system, SUMO-dependent repression contributes to NSC proliferative control (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Disease genetics and mechanisms: SOX2 haploinsufficiency is classically linked to anophthalmia/microphthalmia spectrum and broader neurodevelopmental phenotypes (references therein) (marelli2024sumodependenttranscriptionalrepression pages 1-2). In cancer, SOX2 is frequently overexpressed or amplified (3q) in squamous lineage tumors and contributes to stemness, plasticity, and therapy resistance; enhancer reprogramming of SRR124–134 underlies overexpression in breast/lung adenocarcinoma (sen2024emergingadvancesin pages 1-3, abatti2023epigeneticreprogrammingof pages 1-2). PTMs modulate SOX2 stability and localization in CRC, offering mechanistic links to tumor progression (seo2024acomprehensiveunderstanding pages 1-2).
- Applications: SOX2 is a core reprogramming factor for iPSC generation, enabling applications in disease modeling and regenerative medicine; its pioneer function is directly leveraged for chromatin remodeling in cell-fate engineering (varzideh2023molecularmechanismsunderlying pages 1-3, maresca2023pioneeractivitydistinguishes pages 1-2, orsetti2024structuraldynamicsin pages 1-3).
Notes on gene/protein identification check
- The gene symbol SOX2, organism Homo sapiens, and domain family (HMG box; SOX family) match UniProt P48431 and are consistent with all cited literature. No conflicting gene symbol usage was found.
References (with URLs/dates inline) with citation mapping
- Marelli et al., PLOS ONE, Mar 2024, https://doi.org/10.1371/journal.pone.0298818 (marelli2024sumodependenttranscriptionalrepression pages 22-23, marelli2024sumodependenttranscriptionalrepression pages 1-2).
- Langroudi (thesis/compendium), 2023; domain/NLS–NES overview (langroudi2023evaluatingtherole pages 39-42).
- Maresca et al., EMBO J, Sep 2023, https://doi.org/10.15252/embj.2022113150 (maresca2023pioneeractivitydistinguishes pages 1-2).
- Hu et al., Nucleic Acids Res, Aug 2023, https://doi.org/10.1093/nar/gkad597 (mechanistic context for Sox–Oct condensates and reprogramming; complements SOX2) (hu2023evaluationofthe pages 18-19).
- Tsaytler et al., Stem Cell Reports, May 14, 2024, https://doi.org/10.1016/j.stemcr.2024.03.003 (tsaytler2024earlyneuralspecification pages 1-2).
- Sen et al., Nat Rev Clin Oncol, Jul 2024, https://doi.org/10.1038/s41571-024-00914-x (sen2024emergingadvancesin pages 1-3).
- Seo et al., Cancers, Mar 3, 2024, https://doi.org/10.3390/cancers16051035 (seo2024acomprehensiveunderstanding pages 1-2).
- Abatti et al., Nucleic Acids Res, Sep 2023, https://doi.org/10.1093/nar/gkad734 (abatti2023epigeneticreprogrammingof pages 1-2).
- Ozden et al., J Chem Inf Model, Jun 12, 2023, https://doi.org/10.1021/acs.jcim.2c01520 (ozden2023molecularmechanismof pages 1-2).
- Gadea & Nikolova, J Mol Biol, Jan 30, 2023, https://doi.org/10.1016/j.jmb.2022.167916 (gadea2023structuralplasticityof pages 1-3).
- Varzideh et al., Int J Mol Sci, May 7, 2023, https://doi.org/10.3390/ijms24098386 (varzideh2023molecularmechanismsunderlying pages 1-3).
- Orsetti et al., Biophysical Reviews, Jul 4, 2024, https://doi.org/10.1007/s12551-024-01205-6 (orsetti2024structuraldynamicsin pages 1-3).
References
(langroudi2023evaluatingtherole pages 39-42): L Langroudi. Evaluating the role of myelin regulatory factor in mouse development. Unknown journal, 2023.
(marelli2024sumodependenttranscriptionalrepression pages 1-2): Elisa Marelli, Jaime Hughes, and Paul J. Scotting. Sumo-dependent transcriptional repression by sox2 inhibits the proliferation of neural stem cells. PLOS ONE, 19:e0298818, Mar 2024. URL: https://doi.org/10.1371/journal.pone.0298818, doi:10.1371/journal.pone.0298818. This article has 4 citations and is from a peer-reviewed journal.
(ozden2023molecularmechanismof pages 1-2): Burcu Ozden, Ramachandran Boopathi, Ayşe Berçin Barlas, Imtiaz N. Lone, Jan Bednar, Carlo Petosa, Seyit Kale, Ali Hamiche, Dimitar Angelov, Stefan Dimitrov, and Ezgi Karaca. Molecular mechanism of nucleosome recognition by the pioneer transcription factor sox. Journal of Chemical Information and Modeling, 63:3839-3853, Jun 2023. URL: https://doi.org/10.1021/acs.jcim.2c01520, doi:10.1021/acs.jcim.2c01520. This article has 11 citations and is from a peer-reviewed journal.
(gadea2023structuralplasticityof pages 1-3): Fabiana C. Malaga Gadea and Evgenia N. Nikolova. Structural plasticity of pioneer factor sox2 and dna bendability modulate nucleosome engagement and sox2-oct4 synergism. Journal of Molecular Biology, 435:167916, Jan 2023. URL: https://doi.org/10.1016/j.jmb.2022.167916, doi:10.1016/j.jmb.2022.167916. This article has 20 citations and is from a domain leading peer-reviewed journal.
(tsaytler2024earlyneuralspecification pages 1-2): Pavel Tsaytler, Gaby Blaess, Manuela Scholze-Wittler, Frederic Koch, and Bernhard G. Herrmann. Early neural specification of stem cells is mediated by a set of sox2-dependent neural-associated enhancers. Stem Cell Reports, 19:618-628, May 2024. URL: https://doi.org/10.1016/j.stemcr.2024.03.003, doi:10.1016/j.stemcr.2024.03.003. This article has 4 citations and is from a domain leading peer-reviewed journal.
(maresca2023pioneeractivitydistinguishes pages 1-2): M. Maresca, Teun van den Brand, Hangpeng Li, Hans Teunissen, James O. J. Davies, and Elzo de Wit. Pioneer activity distinguishes activating from non‐activating sox2 binding sites. The EMBO Journal, Sep 2023. URL: https://doi.org/10.15252/embj.2022113150, doi:10.15252/embj.2022113150. This article has 25 citations.
(orsetti2024structuraldynamicsin pages 1-3): Andrea Orsetti, Daphne van Oosten, Roxana-Geanina Vasarhelyi, Theodor-Marian Dănescu, Jan Huertas, Hugo van Ingen, and Vlad Cojocaru. Structural dynamics in chromatin unraveling by pioneer transcription factors. Biophysical Reviews, 16:365-382, Jun 2024. URL: https://doi.org/10.1007/s12551-024-01205-6, doi:10.1007/s12551-024-01205-6. This article has 11 citations and is from a peer-reviewed journal.
(seo2024acomprehensiveunderstanding pages 1-2): Yoojeong Seo, Dong Keon Kim, Jihye Park, Soo Jung Park, Jae Jun Park, Jae Hee Cheon, and Tae Il Kim. A comprehensive understanding of post-translational modification of sox2 via acetylation and o-glcnacylation in colorectal cancer. Cancers, 16:1035, Mar 2024. URL: https://doi.org/10.3390/cancers16051035, doi:10.3390/cancers16051035. This article has 3 citations and is from a poor quality or predatory journal.
(abatti2023epigeneticreprogrammingof pages 1-2): Luis E Abatti, Patricia Lado-Fernández, Linh Huynh, Manuel Collado, Michael M Hoffman, and Jennifer A Mitchell. Epigenetic reprogramming of a distal developmental enhancer cluster drives sox2 overexpression in breast and lung adenocarcinoma. Nucleic Acids Research, 51:10109-10131, Sep 2023. URL: https://doi.org/10.1093/nar/gkad734, doi:10.1093/nar/gkad734. This article has 21 citations and is from a highest quality peer-reviewed journal.
(marelli2024sumodependenttranscriptionalrepression pages 22-23): Elisa Marelli, Jaime Hughes, and Paul J. Scotting. Sumo-dependent transcriptional repression by sox2 inhibits the proliferation of neural stem cells. PLOS ONE, 19:e0298818, Mar 2024. URL: https://doi.org/10.1371/journal.pone.0298818, doi:10.1371/journal.pone.0298818. This article has 4 citations and is from a peer-reviewed journal.
(varzideh2023molecularmechanismsunderlying pages 1-3): Fahimeh Varzideh, Jessica Gambardella, Urna Kansakar, Stanislovas S. Jankauskas, and Gaetano Santulli. Molecular mechanisms underlying pluripotency and self-renewal of embryonic stem cells. International Journal of Molecular Sciences, 24:8386, May 2023. URL: https://doi.org/10.3390/ijms24098386, doi:10.3390/ijms24098386. This article has 86 citations and is from a poor quality or predatory journal.
(sen2024emergingadvancesin pages 1-3): Triparna Sen, Nobuyuki Takahashi, Subhamoy Chakraborty, Naoko Takebe, Amin H. Nassar, Nagla A. Karim, Sonam Puri, and Abdul Rafeh Naqash. Emerging advances in defining the molecular and therapeutic landscape of small-cell lung cancer. Nature reviews. Clinical oncology, 21:610-627, Jul 2024. URL: https://doi.org/10.1038/s41571-024-00914-x, doi:10.1038/s41571-024-00914-x. This article has 95 citations.
(hu2023evaluationofthe pages 18-19): Haoqing Hu, D. Ho, D. Tan, C. MacCarthy, Cheng-han Yu, Mingxi Weng, H. Schöler, and R. Jauch. Evaluation of the determinants for improved pluripotency induction and maintenance by engineered sox17. Nucleic Acids Research, 51:8934-8956, Aug 2023. URL: https://doi.org/10.1093/nar/gkad597, doi:10.1093/nar/gkad597. This article has 7 citations and is from a highest quality peer-reviewed journal.
id: P48431
gene_symbol: SOX2
product_type: PROTEIN
status: COMPLETE
taxon:
id: NCBITaxon:9606
label: Homo sapiens
description: 'SOX2 is a SOXB1 family transcription factor containing an HMG-box domain
that binds the minor groove of DNA and functions as a pioneer factor. As a core
pluripotency factor (one of the Yamanaka factors), SOX2 maintains embryonic stem
cell self-renewal and neural stem cell identity. It cooperates with OCT4 on composite
Sox/Oct DNA motifs to regulate thousands of enhancers controlling pluripotency and
early neural fate specification. SOX2 is essential for maintaining neural progenitor
cells in an undifferentiated state and preventing premature neuronal differentiation.
Nuclear localization is regulated by NLS/NES sequences and post-translational modifications
including methylation, acetylation, SUMOylation, and ubiquitination.'
existing_annotations:
# Biological Process Annotations - Transcriptional Regulation
- term:
id: GO:0000122
label: negative regulation of transcription by RNA polymerase II
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 functions as both a transcriptional activator and repressor
depending on context. SUMOylation of SOX2 enables transcriptional
repression of centrosome/centromere and cell-cycle genes in neural stem
cells, providing proliferative restraint.
action: ACCEPT
reason: This annotation accurately represents a documented function of
SOX2. The deep research shows SUMOylation-dependent repression of
cell-cycle genes in neural stem cells. While SOX2 is primarily an
activator, its repressor function is well-established and represents a
core regulatory mechanism for controlling neural stem cell
proliferation.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "In neural stem cells, SUMOylation of SOX2 is required
for transcriptional repression of centrosome/centromere and cell-cycle
genes and for proliferative restraint"
- term:
id: GO:0000122
label: negative regulation of transcription by RNA polymerase II
evidence_type: ISS
original_reference_id: GO_REF:0000024
review:
summary: ISS evidence from mouse orthologs supports the repressor function
of SOX2.
action: ACCEPT
reason: ISS annotation based on ortholog data is consistent with
experimental evidence from human SOX2 showing transcriptional
repression. This is a valid inference from sequence similarity given the
highly conserved nature of SOX2 function across mammals.
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 is fundamentally a transcriptional activator that cooperates
with OCT4 to activate thousands of enhancers controlling pluripotency
and neural fate genes. This is the primary and core molecular function
of SOX2.
action: ACCEPT
reason: This annotation represents the core function of SOX2 as a
transcriptional activator. Deep research shows SOX2 and OCT4 co-occupy
thousands of OSN enhancers in pluripotent cells and activate genes
essential for pluripotency and neural specification. Multiple IBA, IDA,
and IEA annotations converge on this fundamental activity.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
enhancers in pluripotent cells"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "1,898 SOX2-dependent neural-associated enhancers among
8,531 OSN-bound enhancers"
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IEA
original_reference_id: GO_REF:0000107
review:
summary: Electronic annotation based on ortholog data from Ensembl.
action: ACCEPT
reason: Consistent with IBA and IDA evidence for transcriptional
activation. This electronic annotation accurately reflects SOX2's
primary function.
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IDA
original_reference_id: PMID:21245162
review:
summary: Direct experimental evidence for SOX2 transcriptional activation.
action: ACCEPT
reason: IDA evidence provides direct experimental support for
transcriptional activation function, consistent with the core role of
SOX2.
supported_by:
- reference_id: PMID:21245162
supporting_text: Pluripotency factors regulate definitive endoderm
specification through eomesodermin.
- term:
id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
evidence_type: IDA
original_reference_id: PMID:18027866
review:
summary: Independent IDA evidence for transcriptional activation.
action: ACCEPT
reason: Multiple independent experimental studies confirm transcriptional
activation as a core SOX2 function.
supported_by:
- reference_id: PMID:18027866
supporting_text: Aberrant expression of SOX2 upregulates MUC5AC
gastric foveolar mucin in mucinous cancers of the colorectum and
related lesions.
- term:
id: GO:0045893
label: positive regulation of DNA-templated transcription
evidence_type: IDA
original_reference_id: PMID:29253717
review:
summary: IDA evidence for positive regulation of transcription.
action: ACCEPT
reason: This is a parent term of RNA pol II-specific transcriptional
activation and accurately represents SOX2's core function. The term is
at an appropriate level of specificity for general transcriptional
activation function.
supported_by:
- reference_id: PMID:29253717
supporting_text: Regulation of ADAM10 by miR-140-5p and potential
relevance for Alzheimer's disease.
- term:
id: GO:0045893
label: positive regulation of DNA-templated transcription
evidence_type: IDA
original_reference_id: PMID:18407919
review:
summary: Additional IDA evidence for transcriptional activation.
action: ACCEPT
reason: Multiple experimental studies support this fundamental function of
SOX2 as a transcriptional activator.
supported_by:
- reference_id: PMID:18407919
supporting_text: Apr 10. Novel mutations in LHX3 are associated with
hypopituitarism and sensorineural hearing loss.
- term:
id: GO:0006355
label: regulation of DNA-templated transcription
evidence_type: IEA
original_reference_id: GO_REF:0000002
review:
summary: Electronic annotation based on InterPro domain IPR022097 (SOX
family).
action: ACCEPT
reason: This is a valid high-level annotation based on domain
architecture. All SOX family transcription factors regulate
transcription, and this annotation is appropriately general. While less
specific than the positive/negative regulation terms, it is not
incorrect.
- term:
id: GO:0006355
label: regulation of DNA-templated transcription
evidence_type: IDA
original_reference_id: PMID:16153702
review:
summary: IDA evidence for general transcriptional regulation.
action: ACCEPT
reason: This parent term is supported by experimental evidence and is
appropriately general for describing SOX2's overall transcriptional
regulatory function.
supported_by:
- reference_id: PMID:16153702
supporting_text: Core transcriptional regulatory circuitry in human
embryonic stem cells.
- term:
id: GO:0006355
label: regulation of DNA-templated transcription
evidence_type: NAS
original_reference_id: PMID:7849401
review:
summary: Non-traceable author statement from the original SOX2 cloning
paper.
action: ACCEPT
reason: Historical NAS annotation from the original characterization of
SOX2 is appropriate for general transcriptional regulation function.
supported_by:
- reference_id: PMID:7849401
supporting_text: The cDNA sequence and chromosomal location of the
human SOX2 gene.
- term:
id: GO:0010468
label: regulation of gene expression
evidence_type: IMP
original_reference_id: PMID:18388306
review:
summary: IMP evidence showing mutant phenotype affecting gene expression
regulation.
action: ACCEPT
reason: This is a valid high-level annotation demonstrating SOX2's role in
gene expression control through mutant phenotype analysis. While very
general, it is appropriate as a parent term encompassing transcriptional
regulation.
# Biological Process - Neural Development and Differentiation
supported_by:
- reference_id: PMID:18388306
supporting_text: Apr 3. Regulation of self-renewal and pluripotency by
Sox2 in human embryonic stem cells.
- term:
id: GO:0030182
label: neuron differentiation
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 plays a complex role in neuron differentiation by
maintaining neural progenitors in an undifferentiated state and
preventing premature neuronal differentiation.
action: MODIFY
reason: This term is ambiguous for SOX2's actual role. SOX2 does NOT
promote neuron differentiation; rather, it prevents premature
differentiation by maintaining neural progenitor identity. A more
accurate term would be GO:0045665 (negative regulation of neuron
differentiation), which is already present in the annotation set. The
IBA annotation likely captures involvement in the process broadly, but
this could be misleading about SOX2's actual function.
proposed_replacement_terms:
- id: GO:0045665
label: negative regulation of neuron differentiation
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Keeps neural cells undifferentiated by counteracting the
activity of proneural proteins and suppresses neuronal differentiation"
- reference_id: UniProtKB:P48431
supporting_text: "Keeps neural cells undifferentiated by counteracting the
activity of proneural proteins and suppresses neuronal differentiation"
- term:
id: GO:0045665
label: negative regulation of neuron differentiation
evidence_type: ISS
original_reference_id: GO_REF:0000024
review:
summary: SOX2 actively suppresses neuronal differentiation to maintain
neural progenitor cells in an undifferentiated state. This is a core
function in neural development.
action: ACCEPT
reason: This annotation accurately captures SOX2's essential role in
preventing premature neuronal differentiation. UniProt states that SOX2
"keeps neural cells undifferentiated by counteracting the activity of
proneural proteins and suppresses neuronal differentiation." This is
central to its function in neural stem cell maintenance and represents a
core activity.
supported_by:
- reference_id: UniProtKB:P48431
supporting_text: "Keeps neural cells undifferentiated by counteracting the
activity of proneural proteins and suppresses neuronal differentiation"
- term:
id: GO:0097150
label: neuronal stem cell population maintenance
evidence_type: ISS
original_reference_id: GO_REF:0000024
review:
summary: SOX2 is essential for maintaining neural stem cell identity and
self-renewal. This is one of its core functions in the nervous system.
action: ACCEPT
reason: This annotation represents a core function of SOX2 in neural
biology. Deep research shows SOX2 is critical for neural stem cell
maintenance and is described as a factor that "keeps neural cells
undifferentiated." The term accurately captures SOX2's essential role in
maintaining the neural stem cell pool.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Downstream SRRT target that mediates the promotion of
neural stem cell self-renewal"
- term:
id: GO:0007420
label: brain development
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 plays essential roles in brain development through
maintaining neural progenitor populations and regulating neural fate
specification.
action: KEEP_AS_NON_CORE
reason: While SOX2 is indeed involved in brain development, this term is
very broad and captures pleiotropy rather than core molecular function.
The more specific neural stem cell maintenance and neural specification
functions are better representations of SOX2's primary roles. This
annotation is not incorrect but represents a higher-level developmental
outcome rather than the core cellular function.
- term:
id: GO:0030900
label: forebrain development
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 expression and function in forebrain development is
well-documented.
action: KEEP_AS_NON_CORE
reason: This is a more specific developmental process than general brain
development, but still represents a regional/developmental outcome
rather than core molecular function. SOX2 functions the same way
(maintaining neural progenitors) across brain regions. The
forebrain-specific annotation reflects where SOX2 is expressed and
important, but the core function is neural progenitor maintenance
regardless of brain region.
- term:
id: GO:0030900
label: forebrain development
evidence_type: IEP
original_reference_id: PMID:18285410
review:
summary: Expression pattern evidence showing SOX2 expression during
forebrain development.
action: KEEP_AS_NON_CORE
reason: IEP evidence showing expression pattern during forebrain
development is valid but does not demonstrate specific function. This is
consistent with SOX2's role in maintaining neural progenitors throughout
the developing brain including forebrain regions.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0021781
label: glial cell fate commitment
evidence_type: NAS
original_reference_id: PMID:17291498
review:
summary: NAS evidence for role in glial cell fate commitment.
action: KEEP_AS_NON_CORE
reason: While SOX2 is expressed in glial progenitors and can influence
glial fate, this represents one specific outcome of SOX2's more general
role in maintaining progenitor identity. The core function is
maintaining undifferentiated stem/progenitor states; glial fate
commitment is one context-dependent outcome. This is valid but
peripheral to core function.
# Biological Process - Stem Cell Maintenance and Pluripotency
supported_by:
- reference_id: PMID:17291498
supporting_text: 2006 Dec 20. Sox-2 is expressed by glial and
progenitor cells and Pax-6 is expressed by neuroblasts in the human
subventricular zone.
- term:
id: GO:0035019
label: somatic stem cell population maintenance
evidence_type: IMP
original_reference_id: PMID:19736317
review:
summary: IMP evidence showing SOX2 is required for maintaining somatic
stem cell populations, particularly neural stem cells.
action: ACCEPT
reason: This annotation represents a core function of SOX2. Multiple
independent IMP and IDA studies demonstrate SOX2's essential role in
maintaining stem cell populations. This encompasses both embryonic and
neural stem cell maintenance and represents a fundamental cellular
function of SOX2.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Critical for early embryogenesis and for embryonic stem
cell pluripotency"
- reference_id: PMID:19736317
supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
from stem cell pluripotency to specifying a cardiac fate.'
- term:
id: GO:0035019
label: somatic stem cell population maintenance
evidence_type: IDA
original_reference_id: PMID:19409607
review:
summary: Independent IDA evidence for stem cell maintenance.
action: ACCEPT
reason: Multiple independent experimental studies support this core
function.
supported_by:
- reference_id: PMID:19409607
supporting_text: Apr 30. MicroRNA-145 regulates OCT4, SOX2, and KLF4
and represses pluripotency in human embryonic stem cells.
- term:
id: GO:0035019
label: somatic stem cell population maintenance
evidence_type: IMP
original_reference_id: PMID:18388306
review:
summary: Additional IMP evidence from mutant phenotype analysis.
action: ACCEPT
reason: Mutant phenotype evidence provides strong support for essential
role in stem cell maintenance.
# Biological Process - Cell Cycle and Proliferation
supported_by:
- reference_id: PMID:18388306
supporting_text: Apr 3. Regulation of self-renewal and pluripotency by
Sox2 in human embryonic stem cells.
- term:
id: GO:1902807
label: negative regulation of cell cycle G1/S phase transition
evidence_type: IDA
original_reference_id: PMID:18268498
review:
summary: IDA evidence for SOX2 role in cell cycle regulation.
action: ACCEPT
reason: This annotation is supported by direct experimental evidence and
is consistent with SOX2's role in maintaining progenitor cells in a
self-renewing state. Cell cycle regulation is an integral part of SOX2's
function in controlling stem cell proliferation versus differentiation
decisions.
supported_by:
- reference_id: PMID:18268498
supporting_text: Feb 12. SOX2 is frequently downregulated in gastric
cancers and inhibits cell growth through cell-cycle arrest and
apoptosis.
- term:
id: GO:0043410
label: positive regulation of MAPK cascade
evidence_type: IDA
original_reference_id: PMID:18187129
review:
summary: IDA evidence for SOX2 involvement in MAPK signaling.
action: KEEP_AS_NON_CORE
reason: While experimentally supported, MAPK cascade regulation represents
a context-specific signaling outcome rather than a core molecular
function. SOX2's primary function is as a transcription factor; MAPK
regulation may be an indirect consequence or occur in specific cellular
contexts (e.g., mesenchymal stem cells in this study). This is valid but
peripheral to core transcriptional regulatory function.
# Biological Process - Development and Fate Specification
supported_by:
- reference_id: PMID:18187129
supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog
in human bone marrow derived mesenchymal stem cells maintains their
expansion and differentiation capabilities.
- term:
id: GO:0048839
label: inner ear development
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 expression and function in inner ear development.
action: KEEP_AS_NON_CORE
reason: SOX2 plays roles in sensory organ development including inner ear,
but this represents tissue-specific developmental pleiotropy rather than
core function. The core function is maintaining progenitor populations;
inner ear development is one of many developmental contexts where this
occurs.
- term:
id: GO:0048839
label: inner ear development
evidence_type: IEP
original_reference_id: PMID:18407919
review:
summary: Expression pattern evidence during inner ear development.
action: KEEP_AS_NON_CORE
reason: IEP evidence supports expression during inner ear development but
does not demonstrate specific function. Consistent with SOX2's broader
role in sensory progenitor maintenance.
supported_by:
- reference_id: PMID:18407919
supporting_text: Apr 10. Novel mutations in LHX3 are associated with
hypopituitarism and sensorineural hearing loss.
- term:
id: GO:0001654
label: eye development
evidence_type: IEP
original_reference_id: PMID:18285410
review:
summary: Expression pattern during eye development.
action: KEEP_AS_NON_CORE
reason: SOX2 mutations cause anophthalmia/microphthalmia, demonstrating
clear importance in eye development. However, this represents
tissue-specific developmental outcome rather than core molecular
function. Like inner ear development, eye development is one context
where SOX2's progenitor-maintaining function is critical.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0021983
label: pituitary gland development
evidence_type: IEP
original_reference_id: PMID:18285410
review:
summary: Expression pattern during pituitary development.
action: KEEP_AS_NON_CORE
reason: Another tissue-specific developmental context where SOX2 maintains
progenitor populations. Valid annotation but represents developmental
pleiotropy rather than core cellular function.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0001714
label: endodermal cell fate specification
evidence_type: IDA
original_reference_id: PMID:21245162
review:
summary: IDA evidence for role in endoderm specification.
action: KEEP_AS_NON_CORE
reason: SOX2 with OCT4 regulates germ layer specification including
endoderm fate. While experimentally supported, this represents one
developmental context among many. SOX2's core function is maintaining
pluripotency and regulating lineage choice; endoderm specification is
one specific outcome.
supported_by:
- reference_id: PMID:21245162
supporting_text: Pluripotency factors regulate definitive endoderm
specification through eomesodermin.
- term:
id: GO:0001649
label: osteoblast differentiation
evidence_type: IDA
original_reference_id: PMID:18187129
review:
summary: IDA evidence for SOX2 role in osteoblast differentiation when
ectopically expressed in mesenchymal stem cells.
action: KEEP_AS_NON_CORE
reason: This represents an experimental manipulation (forced SOX2
expression in mesenchymal stem cells) rather than SOX2's endogenous core
function. While experimentally valid, osteoblast differentiation is not
a physiological context where SOX2 normally functions and represents
experimental pleiotropy.
# Biological Process - Signaling and Response
supported_by:
- reference_id: PMID:18187129
supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog
in human bone marrow derived mesenchymal stem cells maintains their
expansion and differentiation capabilities.
- term:
id: GO:0090090
label: negative regulation of canonical Wnt signaling pathway
evidence_type: IDA
original_reference_id: PMID:18285410
review:
summary: IDA evidence for SOX2 regulation of Wnt signaling.
action: KEEP_AS_NON_CORE
reason: While experimentally supported, Wnt pathway regulation represents
cross-talk between SOX2's transcriptional program and developmental
signaling rather than a core molecular function. This is a valid
regulatory connection but peripheral to SOX2's primary role as a
transcriptional regulator of pluripotency and neural fate.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0070848
label: response to growth factor
evidence_type: IDA
original_reference_id: PMID:18187129
review:
summary: IDA evidence for SOX2 response to growth factor stimulation.
action: KEEP_AS_NON_CORE
reason: This represents an upstream regulatory input rather than SOX2's
core function. Growth factors can regulate SOX2 expression or activity,
but this does not define what SOX2 itself does. This is valid annotation
but describes regulation of SOX2 rather than SOX2's function.
supported_by:
- reference_id: PMID:18187129
supporting_text: Epub 2007 Dec 4. Forced expression of Sox2 or Nanog
in human bone marrow derived mesenchymal stem cells maintains their
expansion and differentiation capabilities.
- term:
id: GO:0009611
label: response to wounding
evidence_type: IEP
original_reference_id: PMID:17982423
review:
summary: Expression pattern evidence in corneal stem cells after wounding.
action: KEEP_AS_NON_CORE
reason: IEP evidence showing SOX2 expression in corneal stem cells during
wound healing. This represents SOX2's role in maintaining stem cell
populations in a regenerative context, which is consistent with core
function but wound response itself is a specific physiological context
rather than core cellular function.
supported_by:
- reference_id: PMID:17982423
supporting_text: Stem cell markers in the human posterior limbus and
corneal endothelium of unwounded and wounded corneas.
- term:
id: GO:0006325
label: chromatin organization
evidence_type: NAS
original_reference_id: PMID:7849401
review:
summary: NAS evidence for chromatin organization, likely based on SOX2's
ability to bend DNA and engage nucleosomes.
action: ACCEPT
reason: This annotation accurately captures SOX2's pioneer factor
function. Deep research shows SOX2 engages nucleosomal DNA, bends DNA,
and maintains chromatin accessibility at key regulatory sites. This is
integral to SOX2's mechanism of action as a pioneer transcription factor
and represents a core molecular activity.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 can bind nucleosomal DNA in a position- and sequence-dependent
manner and maintain chromatin accessibility at key regulatory sites"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "thousands of ATAC-seq peaks are lost within ~1 hour of
SOX2 degradation"
# Molecular Function - DNA Binding and Transcription Factor Activity
- reference_id: PMID:7849401
supporting_text: The cDNA sequence and chromosomal location of the
human SOX2 gene.
- term:
id: GO:0001228
label: DNA-binding transcription activator activity, RNA polymerase
II-specific
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 functions as a sequence-specific transcriptional activator
binding to Sox motifs (WWCAAW consensus).
action: ACCEPT
reason: This annotation accurately represents SOX2's core molecular
function. The HMG-box domain binds sequence-specifically to Sox motifs
and activates transcription. This is supported by IBA phylogenetic
inference and extensive experimental evidence.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 contains a ~79-aa HMG box that binds the minor groove
of cognate sequences (consensus WWCAAW; commonly the core TTGT)"
- term:
id: GO:0001228
label: DNA-binding transcription activator activity, RNA polymerase
II-specific
evidence_type: IEA
original_reference_id: GO_REF:0000107
review:
summary: Electronic annotation based on Ensembl ortholog data.
action: ACCEPT
reason: Electronic annotation is consistent with experimental and
phylogenetic evidence for transcriptional activator function.
- term:
id: GO:0000981
label: DNA-binding transcription factor activity, RNA polymerase
II-specific
evidence_type: ISA
original_reference_id: GO_REF:0000113
review:
summary: ISA evidence from TFClass database classification of SOX2 as a
sequence-specific DNA-binding transcription factor.
action: ACCEPT
reason: This is a parent term of DNA-binding transcription activator
activity and accurately describes SOX2's molecular function class. The
ISA evidence from TFClass is appropriate for classifying SOX2 within the
transcription factor family.
- term:
id: GO:0003700
label: DNA-binding transcription factor activity
evidence_type: IDA
original_reference_id: PMID:18027866
review:
summary: IDA evidence for general transcription factor activity.
action: ACCEPT
reason: This is a high-level parent term accurately describing SOX2's
molecular function. Multiple IDA studies support this classification.
supported_by:
- reference_id: PMID:18027866
supporting_text: Aberrant expression of SOX2 upregulates MUC5AC
gastric foveolar mucin in mucinous cancers of the colorectum and
related lesions.
- term:
id: GO:0003700
label: DNA-binding transcription factor activity
evidence_type: IDA
original_reference_id: PMID:16153702
review:
summary: Additional IDA evidence for transcription factor activity.
action: ACCEPT
reason: Independent experimental support for transcription factor
classification.
supported_by:
- reference_id: PMID:16153702
supporting_text: Core transcriptional regulatory circuitry in human
embryonic stem cells.
- term:
id: GO:0003700
label: DNA-binding transcription factor activity
evidence_type: NAS
original_reference_id: PMID:7849401
review:
summary: NAS evidence from original SOX2 characterization.
action: ACCEPT
reason: Historical NAS annotation from the original SOX2 cloning and
characterization is appropriate.
supported_by:
- reference_id: PMID:7849401
supporting_text: The cDNA sequence and chromosomal location of the
human SOX2 gene.
- term:
id: GO:0000978
label: RNA polymerase II cis-regulatory region sequence-specific DNA
binding
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 binds to cis-regulatory enhancer regions through
sequence-specific recognition of Sox motifs.
action: ACCEPT
reason: This annotation accurately captures SOX2's binding to enhancer
elements. Deep research shows SOX2 binds thousands of enhancers and
recognizes specific DNA sequences. This represents a core molecular
function of the HMG-box domain.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
enhancers in pluripotent cells"
- term:
id: GO:0000976
label: transcription cis-regulatory region binding
evidence_type: IEA
original_reference_id: GO_REF:0000107
review:
summary: Electronic annotation for cis-regulatory region binding.
action: ACCEPT
reason: This is a parent term of sequence-specific cis-regulatory binding
and is supported by extensive evidence of SOX2 binding to enhancers.
- term:
id: GO:0000976
label: transcription cis-regulatory region binding
evidence_type: IDA
original_reference_id: PMID:18407919
review:
summary: IDA evidence for cis-regulatory binding.
action: ACCEPT
reason: Direct experimental evidence supports SOX2's binding to regulatory
regions.
supported_by:
- reference_id: PMID:18407919
supporting_text: Apr 10. Novel mutations in LHX3 are associated with
hypopituitarism and sensorineural hearing loss.
- term:
id: GO:0043565
label: sequence-specific DNA binding
evidence_type: IDA
original_reference_id: PMID:20713518
review:
summary: IDA evidence for sequence-specific DNA binding.
action: ACCEPT
reason: This annotation accurately describes the fundamental DNA-binding
property of the SOX2 HMG-box domain, which recognizes specific Sox motif
sequences. This is a core molecular function.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "HMG box that binds the minor groove of cognate sequences
(consensus WWCAAW)"
- reference_id: PMID:20713518
supporting_text: Foxp1 coordinates cardiomyocyte proliferation through
both cell-autonomous and nonautonomous mechanisms.
- term:
id: GO:0003677
label: DNA binding
evidence_type: IEA
original_reference_id: GO_REF:0000043
review:
summary: Electronic annotation based on UniProtKB keyword mapping.
action: ACCEPT
reason: This high-level term is appropriate for describing SOX2's
DNA-binding capacity. While very general, it is not incorrect and is
supported by HMG-box domain annotation.
- term:
id: GO:0003677
label: DNA binding
evidence_type: IDA
original_reference_id: PMID:19427902
review:
summary: IDA evidence for DNA binding from biochemical study.
action: ACCEPT
reason: Direct experimental evidence for DNA binding is appropriate.
supported_by:
- reference_id: PMID:19427902
supporting_text: 2009 May 8. Purification of human transcription
factors Nanog and Sox2, each in complex with Skp, an Escherichia
coli periplasmic chaperone.
- term:
id: GO:0003677
label: DNA binding
evidence_type: IDA
original_reference_id: PMID:18285410
review:
summary: Additional IDA evidence for DNA binding.
action: ACCEPT
reason: Multiple independent experimental confirmations of DNA-binding
activity.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0003677
label: DNA binding
evidence_type: IDA
original_reference_id: PMID:18407919
review:
summary: Further IDA evidence for DNA binding.
action: ACCEPT
reason: Consistent experimental support for DNA-binding function.
supported_by:
- reference_id: PMID:18407919
supporting_text: Apr 10. Novel mutations in LHX3 are associated with
hypopituitarism and sensorineural hearing loss.
- term:
id: GO:0003677
label: DNA binding
evidence_type: IDA
original_reference_id: PMID:16153702
review:
summary: IDA evidence from embryonic stem cell study.
action: ACCEPT
reason: DNA binding is well-established core molecular function.
supported_by:
- reference_id: PMID:16153702
supporting_text: Core transcriptional regulatory circuitry in human
embryonic stem cells.
- term:
id: GO:0003677
label: DNA binding
evidence_type: NAS
original_reference_id: PMID:7849401
review:
summary: NAS evidence from original SOX2 paper.
action: ACCEPT
reason: Historical annotation appropriate for fundamental DNA-binding
function.
# Molecular Function - Protein Binding
supported_by:
- reference_id: PMID:7849401
supporting_text: The cDNA sequence and chromosomal location of the
human SOX2 gene.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:19736317
review:
summary: IPI evidence for protein-protein interaction (OCT4).
action: REMOVE
reason: The term "protein binding" is uninformative and does not tell us
about SOX2's actual function. While SOX2 does bind OCT4 and other
proteins, a more specific term like GO:0046982 (protein
heterodimerization activity) or annotation of the specific complex
(OCT4-SOX2 complex) would be more informative. Per curation guidelines,
we should avoid generic "protein binding" annotations in favor of more
specific molecular function terms.
supported_by:
- reference_id: PMID:19736317
supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
from stem cell pluripotency to specifying a cardiac fate.'
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:23667531
review:
summary: IPI evidence for protein interaction (USP9X, MSI2).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:23667531
supporting_text: The SOX2-interactome in brain cancer cells identifies
the requirement of MSI2 and USP9X for the growth of brain tumor
cells.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:24048479
review:
summary: IPI evidence for protein interaction (MBD3).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:24048479
supporting_text: Deterministic direct reprogramming of somatic cells
to pluripotency.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:24525231
review:
summary: IPI evidence for protein interaction (PRKCI).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:24525231
supporting_text: The PRKCI and SOX2 oncogenes are coamplified and
cooperate to activate Hedgehog signaling in lung squamous cell
carcinoma.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:25416956
review:
summary: IPI evidence from interactome mapping (SUMO1).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:25416956
supporting_text: A proteome-scale map of the human interactome
network.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:25702638
review:
summary: IPI evidence for protein interaction (L1TD1, OCT4).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:25702638
supporting_text: Epub 2015 Feb 19. The L1TD1 protein interactome
reveals the importance of post-transcriptional regulation in human
pluripotency.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:27334688
review:
summary: IPI evidence for protein interaction (VRK1).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:27334688
supporting_text: Oncogenic Sox2 regulates and cooperates with VRK1 in
cell cycle progression and differentiation.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:31515488
review:
summary: IPI evidence for protein interaction (SUMO1).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:31515488
supporting_text: Extensive disruption of protein interactions by
genetic variants across the allele frequency spectrum in human
populations.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:29358331
review:
summary: IPI evidence for protein interaction (PHF20L1).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:29358331
supporting_text: 2018 Jan 22. LSD1 demethylase and the methyl-binding
protein PHF20L1 prevent SET7 methyltransferase-dependent proteolysis
of the stem-cell protein SOX2.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:30442713
review:
summary: IPI evidence for protein interaction (L3MBTL3).
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
supported_by:
- reference_id: PMID:30442713
supporting_text: 2018 Nov 15. Proteolysis of methylated SOX2 protein
is regulated by L3MBTL3 and CRL4(DCAF5) ubiquitin ligase.
- term:
id: GO:0005515
label: protein binding
evidence_type: IPI
original_reference_id: PMID:18456656
review:
summary: IPI evidence for protein interaction.
action: REMOVE
reason: Generic protein binding term is uninformative per curation
guidelines.
# Molecular Function - RNA Binding
supported_by:
- reference_id: PMID:18456656
supporting_text: 2008 May 2. The molecular mechanism governing the
oncogenic potential of SOX2 in breast cancer.
- term:
id: GO:0035198
label: miRNA binding
evidence_type: IDA
original_reference_id: PMID:18710938
review:
summary: IDA evidence for SOX2 binding to miR-302.
action: KEEP_AS_NON_CORE
reason: While experimentally validated, miRNA binding represents a
non-canonical activity that is not part of SOX2's core transcription
factor function. The primary function of SOX2 is sequence-specific DNA
binding and transcriptional regulation. miRNA interaction may represent
regulatory crosstalk but is peripheral to the core molecular function.
# Cellular Component - Nuclear Localization
supported_by:
- reference_id: PMID:18710938
supporting_text: Aug 18. Oct4/Sox2-regulated miR-302 targets cyclin D1
in human embryonic stem cells.
- term:
id: GO:0005634
label: nucleus
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: SOX2 is a nuclear transcription factor with NLS sequences
directing nuclear localization.
action: ACCEPT
reason: Nuclear localization is essential for SOX2 function as a
transcription factor. This annotation accurately represents where SOX2
performs its core function. UniProt describes nuclear localization
regulated by NLS/NES sequences, and deep research confirms SOX2 is
"predominantly nuclear."
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Subcellular localization: SOX2 is nuclear; its HMG region
and flanking sequences include NLS/NES"
- term:
id: GO:0005634
label: nucleus
evidence_type: IEA
original_reference_id: GO_REF:0000120
review:
summary: Electronic annotation based on sequence analysis and ortholog
data.
action: ACCEPT
reason: Electronic annotation is consistent with experimental evidence for
nuclear localization.
- term:
id: GO:0005634
label: nucleus
evidence_type: IDA
original_reference_id: PMID:17291498
review:
summary: Direct experimental evidence for nuclear localization.
action: ACCEPT
reason: Multiple IDA studies confirm nuclear localization through
microscopy or fractionation.
supported_by:
- reference_id: PMID:17291498
supporting_text: 2006 Dec 20. Sox-2 is expressed by glial and
progenitor cells and Pax-6 is expressed by neuroblasts in the human
subventricular zone.
- term:
id: GO:0005634
label: nucleus
evidence_type: IDA
original_reference_id: PMID:18027866
review:
summary: Additional IDA evidence for nuclear localization.
action: ACCEPT
reason: Consistent experimental evidence across multiple studies.
supported_by:
- reference_id: PMID:18027866
supporting_text: Aberrant expression of SOX2 upregulates MUC5AC
gastric foveolar mucin in mucinous cancers of the colorectum and
related lesions.
- term:
id: GO:0005634
label: nucleus
evidence_type: IDA
original_reference_id: PMID:18285410
review:
summary: Further IDA evidence for nuclear localization.
action: ACCEPT
reason: Well-established nuclear localization.
supported_by:
- reference_id: PMID:18285410
supporting_text: Epub 2008 Feb 19. SOX2 plays a critical role in the
pituitary, forebrain, and eye during human embryonic development.
- term:
id: GO:0005634
label: nucleus
evidence_type: IC
original_reference_id: PMID:16153702
review:
summary: Inferred from curator statement based on DNA binding activity.
action: ACCEPT
reason: IC inference is appropriate - DNA-binding transcription factors
must be nuclear to function.
supported_by:
- reference_id: PMID:16153702
supporting_text: Core transcriptional regulatory circuitry in human
embryonic stem cells.
- term:
id: GO:0005634
label: nucleus
evidence_type: NAS
original_reference_id: PMID:7849401
review:
summary: NAS evidence from original SOX2 paper.
action: ACCEPT
reason: Historical annotation appropriate for nuclear localization.
supported_by:
- reference_id: PMID:7849401
supporting_text: The cDNA sequence and chromosomal location of the
human SOX2 gene.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: IDA
original_reference_id: GO_REF:0000052
review:
summary: IDA evidence based on immunofluorescence localization.
action: ACCEPT
reason: Nucleoplasm is a more specific cellular component than nucleus,
indicating SOX2 is soluble within the nuclear compartment rather than
associated with nuclear envelope or other nuclear structures. This is
appropriate for a transcription factor.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-1112609
review:
summary: TAS evidence from Reactome pathway annotation (multiple Reactome
entries).
action: ACCEPT
reason: Multiple Reactome pathway annotations place SOX2 in nucleoplasm,
consistent with its function in transcriptional regulation. These TAS
annotations are appropriate.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972948
review:
summary: TAS from Reactome (OCT4, SOX2, NANOG bind GSC promoter).
action: ACCEPT
reason: Consistent with nucleoplasmic localization during transcriptional
activity.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972949
review:
summary: TAS from Reactome (OCT4, SOX2, NANOG bind EOMES promoter).
action: ACCEPT
reason: Multiple Reactome TAS annotations are consistent with nucleoplasm
localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972956
review:
summary: TAS from Reactome (STAT3 promoter binding).
action: ACCEPT
reason: Consistent nucleoplasm localization across Reactome pathways.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972960
review:
summary: TAS from Reactome (FGF2 promoter binding).
action: ACCEPT
reason: Multiple consistent Reactome annotations support nucleoplasm
localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972962
review:
summary: TAS from Reactome (TDGF1 promoter binding).
action: ACCEPT
reason: Consistent Reactome pathway annotations.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972964
review:
summary: TAS from Reactome (FOXD3 promoter binding).
action: ACCEPT
reason: Consistent with nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972965
review:
summary: TAS from Reactome (TSC22D1 promoter binding).
action: ACCEPT
reason: Multiple Reactome TAS annotations are appropriate.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972966
review:
summary: TAS from Reactome (HHEX promoter binding).
action: ACCEPT
reason: Consistent Reactome annotations.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972967
review:
summary: TAS from Reactome (EPHA1 promoter binding).
action: ACCEPT
reason: Appropriate TAS annotation from Reactome.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972975
review:
summary: TAS from Reactome (SALL1 promoter binding).
action: ACCEPT
reason: Consistent with nucleoplasm localization during transcriptional
regulation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972977
review:
summary: TAS from Reactome (DKK1 promoter binding).
action: ACCEPT
reason: Multiple consistent Reactome pathway annotations.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972978
review:
summary: TAS from Reactome (ZIC3 promoter binding).
action: ACCEPT
reason: Appropriate Reactome TAS annotation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-2972979
review:
summary: TAS from Reactome (CDX2 promoter).
action: ACCEPT
reason: Consistent with nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-452894
review:
summary: TAS from Reactome (Expression of SOX2).
action: ACCEPT
reason: Appropriate Reactome annotation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-480204
review:
summary: TAS from Reactome (NANOG promoter binding).
action: ACCEPT
reason: Consistent nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-480685
review:
summary: TAS from Reactome (SOX2 promoter autoregulation).
action: ACCEPT
reason: Appropriate Reactome TAS annotation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-5626938
review:
summary: TAS from Reactome (Beta-catenin binds SOX proteins).
action: ACCEPT
reason: Consistent with nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-6790036
review:
summary: TAS from Reactome (STAT3-upregulated nuclear proteins).
action: ACCEPT
reason: Appropriate Reactome annotation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-6800120
review:
summary: TAS from Reactome (DPPA4 gene binding).
action: ACCEPT
reason: Consistent nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-9732125
review:
summary: TAS from Reactome (MITF promoter binding).
action: ACCEPT
reason: Appropriate Reactome TAS annotation.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-9833066
review:
summary: TAS from Reactome (ZEB2 gene binding).
action: ACCEPT
reason: Consistent with nucleoplasm localization.
- term:
id: GO:0005654
label: nucleoplasm
evidence_type: TAS
original_reference_id: Reactome:R-HSA-9926523
review:
summary: TAS from Reactome (MITF-dependent SOX2 expression).
action: ACCEPT
reason: Final Reactome TAS annotation, consistent with others.
- term:
id: GO:0016607
label: nuclear speck
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: Electronic annotation suggesting SOX2 localizes to nuclear
speckles.
action: ACCEPT
reason: Nuclear speckles are subnuclear domains enriched in splicing
factors and certain transcription factors. UniProt states SOX2
colocalizes in speckles with SOX6. While this is a more specialized
localization than general nucleoplasm, it is consistent with SOX2's
dynamic localization in active transcriptional regions. This annotation
is appropriate.
supported_by:
- reference_id: UniProtKB:P48431
supporting_text: "Colocalizes with SOX6 in speckles"
- term:
id: GO:0000785
label: chromatin
evidence_type: ISA
original_reference_id: GO_REF:0000113
review:
summary: ISA evidence from TFClass database indicating SOX2 associates
with chromatin.
action: ACCEPT
reason: This annotation is highly appropriate for SOX2 as a pioneer
factor. Deep research shows SOX2 engages nucleosomal DNA and maintains
chromatin accessibility. Chromatin localization is essential for SOX2's
function as a pioneer transcription factor that can bind DNA in
chromatin context.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "As a pioneer factor, SOX2 can bind nucleosomal DNA in
a position- and sequence-dependent manner"
- term:
id: GO:0005667
label: transcription regulator complex
evidence_type: TAS
original_reference_id: PMID:19736317
review:
summary: TAS evidence for SOX2 participation in transcriptional regulatory
complexes, particularly with OCT4.
action: ACCEPT
reason: SOX2 forms complexes with OCT4 and other transcription factors to
regulate gene expression. This is well-documented and represents an
important aspect of SOX2 function - it does not act alone but cooperates
with partner factors in multi-protein complexes.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Structural biophysics and NMR show that SOX2's HMG domain
binds nucleosomal DNA in a site- and sequence-dependent manner and forms
synergistic complexes with full-length OCT4"
# Cellular Component - Cytoplasmic Localization
- reference_id: PMID:19736317
supporting_text: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch
from stem cell pluripotency to specifying a cardiac fate.'
- term:
id: GO:0005737
label: cytoplasm
evidence_type: IEA
original_reference_id: GO_REF:0000044
review:
summary: Electronic annotation based on UniProtKB subcellular location
vocabulary.
action: ACCEPT
reason: While SOX2 is predominantly nuclear, UniProt documents cytoplasmic
localization that is regulated by post-translational modifications.
Acetylation at K75 promotes nuclear export leading to cytoplasmic
localization and proteasomal degradation. This represents a regulatory
mechanism for controlling SOX2 levels and activity.
supported_by:
- reference_id: UniProtKB:P48431
supporting_text: "Acetylation contributes to its nuclear localization and
deacetylation by HDAC3 induces a cytoplasmic delocalization"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "acetylation at K75 within the SOX2 NES, mediated by CBP/p300
and ACSS3/ACSS2, promotes nuclear export"
- term:
id: GO:0005737
label: cytoplasm
evidence_type: IDA
original_reference_id: PMID:18027866
review:
summary: IDA evidence showing cytoplasmic localization of SOX2.
action: ACCEPT
reason: Direct experimental observation of cytoplasmic SOX2 is consistent
with regulated nuclear-cytoplasmic shuttling. While not the primary site
of function, cytoplasmic localization is physiologically relevant for
regulation of SOX2 activity.
supported_by:
- reference_id: PMID:18027866
supporting_text: Aberrant expression of SOX2 upregulates MUC5AC
gastric foveolar mucin in mucinous cancers of the colorectum and
related lesions.
- term:
id: GO:0005829
label: cytosol
evidence_type: IDA
original_reference_id: PMID:18000303
review:
summary: IDA evidence for cytosolic localization.
action: ACCEPT
reason: Cytosol is the aqueous component of cytoplasm. This annotation is
consistent with cytoplasmic localization and represents SOX2 in the
soluble cytoplasmic fraction, likely during nuclear-cytoplasmic
transport or regulated export.
# Core Functions Summary
supported_by:
- reference_id: PMID:18000303
supporting_text: Evidence for the presence of stem cell-like
progenitor cells in human adult pancreas.
core_functions:
- molecular_function:
id: GO:0001228
label: DNA-binding transcription activator activity, RNA polymerase
II-specific
directly_involved_in:
- id: GO:0045944
label: positive regulation of transcription by RNA polymerase II
- id: GO:0000122
label: negative regulation of transcription by RNA polymerase II
locations:
- id: GO:0005634
label: nucleus
- id: GO:0000785
label: chromatin
description: SOX2 functions as a sequence-specific DNA-binding transcription
factor with an HMG-box domain that binds Sox motifs (WWCAAW consensus).
Acts as a pioneer factor engaging nucleosomal DNA to maintain chromatin
accessibility at regulatory enhancers. Primarily activates transcription
cooperating with OCT4 on composite Sox/Oct motifs, but also functions as a
repressor when SUMOylated.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 contains a ~79-aa HMG box that binds the minor groove
of cognate sequences (consensus WWCAAW; commonly the core TTGT)"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "SOX2 and OCT4 co-occupy thousands of OSN (OCT4/SOX2/NANOG)
enhancers in pluripotent cells"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "As a pioneer factor, SOX2 can bind nucleosomal DNA in a
position- and sequence-dependent manner and maintain chromatin accessibility
at key regulatory sites"
- molecular_function:
id: GO:0001228
label: DNA-binding transcription activator activity, RNA polymerase
II-specific
directly_involved_in:
- id: GO:0035019
label: somatic stem cell population maintenance
locations:
- id: GO:0005634
label: nucleus
description: SOX2 is one of the four Yamanaka factors (with OCT4, KLF4, and
MYC) sufficient for reprogramming somatic cells to induced pluripotent
stem cells. Forms transcriptional regulatory complexes with OCT4 and NANOG
to maintain embryonic stem cell self-renewal and pluripotency.
supported_by:
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Critical for early embryogenesis and for embryonic stem
cell pluripotency"
- reference_id: UniProtKB:P48431
supporting_text: "POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four Yamanaka
factors"
- molecular_function:
id: GO:0001228
label: DNA-binding transcription activator activity, RNA polymerase
II-specific
directly_involved_in:
- id: GO:0097150
label: neuronal stem cell population maintenance
- id: GO:0045665
label: negative regulation of neuron differentiation
locations:
- id: GO:0005634
label: nucleus
description: SOX2 maintains neural stem cells in an undifferentiated state
and prevents premature neuronal differentiation by counteracting proneural
factors. Essential for neural progenitor self-renewal and maintaining the
neural stem cell pool throughout the nervous system.
supported_by:
- reference_id: UniProtKB:P48431
supporting_text: "Keeps neural cells undifferentiated by counteracting the
activity of proneural proteins and suppresses neuronal differentiation"
- reference_id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
supporting_text: "Downstream SRRT target that mediates the promotion of neural
stem cell self-renewal"
references:
- id: GO_REF:0000002
title: Gene Ontology annotation through association of InterPro records with
GO terms
- id: GO_REF:0000024
title: Manual transfer of experimentally-verified manual GO annotation data
to orthologs by curator judgment of sequence similarity
- id: GO_REF:0000033
title: Annotation inferences using phylogenetic trees
- id: GO_REF:0000043
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot keyword
mapping
- id: GO_REF:0000044
title: Gene Ontology annotation based on UniProtKB/Swiss-Prot Subcellular
Location vocabulary mapping
- id: GO_REF:0000052
title: Gene Ontology annotation based on curation of immunofluorescence data
- id: GO_REF:0000107
title: Automatic transfer of experimentally verified manual GO annotation
data to orthologs using Ensembl Compara
- id: GO_REF:0000113
title: Gene Ontology annotation of human sequence-specific DNA binding
transcription factors based on TFClass database
- id: GO_REF:0000120
title: Combined Automated Annotation using Multiple IEA Methods
- id: file:genes/human/SOX2/SOX2-deep-research-falcon.md
title: Deep research report on SOX2 function and mechanisms
- id: UniProtKB:P48431
title: UniProt entry for human SOX2 transcription factor
- id: PMID:16153702
title: Core transcriptional regulatory circuitry in human embryonic stem
cells
- id: PMID:17291498
title: Sox-2 is expressed by glial and progenitor cells and Pax-6 is
expressed by neuroblasts in the human subventricular zone.
- id: PMID:17982423
title: Stem cell markers in the human posterior limbus and corneal
endothelium of unwounded and wounded corneas.
- id: PMID:18000303
title: Evidence for the presence of stem cell-like progenitor cells in human
adult pancreas.
- id: PMID:18027866
title: Aberrant expression of SOX2 upregulates MUC5AC gastric foveolar mucin
in mucinous cancers of the colorectum and related lesions.
- id: PMID:18187129
title: Forced expression of Sox2 or Nanog in human bone marrow derived
mesenchymal stem cells maintains their expansion and differentiation
capabilities.
- id: PMID:18268498
title: SOX2 is frequently downregulated in gastric cancers and inhibits cell
growth through cell-cycle arrest and apoptosis.
- id: PMID:18285410
title: SOX2 plays a critical role in the pituitary, forebrain, and eye
during human embryonic development.
- id: PMID:18388306
title: Regulation of self-renewal and pluripotency by Sox2 in human
embryonic stem cells.
- id: PMID:18407919
title: Novel mutations in LHX3 are associated with hypopituitarism and
sensorineural hearing loss.
- id: PMID:18456656
title: The molecular mechanism governing the oncogenic potential of SOX2 in
breast cancer.
- id: PMID:18710938
title: Oct4/Sox2-regulated miR-302 targets cyclin D1 in human embryonic stem
cells.
- id: PMID:19409607
title: MicroRNA-145 regulates OCT4, SOX2, and KLF4 and represses
pluripotency in human embryonic stem cells.
- id: PMID:19427902
title: Purification of human transcription factors Nanog and Sox2, each in
complex with Skp, an Escherichia coli periplasmic chaperone.
- id: PMID:19736317
title: 'Interplay of Oct4 with Sox2 and Sox17: a molecular switch from stem cell
pluripotency to specifying a cardiac fate.'
- id: PMID:20713518
title: Foxp1 coordinates cardiomyocyte proliferation through both
cell-autonomous and nonautonomous mechanisms.
- id: PMID:21245162
title: Pluripotency factors regulate definitive endoderm specification
through eomesodermin.
- id: PMID:23667531
title: The SOX2-interactome in brain cancer cells identifies the requirement
of MSI2 and USP9X for the growth of brain tumor cells.
- id: PMID:24048479
title: Deterministic direct reprogramming of somatic cells to pluripotency.
- id: PMID:24525231
title: The PRKCI and SOX2 oncogenes are coamplified and cooperate to
activate Hedgehog signaling in lung squamous cell carcinoma.
- id: PMID:25416956
title: A proteome-scale map of the human interactome network.
- id: PMID:25702638
title: The L1TD1 protein interactome reveals the importance of
post-transcriptional regulation in human pluripotency.
- id: PMID:27334688
title: Oncogenic Sox2 regulates and cooperates with VRK1 in cell cycle
progression and differentiation.
- id: PMID:29253717
title: Regulation of ADAM10 by miR-140-5p and potential relevance for
Alzheimer's disease.
- id: PMID:29358331
title: LSD1 demethylase and the methyl-binding protein PHF20L1 prevent SET7
methyltransferase-dependent proteolysis of the stem-cell protein SOX2.
- id: PMID:30442713
title: Proteolysis of methylated SOX2 protein is regulated by L3MBTL3 and
CRL4(DCAF5) ubiquitin ligase.
- id: PMID:31515488
title: Extensive disruption of protein interactions by genetic variants
across the allele frequency spectrum in human populations.
- id: PMID:7849401
title: The cDNA sequence and chromosomal location of the human SOX2 gene.
- id: Reactome:R-HSA-1112609
title: Reactome pathway annotation
- id: Reactome:R-HSA-2972948
title: POU5F1 (OCT4), SOX2, NANOG bind GSC promoter
- id: Reactome:R-HSA-2972949
title: POU5F1 (OCT4), SOX2, NANOG bind EOMES promoter
- id: Reactome:R-HSA-2972956
title: POU5F1 (OCT4), SOX2, NANOG bind STAT3 promoter
- id: Reactome:R-HSA-2972960
title: POU5F1 (OCT4), SOX2, NANOG bind FGF2 promoter
- id: Reactome:R-HSA-2972962
title: POU5F1 (OCT4), SOX2, NANOG bind TDGF1 promoter
- id: Reactome:R-HSA-2972964
title: POU5F1 (OCT4), SOX2, NANOG bind FOXD3 promoter
- id: Reactome:R-HSA-2972965
title: POU5F1 (OCT4), SOX2, NANOG bind TSC22D1 promoter
- id: Reactome:R-HSA-2972966
title: POU5F1 (OCT4), SOX2, NANOG bind HHEX promoter
- id: Reactome:R-HSA-2972967
title: POU5F1 (OCT4), SOX2, NANOG bind EPHA1 promoter
- id: Reactome:R-HSA-2972975
title: POU5F1 (OCT4), SOX2, NANOG bind SALL1 promoter
- id: Reactome:R-HSA-2972977
title: POU5F1 (OCT4), SOX2, NANOG bind DKK1 promoter
- id: Reactome:R-HSA-2972978
title: POU5F1 (OCT4), SOX2, NANOG bind ZIC3 promoter
- id: Reactome:R-HSA-2972979
title: POU5F1 (OCT4), NANOG bind CDX2 promoter
- id: Reactome:R-HSA-452894
title: Expression of SOX2
- id: Reactome:R-HSA-480204
title: POU5F1 (OCT4), SOX2, NANOG, KLF4, PBX1, SMAD2 bind NANOG promoter
- id: Reactome:R-HSA-480685
title: POU5F1 (OCT4), SOX2, NANOG bind SOX2 promoter
- id: Reactome:R-HSA-5626938
title: Beta-catenin binds SOX proteins
- id: Reactome:R-HSA-6790036
title: Expression of STAT3-upregulated nuclear proteins
- id: Reactome:R-HSA-6800120
title: POU5F1 (OCT4), SOX2, NANOG bind DPPA4 gene
- id: Reactome:R-HSA-9732125
title: SOX2 binds MITF promoter
- id: Reactome:R-HSA-9833066
title: SOX2 binds ZEB2 gene
- id: Reactome:R-HSA-9926523
title: MITF-M-dependent SOX2 gene expression