Epiphycan (EPYC) is a Class III small leucine-rich proteoglycan (SLRP) that functions as a secreted extracellular matrix component primarily regulating collagen fibrillogenesis and matrix organization in cartilage and other connective tissues. The protein contains six leucine-rich repeat (LRR) domains and is post-translationally modified with dermatan sulfate glycosaminoglycan chains. EPYC's core functions include: (1) regulating collagen type II fibril assembly in cartilage by binding collagen fibrils and controlling lateral growth; (2) organizing three-dimensional extracellular matrix architecture through interactions with collagen, hyaluronan, and other proteoglycans; and (3) modulating growth factor signaling by binding and sequestering TGF-β and BMPs. The protein is essential for maintaining articular cartilage integrity and preventing osteoarthritis, as demonstrated by progressive joint degeneration in EPYC-deficient mice. EPYC is expressed primarily in skeletal tissues including growth plate cartilage, articular cartilage, and intervertebral discs, as well as in ligaments and placenta
| GO Term | Evidence | Action | Reason |
|---|---|---|---|
|
GO:0031012
extracellular matrix
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: This annotation is correct and represents the core localization of EPYC. Epiphycan is a
secreted proteoglycan with an N-terminal signal peptide (aa 1-19) that directs it to the
extracellular matrix where it functions exclusively. The protein is integrated into the
organized three-dimensional scaffold of the ECM through interactions with collagen fibrils,
hyaluronan, and other matrix components [file:human/EPYC/EPYC-deep-research-perplexity.md
"functions exclusively in extracellular contexts...becomes integrated into the organized
three-dimensional scaffold of the extracellular matrix"]. This is a fundamental and essential
annotation for EPYC function.
Supporting Evidence:
file:human/EPYC/EPYC-deep-research-perplexity.md
provider: perplexity
|
|
GO:0060348
bone development
|
IBA
GO_REF:0000033 |
KEEP AS NON CORE |
Summary: This annotation is supported but represents a non-core function. EPYC does influence bone
development through mechanisms involving matrix organization and growth factor sequestration
during endochondral ossification [file:human/EPYC/EPYC-deep-research-perplexity.md
"protein influences bone development...during endochondral ossification"]. However, EPYC's
primary role is in cartilage matrix organization, with bone development being a secondary
consequence. EPYC-deficient mice show relatively subtle skeletal abnormalities early in
development with age-dependent changes, indicating it's not essential for initial bone
formation but contributes to skeletal homeostasis.
|
|
GO:0061975
articular cartilage development
|
IBA
GO_REF:0000033 |
ACCEPT |
Summary: This annotation is correct and represents a core biological process for EPYC. Epiphycan is
essential for articular cartilage development and maintenance. EPYC is highly expressed in
articular cartilage and is critical for maintaining joint integrity. EPYC-deficient mice
develop progressive osteoarthritis with articular cartilage erosions and proteoglycan loss,
demonstrating its essential role [file:human/EPYC/EPYC-deep-research-perplexity.md
"essential for maintaining articular cartilage integrity...EPYC-deficient mice develop
osteoarthritis spontaneously with age...articular cartilage erosions"]. This is a fundamental
function of EPYC.
|
|
GO:0031012
extracellular matrix
|
IEA
GO_REF:0000107 |
ACCEPT |
Summary: Extracellular matrix localization is correct for EPYC. While this duplicates the IBA annotation, both provide support for this cellular component.
Reason: EPYC is a component of the extracellular matrix, particularly in cartilage. Multiple evidence codes supporting the same correct annotation provide additional confidence.
|
|
GO:0005539
glycosaminoglycan binding
|
TAS
PMID:8975717 Characterization of human DSPG3, a small dermatan sulfate pr... |
REMOVE |
Summary: This annotation is problematic. EPYC itself IS a dermatan sulfate proteoglycan - it carries
GAG chains as post-translational modifications [PMID:8975717 "small dermatan sulfate
proteoglycan"]. The term "glycosaminoglycan binding" suggests EPYC binds to GAGs as ligands,
which is not its primary characterized function. EPYC's GAG chains mediate interactions with
other molecules (collagen, growth factors), but "glycosaminoglycan binding" is not an accurate
description of EPYC's molecular function. This should be removed or replaced with a more
accurate term like "extracellular matrix structural constituent" or "collagen binding".
Supporting Evidence:
PMID:8975717
Characterization of human DSPG3, a small dermatan sulfate proteoglycan.
|
|
GO:0007565
female pregnancy
|
TAS
PMID:8975717 Characterization of human DSPG3, a small dermatan sulfate pr... |
MARK AS OVER ANNOTATED |
Summary: This annotation is based solely on expression evidence - EPYC is expressed in placental
tissue [PMID:8975717 "expressed in cartilage, as well as ligament and placental tissues"].
However, mere expression in placenta does not establish a specific role in pregnancy. The
term "female pregnancy" is too broad and doesn't describe EPYC's actual molecular or cellular
function. EPYC likely plays a structural role in placental ECM organization, but this hasn't
been experimentally demonstrated. This is an over-annotation based on tissue expression alone.
Supporting Evidence:
PMID:8975717
Characterization of human DSPG3, a small dermatan sulfate proteoglycan.
|
|
GO:0005518
collagen binding
|
TAS
file:human/EPYC/EPYC-deep-research-perplexity.md |
NEW |
Summary: This is a core molecular function of EPYC. The protein binds to collagen type II fibrils
via its leucine-rich repeat domains and regulates collagen fibril organization in cartilage.
This is well-supported by structural and functional studies showing EPYC's role in controlling
collagen fibrillogenesis through direct binding interactions.
|
|
GO:0030199
collagen fibril organization
|
TAS
file:human/EPYC/EPYC-deep-research-perplexity.md |
NEW |
Summary: This is a core biological process for EPYC. The protein regulates collagen fibril assembly
by controlling lateral fibril growth through steric hindrance, maintaining optimal
intermolecular spacing. This is essential for maintaining proper cartilage matrix properties.
|
|
GO:0005201
extracellular matrix structural constituent
|
TAS
file:human/EPYC/EPYC-deep-research-perplexity.md |
NEW |
Summary: This accurately describes EPYC's role as a structural component of the ECM. EPYC serves
as an organizational node that facilitates interactions between collagen fibrils, hyaluronan,
and other proteoglycans. This is more accurate than "glycosaminoglycan binding" for describing
EPYC's structural function.
|
|
GO:0030198
extracellular matrix organization
|
TAS
file:human/EPYC/EPYC-deep-research-perplexity.md |
NEW |
Summary: This is a core biological process for EPYC. The protein organizes the three-dimensional
architecture of the ECM through multiple interactions with matrix components, contributing
to tissue homeostasis and mechanical properties.
|
|
GO:0050431
transforming growth factor beta binding
|
TAS
file:human/EPYC/EPYC-deep-research-perplexity.md |
NEW |
Summary: This is a core regulatory function of EPYC. The protein binds and sequesters TGF-β isoforms
and BMPs through its dermatan sulfate GAG chains, reducing their bioavailability and modulating
downstream signaling pathways that regulate matrix remodeling and cell differentiation.
|
provider: perplexity
model: sonar-deep-research
cached: false
start_time: '2025-11-11T18:37:01.188560'
end_time: '2025-11-11T18:39:22.097219'
duration_seconds: 140.91
template_file: templates/gene_research_go_focused.md
template_variables:
organism: human
gene_id: EPYC
gene_symbol: EPYC
provider_config:
timeout: 600
max_retries: 3
parameters:
allowed_domains: []
reasoning_effort: medium
search_domain_filter: []
return_citations: true
temperature: 0.0
citation_count: 46
Please provide a comprehensive research report on the gene EPYC (EPYC) in human.
The research report should be a detailed narrative explaining the function, biological processes, and localization of the gene product. Citations should be given for all claims.
You should prioritize authoritative reviews and primary scientific literature when conducting research. You can supplement
this with annotations you find in gene/protein databases, but these can be outdated or inaccurate.
We are specifically interested in the primary function of the gene - for enzymes, what reaction is catalyzed, and what is the substrate specificity? For transporters, what is the substrate? For structural proteins or adapters, what is the broader structural role? For signaling molecules, what is the role in the pathway.
We are interested in where in or outside the cell the gene product carries out its function.
We are also interested in the signaling or biochemical pathways in which the gene functions. We are less interested in broad pleiotropic effects, except where these elucidate the precise role.
Include evidence where possible. We are interested in both experimental evidence as well as inference from structure, evolution, or bioinformatic analysis. Precise studies should be prioritized over high-throughput, where available.
Let me analyze the user's query and the provided search results to create a comprehensive research report on the EPYC gene and its protein product, epiphycan.
From planning my approach:
It's composed of seven exons
Biological Function:
Involved in tissue development and repair
Structure:
Signal peptide for secretion
Localization:
Can be found in all extracellular matrices
Signaling Pathways:
Modulates growth factor signaling through matrix protein interactions
Disease Associations:
Rheumatoid arthritis
Protein Interactions:
, particularly with collagen II
- Growth factor engagement including TGF-β and BMPs
- PLCG2 and CRK signaling molecules
- Other SLRP family members
- Cell surface receptors via integrins
The protein's functional role emerges through its capacity to bind collagen II and interact with growth factors like TGF-β and BMPs, while also engaging signaling molecules such as PLCG2 and CRK. Post-translational modifications including glycosylation and sulfation regulate its structural integrity and binding capacity, making these chemical modifications critical for its biological activity.
For the comprehensive research report, I'm organizing the content into distinct sections that build from molecular structure through tissue distribution to functional mechanisms. The framework begins with foundational concepts about epiphycan within the SLRP family, progresses through its structural architecture including leucine-rich repeats and glycosaminoglycan chains, then addresses its localization patterns and developmental expression. The subsequent sections examine its roles in collagen fibrillogenesis and cartilage development, followed by detailed analysis of signaling pathways involving growth factors and intracellular cascades like PI3K/AKT signaling. This structure maintains narrative continuity while reaching the target length through expanded discussion of each mechanism and its biological significance.
Epiphycan (EPYC), also known as dermatan sulfate proteoglycan 3 (DSPG3), represents a significant member of the small leucine-rich proteoglycan (SLRP) family with essential roles in extracellular matrix organization, tissue development, and homeostasis. Located on chromosome 12 and encoding a 309-amino acid protein with a molecular mass of approximately 35.5 kilodaltons, EPYC functions primarily as a modulator of collagen fibrillogenesis and a regulator of cell signaling through multiple pathways including transforming growth factor-beta (TGF-β), phosphatidylinositol 3-kinase/protein kinase B (PI3K/AKT), and focal adhesion signaling[6][13][16]. The protein is secreted as a decorated extracellular matrix component bearing chondroitin/dermatan sulfate glycosaminoglycan chains, through which it orchestrates critical interactions between cells and their matrix environment[2][3]. Recent research has revealed diverse roles for EPYC extending from cartilage development and osteoarthritis pathogenesis to tumor progression in pancreatic and ovarian cancers, establishing it as both a fundamental regulator of tissue architecture and an emerging biomarker for disease prognosis[3][47][55]. This comprehensive report synthesizes current knowledge of EPYC's structural features, localization patterns, biochemical functions, signaling mechanisms, and clinical relevance.
The small leucine-rich proteoglycan family represents the largest proteoglycan family in humans, comprising eighteen distinct members organized into five distinct classes based on their evolutionary conservation, genomic organization, and structural features[7][52]. These molecules are defined fundamentally by the presence of tandem leucine-rich repeat (LRR) domains flanked by N- and C-terminal cysteine-rich clusters, which create the characteristic curved solenoid structure characteristic of all SLRP family members[7]. The core protein domains of SLRPs typically range from 36 to 77 kilodaltons and may be substituted with various types of glycosaminoglycan (GAG) side chains, including chondroitin sulfate (CS), dermatan sulfate (DS), and keratan sulfate (KS), although some SLRP members exist as non-glycosylated proteins depending on tissue context and developmental stage[7][29]. Because SLRPs are present in virtually all extracellular matrices of bilaterian animals and exhibit wide tissue distribution, they represent ancient and highly conserved molecules with deep evolutionary significance[2]. The functional diversity of this protein family emerges not merely from amino acid sequence variation but fundamentally from the differential organization of their leucine-rich repeat domains, their capacity for post-translational modification, and their tissue-specific regulation of glycosaminoglycan substitution[2][7].
Epiphycan occupies a specific position within SLRP classification as a Class III member alongside opticin and osteoglycin (mimecan)[7][8][50]. The EPYC gene comprises seven exons organized on chromosome 12, position 12q21.33, with a genomic span of approximately 41.3 kilobases[13][48]. The resulting protein contains between six and eight leucine-rich repeat domains, which represents the defining characteristic of Class III SLRPs and distinguishes them from the more elaborate Class I and II members carrying ten to twelve LRRs[7][8]. Each individual leucine-rich repeat in EPYC spans approximately twenty to twenty-nine amino acids and consists of highly conserved segments displaying characteristic patterns such as LxxLxLxxNxL and variable segments that form diverse secondary structures including alpha-helices and beta-turns[19]. The curved solenoid architecture arising from these repeating structures creates a characteristic molecular shape wherein the concave inner surface serves as the ligand-binding interface, while the convex exterior provides structural scaffolding[19]. The N- and C-terminal regions flanking the LRR domains are rich in cysteine residues that form disulfide bonds, creating terminal caps essential for protein stability and proper three-dimensional folding[2][7].
The epiphycan protein is synthesized as a precursor containing an N-terminal signal peptide approximately twenty amino acids in length, which directs the nascent polypeptide chain to the endoplasmic reticulum during translation and is subsequently cleaved to yield the mature secreted protein[41]. Following synthesis, EPYC undergoes extensive post-translational modification within the secretory pathway, with glycosaminoglycan chain attachment occurring primarily through the Golgi apparatus via coordinated enzymatic reactions[7][37]. The protein accepts chondroitin and dermatan sulfate GAG chains as post-translational modifications, with these structures being constructed from repeating disaccharide units comprising N-acetylgalactosamine and uronic acid residues[27][30]. The specific composition and sulfation pattern of these GAG chains undergo further enzymatic processing including epimerization of glucuronic acid to iduronic acid and sulfation at specific positions on both the hexosamine and uronic acid components, reactions catalyzed by dedicated sulfotransferase and epimerase enzymes that confer functional diversity to the modified proteoglycan[27][37].
Post-translational modifications beyond GAG attachment include N-linked and O-linked glycosylation through mannose and glucose residues, modifications that prove critical for protein folding, structural stability, and proper presentation of ligand-binding epitopes[37]. Tyrosine residues within specific regions of EPYC undergo sulfation, a relatively common but underappreciated post-translational modification catalyzed by tyrosylprotein sulfotransferase in the trans-Golgi network[37][40]. These tyrosine sulfations appear particularly important in Class II SLRPs such as fibromodulin but require further investigation in Class III members like EPYC[37]. The overall modification state of epiphycan varies significantly depending on the tissue source, developmental stage, aging, and pathological context, reflecting dynamic regulation of the enzymatic machinery responsible for GAG biosynthesis[7][37]. This dynamic composition explains in part how a single protein core can function in remarkably diverse biological contexts and disease states.
EPYC functions exclusively in extracellular contexts, with the protein predicted and annotated as secreted based on its N-terminal signal peptide sequence and absence of transmembrane domains[41]. The secretion process delivers epiphycan to the extracellular space where it assembles into complex three-dimensional matrices alongside other proteoglycans, structural proteins including collagens and elastin, and various glycoproteins that together constitute the functional extracellular matrix[2][6][9]. The mature protein contains no intracellular localization signals and demonstrates no evidence of intracellular accumulation in normal physiological conditions, consistent with its role as a purely secreted matrix molecule[41][6]. Following secretion, EPYC becomes integrated into the organized three-dimensional scaffold of the extracellular matrix through multiple molecular interactions with collagen fibrils, hyaluronan, and other matrix components, positions that prove critical for its regulatory functions[2][9].
EPYC shows preferential expression in skeletal tissues, with particularly high levels in the growth plate cartilage, articular cartilage, and intervertebral discs where it participates in establishing and maintaining the ordered matrix architecture essential for skeletal function[6][24]. During embryonic development, epiphycan mRNA transcripts appear initially around embryonic day 12.5 in developing cartilage elements and exhibit a tightly spatially and temporally regulated expression pattern during chondrogenesis[25]. The expression timing of EPYC relative to other cartilage markers reveals important developmental relationships, with EPYC mRNA expression occurring later than type II collagen, which marks early chondrogenic differentiation, but appearing in the growth plate before and being excluded from the zone of hypertrophic chondrocytes expressing collagen type X[25]. Protein localization studies using immunofluorescence histochemistry demonstrate that EPYC protein persists throughout all zones of growth plate cartilage, including the hypertrophic zone, even in regions where the gene is no longer actively transcribed[25][28]. This discordance between mRNA and protein localization patterns suggests that epiphycan protein accumulates in the extracellular matrix and persists there during tissue remodeling and cartilage maturation.
Beyond skeletal tissues, EPYC distribution extends to virtually all tissues containing specialized extracellular matrices, including fibrous tissues such as tendons, ligaments, and the cornea, and to the encapsulating matrices of major organs including the meninges and pericardium[7][29]. The widespread distribution of EPYC across diverse connective tissues, combined with its particularly high expression in fibrous tissues, reflects the fundamental importance of this molecule in organizing and regulating extracellular matrix properties across multiple tissue types[7]. However, the expression level and GAG modification state of EPYC varies substantially between different tissues, with such variation likely reflecting tissue-specific functional requirements and developmental history[6][7]. Single-cell expression analysis indicates that EPYC production occurs in chondrocytes of cartilage tissues and in fibroblasts of connective tissues, with these specialized matrix-producing cells serving as the primary sources of EPYC protein in their respective tissues[41].
The primary and best-characterized function of EPYC involves the regulation of collagen fibrillogenesis and the maintenance of ordered collagen matrix architecture, functions shared with other SLRP family members but with specific characteristics reflecting EPYC's unique structural features and biochemical properties[8][24][29]. Collagen fibrils form through a multistep process beginning with self-assembly of collagen triple helix monomers followed by lateral association and cross-linking of individual fibrils into larger organized fiber bundles, processes that generate the mechanical properties essential for tissue function[9][26]. In this assembly process, EPYC and other SLRPs function as regulators, controlling the kinetics of fibril formation, limiting lateral growth of individual fibrils, and maintaining appropriate intermolecular spacing between adjacent fibrils[7][29][31]. Through direct binding interactions with collagen fibrils, particularly with type II collagen present in cartilage and other collagen types in different tissues, EPYC exerts steric hindrance effects that prevent excessive lateral fusion of fibril structures[24][31]. This regulatory function proves essential for maintaining the optimal biomechanical properties of tissues, as aberrant collagen fibrillogenesis results in structural defects affecting tissue stiffness, elasticity, and load-bearing capacity[31].
The role of EPYC in collagen regulation becomes evident through analysis of knockout mouse models wherein EPYC-deficient mice display minimal developmental abnormalities at birth but develop progressive osteoarthritis with age, with the severity of degeneration increasing dramatically in mice deficient in both EPYC and biglycan, another SLRP family member[8][24][33]. Histological examination of cartilage from these doubly deficient animals reveals increased collagen loss, particularly in superficial zones of articular cartilage, associated with a deficiency of superficial chondrocytes and profound disruption of normal joint architecture[24]. These findings demonstrate that EPYC, while not absolutely essential for embryonic development, proves critical for the long-term maintenance of joint integrity and the prevention of cartilage degeneration under the mechanical stresses imposed during normal skeletal loading[8][24][33].
Beyond its role in regulating the physical assembly of collagen matrices, EPYC participates in organizing the three-dimensional architecture of the extracellular matrix through interactions with multiple other matrix components[3][6]. The proteoglycan functions as an organization node, facilitating interactions between collagen fibrils, hyaluronic acid, link proteins, and other proteoglycans that together generate the complex viscoelastic properties characteristic of functional cartilage and other connective tissues[2][9][34]. Through these multi-molecular interactions, EPYC contributes to what is often termed "matrix organization," a process that involves the spatial arrangement of matrix molecules into their proper three-dimensional configuration essential for tissue function[3]. This organizational role extends to regulating the accessibility of matrix components to proteolytic degradation by matrix metalloproteinases and other enzymes, a function with clear implications for understanding tissue remodeling and pathological matrix degradation observed in diseases such as osteoarthritis[26][36].
EPYC demonstrates important functions beyond simple structural organization, with documented roles in bone formation and in establishing the conditions necessary for normal chondrogenesis[4][6]. The protein influences bone development through mechanisms that may involve both direct effects on osteoblast function and indirect effects mediated through matrix organization and sequestration of bone-active growth factors[6]. During endochondral ossification, the developmental process through which cartilage serves as a temporary template for bone formation, EPYC participates in the remodeling of epiphyseal cartilage preceding the formation of secondary ossification centers[15]. The protein may contribute to this process through its interactions with transforming growth factor-beta (TGF-β) family members, including bone morphogenetic proteins (BMPs), which regulate both chondrocyte differentiation and osteoblast maturation[6][14].
The role of EPYC in bone formation becomes apparent through examination of knockout mouse phenotypes, which while showing relatively subtle skeletal abnormalities early in development, display progressive changes in long bone morphometry with age, including effects on femur length in males as assessed at nine months of age[8][24]. These age-dependent skeletal phenotypes suggest that EPYC functions in the maintenance of normal bone homeostasis and the prevention of age-related skeletal changes rather than serving as an absolutely required factor for bone formation itself[8]. The protein may function by regulating local growth factor availability within bone matrix compartments or by maintaining the mechanical properties of the bone matrix necessary for proper skeletal loading and remodeling[6].
A major class of EPYC functions involves the binding and sequestration of soluble growth factors and cytokines, particularly the transforming growth factor-beta superfamily and bone morphogenetic proteins[3][6][29]. Like other SLRP family members, EPYC binds TGF-β isoforms through a mechanism that reduces TGF-β bioavailability and modulates its signaling activity in a dose-dependent manner[26][29]. This function proves particularly significant in tissues where TGF-β signaling drives pathological fibrosis or inflammation, a context in which SLRP-mediated sequestration of TGF-β provides a biological brake on excess fibrogenic signaling[26][29]. The chondroitin and dermatan sulfate GAG chains attached to EPYC contribute substantially to these growth factor interactions, with the sulfated structures serving as high-affinity ligands for multiple bioactive molecules[27][30].
The interaction between EPYC and growth factors extends beyond simple sequestration to include functional modulation of signaling outcomes. Through binding to growth factors or to their cognate receptors, EPYC can either attenuate or enhance specific signaling pathways depending on the cellular context and the nature of the EPYC-growth factor-receptor complex[3][6]. Recent evidence suggests that EPYC may interact with specific signaling molecules including phospholipase Cγ2 (PLCG2) and the adaptor protein CRK, both implicated in signal transduction cascades activated by receptor tyrosine kinases and non-receptor tyrosine kinases[51]. These interactions position EPYC not merely as a passive sequestrant of diffusible signals but as an active participant in cellular decision-making regarding proliferation, survival, and differentiation. The interaction with CRK protein, a known effector in insulin-like growth factor (IGF) signaling pathways, suggests potential cross-talk between ECM-resident EPYC and growth factor receptor signaling cascades that regulate cell survival and metabolism[51].
Evidence from multiple sources demonstrates that EPYC dysfunction associates with osteoarthritis development and progression, a relationship emerging from both genetic studies and functional investigations of EPYC expression in disease tissues. Epiphycan-deficient mice develop osteoarthritis spontaneously with age, with the onset of pathological changes appearing earliest in mice carrying deletions in both EPYC and biglycan, suggesting synergistic effects between these two Class I and Class III SLRP family members[8][24][33]. The osteoarthritis phenotype in EPYC-deficient animals includes classic hallmarks of the disease including articular cartilage erosions, proteoglycan loss from the matrix, subchondral bone sclerosis, and osteophyte growth, morphological changes that closely resemble those observed in human osteoarthritis patients[33].
Mechanistic investigation of EPYC's role in osteoarthritis has identified alterations in matrix protein expression and signaling pathway activation in animals lacking EPYC[8]. Microarray analysis of knee joints from EPYC/biglycan double-deficient mice reveals increased matrix protein expression compared with wild-type mice, including upregulation of other SLRP family members such as asporin, fibromodulin, and lumican[8][24][33]. This compensatory upregulation of alternative SLRPs likely reflects attempts by the tissue to restore normal matrix organization in the face of EPYC deficiency, but these compensatory mechanisms prove insufficient to prevent cartilage degeneration[33]. The expression alterations suggest a model in which EPYC loss results in matrix disorganization that triggers secondary responses including altered transcription of matrix genes and potential activation of catabolic pathways[33].
Gene expression profiling studies employing machine learning approaches identify EPYC as a major feature gene distinguishing osteoarthritis tissue from normal cartilage, with EPYC expression elevated in osteoarthritic tissue samples[55]. Paradoxically, while EPYC knockout mice develop osteoarthritis, osteoarthritic tissue shows elevated EPYC expression, a pattern suggesting that increased EPYC production may represent a reparative response by chondrocytes attempting to restore damaged matrix architecture[55]. This interpretation aligns with the observation that in normal cartilage, EPYC expression remains relatively low and may become transiently upregulated during periods of matrix damage or stress[25][55]. The apparent contradiction between EPYC deficiency causing osteoarthritis and EPYC upregulation in osteoarthritic tissue may reflect distinct roles in prevention of degeneration versus response to matrix damage, a distinction that requires further investigation.
EPYC associations extend beyond skeletal tissues to include roles in corneal development and maintenance, revealed through genetic studies identifying EPYC mutations in patients with corneal dystrophies. Posterior amorphous corneal dystrophy (PACD), a rare autosomal dominant corneal disorder characterized by partial or complete posterior lamellar corneal opacification and decreased corneal thickness, shows genetic linkage to the chromosomal region harboring EPYC[39]. Affected patients exhibit disorganization of posterior corneal stromal lamellae with an abnormal collagenous layer within or adjacent to Descemet's membrane, architectural changes consistent with disrupted collagen matrix organization similar to defects observed when SLRP function is compromised[39]. While no mutations in EPYC were identified in the initially reported PACD family, the candidate gene approach highlighting EPYC alongside other SLRPs including keratocan, lumican, and decorin emphasizes the importance of this proteoglycan family in maintaining corneal architecture[39][26].
EPYC additionally associates with cornea plana (keratoconus), another inherited corneal disorder characterized by corneal flattening and stromal thinning[39][42]. The shared candidate gene status of EPYC with other SLRP family members in corneal dystrophies reflects the universal dependence of corneal transparency and structure on proper collagen matrix organization, a process in which multiple SLRP family members participate redundantly[26][29]. The cornea, with its requirement for exceptional transparency to permit light transmission, demonstrates particular sensitivity to disruptions in matrix organization, such that even modest alterations in collagen fibril diameter or interfibrillar spacing result in clinically apparent stromal opacification[26]. This extreme sensitivity of the cornea to matrix disorganization makes it a sensitive biosensor for SLRP dysfunction and has facilitated the identification of multiple SLRP mutations in corneal disease through molecular genetic studies[26].
Recent research reveals that EPYC expression patterns and functions extend into malignant disease contexts, where the proteoglycan promotes tumor progression and serves as a prognostic indicator of poor outcomes. In pancreatic cancer, integrated bioinformatics analysis of The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) datasets identifies EPYC as an independent risk factor predicting poor prognosis[47][58]. Functional studies demonstrate that EPYC promotes pancreatic cancer cell proliferation through activation of the PI3K-AKT signaling pathway both in cell culture and in vivo tumor models[47][58]. Knockdown of EPYC in pancreatic cancer cell lines reduces cancer cell proliferation, demonstrating direct pro-tumorigenic effects of this ECM proteoglycan[47]. The emergence of EPYC as a prognostic biomarker in pancreatic cancer exemplifies how ECM components traditionally viewed as passive structural molecules now emerge as active regulators of tumor cell behavior.
In ovarian cancer, EPYC expression levels correlate with metastatic potential and clinical outcomes, with EPYC showing higher expression in metastatic ovarian cancer tissue compared to primary tumors, and higher expression in primary cancers compared to normal ovaries[3][44]. Functional interference studies in which EPYC expression is suppressed using small interfering RNA demonstrate that EPYC knockdown reduces ovarian cancer cell migration, invasion, and proliferation, confirming active roles for EPYC in promoting hallmark cancer behaviors[3]. Mechanistically, EPYC appears to promote epithelial-to-mesenchymal transition (EMT), a cellular process associated with increased migratory and invasive capacity through multiple signaling pathways including TGF-β/PI3K-AKT and focal adhesion pathways[3][44]. EPYC-mediated activation of these pathways leads to suppression of the epithelial marker E-cadherin (CDH1) through upregulation of the transcriptional repressor SNAI2, a critical step in EMT[44].
Gene ontology enrichment analysis demonstrates that EPYC primarily functions in extracellular matrix organization, confirming its fundamental role as a matrix protein[3][44]. Pathway enrichment analysis reveals that EPYC-associated gene expression changes activate multiple signaling cascades including PI3K/AKT, focal adhesion, and TGF-β pathways[3][44]. This multiplicity of activated pathways reflects the complex mechanisms through which ECM components influence cancer cell behavior through bidirectional signaling between matrix-resident proteins and cell surface receptors. The findings establish EPYC as an emerging therapeutic target in cancers where ECM-promoted tumor progression contributes to poor outcomes.
The phosphatidylinositol 3-kinase/protein kinase B (PI3K/AKT) signaling cascade represents a major effector pathway activated by EPYC in both cancer and normal physiological contexts[3][47][55]. This pathway initiates when ligands bind to receptor tyrosine kinases or when integrins engage ECM molecules including EPYC-containing matrix through focal adhesion complexes[20][32][35]. Upon receptor activation, PI3K catalyzes the phosphorylation of phosphatidylinositol 4,5-bisphosphate to generate the signaling lipid phosphatidylinositol 3,4,5-trisphosphate (PIP3), which serves as a membrane docking site for proteins containing pleckstrin homology domains, including AKT and its upstream activator PDK1[20][23]. Once recruited to the membrane, AKT becomes phosphorylated and activated, whereupon it phosphorylates multiple downstream effectors regulating cell survival, proliferation, metabolism, and cytoskeletal dynamics[20][23]. In the context of EPYC signaling, PI3K/AKT activation promotes cell survival through suppression of pro-apoptotic proteins and cell proliferation through modulation of cell cycle checkpoint proteins[47][58].
The focal adhesion pathway constitutes another major signaling cascade activated downstream of EPYC-ECM interactions, with focal adhesions forming complex multi-protein assemblies at sites where cell surface integrins engage ECM ligands[20][32][35]. Within focal adhesions, the non-receptor tyrosine kinase focal adhesion kinase (FAK) becomes activated through interactions with integrin cytoplasmic domains, leading to FAK autophosphorylation at tyrosine 397 and subsequent recruitment and activation of Src-family kinases[32][35]. FAK and Src then phosphorylate multiple adaptor proteins including paxillin, p130Cas, and Talin, creating a signaling scaffold that activates downstream pathways regulating cell migration, survival, and proliferation[20][32][35]. These FAK-Src signaling complexes activate additional kinases including the integrin-linked kinase (ILK), which further propagates signals through multiple effector pathways[20][32][35]. The activation of focal adhesion kinase downstream of EPYC-integrin interactions demonstrates how ECM-resident proteoglycans participate in bidirectional signaling between cells and their matrix environment.
EPYC plays crucial roles in developmental processes involving tissue remodeling and matrix reorganization, functions reflecting the general dependence of development on properly regulated ECM assembly and proteoglycan function[6][14][15]. During embryonic skeletal development, EPYC expression precisely correlates with stages of chondrogenesis and endochondral ossification, appearing in developing cartilage elements in the precise temporal and spatial pattern necessary for proper tissue organization[25]. The protein emerges as an important intermediate marker for chondrogenesis, appearing later than early markers of chondrocyte differentiation such as type II collagen but before terminal differentiation markers[25]. This intermediate expression timing positions EPYC in a critical developmental window when cartilage tissue requires specialized matrix organization distinct from both early cartilage anlagen and mature, mineralized cartilage.
The regulatory factors controlling EPYC expression during development remain incompletely characterized but likely include both transcriptional regulation responsive to developmental signals and post-transcriptional regulation of GAG modification state. The growth plate cartilage, a tissue continuously undergoing developmental remodeling through zones of resting, proliferating, and hypertrophic chondrocytes, shows EPYC protein localization throughout its organized structure even though the EPYC gene becomes progressively downregulated as chondrocytes differentiate toward terminal hypertrophy[25][28]. This persistence of EPYC protein in matrix compartments despite reduced transcription suggests accumulation of long-lived matrix-resident EPYC that persists through tissue remodeling cycles and may provide substrate for matrix repair mechanisms. The developmental remodeling of EPYC expression thus reflects complex interplay between synthetic, secretory, and degradative processes governing matrix homeostasis.
EPYC functions within the context of a larger proteoglycan family displaying both functional redundancy and functional specialization, characteristics that emerge from structural similarities combined with evolutionarily divergent sequences and regulatory patterns[7][29][31][52]. The Class III SLRP family, comprising EPYC, opticin, and osteoglycin (mimecan), represents a distinct lineage within the broader SLRP family tree, with sequence and structural characteristics distinguishing these molecules from Class I and II members[7][50][52]. While all SLRPs share fundamental structural organization based on leucine-rich repeat domains and typically interact with collagen matrices to regulate fibrillogenesis, individual SLRP members display specialized functions reflecting their distinct binding specificities and tissue expression patterns[7][29].
Evidence for functional redundancy among SLRP family members emerges from studies of doubly deficient knockout mice, wherein deletion of both EPYC and biglycan produces a phenotype more severe than deletion of either molecule alone[8][24][33]. The enhanced osteoarthritis phenotype in EPYC/biglycan double-deficient mice compared to singly deficient animals suggests that while individual SLRPs are partly dispensable due to compensatory activity of other family members, combined loss of multiple SLRPs reveals their essential aggregate function in joint maintenance[33]. Moreover, molecular characterization of joint tissues from doubly deficient mice reveals compensatory upregulation of other SLRP family members including asporin, fibromodulin, and lumican, demonstrating that tissues attempt to restore normal SLRP levels when individual members are deleted[8][24][33]. This compensation appears partially effective in delaying osteoarthritis onset but ultimately insufficient to prevent age-related cartilage degeneration, suggesting that individual SLRPs possess at least partially specialized functions not entirely redundant with their family members[33].
EPYC demonstrates particular functional relationships with Class I SLRPs like biglycan and decorin through shared interactions with TGF-β and other growth factors, interactions that influence wound healing, tissue remodeling, and fibrosis[26][29][31]. However, Class III SLRPs like EPYC differ from Class I and II members in their GAG composition, with EPYC carrying chondroitin/dermatan sulfate chains while Class II SLRPs typically bear keratan sulfate modifications, differences that confer distinct ligand-binding specificities and functional properties[7][50]. The evolutionary divergence of SLRP family members, evident from phylogenetic analysis showing distinct branches separating different SLRP classes, suggests that specialization for distinct tissue roles occurred during vertebrate evolution, with different SLRP members becoming adapted for particular connective tissue types[52].
The biological activity of EPYC depends critically on post-translational modifications including glycosaminoglycan chain attachment and sulfation, modifications that substantially alter the proteoglycan's biochemical and biophysical properties[7][37]. The composition of attached GAG chains, including the degree of sulfation and the specific positioning of sulfate groups on hexosamine and uronic acid residues, directly influences EPYC's capacity to interact with growth factors, matrix molecules, and cell surface receptors[27][30][37]. This structural variability in GAG modification emerges from regulated expression of sulfotransferase and epimerase enzymes that catalyze the post-translational modifications of GAG chains within the Golgi apparatus and extracellular space[27][37]. Remarkably, a single EPYC core protein can exist in multiple biochemically and functionally distinct forms depending on the degree and pattern of GAG sulfation, a phenomenon that introduces substantial complexity into proteoglycan function[7][37].
The sulfation patterns of GAG chains attached to EPYC prove especially important for the protein's growth factor-binding activities, with specific sulfate positioning enhancing affinity for molecules like transforming growth factor-beta and determining whether EPYC acts as a high-affinity ligand sequestering growth factors or a lower-affinity scaffold permitting growth factor access to cell surface receptors[26][29][30]. Sulfated GAG chains also provide charged environments that influence local pH and ion concentrations in the perifibrillar matrix compartment surrounding collagen fibrils, biophysical properties that may influence both collagen stability and cell behavior[9][21]. The variable GAG sulfation state of EPYC in different tissues and developmental stages suggests that fine-tuning of proteoglycan post-translational modification represents a regulatory mechanism allowing the EPYC core protein to assume diverse functional roles depending on biosynthetic and enzymatic context[7][37].
EPYC represents an evolutionarily conserved protein, with homologous molecules present across vertebrate species and even certain invertebrate representatives, suggesting ancient biological importance[49][52]. Phylogenetic analysis reveals that vertebrate SLRP family members, including EPYC alongside opticin and osteoglycin, cluster within a distinct evolutionary lineage, the Class III SLRP clade that likely arose through gene duplication and divergence from ancestral SLRP sequences[49][52]. The presence of characteristic structural features including the defining six to eight leucine-rich repeats and specific cysteine-rich N-terminal domains distinguishing Class III members indicates that EPYC evolved specialized functions adapted to distinct tissue requirements[49][52]. During teleost fish evolution, EPYC remained as a single gene copy per haploid genome, unlike certain other SLRP members that underwent whole-genome duplication events creating multiple gene copies with subfunctionalization[49]. This single-copy retention through vertebrate evolution suggests that EPYC has maintained a specialized biological role that does not benefit from gene duplication and functional specialization observed in other SLRP family members.
The conservation of EPYC protein sequence across mammalian species reflects functional constraints on amino acid positions critical for ligand binding, structural stability, and interactions with other matrix molecules[49][52]. Comparative genomic analysis reveals that the EPYC gene structure, including the pattern of exons and introns, shows remarkable conservation across mammalian species, a finding consistent with purifying selection against changes that would disrupt proper gene expression or protein function[49]. The evolutionary conservation of EPYC extends to regulatory regions controlling the gene's expression in specific tissues and developmental stages, suggesting that temporal and spatial patterns of EPYC expression have been optimized through evolution for proper skeletal development and tissue homeostasis[49][52].
The molecular mechanisms through which EPYC regulates collagen fibrillogenesis involve direct binding interactions between the EPYC protein core and collagen molecules, interactions mediated by the leucine-rich repeat domains serving as the binding interface[7][29][26]. Biochemical studies employing collagen-proteoglycan complex formation assays and surface plasmon resonance measurements reveal specific binding sites for SLRP interaction with collagen molecules, with certain leucine-rich repeat domains showing particularly high-affinity interactions[7]. For EPYC and other Class III SLRPs, the binding site on collagen involves specific regions of the collagen triple helix that become accessible as the collagen molecules assemble into progressively larger fibrillar structures[29]. The binding of EPYC to collagen fibrils affects the kinetics of collagen fibrillogenesis, with proteomic and biophysical studies demonstrating that SLRP presence slows the rate of fibril formation and reduces the overall turbidity of collagen gels, evidence that SLRP binding inhibits lateral fusion of collagen fibrils[7][29][34].
The mechanism of SLRP-mediated inhibition of collagen fibril growth likely involves steric hindrance, wherein the curved solenoid structure of the SLRP core protein physically limits the lateral approach and fusion of adjacent collagen fibrils by occupying space on the collagen fibril surface[7][29]. This sterically mediated regulation differs from alternative mechanisms such as proteolytic cleavage or covalent modification of collagen, instead representing a reversible, regulator-dependent mechanism that permits dynamic adjustment of fibril growth in response to changing matrix requirements[29][31]. The strength of SLRP binding to collagen, and consequently the degree of fibrillogenesis inhibition, varies among different SLRP family members, reflecting their evolutionary specialization for different tissues and functional roles[7][29]. EPYC demonstrates intermediate-strength binding to collagen compared to some other SLRPs, a characteristic potentially reflecting its specialized role in cartilage and skeletal tissues where moderate regulation of fibrillogenesis optimizes tissue biomechanical properties[29][31].
The GAG chains attached to EPYC contribute substantially to the protein's collagen-binding properties through multiple mechanisms including providing additional binding surface area and creating charged microenvironments that influence protein-protein interactions[7][29][27]. Removal of GAG chains from SLRP molecules through enzymatic treatment with chondroitinase or other GAG-degrading enzymes reduces SLRP binding to collagen and alters the effects of SLRPs on collagen fibrillogenesis kinetics[29][34]. This dependence of function on GAG presence explains the importance of proper post-translational modification for SLRP biological activity and the variable functional states of EPYC depending on tissue-specific GAG modification patterns.
The emerging recognition of EPYC as both a fundamental regulator of tissue homeostasis and an important player in disease pathogenesis suggests potential therapeutic applications in multiple clinical contexts. In osteoarthritis, strategies aimed at modulating EPYC expression or function might target either the excessive matrix degradation associated with advanced disease or the compensatory matrix synthesis responses occurring in early degeneration[55]. The observation that EPYC functions partially redundantly with other SLRP family members suggests that therapeutic interventions targeting EPYC might require simultaneous modulation of other SLRP molecules to achieve maximal efficacy[8][33]. Conversely, the specific roles of EPYC distinct from other SLRPs might enable selective therapeutic targeting of EPYC to achieve desired effects on cartilage organization without disrupting other SLRP-dependent functions.
In cancer contexts, the emerging role of EPYC as a promoter of tumor progression and metastasis suggests that targeting EPYC expression or its signaling functions might impede cancer progression[47][58]. Approaches might include direct inhibition of EPYC using blocking antibodies or antisense oligonucleotides, modulation of EPYC expression through targeting upstream regulators, or indirect approaches targeting downstream signaling pathways activated by EPYC including the PI3K-AKT cascade[47]. The identification of PLCG2 and CRK as EPYC-interacting proteins opens possibilities for targeting these specific protein-protein interactions to modulate EPYC signaling functions[51]. Combination strategies targeting EPYC alongside other ECM components or growth factors might prove particularly effective, reflecting the multifactorial nature of both osteoarthritis and cancer involving complex interactions between cells and their matrix environment.
Future research directions include detailed characterization of the molecular mechanisms through which EPYC activates specific signaling pathways in different cellular contexts, investigation of tissue-specific regulation of EPYC expression and post-translational modification, and functional analysis of the predicted EPYC interactions with PLCG2, CRK, and other signaling molecules[51]. Structure-function studies mapping the specific amino acid residues and domains of EPYC required for interactions with distinct ligands including growth factors, other matrix proteins, and cell surface receptors would provide molecular-level insights enabling rational therapeutic targeting[7][19]. The development of improved animal models permitting inducible deletion or modification of EPYC in specific tissues or at specific developmental timepoints would clarify the tissue-specific roles of EPYC and distinguish between functions required during development versus maintenance of adult tissues[8]. Investigation of whether and how aberrant EPYC expression or modification contribute to disease pathogenesis in tissues beyond those currently characterized would identify new disease associations and potential therapeutic targets[47][55][58].
Epiphycan represents a prototypical small leucine-rich proteoglycan with fundamental roles in organizing extracellular matrices, regulating growth factor signaling, and maintaining tissue homeostasis across multiple biological systems. The protein's multifaceted functions reflect both its structural characteristics as a curved solenoid molecule with multiple binding surfaces and its sophisticated post-translational modification with chondroitin and dermatan sulfate glycosaminoglycans that enable dynamic regulation of biochemical properties in response to tissue requirements and developmental signals. EPYC demonstrates critical importance in maintaining skeletal integrity, preventing osteoarthritis progression, and regulating corneal structure, functions that become apparent from genetic studies employing knockout animal models and bioinformatic analysis of patient tissues. Beyond these classical roles in tissue organization, EPYC emerges as an important regulator of cancer cell behavior, promoting tumor progression through activation of PI3K-AKT and focal adhesion signaling pathways, functions establishing EPYC as an emerging biomarker for cancer prognosis[47][55][58]. Future therapeutic strategies targeting EPYC may prove valuable in both preventing cartilage degeneration in osteoarthritis and restricting cancer progression in malignancies where aberrant EPYC expression promotes metastasis and poor outcomes. The continued investigation of EPYC structure-function relationships, tissue-specific regulation, and signaling mechanisms will enhance understanding of proteoglycan biology while identifying novel approaches for modulating EPYC in diseased states.
Epiphycan is a secreted extracellular matrix proteoglycan that functions primarily to:
1. Regulate collagen fibrillogenesis - particularly type II collagen in cartilage [PMID:8975717, deep-research-perplexity lines 139-146]
2. Organize extracellular matrix architecture - maintains ordered structure of cartilage and other connective tissues [deep-research-perplexity lines 139-146, UniProt]
3. Modulate growth factor signaling - binds and sequesters TGF-β and BMPs [deep-research-perplexity lines 154-157]
id: Q99645
gene_symbol: EPYC
product_type: PROTEIN
status: COMPLETE
tags:
- ECM
taxon:
id: NCBITaxon:9606
label: Homo sapiens
description: |
Epiphycan (EPYC) is a Class III small leucine-rich proteoglycan (SLRP) that functions
as a secreted extracellular matrix component primarily regulating collagen fibrillogenesis
and matrix organization in cartilage and other connective tissues. The protein contains
six leucine-rich repeat (LRR) domains and is post-translationally modified with dermatan
sulfate glycosaminoglycan chains. EPYC's core functions include: (1) regulating collagen
type II fibril assembly in cartilage by binding collagen fibrils and controlling lateral
growth; (2) organizing three-dimensional extracellular matrix architecture through interactions
with collagen, hyaluronan, and other proteoglycans; and (3) modulating growth factor signaling
by binding and sequestering TGF-β and BMPs. The protein is essential for maintaining articular
cartilage integrity and preventing osteoarthritis, as demonstrated by progressive joint
degeneration in EPYC-deficient mice. EPYC is expressed primarily in skeletal tissues including
growth plate cartilage, articular cartilage, and intervertebral discs, as well as in ligaments
and placenta
existing_annotations:
- term:
id: GO:0031012
label: extracellular matrix
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: |
This annotation is correct and represents the core localization of EPYC. Epiphycan is a
secreted proteoglycan with an N-terminal signal peptide (aa 1-19) that directs it to the
extracellular matrix where it functions exclusively. The protein is integrated into the
organized three-dimensional scaffold of the ECM through interactions with collagen fibrils,
hyaluronan, and other matrix components [file:human/EPYC/EPYC-deep-research-perplexity.md
"functions exclusively in extracellular contexts...becomes integrated into the organized
three-dimensional scaffold of the extracellular matrix"]. This is a fundamental and essential
annotation for EPYC function.
action: ACCEPT
supported_by:
- reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
supporting_text: 'provider: perplexity'
- term:
id: GO:0060348
label: bone development
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: |
This annotation is supported but represents a non-core function. EPYC does influence bone
development through mechanisms involving matrix organization and growth factor sequestration
during endochondral ossification [file:human/EPYC/EPYC-deep-research-perplexity.md
"protein influences bone development...during endochondral ossification"]. However, EPYC's
primary role is in cartilage matrix organization, with bone development being a secondary
consequence. EPYC-deficient mice show relatively subtle skeletal abnormalities early in
development with age-dependent changes, indicating it's not essential for initial bone
formation but contributes to skeletal homeostasis.
action: KEEP_AS_NON_CORE
- term:
id: GO:0061975
label: articular cartilage development
evidence_type: IBA
original_reference_id: GO_REF:0000033
review:
summary: |
This annotation is correct and represents a core biological process for EPYC. Epiphycan is
essential for articular cartilage development and maintenance. EPYC is highly expressed in
articular cartilage and is critical for maintaining joint integrity. EPYC-deficient mice
develop progressive osteoarthritis with articular cartilage erosions and proteoglycan loss,
demonstrating its essential role [file:human/EPYC/EPYC-deep-research-perplexity.md
"essential for maintaining articular cartilage integrity...EPYC-deficient mice develop
osteoarthritis spontaneously with age...articular cartilage erosions"]. This is a fundamental
function of EPYC.
action: ACCEPT
- term:
id: GO:0031012
label: extracellular matrix
evidence_type: IEA
original_reference_id: GO_REF:0000107
review:
summary: Extracellular matrix localization is correct for EPYC. While this
duplicates the IBA annotation, both provide support for this cellular
component.
action: ACCEPT
reason: EPYC is a component of the extracellular matrix, particularly in
cartilage. Multiple evidence codes supporting the same correct
annotation provide additional confidence.
- term:
id: GO:0005539
label: glycosaminoglycan binding
evidence_type: TAS
original_reference_id: PMID:8975717
review:
summary: |
This annotation is problematic. EPYC itself IS a dermatan sulfate proteoglycan - it carries
GAG chains as post-translational modifications [PMID:8975717 "small dermatan sulfate
proteoglycan"]. The term "glycosaminoglycan binding" suggests EPYC binds to GAGs as ligands,
which is not its primary characterized function. EPYC's GAG chains mediate interactions with
other molecules (collagen, growth factors), but "glycosaminoglycan binding" is not an accurate
description of EPYC's molecular function. This should be removed or replaced with a more
accurate term like "extracellular matrix structural constituent" or "collagen binding".
action: REMOVE
supported_by:
- reference_id: PMID:8975717
supporting_text: Characterization of human DSPG3, a small dermatan
sulfate proteoglycan.
- term:
id: GO:0007565
label: female pregnancy
evidence_type: TAS
original_reference_id: PMID:8975717
review:
summary: |
This annotation is based solely on expression evidence - EPYC is expressed in placental
tissue [PMID:8975717 "expressed in cartilage, as well as ligament and placental tissues"].
However, mere expression in placenta does not establish a specific role in pregnancy. The
term "female pregnancy" is too broad and doesn't describe EPYC's actual molecular or cellular
function. EPYC likely plays a structural role in placental ECM organization, but this hasn't
been experimentally demonstrated. This is an over-annotation based on tissue expression alone.
action: MARK_AS_OVER_ANNOTATED
supported_by:
- reference_id: PMID:8975717
supporting_text: Characterization of human DSPG3, a small dermatan
sulfate proteoglycan.
- term:
id: GO:0005518
label: collagen binding
evidence_type: TAS
original_reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
review:
summary: |
This is a core molecular function of EPYC. The protein binds to collagen type II fibrils
via its leucine-rich repeat domains and regulates collagen fibril organization in cartilage.
This is well-supported by structural and functional studies showing EPYC's role in controlling
collagen fibrillogenesis through direct binding interactions.
action: NEW
- term:
id: GO:0030199
label: collagen fibril organization
evidence_type: TAS
original_reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
review:
summary: |
This is a core biological process for EPYC. The protein regulates collagen fibril assembly
by controlling lateral fibril growth through steric hindrance, maintaining optimal
intermolecular spacing. This is essential for maintaining proper cartilage matrix properties.
action: NEW
- term:
id: GO:0005201
label: extracellular matrix structural constituent
evidence_type: TAS
original_reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
review:
summary: |
This accurately describes EPYC's role as a structural component of the ECM. EPYC serves
as an organizational node that facilitates interactions between collagen fibrils, hyaluronan,
and other proteoglycans. This is more accurate than "glycosaminoglycan binding" for describing
EPYC's structural function.
action: NEW
- term:
id: GO:0030198
label: extracellular matrix organization
evidence_type: TAS
original_reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
review:
summary: |
This is a core biological process for EPYC. The protein organizes the three-dimensional
architecture of the ECM through multiple interactions with matrix components, contributing
to tissue homeostasis and mechanical properties.
action: NEW
- term:
id: GO:0050431
label: transforming growth factor beta binding
evidence_type: TAS
original_reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
review:
summary: |
This is a core regulatory function of EPYC. The protein binds and sequesters TGF-β isoforms
and BMPs through its dermatan sulfate GAG chains, reducing their bioavailability and modulating
downstream signaling pathways that regulate matrix remodeling and cell differentiation.
action: NEW
references:
- id: GO_REF:0000033
title: Annotation inferences using phylogenetic trees
findings:
- statement: Phylogenetic inference supporting EPYC annotations based on
ortholog analysis
- id: GO_REF:0000107
title: Automatic transfer of experimentally verified manual GO annotation
data to orthologs using Ensembl Compara.
findings:
- statement: Automated transfer of EPYC annotations from orthologous
proteins
- id: PMID:8975717
title: Characterization of human DSPG3, a small dermatan sulfate
proteoglycan.
findings:
- statement: Original characterization of human EPYC (DSPG3) showing it is
a small dermatan sulfate proteoglycan expressed in cartilage,
ligament, and placental tissues. Contains 322 amino acids with
leucine-rich repeats, glycosaminoglycan attachment sites, and is
localized to chromosome 12q21.
supporting_text: "DSPG3 is expressed in cartilage, as well as ligament and
placental tissues...composed of 1515 nucleotides of cDNA that code for a
322-amino-acid protein"
- id: file:human/EPYC/EPYC-deep-research-perplexity.md
title: Deep research summary on EPYC gene function
findings:
- statement: EPYC is a Class III SLRP containing 6 LRRs that regulates
collagen fibrillogenesis by binding collagen fibrils and controlling
lateral fibril growth through steric hindrance
- statement: Functions in organizing three-dimensional ECM architecture
through interactions with collagen, hyaluronan, and other
proteoglycans
- statement: Modulates growth factor signaling by binding and sequestering
TGF-β and BMPs, reducing their bioavailability
- statement: Essential for maintaining articular cartilage integrity -
EPYC-deficient mice develop progressive osteoarthritis with cartilage
erosions
- statement: Participates in endochondral ossification and influences bone
development, though with subtle early skeletal abnormalities
- statement: In cancer contexts, promotes tumor progression through
PI3K/AKT and focal adhesion pathway activation
- statement: Post-translationally modified with dermatan sulfate GAG
chains that are critical for growth factor binding and collagen
interactions
core_functions:
- molecular_function:
id: GO:0005518
label: collagen binding
directly_involved_in:
- id: GO:0030199
label: collagen fibril organization
locations:
- id: GO:0031012
label: extracellular matrix
description: |
Binds to collagen type II fibrils via leucine-rich repeat domains and regulates collagen
fibril assembly in cartilage by controlling lateral fibril growth through steric hindrance,
maintaining optimal intermolecular spacing between collagen fibrils
supported_by:
- reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
supporting_text: "EPYC is a Class III SLRP containing 6 LRRs that regulates
collagen fibrillogenesis by binding collagen fibrils and controlling lateral
fibril growth through steric hindrance"
- reference_id: PMID:8975717
supporting_text: "small dermatan sulfate proteoglycan...expressed in cartilage"
- molecular_function:
id: GO:0005201
label: extracellular matrix structural constituent
directly_involved_in:
- id: GO:0030198
label: extracellular matrix organization
locations:
- id: GO:0031012
label: extracellular matrix
description: |
Serves as a structural component of the extracellular matrix, acting as an organizational
node that facilitates interactions between collagen fibrils, hyaluronan, and other
proteoglycans through its core protein and glycosaminoglycan chains
supported_by:
- reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
supporting_text: "Functions in organizing three-dimensional ECM architecture
through interactions with collagen, hyaluronan, and other proteoglycans"
- molecular_function:
id: GO:0050431
label: transforming growth factor beta binding
directly_involved_in:
- id: GO:0030198
label: extracellular matrix organization
locations:
- id: GO:0031012
label: extracellular matrix
description: |
Binds and sequesters TGF-β isoforms and BMPs through dermatan sulfate GAG chains,
reducing their bioavailability and modulating downstream signaling pathways that
regulate matrix remodeling and cell differentiation
supported_by:
- reference_id: file:human/EPYC/EPYC-deep-research-perplexity.md
supporting_text: "Modulates growth factor signaling by binding and sequestering
TGF-β and BMPs, reducing their bioavailability"