Post-clustering interpretation of gene expression data using functional enrichment and network analysis

Oleg R.  Yarema; Denys O.  Senchyshen; Sergii A.  Babichev

doi:10.15276/aait.08.2025.16

PDF

Published:
2025-09-25

DOI: https://doi.org/10.15276/aait.08.2025.16

Keywords:

Computational biology data analysis, bioinformatics, integrative analysis, gene expression data, post-clustering interpretation, functional enrichment, gene ontology, kyoto encyclopedia of genes and genomes, reactome, cytoscape, network-based analysis

PDF

How to cite

How to Cite

(1)

Yarema O. R. .; Senchyshen D. O. .; Babichev S. A. . " Post-Clustering Interpretation of gene Expression Data using functional Enrichment and network Analysis" Publ. Nauka i Tekhnika. Odesa: Ukraine. ААІТ 8 (3), 249–262. https://doi.org/10.15276/aait.08.2025.16.

Oleg R. Yarema

Ivan Franko National University of Lviv, 1, Universytetska Str. Lviv, 79000, Ukraine

https://orcid.org/0000-0003-3736-4820

Denys O. Senchyshen

Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

https://orcid.org/0000-0002-4311-7095

Sergii A. Babichev

Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

https://orcid.org/0000-0001-6797-1467

Abstract

Clustering of gene expression profiles is a core technique used to reveal hidden biological structures and differentiate disease subtypes in high-dimensional biomedical datasets. Nevertheless, translating cluster structures into biologically meaningful insights requires integrative analytical strategies that go beyond unsupervised learning. In this work, we introduce a novel integrative computational approach that emphasizes post-clustering interpretation by combining statistical functional enrichment with network-based modeling. Clusters of gene expression profiles, previously identified in patients with distinct cancer types, were subjected to enrichment analysis using Gene Ontology, the Kyoto Encyclopedia of Genes and Genomes, and Reactome databases. The enrichment was performed with the g:Profiler tool, allowing the detection of significantly overrepresented biological processes, molecular functions, cellular components, and signaling pathways within each cluster. To visualize and further interpret the enriched functional categories, Cytoscape software was employed. Functional interaction networks were constructed using two key modules: ClueGO, which integrates Gene Ontology and pathway annotation into a functionally grouped network, and CluePedia, which expands these networks by showing relationships between genes and enriched terms. This network-based visualization enabled deeper biological interpretation and facilitated the identification of core functional themes. The analysis revealed that each gene cluster is associated with distinct biological processes, such as immune signaling, metabolic pathways, DNA repair, or cell cycle regulation. The novelty of the proposed approach lies in its systematic integration of enrichment statistics with graph-based visualization, ensuring both computational rigor and biological interpretability. These findings confirm that the method can extract biologically consistent knowledge from complex gene expression data. In summary, the study presents an innovative post-clustering interpretation strategy that bridges unsupervised machine learning and functional genomics. This approach advances the explainability of computational analysis and supports its application in disease subtyping, biomarker discovery, and personalized medicine research.

Downloads

Download data is not yet available.

Issue

Vol. 8 No. 3 (2025): Applied Aspects of Information Technology

Topics

Section

Computer science and software engineering

Authors

Author Biographies

Oleg R. Yarema, Ivan Franko National University of Lviv, 1, Universytetska Str. Lviv, 79000, Ukraine

Ph.D., Associate Professor,Department of Digital economics and Business Analytics

Scopus Author ID: 59250847800

Denys O. Senchyshen, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

PhD Student, Department of Computer Science and Software Engineering

Sergii A. Babichev, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

Doctor of Engineering Sciences, Professor, Department of Informatics. Jan Evangelista Purkyně University in Ústí nad Labem, Pasteurova 3632/15, 400 96 Ústí nad Labem, Czech Republic

Scopus Author ID: 57189091127

Post-clustering interpretation of gene expression data using functional enrichment and network analysis

How to cite

How to Cite

Abstract

Downloads

Issue

Topics

Section

Authors

Author Biographies

Oleg R. Yarema, Ivan Franko National University of Lviv, 1, Universytetska Str. Lviv, 79000, Ukraine

Denys O. Senchyshen, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

Sergii A. Babichev, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

Similar Articles

Menu

Article Sidebar

How to cite

How to Cite

Main Article Content

Abstract

Downloads

Article Details

Issue

Topics

Section

Authors

Author Biographies

Oleg R. Yarema, Ivan Franko National University of Lviv, 1, Universytetska Str. Lviv, 79000, Ukraine

Denys O. Senchyshen, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

Sergii A. Babichev, Kherson State University, 14b, Shevchenko Str, Sivka-Voynylivska, Ivano-Frankivsk Oblast, 77311, Ukraine

Similar Articles

Menu