IJCOPE Journal

UGC Logo DOI / ISO Logo

International Journal of Creative and Open Research in Engineering and Management

A Peer-Reviewed, Open-Access International Journal Supporting Multidisciplinary Research, Digital Publishing Standards, DOI Registration, and Academic Indexing.
Journal Information
ISSN: 3108-1754 (Online)
Crossref DOI: Available
ISO Certification: 9001:2015
Publication Fee: 599/- INR
Compliance: UGC Journal Norms
License: CC BY 4.0
Peer Review: Double Blind
Volume 02, Issue 05

Published on: May 2026

AN EXPLAINABLE MACHINE LEARNING FRAMEWORK FOR PREDICTING STEM CELL DIFFERENTIATION INTO INSULIN-PRODUCING BETA CELLS USING GENE EXPRESSION BIOMARKERS

R. Pavan Teja

Y. Dayanand Kumar

Department of Computer Science and Artificial Intelligence

Central University of Andhra Pradesh

Ananthapuramu Andhra Pradesh India

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Abstract

Stem cell differentiation into insulin-producing pancreatic beta cells represents a transformative avenue for type 1 diabetes therapy. Existing computational approaches predominantly rely on convolutional neural networks (CNNs) applied to microscopic cell images; while achieving high classification accuracy, such methods suffer from limited interpretability and dependence on expensive imaging infrastructure. This paper presents BetaXplain, an explainable machine learning framework that predicts differentiation success from quantitative gene expression measurements rather than visual features. The system models differentiation outcome as a binary classification problem over a feature vector of five biologically validated transcription factor biomarkers — PDX1, NKX6.1, NGN3, INS, and MAFA — derived from the public GSE83139 gene expression dataset. We evaluate three classifiers — Random Forest, Support Vector Machine, and XGBoost — with XGBoost achieving the highest accuracy of 92.4%, precision of 91.8%, and recall of 93.1%. Prediction outputs are augmented with gene importance rankings and radar-chart visualizations that provide biologically interpretable explanations of each decision. Comparative analysis against CNN-based baselines demonstrates that BetaXplain matches predictive performance while substantially improving transparency, reducing infrastructure requirements, and enabling direct biological insight. The framework constitutes a step toward clinically actionable, interpretable AI in regenerative medicine.

Index Terms—stem cell differentiation, gene expression, explainable AI, XGBoost, pancreatic beta cells, bioinformatics, transcription factor biomarkers

How to Cite this Paper

Teja, R. P. (2026). An Explainable Machine Learning Framework for Predicting Stem Cell Differentiation into Insulin-Producing Beta Cells Using Gene Expression Biomarkers. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.255

Teja, R.. "An Explainable Machine Learning Framework for Predicting Stem Cell Differentiation into Insulin-Producing Beta Cells Using Gene Expression Biomarkers." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.255.

Teja, R.. "An Explainable Machine Learning Framework for Predicting Stem Cell Differentiation into Insulin-Producing Beta Cells Using Gene Expression Biomarkers." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.255.

Search & Index

References

[1] International Diabetes Federation, "IDF Diabetes Atlas, 10th ed.," Brussels, Belgium: IDF, 2021. [Online]. Available: https://www.diabetesatlas.org

[2] F. W. Pagliuca et al., "Generation of functional human pancreatic beta cells in vitro," Cell, vol. 159, no. 2, pp. 428–439, 2014.

[3] D. Kusumoto et al., "Automated deep learning-based system to identify endothelial cells derived from induced pluripotent stem cells," Stem Cell Reports, vol. 10, no. 6, pp. 1687–1695, 2018.

[4] A. Rezania et al., "Reversal of diabetes with insulin-producing cells derived in vitro from human pluripotent stem cells," Nature Biotechnology, vol. 32, no. 11, pp. 1121–1133, 2014.

[5] A. Veres et al., "Charting cellular identity during human in vitro β-cell differentiation," Nature, vol. 569, no. 7756, pp. 368–373, 2019.

[6] T. Chen and C. Guestrin, "XGBoost: A scalable tree boosting system," in Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (KDD), pp. 785–794, 2016.

[7] A. Esteva et al., "Dermatologist-level classification of skin cancer with deep neural networks," Nature, vol. 542, no. 7639, pp. 115–118, 2017.

[8] S. M. Lundberg and S.-I. Lee, "A unified approach to interpreting model predictions," in Advances in Neural Information Processing Systems (NeurIPS), vol. 30, pp. 4765–4774, 2017.

Ethical Compliance & Review Process

  • All submissions are screened under plagiarism detection.
  • Review follows editorial policy.
  • Authors retain copyright.
  • Peer Review Type: Double-Blind Peer Review
  • Published on: May 08 2026
CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License
Scroll to Top