Published on: April 2026
MEDI AI : A DISEASE PREDICTION SYSTEM
Riddhi Dubey G.vyshnavi B.Bhargavi
Dr. R.R.S. RAVI KUMARI
Article Status
Available Documents
Abstract
The system utilizes a dataset containing various diseases and their corresponding symptoms. Data preprocessing techniques such as missing value handling, encoding, normalization, and structuring are applied to improve data quality and enhance model performance. Multiple machine learning algorithms such as Logistic Regression, Decision Tree, Random Forest, Naive Bayes, Support Vector Machine, and K-Nearest Neighbors are implemented and evaluated using performance metrics like accuracy, precision, recall, F1-score, and confusion matrix.
Experimental results indicate that ensemble models such as Random Forest provide higher prediction accuracy compared to other algorithms. The best-performing model is integrated into a web-based application that allows users to input symptoms and receive real-time disease predictions. The proposed system helps in early diagnosis, reduces dependency on immediate medical consultation, and improves healthcare accessibility, especially in remote areas.
How to Cite this Paper
Dubey, R., G.vyshnavi, & B.Bhargavi, (2026). MEDI AI : A DISEASE PREDICTION SYSTEM. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(04). https://doi.org/10.55041/ijcope.v2i4.887
Dubey, Riddhi, et al.. "MEDI AI : A DISEASE PREDICTION SYSTEM." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 04, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i4.887.
Dubey, Riddhi, G.vyshnavi, and B.Bhargavi. "MEDI AI : A DISEASE PREDICTION SYSTEM." International Journal of Creative and Open Research in Engineering and Management 02, no. 04 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i4.887.
References
[1]M. Mitchell, Machine Learning, New York, USA: McGraw-Hill, 1997.[2] F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
[3] L. Breiman, “Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.
[4] J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011.
[5] D. Dua and C. Graff, “UCI Machine Learning Repository,” University of California, Irvine, 2017. Available: https://archive.ics.uci.edu
[6] Kaggle, “Disease Prediction Using Symptoms Dataset,”Available: https://www.kaggle.com/datasets/itachi9604/disease-symptom-description-dataset
[7] Pandas Documentation, Available: https://pandas.pydata.org/docs/
[8] NumPy Documentation, Available: https://numpy.org/doc/
[9] Scikit-learn Documentation, Available: https://scikit-learn.org/stable/
[10] World Health Organization (WHO), “Global Health Observatory Data Repository,” Available: https://www.who.int/data
Ethical Compliance & Review Process
- •All submissions are screened under plagiarism detection.
- •Review follows editorial policy.
- •Authors retain copyright.
- •Peer Review Type: Double-Blind Peer Review
- •Published on: Apr 29 2026
This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

