This paper is published in Volume-11, Issue-2, 2025
Area
Machine Learning
Author
Taaha Ansari, Vaishali M. Bagade
Org/Univ
Alamuri Ratnamala Institute of Engineering and Technology, Tute, Maharashtra, India
Keywords
Diabetes Prediction, Machine Learning, Logistic Regression, KNN, F1-Score, AUC.
Citations
IEEE
Taaha Ansari, Vaishali M. Bagade. Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Taaha Ansari, Vaishali M. Bagade (2025). Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study. International Journal of Advance Research, Ideas and Innovations in Technology, 11(2) www.IJARIIT.com.
MLA
Taaha Ansari, Vaishali M. Bagade. "Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study." International Journal of Advance Research, Ideas and Innovations in Technology 11.2 (2025). www.IJARIIT.com.
Taaha Ansari, Vaishali M. Bagade. Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Taaha Ansari, Vaishali M. Bagade (2025). Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study. International Journal of Advance Research, Ideas and Innovations in Technology, 11(2) www.IJARIIT.com.
MLA
Taaha Ansari, Vaishali M. Bagade. "Comparative Analysis of Machine Learning Models for Diabetes Prediction: A Performance Evaluation Study." International Journal of Advance Research, Ideas and Innovations in Technology 11.2 (2025). www.IJARIIT.com.
Abstract
Diabetes is a chronic disease affecting millions worldwide, necessitating early diagnosis and effective prediction models for improved healthcare outcomes. This study evaluates seven machine learning algorithms for diabetes prediction using healthcare data. We compared Logistic Regression, K-Nearest Neighbors (KNN), Random Forest, Decision Tree, AdaBoost, XGBoost, and Support Vector Machine (SVM) models. The analysis focused on key performance metrics: accuracy, precision, recall, F1-score, and Area Under the Curve (AUC). Results showed that logistic regression achieved the highest overall performance with 79% accuracy and 0.88 AUC, suggesting its potential utility in clinical diabetes prediction applications.