This paper is published in Volume-11, Issue-2, 2025
Area
Machine Learning
Author
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad
Org/Univ
Integral University Lucknow, Dashauli, Uttar Pradesh, India
Keywords
Insurance Fraud Detection, Machine Learning, Random Forest, Classification, Smote, Imbalanced Data, Feature Engineering, Fraud Analytics, Predictive Modelling, Roc-Auc Curve, Ensemble Learning, Data Preprocessing, Supervised Learning, Automated Detection Systems, Financial Fraud.
Citations
IEEE
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad. Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad (2025). Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification. International Journal of Advance Research, Ideas and Innovations in Technology, 11(2) www.IJARIIT.com.
MLA
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad. "Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification." International Journal of Advance Research, Ideas and Innovations in Technology 11.2 (2025). www.IJARIIT.com.
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad. Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad (2025). Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification. International Journal of Advance Research, Ideas and Innovations in Technology, 11(2) www.IJARIIT.com.
MLA
Shaba Khatoon, Dr. Ankita Srivastava, Dr. Shish Ahmad. "Detection of Insurance Fraud Using Machine Learning: A Data-Driven Approach for Accurate Classification." International Journal of Advance Research, Ideas and Innovations in Technology 11.2 (2025). www.IJARIIT.com.
Abstract
Insurance fraud exerts a considerable economic strain on both insurers and policyholders, eroding the foundational trust that sustains the insurance industry. This study presents the development of a robust, scalable machine learning framework tailored for the detection of fraudulent claims. To mitigate the prevalent issue of class imbalance in fraud detection, a synthetically balanced dataset was constructed using the Synthetic Minority Oversampling Technique (SMOTE). The methodological approach incorporates a Random Forest classifier, augmented with domain-informed feature engineering—such as income-to- liability metrics and other fiscal indicators—designed to enhance discriminative power. Model performance is rigorously assessed through a suite of evaluation criteria, including accuracy, precision, recall, F1-score, confusion matrix, and both ROC-AUC and precision-recall curves. The experimental outcomes reveal exceptional classification efficacy, with the model achieving near-flawless predictive accuracy on the balanced dataset. When benchmarked against baseline models, the proposed system demonstrates superior detection capability, particularly in minimizing false negatives. This research underscores the effectiveness of ensemble-based algorithms in the fraud analytics domain and sets the stage for practical deployment in operational insurance environments. Prospective advancements may involve the integration of deep neural networks and real-time anomaly detection using streaming architectures to further enhance scalability and responsiveness.
